llvm-project

Commit Graph

Author	SHA1	Message	Date
Nico Weber	2ca46867e1	Add an inttypes.h wrapper that fixes up some macros in Microsoft mode. Before MSVS2015, MSVS's headers disagree about int32_t and PRIx32 and so on. Provide a wrapper header to fix this, so that -Wformat can still be used. Fixes PR23412. llvm-svn: 240741	2015-06-26 00:13:18 +00:00
Sean Silva	d0de76a3da	Remove `requires` for x86 CPU features. Ever since the target attributes change, we don't need to guard these headers with `requires`. Actually it's a bit worse, because if we do then they are included textually under the covers, causing declarations to appear in submodules they aren't supposed to be in. llvm-svn: 240720	2015-06-25 23:22:11 +00:00
Eric Christopher	3d920eed5d	Move xtest to its own file to match the gcc header organization. llvm-svn: 239926	2015-06-17 18:42:07 +00:00
Eric Christopher	29b78091e7	Update comments on HLE, RTM, and ADX support for intrinsics. llvm-svn: 239925	2015-06-17 18:42:03 +00:00
Eric Christopher	9fc7fb274e	Update the intel intrinsic headers to use the target attribute support. This involved removing the conditional inclusion and replacing them with target attributes matching the original conditional inclusion and checks. The testcase update removes the macro checks for each file and replaces them with usage of the __target__ attribute, e.g.: int __attribute__((__target__(("sse3")))) foo(int a) { _mm_mwait(0, 0); return 4; } This usage does require the enclosing function have the requisite __target__ attribute for inlining and code generation - also for any macro intrinsic uses in the enclosing function. There's no change for existing uses of the intrinsic headers. llvm-svn: 239883	2015-06-17 07:09:32 +00:00
Eric Christopher	4d185168e9	Use a define for per-file function attributes for the Intel intrinsic headers. This is a precursor to changing them to use the new target attribute code. llvm-svn: 239882	2015-06-17 07:09:20 +00:00
Eric Christopher	5a9bec104b	Use a macro for the omnipresent attributes on header functions in Intrin.h. Saves some typing and if someone wants to change them it makes it much easier. llvm-svn: 239782	2015-06-15 23:20:35 +00:00
Luke Cheeseman	59b2d83909	This patch implements clang support for the ACLE special register intrinsics in section 10.1, __arm_{w,r}sr{,p,64}. This includes arm_acle.h definitions with builtins and codegen to support these, the intrinsics are implemented by generating read/write_register calls which get appropriately lowered in the backend based on the register string provided. SemaChecking is also implemented to fault invalid parameters. Differential Revision: http://reviews.llvm.org/D9697 llvm-svn: 239737	2015-06-15 17:51:01 +00:00
Nemanja Ivanovic	b17f1129fa	Clang support for vector quad bit permute and gather instructions through builtins This patch corresponds to review: http://reviews.llvm.org/D10095 This is for just two instructions and related builtins: vbpermq vgbbd llvm-svn: 239506	2015-06-11 06:25:36 +00:00
Bill Seurer	703e8486ec	[PowerPC] Reformat altivec.h with clang-format This revision just fixes the formatting of altivec.h. llvm-svn: 239408	2015-06-09 14:39:47 +00:00
David Majnemer	81ecbf45d4	Revert accidental commit This change was unrelated to r239170. llvm-svn: 239176	2015-06-05 18:24:55 +00:00
David Majnemer	cdffc36c11	[AST] There is no message for C++1z-style static_assert We would crash in the DeclPrinter trying to pretty-print the static_assert message. C++1z-style assertions don't have a message so we would crash. This fixes PR23756. llvm-svn: 239170	2015-06-05 18:03:58 +00:00
Bill Seurer	8be14f11ce	[PowerPC] This revision adds 68 of the missing "Predefined Functions for Vector Programming" from appendix A of the OpenPOWER ABI for Linux Supplement document. I also added tests for the new functions and updated another test that was looking for specific line numbers in error messages from altivec.h. https://llvm.org/bugs/show_bug.cgi?id=23679 http://reviews.llvm.org/D10131 llvm-svn: 239066	2015-06-04 18:45:44 +00:00
Ekaterina Romanova	2e81434552	Added doxygen comments for the intrinsics. llvm-svn: 238386	2015-05-28 01:25:25 +00:00
John Thompson	b7892ffc69	It appears these exports are needed, as wmmintrin.h includes them. llvm-svn: 238345	2015-05-27 18:26:41 +00:00
Kit Barton	5944ee2179	This patch adds support for the vector quadword add/sub instructions introduced in POWER8. These are the Clang-related changes for http://reviews.llvm.org/D9081 vadduqm vaddeuqm vaddcuq vaddecuq vsubuqm vsubeuqm vsubcuq vsubecuq All builtins are added in altivec.h, and guarded with the POWER8_VECTOR and powerpc64 macros. http://reviews.llvm.org/D9903 llvm-svn: 238145	2015-05-25 15:52:45 +00:00
Michael Kuperstein	7619004211	[X86] Add _mm256_set_m128 and its 5 variants. Differential Revision: http://reviews.llvm.org/D9855 llvm-svn: 237778	2015-05-20 07:46:52 +00:00
Michael Kuperstein	877f3cbe84	[X86] Add _mm_broadcastsd_pd intrinsic _mm_broadcastsd_pd is basically an alias for _mm_movedup_pd, however the alias is only available from AVX2 forward. llvm-svn: 237698	2015-05-19 14:49:14 +00:00
Michael Kuperstein	6168183e04	[X86] Added _mm256_bslli_epi128 and _mm256_bsrli_epi128. These two intrinsics are alternative names for _mm256_slli_si256 and _mm256_srli_si256, respectively. llvm-svn: 237693	2015-05-19 13:05:46 +00:00
Bill Schmidt	41e14c4dfa	[PPC64] Add vector pack/unpack support from ISA 2.07 This patch adds support for the following new instructions in the Power ISA 2.07: vpksdss vpksdus vpkudus vpkudum vupkhsw vupklsw These instructions are available through the vec_packs, vec_packsu, vec_unpackh, and vec_unpackl built-in interfaces. These are lane-sensitive instructions, so the built-ins have different implementations for big- and little-endian, and the instructions must be marked as killing the vector swap optimization for now. The first three instructions perform saturating pack operations. The fourth performs a modulo pack operation, which means it can be represented with a vector shuffle, and conversely the appropriate vector shuffles may cause this instruction to be generated. The other instructions are only generated via built-in support for now. I noticed during patch preparation that the macro __VSX__ was not previously predefined when the power8-vector or direct-move features are requested. This is an error, and I've corrected that here as well. Appropriate tests have been added. There is a companion patch to llvm for the rest of this support. llvm-svn: 237500	2015-05-16 01:02:25 +00:00
Richard Smith	23d8d0338e	[modules] Fix a #include cycle when building a module for our builtin headers. xmmintrin.h includes emmintrin.h and vice versa if SSE2 is enabled. We break this cycle for a modules build, and instead make the xmmintrin.h module re-export the immintrin.h module. Also included is a fix for an assert in the serialization code if a module exports another module that was declared later in the same module map. llvm-svn: 237321	2015-05-14 00:45:20 +00:00
Elena Demikhovsky	bd5c8b9be9	AVX-512: FP compare intrinsics - changed type of CC parameter from i8 to i32 according to the spec. Added FP compare intrinsics for SKX. llvm-svn: 236715	2015-05-07 11:26:36 +00:00
Elena Demikhovsky	e7d4c2e229	AVX-512: Added AVX-512 intrinsics and tests by Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 236218	2015-04-30 09:24:29 +00:00
Elena Demikhovsky	35dc8c0944	AVX-512: added intrinsics for KNL and SKX by Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 235986	2015-04-28 13:28:01 +00:00
Artem Belevich	4e192df778	[cuda] Added support for CUDA built-in variables. Added cuda_builtin_vars.h which implements built-in CUDA variables using __declattr(property). Fields of built-in variables (except for warpSize) are implemented using __declattr(property) which replaces read/write of a member field with a call to a getter/setter member function, in this case with appropriate NVPTX builtin. Added a test case to check diagnostics on attempt to construct or improperly access a built-in variable. Differential Revision: http://reviews.llvm.org/D9064 llvm-svn: 235448	2015-04-21 22:14:13 +00:00
Artem Belevich	a050112bba	Revert r235398 "[cuda] Added support for CUDA built-in variables." r235398 was causing buildbot break due to missing Makefile changes. llvm-svn: 235401	2015-04-21 18:36:42 +00:00
Artem Belevich	d0a2ae054f	[cuda] Added support for CUDA built-in variables. Added cuda_builtin_vars.h which implements built-in CUDA variables using __declattr(property). Fields of built-in variables (except for warpSize) are implemented using __declattr(property) which replaces read/write of a member field with a call to a getter/setter member function, in this case with appropriate NVPTX builtin. Added a test case to check diagnostics on attempt to construct or improperly access a built-in variable. Differential Revision: http://reviews.llvm.org/D9064 llvm-svn: 235398	2015-04-21 17:39:06 +00:00
Ekaterina Romanova	b929ad7b17	_mm256_blend_epi16 is being cast to __m256d instead of __m256i. Fixing this. llvm-svn: 234560	2015-04-10 02:39:45 +00:00
Ulrich Weigand	cc67344a86	[SystemZ] Add header files to Makefile / module.modulemap This should fix build-bot failures after r233804. The patch also adds a "systemz" feature, and renames the "transactional-execution" feature to "htm", since it turns out "-" is not a legal character in module feature names. llvm-svn: 233807	2015-04-01 14:15:35 +00:00
Ulrich Weigand	3a610ebf1e	[SystemZ] Support transactional execution on zEC12 The zEC12 provides the transactional-execution facility. This is exposed to users via a set of builtin routines on other compilers. This patch adds clang support to enable those builtins. In partciular, the patch: - enables the transactional-execution feature by default on zEC12 - allows to override presence of that feature via the -mhtm/-mno-htm options - adds a predefined macro __HTM__ if the feature is enabled - adds support for the transactional-execution GCC builtins - adds Sema checking to verify the __builtin_tabort abort code - adds the s390intrin.h header file (for GCC compatibility) - adds s390 sections to the htmintrin.h and htmxlintrin.h header files Since this is first use of target-specific intrinsics on the platform, the patch creates the include/clang/Basic/BuiltinsSystemZ.def file and hooks it up in TargetBuiltins.h and lib/Basic/Targets.cpp. An associated LLVM patch adds the required LLVM IR intrinsics. For reference, the transactional-execution instructions are documented in the z/Architecture Principles of Operation for the zEC12: http://publibfp.boulder.ibm.com/cgi-bin/bookmgr/download/DZ9ZR009.pdf The associated builtins are documented in the GCC manual: http://gcc.gnu.org/onlinedocs/gcc/S_002f390-System-z-Built-in-Functions.html The htmxlintrin.h intrinsics provided for compatibility with the IBM XL compiler are documented in the "z/OS XL C/C++ Programming Guide". llvm-svn: 233804	2015-04-01 12:54:25 +00:00
Elena Demikhovsky	29da2fba46	AVX-512: added clang intrinsics for logical and, or xor for 512 bits by Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 233794	2015-04-01 06:54:16 +00:00
Kit Barton	8246f28237	Add Hardware Transactional Memory (HTM) Support This patch adds Hardware Transaction Memory (HTM) support supported by ISA 2.07 (POWER8). The intrinsic support is based on GCC one [1], with both 'PowerPC HTM Low Level Built-in Functions' and 'PowerPC HTM High Level Inline Functions' implemented. Along with builtins a new driver switch is added to enable/disable HTM instruction support (-mhtm) and a header with common definitions (mostly to parse the TFHAR register value). The HTM switch also sets a preprocessor builtin HTM. The HTM usage requires a recently newer kernel with PPC HTM enabled. Tested on powerpc64 and powerpc64le. This is send along a llvm patch to enabled the builtins and option switch. [1] https://gcc.gnu.org/onlinedocs/gcc/PowerPC-Hardware-Transactional-Memory-Built-in-Functions.html Phabricator Review: http://reviews.llvm.org/D8248 llvm-svn: 233205	2015-03-25 19:41:41 +00:00
Sanjay Patel	0a6da5de55	[X86, AVX2] Replace inserti128 and extracti128 intrinsics with generic shuffles This is nearly identical to the v*f128_si256 parts of r231792 and r232052. AVX2 introduced proper integer variants of the hacked integer insert/extract C intrinsics that were created for this same functionality with AVX1. This should complete the front end fixes for insert/extract128 intrinsics. Corresponding LLVM patch to follow. llvm-svn: 232109	2015-03-12 21:54:24 +00:00
Sanjay Patel	f204b00940	Replace second (hopefully unused) access of macro input argument with zero vector to be safer. Suggested by Craig Topper in D8275. This is a follow-on to r232052. llvm-svn: 232061	2015-03-12 17:23:46 +00:00
Sanjay Patel	0c351aba25	[X86, AVX] replace vextractf128 intrinsics with generic shuffles This is very much like D8088 (checked in at r231792). Now that we've replaced the vinsertf128 intrinsics, do the same for their extract twins. Differential Revision: http://reviews.llvm.org/D8275 llvm-svn: 232052	2015-03-12 15:50:36 +00:00
Kit Barton	8553bec911	Add builtins for the 64-bit vector integer arithmetic instructions added in POWER8. These are the Clang-related changes for the instructions added to LLVM in http://reviews.llvm.org/D7959. Phabricator review: http://reviews.llvm.org/D8041 llvm-svn: 231931	2015-03-11 15:57:19 +00:00
Sanjay Patel	7f6aa52e93	[X86, AVX] Replace vinsertf128 intrinsics with generic shuffles. We want to replace as much custom x86 shuffling via intrinsics as possible because pushing the code down the generic shuffle optimization path allows for better codegen and less complexity in LLVM. This is the sibling patch for the LLVM half of this change: http://reviews.llvm.org/D8086 Differential Revision: http://reviews.llvm.org/D8088 llvm-svn: 231792	2015-03-10 15:19:26 +00:00
Nemanja Ivanovic	55e757db4a	Add Clang support for PPC cryptography builtins Review: http://reviews.llvm.org/D7951 llvm-svn: 231291	2015-03-04 21:48:22 +00:00
Juergen Ributzka	9baa03fc07	Lower _mm256_broadcastsi128_si256 directly to a vector shuffle. Originally we were using the same GCC builtins to lower this AVX2 vector intrinsic. Instead we will now lower it directly to a vector shuffle. This will not only allow LLVM to generate better code, but it will also allow us to remove the GCC intrinsics. Reviewed by Andrea This is related to rdar://problem/18742778. llvm-svn: 231081	2015-03-03 17:22:53 +00:00
Dmitri Gribenko	a586ea13a4	Restore the libc++ definition of max_align_t on Apple platforms Clang has introduced ::max_align_t in stddef.h in r201729, but libc++ was already defining std::max_align_t on Darwin because there was none in the global namespace. After that Clang commit though, libc++ started defining std::max_align_t to be a typedef for ::max_align_t, which has a different definition. This changed the ABI. This commit restores the previous definition. rdar://19919394 rdar://18557982 llvm-svn: 230292	2015-02-24 01:06:22 +00:00
Filipe Cabecinhas	d74002965e	Make the _mm256_insert_epi64 definition more consistent Use long long for the epi64 argument, like the other intrinsics. NFC since this is only defined in 64-bit mode, not in 32-bit. Fix suggested by H. J. Lu! llvm-svn: 229886	2015-02-19 19:00:33 +00:00
Filipe Cabecinhas	54a2ba8b76	[Headers] Add tests for _mm256_insert_epi64 and fix its definition Summary: The definition for _mm256_insert_epi64 was taking an int, which would get truncated before being inserted in the vector. Original patch by Joshua Magee! Reviewers: bruno, craig.topper Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D7179 llvm-svn: 229811	2015-02-19 03:02:33 +00:00
Craig Topper	a462482d98	[X86] Add _mm_bslli_si128 and _mm_bsrli_si128 as aliases of _mm_slli_si128 and _mm_srli_si128. This matches Intel documentation and gcc. llvm-svn: 229066	2015-02-13 06:04:45 +00:00
Craig Topper	51e47418d4	[X86] Simplify some code and remove some -Wshadow disables from intrinsic header. llvm-svn: 229065	2015-02-13 06:04:43 +00:00
Filipe Cabecinhas	2177fc1732	Make the byte-shift SSE intrinsics emit vector shuffles which we know the backend can handle. Also removed unused builtins. Original patch by Andrea Di Biagio! Reviewers: craig.topper, nadav Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D7199 llvm-svn: 228481	2015-02-07 01:37:09 +00:00
David Majnemer	1cf22e690d	Headers: Don't use attribute keywords which aren't reserved Instead of using 'unavailable', use '__unavailable__' llvm-svn: 228087	2015-02-04 00:26:10 +00:00
Craig Topper	53565c60e7	[X86] Add other flavors of AVX512 cmpps/cmppd intrinsics. llvm-svn: 227773	2015-02-01 22:27:40 +00:00
Craig Topper	2a898bfc67	[X86] Add the AVX512 exp2a23 intrinsics. llvm-svn: 227769	2015-02-01 21:34:11 +00:00
Craig Topper	da97c20128	[X86] Add all intrinsics for scalar rsqrt28/rcp28 to avx512erintrin.h. Add parentheses around all macro arguments. llvm-svn: 227722	2015-02-01 10:15:11 +00:00
Craig Topper	c4b852a909	[X86] Flesh out more of the avx512erintrin.h file. llvm-svn: 227719	2015-02-01 08:52:55 +00:00
Craig Topper	b01fc317c1	[X86] Use macros in AVX512ER header to allow ICE to be checked for immediate argument. llvm-svn: 227716	2015-02-01 08:05:12 +00:00
Craig Topper	67826a5883	[X86] Rename _mm512_valign_epi64/32 intrinsics to _mm512_alignr_epi64/32 to match Intel docs. Make immediate argument to them an ICE. Fix mask size for the alignd version. llvm-svn: 227713	2015-02-01 07:35:40 +00:00
Craig Topper	72c7d51251	[X86] Change rounding parameter of all the AVX512 builtins to an ICE. llvm-svn: 227712	2015-02-01 07:35:35 +00:00
Craig Topper	9fee8ab4f9	[x86] Remove tab characters from avxintrin.h. NFC. llvm-svn: 227676	2015-01-31 06:33:59 +00:00
Craig Topper	459554f164	[X86] Make order consistent between 'const' and 'int' in one of the intrinsic header files. NFC llvm-svn: 227675	2015-01-31 06:31:30 +00:00
Richard Smith	99335be950	Don't use BCPL comments here, in case someone wants to use <stdatomic.h> from C89 mode. llvm-svn: 227417	2015-01-29 03:34:39 +00:00
Hans Wennborg	2e56d950ff	Intrin.h: define _XCR_XFEATURE_ENABLED_MASK Users expect to be able to use this with _xgetbv. llvm-svn: 227270	2015-01-27 23:34:35 +00:00
Craig Topper	335e218760	[X86] Add intrinsics for AVX512 128 and 256 bit integer comparison of word and byte vectors. llvm-svn: 227186	2015-01-27 09:16:29 +00:00
Craig Topper	b4789096c0	[X86] Add AVX512 integer comparison intrinsics for word and byte vectors. llvm-svn: 227079	2015-01-26 09:24:10 +00:00
Craig Topper	2f25a5a875	[X86] Add more of the AVX512 integer comparision intrinsics. This adds 128 and 256 bit vectors of dwords and qwords. llvm-svn: 227075	2015-01-26 08:11:49 +00:00
Craig Topper	4cac1c2318	[X86] Add AVX512F integer comparision intrinsics to header file. llvm-svn: 227067	2015-01-25 23:30:07 +00:00
Adam Nemet	f893edeaea	[AVX512] Add sub-vector FP extracts Analogous to AVX2, these need to be implemented as macros to properly propagate the immediate index operand. Part of <rdar://problem/17688758> llvm-svn: 226496	2015-01-19 20:12:05 +00:00
Craig Topper	f557b09f14	[x86] Mark that the AVX-512 cmpps/cmppd builtins need an ICE for the comparison immediate. This requires converting to a macro in the header file. llvm-svn: 226421	2015-01-19 01:18:19 +00:00
Adam Nemet	c0cff244fc	[AVX512] Add intrinsics for masked aligned FP loads and stores Part of <rdar://problem/17688758> llvm-svn: 226298	2015-01-16 18:51:50 +00:00
Adam Nemet	63a951eb1c	[AVX512] Add FP unpack intrinsics These are implemented with __builtin_shufflevector just like AVX. We have some tests on the LLVM side to assert that these shufflevectors do indeed generate the corresponding unpck instruction. Part of <rdar://problem/17688758> llvm-svn: 225922	2015-01-14 01:31:17 +00:00
Ben Langmuir	c67a774e17	Add [extern_c] attribute to _Builtin_intrinsics module This allows users to import this module inside an extern "C" {} block. llvm-svn: 225835	2015-01-13 21:54:32 +00:00
Chandler Carruth	032d422d2e	Effectively revert r151058 which caused Clang's unwind.h to defer to libunwind in all cases when installed. At the time, Clang's unwind.h didn't provide huge chunks of the LSB-specified unwind interface, and was generally too aenemic to use for real software. However, it has since then become a strict superset of the APIs provided by libunwind on Linux. Notably, you cannot compile llgo's libgo library against libunwind, but you can against Clang's unwind.h. So let's just use our header. =] I've checked pretty thoroughly for any incompatibilities, and I am not aware of any. An open question is whether or not we should continue to munge GNU_SOURCE here. I didn't touch that as it potentially has compatibility implications on systems I cannot easily test -- Darwin. If a Darwin maintainer can verify that this is in fact unnecessary and remove it, cool. Until then, leaving it in makes this change a no-op there, and only really relevant on Linux systems where it is pretty clearly the right way to go. llvm-svn: 224934	2014-12-29 13:29:38 +00:00
Chandler Carruth	f3cabbd424	Add a missing declaration to our unwind.h implementation. This is necessary to be fully compatible with existing software that calls into the linux unwind code. You can find documentation of this API and why it exists in the discussion abot NPTL here: https://gcc.gnu.org/ml/gcc-patches/2003-09/msg00154.html llvm-svn: 224933	2014-12-29 13:29:36 +00:00
Chandler Carruth	28daca211c	[x86] Also add the missing type casts on the returns in the sha intrinsic header file. Along with r224822, this should restore the build bots to passing. llvm-svn: 224883	2014-12-27 11:50:51 +00:00
Craig Topper	ab70789199	[x86] Add missing typecast to __v4si to sha intrinsic header file. llvm-svn: 224882	2014-12-27 07:19:25 +00:00
Craig Topper	2094d8fe88	[x86] Add the (v)cmpps/pd/ss/sd builtins to match gcc. Use them in the sse intrinsic files. This still lower to the same intrinsics as before. This is preparation for bounds checking the immediate on the avx version of the builtin so we don't pass illegal immediates into the backend. Since SSE uses a smaller size immediate its not possible to bounds check when using a shared builtin. Rather than creating a clang specific builtin for the different immediate, I decided (after consulting with Chandler) that it was better to match gcc. llvm-svn: 224879	2014-12-27 06:59:57 +00:00
Eric Christopher	c67e1b6a2a	Make sure that vec_perm is listed as a static function in altivec.h. llvm-svn: 223871	2014-12-10 00:57:43 +00:00
Reid Kleckner	baf7709055	Implement __umulh with __int128 arithmetic Use the same approach as _umul128, but just return the high half. llvm-svn: 223316	2014-12-03 23:36:14 +00:00
David Majnemer	00973ce683	FullProduct should be _FullProduct llvm-svn: 223179	2014-12-02 23:44:40 +00:00
David Majnemer	5450763dd8	Intrin: shrx_u64 should be _shrx_u64 llvm-svn: 223176	2014-12-02 23:30:26 +00:00
David Majnemer	5f9afc59f8	Intrin: Add _umul128 Implement _umul128; it provides the high and low halves of a 128-bit multiply. We can simply use our __int128 arithmetic to implement this, we generate great code for it: movq %rdx, %rax mulq %rcx movq %rdx, (%r8) retq Differential Revision: http://reviews.llvm.org/D6486 llvm-svn: 223175	2014-12-02 23:30:24 +00:00
Reid Kleckner	e35b07ad49	Intercept __crt_va_* used by MSVC "14" Moving further into the implementor's namespace is good, but now we have one more name to intercept. llvm-svn: 222473	2014-11-20 22:44:03 +00:00
Bill Schmidt	8ff672d397	[PowerPC] Enable vec_perm for long long and double vector types for VSX VSX makes the "vector long long" and "vector double" types available. This patch enables the vec_perm interface for these types. The same builtin is generated regardless of the specified type, so no additional work or testing is needed in the back end. Tests are added to ensure this builtin is generated by the front end. llvm-svn: 221988	2014-11-14 13:10:13 +00:00
Bill Schmidt	cee13a2712	[PowerPC] Add VSX builtins for vec_div This patch adds builtin support for xvdivdp and xvdivsp, along with a new test case. The builtins are accessed using vec_div in altivec.h. Builtins are listed (mostly) alphabetically there, so inserting these changed the line numbers for deprecation warnings tested in test/Headers/altivec-intrin.c. There is a companion patch for LLVM. llvm-svn: 221984	2014-11-14 12:10:51 +00:00
Bill Schmidt	9ec8cea02b	[PowerPC] Add vec_vsx_ld and vec_vsx_st intrinsics This patch enables the vec_vsx_ld and vec_vsx_st intrinsics for PowerPC, which provide programmer access to the lxvd2x, lxvw4x, stxvd2x, and stxvw4x instructions. New code in altivec.h defines these in terms of new builtins, which are themselves defined in BuiltinsPPC.def. The builtins are converted to LLVM intrinsics in CGBuiltin.cpp. Additional code is added to builtins-ppc-vsx.c to verify the correct generation of the intrinsics. Note that I moved the other VSX builtins so all VSX builtins will be alphabetical in their own section in BuiltinsPPC.def. There is a companion patch for LLVM. llvm-svn: 221768	2014-11-12 04:19:56 +00:00
Craig Topper	8c7f251e98	Add FSGSBASE intrinsics to x86 intrinsic headers. llvm-svn: 221130	2014-11-03 06:51:41 +00:00
Craig Topper	554797f255	Remove definitions from Intrin.h that already exist in one of the other x86 intrinsic headers. Add a run line with Broadwell as the cpu type to ms-intrin.cpp test to catch some of these in the future. llvm-svn: 221127	2014-11-03 04:19:58 +00:00
Craig Topper	e1c664b136	Add _lzcnt_u32 and _lzcnt_u64 to lzcntintrin.h to match Intel documentation names for these intrinsics. llvm-svn: 221066	2014-11-01 22:50:57 +00:00
Craig Topper	a52e0d7cc0	Avoid undefined behavior in the x86 bmi header file by explicitly checking for 0 before calling __builtin_ctz. Without this the optimizers may take advantage of the undefined behavior and produce incorrect results. LLVM itself still needs to be taught to merge the zero check into the llvm.cttz with defined zero behavior. llvm-svn: 221065	2014-11-01 22:50:54 +00:00
Craig Topper	3ca55d9c41	Avoid undefined behavior in the x86 lzcnt header file by explicitly checking for 0 before calling __builtin_clz. Without this the optimizers may take advantage of the undefined behavior and produce incorrect results. LLVM itself still needs to be taught to merge the zero check into the llvm.ctlz with defined zero behavior. llvm-svn: 221064	2014-11-01 22:25:23 +00:00
Bill Schmidt	691e01d94e	[PowerPC] Initial VSX intrinsic support, with min/max for vector double Now that we have initial support for VSX, we can begin adding intrinsics for programmer access to VSX instructions. This patch performs the necessary enablement in the front end, and tests it by implementing intrinsics for minimum and maximum using the vector double data type. The main change in the front end is to no longer disallow "vector" and "double" in the same declaration (lib/Sema/DeclSpec.cpp), but "vector" and "long double" must still be disallowed. The new intrinsics are accessed via vec_max and vec_min with changes in lib/Headers/altivec.h. Note that for v4f32, we already access corresponding VMX builtins, but with VSX enabled we should use the forms that allow all 64 vector registers. The new built-ins are defined in include/clang/Basic/BuiltinsPPC.def. I've added a new test in test/CodeGen/builtins-ppc-vsx.c that is similar to, but much smaller than, builtins-ppc-altivec.c. This allows us to test VSX IR generation without duplicating CHECK lines for the existing bazillion Altivec tests. Since vector double is now legal when VSX is available, I've modified the error message, and changed where we test for it and for vector long double, since the target machine isn't visible in the old place. This serendipitously removed a not-pertinent warning about 'long' being deprecated when used with 'vector', when "vector long double" is encountered and we just want to issue an error. The existing tests test/Parser/altivec.c and test/Parser/cxx-altivec.cpp have been updated accordingly, and I've added test/Parser/vsx.c to verify that "vector double" is now legitimate with VSX enabled. There is a companion patch for LLVM. llvm-svn: 220989	2014-10-31 19:19:24 +00:00
Saleem Abdulrasool	a25fbef088	CodeGen: add __readfsdword builtin The Windows NT SDK uses __readfsdword and declares it as a compiler provided builtin (#pragma intrinsic(__readfsword). Because intrin.h is not referenced by winnt.h, it is not possible to provide an out-of-line definition for the intrinsic. Provide a proper compiler builtin definition. llvm-svn: 220859	2014-10-29 16:35:41 +00:00
NAKAMURA Takumi	a267847538	<float.h>: Don't seek #include_next if -ffreestanding for targeting mingw. llvm-svn: 220356	2014-10-22 01:25:49 +00:00
Hans Wennborg	818514b718	vadefs.h: be even more conservative and only define the macros if already defined llvm-svn: 219745	2014-10-14 23:20:25 +00:00
Hans Wennborg	752b789e7b	Sort files list in lib/Headers/CMakeLists.txt majnemer pointed out that vadefs.h was added in the wrong place. Might as well sort the rest too. llvm-svn: 219743	2014-10-14 23:15:43 +00:00
Hans Wennborg	adfd7f6ef4	MS Compat: interpose vadefs.h to fix definitions of _crt_va_{start,end,arg} (PR21247) Differential revision: http://reviews.llvm.org/D5784 llvm-svn: 219740	2014-10-14 22:35:42 +00:00
Robert Khasanov	33e7685b2a	Added new headers to CMakeLists.txt. Fix for rev219319 llvm-svn: 219325	2014-10-08 17:37:51 +00:00
Robert Khasanov	b9f3a911c9	[AVX512] Added VPCMPEQ intrinisics to headers. Added tests. Patch by Maxim Blumenthal <maxim.blumenthal@intel.com> llvm-svn: 219319	2014-10-08 17:18:13 +00:00
Bill Schmidt	cad3a5f7d4	[PATCH][Power] Fix (and deprecate) vec_lvsl and vec_lvsr for little endian The use of the vec_lvsl and vec_lvsr interfaces are discouraged for little endian targets since Power8 hardware is a minimum requirement, and Power8 provides reasonable performance for unaligned vector loads and stores. Up till now we have not provided "correct" (i.e., big- endian-compatible) code generation for these interfaces, as to do so produces poorly performing code. However, this has become the source of too many questions. With this patch, LLVM will now produce compatible code for these interfaces, but will also produce a deprecation warning message for PPC64LE when one of them is used. This should make the porting direction clearer to programmers. A similar patch has recently been committed to GCC. This patch includes a test for the warning message. There is a companion patch that adds two unit tests to projects/test-suite. llvm-svn: 219137	2014-10-06 19:02:20 +00:00
Hal Finkel	6970ac8b0a	Add an implementation of C11's stdatomic.h Adds a Clang-specific implementation of C11's stdatomic.h header. On systems, such as FreeBSD, where a stdatomic.h header is already provided, we defer to that header instead (using our __has_include_next technology). Otherwise, we provide an implementation in terms of our __c11_atomic_* intrinsics (that were created for this purpose). C11 7.1.4p1 requires function declarations for atomic_thread_fence, atomic_signal_fence, atomic_flag_test_and_set, atomic_flag_test_and_set_explicit, and atomic_flag_clear, and requires that they have external linkage. Accordingly, we provide these declarations, but if a user elides the shadowing macros and uses them, then they must have a libc (or similar) that actually provides definitions. atomic_flag is implemented using _Bool as the underlying type. This is consistent with the implementation provided by FreeBSD and also GCC 4.9 (at least when __GCC_ATOMIC_TEST_AND_SET_TRUEVAL == 1). Patch by Richard Smith (rebased and slightly edited by me -- Richard said I should drive at this point). llvm-svn: 218957	2014-10-03 04:29:40 +00:00
Richard Smith	ef99e4d88a	Fix interaction of max_align_t and modules. When building with modules enabled, we were defining max_align_t as a typedef for a different anonymous struct type each time it was included, resulting in an error if <stddef.h> is not covered by a module map and is included more than once in the same modules-enabled compilation of C11 or C++11 code. llvm-svn: 218931	2014-10-03 00:31:35 +00:00
Joerg Sonnenberger	2960178a77	Fix trailing commas in AMD define. llvm-svn: 218825	2014-10-01 21:22:17 +00:00
Joerg Sonnenberger	e028e05a7e	Add the various signature macros. llvm-svn: 218824	2014-10-01 21:21:42 +00:00
Joerg Sonnenberger	cf0740454d	Rename bit_RDRAND to bit_RDRND to match GCC's version of this header. llvm-svn: 218823	2014-10-01 21:21:16 +00:00
Robert Khasanov	ea13042cf2	[x86] Fixed argument types in intrinsics: _addcarryx_u64 _addcarry_u64 _subborrow_u64 Thanks Pasi Parviainen for notice. llvm-svn: 218376	2014-09-24 06:45:23 +00:00
Akira Hatanaka	416efb5f90	Fix bugs in cpuid.h. This commit makes two changes: - Remove the push and pop instructions that were saving and restoring %ebx before and after cpuid in 32-bit pic mode. We were doing this to ensure we don't lose the GOT address in pic register %ebx, but this isn't necessary because the GOT address is kept in a virtual register. - In 64-bit mode, preserve base register %rbx around cpuid. This fixes PR20311 and rdar://problem/17686779. llvm-svn: 218173	2014-09-20 01:31:09 +00:00
Robert Khasanov	2c589bcc5e	[x86] Add _addcarry_u{32\|64} and _subborrow_u{32\|64}. They are added to adxintrin.h but outside __ADX__ block. These intrinics generates adc and sbb correspondingly that were available before ADX llvm-svn: 218118	2014-09-19 10:29:22 +00:00
Robert Khasanov	83c419b349	[x86] Added _addcarryx_u32, _addcarryx_u64 intrinsics llvm-svn: 218117	2014-09-19 10:17:06 +00:00
Yi Kong	a8833f0c28	arm_acle: Fix error in ROR implementation The logic in calculating the rotate amount was flawed. Thanks Pasi Parviainen for pointing out! llvm-svn: 216669	2014-08-28 15:25:52 +00:00
Yi Kong	623393f31e	arm_acle: Implement data processing intrinsics Summary: ACLE 2.0 section 9.2 defines the following "miscellaneous data processing intrinsics": `__clz`, `__cls`, `__ror`, `__rev`, `__rev16`, `__revsh` and `__rbit`. `__clz` has already been implemented in the arm_acle.h header file. The rest are not supported yet. This patch completes ACLE data processing intrinsics. Reviewers: t.p.northover, rengolin Reviewed By: rengolin Subscribers: aemerson, mroth, llvm-commits Differential Revision: http://reviews.llvm.org/D4983 llvm-svn: 216658	2014-08-28 09:44:07 +00:00
Yi Kong	6891746cd8	arm_acle: Add mappings for dbg intrinsic This completes all ACLE hint intrinsics. llvm-svn: 216453	2014-08-26 12:48:11 +00:00
Yi Kong	0705e0065e	arm_acle: Implement swap intrinsic Insert the LDREX/STREX instruction sequence specified in ARM ACLE 2.0, as SWP instruction is deprecated since ARMv6. llvm-svn: 216446	2014-08-26 09:50:54 +00:00
Yi Kong	70cf4c626e	arm_acle.h: Small cleanup Since __SIZEOF_LONG_LONG__ is always defined as 8 on ARM targets, there's no point in checking this. NFC. Patch by Moritz Roth. llvm-svn: 215697	2014-08-15 08:53:22 +00:00
Adam Nemet	2278fcbf0c	[AVX512] Add FMA intrinsics Part of <rdar://problem/17688758> llvm-svn: 215666	2014-08-14 17:17:57 +00:00
Yi Kong	45a09319bf	ARM: Add mappings for ACLE prefetch intrinsics Implement __pld, __pldx, __pli and __plix builtin intrinsics as specified in ARM ACLE 2.0. llvm-svn: 215599	2014-08-13 23:20:15 +00:00
Adam Nemet	4abc07cb75	[AVX512] Add intrinsics for FP scalar broadcasts Similar approach to the set1 intrinsics is used: implement in terms of vector initializers and then ensure with an LLVM test that a broadcast is generated at the end. Part of <rdar://problem/17688758> llvm-svn: 215486	2014-08-13 00:29:01 +00:00
Adam Nemet	5bf7baa938	[AVX512] Add intrinsic for valignd/q Note that similar to palingr, we could further optimize these to emit shufflevector when the shift count is <=64. This however does not change the overall design that unlike palignr we would still need the LLVM intrinsic corresponding to this intruction to handle the >64 cases. (palignr uses the psrldq intrinsic in this case.) llvm-svn: 214891	2014-08-05 17:28:23 +00:00
Bill Schmidt	ccbe0a8022	[PPC64LE] Fix wrong IR for vec_sld and vec_vsldoi My original LE implementation of the vsldoi instruction, with its altivec.h interfaces vec_sld and vec_vsldoi, produces incorrect shufflevector operations in the LLVM IR. Correct code is generated because the back end handles the incorrect shufflevector in a consistent manner. This patch and a companion patch for LLVM correct this problem by removing the fixup from altivec.h and the corresponding fixup from the PowerPC back end. Several test cases are also modified to reflect the now-correct LLVM IR. The vec_sums and vec_vsumsws interfaces in altivec.h are also fixed, because they used vec_perm calls intended to be recognized as vsldoi instructions. These vec_perm calls are now replaced with code that more clearly shows the intent of the transformation. llvm-svn: 214801	2014-08-04 23:21:26 +00:00
Adam Nemet	da82bcc4dd	[AVX512] Add unaligned FP load intrinsics Part of <rdar://problem/17688758> llvm-svn: 214380	2014-07-31 04:00:39 +00:00
Adam Nemet	2db1d2fb32	[AVX512] Add intrinsic for knot Part of <rdar://problem/17688758> llvm-svn: 214316	2014-07-30 16:51:27 +00:00
Adam Nemet	c871ff95f3	[AVX512] Add some of the FP cast intrinsics Part of <rdar://problem/17688758> llvm-svn: 214315	2014-07-30 16:51:24 +00:00
Adam Nemet	f42e7a274a	[AVX512] Add set1 intrinsics (Dropped the byte and word variants from the patch. Turns out these are not part of AVX512F but only AVX512BW/VL.) Part of <rdar://problem/17688758> llvm-svn: 214314	2014-07-30 16:51:22 +00:00
Joerg Sonnenberger	3d9478cf3a	Change __INTx_TYPE__ to be always signed. This changes the value for char-based types from "char" to "signed char". Adjust stdint.h to use __INTx_TYPE__ directly without prefixing it with signed and to use __UINTx_TYPE__ for unsigned ones. The value of __INTx_TYPE__ now matches GCC. llvm-svn: 214119	2014-07-28 21:06:22 +00:00
Adam Nemet	fce1ad0b99	[AVX512] Add non-masking FP store intrinsics Part of <rdar://problem/17688758> llvm-svn: 214099	2014-07-28 17:14:45 +00:00
Adam Nemet	a3ebe6214b	[AVX512] Add FP add/sub/mul intrinsics Part of <rdar://problem/17688758> llvm-svn: 214098	2014-07-28 17:14:42 +00:00
Adam Nemet	0d5bb5530d	[AVX512] Reorder functions in avx512fintrin.h There is no functional change here. The idea is to have a similar order and categories of functions that we have in avxintrin.h. llvm-svn: 214097	2014-07-28 17:14:40 +00:00
Adam Nemet	9a3ea60a2c	[AVX512] Bring the formatting of avx512fintrin.h closer to avxintrin.h llvm-svn: 214096	2014-07-28 17:14:38 +00:00
Yi Kong	cd08139865	Add module map entry for ARM ACLE header file llvm-svn: 213731	2014-07-23 09:00:21 +00:00
Elena Demikhovsky	bd1a49bf81	AVX-512: I added new headers to makefiles. It should resolve tests fail. If it will not, I'm reverting the both commits. llvm-svn: 213645	2014-07-22 12:08:25 +00:00
Elena Demikhovsky	fcc6df310d	AVX-512: Added intrinsics to clang. The set is small, that what I have right now. Everybody is welcome to add more. llvm-svn: 213641	2014-07-22 11:31:39 +00:00
Viktor Kutuzov	99400a5a34	Revert D3908 due to issues on Mac platforms llvm-svn: 213450	2014-07-19 05:58:38 +00:00
Yi Kong	28d7b02687	ARM: Add ACLE memory barrier intrinsic mapping llvm-svn: 213261	2014-07-17 12:45:17 +00:00
Yi Kong	472e521cec	ARM: Add NOP intrinsic mapping in arm_acle.h llvm-svn: 212950	2014-07-14 15:32:29 +00:00
Saleem Abdulrasool	07257fe14e	Headers: add hint intrinsics to arm_acle.h This adds the ARM ACLE hint intrinsic wrappers to arm_acle.h. These need to be protected with a !defined(_MSC_VER) since MSVC (and thus clang in compatibility mode) provide these wrappers as proper builtin intrinsics. llvm-svn: 212891	2014-07-12 23:27:26 +00:00
Yi Kong	4e00ce7d0c	Improve comments of ARM ACLE header file and tests Include section number in ARM ACLE specification for easier navigation. llvm-svn: 212887	2014-07-12 22:48:13 +00:00
Viktor Kutuzov	63537656c6	Add clang headers that fix machine-dependent definitions on FreeBSD 9.2 Differential Revision: http://reviews.llvm.org/D3908 llvm-svn: 212689	2014-07-10 08:43:39 +00:00
Nico Weber	a62cffae52	Don't pull in setjmp.h in -ffreestanding compiles. Also provide _setjmpex(). r200243 put in _setjmp() and _setjmpex() behind a comment since jmp_buf wasn't available. r200344 added jmp_buf and put in _setjmp(), but missed _setjmpex(). llvm-svn: 212557	2014-07-08 18:34:46 +00:00
Nico Weber	1287091373	Replace a few // comments with /**/ comments in headers, for consistency. llvm-svn: 212556	2014-07-08 18:29:27 +00:00
Saleem Abdulrasool	c4ebb129b7	Headers: conditionalise more declarations Protect MMX specific declarations under a __MMX__ guard. This header can be included on non-x86 architectures (e.g. ARM) which do not support the MMX ISA. Use the preprocessor to prevent these declarations from being processed. llvm-svn: 212512	2014-07-08 05:46:04 +00:00
Saleem Abdulrasool	60df0615b6	Headers: mark arm_acle.h with extern "C" Although the functions are marked as always_inline, the compiler with which they are used may not honour the extended attributes and emit them as functions. In such a case, indicate that they should have extern "C" linkage and should not be mangled in C++ style if used within C++. llvm-svn: 212511	2014-07-08 05:46:00 +00:00
Renato Golin	47843efcf6	Add the __qdbl intrinsic to the arm_acle.h header Patch by: Moritz Roth llvm-svn: 212264	2014-07-03 10:14:52 +00:00
Yaron Keren	672efea2e9	Added standard macro guard. In case __GNUC_VA_LIST was not defined or defined identically before there will not be any change in functionality. MinGW-w64 defines __GNUC_VA_LIST as #define __GNUC_VA_LIST which is different than the definition here, causing a warning without the guard. llvm-svn: 212183	2014-07-02 15:25:03 +00:00
Andrea Di Biagio	eb606a3c27	[x86] Add Clang support for intrinsic __rdpmc. This patch adds intrinsic __rdpmc to header file 'ia32intrin.h'. Intrinsic __rdmpc can be used to read performance monitoring counters. It is implemented as a direct call to __builtin_ia32_rdpmc. It takes as input a value representing the index of the performance counter to read. The value of the performance counter is then returned as a unsigned 64-bit quantity. llvm-svn: 212053	2014-06-30 18:23:58 +00:00
Yi Kong	a44c4d7173	Introduce arm_acle.h supporting existing LLVM builtin intrinsics Summary: This patch introduces ACLE header file, implementing extensions that can be directly mapped to existing Clang intrinsics. It implements for both AArch32 and AArch64. Reviewers: t.p.northover, compnerd, rengolin Reviewed By: compnerd, rengolin Subscribers: rnk, echristo, compnerd, aemerson, mroth, cfe-commits Differential Revision: http://reviews.llvm.org/D4296 llvm-svn: 211962	2014-06-27 21:25:42 +00:00
Saleem Abdulrasool	702eefed9a	Headers: be a bit more careful about inline asm Conditionally include x86intrin.h if we are building for x86 or x86_64. Conditionalise definition of inline assembly routines which use x86 or x86_64 inline assembly. This is needed as clang can target Windows on ARM where these definitions may be included into user code. llvm-svn: 211716	2014-06-25 16:48:40 +00:00
Saleem Abdulrasool	114efe0dc8	CodeGen: improve ms instrincics support Add support for _InterlockedCompareExchangePointer, _InterlockExchangePointer, _InterlockExchange. These are available as a compiler intrinsic on ARM and x86. These are used directly by the Windows SDK headers without use of the intrin header. llvm-svn: 211216	2014-06-18 20:51:10 +00:00
Bill Schmidt	1cf7c64fa5	[PPC64LE] Run some existing Altivec tests on powerpc64le as well There are several Altivec tests that formerly ran only on big-endian targets (and in some cases only on 32-bit targets). It is useful to verify these on little-endian targets as well. While testing these, I discovered a typo in <altivec.h>. This is also fixed by this patch. llvm-svn: 210928	2014-06-13 18:30:06 +00:00
Bill Schmidt	56a6967000	[PPC64LE] Fix vec_sld and vec_vsldoi for little endian The vec_sld and vec_vsldoi interfaces perform a left-shift on vector arguments for both big and little endian. However, because they rely on the vec_perm interface which is endian-dependent, the permutation vector needs to be reversed for LE to get the proper shift direction. I've added some extra testing for these interfaces for LE in the builtins-ppc-altivec.c. llvm-svn: 210657	2014-06-11 15:48:46 +00:00
Bill Schmidt	7f6596bb13	[PPC64LE] Implement little-endian semantics for vec_sums The PowerPC vsumsws instruction, accessed via vec_sums, is defined architecturally with a big-endian bias, in that the second input vector and the result always reference big-endian element 3 (little-endian element 0). For ease of porting, the programmer wants elements 3 in both cases. To provide this semantics, for little endian we generate a permute for the second input vector prior to the vsumsws instruction, and generate a permute for the result vector following the vsumsws instruction. The correctness of this code is tested by the new sums.c test added in a previous patch, as well as the modifications to builtins-ppc-altivec.c in the present patch. llvm-svn: 210449	2014-06-09 03:31:47 +00:00
Bill Schmidt	d7c53a91df	[PPC64LE] Implement little-endian semantics for vec_unpack[hl] The PowerPC vector-unpack-high and vector-unpack-low instructions are defined architecturally with a big-endian bias, in that the vector element numbering is assumed to be "left to right" regardless of whether the processor is in big-endian or little-endian mode. This effectively reverses the meaning of "high" and "low." Such a definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_unpackh and vec_unpackl interfaces are designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The desired semantics can be achieved by using the opposite instruction for little-endian mode. That is, when a call to vec_unpackh appears in the code, a vector-unpack-low is generated, and when a call to vec_unpackl appears in the code, a vector-unpack-high is generated. The correctness of this code is tested by the new unpack.c test added in a previous patch, as well as the modifications to builtins-ppc-altivec.c in the present patch. Note that these interfaces were originally incorrectly implemented when they take a vector pixel argument. This patch corrects this implementation for both big- and little-endian code generation. llvm-svn: 210391	2014-06-07 02:20:52 +00:00
Bill Schmidt	7f0a5c5141	[PPC64LE] Update builtins-ppc-altivec.c for PPC64 and PPC64LE The Altivec builtin test case test/CodeGen/builtins-ppc-altivec.c has always been executed only for 32-bit PowerPC. These tests are equally valid for 64-bit PowerPC. This patch updates the test to be run for three targets: powerpc-unknown-unknown, powerpc64-unknown-unknown, and powerpc64le-unknown-unknown. The expected code generation changes for some of the Altivec builtins for little endian, so this patch adds new CHECK-LE variants to the test for the powerpc64le target. These tests satisfy the testing requirements for some previous patches committed over the last couple of days for lib/Headers/altivec.h: r210279 for vec_perm, r210337 for vec_mul[eo], and r210340 for vec_pack. llvm-svn: 210384	2014-06-06 23:12:00 +00:00
Bill Schmidt	8a7b4f18bd	[PPC64LE] Implement little-endian semantics for vec_pack family The PowerPC vector-pack instructions are defined architecturally with a big-endian bias, in that the vector element numbering is assumed to be "left to right" regardless of whether the processor is in big-endian or little-endian mode. This definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_pack and related interfaces are designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The vec_pack calls are implemented as calls to vec_perm, specifying selection of the odd-numbered vector elements. For little endian, this means the odd-numbered elements counting from the right end of the register. Since the underlying instructions count from the left end, we must instead select the even-numbered vector elements for little endian to achieve the desired semantics. The correctness of this code is tested by the new pack.c test added in a previous patch. I plan to later make the existing ppc32 Altivec compile-time tests work for ppc64 and ppc64le as well. llvm-svn: 210340	2014-06-06 15:10:47 +00:00
Bill Schmidt	7c0114f6e3	[PPC64LE] Implement little-endian semantics for vec_mul[eo] The PowerPC vector-multiply-even and vector-multiply-odd instructions are defined architecturally with a big-endian bias, in that the vector element numbering is assumed to be "left to right" regardless of whether the processor is in big-endian or little-endian mode. This definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_mule and vec_mulo interfacs are designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The desired semantics can be achieved by using the opposite instruction for little-endian mode. That is, when a call to vec_mule appears in the code, a vector-multiply-odd is generated, and when a call to vec_mulo appears in the code, a vector-multiply-even is generated. The correctness of this code is tested by the new mult-even-odd.c test added in a previous patch. I plan to later make the existing ppc32 Altivec compile-time tests work for ppc64 and ppc64le as well. llvm-svn: 210337	2014-06-06 14:45:06 +00:00
Bill Schmidt	f7e289c0f2	[PPC64LE] Implement little-endian semantics for vec_perm The PowerPC vperm (vector permute) instruction is defined architecturally with a big-endian bias, in that the two input vectors are assumed to be concatenated "left to right" and the elements of the combined input vector are assumed to be numbered from "left to right" (i.e., with element 0 referencing the high-order element). This definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_perm interface is designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The desired semantics can be achieved with the vperm instruction provided that the two input vector registers are reversed, and the permute control vector is complemented. The complementing is performed using an xor with a vector containing all one bits. Only the rightmost 5 bits of each element of the permute control vector are relevant, so it would be possible to complement the vector with respect to a <16xi8> vector containing all 31s. However, when the permute control vector is not a constant, using 255 instead has the advantage that the vec_xor can be recognized during code generation as a vnor instruction. (Power8 introduces a vnand instruction which could alternatively be generated.) The correctness of this code is tested by the new perm.c test added in a previous patch. I plan to later make the existing ppc32 Altivec compile-time tests work for ppc64 and ppc64le as well. llvm-svn: 210279	2014-06-05 19:07:40 +00:00
Adam Nemet	286ae08e7d	Implement AVX1 vbroadcast intrinsics with vector initializers These intrinsics are special because they directly take a memory operand (AVX2 adds the register counterparts). Typically, other non-memop intrinsics take registers and then it's left to isel to fold memory operands. In order to LICM intrinsics directly reading memory, we require that no stores are in the loop (LICM) or that the folded load accesses constant memory (MachineLICM). When neither is the case we fail to hoist a loop-invariant broadcast. We can work around this limitation if we expose the load as a regular load and then just implement the broadcast using the vector initializer syntax. This exposes the load to LICM and other optimizations. At the IR level this is translated into a series of insertelements. The sequence is already recognized as a broadcast so there is no impact on the quality of codegen. _mm256_broadcast_pd and _mm256_broadcast_ps are not updated by this patch because right now we lack the DAG-combiner smartness to recover the broadcast instructions. This will be tackled in a follow-on. There will be completing changes on the LLVM side to remove the LLVM intrinsics and to auto-upgrade bitcode files. Fixes <rdar://problem/16494520> llvm-svn: 209846	2014-05-29 20:47:29 +00:00
Sanjay Patel	1585fb94ab	added Intel's BMI intrinsic variants (fixes PR19431 - http://llvm.org/bugs/show_bug.cgi?id=19431) llvm-svn: 209769	2014-05-28 20:26:57 +00:00
Akira Hatanaka	5d28ea1451	Fix a bug in xmmintrin.h. The last step of _mm_cvtps_pi16 should use _mm_packs_pi32, which is a function that reads two __m64 values and packs four 32-bit values into four 16-bit values. <rdar://problem/16873717> llvm-svn: 209489	2014-05-23 00:38:07 +00:00
Timur Iskhodzhanov	a27b044166	Define the InterlockedCompareExchange64 intrinsic on 32-bits too llvm-svn: 208699	2014-05-13 13:59:05 +00:00
Filipe Cabecinhas	5d289b48b1	Patched clang to emit x86 blends as shufflevectors. Summary: Most of the clang header patch by Simon Pilgrim @ SCEE. Also fixed (or added) clang tests for these intrinsics. LLVM tests to make sure we get the blend instruction out of these shufflevectors are at http://reviews.llvm.org/D3600 Reviewers: eli.friedman, craig.topper, rafael Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D3601 llvm-svn: 208664	2014-05-13 02:37:02 +00:00
Nico Weber	272bcf6768	Let stddef.h respect __need_{wchar_t, size_t, NULL, ptrdiff_t, wint_t}. glibc expects that stddef.h only defines a single thing if either of these defines is set. For example, before this change, a C file containing #include <stdlib.h> int ptrdiff_t = 0; would compile with gcc but not with clang. Now it compiles with clang too. This also fixes PR12997, where older versions of the Linux headers would define NULL incorrectly, and glibc would define __need_NULL and expect stddef.h to redefine NULL with the correct definition. llvm-svn: 207606	2014-04-30 04:35:09 +00:00
Nico Weber	f077c51a70	Revert r207482; I fail at reading IRC. llvm-svn: 207483	2014-04-29 01:25:49 +00:00
Nico Weber	8af28c1e61	Let stddef.h redefine NULL if __need_NULL is set, as needed by glibc, PR12997. See the bug and the cfe-commits thread "[patch] Let stddef.h redefine NULL if __need_NULL is set" for discussion. Fixes PR12997 and is similar to the __need_wint_t bits already in this file. llvm-svn: 207482	2014-04-29 01:19:21 +00:00
Hans Wennborg	ac156e2225	Intrin.h: remove __rdtsc and __rdtscp declarations Since r207132, these are defined in ia32intrin.h. llvm-svn: 207134	2014-04-24 18:40:06 +00:00
Andrea Di Biagio	7ceec07cf6	[X86] Add Clang support for intrinsics __rdtsc and __rdtscp. This patch: 1. Adds a definition for two new GCCBuiltins in BuiltinsX86.def: __builtin_ia32_rdtsc; __builtin_ia32_rdtscp; 2. Replaces the already existing definition of intrinsic __rdtsc in ia32intrin.h with a simple call to the new GCC builtin __builtin_ia32_rdtsc. 3. Adds a definition for the new intrinsic __rdtscp in ia32intrin.h llvm-svn: 207132	2014-04-24 18:26:35 +00:00
Ben Langmuir	47d1ca4838	Rename lib/Headers/module.map to module.modulemap Don't install a file using the legacy spelling. llvm-svn: 206431	2014-04-17 00:52:48 +00:00
Reid Kleckner	6df5254d6f	intrin.h: Fix up bugs in the cr3 and msr intrinsics Don't include input and output regs in clobbers. Prefix some identifiers with __. Add a memory constraint to __readcr3 to prevent reordering. This constraint is heavy handed, but conservatively correct. Thanks to PaX Team for the suggestions. llvm-svn: 205778	2014-04-08 17:49:16 +00:00
Reid Kleckner	592dc61acf	intrin.h: Implement __readmsr, __readcr3, and __writecr3 Fixes PR19301. Based on a patch from Steven Graf! llvm-svn: 205751	2014-04-08 00:28:22 +00:00
Alexey Volkov	ae43aae96a	Added _rdtsc intrinsics by Robert Khasanov Differential Revision: http://llvm-reviews.chandlerc.com/D3212 llvm-svn: 205172	2014-03-31 08:08:46 +00:00
Tim Northover	fe7a445bf7	Install: add arm_neon.h header back I'd gone too far pruning aarch64_simd.h this time and took out one instance of arm_neon.h. This should restore us to the status quo. llvm-svn: 205111	2014-03-29 17:35:34 +00:00
Tim Northover	dca92dbc82	Remove stray references to aarch64_simd.h They were causing the autotools builds to fail. llvm-svn: 205103	2014-03-29 15:21:06 +00:00
Tim Northover	a2ee433c8d	ARM64: initial clang support commit. This adds Clang support for the ARM64 backend. There are definitely still some rough edges, so please bring up any issues you see with this patch. As with the LLVM commit though, we think it'll be more useful for merging with AArch64 from within the tree. llvm-svn: 205100	2014-03-29 15:09:45 +00:00
Reid Kleckner	7dd8bc0a84	Intrin.h: Implement _InterlockedExchangePointer llvm-svn: 204827	2014-03-26 16:09:48 +00:00
Hans Wennborg	a316933e09	MS intrinsics: __interlockedbittestandset(64) (PR19054) llvm-svn: 203816	2014-03-13 17:05:09 +00:00
Hans Wennborg	d9be72ec44	MS intrinsics: implement the __movs and __stos intrinsics (PR19054) llvm-svn: 203722	2014-03-12 22:00:32 +00:00
Hans Wennborg	a4421e03fa	MS intrinsics: implement __readgs{byte,word,dword,qword} (PR19054) llvm-svn: 203715	2014-03-12 21:09:05 +00:00
Hans Wennborg	dd0f5304f6	MS intrinsics: don't declare __readeflags and __writeeflags in Intrin.h They're already defined in ia32intrin.h, and this would cause including Intrin.h in 64-bit mode to fail because of conflicting types. Update ms-intrin.cpp to also run in 64-bit mode to catch things like this. llvm-svn: 203714	2014-03-12 21:09:03 +00:00
David Majnemer	1e57976ec0	Headers: Provide an ABI compatible max_align_t when _MSC_VER is defined Summary: Our usual definition of max_align_t wouldn't match up with MSVC if it was used in a template argument. Reviewers: chandlerc, rsmith, rnk Reviewed By: chandlerc CC: cfe-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2924 llvm-svn: 202911	2014-03-04 23:43:48 +00:00
Roman Divacky	b8322b13f8	The wmmintrin.h header includes two different sub-headers: one for AES support and one for PCLMUL support. The current immintrin.h header only includes wmmintrin.h if AES support is enabled. It should include it if either AES or PCLMUL is enabled (GCC's version of immintrin.h does this). Patch by John Baldwin! llvm-svn: 202871	2014-03-04 18:26:12 +00:00
Argyrios Kyrtzidis	7ffeea4ef3	[CMake] Add the newly introduced compiler header. llvm-svn: 202792	2014-03-04 06:28:23 +00:00
Alexey Bataev	af02c1c003	Fix for r202778 - Implement __readeflags and __writeeflags intrinsics (renamed res to __res) llvm-svn: 202784	2014-03-04 03:42:58 +00:00
Alexey Bataev	7cab007902	Implement __readeflags and __writeeflags intrinsics llvm-svn: 202778	2014-03-04 03:03:03 +00:00
Warren Hunt	0dc28ea301	[_mm_prefetch] Returning previously deleted comment. No functional change. It's unclear if the word FIXME is relevant given that the macro behaves as intended. llvm-svn: 201920	2014-02-22 00:47:24 +00:00
Warren Hunt	20e4a5d2af	Reapply 201734 but with appropriate gcc compatibility Because GCC incorrectly defines _mm_prefetch to take anything that casts to void, people have started using that behavior. The previous patch that made _mm_prefetch actually take a const char broke compatibility with existing code. This update to the patch leaves the macro that defines _mm_prefetch with the (void*) cast when _MSC_VER is not defined. llvm-svn: 201901	2014-02-21 23:08:53 +00:00
Daniel Jasper	2f0f297bdb	Revert r201734 and r201742. This breaks backwards compatibility with existing code. Previously, this was defined as #define _mm_prefetch(a, sel) (__builtin_prefetch((void )(a), 0, (sel))) Which basically accepts any pointer. Changing this to char simply breaks a lot of existing code. I have tried changing char* to "const void*", which seems to be the right thing as per Intel specification this should work on basically any pointer. However, apparently this breaks windows compatibility (because of a conflicting declaration in windows.h). So, we probably need to #ifdef this based on whether clang is compiling for windows. According to Chandler, this might be done by introducing an additional symbol to a fake type in BuiltinsX86.def and then condition the type expansion on the platform. llvm-svn: 201775	2014-02-20 11:10:48 +00:00
Chandler Carruth	7ce956ded4	Fix two pedantic issues with our builtin headers. The __STDC_VERSION__ for C99 is '199901L' and we shouldn't be comparing it with anything else. Neither of these should have had any impact in practice. llvm-svn: 201738	2014-02-19 23:38:18 +00:00
Warren Hunt	40d6f29ad8	Add _mm_prefetch and some others as MS builtins This patch adds several built-ins that are required for ms compatibility. _mm_prefetch must be a built-in because it takes a compile-time constant argument and our prior approach of using a #define to the current built-in doesn't work in the presence of re-declaration of _mm_prefetch. The others can be obtained by including the windows system headers. If a user includes the windows system headers but not intrin.h they still need to work and therefore must be built-in because we don't get a chance to implement them in intrin.h in this case. llvm-svn: 201734	2014-02-19 23:20:20 +00:00
Richard Smith	294e59a33b	Remove a broken attempt to cope with someone #undef'ing __has_include_next. This was broken because __has_include_next(...) would not be valid in a preprocessor condition if __has_include_next is not defined. llvm-svn: 201731	2014-02-19 22:53:42 +00:00
Chandler Carruth	e813984b43	Teach Clang to provide ::max_align_t in C11 and C++11 modes. This definition is not chosen idly. There is an unfortunate reality with max_align_t -- the specific nature of its definition leaks into the ABI almost immediately. Because it is part of C11 and C++11 it becomes essential for it to match with other systems on that ABI. There is an effort to discourage any further use of this construct as a consequence -- using max_align_t introduces an immediate ABI problem. We can never update it to have larger alignment even as the microarchitecture changes to necessitate higher alignment. =/ The particular definition here exactly matches the ABI of GCC's chosen ::max_align_t definition, for better or worse. This was written with the help of Richard Smith who was decoding the exact ABI implications of the selected definition in GCC. Notably, in-register arguments are impacted by the particular definition chosen. =/ No one is under the illusion that this is a "good" or "useful" definition of max_align_t, and we are working with the standards committee to specify a more useful interface to address this need. llvm-svn: 201729	2014-02-19 22:35:01 +00:00
Hans Wennborg	12fb89ec51	MS Intrin.h: implement __cpuidex and simplify __cpuid The two identical implementations of __cpuid for X86 / X86_64 were leftovers from my first iteration on the patch that implemented it. llvm-svn: 200568	2014-01-31 19:44:55 +00:00
Hans Wennborg	1fd6dd3616	Intrin.h: include setjmp.h to get a jmp_buf definition This makes sure that the ms-intrin.cpp test passes by providing a mock setjmp.h as a test input. llvm-svn: 200344	2014-01-28 23:01:59 +00:00
Hans Wennborg	740a4d6e46	Intrin.h: implement __rdtsc and __halt llvm-svn: 200343	2014-01-28 22:55:01 +00:00
Reid Kleckner	33630907d6	Revert "intrin.h: include setjmp.h to get a jmp_buf definition" This failed the ms-intrin.cpp test. This reverts commit r200237. This also comments out the _setjmpex declaration for now so that intrin.h will work on x64 targets. llvm-svn: 200243	2014-01-27 19:32:42 +00:00
Reid Kleckner	f08d658d48	Add implementations of some MSVC intrinsics Adds an implementation for _InterlockedCompareExchangePointer() and __faststorefence(). Patch by David Ziman! llvm-svn: 200239	2014-01-27 19:16:35 +00:00
Reid Kleckner	9b8dcebbca	intrin.h: include setjmp.h to get a jmp_buf definition This fixes an error on our _setjmpex declaration for 64-bit code and allows us to declare _setjmp for 32-bit code. llvm-svn: 200237	2014-01-27 19:14:09 +00:00
Reid Kleckner	924eb2afdc	Add 'static __inline__' to MSVC intrinsics with implementations This avoids warnings visible with -Wsystem-headers. llvm-svn: 200235	2014-01-27 18:48:02 +00:00
Eric Christopher	58b404398e	One more intrinsic. llvm-svn: 200061	2014-01-25 01:38:30 +00:00
Eric Christopher	439137ea32	Add missing intrinsics, fix a couple of typos in intrinsic names, and remove duplicate declarations. llvm-svn: 199992	2014-01-24 12:13:47 +00:00
Hans Wennborg	74ca0c4105	Add implementations of __readfs{byte,word,dword,qword} to Intrin.h Differential Revision: http://llvm-reviews.chandlerc.com/D2606 llvm-svn: 199958	2014-01-24 00:52:39 +00:00
Hans Wennborg	2ed8880346	Intrin.h: fix definitions of _Interlocked{In,De}crement16 The declarations seem correct, but the definitions were using chars instead of shorts. llvm-svn: 199923	2014-01-23 19:15:39 +00:00
NAKAMURA Takumi	c28a9a2c33	[CMake] Deprecate CLANG_RUNTIME_OUTPUT_INTDIR and CLANG_LIBRARY_OUTPUT_INTDIR. LLVM_*_OUTPUT_INTDIR should be available everywhere. It was my mistake when I introduced INTDIR stuff. llvm-svn: 199597	2014-01-19 13:00:01 +00:00
Hans Wennborg	854f7d34ec	Add implementations of _cpuid and _xgetbv to Intrin.h The _cpuid() implementation is the same as in lib/Headers/cpuid.h with the parameter names adjusted to match the interface. _xgetbv just does what the Intel manual says. Differential Revision: http://llvm-reviews.chandlerc.com/D2564 llvm-svn: 199439	2014-01-16 23:39:35 +00:00
NAKAMURA Takumi	baa9f533fe	[CMake][VS][XCode] Restruct the output directory layout more comfortable, ${BINARY_DIR}/${BUILD_MODE}/(bin\|lib) We have been seeing nasty directory layout with CMake multiconfig, such as, bin/Release/clang.exe lib/clang/3.x/... lib/Release/clang/3.x/.. (duplicated) Move the layout similar to autoconf's; Release/bin/clang.exe Release/lib/clang/3.x/... Checked on Visual Studio 10. Could you guys please confirm my change on XCode(and other multiconfig builders)? Note: Don't set variables CMAKE_*_OUTPUT_DIRECTORY any more, or a certain builder, for eaxample, msbuild.exe, would be confused. llvm-svn: 198205	2013-12-30 06:48:30 +00:00
NAKAMURA Takumi	38b8c938e8	[CMake] clang/lib/Headers: Install just-generated ${CMAKE_CURRENT_BINARY_DIR}/arm_neon.h, instead of copied arm_neon.h. llvm-svn: 197852	2013-12-21 01:56:00 +00:00
NAKAMURA Takumi	ea0c73b84e	clang/lib/Headers/CMakeLists.txt: Revert part of r197395. It should not be staged yet. llvm-svn: 197441	2013-12-17 00:02:38 +00:00
Nico Weber	ef9a766555	Add bit_FXSAVE as an alias for bit_FXSR, for gcc compat. llvm-svn: 197399	2013-12-16 17:54:57 +00:00
NAKAMURA Takumi	a8c958de47	[CMake] Introduce CLANG_RUNTIME_OUTPUT_INTDIR and CLANG_LIBRARY_OUTPUT_INTDIR. llvm-svn: 197395	2013-12-16 16:03:21 +00:00
Alp Toker	d480b1bf34	Fix a SSE2 intrinsics typo Full discourse at: http://lists.cs.uiuc.edu/pipermail/cfe-commits/Week-of-Mon-20131104/092514.html http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-November/068124.html Patch by Dimitry Andric and Alexey Dokuchaev! llvm-svn: 195558	2013-11-23 22:11:57 +00:00
JF Bastien	1334d0aedf	Define [U]LLONG_{MIN,MAX} for C++11, add tests. Add tests for limits.h, not just [U]LLONG_{MIN,MAX}. llvm-svn: 193506	2013-10-27 19:00:49 +00:00
Manman Ren	c94122e05b	Intrinsics: fix extract & insert when index is out of bound. Now, all extract & insert intrinsics should have the correct and operation to ignore higher bits. rdar://15250497 llvm-svn: 193267	2013-10-23 20:33:14 +00:00
Manman Ren	be38b9e15f	_mm_extract_epi16: use "& 7" when index is out of bound. This is in line with implementation of _mm_extract_pi16. rdar://15250497 llvm-svn: 193187	2013-10-22 19:24:42 +00:00
Reid Kleckner	00d33a5cb1	Add implementations of the MSVC barrier intrinsics Summary: These are deprecated in VS 2012 according to MSDN. They don't actually compile down to any code. They prevent the compiler from reordering memory accesses across the barrier, which is what a memory-clobbering volatile asm does. Reviewers: echristo CC: cfe-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1954 llvm-svn: 192860	2013-10-17 01:29:48 +00:00
Ted Kremenek	854cc293a7	Suppress useless -Wshadow warning when using _mm* macros from emmintrin.h Fixes <rdar://problem/10679282>. I'm not completely satisfied with this patch. Sprinkling "diagnostic ignored" _Pragmas throughout this file is gross, but I couldn't suppress it for the entire file. llvm-svn: 192143	2013-10-07 23:51:11 +00:00
Craig Topper	d335c9da22	Use logical/arithmetic operations instead of builtins in tbmintrin.h. This way we can remove the intrinsic support from the backend. llvm-svn: 192036	2013-10-05 17:08:42 +00:00
Craig Topper	d867805739	Change __builtin_ia32_bextri_u64 to take an i64imm to match up with LLVM backend changes. An explicit cast is still needed in tbmintrin.h to convert any big integer down to i32imm. Patch from Yunzhong Gao. llvm-svn: 191872	2013-10-03 04:21:19 +00:00
Warren Hunt	2731e3e4ef	Fixing implementation of bittestandset in Intrin.h. llvm-svn: 191783	2013-10-01 17:12:40 +00:00
Warren Hunt	3f98794718	Changing __X86_64__ to __x86_64__ in Intrin.h. llvm-svn: 191700	2013-09-30 21:08:05 +00:00
Yunzhong Gao	f4e0b1047a	Adding intrinsics to the clang front end for the x86 TBM instruction set. Differential Revision: http://llvm-reviews.chandlerc.com/D1751 llvm-svn: 191681	2013-09-30 17:25:14 +00:00
Warren Hunt	41a993f6f8	Typo correction: _int64 -> __int64. llvm-svn: 191592	2013-09-28 00:15:41 +00:00
Warren Hunt	d6ffae91d5	Implements some of the more commonly used intrinsics in Intrin.h Differential Revision: http://llvm-reviews.chandlerc.com/D1766 llvm-svn: 191590	2013-09-27 23:57:26 +00:00
Craig Topper	61f71c3903	Remove some stray underscores from copyright block. Fix first line length to match length of the one after the copyright block. llvm-svn: 191483	2013-09-27 03:57:18 +00:00
Hans Wennborg	736a02931e	Provide inline definitions of _Unwind_GetIP etc. for ARM in unwind.h These symbols were showing up as undefined when trying to link programs on Android. We should match libgcc's behaviour and provide inline definitions of these on ARM. It seems unwind.h on ARM/Darwin doesn't provide inline definitions, so we just declare them for that platform. llvm-svn: 191406	2013-09-25 22:34:03 +00:00
Eli Friedman	3cd55f49ab	Fix argument types of some AVX2 intrinsics. This fix makes our headers consistent with gcc. PR17312. llvm-svn: 191248	2013-09-23 23:52:04 +00:00
Eli Friedman	f9d8c6cebb	Add _mm_stream_si64 intrinsic. While I'm here, also fix the alignment computation for the whole family of intrinsics. PR17298. llvm-svn: 191243	2013-09-23 23:38:39 +00:00
Eli Friedman	9b04f41899	Fix return type of _mm_extract_epi8 etc. PR17300. llvm-svn: 191120	2013-09-21 00:05:25 +00:00
Ben Langmuir	ed6e97d2c3	Fix ifdef macro missed in previous commit llvm-svn: 191003	2013-09-19 14:07:14 +00:00
Ben Langmuir	6efe3a886e	Move sha intrinsics to immintrin.h This is consistent with ICC and Intel's SHA-enabled GCC version. llvm-svn: 191002	2013-09-19 14:00:22 +00:00
Ben Langmuir	58078d0103	Add C intrinsics for Intel SHA Extensions Intrinsics added shaintrin.h, which is included from x86intrin.h if __SHA__ is enabled. SHA implies SSE2, which is needed for the __m128i type. Also add the -msha/-mno-sha option. llvm-svn: 190999	2013-09-19 13:22:04 +00:00
Reid Kleckner	f0e232287a	Fix ifdef ordering at the end of Intrin.h from r190965 Test that intrin.h at least parses in C++ TUs. llvm-svn: 190978	2013-09-19 00:19:53 +00:00
Eric Christopher	cc87253f90	Fix closing brace around ifdef. llvm-svn: 190965	2013-09-18 22:40:18 +00:00
Eric Christopher	0db88a7d7e	The intrinsics should all have C linkage. llvm-svn: 190963	2013-09-18 22:24:01 +00:00
Eric Christopher	e276f88947	Add Intrin.h to the cmake files. llvm-svn: 190199	2013-09-06 20:11:28 +00:00
Eric Christopher	fb4b433bbb	Typo. llvm-svn: 189710	2013-08-31 00:27:38 +00:00
Eric Christopher	d1428bf635	Add initial clang targeted compatible decls for Intrin.h. Step towards a windows compatible builtin header. Currently uses x86intrin.h for implementing intel intrinsics in a clang specific manner. llvm-svn: 189709	2013-08-31 00:22:48 +00:00
Peter Collingbourne	6c77e72659	Two more definitions required by libsupc++ (_sleb128_t and _uleb128_t) Differential Revision: http://llvm-reviews.chandlerc.com/D1542 llvm-svn: 189558	2013-08-29 01:56:22 +00:00
Peter Collingbourne	7ac84bd808	80 cols. llvm-svn: 189538	2013-08-28 23:32:22 +00:00
Peter Collingbourne	ec1cb850d1	Add missing definitions to unwind.h. Original patch by Charles Davis. llvm-svn: 189535	2013-08-28 23:16:49 +00:00
Ted Kremenek	80655be83f	[CMake] use combination of CMAKE_RUNTIME_OUTPUT_DIRECTORY and CMAKE_LIBRARY_OUTPUT_DIRECTORY to install clang headers for Xcode builds. llvm-svn: 189443	2013-08-28 05:38:43 +00:00
Ted Kremenek	2894000b48	Revert "Use CMAKE_RUNTIME_OUTPUT_DIRECTORY instead of LLVM_BINARY_DIR for installing Clang headers." This appears to be breaking the buildbots. llvm-svn: 189426	2013-08-28 00:07:08 +00:00
Ted Kremenek	ae2c8776d0	Use CMAKE_RUNTIME_OUTPUT_DIRECTORY instead of LLVM_BINARY_DIR for installing Clang headers. llvm-svn: 189414	2013-08-27 23:20:26 +00:00
Ted Kremenek	e4a0cac4a8	Revert "[CMake] Use CLANG_BINARY_DIR instead of LLVM_BINARY_DIR as installation path for Clang headers." This was breaking some tests. Will investigate. llvm-svn: 189403	2013-08-27 20:46:01 +00:00
Ted Kremenek	8ff42222ba	[CMake] Use CLANG_BINARY_DIR instead of LLVM_BINARY_DIR as installation path for Clang headers. llvm-svn: 189402	2013-08-27 20:41:18 +00:00
Juergen Ributzka	2c2dbf4542	Fix the name and the type of the argument for intrinisc _mm256_broadcastsi128_si256 to align with the Intel documentation. This fixes bug PR 16581 and rdar:14747994. llvm-svn: 188609	2013-08-17 16:40:09 +00:00
Craig Topper	c5244512c8	Use a shuffle with undef elements instead of inserting 0s in the 128-bit to 256-bit casting intrinsics to improve performance. Thanks to Katya Romanova for identifying this issue. llvm-svn: 187716	2013-08-05 06:17:21 +00:00
Roman Divacky	4dcb5dbb53	This patch implements __get_cpuid_max() as an inline and __cpuid() and __cpuid_count() as macros to be compatible with GCC's cpuid.h. It also adds bit_<foo> constants for the various feature bits as described in version 039 (May 2011) of Intel's SDM Volume 2 in the description of the CPUID instruction. The list of bit_<foo> constants is a bit exhaustive (GCC doesn't do near this many). More bits could be added from a newer version of SDM if desired. Patch by John Baldwin! llvm-svn: 186696	2013-07-19 17:28:36 +00:00
Richard Smith	49e56440f9	Add missing include guards into headers in lib/Headers. While it may appear that these headers should not be included more than once, they are in fact included twice when building our builtins module (in order for it to generate submodules for them), and without this, any modular build enabling AVX and including any builtin header fails. Testing this is tricky because including any of these headers in a modular build is liable to fail, due to unrelated builtin headers in the same module including headers which might not be available on the system running the tests. Suggestion on that front are welcome (but we're getting close to being able to run a buildbot that has modules enabled for all tests, which would nicely solve the testing problem). llvm-svn: 186275	2013-07-14 05:41:45 +00:00
Manman Ren	9bb34d66b3	X86 intrinsics: cmpge\|gt\|nge\|ngt_ss\|_sd These intrinsics should return the comparision result in the low bits and keep the high bits of the first source operand. When calling to builtin functions, the source operands are swapped and the high bits of the second source operand are kept. To fix the issue, an extra shufflevector is used. rdar://14153896 llvm-svn: 184110	2013-06-17 19:42:49 +00:00
Douglas Gregor	ae3a4dfac0	Even in a modules world, people will depend on the weird xmmintrin.h -> emmintrin.h forwarding. llvm-svn: 183585	2013-06-07 22:49:44 +00:00
Douglas Gregor	5cad45bc89	Add arm_neon.h to the builtin intrinsics module map. Fixes <rdar://problem/13933913>. llvm-svn: 182268	2013-05-20 14:07:18 +00:00
Richard Smith	0646c86dcb	Fix the return type of the complex creal functions. Patch by YunZhong Gao, modified to use _Static_assert and to check __STDC_HOSTED__ by me. llvm-svn: 181527	2013-05-09 17:41:19 +00:00
Benjamin Kramer	4baf67a61b	xopintrin.h: Add wrappers for all flavors of _mm_com. GCC defines only the wrappers, MSVC defines both, we define both now too. PR15844. llvm-svn: 181514	2013-05-09 15:07:46 +00:00
Benjamin Kramer	fd57b195a3	Add include guards to prfchwintrin.h. llvm-svn: 181513	2013-05-09 15:07:39 +00:00
Hans Wennborg	4c02be3b83	Make sure we define wchar_t related macros correctly in -fms-extensions mode. This adds a test to make sure we define _WCHAR_T_DEFINED and _NATIVE_WCHAR_T_DEFINED correctly in the preprocessor, and updates stddef.h to set it when typedeffing wchar_t. llvm-svn: 180918	2013-05-02 13:12:32 +00:00
Hans Wennborg	b2175b25a7	Fix typo in a stddef.h comment: s/risze_t/rsize_t/ llvm-svn: 180916	2013-05-02 10:36:31 +00:00
Benjamin Kramer	beea351287	Fix header comment. llvm-svn: 180268	2013-04-25 16:14:14 +00:00
Reid Kleckner	7ab75b3f68	Avoid names like __in that conflict with SAL in builtin headers Microsoft's Source Annotation Language (SAL) defines a bunch of keywords for annotating the inputs and outputs of functions. Empty definitions for the keywords are provided by <stdlib.h> -> <crtdefs.h> -> <sal.h>. This makes it basically impossible to include MSVC's stdlib.h and Clang's *mmintrin.h headers at the same time if they have variables named __in. As a workaround, I've renamed those variables. This fixes the Modules/compiler_builtins.m test which was XFAILed, presumably due to this conflict. llvm-svn: 179860	2013-04-19 17:00:14 +00:00
Argyrios Kyrtzidis	08dff958e9	[CMake] Create the directory before creating the link to the clang headers. llvm-svn: 179782	2013-04-18 18:54:03 +00:00
Daniel Dunbar	95f1de3de5	Headers: Add support for ISO9899:2011 rsize_t. llvm-svn: 179427	2013-04-12 23:24:56 +00:00
Richard Smith	2362829734	tl;dr: Teach Clang to work around g++ changing its workaround to glibc's implementation of C99's attempt to control the C++ standard. sigh The C99 standard says that certain macros in <stdint.h>, such as SIZE_MAX, should not be defined when the header is included in C++ mode, unless __STDC_LIMIT_MACROS and __STDC_CONSTANT_MACROS are defined. The C++11 standard says "Thanks, but no thanks" and C11 removed this rule, but various C library implementations (such as glibc) follow C99 anyway. g++ prior to 4.8 worked around the C99 / glibc behavior by defining __STDC__MACROS in <cstdint>, which was incorrect, because <stdint.h> is supposed to provide these macros too. g++ 4.8 works around it by defining __STDC__MACROS in its builtin <stdint.h> header. This change makes Clang act like g++ 4.8 in this regard: our <stdint.h> now countermands any attempt by the C library to implement the undesired C99 rules, by defining the __STDC_*_MACROS first. Unlike g++, we do this even in C++98 mode, since that was the intent of the C++ committee, matches the behavior required in C11, and matches our built-in implementation of <stdint.h>. llvm-svn: 179419	2013-04-12 22:11:07 +00:00
Richard Smith	584f7dcc0e	Add tests that build modules for our builtin headers, and fix two buglets exposed by doing so. llvm-svn: 178736	2013-04-04 02:55:24 +00:00
Argyrios Kyrtzidis	41686481f4	[cmake] Add clang-headers as a dependency of libclang and if we have to copy them for the IDE case, also create a symlink inside the libclang.dylib directory. llvm-svn: 178372	2013-03-29 21:51:40 +00:00
Michael Liao	ffaae3511a	Add RDSEED intrinsic support defined in AVX2 extension llvm-svn: 178331	2013-03-29 05:17:55 +00:00
Michael Liao	4442f796a4	Add XTEST intrinsic defined in TSX extension llvm-svn: 178330	2013-03-29 05:14:06 +00:00
Argyrios Kyrtzidis	95aa0b77f2	Revert "[lib/Headers] Define NULL as __DARWIN_NULL when on __APPLE__." Per feedback by Doug, we should avoid platform-specific implementations in lib/Headers as much as possible. This reverts commit r178110. llvm-svn: 178181	2013-03-27 21:22:45 +00:00
Argyrios Kyrtzidis	fff55a028b	[lib/Headers] Break the module import cycle between _Builtin_intrinsics.sse and _Builtin_intrinsics.sse2 Module "sse" implicitly exports module "sse2". This is bad because we also have module "sse2" export module "sse" (as intended) so we end up with a cycle in the module import graph: 1. sse2 -> (also imports) sse 2. sse -> (also imports) sse2 To eliminate the cycle remove 2.; importing module "sse2" will also import module "sse", but just importing module "sse" will not also import module "sse2". rdar://13240552 llvm-svn: 178117	2013-03-27 05:12:34 +00:00
Argyrios Kyrtzidis	0909d3c5ed	[lib/Headers] Define NULL as __DARWIN_NULL when on __APPLE__. This makes it identical with the system definition. llvm-svn: 178110	2013-03-27 01:25:37 +00:00
Michael Liao	74f4eaf4dc	Add PRFCHW intrinsic support - Add head 'prfchwintrin.h' to define '_m_prefetchw' which is mapped to LLVM/clang prefetch builtin - Add option '-mprfchw' to enable PRFCHW feature and pre-define '__PRFCHW__' macro llvm-svn: 178041	2013-03-26 17:52:08 +00:00
Douglas Gregor	96efb4a442	<rdar://problem/13479214> Make Clang's <stddef.h> robust against system headers defining size_t/ptrdiff_t/wchar_t. Clang's <stddef.h> provides definitions for the C standard library types size_t, ptrdiff_t, and wchar_t. However, the system's C standard library headers tend to provide the same typedefs, and the two generally avoid each other using the macros _SIZE_T/_PTRDIFF_T/_WCHAR_T. With modules, however, we need to see all of the places where these types are defined, so provide the typedefs (ignoring the macros) when modules are enabled. llvm-svn: 177686	2013-03-22 00:10:49 +00:00
Anton Yartsev	a3c9ba364e	PR15480: fixed second parameter types of vec_lde, vec_lvebx, vec_lvehx, and vec_lvewx according to AltiVec Programming Interface Manual llvm-svn: 176789	2013-03-10 16:25:43 +00:00
Richard Smith	8acb4044d8	libstdc++'s <cstdalign> #includes <stdalign.h> and expects it to guard against being included in C++. Don't define alignof or alignas in this case. Note that the C++11 standard is broken in various ways here (it refers to the contents of <stdalign.h> in C99, where that header did not exist, and doesn't mention the alignas macro at all), but we do our best to do what it intended. llvm-svn: 175708	2013-02-21 02:17:58 +00:00
Daniel Dunbar	230cc79394	[Headers] Use standard builtin defines instead of typeof trickery. - The trickery can confuse more basic source processors, in particular the Unix conformance tool that wants to scan headers. llvm-svn: 174475	2013-02-06 00:38:13 +00:00
Richard Smith	4dab709484	C11: Provide the missing half of <stdalign.h> llvm-svn: 173900	2013-01-30 06:33:54 +00:00
Richard Smith	0015f09877	Parsing support for C11's _Noreturn keyword. No semantics yet. llvm-svn: 172761	2013-01-17 22:16:11 +00:00
David Blaikie	5bb700360c	Readd an open paren that was lost while reformatting code. llvm-svn: 172669	2013-01-16 23:13:42 +00:00
David Blaikie	3302f2bd46	PR14964: intrinsic headers using non-reserved identifiers Several of the intrinsic headers were using plain non-reserved identifiers. C++11 17.6.4.3.2 [global.names] p1 reservers names containing a double begining with an underscore followed by an uppercase letter for any use. I think I got them all, but open to being corrected. For the most part I didn't bother updating function-like macro parameter names because I don't believe they're subject to any such collission - though some function-like macros already follow this convention (I didn't update them in part because the churn was more significant as several function-like macros use the double underscore prefixed version of the same name as a parameter in their implementation) llvm-svn: 172666	2013-01-16 23:08:36 +00:00
Benjamin Kramer	696651429d	unwind.h: Add include guards and don't mess with visibility if HIDE_EXPORTS is specified. For GCC compatibility. llvm-svn: 171991	2013-01-09 19:54:57 +00:00
Logan Chien	4d401b47d1	Code cleanup: Remove trailing whitespace in unwind.h. llvm-svn: 167915	2012-11-14 06:33:58 +00:00
Michael Liao	625a875f05	Add clang support of RTM from TSX - New options '-mrtm'/'-mno-rtm' are added to enable/disable RTM feature - Builtin macro '__RTM__' is defined if RTM feature is enabled - RTM intrinsic header is added and introduces 3 new intrinsics, namely '_xbegin', '_xend', and '_xabort'. - 3 new builtins are added to keep compatible with gcc, namely '__builtin_ia32_xbegin', '__builtin_ia32_xend', and '__builtin_ia32_xabort'. - Test cases for pre-defined macro and new intrinsic codegen are added. llvm-svn: 167665	2012-11-10 05:17:46 +00:00
Douglas Gregor	dc779abb8b	Split the instrinsic header wmmintrin.h into AES and PCLMUL parts, so that we can model them as separate submodules. llvm-svn: 167420	2012-11-05 23:30:26 +00:00
Douglas Gregor	10b4f2a20c	Fix module map for SSE4a builtins llvm-svn: 167399	2012-11-05 20:41:30 +00:00
Douglas Gregor	4c69859b56	Make cpuid.h actually work with -std=c99 <rdar://problem/12552716>. While we're here, extend the module map to cover most of the newly-added instrinsic headers. Only wmmintrin.h is missing, because it needs to be split into AES/PCLMUL subheaders (as a separate commit). llvm-svn: 167398	2012-11-05 20:11:10 +00:00
Ulrich Weigand	9936f137eb	Add "static" to some functions in altivec.c where it was missing. llvm-svn: 167148	2012-10-31 18:17:07 +00:00
Manman Ren	5750c1c07e	X86 SSE Intrinsics: update header for sqrt_ss, rsqrt_ss and rcp_ss. There intrinsics pass through the upper FP values from the input. rdar://12558838 llvm-svn: 166743	2012-10-26 00:25:10 +00:00
NAKAMURA Takumi	16ff8fdb57	clang/lib/Headers/CMakeLists.txt: Add f16cintrin.h. llvm-svn: 165688	2012-10-11 01:10:04 +00:00
Manman Ren	a45358c284	X86: add F16C support in Clang Support the following intrinsics: _mm_cvtph_ps, _mm256_cvtph_ps, _mm_cvtps_ph, _mm256_cvtps_ph rdar://12407875 llvm-svn: 165685	2012-10-11 00:59:55 +00:00
Michael Liao	4a7f8c23e0	Add intrinsic of MULX in BMI2 header llvm-svn: 165325	2012-10-05 18:50:09 +00:00
Logan Chien	774442162d	Add struct keyword before _Unwind_Context. In the C programming language, we have to add the "struct" keyword. Otherwise, the compiler will emit error message. llvm-svn: 164665	2012-09-26 06:35:17 +00:00
Benjamin Kramer	a43b6999ff	Add _rdrand{16,32,64}_step intrinsics to immintrin.h llvm-svn: 160118	2012-07-12 09:33:03 +00:00
Craig Topper	6490bdcf72	Rename tzcnt intrinsics to match gcc. llvm-svn: 159515	2012-07-02 06:52:51 +00:00
Douglas Gregor	158dec5d20	std::nullptr_t support in MS headers, from João Matos. llvm-svn: 159448	2012-06-29 18:28:41 +00:00
Manman Ren	f865ba0c0e	X86: add more GATHER intrinsics in Clang Support the following intrinsics: _mm_i32gather_pd, _mm256_i32gather_pd, _mm_i64gather_pd, _mm256_i64gather_pd, _mm_i32gather_ps, _mm256_i32gather_ps, _mm_i64gather_ps, _mm256_i64gather_ps, _mm_i32gather_epi64, _mm256_i32gather_epi64, _mm_i64gather_epi64, _mm256_i64gather_epi64, _mm_i32gather_epi32, _mm256_i32gather_epi32, _mm_i64gather_epi32, _mm256_i64gather_epi32 llvm-svn: 159410	2012-06-29 05:19:13 +00:00
Manman Ren	86c3250b82	X86: add more GATHER intrinsics in Clang Corrected type for index of _mm256_mask_i32gather_pd from 256-bit to 128-bit Corrected types for src\|dst\|mask of _mm256_mask_i64gather_ps from 256-bit to 128-bit Support the following intrinsics: _mm_mask_i32gather_epi64, _mm256_mask_i32gather_epi64, _mm_mask_i64gather_epi64, _mm256_mask_i64gather_epi64, _mm_mask_i32gather_epi32, _mm256_mask_i32gather_epi32, _mm_mask_i64gather_epi32, _mm256_mask_i64gather_epi32 llvm-svn: 159403	2012-06-29 00:54:35 +00:00
Manman Ren	add5e9e289	X86: add GATHER intrinsics (AVX2) in Clang Support the following intrinsics: _mm_mask_i32gather_pd, _mm256_mask_i32gather_pd, _mm_mask_i64gather_pd _mm256_mask_i64gather_pd, _mm_mask_i32gather_ps, _mm256_mask_i32gather_ps _mm_mask_i64gather_ps, _mm256_mask_i64gather_ps llvm-svn: 159222	2012-06-26 19:55:09 +00:00
NAKAMURA Takumi	f500be025a	Headers/xopintrin.h: Try to fix r158492. Did you mean, mm256? llvm-svn: 158521	2012-06-15 13:37:44 +00:00
Craig Topper	9e28bf9345	Add XOP frcz instrinsics. llvm-svn: 158492	2012-06-15 06:33:42 +00:00
Craig Topper	db0fbf0a50	Add XOP permute intrinsics. llvm-svn: 158351	2012-06-12 06:03:35 +00:00
Craig Topper	ce8dbaadb6	Add XOP shift and compare intrinsics. llvm-svn: 158300	2012-06-11 07:01:43 +00:00
Craig Topper	a3c5fbf54b	Add XOP vprot* instruction intrinsics llvm-svn: 158292	2012-06-10 07:47:32 +00:00
Craig Topper	02b3d81a97	More XOP intrinsics llvm-svn: 158287	2012-06-10 02:46:15 +00:00
Craig Topper	33b6d5e20b	Begin adding XOP intrinsics llvm-svn: 158286	2012-06-10 00:39:38 +00:00
Craig Topper	2b1eda344a	Add fma3 intrinsic header file. llvm-svn: 157913	2012-06-04 03:42:47 +00:00
Craig Topper	3f122a7636	Add builtin for pclmulqdq instruction. llvm-svn: 157733	2012-05-31 05:18:48 +00:00
Craig Topper	9fd12db1c0	Update FIXME. ABM is already covered by LZCNT and POPCNT. llvm-svn: 157676	2012-05-30 04:49:49 +00:00
Benjamin Kramer	1ab16ba501	Install ammintrin.h in the cmake build. llvm-svn: 157639	2012-05-29 19:36:17 +00:00
Benjamin Kramer	ba6e2528fa	Add an ammintrin.h header for SSE4a intrinsics. This is a clean-room implementation based on public documentation and I tried to validate it as much as possible against gcc. llvm-svn: 157638	2012-05-29 19:10:17 +00:00
Chandler Carruth	4496c44e5f	Remove the 'intrin.h' builtin header file and its tests for now. After discussion with several people, including Doug Gregor, we've decided to change our approach here. If you have questions about this header file, the commit removing it, etc., please reach out to me off-list. llvm-svn: 156322	2012-05-07 20:46:58 +00:00
Chad Rosier	87622b8b84	Get rid of storelv4si builtin as it can be expressed directly. This is general goodness because it provides opportunites to cleanup things. For example, uint64_t t1(__m128i vA) { uint64_t Alo; _mm_storel_epi64((__m128i*)&Alo, vA); return Alo; } was generating movq %xmm0, -8(%rbp) movq -8(%rbp), %rax and now generates movd %xmm0, %rax rdar://11282581 llvm-svn: 155924	2012-05-01 18:11:51 +00:00
Nico Weber	cb93142e1f	Expand #include_next in float.h from mingw to _msc_ver. A test for this is checking if this compiles: #include <float.h> inline bool IsFinite(const double& number) { return _finite(number) != 0; } That depends however on either mingw or msvc being installed, and chapuni tells me there might be issues with float.h on mingw, so no automated test is added. llvm-svn: 155507	2012-04-24 23:43:40 +00:00
Nico Weber	1d725ecf93	Let NULL and MSVC headers coexist better. Fixes the two issues mentioned in PR12146. llvm-svn: 155490	2012-04-24 21:27:01 +00:00
Aaron Ballman	0ae8c946a4	Adding information about what intrinsics still need to be implemented for MSVC compatibility. llvm-svn: 155441	2012-04-24 12:30:37 +00:00
Chandler Carruth	ff90611253	Fix a typo spotted by Matt. llvm-svn: 155427	2012-04-24 05:59:48 +00:00
Chandler Carruth	3dfb6d84c6	Introduce an initial sketch of a MSVC compatible 'intrin.h' builtin header, along with a stub test to make sure it compiles in the appropriate modes. Thanks to Aaron Ballman for working with me to figure out the initial strategy here, and to Nico for reviewing and pestering me to actually commit it. llvm-svn: 155425	2012-04-24 05:23:54 +00:00
Craig Topper	26e74e50b6	Convert vperm2f128 and vperm2i128 intrinsics back to using llvm intrinsics. Unfortunately, these instructions have behavior that can't be modeled with shuffle vector. llvm-svn: 154906	2012-04-17 05:16:56 +00:00
Craig Topper	8e57855ea0	Change _mm256_permute4x64_epi64 and _mm256_permute4x64_pd to use builtin_shufflevector instead of specific builtins. Old builtins will be removed from llvm now that vpermq/vpermpd are supported by shuffle lowering code. llvm-svn: 154777	2012-04-15 22:18:10 +00:00
Chad Rosier	2c5154224b	Fix the signatures for the _mm256_storeu2_* intrinsics. PR12532 llvm-svn: 154591	2012-04-12 16:29:08 +00:00
Craig Topper	74c17c65e4	Correctly check argument types for some vector macros in smmintrin.h. Put parentheses around uses of vector macro arguments. llvm-svn: 153732	2012-03-30 07:01:17 +00:00
Craig Topper	97f042f2d6	Add _mm_minpos_epu16 to smmintrin.h. Fixes PR12399. llvm-svn: 153726	2012-03-30 05:41:28 +00:00
Craig Topper	678a53c350	Fix shuffle vector calculation for mm_permute_ps. Fixes PR 12401. llvm-svn: 153724	2012-03-30 05:09:18 +00:00
Rafael Espindola	c31d004ece	unwind.h fix for -fvisibility=hidden users. This fixes firefox build in a system with libunwind installed. Patch by Jeffrey Yasskin! llvm-svn: 153633	2012-03-29 03:37:17 +00:00
Chad Rosier	f8df4f4e3b	[avx] Define the _mm256_loadu2_xxx and _mm256_storeu2_xxx intrinsics. From the Intel Optimization Reference Manual, Section 11.6.2. When data cannot be aligned or alignment is not known, 16-byte memory accesses may provide better performance. rdar://11076953 llvm-svn: 153091	2012-03-20 16:40:00 +00:00
Howard Hinnant	ebab2b0660	* tgmath_logb.patch implements the missing logb function (see C99 standard 7.22, paragraph 5). * tgmath_fabs_complex.patch corrects the return types for the complex fabs functions. These must be non-complex float/double/long double (see C99 standard 7.22, paragraph 4 and 7.3.8.1). Patch contributed by Kristof Beyls. llvm-svn: 151276	2012-02-23 20:22:10 +00:00
Jeffrey Yasskin	a09e62a042	Allow linux builds to take advantage of libunwind to get unwind.h if that's installed. llvm-svn: 151058	2012-02-21 16:20:12 +00:00
Chandler Carruth	a2a5410e6d	Add 3dNOW intrinsic header to x86intrin.h, conditioned on __3dNOW__ to match the behavior of GCC. Also add a test for these intrinsics, which apparently have zero tests. =[ Not surprisingly, Clang crashed when compiling these. Fix the bug in CodeGen where we failed to bitcast the argument type to x86mmx prior to calling the LLVM intrinsic. This fixes an assert on the new 3dnow-builtins.c test. This is one issue impacting the efforts to get Clang to emulate the Microsoft intrinsics headers -- 3dnow intrinsics are implictitly made available there. llvm-svn: 150948	2012-02-20 07:35:45 +00:00
Craig Topper	e5ea3b0239	Remove vperm2f* and vperm2i builtins. Same effect can be achieved with builtin_shufflevector. llvm-svn: 150064	2012-02-08 07:33:36 +00:00
Craig Topper	fec9f8edb7	Remove vpermilp* builtins. Same effect can be achieved with builtin_shufflevector. llvm-svn: 150056	2012-02-08 05:16:54 +00:00
Eli Friedman	96efec99eb	Add C11 FLT_TRUE_MIN and friends. <rdar://problem/10812837>. llvm-svn: 149949	2012-02-07 01:02:19 +00:00
Nick Lewycky	d0ba3793aa	Comment mystery code. llvm-svn: 149742	2012-02-04 02:16:48 +00:00
Nick Lewycky	51a009092c	Make _mm_cmpgt_epi8 immute to -funsigned-char. llvm-svn: 149725	2012-02-03 23:57:48 +00:00
Douglas Gregor	3ec6663be0	Back out my heinous hack that tricked the module generation mechanism into using non-absolute system includes (<foo>)... ... and introduce another hack that is simultaneously more heineous and more effective. We whitelist Clang-supplied headers that augment or override system headers (such as float.h, stdarg.h, and tgmath.h). For these headers, Clang does not provide a module mapping. Instead, a system-supplied module map can refer to these headers in a system module, and Clang will look both in its own include directory and wherever the system-supplied module map suggests, then adds either or both headers. The end result is that Clang-supplied headers get merged into the system-supplied module for the C standard library. As a drive-by, fix up a few dependencies in the _Builtin_instrinsics module. llvm-svn: 149611	2012-02-02 18:42:48 +00:00
Douglas Gregor	232e3431e2	Split compiler builtin module into "stdlib" builtins and "intrinsic" builds, and bring mm_alloc.h into the fold. Start playing some tricks with these builtin modules to mirror the include_next tricks that the headers already perform. llvm-svn: 149434	2012-01-31 21:57:50 +00:00
Douglas Gregor	56435b49e0	Remove tgmath.h from the module map for now, because it currently causes a cyclic module dependency due to its inclusion of math.h and complex.h. I'll take another shot at it later. llvm-svn: 149283	2012-01-30 22:22:39 +00:00
Douglas Gregor	71022cac1f	Fix typo spotted by Sebastian. Thanks! llvm-svn: 149257	2012-01-30 18:49:05 +00:00
Craig Topper	d6d3a05b4f	Cleanup 3dnow builtin handling. Most of them were already handled by LLVM connecting intrinsics and builtins in IntrinsicsX86.td. llvm-svn: 149233	2012-01-30 08:18:19 +00:00
Douglas Gregor	0070c0bfbe	Introduce TargetInfo::hasFeature() to query various feature names in each of the targets. Use this for module requirements, so that we can pin the availability of certain modules to certain target features, e.g., provide a module for xmmintrin.h only when SSE support is available. Use these feature names to provide a nearly-complete module map for Clang's built-in headers. Only mm_alloc.h and unwind.h are missing, and those two are fairly specialized at the moment. Finishes <rdar://problem/10710060>. llvm-svn: 149227	2012-01-30 06:38:25 +00:00
Douglas Gregor	c93a872206	Just disable the compiler-builtins module test on MSVC for now llvm-svn: 149214	2012-01-29 23:53:54 +00:00
Douglas Gregor	e8f900bdcc	Teach tgmath.h to only include <complex.h> if it's available. llvm-svn: 149213	2012-01-29 23:40:50 +00:00
Douglas Gregor	80928be137	Alternate fix to the modules failures that doesn't require us to tweak tgmath.h llvm-svn: 149210	2012-01-29 22:47:19 +00:00
Douglas Gregor	b9f9aea13c	If there's no math.h, then tgmath.h should just be empty llvm-svn: 149209	2012-01-29 22:35:57 +00:00
Douglas Gregor	3f09de6442	Introduce a module map for (some of) the compiler-supplied headers. The remaining headers require more sophisticated requirements; they'll be handled separately. Part of <rdar://problem/10710060>. llvm-svn: 149206	2012-01-29 20:52:14 +00:00
Craig Topper	9e9301a83a	Represent 256-bit unaligned loads natively and remove the builtins. Similar change was made for 128-bit versions a while back. llvm-svn: 148919	2012-01-25 04:26:17 +00:00
Douglas Gregor	38f3981a99	On Darwin, use the system's <unwind.h> whenever it is available. Clang's <unwind.h> isn't ready for prime time. Fixes <rdar://problem/10733587>. llvm-svn: 148807	2012-01-24 15:12:50 +00:00
Bob Wilson	51897ec79b	Fix a typo: _MM_FLUSH_ZERO_OFF has the wrong value. rdar://10716672 llvm-svn: 148711	2012-01-23 18:27:24 +00:00
Evgeniy Stepanov	5dfe9f2b2f	Extend unwind.h with the ARM unwinder interface. These declarations come from the sample code in the "Exception Handling ABI for the ARM Architecture" document. llvm-svn: 148469	2012-01-19 11:39:05 +00:00
Joerg Sonnenberger	c2f91c37e8	Don't depend on undefined macros being 0, there are options for the preprocessor to warn about it. llvm-svn: 147466	2012-01-03 19:22:38 +00:00
NAKAMURA Takumi	96d77daa49	clang/lib/Headers/CMakeLists.txt: Unbreak cmake build. llvm-svn: 147373	2011-12-30 10:38:16 +00:00
Craig Topper	b4ceb6fd52	Add FMA4 intrinsics. llvm-svn: 147372	2011-12-30 09:15:03 +00:00
Craig Topper	ba418d8e91	Remove an accidental change from r147370. Would only break if the new fma4 flag was used. llvm-svn: 147371	2011-12-30 07:35:49 +00:00
Craig Topper	ffdb46ceef	Add FMA4 feature flag. Intrinsics coming soon. Also make sse4a feature flag imply sse3. Matches gcc behavior. llvm-svn: 147370	2011-12-30 07:33:42 +00:00
Richard Smith	6b751dc2c6	Unbreak cmake build after r147340. llvm-svn: 147355	2011-12-29 21:42:29 +00:00
Craig Topper	1de8348db7	Add popcnt feature flag to match gcc. This flag is implied when sse42 is enabled, but can be disabled separately. Move popcnt intrinsics to popcntintrin.h to match gcc. llvm-svn: 147340	2011-12-29 16:10:46 +00:00
NAKAMURA Takumi	9a3f299f0e	clang/lib/Headers/CMakeLists.txt: Unbreak cmake build to add bmi2intrin.h since r147275. llvm-svn: 147276	2011-12-26 03:20:06 +00:00
Craig Topper	c334dd68a7	Add BMI2 intrinsics. llvm-svn: 147275	2011-12-26 02:31:10 +00:00
NAKAMURA Takumi	dceeeb8918	lib/Headers/CMakeLists.txt: Fix cmake build since r147263, for two missing headers. llvm-svn: 147266	2011-12-25 12:47:46 +00:00
Craig Topper	a06d4a1c40	Add the rest of the BMI intrinsics. llvm-svn: 147265	2011-12-25 07:27:12 +00:00
Craig Topper	f2855ade2b	Add intrinsics for lzcnt and tzcnt instructions. llvm-svn: 147263	2011-12-25 06:25:37 +00:00
Craig Topper	22967d4a61	Add BMI, BMI2, and LZCNT feature flags to enable adding intrinsics. llvm-svn: 147262	2011-12-25 05:06:45 +00:00
Craig Topper	175543ac78	Add last of the AVX2 intrinsics except for gather. llvm-svn: 147253	2011-12-24 17:20:15 +00:00
Craig Topper	9f00948a82	Add AVX2 permute intrinsics. Also add parentheses on some macro arguments in other intrinsic headers. llvm-svn: 147241	2011-12-24 07:55:14 +00:00
Craig Topper	9479895928	Add AVX2 intrinsics for FP vbroadcast, vbroadcasti128, and vpblendd. llvm-svn: 147239	2011-12-24 05:19:29 +00:00
Craig Topper	a6fdbd1807	Intrinsics for AVX2 unpack instructions. llvm-svn: 147237	2011-12-24 03:58:43 +00:00
Craig Topper	f4bb952533	More AVX2 intrinsics for shift, psign, some shuffles, and psadbw. llvm-svn: 147236	2011-12-24 03:28:57 +00:00
Craig Topper	235a365d58	Add AVX2 multiply intrinsics. llvm-svn: 147219	2011-12-23 08:31:16 +00:00
Craig Topper	1f2460ad43	Add AVX2 intrinsics for max, min, sign extend, and zero extend. llvm-svn: 147141	2011-12-22 09:18:58 +00:00
Craig Topper	a73baa8050	Add a few more AVX2 intrinsics and fix the type strings on a couple SSE intrinsics. llvm-svn: 147048	2011-12-21 08:35:05 +00:00
Craig Topper	3fe5ac40db	Add AVX2 horizontal add/sub intrinsics. llvm-svn: 147047	2011-12-21 08:17:40 +00:00
Craig Topper	a89747dd1e	Add AVX2 intrinsics for pavg, pblend, and pcmp instructions. Also remove unneeded builtins for SSE pcmp. Change SSE pcmpeqq and pcmpgtq to not use builtins and just use vector == and >. llvm-svn: 146969	2011-12-20 09:55:26 +00:00
Craig Topper	a557e1c122	Add AVX2 intrinsics for and, andn, or, and xor. llvm-svn: 146862	2011-12-19 09:03:48 +00:00
Craig Topper	94aba2c260	More AVX2 intrinsic support including saturating add/sub and palignr. llvm-svn: 146857	2011-12-19 07:03:25 +00:00
Craig Topper	dec792ebb5	Begin adding AVX2 intrinsics. Necessitated increasing the number of bits used to store builtinID when serializing identifier table. llvm-svn: 146855	2011-12-19 05:04:33 +00:00
Chad Rosier	7caca84ce4	Fix _mm_permute_ps and _mm256_permute_ps AVX intrinsics to use "I" (ICE) markings. Fix avxintrin.h to take them into account. Part of rdar://10595450 llvm-svn: 146810	2011-12-17 01:51:05 +00:00
Chad Rosier	93375d5fa5	Revert r146797, which was a partial revert of r146791; It was correct in the first place. The permutevar_* (note the var) intrinsics use ymm/mem. llvm-svn: 146807	2011-12-17 01:39:56 +00:00
Chad Rosier	0adfe7aa2f	Fix _mm256_extractf128_* AVX intrinsics to use "I" (ICE) markings. Fix avxintrin.h to take them into account. Part of rdar://10595450 llvm-svn: 146804	2011-12-17 01:22:27 +00:00
Chad Rosier	3648646b2b	Partial revert of r146791; vpermilps/vpermilpd instructions accepts ymm/mem/imm8. llvm-svn: 146797	2011-12-17 00:50:42 +00:00
Chad Rosier	060d03be1c	Fix _mm256_round_pd, _mm256_round_ps, _mm_permute_pd and _mm256_permute_pd AVX intrinsics to use "I" (ICE) markings. Fix avxintrin.h to take them into account. Part of rdar://10595450 llvm-svn: 146791	2011-12-17 00:15:26 +00:00
Chad Rosier	33d22d8def	Fix vinsertf128_* AVX intrinsics to use "I" (ICE) markings. Fix avxintrin.h to take them into account. rdar://10590282 llvm-svn: 146758	2011-12-16 21:40:31 +00:00
Chad Rosier	9138fea25e	Fix vperm2f128_* AVX intrinsics to use "I" (ICE) markings. Fix avxintrin.h to take them into account. rdar://10576962 llvm-svn: 146757	2011-12-16 21:07:34 +00:00
Bob Wilson	16c4195548	Fix obvious error in _mm_test_all_zeros. PR11565. Patch by Mathias Gaunard! llvm-svn: 146565	2011-12-14 17:17:16 +00:00
Chandler Carruth	222c66db38	Fix a blatant typo or cut/paste-o reported by users of this header. llvm-svn: 146251	2011-12-09 09:23:55 +00:00
Rafael Espindola	7a284b2e78	Use default visibility in the the symbols declared in unwind.h. This matches the behavior of gcc's unwind.h. llvm-svn: 146208	2011-12-09 00:08:01 +00:00
Rafael Espindola	18c7920d6b	Add a minimal unwind.h that knows how to forward to the system one in systems that have it in /usr/include (only OS X Lion so far). llvm-svn: 146140	2011-12-08 05:01:39 +00:00
Daniel Dunbar	e946e361ab	Headers: wmmintrin.h only needs xmmintrin.h. - Fixes <rdar://problem/10261246> clang -maes option is not sufficient to include <wmmintrin.h> llvm-svn: 145939	2011-12-06 16:17:54 +00:00
Rafael Espindola	488ea473db	Install cpuid.h when building with cmake too. llvm-svn: 145935	2011-12-06 15:46:47 +00:00
Rafael Espindola	49118520de	Fix comment. llvm-svn: 145271	2011-11-28 20:05:27 +00:00
Rafael Espindola	0618d14edf	Error on non x86 architectures. llvm-svn: 145185	2011-11-27 15:21:33 +00:00
Rafael Espindola	fd03d0b733	Fix file name in comments. llvm-svn: 145184	2011-11-27 15:13:54 +00:00
Rafael Espindola	d086573a4d	Add the minimum implementation of cpuid.h. This works on "modern" intel cpus and on clang, which seams to handled "=b" correctly even when ebx is the PIC register. llvm-svn: 145149	2011-11-26 20:53:19 +00:00
Eli Friedman	f16beb3942	Fix some additional x86 intrinsics to use "I" (ICE) markings. Fix *mmintrin.h to take them into account. <rdar://problem/10341145> llvm-svn: 144246	2011-11-10 00:11:13 +00:00
Eli Friedman	9586cdb01e	Misc fixes to pcmp*stri. llvm-svn: 144073	2011-11-08 04:13:51 +00:00
Bob Wilson	c9b97cc1da	Fix vector macros to correctly check argument types. <rdar://problem/10261670> llvm-svn: 143792	2011-11-05 06:08:06 +00:00
Eli Friedman	89c11337ba	Add _mm_comige_sd to emmintrin.h, since I apparently forgot to do this in r138769. <rdar://problem/10230751> llvm-svn: 141310	2011-10-06 20:31:50 +00:00
Peter Collingbourne	d937a99465	Clang-side build system infrastructure for multiple tblgens. llvm-svn: 141267	2011-10-06 01:52:10 +00:00
Peter Collingbourne	2f3cf4b158	Add support for alignment-specifiers in C1X and C++11, remove support for the C++0x draft [[align]] attribute and add the C1X standard header file stdalign.h llvm-svn: 140796	2011-09-29 18:04:28 +00:00
Eli Friedman	9bb51adcce	Tweak *mmintrin.h so that they don't make any bad assumptions about alignment (which probably has little effect in practice, but better to get it right). Make the load in _mm_loadh_pi and _mm_loadl_pi a single LLVM IR instruction to make optimizing easier for CodeGen. rdar://10054986 llvm-svn: 139874	2011-09-15 23:15:27 +00:00
Eric Christopher	bd202c0496	Remove WCHAR_MIN and WCHAR_MAX from limits.h. According to posix and c99 these should be in stdint.h - and they already are. Fixes rdar://10097036. llvm-svn: 139332	2011-09-08 23:25:25 +00:00
Eli Friedman	f8cb480528	Add missing function _mm_ucomige_sd to emmintrin.h. PR10803. llvm-svn: 138769	2011-08-29 21:26:24 +00:00
Bruno Cardoso Lopes	7a98a7e681	Fix _mm256_shuffle_ps mask! Example, for mask=203, Instead of: <i32 3, i32 2, i32 8, i32 11, i32 3, i32 6, i32 12, i32 15> generate: <i32 3, i32 2, i32 8, i32 11, i32 7, i32 6, i32 12, i32 15> llvm-svn: 138411	2011-08-23 23:29:45 +00:00
Howard Hinnant	854a3966d4	http://llvm.org/bugs/show_bug.cgi?id=10472 llvm-svn: 135927	2011-07-25 18:09:56 +00:00
Nick Lewycky	c3218637bd	Fix typo. llvm-svn: 135473	2011-07-19 08:48:08 +00:00
Alexis Hunt	8cb46bb51c	Implement a __WCHAR_UNSIGNED__ macro and use it to include WCHAR_MIN and WCHAR_MAX in limits.h, thus solving the problem where the system header thinks it knows better. llvm-svn: 135455	2011-07-19 00:50:57 +00:00
NAKAMURA Takumi	a185c78d4d	lib/Headers/mm_malloc.h: Use __mingw_aligned_malloc() in _mm_malloc() on mingw. By default, mingw does not have _mm_alloc() nor _aligned_malloc(). llvm-svn: 135388	2011-07-18 11:13:50 +00:00
Douglas Gregor	c9c40ce861	Teach Clang's <float.h> to also include MinGW's <float.h>, which provides additional system definitions, from Ruben Van Boxem llvm-svn: 134407	2011-07-05 14:17:04 +00:00
Douglas Gregor	21ad5dfc69	Define va_copy when in C++0x mode; C++0x picked it up from C99. llvm-svn: 133438	2011-06-20 15:03:22 +00:00
Bill Wendling	03e7e430c3	Add 'may_alias' attribute. Noticed by Eli. llvm-svn: 131278	2011-05-13 01:24:00 +00:00
Bill Wendling	502931fad9	Represent the unaligned loads natively. These are converted into a call to the correct unaligned load. llvm-svn: 131268	2011-05-13 00:11:39 +00:00
Bill Wendling	e106c34817	LLVM doesn't always optimize away the four loads from this: (__m128){ p[0], p[1], p[2], p[3] } which produces really bad code. This could be done in instcombine, but it's probably better to do it in the front-end instead. <rdar://problem/9424836> llvm-svn: 131237	2011-05-12 19:02:15 +00:00
Eli Friedman	8ba29d8e7f	PR9866: Fix the implementation of _mm_loadl_pd and _mm_loadh_pd to not make bad assumptions about the alignment of the double* argument. llvm-svn: 131052	2011-05-07 18:59:31 +00:00
Eli Friedman	cb59baaa20	PR9849: Fix _mm_setr_pi32 and friends to actually work correctly. They broke with the MMX rewrite a while back. llvm-svn: 130945	2011-05-05 20:21:54 +00:00
Eli Friedman	fe0739dffb	Some small improvements to the builtin (-ffreestanding) stdint.h; in particular, make sure to handle WCHAR_MIN correctly. llvm-svn: 130618	2011-04-30 19:02:59 +00:00
Chris Lattner	f03406f103	don't use compound literals in MM macros, since they will be instantiated into user code which may warn about them with -pedantic. Patch by Jonathan Sauer! llvm-svn: 130149	2011-04-25 20:42:40 +00:00
Eli Friedman	4547752402	PR9772: Fix the definition of WINT_MIN and WINT_MAX on Linux -ffreestanding. llvm-svn: 129907	2011-04-21 05:45:45 +00:00
Michael J. Spencer	1737c9e0b5	Add mm3dnow.h. llvm-svn: 129572	2011-04-15 15:11:21 +00:00
Chris Lattner	57540c5be0	fix a bunch of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129559	2011-04-15 05:22:18 +00:00
Bill Wendling	2c1c33552d	Remove comment that snuck in there. llvm-svn: 129434	2011-04-13 10:05:14 +00:00
Bill Wendling	b9c9e34cb3	Just use a native "load" instead of translating the builtin later. Clang can take it! I wasn't able to get __builtin_ia32_loaddqu to transform into an unaligned load...I'll have to look into it further. llvm-svn: 129427	2011-04-13 05:58:17 +00:00
John McCall	91a528841b	Implement the AVX cmp builtins as macros instead of static inlines. Patch by Syoyo Fujita! Reviewed by Chris Lattner! Checked in by me! llvm-svn: 128984	2011-04-06 03:37:51 +00:00
Ted Kremenek	f49e1dd86d	Add '#ifndef _PTRDIFF_T' guard around definition of ptrdiff_t. Fixes <rdar://problem/9210154>. llvm-svn: 128578	2011-03-30 21:43:52 +00:00
NAKAMURA Takumi	1a780ee8a9	lib/Headers/mm_malloc.h: On Windows, we can expect _mm_malloc would be provided as macro by <malloc.h>. llvm-svn: 127654	2011-03-15 02:32:43 +00:00
Oscar Fuentes	6401523405	CMake: updated list of installable header files. PR9321. llvm-svn: 126572	2011-02-27 13:33:31 +00:00
Oscar Fuentes	15fe190027	Put targets on folders, if the IDE supports the feature. Requires CMake 2.8.3 or newer. llvm-svn: 126094	2011-02-20 22:06:44 +00:00
Oscar Fuentes	6f72540e46	New function for tablegenning: clang_tablegen. llvm-svn: 126093	2011-02-20 22:06:32 +00:00
Anton Yartsev	b9734cd4eb	Optimized IR for vec_splat llvm-svn: 120610	2010-12-01 21:59:31 +00:00
Chandler Carruth	45c2fb1e69	Undo part of my previous commit to mm_malloc.h, going back to the use of stdlib.h. There were numerous problems with forward declaring 'malloc' and 'free', but the most important is that these are reserved by POSIX and may be implemented via a function-like macro. As suggested by Dale Johannesen, I'm instead guarding the only include of this in our builtin headers with __STDC_HOSTED__, and I've removed the include of the header from the test suite. I'll discuss with folks whether we want to have a hosted section of the test suite or not, and add it (and perhaps other tests) back there if that's the direction. llvm-svn: 119958	2010-11-22 08:06:31 +00:00
Anton Yartsev	f2a1345a34	turned pointers into pointers to const in function parameters in all functions/builtins accepting pointers to a const-qualified type according to PIM and "Language Extensions for CBEA" llvm-svn: 119376	2010-11-16 20:09:36 +00:00
Chandler Carruth	5eef9ba483	Futher reduce the includes of our builtin headers, and teach limits.h to avoid include_next when not hosted or unavailable. This follows the pattern in stdint.h and allows these headers to work even in a freestanding configuration without a standard library. llvm-svn: 119343	2010-11-16 10:07:43 +00:00
Douglas Gregor	c4a821860c	Fix CMake installation of arm_neon.h llvm-svn: 116835	2010-10-19 18:06:10 +00:00
NAKAMURA Takumi	a8792e514a	lib/Headers/stddef.h: wint_t should be defined whenever <stddef.h> is included with __need_wint_t. llvm-svn: 116794	2010-10-19 03:42:41 +00:00
Eric Christopher	8a8673ea39	From scratch rewrite of mm_malloc.h. Patch by Matthew Beaumont-Gay! llvm-svn: 116771	2010-10-18 23:38:51 +00:00
Anton Yartsev	73d4023114	support for AltiVec extensions from the Cell architecture llvm-svn: 116478	2010-10-14 14:37:46 +00:00
Douglas Gregor	bd82998e35	Eliminate CIndexer::getClangPath(), since libclang no longer depends on the presence of a 'clang' executable. Simplify CIndexer::getClangResourcesPath() a bit. Patch up the CMake makefiles to install headers into two locations in the build tree, for those silly cases where 'clang' will end up looking into the wrong build directory for headers. llvm-svn: 116260	2010-10-11 23:17:59 +00:00
Chris Lattner	07704f1d7e	the mmx intrinsic for pshufw should map to the IR intrinsic, not to a shufflevector. Otherwise it doesn't turn into a pshufw. This bug was introduced in the mmx rewrite. llvm-svn: 115423	2010-10-02 21:32:59 +00:00
Chris Lattner	1750cb037d	__builtin_ia32_psrldqi128 too llvm-svn: 115301	2010-10-01 06:58:49 +00:00
Chris Lattner	81f347fe6d	the second argument to __builtin_ia32_pslldqi128 must be an immediate, so it needs to be called from a macro, not a function. This is a necessary but insufficient step towards fixing PR8221 llvm-svn: 115299	2010-10-01 06:52:23 +00:00
Dale Johannesen	39d6f4b95c	Clang part of MMX rewrite (goes with 115243). llvm-svn: 115244	2010-09-30 23:57:50 +00:00
Douglas Gregor	1f7d02fb6d	Define _Bool, bool, true, and false macros in <stdbool.h> when we're in a GNU-compatible C++ dialect. Fixes <rdar://problem/8477819>. llvm-svn: 115028	2010-09-29 04:57:11 +00:00
Bill Wendling	11191f11b8	Accidentally committed some temporary changes on my branch when reverting patches. llvm-svn: 114936	2010-09-28 01:28:56 +00:00
Bill Wendling	6d8c442e08	Temporarily revert 114929 114925 114924 114921. It looked like they (or at least one of them) was causing a series of failures: http://google1.osuosl.org:8011/builders/clang-x86_64-darwin10-selfhost/builds/4518 svn merge -c -114929 https://llvm.org/svn/llvm-project/cfe/trunk --- Reverse-merging r114929 into '.': U include/clang/Sema/Sema.h U include/clang/AST/DeclCXX.h U lib/Sema/SemaDeclCXX.cpp U lib/Sema/SemaTemplateInstantiateDecl.cpp U lib/Sema/SemaDecl.cpp U lib/Sema/SemaTemplateInstantiate.cpp U lib/AST/DeclCXX.cpp svn merge -c -114925 https://llvm.org/svn/llvm-project/cfe/trunk --- Reverse-merging r114925 into '.': G include/clang/AST/DeclCXX.h G lib/Sema/SemaDeclCXX.cpp G lib/AST/DeclCXX.cpp svn merge -c -114924 https://llvm.org/svn/llvm-project/cfe/trunk --- Reverse-merging r114924 into '.': G include/clang/AST/DeclCXX.h G lib/Sema/SemaDeclCXX.cpp G lib/Sema/SemaDecl.cpp G lib/AST/DeclCXX.cpp U lib/AST/ASTContext.cpp svn merge -c -114921 https://llvm.org/svn/llvm-project/cfe/trunk --- Reverse-merging r114921 into '.': G include/clang/AST/DeclCXX.h G lib/Sema/SemaDeclCXX.cpp G lib/Sema/SemaDecl.cpp G lib/AST/DeclCXX.cpp llvm-svn: 114933	2010-09-28 01:09:49 +00:00
Anton Yartsev	79d6af3839	formatted everything to fit within 80 columns llvm-svn: 114249	2010-09-18 00:39:16 +00:00
Chris Lattner	ee8df8f167	fix PR7192 by defining wchar_t in a more conventional way. The type of L"x" can change based on command line arguments. llvm-svn: 113127	2010-09-05 23:29:49 +00:00
Chris Lattner	212a492063	fix incorrect MM_HINT_ definitions, PR8011 llvm-svn: 112283	2010-08-27 20:10:06 +00:00
Eric Christopher	2a9898f0a2	Move some type defines from smmintrin.h to emmintrin.h to match where gcc defines them. llvm-svn: 112146	2010-08-26 02:09:25 +00:00
Nick Lewycky	0b84914da0	Add x86intrin.h which is generic x86 intrinsics for more than just Intel. Thus far, this just #include's immintrin.h for compatibility. llvm-svn: 111785	2010-08-22 20:38:05 +00:00
Benjamin Kramer	6f35f3cd80	Disallow direct inclusion of avxintrin.h. Users should include immintrin.h instead. This matches GCC's behavior. llvm-svn: 111692	2010-08-20 23:00:03 +00:00
Benjamin Kramer	65b9f7b255	Add immintrin meta header. - This is the official way to get AVX intrinsics, we might want to disallow direct inclusion of avxintrin.h, just like GCC does. llvm-svn: 111660	2010-08-20 18:04:07 +00:00
Chris Lattner	1b55b75e24	alphabeticalize llvm-svn: 111654	2010-08-20 17:24:02 +00:00
Chris Lattner	21a597a31d	hopefully unbreak the msvc buildbot. llvm-svn: 111653	2010-08-20 17:23:33 +00:00
Benjamin Kramer	ae8ea1f715	Fix header comments. llvm-svn: 111645	2010-08-20 16:47:17 +00:00
Chris Lattner	9052c35479	fix some vector extractions to return properly zero extended values (instead of sign extending) to match ICC. GCC is changing this in a series of their own PRs (e.g. 41323). llvm-svn: 111637	2010-08-20 16:08:33 +00:00
Anton Yartsev	583a1cf7b5	support for predicates with bool/pixel arguments llvm-svn: 111515	2010-08-19 11:57:49 +00:00
Anton Yartsev	fc83c60755	support for the rest of AltiVec functions with bool/pixel arguments and return values (except predicates) llvm-svn: 111511	2010-08-19 03:21:36 +00:00
Anton Yartsev	9e96898032	support for vec_perm and all dependent functions (vec_mergeh, vec_mergel, vec_pack, vec_sld, vec_splat) with bool/pixel arguments and return values llvm-svn: 111509	2010-08-19 03:00:09 +00:00
Anton Yartsev	2cc136d4e3	support for vec_add, vec_adds, vec_and, vec_andc with bool arguments llvm-svn: 111141	2010-08-16 16:22:12 +00:00
Anton Yartsev	bfd0f96e79	first test commit llvm-svn: 110941	2010-08-12 18:51:55 +00:00
Bruno Cardoso Lopes	8c333153e0	Fix define inserting a comma :) llvm-svn: 110839	2010-08-11 18:45:43 +00:00
Bruno Cardoso Lopes	65954ffc69	Remove 256-bit cast built-ins and make the AVX intrinsic call llvm __builtin_shufflevector with the appropriate arguments llvm-svn: 110771	2010-08-11 02:14:38 +00:00
Bruno Cardoso Lopes	a4f1930b75	Remove 256-bit unpack built-ins and make the AVX intrinsic call llvm __builtin_shufflevector with the appropriate arguments llvm-svn: 110768	2010-08-11 01:43:24 +00:00
Bruno Cardoso Lopes	e712a135b7	Remove 256-bit shuffle built-ins and make the AVX intrinsic call llvm __builtin_shufflevector with the appropriate arguments llvm-svn: 110766	2010-08-11 01:17:34 +00:00

... 7 8 9 10 11 ...

1061 Commits