llvm-project

Commit Graph

Author	SHA1	Message	Date
Michael Zuckerman	0165e7669c	[CLANG][AVX512][BUILTIN] Adding PSRLV builtin Differential Revision: http://reviews.llvm.org/D17718 llvm-svn: 262326	2016-03-01 13:03:45 +00:00
Michael Zuckerman	1ac360cca4	[CLANG] [AVX512] [BUILTIN] Adding PSRA{Q\|D\|QI\|DI}{128\|256\|512} builtin Differential Revision: http://reviews.llvm.org/D17693 llvm-svn: 262321	2016-03-01 11:38:16 +00:00
Logan Chien	3267ca225d	Add ARM EHABI-related constants to unwind.h. Adds a number of constants, defined in the ARM EHABI spec, to the Clang lib/Headers/unwind.h header. This is prerequisite for landing http://reviews.llvm.org/D15781, as previously discussed there. Patch by Timon Van Overveldt. llvm-svn: 262178	2016-02-28 15:01:42 +00:00
Michael Zuckerman	431b0e18b4	[CLANG] [AVX512] [BUILTIN] Adding PSLL{V\|W\|Wi}{128\|256\|512} builtin Differential Revision: http://reviews.llvm.org/D17685 llvm-svn: 262177	2016-02-28 07:39:34 +00:00
Chris Bieneman	2c6c01a4fc	[CMake] Fixing install-clang-headers dependencies to depend on generating the headers. llvm-svn: 261911	2016-02-25 18:39:19 +00:00
Justin Lebar	d7a35492ad	[CUDA] Add conversion operators for threadIdx, blockIdx, gridDim, and blockDim to uint3 and dim3. Summary: This lets you write, e.g. uint3 a = threadIdx; uint3 b = blockIdx; dim3 c = gridDim; dim3 d = blockDim; which is legal in nvcc, but was not legal in clang. The fact that e.g. the type of threadIdx is not actually uint3 is still observable, but now you have to try to observe it. Reviewers: tra Subscribers: echristo, cfe-commits Differential Revision: http://reviews.llvm.org/D17561 llvm-svn: 261777	2016-02-24 21:49:33 +00:00
Justin Lebar	c8dae5378b	[CUDA] Add hack so code which includes "curand.h" doesn't break. Summary: curand.h includes curand_mtgp32_kernel.h. In host mode, this header redefines threadIdx and blockDim, giving them their "proper" types of uint3 and dim3, respectively. clang has its own plan for these variables -- their types are magic builtin classes. So these redefinitions are incompatible. As a hack, we force-include the offending CUDA header and use #defines to get the right types for threadIdx and blockDim. Reviewers: tra Subscribers: echristo, cfe-commits Differential Revision: http://reviews.llvm.org/D17562 llvm-svn: 261776	2016-02-24 21:49:31 +00:00
Michael Zuckerman	6c317515e4	[CLANG] [AVX512] [BUILTIN] Adding PSHUF{L\|H}W{128\|256\|512} builtin to clang . Differential Revision: http://reviews.llvm.org/D17539 llvm-svn: 261755	2016-02-24 17:39:35 +00:00
Michael Zuckerman	e98cc7477f	[CLANG] [AVX512] [BUILTIN] Adding prorv{d\|q}{128\|256\|512} builtin to clang Differential Revision: http://reviews.llvm.org/D17512 llvm-svn: 261641	2016-02-23 15:59:47 +00:00
Michael Zuckerman	4924c7a2b5	[CLANG] [AVX512] [BUILTIN] Adding pro{lv\|r}{d\|q}{128\|256\|512} builtin to clang Adding closer to the end of macro }->}) Differential Revision: http://reviews.llvm.org/D17506 llvm-svn: 261638	2016-02-23 14:23:53 +00:00
Michael Zuckerman	0231f1649b	[CLANG] [AVX512] [BUILTIN] Adding pro{lv\|r}{d\|q}{128\|256\|512} builtin to clang Differential Revision: http://reviews.llvm.org/D17506 llvm-svn: 261635	2016-02-23 13:41:13 +00:00
Michael Zuckerman	477e0a326b	[CLANG] [AVX512] [BUILTIN] Adding prol{d\|q\|w}{128\|256\|512} builtin to clang . Fixing problem with the lib/include/avx512vlintrin.h file. Adding one more _ to the prefix of _extension__ -> __extension__. Differential Revision: http://reviews.llvm.org/D16985 llvm-svn: 261518	2016-02-22 09:42:57 +00:00
Michael Zuckerman	38a2727764	[CLANG] [AVX512] [BUILTIN] Adding prol{d\|q\|w}{128\|256\|512} builtin to clang . Differential Revision: http://reviews.llvm.org/D16985 llvm-svn: 261516	2016-02-22 09:05:41 +00:00
Michael Zuckerman	7a33dce4ef	[CLANG] [AVX512] [BUILTIN] Adding pmovzx{b\|d\|w}{w\|d\|q}{128\|256\|512} builtin to clang Differential Revision: http://reviews.llvm.org/D16961 llvm-svn: 261471	2016-02-21 14:00:11 +00:00
David Majnemer	7a0d7d6be9	Remove a duplicate declaration specifier from _ReadBarrier This fixes PR26675. llvm-svn: 261388	2016-02-20 00:57:00 +00:00
Michael Zuckerman	7cdb72f7ea	[CLANG] [AVX512] [BUILTIN] Adding pmovsx{b\|d\|w}{w\|d\|q}{128\|256\|512} builtin to clang Differential Revision: http://reviews.llvm.org/D16955 llvm-svn: 261196	2016-02-18 09:09:34 +00:00
Artem Belevich	7f522b7876	Added missing '__'. llvm-svn: 260719	2016-02-12 20:26:43 +00:00
Eric Christopher	39a84d0b9b	Update functions in clang supplied headers to use the compiler reserved namespace for arguments. llvm-svn: 260647	2016-02-12 02:22:53 +00:00
Richard Smith	66a7385e27	<float.h>: do not define DECIMAL_DIG in -std=c89 mode; this macro was added in C99. Patch by Jorge Teixeira! llvm-svn: 260639	2016-02-12 01:15:33 +00:00
Eric Christopher	0466c7ce23	Use __ before argument names in provided headers. llvm-svn: 260631	2016-02-12 00:32:23 +00:00
Richard Smith	b473e1e473	In C11, provide macros FLT_DECIMAL_DIG, DBL_DECIMAL_DIG, and LDBL_DECIMAL_DIG in <float.h>. Patch by Jorge Teixeira! llvm-svn: 260577	2016-02-11 19:57:37 +00:00
Ekaterina Romanova	a61946d551	This patch adds doxygen comments for all the intrinsincs in the header file f16cintrin.h. The doxygen comments are automatically generated based on Sony's intrinsics document. Differential Revision: http://reviews.llvm.org/D17021 llvm-svn: 260333	2016-02-10 00:12:24 +00:00
Ekaterina Romanova	d416747803	This patch adds doxygen comments for all the intrinsincs in the header file pmmintrin.h. The doxygen comments are automatically generated based on Sony's intrinsics document. Differential Revision: http://reviews.llvm.org/D16913 llvm-svn: 260160	2016-02-08 22:35:09 +00:00
Igor Breger	9c2a0bfa13	AVX512: Change builtin function name for scalar intrinsics. Add "mask" to function name to reflect the function behavior. Differential Revision: http://reviews.llvm.org/D16957 llvm-svn: 260088	2016-02-08 12:36:48 +00:00
Artem Belevich	2aad2b3500	[CUDA] Bug 26497 : Remove wrappers for variants provided by CUDA headers. ... and pull global-scope ones into std namespace with using-declaration. Differential Revision: http://reviews.llvm.org/D16932 llvm-svn: 259944	2016-02-05 22:54:05 +00:00
Artem Belevich	7b660e2604	[CUDA] added declarations for device-side system calls ...and std:: wrappers for free/malloc. llvm-svn: 259690	2016-02-03 20:53:58 +00:00
Ekaterina Romanova	0e19cf2dd8	This patch adds doxygen comments for the intrinsincs in the header file __wmmintrin_aes.h. The doxygen comments are automatically generated based on Sony's intrinsics document. Differential Revision: http://reviews.llvm.org/D16562 llvm-svn: 259275	2016-01-29 23:59:00 +00:00
Ekaterina Romanova	deec50a3d2	This patch adds doxygen comments for the intrinsincs in the header file __wmmintrin_pclmul.h. The doxygen comments are automatically generated based on Sony's intrinsics document. Differential Revision: http://reviews.llvm.org/D15999 llvm-svn: 259239	2016-01-29 20:37:14 +00:00
Artem Belevich	c5f41a34e5	[CUDA] Implemented device-side support functions in <cmath>. CUDA expects math functions in std:: namespace to work on device side. In order to make it work with clang without allowing device-side code generation for functions w/o appropriate target attributes, this patch provides device-side implementations for <cmath> functions. Most of them call global-scope math functions provided by CUDA headers. In few cases we use clang builtins. Tested out-of tree by compiling and running thrust's unit_tests. https://github.com/thrust/thrust/tree/master/testing Differential Revision: http://reviews.llvm.org/D16593 llvm-svn: 258880	2016-01-26 23:37:29 +00:00
Chris Bieneman	2bf68c6c1c	Remove autoconf support Summary: This patch is provided in preparation for removing autoconf on 1/26. The proposal to remove autoconf on 1/26 was discussed on the llvm-dev thread here: http://lists.llvm.org/pipermail/llvm-dev/2016-January/093875.html "This is the way [autoconf] ends Not with a bang but a whimper." -T.S. Eliot Reviewers: chandlerc, grosbach, bob.wilson, echristo Subscribers: klimek, cfe-commits Differential Revision: http://reviews.llvm.org/D16472 llvm-svn: 258862	2016-01-26 21:30:40 +00:00
Justin Lebar	3039a593db	[CUDA] Make printf work. Summary: The code in CGCUDACall is largely based on a patch written by Eli Bendersky: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20140324/210218.html That patch implemented an LLVM pass lowering printf to vprintf; this one does something similar, but in Clang codegen. Reviewers: echristo Subscribers: cfe-commits, jhen, tra, majnemer Differential Revision: http://reviews.llvm.org/D16372 llvm-svn: 258642	2016-01-23 21:28:14 +00:00
Ekaterina Romanova	08d1f2431d	2 missing intrinsics _cvtss_sh and _mm_cvtps_ph were added to the intrinsics header f16intrin.h Differential Revision: http://reviews.llvm.org/D16177 llvm-svn: 258492	2016-01-22 06:50:50 +00:00
Adam Nemet	e708747129	[AVX512] Fix typo in r226298 Hal noticed that the double/float got mixed up on the parameters for these. llvm-svn: 258108	2016-01-19 02:02:25 +00:00
Kyle Butt	436ff85b63	[PPC] Add long long/double support for vec_cts, vec_ctu and vec_ctf Add long long/double support for vec_cts, vec_ctu and vec_ctf. Similar to this change in GCC: https://gcc.gnu.org/ml/gcc-patches/2014-08/msg02653.html Patch by Tim Shen. llvm-svn: 257135	2016-01-08 02:00:48 +00:00
David Majnemer	30f9bfd574	Reimplement __readeflags and __writeeflags on top of intrinsics Lean on LLVM to provide this functionality now that it provides the necessary intrinsics. llvm-svn: 256686	2016-01-01 06:50:08 +00:00
Asaf Badouh	a9d1e18f48	[X86][PKU] add clang intrinsic for {RD\|WR}PKRU Differential Revision: http://reviews.llvm.org/D15837 llvm-svn: 256672	2015-12-31 14:14:07 +00:00
Eric Christopher	7f7d9bea6f	Fix up comment in header. llvm-svn: 256508	2015-12-28 19:07:46 +00:00
Michael Kuperstein	591278c08d	[X86] Add missing m64/int64 conversions Define the 64-bit equivalents of _m_to_int and _m_from_int. Differential Revision: http://reviews.llvm.org/D15572 llvm-svn: 256122	2015-12-20 12:37:18 +00:00
Michael Kuperstein	beae026738	[X86] Add signed aliases for popcnt intrinsics The Intel manual documents both an unsigned form (_mm_popcnt_u32) and a signed form (_popcnt32) of the intrinsic. Add the missing signed form. Differential Revision: http://reviews.llvm.org/D15568 llvm-svn: 256121	2015-12-20 12:35:35 +00:00
Artem Belevich	8e9ba042a6	[CUDA] runtime wrapper header tweaks * Pull in host-only implementations of few CUDA-specific math functions. * #nclude <cmath> early to prevent its inclusion from CUDA headers after they've messed with __THROW macro. llvm-svn: 255933	2015-12-17 22:25:22 +00:00
Artem Belevich	7fda3c9ff3	[CUDA] renamed cuda_runtime.h wrapper to __cuda_runtime.h Currently it's easy to break CUDA compilation by passing "-isystem /path/to/cuda/include" to compiler which leads to compiler including real cuda_runtime.h from there instead of the wrapper we need. Renaming the wrapper ensures that we can include the wrapper regardless of user-specified include paths and files. Differential Revision: http://reviews.llvm.org/D15534 llvm-svn: 255802	2015-12-16 18:51:59 +00:00
Asaf Badouh	5e4248b4e0	[x86][avx512] more changes in intrinsics to be align with gcc format Differential Revision: http://reviews.llvm.org/D15328 llvm-svn: 255012	2015-12-08 12:34:38 +00:00
Asaf Badouh	3e5111e313	[avx512] rename gcc intrinsics to be align with gcc format rename the gcc intrinsics suffix : _mask ->_round Differential Revision: http://reviews.llvm.org/D15284 llvm-svn: 254906	2015-12-07 13:14:22 +00:00
Paul Robinson	941bc91518	Move _mm256_cvtps_ph and _mm256_cvtph_ps to immintrin.h. This more closely matches their locations as described by Intel documentation, and lets us remove a pair of redundant typedefs. Differential Revision: http://reviews.llvm.org/D15127 llvm-svn: 254528	2015-12-02 18:41:52 +00:00
Craig Topper	5ec97a7b9b	[X86] Improve codegen for AVX2 gather with an all 1s mask. Use undefined instead of setzero as the pass through input since its going to be fully overwritten. Use cmpeq of two zero vectors to produce the all 1s vector. Casting -1 to a double and vectorizing causes a constant load of a -1.0 floating point value. llvm-svn: 254389	2015-12-01 07:12:59 +00:00
Craig Topper	e20b8c68ed	[X86] _mm256_permutevar8x32_ps should take an integer vector for its shuffle index input. llvm-svn: 254270	2015-11-29 22:53:32 +00:00
Craig Topper	3a71f35a67	[X86] Remove temporary variables from intrinsic macros. NFC llvm-svn: 254247	2015-11-29 06:50:33 +00:00
Argyrios Kyrtzidis	dcb5653516	[CMake] Add a specific 'install-clang-headers' target. llvm-svn: 253636	2015-11-20 02:24:03 +00:00
Artem Belevich	c29db84419	[CUDA] Added a wrapper header for inclusion of stock CUDA headers. Header files that come with CUDA are assuming split host/device compilation and are not usable by clang out of the box. With a bit of preprocessor magic it's possible to twist them into something clang can use. This wrapper always includes CUDA headers exactly the same way during host and device compilation passes and produces identical preprocessed content during host and device side compilation for sm_35 GPUs. Device compilation passes for older GPUs will see a smaller subset of device functions supported by particular GPU. The wrapper assumes specific contents of CUDA header files and works only with CUDA 7.0 and 7.5. Differential Revision: http://reviews.llvm.org/D13171 llvm-svn: 253388	2015-11-17 22:28:52 +00:00
Hans Wennborg	1acf955a6a	bmiintrin.h: Allow using the tzcnt intrinsics for non-BMI targets The tzcnt intrinsics are used non non-BMI targets by code (e.g. ffmpeg) that uses it as a potentially faster BSF. The TZCNT instruction is special in that it's encoded in a backward-compatible way and behaves as BSF on non-BMI targets. Differential Revision: http://reviews.llvm.org/D14748 llvm-svn: 253358	2015-11-17 18:46:48 +00:00
Oliver Stannard	7aa90f5735	[ARM,AArch64] Fix __rev16l and __rev16ll intrinsics These two intrinsics are defined in arm_acle.h. __rev16l needs to rotate by 16 bits, bit it was actually rotating by 2 bits. For AArch64, where long is 64 bits, this would still be wrong. __rev16ll was incorrect, it reversed the bytes in each 32-bit word, rather than each 16-bit halfword. The correct implementation is to apply __rev16 to the top and bottom words of the 64-bit value. For AArch32 targets, these get compiled down to the hardware rev16 instruction at -O1 and above. For AArch64 targets, the 64-bit ones get compiled to two 32-bit rev16 instructions, because there is not currently a pattern for the 64-bit rev16 instruction. Differential Revision: http://reviews.llvm.org/D14609 llvm-svn: 253211	2015-11-16 14:58:50 +00:00
Craig Topper	fb79b5f273	[X86] Add 'pause' builtin that's already in llvm and use it instead of inline assembly to implement _mm_pause. llvm-svn: 252712	2015-11-11 08:13:33 +00:00
Craig Topper	a5455524c2	[X86] Use __builtin_ia32_paddq and __builtin_ia32_psubq to implement a couple intrinsics that were supposed to operate on MMX registers. Otherwise we end up operating on GPRs. Throw in a test for _mm_mul_su32 while I was there. llvm-svn: 252711	2015-11-11 08:00:41 +00:00
Craig Topper	880f60b7b3	[X86] Header formatting fixes. NFC llvm-svn: 252710	2015-11-11 08:00:39 +00:00
Craig Topper	d619eaaae4	[X86] Add missing typecasts in intrinsic macros. This should make them more robust against inputs that aren't already the right type. llvm-svn: 252700	2015-11-11 03:47:10 +00:00
Craig Topper	19744ee6ad	[X86] Change pointer type in AVX2 gather builtins to be the scalar type instead of the vector type. This matches gcc and removes extras casts. llvm-svn: 252697	2015-11-11 02:51:18 +00:00
Craig Topper	fd778eebac	[X86] Use setzero instead of set1(0) in a few places in intrinsic headers. llvm-svn: 252587	2015-11-10 05:08:08 +00:00
Craig Topper	7148166785	[X86] Remove temporary variables from macros in x86 intrinsic headers. Prevents duplicate names appearing from multiple macro expansions. NFC llvm-svn: 252586	2015-11-10 05:08:05 +00:00
Craig Topper	166f8b20a3	[X86] Fix bad intrinsic header comment. NFC. llvm-svn: 252585	2015-11-10 05:08:00 +00:00
Craig Topper	991d499457	Fix a couple intrinsic header comments. NFC llvm-svn: 251900	2015-11-03 06:16:31 +00:00
Eric Christopher	99af5b2ea7	Handle target builtin options that are all required rather than only one of a group of possibilities. This changes the syntax in the builtin files to represent: , as the and operator \| as the or operator The former syntax matches how the backend tablegen files represent multiple subtarget features being required. Updated the builtin and intrinsic headers accordingly for the new syntax. llvm-svn: 251388	2015-10-27 06:11:03 +00:00
Andrea Di Biagio	8bb12d0a77	[x86] Fix maskload/store intrinsic definitions in avxintrin.h According to the Intel documentation, the mask operand of a maskload and maskstore intrinsics is always a vector of packed integer/long integer values. This patch introduces the following two changes: 1. It fixes the avx maskload/store intrinsic definitions in avxintrin.h. 2. It changes BuiltinsX86.def to match the correct gcc definitions for avx maskload/store (see D13861 for more details). Differential Revision: http://reviews.llvm.org/D13861 llvm-svn: 250816	2015-10-20 11:19:54 +00:00
Craig Topper	e33f51fa91	[X86] Add fxsr feature name for fxsave/fxrestore builtins. llvm-svn: 250498	2015-10-16 06:22:36 +00:00
Peter Collingbourne	e919b0f9ad	Headers: Switch some headers to LF line endings for consistency. llvm-svn: 250388	2015-10-15 10:33:27 +00:00
Hans Wennborg	4ca00afd7c	Intrin.h: implement __emul and __emulu llvm-svn: 250301	2015-10-14 16:24:28 +00:00
Eric Christopher	525334cf6c	Add subtarget feature support for 3dnowa to the 3dnowa intrinsics. llvm-svn: 250202	2015-10-13 18:40:17 +00:00
Amjad Aboud	2b9b8a5921	[X86] Add XSAVE intrinsic family Add intrinsics for the XSAVE instructions (XSAVE/XSAVE64/XRSTOR/XRSTOR64) XSAVEOPT instructions (XSAVEOPT/XSAVEOPT64) XSAVEC instructions (XSAVEC/XSAVEC64) XSAVES instructions (XSAVES/XSAVES64/XRSTORS/XRSTORS64) Differential Revision: http://reviews.llvm.org/D13014 llvm-svn: 250158	2015-10-13 12:29:35 +00:00
Ahmed Bougacha	7dfaaf3891	[Headers][X86] Fix stream_load (movntdqa) to accept const. Per Intel intrinsics guide: - _mm256_stream_load_si256 takes `__m256i const ' - _mm_stream_load_si128 takes `__m128i ', for no good reason. Let's accept const for both. llvm-svn: 249213	2015-10-02 23:29:26 +00:00
Chandler Carruth	cbe6411401	Fix the SSE4 byte sign extension in a cleaner way, and more thoroughly test that our intrinsics behave the same under -fsigned-char and -funsigned-char. This further testing uncovered that AVX-2 has a broken cmpgt for 8-bit elements, and has for a long time. This is fixed in the same way as SSE4 handles the case. The other ISA extensions currently work correctly because they use specific instruction intrinsics. As soon as they are rewritten in terms of generic IR, they will need to add these special casts. I've added the necessary testing to catch this however, so we shouldn't have to chase it down again. I considered changing the core typedef to be signed, but that seems like a bad idea. Notably, it would be an ABI break if anyone is reaching into the innards of the intrinsic headers and passing __v16qi on an API boundary. I can't be completely confident that this wouldn't happen due to a macro expanding in a lambda, etc., so it seems much better to leave it alone. It also matches GCC's behavior exactly. A fun side note is that for both GCC and Clang, -funsigned-char really does change the semantics of __v16qi. To observe this, consider: % cat x.cc #include <smmintrin.h> #include <iostream> int main() { __v16qi a = { 1, -1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}; __v16qi b = _mm_set1_epi8(-1); std::cout << (int)(a / b)[0] << ", " << (int)(a / b)[1] << '\n'; } % clang++ -o x x.cc && ./x -1, 1 % clang++ -funsigned-char -o x x.cc && ./x 0, 1 However, while this may be surprising, both Clang and GCC agree. Differential Revision: http://reviews.llvm.org/D13324 llvm-svn: 249097	2015-10-01 23:40:12 +00:00
Chandler Carruth	9143378db0	Patch over a really horrible bug in our vector builtins that showed up recently when we started using direct conversion to model sign extension. The __v16qi type we use for SSE v16i8 vectors is defined in terms of 'char' which may or may not be signed! This causes us to generate pmovsx and pmovzx depending on the setting of -funsigned-char. This patch just forms an explicitly signed type and uses that to formulate the sign extension. While this gets the correct behavior (which we now verify with the enhanced test) this is just the tip of the ice berg. Now that I know what to look for, I have found errors of this sort throughout our vector code. Fortunately, this is the only specific place where I know of users actively having their code miscompiled by Clang due to this, so I'm keeping the fix for those users minimal and targeted. I'll be sending a proper email for discussion of how to fix these systematically, what the implications are, and just how widely broken this is... From what I can tell, we have never shipped a correct set of builtin headers for x86 when users rely on -funsigned-char. Oops. llvm-svn: 248980	2015-10-01 02:21:34 +00:00
Nemanja Ivanovic	a0deee530b	Forgot to remove a FIXME that has been fixed. NFC. llvm-svn: 248815	2015-09-29 18:20:59 +00:00
Nemanja Ivanovic	236904ea9e	Addition of interfaces the FE to conform to Table A-2 of ELF V2 ABI V1.1 This patch corresponds to review: http://reviews.llvm.org/D13190 Implemented the following interfaces to conform to ELF V2 ABI version 1.1. vector signed __int128 vec_adde (vector signed __int128, vector signed __int128, vector signed __int128); vector unsigned __int128 vec_adde (vector unsigned __int128, vector unsigned __int128, vector unsigned __int128); vector signed __int128 vec_addec (vector signed __int128, vector signed __int128, vector signed __int128); vector unsigned __int128 vec_addec (vector unsigned __int128, vector unsigned __int128, vector unsigned __int128); vector signed int vec_addc(vector signed int __a, vector signed int __b); vector bool char vec_cmpge (vector signed char __a, vector signed char __b); vector bool char vec_cmpge (vector unsigned char __a, vector unsigned char __b); vector bool short vec_cmpge (vector signed short __a, vector signed short __b); vector bool short vec_cmpge (vector unsigned short __a, vector unsigned short __b); vector bool int vec_cmpge (vector signed int __a, vector signed int __b); vector bool int vec_cmpge (vector unsigned int __a, vector unsigned int __b); vector bool char vec_cmple (vector signed char __a, vector signed char __b); vector bool char vec_cmple (vector unsigned char __a, vector unsigned char __b); vector bool short vec_cmple (vector signed short __a, vector signed short __b); vector bool short vec_cmple (vector unsigned short __a, vector unsigned short __b); vector bool int vec_cmple (vector signed int __a, vector signed int __b); vector bool int vec_cmple (vector unsigned int __a, vector unsigned int __b); vector double vec_double (vector signed long long __a); vector double vec_double (vector unsigned long long __a); vector bool char vec_eqv(vector bool char __a, vector bool char __b); vector bool short vec_eqv(vector bool short __a, vector bool short __b); vector bool int vec_eqv(vector bool int __a, vector bool int __b); vector bool long long vec_eqv(vector bool long long __a, vector bool long long __b); vector signed short vec_madd(vector signed short __a, vector signed short __b, vector signed short __c); vector signed short vec_madd(vector signed short __a, vector unsigned short __b, vector unsigned short __c); vector signed short vec_madd(vector unsigned short __a, vector signed short __b, vector signed short __c); vector unsigned short vec_madd(vector unsigned short __a, vector unsigned short __b, vector unsigned short __c); vector bool long long vec_mergeh(vector bool long long __a, vector bool long long __b); vector bool long long vec_mergel(vector bool long long __a, vector bool long long __b); vector bool char vec_nand(vector bool char __a, vector bool char __b); vector bool short vec_nand(vector bool short __a, vector bool short __b); vector bool int vec_nand(vector bool int __a, vector bool int __b); vector bool long long vec_nand(vector bool long long __a, vector bool long long __b); vector bool char vec_orc(vector bool char __a, vector bool char __b); vector bool short vec_orc(vector bool short __a, vector bool short __b); vector bool int vec_orc(vector bool int __a, vector bool int __b); vector bool long long vec_orc(vector bool long long __a, vector bool long long __b); vector signed long long vec_sub(vector signed long long __a, vector signed long long __b); vector signed long long vec_sub(vector bool long long __a, vector signed long long __b); vector signed long long vec_sub(vector signed long long __a, vector bool long long __b); vector unsigned long long vec_sub(vector unsigned long long __a, vector unsigned long long __b); vector unsigned long long vec_sub(vector bool long long __a, vector unsigned long long __b); vector unsigned long long vec_sub(vector unsigned long long __V2 ABI V1.1 http://ror float vec_sub(vector float __a, vector float __b); unsigned char vec_extract(vector bool char __a, int __b); signed short vec_extract(vector signed short __a, int __b); unsigned short vec_extract(vector bool short __a, int __b); signed int vec_extract(vector signed int __a, int __b); unsigned int vec_extract(vector bool int __a, int __b); signed long long vec_extract(vector signed long long __a, int __b); unsigned long long vec_extract(vector unsigned long long __a, int __b); unsigned long long vec_extract(vector bool long long __a, int __b); double vec_extract(vector double __a, int __b); vector bool char vec_insert(unsigned char __a, vector bool char __b, int __c); vector signed short vec_insert(signed short __a, vector signed short __b, int __c); vector bool short vec_insert(unsigned short __a, vector bool short __b, int __c); vector signed int vec_insert(signed int __a, vector signed int __b, int __c); vector bool int vec_insert(unsigned int __a, vector bool int __b, int __c); vector signed long long vec_insert(signed long long __a, vector signed long long __b, int __c); vector unsigned long long vec_insert(unsigned long long __a, vector unsigned long long __b, int __c); vector bool long long vec_insert(unsigned long long __a, vector bool long long __b, int __c); vector double vec_insert(double __a, vector double __b, int __c); vector signed long long vec_splats(signed long long __a); vector unsigned long long vec_splats(unsigned long long __a); vector signed __int128 vec_splats(signed __int128 __a); vector unsigned __int128 vec_splats(unsigned __int128 __a); vector double vec_splats(double __a); int vec_all_eq(vector double __a, vector double __b); int vec_all_ge(vector double __a, vector double __b); int vec_all_gt(vector double __a, vector double __b); int vec_all_le(vector double __a, vector double __b); int vec_all_lt(vector double __a, vector double __b); int vec_all_nan(vector double __a); int vec_all_ne(vector double __a, vector double __b); int vec_all_nge(vector double __a, vector double __b); int vec_all_ngt(vector double __a, vector double __b); int vec_any_eq(vector double __a, vector double __b); int vec_any_ge(vector double __a, vector double __b); int vec_any_gt(vector double __a, vector double __b); int vec_any_le(vector double __a, vector double __b); int vec_any_lt(vector double __a, vector double __b); int vec_any_ne(vector double __a, vector double __b); vector unsigned char vec_sbox_be (vector unsigned char); vector unsigned char vec_cipher_be (vector unsigned char, vector unsigned char); vector unsigned char vec_cipherlast_be (vector unsigned char, vector unsigned char); vector unsigned char vec_ncipher_be (vector unsigned char, vector unsigned char); vector unsigned char vec_ncipherlast_be (vector unsigned char, vector unsigned char); vector unsigned int vec_shasigma_be (vector unsigned int, const int, const int); vector unsigned long long vec_shasigma_be (vector unsigned long long, const int, const int); vector unsigned short vec_pmsum_be (vector unsigned char, vector unsigned char); vector unsigned int vec_pmsum_be (vector unsigned short, vector unsigned short); vector unsigned long long vec_pmsum_be (vector unsigned int, vector unsigned int); vector unsigned __int128 vec_pmsum_be (vector unsigned long long, vector unsigned long long); vector unsigned char vec_gb (vector unsigned char); vector unsigned long long vec_bperm (vector unsigned __int128 __a, vector unsigned char __b); Removed the folowing interfaces either because their signatures have changed in version 1.1 of the ABI or because they were implemented for ELF V2 ABI but have actually been deprecated in version 1.1. vector signed char vec_eqv(vector bool char __a, vector signed char __b); vector signed char vec_eqv(vector signed char __a, vector bool char __b); vector unsigned char vec_eqv(vector bool char __a, vector unsigned char __b); vector unsigned char vec_eqv(vector unsigned char __a, vector bool char __b); vector signed short vec_eqv(vector bool short __a, vector signed short __b); vector signed short vec_eqv(vector signed short __a, vector bool short __b); vector unsigned short vec_eqv(vector bool short __a, vector unsigned short __b); vector unsigned short vec_eqv(vector unsigned short __a, vector bool short __b); vector signed int vec_eqv(vector bool int __a, vector signed int __b); vector signed int vec_eqv(vector signed int __a, vector bool int __b); vector unsigned int vec_eqv(vector bool int __a, vector unsigned int __b); vector unsigned int vec_eqv(vector unsigned int __a, vector bool int __b); vector signed long long vec_eqv(vector bool long long __a, vector signed long long __b); vector signed long long vec_eqv(vector signed long long __a, vector bool long long __b); vector unsigned long long vec_eqv(vector bool long long __a, vector unsigned long long __b); vector unsigned long long vec_eqv(vector unsigned long long __a, vector bool long long __b); vector float vec_eqv(vector bool int __a, vector float __b); vector float vec_eqv(vector float __a, vector bool int __b); vector double vec_eqv(vector bool long long __a, vector double __b); vector double vec_eqv(vector double __a, vector bool long long __b); vector unsigned short vec_nand(vector bool short __a, vector unsigned short __b); llvm-svn: 248813	2015-09-29 18:13:34 +00:00
Nico Weber	1f22a34409	ms Intrin.h: Fix __movsw's and __stosw's inline asm. Before, clang's internal assembler would reject the inline asm in clang's Intrin.h. To make sure this doesn't happen for other Intrin.h functions using __asm__ blocks, add 32-bit and 64-bit codegen tests for Intrin.h. Sadly, these tests discovered that __readcr3 and __writecr3 have bad implementations in 64-bit builds. This will have to be fixed in a follow-up. llvm-svn: 248234	2015-09-22 00:46:21 +00:00
Michael Kuperstein	a10dff946e	[X86] Make f16c intrinsics accessible through emmintrin.h, per Intel docs Differential Revision: http://reviews.llvm.org/D13015 llvm-svn: 248156	2015-09-21 13:34:47 +00:00
Michael Kuperstein	5c2cb0eee2	[X86] Fix some non-reserved parameter names in intrinsic headers Differential Revision: http://reviews.llvm.org/D13009 llvm-svn: 248150	2015-09-21 11:45:27 +00:00
Simon Pilgrim	12919f7e49	[X86][SSE] Replace 128-bit SSE41 PMOVSX intrinsics with native IR 128-bit vector integer sign extensions correctly lower to the pmovsx instructions even for debug builds. This patch removes the builtins and reimplements the _mm_cvtepi_epi intrinsics __using builtin_shufflevector (to extract the bottom most subvector) and __builtin_convertvector (to actually perform the sign extension). Differential Revision: http://reviews.llvm.org/D12835 llvm-svn: 248092	2015-09-19 15:12:38 +00:00
Asaf Badouh	2718051dd7	re-apply r.247881 fixed the tests. llvm-svn: 247892	2015-09-17 14:53:37 +00:00
Asaf Badouh	8a61250709	revert r.247881 due to tests failures llvm-svn: 247883	2015-09-17 13:09:33 +00:00
Asaf Badouh	a0e5e71ef1	[X86][AVX512DQ] add new intrinsics convert i64 to FP and vice versa reduceps & reducepd rangeps & rangepd all in their 512bit versions Differential Revision: http://reviews.llvm.org/D11716 llvm-svn: 247881	2015-09-17 11:56:04 +00:00
Sean Silva	e4c3760a9f	Clean up trailing whitespace in the builtin headers llvm-svn: 247498	2015-09-12 02:55:19 +00:00
Simon Pilgrim	5aba9925c0	[X86][SSE] Add _mm_undefined_* intrinsics Added missing SSE/AVX 'undefined' intrinsics (PR24040): _mm_undefined_pd, _mm_undefined_ps + _mm_undefined_si128 _mm256_undefined_pd, _mm256_undefined_ps + _mm256_undefined_si256 _mm512_undefined, _mm512_undefined_ps, _mm512_undefined_pd + _mm512_undefined_epi32 Added builtin intrinsicss: __builtin_ia32_undef128, __builtin_ia32_undef256 + __builtin_ia32_undef512 Differential Revision: http://reviews.llvm.org/D12052 llvm-svn: 246083	2015-08-26 21:17:12 +00:00
Simon Pilgrim	fbb8904411	[X86] Remove unnecessary MMX declarations from Intrin.h As discussed in PR23648 - the intrinsics _m_from_int, _m_to_int and _m_prefetch are defined in mmintrin.h and prfchwintrin.h so we don't need to in Intrin.h Added tests for _m_from_int and _m_to_int D11338 already added a test for _m_prefetch Differential Revision: http://reviews.llvm.org/D12272 llvm-svn: 245975	2015-08-25 21:27:46 +00:00
Michael Kuperstein	b62c5bc64d	Revert r245923 since it breaks mingw. llvm-svn: 245929	2015-08-25 11:42:31 +00:00
Michael Kuperstein	2c8f9c2c23	[X86] Expose the various _rot intrinsics on non-MS platforms _rotl, _rotwl and _lrotl (and their right-shift counterparts) are official x86 intrinsics, and should be supported regardless of environment. This is in contrast to _rotl8, _rotl16, and _rotl64 which are MS-specific. Note that the MS documentation for _lrotl is different from the Intel documentation. Intel explicitly documents it as a 64-bit rotate, while for MS, since sizeof(unsigned long) for MSVC is always 4, a 32-bit rotate is implied. Differential Revision: http://reviews.llvm.org/D12271 llvm-svn: 245923	2015-08-25 07:21:33 +00:00
Ahmed Bougacha	5e354cb547	[Headers][X86] Use __builtin_shufflevector in AVX2 broadcasts. This lets us optimize them better. We agreed to remove the intrinsics, instead of combining them later, as, at -O0, we generate the expected instructions. Plus, it's a nice cleanup. Differential Revision: http://reviews.llvm.org/D10556 llvm-svn: 245605	2015-08-20 20:27:21 +00:00
Michael Kuperstein	d7b9392f59	[X86] Add support for _MM_ALIGN16 Differential Revision: http://reviews.llvm.org/D11753 llvm-svn: 244201	2015-08-06 08:24:38 +00:00
Asaf Badouh	c68e347c25	[X86][AVX512VLBW] add pack, cvt, mulhi and madd intrinsics Differential Revision: http://reviews.llvm.org/D11642 llvm-svn: 243867	2015-08-03 07:51:00 +00:00
Asaf Badouh	73b639f650	[X86][AVX512VLDQ] add reduce/range/cvt intrinsics add 128 & 256 width intrinsic versions of reduce/range and cvt i64 to FP and vice versa Differential Revision: http://reviews.llvm.org/D11598 llvm-svn: 243848	2015-08-02 12:43:08 +00:00
Ulrich Weigand	ca25643a05	[SystemZ] Add support for vecintrin.h vector built-in functions This patch adds support for the System Z vector built-in functions. The API-defined header file has the name vecintrin.h. The user-level functions are defined in the same style as the clang version of altivec.h, making heavy use of the __overloadable__ and __always_inline__ attributes. Where possible the functions expand to generic operations rather than specific built-in functions, in the hope that that form can be optimised better. Where a built-in routine is specified to require an immediate integer argument, the __enable_if__ attribute is used to verify the argument is in fact constant and in the appropriate range. Based on a patch by Richard Sandiford. llvm-svn: 243643	2015-07-30 14:10:43 +00:00
Asaf Badouh	d6cb100bc2	[X86][AVX512BW] Remove whitespaces llvm-svn: 243623	2015-07-30 06:52:26 +00:00
Asaf Badouh	1998eb2077	[X86][AVX512BW] add convert i16 to i8 and unpack intrinsics Differential Revision: http://reviews.llvm.org/D11564 llvm-svn: 243514	2015-07-29 12:34:20 +00:00
Asaf Badouh	a6c31703ac	[X86][AVX512BW] Replace attributes with __DEFAULT_FN_ATTRS llvm-svn: 243512	2015-07-29 12:22:19 +00:00
Asaf Badouh	93aa4c808a	[X86][AVX512VL] add AVX512VL intrinsics 4 out of 4 Differential Revision: http://reviews.llvm.org/D11526 llvm-svn: 243409	2015-07-28 12:04:40 +00:00
Asaf Badouh	b7cf71b63d	[X86][AVX512VL] add AVX512VL intrinsics 3 out of 4 http://reviews.llvm.org/D11526 llvm-svn: 243406	2015-07-28 11:14:09 +00:00
Asaf Badouh	78ee5cc8e1	[X86][AVX512VL] add AVX512VL intrinsics 2 out of 4 http://reviews.llvm.org/D11526 llvm-svn: 243402	2015-07-28 10:30:56 +00:00
Asaf Badouh	74da38706e	[X86][AVX512VL] add AVX512VL intrinsics 1 out of 4 http://reviews.llvm.org/D11526 llvm-svn: 243394	2015-07-28 08:26:14 +00:00
Simon Pilgrim	f81966d04b	[X86] Add missing _m_prefetch intrinsic The 3DNOW/PRFCHW cpu targets define both the PREFETCHW (set cache line modified) and PREFETCH (set cache line exclusive) instructions but only the _m_prefetchw (PREFETCHW) intrinsic is included in the header. This patch adds the missing _m_prefetch intrinsic. I'm basing this off AMD documentation - the intel docs on the support for PREFETCHW isn't clear whether Silvermont/Broadwell properly support PREFETCH but given that the intrinsic implementation is a default __builtin_prefetch call, it is safe whatever. Fix for PR23648 Differential Revision: http://reviews.llvm.org/D11338 llvm-svn: 243305	2015-07-27 19:01:52 +00:00
Asaf Badouh	f6a58b6dff	[X86][AVX512F] Add FP scalar intrinsics intrinsics for: add/sub/mul/div/min/max in their FP scalar versions Differential Revision: http://reviews.llvm.org/D11418 llvm-svn: 243009	2015-07-23 12:13:32 +00:00
Asaf Badouh	7d99966e91	[X86][AVX512BW] add madd and maddubs intrinsics Differential Revision: http://reviews.llvm.org/D11420 llvm-svn: 242986	2015-07-23 07:07:25 +00:00
Asaf Badouh	ffeb624483	[X86][AVX512F] add FP arithmetic intrinsics add/div/mul/sub include rounding versions Differential Revision: http://reviews.llvm.org/D11354 llvm-svn: 242790	2015-07-21 15:27:28 +00:00
Asaf Badouh	d4419ca657	[X86][AVX512BW] add clang intrinsics for pmulhrsw / pmulhuw / pmulhw also made minor fix in "test_mm512_maskz_permutex2var_epi16" Differential Revision: http://reviews.llvm.org/D11336 llvm-svn: 242635	2015-07-19 08:47:31 +00:00
David Majnemer	6b8e297089	[Intrin.h] Use compiler builtins to model memory barriers _ReadBarrier, _WriteBarrier, and _ReadWriteBarrier are essentially memory barriers of one form or another. Model these as atomic_signal_fence(ATOMIC_SEQ_CST). __faststorefence is a curious intrinsic. It's single purpose seems to an alternative to mfence when that instruction is slow. However, mfence is not always slow and is, in general, preferable to a 'lock or' sequence on certain CPUs. Give the compiler freedom to select the best sequence to get a fence. llvm-svn: 242378	2015-07-16 03:13:02 +00:00
Bill Schmidt	8da737a18a	[PPC64LE] Fix vec_sld semantics for little endian The vec_sld interface provides access to the vsldoi instruction. Unlike most of the vec_* interfaces, we do not attempt to change the generated code for vec_sld based on the endian mode. It is too difficult to correctly infer the desired semantics because of different element types, and the corrected instruction sequence is expensive, involving loading a permute control vector and performing a generalized permute. For GCC, this was implemented as "Don't touch the vec_sld" implementation. When it came time for the LLVM implementation, I did the same thing. However, this was hasty and incorrect. In LLVM's version of altivec.h, vec_sld was previously defined in terms of the vec_perm interface. Because vec_perm semantics are adjusted for little endian, this means that leaving vec_sld untouched causes it to generate something different for LE than for BE. Not good. This patch adjusts the form of vec_perm that is used for vec_sld and vec_vsldoi, effectively undoing the modifications so that the same vsldoi instruction will be generated for both BE and LE. There is an accompanying back-end patch to take care of some small ripple effects caused by these changes. llvm-svn: 242297	2015-07-15 15:45:53 +00:00
Nemanja Ivanovic	6c363ed67a	Add missing builtins to altivec.h for ABI compliance (vol. 4) This patch corresponds to review: http://reviews.llvm.org/D11184 A number of new interfaces for altivec.h (as mandated by the ABI): vector float vec_cpsgn(vector float, vector float) vector double vec_cpsgn(vector double, vector double) vector double vec_or(vector bool long long, vector double) vector double vec_or(vector double, vector bool long long) vector double vec_re(vector double) vector signed char vec_cntlz(vector signed char) vector unsigned char vec_cntlz(vector unsigned char) vector short vec_cntlz(vector short) vector unsigned short vec_cntlz(vector unsigned short) vector int vec_cntlz(vector int) vector unsigned int vec_cntlz(vector unsigned int) vector signed long long vec_cntlz(vector signed long long) vector unsigned long long vec_cntlz(vector unsigned long long) vector signed char vec_nand(vector bool signed char, vector signed char) vector signed char vec_nand(vector signed char, vector bool signed char) vector signed char vec_nand(vector signed char, vector signed char) vector unsigned char vec_nand(vector bool unsigned char, vector unsigned char) vector unsigned char vec_nand(vector unsigned char, vector bool unsigned char) vector unsigned char vec_nand(vector unsigned char, vector unsigned char) vector short vec_nand(vector bool short, vector short) vector short vec_nand(vector short, vector bool short) vector short vec_nand(vector short, vector short) vector unsigned short vec_nand(vector bool unsigned short, vector unsigned short) vector unsigned short vec_nand(vector unsigned short, vector bool unsigned short) vector unsigned short vec_nand(vector unsigned short, vector unsigned short) vector int vec_nand(vector bool int, vector int) vector int vec_nand(vector int, vector bool int) vector int vec_nand(vector int, vector int) vector unsigned int vec_nand(vector bool unsigned int, vector unsigned int) vector unsigned int vec_nand(vector unsigned int, vector bool unsigned int) vector unsigned int vec_nand(vector unsigned int, vector unsigned int) vector signed long long vec_nand(vector bool long long, vector signed long long) vector signed long long vec_nand(vector signed long long, vector bool long long) vector signed long long vec_nand(vector signed long long, vector signed long long) vector unsigned long long vec_nand(vector bool long long, vector unsigned long long) vector unsigned long long vec_nand(vector unsigned long long, vector bool long long) vector unsigned long long vec_nand(vector unsigned long long, vector unsigned long long) vector signed char vec_orc(vector bool signed char, vector signed char) vector signed char vec_orc(vector signed char, vector bool signed char) vector signed char vec_orc(vector signed char, vector signed char) vector unsigned char vec_orc(vector bool unsigned char, vector unsigned char) vector unsigned char vec_orc(vector unsigned char, vector bool unsigned char) vector unsigned char vec_orc(vector unsigned char, vector unsigned char) vector short vec_orc(vector bool short, vector short) vector short vec_orc(vector short, vector bool short) vector short vec_orc(vector short, vector short) vector unsigned short vec_orc(vector bool unsigned short, vector unsigned short) vector unsigned short vec_orc(vector unsigned short, vector bool unsigned short) vector unsigned short vec_orc(vector unsigned short, vector unsigned short) vector int vec_orc(vector bool int, vector int) vector int vec_orc(vector int, vector bool int) vector int vec_orc(vector int, vector int) vector unsigned int vec_orc(vector bool unsigned int, vector unsigned int) vector unsigned int vec_orc(vector unsigned int, vector bool unsigned int) vector unsigned int vec_orc(vector unsigned int, vector unsigned int) vector signed long long vec_orc(vector bool long long, vector signed long long) vector signed long long vec_orc(vector signed long long, vector bool long long) vector signed long long vec_orc(vector signed long long, vector signed long long) vector unsigned long long vec_orc(vector bool long long, vector unsigned long long) vector unsigned long long vec_orc(vector unsigned long long, vector bool long long) vector unsigned long long vec_orc(vector unsigned long long, vector unsigned long long) vector signed char vec_div(vector signed char, vector signed char) vector unsigned char vec_div(vector unsigned char, vector unsigned char) vector signed short vec_div(vector signed short, vector signed short) vector unsigned short vec_div(vector unsigned short, vector unsigned short) vector signed int vec_div(vector signed int, vector signed int) vector unsigned int vec_div(vector unsigned int, vector unsigned int) vector signed long long vec_div(vector signed long long, vector signed long long) vector unsigned long long vec_div(vector unsigned long long, vector unsigned long long) vector unsigned char vec_mul(vector unsigned char, vector unsigned char) vector unsigned int vec_mul(vector unsigned int, vector unsigned int) vector unsigned long long vec_mul(vector unsigned long long, vector unsigned long long) vector unsigned short vec_mul(vector unsigned short, vector unsigned short) vector signed char vec_mul(vector signed char, vector signed char) vector signed int vec_mul(vector signed int, vector signed int) vector signed long long vec_mul(vector signed long long, vector signed long long) vector signed short vec_mul(vector signed short, vector signed short) vector signed long long vec_mergeh(vector signed long long, vector signed long long) vector signed long long vec_mergeh(vector signed long long, vector bool long long) vector signed long long vec_mergeh(vector bool long long, vector signed long long) vector unsigned long long vec_mergeh(vector unsigned long long, vector unsigned long long) vector unsigned long long vec_mergeh(vector unsigned long long, vector bool long long) vector unsigned long long vec_mergeh(vector bool long long, vector unsigned long long) vector double vec_mergeh(vector double, vector double) vector double vec_mergeh(vector double, vector bool long long) vector double vec_mergeh(vector bool long long, vector double) vector signed long long vec_mergel(vector signed long long, vector signed long long) vector signed long long vec_mergel(vector signed long long, vector bool long long) vector signed long long vec_mergel(vector bool long long, vector signed long long) vector unsigned long long vec_mergel(vector unsigned long long, vector unsigned long long) vector unsigned long long vec_mergel(vector unsigned long long, vector bool long long) vector unsigned long long vec_mergel(vector bool long long, vector unsigned long long) vector double vec_mergel(vector double, vector double) vector double vec_mergel(vector double, vector bool long long) vector double vec_mergel(vector bool long long, vector double) vector signed int vec_pack(vector signed long long, vector signed long long) vector unsigned int vec_pack(vector unsigned long long, vector unsigned long long) vector bool int vec_pack(vector bool long long, vector bool long long) llvm-svn: 242171	2015-07-14 17:50:27 +00:00
Asaf Badouh	1626545667	[x86] add 2 bit to ObjCOrBuiltinID and new intrinsics add 2 bit to ObjCOrBuiltinID (changed from 11bits to 13bits), see discussion in Add new intrinsics support that already covered by the BE. All the intrinsics are covered by tests Differential Revision: http://reviews.llvm.org/D10893 llvm-svn: 242144	2015-07-14 14:02:45 +00:00
David Majnemer	e0b863f4c7	[Intrin.h] Use __ATOMIC_SEQ_CST instead of '5' No functionality change is intended. llvm-svn: 242087	2015-07-13 23:39:37 +00:00
David Majnemer	56e466745d	[Intrin.h] Make the variable names more consistent No functionality change intended. llvm-svn: 242086	2015-07-13 23:38:56 +00:00
David Majnemer	8dadce78ed	Intrin.h: Don't invade the program's namespace The program is permitted to have stuff like '#define x' in it so avoid using identifiers not reserved for the implementation. llvm-svn: 242010	2015-07-13 02:53:23 +00:00
David Majnemer	3c8ea5f3f8	Intrin.h: Clean up our atomic intrinsics Three things: - The atomic intrinsics mandate memory barriers, let's start emitting some. - We don't need to manually create RMW operations, we can just do __atomic_fetch_foo instead of performing __atomic_foo_fetch and undoing foo. - Don't use inline assembly, we don't need it for these intrinsics. This fixes PR24101. llvm-svn: 242009	2015-07-13 02:53:19 +00:00
Nemanja Ivanovic	26c3534b84	Add missing builtins to altivec.h for ABI compliance (vol. 3) This patch corresponds to review: http://reviews.llvm.org/D10972 Fix for the handling of dependent features that are enabled by default on some CPU's (such as -mvsx, -mpower8-vector). Also provides a number of new interfaces or fixes existing ones in altivec.h. Changed signatures to conform to ABI: vector short vec_perm(vector signed short, vector signed short, vector unsigned char) vector int vec_perm(vector signed int, vector signed int, vector unsigned char) vector long long vec_perm(vector signed long long, vector signed long long, vector unsigned char) vector signed char vec_sld(vector signed char, vector signed char, const int) vector unsigned char vec_sld(vector unsigned char, vector unsigned char, const int) vector bool char vec_sld(vector bool char, vector bool char, const int) vector unsigned short vec_sld(vector unsigned short, vector unsigned short, const int) vector signed short vec_sld(vector signed short, vector signed short, const int) vector signed int vec_sld(vector signed int, vector signed int, const int) vector unsigned int vec_sld(vector unsigned int, vector unsigned int, const int) vector float vec_sld(vector float, vector float, const int) vector signed char vec_splat(vector signed char, const int) vector unsigned char vec_splat(vector unsigned char, const int) vector bool char vec_splat(vector bool char, const int) vector signed short vec_splat(vector signed short, const int) vector unsigned short vec_splat(vector unsigned short, const int) vector bool short vec_splat(vector bool short, const int) vector pixel vec_splat(vector pixel, const int) vector signed int vec_splat(vector signed int, const int) vector unsigned int vec_splat(vector unsigned int, const int) vector bool int vec_splat(vector bool int, const int) vector float vec_splat(vector float, const int) Added a VSX path to: vector float vec_round(vector float) Added interfaces: vector signed char vec_eqv(vector signed char, vector signed char) vector signed char vec_eqv(vector bool char, vector signed char) vector signed char vec_eqv(vector signed char, vector bool char) vector unsigned char vec_eqv(vector unsigned char, vector unsigned char) vector unsigned char vec_eqv(vector bool char, vector unsigned char) vector unsigned char vec_eqv(vector unsigned char, vector bool char) vector signed short vec_eqv(vector signed short, vector signed short) vector signed short vec_eqv(vector bool short, vector signed short) vector signed short vec_eqv(vector signed short, vector bool short) vector unsigned short vec_eqv(vector unsigned short, vector unsigned short) vector unsigned short vec_eqv(vector bool short, vector unsigned short) vector unsigned short vec_eqv(vector unsigned short, vector bool short) vector signed int vec_eqv(vector signed int, vector signed int) vector signed int vec_eqv(vector bool int, vector signed int) vector signed int vec_eqv(vector signed int, vector bool int) vector unsigned int vec_eqv(vector unsigned int, vector unsigned int) vector unsigned int vec_eqv(vector bool int, vector unsigned int) vector unsigned int vec_eqv(vector unsigned int, vector bool int) vector signed long long vec_eqv(vector signed long long, vector signed long long) vector signed long long vec_eqv(vector bool long long, vector signed long long) vector signed long long vec_eqv(vector signed long long, vector bool long long) vector unsigned long long vec_eqv(vector unsigned long long, vector unsigned long long) vector unsigned long long vec_eqv(vector bool long long, vector unsigned long long) vector unsigned long long vec_eqv(vector unsigned long long, vector bool long long) vector float vec_eqv(vector float, vector float) vector float vec_eqv(vector bool int, vector float) vector float vec_eqv(vector float, vector bool int) vector double vec_eqv(vector double, vector double) vector double vec_eqv(vector bool long long, vector double) vector double vec_eqv(vector double, vector bool long long) vector bool long long vec_perm(vector bool long long, vector bool long long, vector unsigned char) vector double vec_round(vector double) vector double vec_splat(vector double, const int) vector bool long long vec_splat(vector bool long long, const int) vector signed long long vec_splat(vector signed long long, const int) vector unsigned long long vec_splat(vector unsigned long long, vector bool int vec_sld(vector bool int, vector bool int, const int) vector bool short vec_sld(vector bool short, vector bool short, const int) llvm-svn: 241904	2015-07-10 13:11:34 +00:00
Nemanja Ivanovic	e00fa61412	Add the missing return statements from revision 241399. llvm-svn: 241405	2015-07-05 10:54:10 +00:00
Nemanja Ivanovic	1c7ad715ec	Add missing builtins to altivec.h for ABI compliance (vol. 2) This patch corresponds to review: http://reviews.llvm.org/D10875 The bulk of the second round of additions to altivec.h. The following interfaces were added: vector double vec_floor(vector double) vector double vec_madd(vector double, vector double, vector double) vector float vec_msub(vector float, vector float, vector float) vector double vec_msub(vector double, vector double, vector double) vector float vec_mul(vector float, vector float) vector double vec_mul(vector double, vector double) vector float vec_nmadd(vector float, vector float, vector float) vector double vec_nmadd(vector double, vector double, vector double) vector double vec_nmsub(vector double, vector double, vector double) vector double vec_nor(vector double, vector double) vector double vec_or(vector double, vector double) vector float vec_rint(vector float) vector double vec_rint(vector double) vector float vec_nearbyint(vector float) vector double vec_nearbyint(vector double) vector float vec_sqrt(vector float) vector double vec_sqrt(vector double) vector double vec_rsqrte(vector double) vector double vec_sel(vector double, vector double, vector unsigned long long) vector double vec_sel(vector double, vector double, vector unsigned long long) vector double vec_sub(vector double, vector double) vector double vec_trunc(vector double) vector double vec_xor(vector double, vector double) vector double vec_xor(vector double, vector bool long long) vector double vec_xor(vector bool long long, vector double) New VSX paths for the following interfaces: vector float vec_madd(vector float, vector float, vector float) vector float vec_nmsub(vector float, vector float, vector float) vector float vec_rsqrte(vector float) vector float vec_trunc(vector float) vector float vec_floor(vector float) llvm-svn: 241399	2015-07-05 06:40:52 +00:00
Kit Barton	b61173e791	This patch adds support for the vector merge even word and vector merge odd word instructions introduced in POWER8. These are the Clang-related changes for http://reviews.llvm.org/D10704 All builtins are added in altivec.h and guarded with the POWER8_VECTOR macro. Phabricator review: http://reviews.llvm.org/D10736 llvm-svn: 241293	2015-07-02 19:29:05 +00:00
Michael Kuperstein	e45af54cdb	[X86] Rename DEFAULT_FN_ATTR macro to __DEFAULT_FN_ATTR llvm-svn: 241065	2015-06-30 13:36:19 +00:00
Michael Kuperstein	9101a98bd0	[X86] Add missing undef of DEFAULT_FN_ATTRS in FXSR intrinsics llvm-svn: 241055	2015-06-30 10:18:54 +00:00
Michael Kuperstein	a3c7b74208	[X86] Add FXSR intrinsics Add intrinsics for the FXSR instructions (FXSAVE/FXSAVE64/FXRSTOR/FXRSTOR64) These were previously declared in Intrin.h for MSVC compatibility, but now that we have them implemented, these declarations can be removed. llvm-svn: 241053	2015-06-30 09:45:38 +00:00
Asaf Badouh	a45b7cab7b	[x86][AVX512CD] Add conflict and lzcnt intrinsics in their 512bit versions include tests review http://reviews.llvm.org/D10795 llvm-svn: 240941	2015-06-29 12:51:53 +00:00
Asaf Badouh	4002ce4834	[X86][AVX512BW] Add more intrinsics support: Blend, abs, packs, adds, subs, avg, max, min, permute. all the intrinsics are covered by tests review: http://reviews.llvm.org/D10799 llvm-svn: 240937	2015-06-29 12:16:40 +00:00
Elena Demikhovsky	c563c2c61a	AVX-512: Implemented AVX-512 FMA intrinsics and tests. by Igor Breger http://reviews.llvm.org/D10797 llvm-svn: 240928	2015-06-29 09:20:57 +00:00
Nemanja Ivanovic	2f1f926e34	Add missing builtins to altivec.h for ABI compliance (vol. 1) This patch corresponds to review: http://reviews.llvm.org/D10637 This is the first round of additions of missing builtins listed in the ABI document. More to come (this builds onto what seurer already addes). This patch adds: vector signed long long vec_abs(vector signed long long) vector double vec_abs(vector double) vector signed long long vec_add(vector signed long long, vector signed long long) vector unsigned long long vec_add(vector unsigned long long, vector unsigned long long) vector double vec_add(vector double, vector double) vector double vec_and(vector bool long long, vector double) vector double vec_and(vector double, vector bool long long) vector double vec_and(vector double, vector double) vector signed long long vec_and(vector signed long long, vector signed long long) vector double vec_andc(vector bool long long, vector double) vector double vec_andc(vector double, vector bool long long) vector double vec_andc(vector double, vector double) vector signed long long vec_andc(vector signed long long, vector signed long long) vector double vec_ceil(vector double) vector bool long long vec_cmpeq(vector double, vector double) vector bool long long vec_cmpge(vector double, vector double) vector bool long long vec_cmpge(vector signed long long, vector signed long long) vector bool long long vec_cmpge(vector unsigned long long, vector unsigned long long) vector bool long long vec_cmpgt(vector double, vector double) vector bool long long vec_cmple(vector double, vector double) vector bool long long vec_cmple(vector signed long long, vector signed long long) vector bool long long vec_cmple(vector unsigned long long, vector unsigned long long) vector bool long long vec_cmplt(vector double, vector double) vector bool long long vec_cmplt(vector signed long long, vector signed long long) vector bool long long vec_cmplt(vector unsigned long long, vector unsigned long long) llvm-svn: 240821	2015-06-26 19:27:20 +00:00
Nico Weber	ac64b97771	Add new file from r240741 to CMakeLists.txt. llvm-svn: 240743	2015-06-26 00:19:32 +00:00
Nico Weber	2ca46867e1	Add an inttypes.h wrapper that fixes up some macros in Microsoft mode. Before MSVS2015, MSVS's headers disagree about int32_t and PRIx32 and so on. Provide a wrapper header to fix this, so that -Wformat can still be used. Fixes PR23412. llvm-svn: 240741	2015-06-26 00:13:18 +00:00
Sean Silva	d0de76a3da	Remove `requires` for x86 CPU features. Ever since the target attributes change, we don't need to guard these headers with `requires`. Actually it's a bit worse, because if we do then they are included textually under the covers, causing declarations to appear in submodules they aren't supposed to be in. llvm-svn: 240720	2015-06-25 23:22:11 +00:00
Eric Christopher	3d920eed5d	Move xtest to its own file to match the gcc header organization. llvm-svn: 239926	2015-06-17 18:42:07 +00:00
Eric Christopher	29b78091e7	Update comments on HLE, RTM, and ADX support for intrinsics. llvm-svn: 239925	2015-06-17 18:42:03 +00:00
Eric Christopher	9fc7fb274e	Update the intel intrinsic headers to use the target attribute support. This involved removing the conditional inclusion and replacing them with target attributes matching the original conditional inclusion and checks. The testcase update removes the macro checks for each file and replaces them with usage of the __target__ attribute, e.g.: int __attribute__((__target__(("sse3")))) foo(int a) { _mm_mwait(0, 0); return 4; } This usage does require the enclosing function have the requisite __target__ attribute for inlining and code generation - also for any macro intrinsic uses in the enclosing function. There's no change for existing uses of the intrinsic headers. llvm-svn: 239883	2015-06-17 07:09:32 +00:00
Eric Christopher	4d185168e9	Use a define for per-file function attributes for the Intel intrinsic headers. This is a precursor to changing them to use the new target attribute code. llvm-svn: 239882	2015-06-17 07:09:20 +00:00
Eric Christopher	5a9bec104b	Use a macro for the omnipresent attributes on header functions in Intrin.h. Saves some typing and if someone wants to change them it makes it much easier. llvm-svn: 239782	2015-06-15 23:20:35 +00:00
Luke Cheeseman	59b2d83909	This patch implements clang support for the ACLE special register intrinsics in section 10.1, __arm_{w,r}sr{,p,64}. This includes arm_acle.h definitions with builtins and codegen to support these, the intrinsics are implemented by generating read/write_register calls which get appropriately lowered in the backend based on the register string provided. SemaChecking is also implemented to fault invalid parameters. Differential Revision: http://reviews.llvm.org/D9697 llvm-svn: 239737	2015-06-15 17:51:01 +00:00
Nemanja Ivanovic	b17f1129fa	Clang support for vector quad bit permute and gather instructions through builtins This patch corresponds to review: http://reviews.llvm.org/D10095 This is for just two instructions and related builtins: vbpermq vgbbd llvm-svn: 239506	2015-06-11 06:25:36 +00:00
Bill Seurer	703e8486ec	[PowerPC] Reformat altivec.h with clang-format This revision just fixes the formatting of altivec.h. llvm-svn: 239408	2015-06-09 14:39:47 +00:00
David Majnemer	81ecbf45d4	Revert accidental commit This change was unrelated to r239170. llvm-svn: 239176	2015-06-05 18:24:55 +00:00
David Majnemer	cdffc36c11	[AST] There is no message for C++1z-style static_assert We would crash in the DeclPrinter trying to pretty-print the static_assert message. C++1z-style assertions don't have a message so we would crash. This fixes PR23756. llvm-svn: 239170	2015-06-05 18:03:58 +00:00
Bill Seurer	8be14f11ce	[PowerPC] This revision adds 68 of the missing "Predefined Functions for Vector Programming" from appendix A of the OpenPOWER ABI for Linux Supplement document. I also added tests for the new functions and updated another test that was looking for specific line numbers in error messages from altivec.h. https://llvm.org/bugs/show_bug.cgi?id=23679 http://reviews.llvm.org/D10131 llvm-svn: 239066	2015-06-04 18:45:44 +00:00
Ekaterina Romanova	2e81434552	Added doxygen comments for the intrinsics. llvm-svn: 238386	2015-05-28 01:25:25 +00:00
John Thompson	b7892ffc69	It appears these exports are needed, as wmmintrin.h includes them. llvm-svn: 238345	2015-05-27 18:26:41 +00:00
Kit Barton	5944ee2179	This patch adds support for the vector quadword add/sub instructions introduced in POWER8. These are the Clang-related changes for http://reviews.llvm.org/D9081 vadduqm vaddeuqm vaddcuq vaddecuq vsubuqm vsubeuqm vsubcuq vsubecuq All builtins are added in altivec.h, and guarded with the POWER8_VECTOR and powerpc64 macros. http://reviews.llvm.org/D9903 llvm-svn: 238145	2015-05-25 15:52:45 +00:00
Michael Kuperstein	7619004211	[X86] Add _mm256_set_m128 and its 5 variants. Differential Revision: http://reviews.llvm.org/D9855 llvm-svn: 237778	2015-05-20 07:46:52 +00:00
Michael Kuperstein	877f3cbe84	[X86] Add _mm_broadcastsd_pd intrinsic _mm_broadcastsd_pd is basically an alias for _mm_movedup_pd, however the alias is only available from AVX2 forward. llvm-svn: 237698	2015-05-19 14:49:14 +00:00
Michael Kuperstein	6168183e04	[X86] Added _mm256_bslli_epi128 and _mm256_bsrli_epi128. These two intrinsics are alternative names for _mm256_slli_si256 and _mm256_srli_si256, respectively. llvm-svn: 237693	2015-05-19 13:05:46 +00:00
Bill Schmidt	41e14c4dfa	[PPC64] Add vector pack/unpack support from ISA 2.07 This patch adds support for the following new instructions in the Power ISA 2.07: vpksdss vpksdus vpkudus vpkudum vupkhsw vupklsw These instructions are available through the vec_packs, vec_packsu, vec_unpackh, and vec_unpackl built-in interfaces. These are lane-sensitive instructions, so the built-ins have different implementations for big- and little-endian, and the instructions must be marked as killing the vector swap optimization for now. The first three instructions perform saturating pack operations. The fourth performs a modulo pack operation, which means it can be represented with a vector shuffle, and conversely the appropriate vector shuffles may cause this instruction to be generated. The other instructions are only generated via built-in support for now. I noticed during patch preparation that the macro __VSX__ was not previously predefined when the power8-vector or direct-move features are requested. This is an error, and I've corrected that here as well. Appropriate tests have been added. There is a companion patch to llvm for the rest of this support. llvm-svn: 237500	2015-05-16 01:02:25 +00:00
Richard Smith	23d8d0338e	[modules] Fix a #include cycle when building a module for our builtin headers. xmmintrin.h includes emmintrin.h and vice versa if SSE2 is enabled. We break this cycle for a modules build, and instead make the xmmintrin.h module re-export the immintrin.h module. Also included is a fix for an assert in the serialization code if a module exports another module that was declared later in the same module map. llvm-svn: 237321	2015-05-14 00:45:20 +00:00
Elena Demikhovsky	bd5c8b9be9	AVX-512: FP compare intrinsics - changed type of CC parameter from i8 to i32 according to the spec. Added FP compare intrinsics for SKX. llvm-svn: 236715	2015-05-07 11:26:36 +00:00
Elena Demikhovsky	e7d4c2e229	AVX-512: Added AVX-512 intrinsics and tests by Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 236218	2015-04-30 09:24:29 +00:00
Elena Demikhovsky	35dc8c0944	AVX-512: added intrinsics for KNL and SKX by Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 235986	2015-04-28 13:28:01 +00:00
Artem Belevich	4e192df778	[cuda] Added support for CUDA built-in variables. Added cuda_builtin_vars.h which implements built-in CUDA variables using __declattr(property). Fields of built-in variables (except for warpSize) are implemented using __declattr(property) which replaces read/write of a member field with a call to a getter/setter member function, in this case with appropriate NVPTX builtin. Added a test case to check diagnostics on attempt to construct or improperly access a built-in variable. Differential Revision: http://reviews.llvm.org/D9064 llvm-svn: 235448	2015-04-21 22:14:13 +00:00
Artem Belevich	a050112bba	Revert r235398 "[cuda] Added support for CUDA built-in variables." r235398 was causing buildbot break due to missing Makefile changes. llvm-svn: 235401	2015-04-21 18:36:42 +00:00
Artem Belevich	d0a2ae054f	[cuda] Added support for CUDA built-in variables. Added cuda_builtin_vars.h which implements built-in CUDA variables using __declattr(property). Fields of built-in variables (except for warpSize) are implemented using __declattr(property) which replaces read/write of a member field with a call to a getter/setter member function, in this case with appropriate NVPTX builtin. Added a test case to check diagnostics on attempt to construct or improperly access a built-in variable. Differential Revision: http://reviews.llvm.org/D9064 llvm-svn: 235398	2015-04-21 17:39:06 +00:00
Ekaterina Romanova	b929ad7b17	_mm256_blend_epi16 is being cast to __m256d instead of __m256i. Fixing this. llvm-svn: 234560	2015-04-10 02:39:45 +00:00
Ulrich Weigand	cc67344a86	[SystemZ] Add header files to Makefile / module.modulemap This should fix build-bot failures after r233804. The patch also adds a "systemz" feature, and renames the "transactional-execution" feature to "htm", since it turns out "-" is not a legal character in module feature names. llvm-svn: 233807	2015-04-01 14:15:35 +00:00
Ulrich Weigand	3a610ebf1e	[SystemZ] Support transactional execution on zEC12 The zEC12 provides the transactional-execution facility. This is exposed to users via a set of builtin routines on other compilers. This patch adds clang support to enable those builtins. In partciular, the patch: - enables the transactional-execution feature by default on zEC12 - allows to override presence of that feature via the -mhtm/-mno-htm options - adds a predefined macro __HTM__ if the feature is enabled - adds support for the transactional-execution GCC builtins - adds Sema checking to verify the __builtin_tabort abort code - adds the s390intrin.h header file (for GCC compatibility) - adds s390 sections to the htmintrin.h and htmxlintrin.h header files Since this is first use of target-specific intrinsics on the platform, the patch creates the include/clang/Basic/BuiltinsSystemZ.def file and hooks it up in TargetBuiltins.h and lib/Basic/Targets.cpp. An associated LLVM patch adds the required LLVM IR intrinsics. For reference, the transactional-execution instructions are documented in the z/Architecture Principles of Operation for the zEC12: http://publibfp.boulder.ibm.com/cgi-bin/bookmgr/download/DZ9ZR009.pdf The associated builtins are documented in the GCC manual: http://gcc.gnu.org/onlinedocs/gcc/S_002f390-System-z-Built-in-Functions.html The htmxlintrin.h intrinsics provided for compatibility with the IBM XL compiler are documented in the "z/OS XL C/C++ Programming Guide". llvm-svn: 233804	2015-04-01 12:54:25 +00:00
Elena Demikhovsky	29da2fba46	AVX-512: added clang intrinsics for logical and, or xor for 512 bits by Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 233794	2015-04-01 06:54:16 +00:00
Kit Barton	8246f28237	Add Hardware Transactional Memory (HTM) Support This patch adds Hardware Transaction Memory (HTM) support supported by ISA 2.07 (POWER8). The intrinsic support is based on GCC one [1], with both 'PowerPC HTM Low Level Built-in Functions' and 'PowerPC HTM High Level Inline Functions' implemented. Along with builtins a new driver switch is added to enable/disable HTM instruction support (-mhtm) and a header with common definitions (mostly to parse the TFHAR register value). The HTM switch also sets a preprocessor builtin HTM. The HTM usage requires a recently newer kernel with PPC HTM enabled. Tested on powerpc64 and powerpc64le. This is send along a llvm patch to enabled the builtins and option switch. [1] https://gcc.gnu.org/onlinedocs/gcc/PowerPC-Hardware-Transactional-Memory-Built-in-Functions.html Phabricator Review: http://reviews.llvm.org/D8248 llvm-svn: 233205	2015-03-25 19:41:41 +00:00
Sanjay Patel	0a6da5de55	[X86, AVX2] Replace inserti128 and extracti128 intrinsics with generic shuffles This is nearly identical to the v*f128_si256 parts of r231792 and r232052. AVX2 introduced proper integer variants of the hacked integer insert/extract C intrinsics that were created for this same functionality with AVX1. This should complete the front end fixes for insert/extract128 intrinsics. Corresponding LLVM patch to follow. llvm-svn: 232109	2015-03-12 21:54:24 +00:00
Sanjay Patel	f204b00940	Replace second (hopefully unused) access of macro input argument with zero vector to be safer. Suggested by Craig Topper in D8275. This is a follow-on to r232052. llvm-svn: 232061	2015-03-12 17:23:46 +00:00
Sanjay Patel	0c351aba25	[X86, AVX] replace vextractf128 intrinsics with generic shuffles This is very much like D8088 (checked in at r231792). Now that we've replaced the vinsertf128 intrinsics, do the same for their extract twins. Differential Revision: http://reviews.llvm.org/D8275 llvm-svn: 232052	2015-03-12 15:50:36 +00:00
Kit Barton	8553bec911	Add builtins for the 64-bit vector integer arithmetic instructions added in POWER8. These are the Clang-related changes for the instructions added to LLVM in http://reviews.llvm.org/D7959. Phabricator review: http://reviews.llvm.org/D8041 llvm-svn: 231931	2015-03-11 15:57:19 +00:00
Sanjay Patel	7f6aa52e93	[X86, AVX] Replace vinsertf128 intrinsics with generic shuffles. We want to replace as much custom x86 shuffling via intrinsics as possible because pushing the code down the generic shuffle optimization path allows for better codegen and less complexity in LLVM. This is the sibling patch for the LLVM half of this change: http://reviews.llvm.org/D8086 Differential Revision: http://reviews.llvm.org/D8088 llvm-svn: 231792	2015-03-10 15:19:26 +00:00
Nemanja Ivanovic	55e757db4a	Add Clang support for PPC cryptography builtins Review: http://reviews.llvm.org/D7951 llvm-svn: 231291	2015-03-04 21:48:22 +00:00
Juergen Ributzka	9baa03fc07	Lower _mm256_broadcastsi128_si256 directly to a vector shuffle. Originally we were using the same GCC builtins to lower this AVX2 vector intrinsic. Instead we will now lower it directly to a vector shuffle. This will not only allow LLVM to generate better code, but it will also allow us to remove the GCC intrinsics. Reviewed by Andrea This is related to rdar://problem/18742778. llvm-svn: 231081	2015-03-03 17:22:53 +00:00
Dmitri Gribenko	a586ea13a4	Restore the libc++ definition of max_align_t on Apple platforms Clang has introduced ::max_align_t in stddef.h in r201729, but libc++ was already defining std::max_align_t on Darwin because there was none in the global namespace. After that Clang commit though, libc++ started defining std::max_align_t to be a typedef for ::max_align_t, which has a different definition. This changed the ABI. This commit restores the previous definition. rdar://19919394 rdar://18557982 llvm-svn: 230292	2015-02-24 01:06:22 +00:00
Filipe Cabecinhas	d74002965e	Make the _mm256_insert_epi64 definition more consistent Use long long for the epi64 argument, like the other intrinsics. NFC since this is only defined in 64-bit mode, not in 32-bit. Fix suggested by H. J. Lu! llvm-svn: 229886	2015-02-19 19:00:33 +00:00
Filipe Cabecinhas	54a2ba8b76	[Headers] Add tests for _mm256_insert_epi64 and fix its definition Summary: The definition for _mm256_insert_epi64 was taking an int, which would get truncated before being inserted in the vector. Original patch by Joshua Magee! Reviewers: bruno, craig.topper Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D7179 llvm-svn: 229811	2015-02-19 03:02:33 +00:00
Craig Topper	a462482d98	[X86] Add _mm_bslli_si128 and _mm_bsrli_si128 as aliases of _mm_slli_si128 and _mm_srli_si128. This matches Intel documentation and gcc. llvm-svn: 229066	2015-02-13 06:04:45 +00:00
Craig Topper	51e47418d4	[X86] Simplify some code and remove some -Wshadow disables from intrinsic header. llvm-svn: 229065	2015-02-13 06:04:43 +00:00
Filipe Cabecinhas	2177fc1732	Make the byte-shift SSE intrinsics emit vector shuffles which we know the backend can handle. Also removed unused builtins. Original patch by Andrea Di Biagio! Reviewers: craig.topper, nadav Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D7199 llvm-svn: 228481	2015-02-07 01:37:09 +00:00
David Majnemer	1cf22e690d	Headers: Don't use attribute keywords which aren't reserved Instead of using 'unavailable', use '__unavailable__' llvm-svn: 228087	2015-02-04 00:26:10 +00:00
Craig Topper	53565c60e7	[X86] Add other flavors of AVX512 cmpps/cmppd intrinsics. llvm-svn: 227773	2015-02-01 22:27:40 +00:00
Craig Topper	2a898bfc67	[X86] Add the AVX512 exp2a23 intrinsics. llvm-svn: 227769	2015-02-01 21:34:11 +00:00
Craig Topper	da97c20128	[X86] Add all intrinsics for scalar rsqrt28/rcp28 to avx512erintrin.h. Add parentheses around all macro arguments. llvm-svn: 227722	2015-02-01 10:15:11 +00:00
Craig Topper	c4b852a909	[X86] Flesh out more of the avx512erintrin.h file. llvm-svn: 227719	2015-02-01 08:52:55 +00:00
Craig Topper	b01fc317c1	[X86] Use macros in AVX512ER header to allow ICE to be checked for immediate argument. llvm-svn: 227716	2015-02-01 08:05:12 +00:00
Craig Topper	67826a5883	[X86] Rename _mm512_valign_epi64/32 intrinsics to _mm512_alignr_epi64/32 to match Intel docs. Make immediate argument to them an ICE. Fix mask size for the alignd version. llvm-svn: 227713	2015-02-01 07:35:40 +00:00
Craig Topper	72c7d51251	[X86] Change rounding parameter of all the AVX512 builtins to an ICE. llvm-svn: 227712	2015-02-01 07:35:35 +00:00
Craig Topper	9fee8ab4f9	[x86] Remove tab characters from avxintrin.h. NFC. llvm-svn: 227676	2015-01-31 06:33:59 +00:00
Craig Topper	459554f164	[X86] Make order consistent between 'const' and 'int' in one of the intrinsic header files. NFC llvm-svn: 227675	2015-01-31 06:31:30 +00:00
Richard Smith	99335be950	Don't use BCPL comments here, in case someone wants to use <stdatomic.h> from C89 mode. llvm-svn: 227417	2015-01-29 03:34:39 +00:00
Hans Wennborg	2e56d950ff	Intrin.h: define _XCR_XFEATURE_ENABLED_MASK Users expect to be able to use this with _xgetbv. llvm-svn: 227270	2015-01-27 23:34:35 +00:00
Craig Topper	335e218760	[X86] Add intrinsics for AVX512 128 and 256 bit integer comparison of word and byte vectors. llvm-svn: 227186	2015-01-27 09:16:29 +00:00
Craig Topper	b4789096c0	[X86] Add AVX512 integer comparison intrinsics for word and byte vectors. llvm-svn: 227079	2015-01-26 09:24:10 +00:00
Craig Topper	2f25a5a875	[X86] Add more of the AVX512 integer comparision intrinsics. This adds 128 and 256 bit vectors of dwords and qwords. llvm-svn: 227075	2015-01-26 08:11:49 +00:00
Craig Topper	4cac1c2318	[X86] Add AVX512F integer comparision intrinsics to header file. llvm-svn: 227067	2015-01-25 23:30:07 +00:00
Adam Nemet	f893edeaea	[AVX512] Add sub-vector FP extracts Analogous to AVX2, these need to be implemented as macros to properly propagate the immediate index operand. Part of <rdar://problem/17688758> llvm-svn: 226496	2015-01-19 20:12:05 +00:00
Craig Topper	f557b09f14	[x86] Mark that the AVX-512 cmpps/cmppd builtins need an ICE for the comparison immediate. This requires converting to a macro in the header file. llvm-svn: 226421	2015-01-19 01:18:19 +00:00
Adam Nemet	c0cff244fc	[AVX512] Add intrinsics for masked aligned FP loads and stores Part of <rdar://problem/17688758> llvm-svn: 226298	2015-01-16 18:51:50 +00:00
Adam Nemet	63a951eb1c	[AVX512] Add FP unpack intrinsics These are implemented with __builtin_shufflevector just like AVX. We have some tests on the LLVM side to assert that these shufflevectors do indeed generate the corresponding unpck instruction. Part of <rdar://problem/17688758> llvm-svn: 225922	2015-01-14 01:31:17 +00:00
Ben Langmuir	c67a774e17	Add [extern_c] attribute to _Builtin_intrinsics module This allows users to import this module inside an extern "C" {} block. llvm-svn: 225835	2015-01-13 21:54:32 +00:00
Chandler Carruth	032d422d2e	Effectively revert r151058 which caused Clang's unwind.h to defer to libunwind in all cases when installed. At the time, Clang's unwind.h didn't provide huge chunks of the LSB-specified unwind interface, and was generally too aenemic to use for real software. However, it has since then become a strict superset of the APIs provided by libunwind on Linux. Notably, you cannot compile llgo's libgo library against libunwind, but you can against Clang's unwind.h. So let's just use our header. =] I've checked pretty thoroughly for any incompatibilities, and I am not aware of any. An open question is whether or not we should continue to munge GNU_SOURCE here. I didn't touch that as it potentially has compatibility implications on systems I cannot easily test -- Darwin. If a Darwin maintainer can verify that this is in fact unnecessary and remove it, cool. Until then, leaving it in makes this change a no-op there, and only really relevant on Linux systems where it is pretty clearly the right way to go. llvm-svn: 224934	2014-12-29 13:29:38 +00:00
Chandler Carruth	f3cabbd424	Add a missing declaration to our unwind.h implementation. This is necessary to be fully compatible with existing software that calls into the linux unwind code. You can find documentation of this API and why it exists in the discussion abot NPTL here: https://gcc.gnu.org/ml/gcc-patches/2003-09/msg00154.html llvm-svn: 224933	2014-12-29 13:29:36 +00:00
Chandler Carruth	28daca211c	[x86] Also add the missing type casts on the returns in the sha intrinsic header file. Along with r224822, this should restore the build bots to passing. llvm-svn: 224883	2014-12-27 11:50:51 +00:00
Craig Topper	ab70789199	[x86] Add missing typecast to __v4si to sha intrinsic header file. llvm-svn: 224882	2014-12-27 07:19:25 +00:00
Craig Topper	2094d8fe88	[x86] Add the (v)cmpps/pd/ss/sd builtins to match gcc. Use them in the sse intrinsic files. This still lower to the same intrinsics as before. This is preparation for bounds checking the immediate on the avx version of the builtin so we don't pass illegal immediates into the backend. Since SSE uses a smaller size immediate its not possible to bounds check when using a shared builtin. Rather than creating a clang specific builtin for the different immediate, I decided (after consulting with Chandler) that it was better to match gcc. llvm-svn: 224879	2014-12-27 06:59:57 +00:00
Eric Christopher	c67e1b6a2a	Make sure that vec_perm is listed as a static function in altivec.h. llvm-svn: 223871	2014-12-10 00:57:43 +00:00
Reid Kleckner	baf7709055	Implement __umulh with __int128 arithmetic Use the same approach as _umul128, but just return the high half. llvm-svn: 223316	2014-12-03 23:36:14 +00:00
David Majnemer	00973ce683	FullProduct should be _FullProduct llvm-svn: 223179	2014-12-02 23:44:40 +00:00
David Majnemer	5450763dd8	Intrin: shrx_u64 should be _shrx_u64 llvm-svn: 223176	2014-12-02 23:30:26 +00:00
David Majnemer	5f9afc59f8	Intrin: Add _umul128 Implement _umul128; it provides the high and low halves of a 128-bit multiply. We can simply use our __int128 arithmetic to implement this, we generate great code for it: movq %rdx, %rax mulq %rcx movq %rdx, (%r8) retq Differential Revision: http://reviews.llvm.org/D6486 llvm-svn: 223175	2014-12-02 23:30:24 +00:00
Reid Kleckner	e35b07ad49	Intercept __crt_va_* used by MSVC "14" Moving further into the implementor's namespace is good, but now we have one more name to intercept. llvm-svn: 222473	2014-11-20 22:44:03 +00:00
Bill Schmidt	8ff672d397	[PowerPC] Enable vec_perm for long long and double vector types for VSX VSX makes the "vector long long" and "vector double" types available. This patch enables the vec_perm interface for these types. The same builtin is generated regardless of the specified type, so no additional work or testing is needed in the back end. Tests are added to ensure this builtin is generated by the front end. llvm-svn: 221988	2014-11-14 13:10:13 +00:00
Bill Schmidt	cee13a2712	[PowerPC] Add VSX builtins for vec_div This patch adds builtin support for xvdivdp and xvdivsp, along with a new test case. The builtins are accessed using vec_div in altivec.h. Builtins are listed (mostly) alphabetically there, so inserting these changed the line numbers for deprecation warnings tested in test/Headers/altivec-intrin.c. There is a companion patch for LLVM. llvm-svn: 221984	2014-11-14 12:10:51 +00:00
Bill Schmidt	9ec8cea02b	[PowerPC] Add vec_vsx_ld and vec_vsx_st intrinsics This patch enables the vec_vsx_ld and vec_vsx_st intrinsics for PowerPC, which provide programmer access to the lxvd2x, lxvw4x, stxvd2x, and stxvw4x instructions. New code in altivec.h defines these in terms of new builtins, which are themselves defined in BuiltinsPPC.def. The builtins are converted to LLVM intrinsics in CGBuiltin.cpp. Additional code is added to builtins-ppc-vsx.c to verify the correct generation of the intrinsics. Note that I moved the other VSX builtins so all VSX builtins will be alphabetical in their own section in BuiltinsPPC.def. There is a companion patch for LLVM. llvm-svn: 221768	2014-11-12 04:19:56 +00:00
Craig Topper	8c7f251e98	Add FSGSBASE intrinsics to x86 intrinsic headers. llvm-svn: 221130	2014-11-03 06:51:41 +00:00
Craig Topper	554797f255	Remove definitions from Intrin.h that already exist in one of the other x86 intrinsic headers. Add a run line with Broadwell as the cpu type to ms-intrin.cpp test to catch some of these in the future. llvm-svn: 221127	2014-11-03 04:19:58 +00:00
Craig Topper	e1c664b136	Add _lzcnt_u32 and _lzcnt_u64 to lzcntintrin.h to match Intel documentation names for these intrinsics. llvm-svn: 221066	2014-11-01 22:50:57 +00:00
Craig Topper	a52e0d7cc0	Avoid undefined behavior in the x86 bmi header file by explicitly checking for 0 before calling __builtin_ctz. Without this the optimizers may take advantage of the undefined behavior and produce incorrect results. LLVM itself still needs to be taught to merge the zero check into the llvm.cttz with defined zero behavior. llvm-svn: 221065	2014-11-01 22:50:54 +00:00
Craig Topper	3ca55d9c41	Avoid undefined behavior in the x86 lzcnt header file by explicitly checking for 0 before calling __builtin_clz. Without this the optimizers may take advantage of the undefined behavior and produce incorrect results. LLVM itself still needs to be taught to merge the zero check into the llvm.ctlz with defined zero behavior. llvm-svn: 221064	2014-11-01 22:25:23 +00:00
Bill Schmidt	691e01d94e	[PowerPC] Initial VSX intrinsic support, with min/max for vector double Now that we have initial support for VSX, we can begin adding intrinsics for programmer access to VSX instructions. This patch performs the necessary enablement in the front end, and tests it by implementing intrinsics for minimum and maximum using the vector double data type. The main change in the front end is to no longer disallow "vector" and "double" in the same declaration (lib/Sema/DeclSpec.cpp), but "vector" and "long double" must still be disallowed. The new intrinsics are accessed via vec_max and vec_min with changes in lib/Headers/altivec.h. Note that for v4f32, we already access corresponding VMX builtins, but with VSX enabled we should use the forms that allow all 64 vector registers. The new built-ins are defined in include/clang/Basic/BuiltinsPPC.def. I've added a new test in test/CodeGen/builtins-ppc-vsx.c that is similar to, but much smaller than, builtins-ppc-altivec.c. This allows us to test VSX IR generation without duplicating CHECK lines for the existing bazillion Altivec tests. Since vector double is now legal when VSX is available, I've modified the error message, and changed where we test for it and for vector long double, since the target machine isn't visible in the old place. This serendipitously removed a not-pertinent warning about 'long' being deprecated when used with 'vector', when "vector long double" is encountered and we just want to issue an error. The existing tests test/Parser/altivec.c and test/Parser/cxx-altivec.cpp have been updated accordingly, and I've added test/Parser/vsx.c to verify that "vector double" is now legitimate with VSX enabled. There is a companion patch for LLVM. llvm-svn: 220989	2014-10-31 19:19:24 +00:00
Saleem Abdulrasool	a25fbef088	CodeGen: add __readfsdword builtin The Windows NT SDK uses __readfsdword and declares it as a compiler provided builtin (#pragma intrinsic(__readfsword). Because intrin.h is not referenced by winnt.h, it is not possible to provide an out-of-line definition for the intrinsic. Provide a proper compiler builtin definition. llvm-svn: 220859	2014-10-29 16:35:41 +00:00
NAKAMURA Takumi	a267847538	<float.h>: Don't seek #include_next if -ffreestanding for targeting mingw. llvm-svn: 220356	2014-10-22 01:25:49 +00:00
Hans Wennborg	818514b718	vadefs.h: be even more conservative and only define the macros if already defined llvm-svn: 219745	2014-10-14 23:20:25 +00:00
Hans Wennborg	752b789e7b	Sort files list in lib/Headers/CMakeLists.txt majnemer pointed out that vadefs.h was added in the wrong place. Might as well sort the rest too. llvm-svn: 219743	2014-10-14 23:15:43 +00:00
Hans Wennborg	adfd7f6ef4	MS Compat: interpose vadefs.h to fix definitions of _crt_va_{start,end,arg} (PR21247) Differential revision: http://reviews.llvm.org/D5784 llvm-svn: 219740	2014-10-14 22:35:42 +00:00
Robert Khasanov	33e7685b2a	Added new headers to CMakeLists.txt. Fix for rev219319 llvm-svn: 219325	2014-10-08 17:37:51 +00:00
Robert Khasanov	b9f3a911c9	[AVX512] Added VPCMPEQ intrinisics to headers. Added tests. Patch by Maxim Blumenthal <maxim.blumenthal@intel.com> llvm-svn: 219319	2014-10-08 17:18:13 +00:00
Bill Schmidt	cad3a5f7d4	[PATCH][Power] Fix (and deprecate) vec_lvsl and vec_lvsr for little endian The use of the vec_lvsl and vec_lvsr interfaces are discouraged for little endian targets since Power8 hardware is a minimum requirement, and Power8 provides reasonable performance for unaligned vector loads and stores. Up till now we have not provided "correct" (i.e., big- endian-compatible) code generation for these interfaces, as to do so produces poorly performing code. However, this has become the source of too many questions. With this patch, LLVM will now produce compatible code for these interfaces, but will also produce a deprecation warning message for PPC64LE when one of them is used. This should make the porting direction clearer to programmers. A similar patch has recently been committed to GCC. This patch includes a test for the warning message. There is a companion patch that adds two unit tests to projects/test-suite. llvm-svn: 219137	2014-10-06 19:02:20 +00:00
Hal Finkel	6970ac8b0a	Add an implementation of C11's stdatomic.h Adds a Clang-specific implementation of C11's stdatomic.h header. On systems, such as FreeBSD, where a stdatomic.h header is already provided, we defer to that header instead (using our __has_include_next technology). Otherwise, we provide an implementation in terms of our __c11_atomic_* intrinsics (that were created for this purpose). C11 7.1.4p1 requires function declarations for atomic_thread_fence, atomic_signal_fence, atomic_flag_test_and_set, atomic_flag_test_and_set_explicit, and atomic_flag_clear, and requires that they have external linkage. Accordingly, we provide these declarations, but if a user elides the shadowing macros and uses them, then they must have a libc (or similar) that actually provides definitions. atomic_flag is implemented using _Bool as the underlying type. This is consistent with the implementation provided by FreeBSD and also GCC 4.9 (at least when __GCC_ATOMIC_TEST_AND_SET_TRUEVAL == 1). Patch by Richard Smith (rebased and slightly edited by me -- Richard said I should drive at this point). llvm-svn: 218957	2014-10-03 04:29:40 +00:00
Richard Smith	ef99e4d88a	Fix interaction of max_align_t and modules. When building with modules enabled, we were defining max_align_t as a typedef for a different anonymous struct type each time it was included, resulting in an error if <stddef.h> is not covered by a module map and is included more than once in the same modules-enabled compilation of C11 or C++11 code. llvm-svn: 218931	2014-10-03 00:31:35 +00:00
Joerg Sonnenberger	2960178a77	Fix trailing commas in AMD define. llvm-svn: 218825	2014-10-01 21:22:17 +00:00
Joerg Sonnenberger	e028e05a7e	Add the various signature macros. llvm-svn: 218824	2014-10-01 21:21:42 +00:00
Joerg Sonnenberger	cf0740454d	Rename bit_RDRAND to bit_RDRND to match GCC's version of this header. llvm-svn: 218823	2014-10-01 21:21:16 +00:00
Robert Khasanov	ea13042cf2	[x86] Fixed argument types in intrinsics: _addcarryx_u64 _addcarry_u64 _subborrow_u64 Thanks Pasi Parviainen for notice. llvm-svn: 218376	2014-09-24 06:45:23 +00:00
Akira Hatanaka	416efb5f90	Fix bugs in cpuid.h. This commit makes two changes: - Remove the push and pop instructions that were saving and restoring %ebx before and after cpuid in 32-bit pic mode. We were doing this to ensure we don't lose the GOT address in pic register %ebx, but this isn't necessary because the GOT address is kept in a virtual register. - In 64-bit mode, preserve base register %rbx around cpuid. This fixes PR20311 and rdar://problem/17686779. llvm-svn: 218173	2014-09-20 01:31:09 +00:00
Robert Khasanov	2c589bcc5e	[x86] Add _addcarry_u{32\|64} and _subborrow_u{32\|64}. They are added to adxintrin.h but outside __ADX__ block. These intrinics generates adc and sbb correspondingly that were available before ADX llvm-svn: 218118	2014-09-19 10:29:22 +00:00
Robert Khasanov	83c419b349	[x86] Added _addcarryx_u32, _addcarryx_u64 intrinsics llvm-svn: 218117	2014-09-19 10:17:06 +00:00
Yi Kong	a8833f0c28	arm_acle: Fix error in ROR implementation The logic in calculating the rotate amount was flawed. Thanks Pasi Parviainen for pointing out! llvm-svn: 216669	2014-08-28 15:25:52 +00:00
Yi Kong	623393f31e	arm_acle: Implement data processing intrinsics Summary: ACLE 2.0 section 9.2 defines the following "miscellaneous data processing intrinsics": `__clz`, `__cls`, `__ror`, `__rev`, `__rev16`, `__revsh` and `__rbit`. `__clz` has already been implemented in the arm_acle.h header file. The rest are not supported yet. This patch completes ACLE data processing intrinsics. Reviewers: t.p.northover, rengolin Reviewed By: rengolin Subscribers: aemerson, mroth, llvm-commits Differential Revision: http://reviews.llvm.org/D4983 llvm-svn: 216658	2014-08-28 09:44:07 +00:00
Yi Kong	6891746cd8	arm_acle: Add mappings for dbg intrinsic This completes all ACLE hint intrinsics. llvm-svn: 216453	2014-08-26 12:48:11 +00:00
Yi Kong	0705e0065e	arm_acle: Implement swap intrinsic Insert the LDREX/STREX instruction sequence specified in ARM ACLE 2.0, as SWP instruction is deprecated since ARMv6. llvm-svn: 216446	2014-08-26 09:50:54 +00:00
Yi Kong	70cf4c626e	arm_acle.h: Small cleanup Since __SIZEOF_LONG_LONG__ is always defined as 8 on ARM targets, there's no point in checking this. NFC. Patch by Moritz Roth. llvm-svn: 215697	2014-08-15 08:53:22 +00:00
Adam Nemet	2278fcbf0c	[AVX512] Add FMA intrinsics Part of <rdar://problem/17688758> llvm-svn: 215666	2014-08-14 17:17:57 +00:00
Yi Kong	45a09319bf	ARM: Add mappings for ACLE prefetch intrinsics Implement __pld, __pldx, __pli and __plix builtin intrinsics as specified in ARM ACLE 2.0. llvm-svn: 215599	2014-08-13 23:20:15 +00:00
Adam Nemet	4abc07cb75	[AVX512] Add intrinsics for FP scalar broadcasts Similar approach to the set1 intrinsics is used: implement in terms of vector initializers and then ensure with an LLVM test that a broadcast is generated at the end. Part of <rdar://problem/17688758> llvm-svn: 215486	2014-08-13 00:29:01 +00:00
Adam Nemet	5bf7baa938	[AVX512] Add intrinsic for valignd/q Note that similar to palingr, we could further optimize these to emit shufflevector when the shift count is <=64. This however does not change the overall design that unlike palignr we would still need the LLVM intrinsic corresponding to this intruction to handle the >64 cases. (palignr uses the psrldq intrinsic in this case.) llvm-svn: 214891	2014-08-05 17:28:23 +00:00
Bill Schmidt	ccbe0a8022	[PPC64LE] Fix wrong IR for vec_sld and vec_vsldoi My original LE implementation of the vsldoi instruction, with its altivec.h interfaces vec_sld and vec_vsldoi, produces incorrect shufflevector operations in the LLVM IR. Correct code is generated because the back end handles the incorrect shufflevector in a consistent manner. This patch and a companion patch for LLVM correct this problem by removing the fixup from altivec.h and the corresponding fixup from the PowerPC back end. Several test cases are also modified to reflect the now-correct LLVM IR. The vec_sums and vec_vsumsws interfaces in altivec.h are also fixed, because they used vec_perm calls intended to be recognized as vsldoi instructions. These vec_perm calls are now replaced with code that more clearly shows the intent of the transformation. llvm-svn: 214801	2014-08-04 23:21:26 +00:00
Adam Nemet	da82bcc4dd	[AVX512] Add unaligned FP load intrinsics Part of <rdar://problem/17688758> llvm-svn: 214380	2014-07-31 04:00:39 +00:00
Adam Nemet	2db1d2fb32	[AVX512] Add intrinsic for knot Part of <rdar://problem/17688758> llvm-svn: 214316	2014-07-30 16:51:27 +00:00
Adam Nemet	c871ff95f3	[AVX512] Add some of the FP cast intrinsics Part of <rdar://problem/17688758> llvm-svn: 214315	2014-07-30 16:51:24 +00:00
Adam Nemet	f42e7a274a	[AVX512] Add set1 intrinsics (Dropped the byte and word variants from the patch. Turns out these are not part of AVX512F but only AVX512BW/VL.) Part of <rdar://problem/17688758> llvm-svn: 214314	2014-07-30 16:51:22 +00:00
Joerg Sonnenberger	3d9478cf3a	Change __INTx_TYPE__ to be always signed. This changes the value for char-based types from "char" to "signed char". Adjust stdint.h to use __INTx_TYPE__ directly without prefixing it with signed and to use __UINTx_TYPE__ for unsigned ones. The value of __INTx_TYPE__ now matches GCC. llvm-svn: 214119	2014-07-28 21:06:22 +00:00
Adam Nemet	fce1ad0b99	[AVX512] Add non-masking FP store intrinsics Part of <rdar://problem/17688758> llvm-svn: 214099	2014-07-28 17:14:45 +00:00
Adam Nemet	a3ebe6214b	[AVX512] Add FP add/sub/mul intrinsics Part of <rdar://problem/17688758> llvm-svn: 214098	2014-07-28 17:14:42 +00:00
Adam Nemet	0d5bb5530d	[AVX512] Reorder functions in avx512fintrin.h There is no functional change here. The idea is to have a similar order and categories of functions that we have in avxintrin.h. llvm-svn: 214097	2014-07-28 17:14:40 +00:00
Adam Nemet	9a3ea60a2c	[AVX512] Bring the formatting of avx512fintrin.h closer to avxintrin.h llvm-svn: 214096	2014-07-28 17:14:38 +00:00
Yi Kong	cd08139865	Add module map entry for ARM ACLE header file llvm-svn: 213731	2014-07-23 09:00:21 +00:00
Elena Demikhovsky	bd1a49bf81	AVX-512: I added new headers to makefiles. It should resolve tests fail. If it will not, I'm reverting the both commits. llvm-svn: 213645	2014-07-22 12:08:25 +00:00
Elena Demikhovsky	fcc6df310d	AVX-512: Added intrinsics to clang. The set is small, that what I have right now. Everybody is welcome to add more. llvm-svn: 213641	2014-07-22 11:31:39 +00:00
Viktor Kutuzov	99400a5a34	Revert D3908 due to issues on Mac platforms llvm-svn: 213450	2014-07-19 05:58:38 +00:00
Yi Kong	28d7b02687	ARM: Add ACLE memory barrier intrinsic mapping llvm-svn: 213261	2014-07-17 12:45:17 +00:00
Yi Kong	472e521cec	ARM: Add NOP intrinsic mapping in arm_acle.h llvm-svn: 212950	2014-07-14 15:32:29 +00:00
Saleem Abdulrasool	07257fe14e	Headers: add hint intrinsics to arm_acle.h This adds the ARM ACLE hint intrinsic wrappers to arm_acle.h. These need to be protected with a !defined(_MSC_VER) since MSVC (and thus clang in compatibility mode) provide these wrappers as proper builtin intrinsics. llvm-svn: 212891	2014-07-12 23:27:26 +00:00
Yi Kong	4e00ce7d0c	Improve comments of ARM ACLE header file and tests Include section number in ARM ACLE specification for easier navigation. llvm-svn: 212887	2014-07-12 22:48:13 +00:00
Viktor Kutuzov	63537656c6	Add clang headers that fix machine-dependent definitions on FreeBSD 9.2 Differential Revision: http://reviews.llvm.org/D3908 llvm-svn: 212689	2014-07-10 08:43:39 +00:00
Nico Weber	a62cffae52	Don't pull in setjmp.h in -ffreestanding compiles. Also provide _setjmpex(). r200243 put in _setjmp() and _setjmpex() behind a comment since jmp_buf wasn't available. r200344 added jmp_buf and put in _setjmp(), but missed _setjmpex(). llvm-svn: 212557	2014-07-08 18:34:46 +00:00
Nico Weber	1287091373	Replace a few // comments with /**/ comments in headers, for consistency. llvm-svn: 212556	2014-07-08 18:29:27 +00:00
Saleem Abdulrasool	c4ebb129b7	Headers: conditionalise more declarations Protect MMX specific declarations under a __MMX__ guard. This header can be included on non-x86 architectures (e.g. ARM) which do not support the MMX ISA. Use the preprocessor to prevent these declarations from being processed. llvm-svn: 212512	2014-07-08 05:46:04 +00:00
Saleem Abdulrasool	60df0615b6	Headers: mark arm_acle.h with extern "C" Although the functions are marked as always_inline, the compiler with which they are used may not honour the extended attributes and emit them as functions. In such a case, indicate that they should have extern "C" linkage and should not be mangled in C++ style if used within C++. llvm-svn: 212511	2014-07-08 05:46:00 +00:00
Renato Golin	47843efcf6	Add the __qdbl intrinsic to the arm_acle.h header Patch by: Moritz Roth llvm-svn: 212264	2014-07-03 10:14:52 +00:00
Yaron Keren	672efea2e9	Added standard macro guard. In case __GNUC_VA_LIST was not defined or defined identically before there will not be any change in functionality. MinGW-w64 defines __GNUC_VA_LIST as #define __GNUC_VA_LIST which is different than the definition here, causing a warning without the guard. llvm-svn: 212183	2014-07-02 15:25:03 +00:00
Andrea Di Biagio	eb606a3c27	[x86] Add Clang support for intrinsic __rdpmc. This patch adds intrinsic __rdpmc to header file 'ia32intrin.h'. Intrinsic __rdmpc can be used to read performance monitoring counters. It is implemented as a direct call to __builtin_ia32_rdpmc. It takes as input a value representing the index of the performance counter to read. The value of the performance counter is then returned as a unsigned 64-bit quantity. llvm-svn: 212053	2014-06-30 18:23:58 +00:00
Yi Kong	a44c4d7173	Introduce arm_acle.h supporting existing LLVM builtin intrinsics Summary: This patch introduces ACLE header file, implementing extensions that can be directly mapped to existing Clang intrinsics. It implements for both AArch32 and AArch64. Reviewers: t.p.northover, compnerd, rengolin Reviewed By: compnerd, rengolin Subscribers: rnk, echristo, compnerd, aemerson, mroth, cfe-commits Differential Revision: http://reviews.llvm.org/D4296 llvm-svn: 211962	2014-06-27 21:25:42 +00:00
Saleem Abdulrasool	702eefed9a	Headers: be a bit more careful about inline asm Conditionally include x86intrin.h if we are building for x86 or x86_64. Conditionalise definition of inline assembly routines which use x86 or x86_64 inline assembly. This is needed as clang can target Windows on ARM where these definitions may be included into user code. llvm-svn: 211716	2014-06-25 16:48:40 +00:00
Saleem Abdulrasool	114efe0dc8	CodeGen: improve ms instrincics support Add support for _InterlockedCompareExchangePointer, _InterlockExchangePointer, _InterlockExchange. These are available as a compiler intrinsic on ARM and x86. These are used directly by the Windows SDK headers without use of the intrin header. llvm-svn: 211216	2014-06-18 20:51:10 +00:00
Bill Schmidt	1cf7c64fa5	[PPC64LE] Run some existing Altivec tests on powerpc64le as well There are several Altivec tests that formerly ran only on big-endian targets (and in some cases only on 32-bit targets). It is useful to verify these on little-endian targets as well. While testing these, I discovered a typo in <altivec.h>. This is also fixed by this patch. llvm-svn: 210928	2014-06-13 18:30:06 +00:00
Bill Schmidt	56a6967000	[PPC64LE] Fix vec_sld and vec_vsldoi for little endian The vec_sld and vec_vsldoi interfaces perform a left-shift on vector arguments for both big and little endian. However, because they rely on the vec_perm interface which is endian-dependent, the permutation vector needs to be reversed for LE to get the proper shift direction. I've added some extra testing for these interfaces for LE in the builtins-ppc-altivec.c. llvm-svn: 210657	2014-06-11 15:48:46 +00:00
Bill Schmidt	7f6596bb13	[PPC64LE] Implement little-endian semantics for vec_sums The PowerPC vsumsws instruction, accessed via vec_sums, is defined architecturally with a big-endian bias, in that the second input vector and the result always reference big-endian element 3 (little-endian element 0). For ease of porting, the programmer wants elements 3 in both cases. To provide this semantics, for little endian we generate a permute for the second input vector prior to the vsumsws instruction, and generate a permute for the result vector following the vsumsws instruction. The correctness of this code is tested by the new sums.c test added in a previous patch, as well as the modifications to builtins-ppc-altivec.c in the present patch. llvm-svn: 210449	2014-06-09 03:31:47 +00:00
Bill Schmidt	d7c53a91df	[PPC64LE] Implement little-endian semantics for vec_unpack[hl] The PowerPC vector-unpack-high and vector-unpack-low instructions are defined architecturally with a big-endian bias, in that the vector element numbering is assumed to be "left to right" regardless of whether the processor is in big-endian or little-endian mode. This effectively reverses the meaning of "high" and "low." Such a definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_unpackh and vec_unpackl interfaces are designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The desired semantics can be achieved by using the opposite instruction for little-endian mode. That is, when a call to vec_unpackh appears in the code, a vector-unpack-low is generated, and when a call to vec_unpackl appears in the code, a vector-unpack-high is generated. The correctness of this code is tested by the new unpack.c test added in a previous patch, as well as the modifications to builtins-ppc-altivec.c in the present patch. Note that these interfaces were originally incorrectly implemented when they take a vector pixel argument. This patch corrects this implementation for both big- and little-endian code generation. llvm-svn: 210391	2014-06-07 02:20:52 +00:00
Bill Schmidt	7f0a5c5141	[PPC64LE] Update builtins-ppc-altivec.c for PPC64 and PPC64LE The Altivec builtin test case test/CodeGen/builtins-ppc-altivec.c has always been executed only for 32-bit PowerPC. These tests are equally valid for 64-bit PowerPC. This patch updates the test to be run for three targets: powerpc-unknown-unknown, powerpc64-unknown-unknown, and powerpc64le-unknown-unknown. The expected code generation changes for some of the Altivec builtins for little endian, so this patch adds new CHECK-LE variants to the test for the powerpc64le target. These tests satisfy the testing requirements for some previous patches committed over the last couple of days for lib/Headers/altivec.h: r210279 for vec_perm, r210337 for vec_mul[eo], and r210340 for vec_pack. llvm-svn: 210384	2014-06-06 23:12:00 +00:00
Bill Schmidt	8a7b4f18bd	[PPC64LE] Implement little-endian semantics for vec_pack family The PowerPC vector-pack instructions are defined architecturally with a big-endian bias, in that the vector element numbering is assumed to be "left to right" regardless of whether the processor is in big-endian or little-endian mode. This definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_pack and related interfaces are designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The vec_pack calls are implemented as calls to vec_perm, specifying selection of the odd-numbered vector elements. For little endian, this means the odd-numbered elements counting from the right end of the register. Since the underlying instructions count from the left end, we must instead select the even-numbered vector elements for little endian to achieve the desired semantics. The correctness of this code is tested by the new pack.c test added in a previous patch. I plan to later make the existing ppc32 Altivec compile-time tests work for ppc64 and ppc64le as well. llvm-svn: 210340	2014-06-06 15:10:47 +00:00
Bill Schmidt	7c0114f6e3	[PPC64LE] Implement little-endian semantics for vec_mul[eo] The PowerPC vector-multiply-even and vector-multiply-odd instructions are defined architecturally with a big-endian bias, in that the vector element numbering is assumed to be "left to right" regardless of whether the processor is in big-endian or little-endian mode. This definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_mule and vec_mulo interfacs are designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The desired semantics can be achieved by using the opposite instruction for little-endian mode. That is, when a call to vec_mule appears in the code, a vector-multiply-odd is generated, and when a call to vec_mulo appears in the code, a vector-multiply-even is generated. The correctness of this code is tested by the new mult-even-odd.c test added in a previous patch. I plan to later make the existing ppc32 Altivec compile-time tests work for ppc64 and ppc64le as well. llvm-svn: 210337	2014-06-06 14:45:06 +00:00
Bill Schmidt	f7e289c0f2	[PPC64LE] Implement little-endian semantics for vec_perm The PowerPC vperm (vector permute) instruction is defined architecturally with a big-endian bias, in that the two input vectors are assumed to be concatenated "left to right" and the elements of the combined input vector are assumed to be numbered from "left to right" (i.e., with element 0 referencing the high-order element). This definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_perm interface is designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The desired semantics can be achieved with the vperm instruction provided that the two input vector registers are reversed, and the permute control vector is complemented. The complementing is performed using an xor with a vector containing all one bits. Only the rightmost 5 bits of each element of the permute control vector are relevant, so it would be possible to complement the vector with respect to a <16xi8> vector containing all 31s. However, when the permute control vector is not a constant, using 255 instead has the advantage that the vec_xor can be recognized during code generation as a vnor instruction. (Power8 introduces a vnand instruction which could alternatively be generated.) The correctness of this code is tested by the new perm.c test added in a previous patch. I plan to later make the existing ppc32 Altivec compile-time tests work for ppc64 and ppc64le as well. llvm-svn: 210279	2014-06-05 19:07:40 +00:00
Adam Nemet	286ae08e7d	Implement AVX1 vbroadcast intrinsics with vector initializers These intrinsics are special because they directly take a memory operand (AVX2 adds the register counterparts). Typically, other non-memop intrinsics take registers and then it's left to isel to fold memory operands. In order to LICM intrinsics directly reading memory, we require that no stores are in the loop (LICM) or that the folded load accesses constant memory (MachineLICM). When neither is the case we fail to hoist a loop-invariant broadcast. We can work around this limitation if we expose the load as a regular load and then just implement the broadcast using the vector initializer syntax. This exposes the load to LICM and other optimizations. At the IR level this is translated into a series of insertelements. The sequence is already recognized as a broadcast so there is no impact on the quality of codegen. _mm256_broadcast_pd and _mm256_broadcast_ps are not updated by this patch because right now we lack the DAG-combiner smartness to recover the broadcast instructions. This will be tackled in a follow-on. There will be completing changes on the LLVM side to remove the LLVM intrinsics and to auto-upgrade bitcode files. Fixes <rdar://problem/16494520> llvm-svn: 209846	2014-05-29 20:47:29 +00:00
Sanjay Patel	1585fb94ab	added Intel's BMI intrinsic variants (fixes PR19431 - http://llvm.org/bugs/show_bug.cgi?id=19431) llvm-svn: 209769	2014-05-28 20:26:57 +00:00
Akira Hatanaka	5d28ea1451	Fix a bug in xmmintrin.h. The last step of _mm_cvtps_pi16 should use _mm_packs_pi32, which is a function that reads two __m64 values and packs four 32-bit values into four 16-bit values. <rdar://problem/16873717> llvm-svn: 209489	2014-05-23 00:38:07 +00:00
Timur Iskhodzhanov	a27b044166	Define the InterlockedCompareExchange64 intrinsic on 32-bits too llvm-svn: 208699	2014-05-13 13:59:05 +00:00
Filipe Cabecinhas	5d289b48b1	Patched clang to emit x86 blends as shufflevectors. Summary: Most of the clang header patch by Simon Pilgrim @ SCEE. Also fixed (or added) clang tests for these intrinsics. LLVM tests to make sure we get the blend instruction out of these shufflevectors are at http://reviews.llvm.org/D3600 Reviewers: eli.friedman, craig.topper, rafael Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D3601 llvm-svn: 208664	2014-05-13 02:37:02 +00:00
Nico Weber	272bcf6768	Let stddef.h respect __need_{wchar_t, size_t, NULL, ptrdiff_t, wint_t}. glibc expects that stddef.h only defines a single thing if either of these defines is set. For example, before this change, a C file containing #include <stdlib.h> int ptrdiff_t = 0; would compile with gcc but not with clang. Now it compiles with clang too. This also fixes PR12997, where older versions of the Linux headers would define NULL incorrectly, and glibc would define __need_NULL and expect stddef.h to redefine NULL with the correct definition. llvm-svn: 207606	2014-04-30 04:35:09 +00:00
Nico Weber	f077c51a70	Revert r207482; I fail at reading IRC. llvm-svn: 207483	2014-04-29 01:25:49 +00:00
Nico Weber	8af28c1e61	Let stddef.h redefine NULL if __need_NULL is set, as needed by glibc, PR12997. See the bug and the cfe-commits thread "[patch] Let stddef.h redefine NULL if __need_NULL is set" for discussion. Fixes PR12997 and is similar to the __need_wint_t bits already in this file. llvm-svn: 207482	2014-04-29 01:19:21 +00:00
Hans Wennborg	ac156e2225	Intrin.h: remove __rdtsc and __rdtscp declarations Since r207132, these are defined in ia32intrin.h. llvm-svn: 207134	2014-04-24 18:40:06 +00:00
Andrea Di Biagio	7ceec07cf6	[X86] Add Clang support for intrinsics __rdtsc and __rdtscp. This patch: 1. Adds a definition for two new GCCBuiltins in BuiltinsX86.def: __builtin_ia32_rdtsc; __builtin_ia32_rdtscp; 2. Replaces the already existing definition of intrinsic __rdtsc in ia32intrin.h with a simple call to the new GCC builtin __builtin_ia32_rdtsc. 3. Adds a definition for the new intrinsic __rdtscp in ia32intrin.h llvm-svn: 207132	2014-04-24 18:26:35 +00:00
Ben Langmuir	47d1ca4838	Rename lib/Headers/module.map to module.modulemap Don't install a file using the legacy spelling. llvm-svn: 206431	2014-04-17 00:52:48 +00:00
Reid Kleckner	6df5254d6f	intrin.h: Fix up bugs in the cr3 and msr intrinsics Don't include input and output regs in clobbers. Prefix some identifiers with __. Add a memory constraint to __readcr3 to prevent reordering. This constraint is heavy handed, but conservatively correct. Thanks to PaX Team for the suggestions. llvm-svn: 205778	2014-04-08 17:49:16 +00:00
Reid Kleckner	592dc61acf	intrin.h: Implement __readmsr, __readcr3, and __writecr3 Fixes PR19301. Based on a patch from Steven Graf! llvm-svn: 205751	2014-04-08 00:28:22 +00:00
Alexey Volkov	ae43aae96a	Added _rdtsc intrinsics by Robert Khasanov Differential Revision: http://llvm-reviews.chandlerc.com/D3212 llvm-svn: 205172	2014-03-31 08:08:46 +00:00
Tim Northover	fe7a445bf7	Install: add arm_neon.h header back I'd gone too far pruning aarch64_simd.h this time and took out one instance of arm_neon.h. This should restore us to the status quo. llvm-svn: 205111	2014-03-29 17:35:34 +00:00
Tim Northover	dca92dbc82	Remove stray references to aarch64_simd.h They were causing the autotools builds to fail. llvm-svn: 205103	2014-03-29 15:21:06 +00:00
Tim Northover	a2ee433c8d	ARM64: initial clang support commit. This adds Clang support for the ARM64 backend. There are definitely still some rough edges, so please bring up any issues you see with this patch. As with the LLVM commit though, we think it'll be more useful for merging with AArch64 from within the tree. llvm-svn: 205100	2014-03-29 15:09:45 +00:00
Reid Kleckner	7dd8bc0a84	Intrin.h: Implement _InterlockedExchangePointer llvm-svn: 204827	2014-03-26 16:09:48 +00:00
Hans Wennborg	a316933e09	MS intrinsics: __interlockedbittestandset(64) (PR19054) llvm-svn: 203816	2014-03-13 17:05:09 +00:00
Hans Wennborg	d9be72ec44	MS intrinsics: implement the __movs and __stos intrinsics (PR19054) llvm-svn: 203722	2014-03-12 22:00:32 +00:00
Hans Wennborg	a4421e03fa	MS intrinsics: implement __readgs{byte,word,dword,qword} (PR19054) llvm-svn: 203715	2014-03-12 21:09:05 +00:00
Hans Wennborg	dd0f5304f6	MS intrinsics: don't declare __readeflags and __writeeflags in Intrin.h They're already defined in ia32intrin.h, and this would cause including Intrin.h in 64-bit mode to fail because of conflicting types. Update ms-intrin.cpp to also run in 64-bit mode to catch things like this. llvm-svn: 203714	2014-03-12 21:09:03 +00:00
David Majnemer	1e57976ec0	Headers: Provide an ABI compatible max_align_t when _MSC_VER is defined Summary: Our usual definition of max_align_t wouldn't match up with MSVC if it was used in a template argument. Reviewers: chandlerc, rsmith, rnk Reviewed By: chandlerc CC: cfe-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2924 llvm-svn: 202911	2014-03-04 23:43:48 +00:00
Roman Divacky	b8322b13f8	The wmmintrin.h header includes two different sub-headers: one for AES support and one for PCLMUL support. The current immintrin.h header only includes wmmintrin.h if AES support is enabled. It should include it if either AES or PCLMUL is enabled (GCC's version of immintrin.h does this). Patch by John Baldwin! llvm-svn: 202871	2014-03-04 18:26:12 +00:00
Argyrios Kyrtzidis	7ffeea4ef3	[CMake] Add the newly introduced compiler header. llvm-svn: 202792	2014-03-04 06:28:23 +00:00
Alexey Bataev	af02c1c003	Fix for r202778 - Implement __readeflags and __writeeflags intrinsics (renamed res to __res) llvm-svn: 202784	2014-03-04 03:42:58 +00:00
Alexey Bataev	7cab007902	Implement __readeflags and __writeeflags intrinsics llvm-svn: 202778	2014-03-04 03:03:03 +00:00
Warren Hunt	0dc28ea301	[_mm_prefetch] Returning previously deleted comment. No functional change. It's unclear if the word FIXME is relevant given that the macro behaves as intended. llvm-svn: 201920	2014-02-22 00:47:24 +00:00
Warren Hunt	20e4a5d2af	Reapply 201734 but with appropriate gcc compatibility Because GCC incorrectly defines _mm_prefetch to take anything that casts to void, people have started using that behavior. The previous patch that made _mm_prefetch actually take a const char broke compatibility with existing code. This update to the patch leaves the macro that defines _mm_prefetch with the (void*) cast when _MSC_VER is not defined. llvm-svn: 201901	2014-02-21 23:08:53 +00:00
Daniel Jasper	2f0f297bdb	Revert r201734 and r201742. This breaks backwards compatibility with existing code. Previously, this was defined as #define _mm_prefetch(a, sel) (__builtin_prefetch((void )(a), 0, (sel))) Which basically accepts any pointer. Changing this to char simply breaks a lot of existing code. I have tried changing char* to "const void*", which seems to be the right thing as per Intel specification this should work on basically any pointer. However, apparently this breaks windows compatibility (because of a conflicting declaration in windows.h). So, we probably need to #ifdef this based on whether clang is compiling for windows. According to Chandler, this might be done by introducing an additional symbol to a fake type in BuiltinsX86.def and then condition the type expansion on the platform. llvm-svn: 201775	2014-02-20 11:10:48 +00:00

... 4 5 6 7 8 ...

1032 Commits