llvm-project

Commit Graph

Author	SHA1	Message	Date
Michael Kuperstein	e45af54cdb	[X86] Rename DEFAULT_FN_ATTR macro to __DEFAULT_FN_ATTR llvm-svn: 241065	2015-06-30 13:36:19 +00:00
Eric Christopher	9fc7fb274e	Update the intel intrinsic headers to use the target attribute support. This involved removing the conditional inclusion and replacing them with target attributes matching the original conditional inclusion and checks. The testcase update removes the macro checks for each file and replaces them with usage of the __target__ attribute, e.g.: int __attribute__((__target__(("sse3")))) foo(int a) { _mm_mwait(0, 0); return 4; } This usage does require the enclosing function have the requisite __target__ attribute for inlining and code generation - also for any macro intrinsic uses in the enclosing function. There's no change for existing uses of the intrinsic headers. llvm-svn: 239883	2015-06-17 07:09:32 +00:00
Eric Christopher	4d185168e9	Use a define for per-file function attributes for the Intel intrinsic headers. This is a precursor to changing them to use the new target attribute code. llvm-svn: 239882	2015-06-17 07:09:20 +00:00
Michael Kuperstein	877f3cbe84	[X86] Add _mm_broadcastsd_pd intrinsic _mm_broadcastsd_pd is basically an alias for _mm_movedup_pd, however the alias is only available from AVX2 forward. llvm-svn: 237698	2015-05-19 14:49:14 +00:00
Michael Kuperstein	6168183e04	[X86] Added _mm256_bslli_epi128 and _mm256_bsrli_epi128. These two intrinsics are alternative names for _mm256_slli_si256 and _mm256_srli_si256, respectively. llvm-svn: 237693	2015-05-19 13:05:46 +00:00
Ekaterina Romanova	b929ad7b17	_mm256_blend_epi16 is being cast to __m256d instead of __m256i. Fixing this. llvm-svn: 234560	2015-04-10 02:39:45 +00:00
Sanjay Patel	0a6da5de55	[X86, AVX2] Replace inserti128 and extracti128 intrinsics with generic shuffles This is nearly identical to the v*f128_si256 parts of r231792 and r232052. AVX2 introduced proper integer variants of the hacked integer insert/extract C intrinsics that were created for this same functionality with AVX1. This should complete the front end fixes for insert/extract128 intrinsics. Corresponding LLVM patch to follow. llvm-svn: 232109	2015-03-12 21:54:24 +00:00
Juergen Ributzka	9baa03fc07	Lower _mm256_broadcastsi128_si256 directly to a vector shuffle. Originally we were using the same GCC builtins to lower this AVX2 vector intrinsic. Instead we will now lower it directly to a vector shuffle. This will not only allow LLVM to generate better code, but it will also allow us to remove the GCC intrinsics. Reviewed by Andrea This is related to rdar://problem/18742778. llvm-svn: 231081	2015-03-03 17:22:53 +00:00
Filipe Cabecinhas	5d289b48b1	Patched clang to emit x86 blends as shufflevectors. Summary: Most of the clang header patch by Simon Pilgrim @ SCEE. Also fixed (or added) clang tests for these intrinsics. LLVM tests to make sure we get the blend instruction out of these shufflevectors are at http://reviews.llvm.org/D3600 Reviewers: eli.friedman, craig.topper, rafael Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D3601 llvm-svn: 208664	2014-05-13 02:37:02 +00:00
Eli Friedman	3cd55f49ab	Fix argument types of some AVX2 intrinsics. This fix makes our headers consistent with gcc. PR17312. llvm-svn: 191248	2013-09-23 23:52:04 +00:00
Juergen Ributzka	2c2dbf4542	Fix the name and the type of the argument for intrinisc _mm256_broadcastsi128_si256 to align with the Intel documentation. This fixes bug PR 16581 and rdar:14747994. llvm-svn: 188609	2013-08-17 16:40:09 +00:00
Richard Smith	49e56440f9	Add missing include guards into headers in lib/Headers. While it may appear that these headers should not be included more than once, they are in fact included twice when building our builtins module (in order for it to generate submodules for them), and without this, any modular build enabling AVX and including any builtin header fails. Testing this is tricky because including any of these headers in a modular build is liable to fail, due to unrelated builtin headers in the same module including headers which might not be available on the system running the tests. Suggestion on that front are welcome (but we're getting close to being able to run a buildbot that has modules enabled for all tests, which would nicely solve the testing problem). llvm-svn: 186275	2013-07-14 05:41:45 +00:00
David Blaikie	3302f2bd46	PR14964: intrinsic headers using non-reserved identifiers Several of the intrinsic headers were using plain non-reserved identifiers. C++11 17.6.4.3.2 [global.names] p1 reservers names containing a double begining with an underscore followed by an uppercase letter for any use. I think I got them all, but open to being corrected. For the most part I didn't bother updating function-like macro parameter names because I don't believe they're subject to any such collission - though some function-like macros already follow this convention (I didn't update them in part because the churn was more significant as several function-like macros use the double underscore prefixed version of the same name as a parameter in their implementation) llvm-svn: 172666	2013-01-16 23:08:36 +00:00
Manman Ren	f865ba0c0e	X86: add more GATHER intrinsics in Clang Support the following intrinsics: _mm_i32gather_pd, _mm256_i32gather_pd, _mm_i64gather_pd, _mm256_i64gather_pd, _mm_i32gather_ps, _mm256_i32gather_ps, _mm_i64gather_ps, _mm256_i64gather_ps, _mm_i32gather_epi64, _mm256_i32gather_epi64, _mm_i64gather_epi64, _mm256_i64gather_epi64, _mm_i32gather_epi32, _mm256_i32gather_epi32, _mm_i64gather_epi32, _mm256_i64gather_epi32 llvm-svn: 159410	2012-06-29 05:19:13 +00:00
Manman Ren	86c3250b82	X86: add more GATHER intrinsics in Clang Corrected type for index of _mm256_mask_i32gather_pd from 256-bit to 128-bit Corrected types for src\|dst\|mask of _mm256_mask_i64gather_ps from 256-bit to 128-bit Support the following intrinsics: _mm_mask_i32gather_epi64, _mm256_mask_i32gather_epi64, _mm_mask_i64gather_epi64, _mm256_mask_i64gather_epi64, _mm_mask_i32gather_epi32, _mm256_mask_i32gather_epi32, _mm_mask_i64gather_epi32, _mm256_mask_i64gather_epi32 llvm-svn: 159403	2012-06-29 00:54:35 +00:00
Manman Ren	add5e9e289	X86: add GATHER intrinsics (AVX2) in Clang Support the following intrinsics: _mm_mask_i32gather_pd, _mm256_mask_i32gather_pd, _mm_mask_i64gather_pd _mm256_mask_i64gather_pd, _mm_mask_i32gather_ps, _mm256_mask_i32gather_ps _mm_mask_i64gather_ps, _mm256_mask_i64gather_ps llvm-svn: 159222	2012-06-26 19:55:09 +00:00
Craig Topper	26e74e50b6	Convert vperm2f128 and vperm2i128 intrinsics back to using llvm intrinsics. Unfortunately, these instructions have behavior that can't be modeled with shuffle vector. llvm-svn: 154906	2012-04-17 05:16:56 +00:00
Craig Topper	8e57855ea0	Change _mm256_permute4x64_epi64 and _mm256_permute4x64_pd to use builtin_shufflevector instead of specific builtins. Old builtins will be removed from llvm now that vpermq/vpermpd are supported by shuffle lowering code. llvm-svn: 154777	2012-04-15 22:18:10 +00:00
Craig Topper	74c17c65e4	Correctly check argument types for some vector macros in smmintrin.h. Put parentheses around uses of vector macro arguments. llvm-svn: 153732	2012-03-30 07:01:17 +00:00
Craig Topper	e5ea3b0239	Remove vperm2f* and vperm2i builtins. Same effect can be achieved with builtin_shufflevector. llvm-svn: 150064	2012-02-08 07:33:36 +00:00
Craig Topper	175543ac78	Add last of the AVX2 intrinsics except for gather. llvm-svn: 147253	2011-12-24 17:20:15 +00:00
Craig Topper	9f00948a82	Add AVX2 permute intrinsics. Also add parentheses on some macro arguments in other intrinsic headers. llvm-svn: 147241	2011-12-24 07:55:14 +00:00
Craig Topper	9479895928	Add AVX2 intrinsics for FP vbroadcast, vbroadcasti128, and vpblendd. llvm-svn: 147239	2011-12-24 05:19:29 +00:00
Craig Topper	a6fdbd1807	Intrinsics for AVX2 unpack instructions. llvm-svn: 147237	2011-12-24 03:58:43 +00:00
Craig Topper	f4bb952533	More AVX2 intrinsics for shift, psign, some shuffles, and psadbw. llvm-svn: 147236	2011-12-24 03:28:57 +00:00
Craig Topper	235a365d58	Add AVX2 multiply intrinsics. llvm-svn: 147219	2011-12-23 08:31:16 +00:00
Craig Topper	1f2460ad43	Add AVX2 intrinsics for max, min, sign extend, and zero extend. llvm-svn: 147141	2011-12-22 09:18:58 +00:00
Craig Topper	a73baa8050	Add a few more AVX2 intrinsics and fix the type strings on a couple SSE intrinsics. llvm-svn: 147048	2011-12-21 08:35:05 +00:00
Craig Topper	3fe5ac40db	Add AVX2 horizontal add/sub intrinsics. llvm-svn: 147047	2011-12-21 08:17:40 +00:00
Craig Topper	a89747dd1e	Add AVX2 intrinsics for pavg, pblend, and pcmp instructions. Also remove unneeded builtins for SSE pcmp. Change SSE pcmpeqq and pcmpgtq to not use builtins and just use vector == and >. llvm-svn: 146969	2011-12-20 09:55:26 +00:00
Craig Topper	a557e1c122	Add AVX2 intrinsics for and, andn, or, and xor. llvm-svn: 146862	2011-12-19 09:03:48 +00:00
Craig Topper	94aba2c260	More AVX2 intrinsic support including saturating add/sub and palignr. llvm-svn: 146857	2011-12-19 07:03:25 +00:00
Craig Topper	dec792ebb5	Begin adding AVX2 intrinsics. Necessitated increasing the number of bits used to store builtinID when serializing identifier table. llvm-svn: 146855	2011-12-19 05:04:33 +00:00

33 Commits