llvm-project

Commit Graph

Author	SHA1	Message	Date
Eric Christopher	cd875efa78	Canonicalize some of the x86 builtin tests and either remove or comment about optimization options. llvm-svn: 250271	2015-10-14 05:40:21 +00:00
Ahmed Bougacha	7dfaaf3891	[Headers][X86] Fix stream_load (movntdqa) to accept const. Per Intel intrinsics guide: - _mm256_stream_load_si256 takes `__m256i const ' - _mm_stream_load_si128 takes `__m128i ', for no good reason. Let's accept const for both. llvm-svn: 249213	2015-10-02 23:29:26 +00:00
Chandler Carruth	9143378db0	Patch over a really horrible bug in our vector builtins that showed up recently when we started using direct conversion to model sign extension. The __v16qi type we use for SSE v16i8 vectors is defined in terms of 'char' which may or may not be signed! This causes us to generate pmovsx and pmovzx depending on the setting of -funsigned-char. This patch just forms an explicitly signed type and uses that to formulate the sign extension. While this gets the correct behavior (which we now verify with the enhanced test) this is just the tip of the ice berg. Now that I know what to look for, I have found errors of this sort throughout our vector code. Fortunately, this is the only specific place where I know of users actively having their code miscompiled by Clang due to this, so I'm keeping the fix for those users minimal and targeted. I'll be sending a proper email for discussion of how to fix these systematically, what the implications are, and just how widely broken this is... From what I can tell, we have never shipped a correct set of builtin headers for x86 when users rely on -funsigned-char. Oops. llvm-svn: 248980	2015-10-01 02:21:34 +00:00
Simon Pilgrim	12919f7e49	[X86][SSE] Replace 128-bit SSE41 PMOVSX intrinsics with native IR 128-bit vector integer sign extensions correctly lower to the pmovsx instructions even for debug builds. This patch removes the builtins and reimplements the _mm_cvtepi_epi intrinsics __using builtin_shufflevector (to extract the bottom most subvector) and __builtin_convertvector (to actually perform the sign extension). Differential Revision: http://reviews.llvm.org/D12835 llvm-svn: 248092	2015-09-19 15:12:38 +00:00
Simon Pilgrim	ff88a0da31	[X86]][SSE3] Added SSE41 IR + assembly codegen builtin tests Transferred SSE41 instructions from sse-builtins.c llvm-svn: 246947	2015-09-06 16:38:17 +00:00

5 Commits