llvm-project

Commit Graph

Author	SHA1	Message	Date
Evan Cheng	25b09295f8	Added X86 SSE2 intrinsics which can be represented as vector_shuffles. This is a temporary workaround for the 2-wide vector_shuffle problem (i.e. its mask would have type v2i32 which is not legal). llvm-svn: 27964	2006-04-24 23:34:56 +00:00
Evan Cheng	d03631ee76	Add a new entry. llvm-svn: 27963	2006-04-24 23:30:10 +00:00
Evan Cheng	5c2bfb069e	Special case handling two wide build_vector(0, x). llvm-svn: 27961	2006-04-24 22:58:52 +00:00
Evan Cheng	63bd4d3730	Some missing movlps, movhps, movlpd, and movhpd patterns. llvm-svn: 27960	2006-04-24 21:58:20 +00:00
Evan Cheng	b0461080e4	A little bit more build_vector enhancement for v8i16 cases. llvm-svn: 27959	2006-04-24 18:01:45 +00:00
Evan Cheng	2f9b0bcbd5	Remove a completed entry. llvm-svn: 27958	2006-04-24 17:38:16 +00:00
Evan Cheng	ab0ee6340c	MakeMIInst() should handle jump table index operands. llvm-svn: 27955	2006-04-24 05:37:35 +00:00
Chris Lattner	f110527a29	Add a note llvm-svn: 27954	2006-04-23 19:47:09 +00:00
Evan Cheng	b4f31dd1a8	MOVL shuffle (i.e. movd or movss / movsd from memory) of undef, V2 == V2 llvm-svn: 27953	2006-04-23 06:35:19 +00:00
Nate Begeman	9f0b13c885	Optimized stores to the constant pool, while cool, are unnecessary. llvm-svn: 27948	2006-04-22 22:31:45 +00:00
Nate Begeman	4ca2ea5b43	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Evan Cheng	e728efdfce	Don't do all the lowering stuff for 2-wide build_vector's. Also, minor optimization for shuffle of undef. llvm-svn: 27946	2006-04-22 08:34:05 +00:00
Evan Cheng	16ef94f4e8	Fix a performance regression. Use {p}shuf* when there are only two distinct elements in a build_vector. llvm-svn: 27945	2006-04-22 06:21:46 +00:00
Chris Lattner	c8afdfec52	Teach the JIT how to relocate LI, this fixes the JIT on Prolangs-C/TimberWolfMC llvm-svn: 27943	2006-04-22 06:17:56 +00:00
Evan Cheng	14215c36b6	Revamp build_vector lowering to take advantage of movss and movd instructions. movd always clear the top 96 bits and movss does so when it's loading the value from memory. The net result is codegen for 4-wide shuffles is much improved. It is near optimal if one or more elements is a zero. e.g. __m128i test(int a, int b) { return _mm_set_epi32(0, 0, b, a); } compiles to _test: movd 8(%esp), %xmm1 movd 4(%esp), %xmm0 punpckldq %xmm1, %xmm0 ret compare to gcc: _test: subl $12, %esp movd 20(%esp), %xmm0 movd 16(%esp), %xmm1 punpckldq %xmm0, %xmm1 movq %xmm1, %xmm0 movhps LC0, %xmm0 addl $12, %esp ret or icc: _test: movd 4(%esp), %xmm0 #5.10 movd 8(%esp), %xmm3 #5.10 xorl %eax, %eax #5.10 movd %eax, %xmm1 #5.10 punpckldq %xmm1, %xmm0 #5.10 movd %eax, %xmm2 #5.10 punpckldq %xmm2, %xmm3 #5.10 punpckldq %xmm3, %xmm0 #5.10 ret #5.10 There are still room for improvement, for example the FP variant of the above example: __m128 test(float a, float b) { return _mm_set_ps(0.0, 0.0, b, a); } _test: movss 8(%esp), %xmm1 movss 4(%esp), %xmm0 unpcklps %xmm1, %xmm0 xorps %xmm1, %xmm1 movlhps %xmm1, %xmm0 ret The xorps and movlhps are unnecessary. This will require post legalizer optimization to handle. llvm-svn: 27939	2006-04-21 23:03:30 +00:00
Nate Begeman	57a32f0bc1	Fix the comment llvm-svn: 27938	2006-04-21 22:11:27 +00:00
Nate Begeman	516b393992	Change the PPC JIT to use a Static relocation model llvm-svn: 27937	2006-04-21 22:04:15 +00:00
Chris Lattner	3e62d4b289	fix thinko llvm-svn: 27935	2006-04-21 21:05:22 +00:00
Chris Lattner	e1f9ab7d53	add some low-prio notes llvm-svn: 27934	2006-04-21 21:03:21 +00:00
Evan Cheng	e8b5180044	Now generating perfect (I think) code for "vector set" with a single non-zero scalar value. e.g. _mm_set_epi32(0, a, 0, 0); ==> movd 4(%esp), %xmm0 pshufd $69, %xmm0, %xmm0 _mm_set_epi8(0, 0, 0, 0, 0, a, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0); ==> movzbw 4(%esp), %ax movzwl %ax, %eax pxor %xmm0, %xmm0 pinsrw $5, %eax, %xmm0 llvm-svn: 27923	2006-04-21 01:05:10 +00:00
Chris Lattner	99d3da9d2c	Fix the CodeGen/PowerPC/buildvec_canonicalize.ll regression last night. llvm-svn: 27908	2006-04-20 19:01:30 +00:00
Chris Lattner	d1c3a067ee	add a note llvm-svn: 27907	2006-04-20 18:49:28 +00:00
Chris Lattner	3e5521799c	remove some v9 specific code llvm-svn: 27900	2006-04-20 18:33:11 +00:00
Chris Lattner	2a875285f7	Remove this obsolete file llvm-svn: 27895	2006-04-20 18:16:45 +00:00
Chris Lattner	ac61195539	This target is no longer built. The ,v files now live in the reoptimizer. llvm-svn: 27885	2006-04-20 17:15:44 +00:00
Evan Cheng	60f0b8998e	- Added support to turn "vector clear elements", e.g. pand V, <-1, -1, 0, -1> to a vector shuffle. - VECTOR_SHUFFLE lowering change in preparation for more efficient codegen of vector shuffle with zero (or any splat) vector. llvm-svn: 27875	2006-04-20 08:58:49 +00:00
Chris Lattner	0cd0065c58	Make sure that the new instructions selected have the right type. This fixes CodeGen/PowerPC/2006-04-19-vmaddfp-crash.ll llvm-svn: 27868	2006-04-20 05:58:10 +00:00
Evan Cheng	15c264b753	Handle v2i64 BUILD_VECTOR custom lowering correctly. v2i64 is a legal type, but i64 is not. If possible, change a i64 op to a f64 (e.g. load, constant) and then cast it back. llvm-svn: 27849	2006-04-20 00:11:39 +00:00
Evan Cheng	4a1b0d3292	isSplatMask() bug: first element can be an undef. llvm-svn: 27847	2006-04-19 23:28:59 +00:00
Evan Cheng	a3caaee503	- Added support to do aribitrary 4 wide shuffle with no more than three instructions. - Fixed a commute vector_shuff bug. llvm-svn: 27845	2006-04-19 22:48:17 +00:00
Evan Cheng	6d5297dac3	Prefer {p}unpack* and movdup over {p}shuf as well. llvm-svn: 27844	2006-04-19 21:15:24 +00:00
Evan Cheng	52df74000a	Renamed AddedCost to AddedComplexity. llvm-svn: 27843	2006-04-19 20:38:28 +00:00
Evan Cheng	b416a25174	- Renamed AddedCost to AddedComplexity. - Added more movhlps and movlhps patterns. llvm-svn: 27842	2006-04-19 20:37:34 +00:00
Evan Cheng	7855e4d032	Commute vector_shuffle to match more movlhps, movlp{s\|d} cases. llvm-svn: 27840	2006-04-19 20:35:22 +00:00
Evan Cheng	cc7abc6c38	More mov{h\|l}p{d\|s} patterns. llvm-svn: 27836	2006-04-19 18:20:17 +00:00
Evan Cheng	aeb09ccdd3	- More mov{h\|l}ps patterns. - Increase cost (complexity) of patterns which match mov{h\|l}ps ops. These are preferred over shufps in most cases. llvm-svn: 27835	2006-04-19 18:11:52 +00:00
Evan Cheng	aa3325e925	Allow "let AddedCost = n in" to increase pattern complexity. llvm-svn: 27834	2006-04-19 18:07:24 +00:00
Chris Lattner	05bbec5020	add a note llvm-svn: 27832	2006-04-19 16:22:38 +00:00
Chris Lattner	a922a516b0	add a note llvm-svn: 27828	2006-04-19 05:55:06 +00:00
Chris Lattner	bfab82817a	Add a note. llvm-svn: 27827	2006-04-19 05:53:27 +00:00
Evan Cheng	3823aa1d0f	- PEXTRW cannot take a memory location as its first source operand. - PINSRWrmi encoding bug. llvm-svn: 27818	2006-04-18 21:59:43 +00:00
Evan Cheng	43f4ef4ffb	SHUFP{S\|D}, PSHUF* encoding bugs. Left out the mask immediate operand. llvm-svn: 27817	2006-04-18 21:56:36 +00:00
Evan Cheng	a179ea631d	Name change for clarity sake llvm-svn: 27816	2006-04-18 21:55:35 +00:00
Evan Cheng	09e36ef710	Encoding bug: CMPPSrmi, CMPPDrmi dropped operand 2 (condtion immediate). llvm-svn: 27815	2006-04-18 21:31:08 +00:00
Evan Cheng	d799d680f4	Name change for clarity sake llvm-svn: 27814	2006-04-18 21:29:50 +00:00
Evan Cheng	0ee281f37c	Left a pattern out llvm-svn: 27813	2006-04-18 21:29:08 +00:00
Chris Lattner	34c901b50e	These are correctly encoded by the JIT. I checked :) llvm-svn: 27810	2006-04-18 19:03:38 +00:00
Chris Lattner	197d762232	add a note llvm-svn: 27809	2006-04-18 18:30:19 +00:00
Chris Lattner	518834c67e	Fix a crash on: void foo2(vector float A, vector float B) { vector float C = (vector float)vec_cmpeq(A, B); if (!vec_any_eq(A, B)) B = (vector float){0,0,0,0}; A = C; } llvm-svn: 27808	2006-04-18 18:28:22 +00:00
Evan Cheng	e2d25a1a50	Fixed an encoding bug: movd from XMM to R32. llvm-svn: 27807	2006-04-18 18:19:00 +00:00

1 2 3 4 5 ...

5231 Commits