llvm-project

Commit Graph

Author	SHA1	Message	Date
Reid Spencer	9597684ba7	Remove traces of Burg utility now that its gone and not needed. llvm-svn: 27902	2006-04-20 18:42:24 +00:00
Reid Spencer	06502b6135	Burg not needed any more now that SparcV9 is gone. llvm-svn: 27901	2006-04-20 18:39:19 +00:00
Chris Lattner	3e5521799c	remove some v9 specific code llvm-svn: 27900	2006-04-20 18:33:11 +00:00
Chris Lattner	dcc1f995eb	This field no longer exists llvm-svn: 27899	2006-04-20 18:32:41 +00:00
Chris Lattner	778509c844	Don't fill in fields that no longer exist. llvm-svn: 27898	2006-04-20 18:32:22 +00:00
Chris Lattner	f2a5922fa9	Remove a bunch of dead stuff, shrinkifying TargetInstrDescriptor significantly. llvm-svn: 27897	2006-04-20 18:32:02 +00:00
Chris Lattner	7d7ed24b96	Remove some obsolete interfaces llvm-svn: 27896	2006-04-20 18:17:21 +00:00
Chris Lattner	2a875285f7	Remove this obsolete file llvm-svn: 27895	2006-04-20 18:16:45 +00:00
Chris Lattner	862755b95b	Remove some of the obvious v9-specific cruft llvm-svn: 27894	2006-04-20 18:09:13 +00:00
Chris Lattner	a38c3580bd	Remove some of the obvious V9-specific cruft llvm-svn: 27893	2006-04-20 18:08:53 +00:00
Evan Cheng	73b12f2f53	Vector extract element test case. llvm-svn: 27892	2006-04-20 17:59:30 +00:00
Chris Lattner	d5737be0f0	Remove V9 jit support llvm-svn: 27891	2006-04-20 17:52:00 +00:00
Evan Cheng	aecd41384f	Vector insert test case. llvm-svn: 27890	2006-04-20 17:50:10 +00:00
Chris Lattner	ec86eace63	allow this dir to get pruned llvm-svn: 27889	2006-04-20 17:45:33 +00:00
Chris Lattner	1798687dba	Remove this target's reg tests llvm-svn: 27888	2006-04-20 17:44:51 +00:00
Chris Lattner	5197dfadf2	Fails with all sparcs llvm-svn: 27887	2006-04-20 17:43:41 +00:00
Chris Lattner	7991e85b2e	Remove V9 llvm-svn: 27886	2006-04-20 17:42:23 +00:00
Chris Lattner	ac61195539	This target is no longer built. The ,v files now live in the reoptimizer. llvm-svn: 27885	2006-04-20 17:15:44 +00:00
Chris Lattner	53f4499b22	Never link in sparcv9 llvm-svn: 27884	2006-04-20 17:07:46 +00:00
Chris Lattner	8fe3dbceb0	Never build SparcV9 llvm-svn: 27883	2006-04-20 17:01:19 +00:00
Chris Lattner	d0a3a32eae	remove a dead prototype llvm-svn: 27882	2006-04-20 15:45:54 +00:00
Andrew Lenharth	f89e630b2f	Make code match cvs commit message :) llvm-svn: 27881	2006-04-20 15:41:37 +00:00
Andrew Lenharth	61eae29ad6	If we can convert the return pointer type into an integer that IntPtrType can be converted to losslessly, we can continue the conversion to a direct call. llvm-svn: 27880	2006-04-20 14:56:47 +00:00
Andrew Lenharth	b950dbea0b	can we cast between pointers and IntPtrType llvm-svn: 27879	2006-04-20 14:54:17 +00:00
Reid Spencer	8923c09997	Add a missing =back to eliminate error. llvm-svn: 27878	2006-04-20 14:17:47 +00:00
Evan Cheng	3ee104c852	v16i8 splat with 2 punpcklbw and a single pshufd. llvm-svn: 27877	2006-04-20 09:05:16 +00:00
Evan Cheng	f2c5fe9139	Another shuffle test. For 4-wide shuffle, no more than 3 {p}shuf*. llvm-svn: 27876	2006-04-20 09:01:54 +00:00
Evan Cheng	60f0b8998e	- Added support to turn "vector clear elements", e.g. pand V, <-1, -1, 0, -1> to a vector shuffle. - VECTOR_SHUFFLE lowering change in preparation for more efficient codegen of vector shuffle with zero (or any splat) vector. llvm-svn: 27875	2006-04-20 08:58:49 +00:00
Evan Cheng	a320abc494	Turn a VAND into a VECTOR_SHUFFLE is applicable. DAG combiner can turn a VAND V, <-1, 0, -1, -1>, i.e. vector clear elements, into a vector shuffle with a zero vector. It only does so when TLI tells it the xform is profitable. llvm-svn: 27874	2006-04-20 08:56:16 +00:00
Evan Cheng	8d6c229f8c	Added a virtual method isVectorClearMaskLegal to TLI. It is similar to isShuffleMaskLegal, used to determine if it makes sense to turn a "vector clear" (e.g. pand V, <0, -1, 0, -1> to a shuffle of the vector and a zero vector. llvm-svn: 27873	2006-04-20 08:54:13 +00:00
Evan Cheng	2bd632a02a	Added a test case for , e.g. xform pand <0, 0, -1, -1> to a shuffle. llvm-svn: 27872	2006-04-20 08:51:03 +00:00
Evan Cheng	059676f77b	Added a movhlps, movlhps test case. llvm-svn: 27871	2006-04-20 08:47:47 +00:00
Chris Lattner	171db236e5	Don't hardcode in 1.5 for the website, just use 'CVS'. llvm-svn: 27870	2006-04-20 06:24:16 +00:00
Chris Lattner	a23665ee78	This is old, out of date, and isn't linked to by anything. llvm-svn: 27869	2006-04-20 06:15:48 +00:00
Chris Lattner	0cd0065c58	Make sure that the new instructions selected have the right type. This fixes CodeGen/PowerPC/2006-04-19-vmaddfp-crash.ll llvm-svn: 27868	2006-04-20 05:58:10 +00:00
Chris Lattner	4ae41a3556	New testcase for a codegen crash llvm-svn: 27867	2006-04-20 05:57:43 +00:00
Tanya Lattner	cf46848098	Changing domain name llvm-svn: 27865	2006-04-20 05:51:53 +00:00
Chris Lattner	bc1b262725	Implement folding of a bunch of binops with undef llvm-svn: 27863	2006-04-20 05:39:12 +00:00
Chris Lattner	499abb5695	Update llvmgcc4 tarball names llvm-svn: 27861	2006-04-20 05:08:23 +00:00
Tanya Lattner	3687d08f8e	Removed listing of llvm releases after 1.4, and said "1.4 and newer" llvm-svn: 27860	2006-04-20 05:05:12 +00:00
Tanya Lattner	5a423b3c4f	Made warning red. llvm-svn: 27859	2006-04-20 04:57:19 +00:00
Tanya Lattner	011f7359d6	Document is out of date.. added warning and link to llvm-config. llvm-svn: 27858	2006-04-20 04:55:50 +00:00
Tanya Lattner	1a96c7158c	Fixed up comment on xfail for llvmgcc version. llvm-svn: 27857	2006-04-20 04:47:55 +00:00
Tanya Lattner	79674968b6	Added note about being able to XFAIL based on llvmgcc version. llvm-svn: 27856	2006-04-20 04:45:59 +00:00
Tanya Lattner	3962e5965a	Removed cvs mirror comment llvm-svn: 27855	2006-04-20 04:38:16 +00:00
Tanya Lattner	fb76291234	Minor fixes for the release. llvm-svn: 27854	2006-04-20 04:35:34 +00:00
Chris Lattner	6400606664	This has been fixed! Thanks Reid. llvm-svn: 27853	2006-04-20 04:24:28 +00:00
Chris Lattner	d05559828a	Yeah that's right! llvm-svn: 27852	2006-04-20 04:22:06 +00:00
Chris Lattner	70dfe24866	Fixes from Tanya llvm-svn: 27851	2006-04-20 04:01:31 +00:00
Reid Spencer	df65ba121b	Add in missing #defines for _OpenBSD_ systems. llvm-svn: 27850	2006-04-20 00:18:39 +00:00
Evan Cheng	15c264b753	Handle v2i64 BUILD_VECTOR custom lowering correctly. v2i64 is a legal type, but i64 is not. If possible, change a i64 op to a f64 (e.g. load, constant) and then cast it back. llvm-svn: 27849	2006-04-20 00:11:39 +00:00
Reid Spencer	48b9203a40	Allow OpenBSD to be recognized as a UNIX platform. llvm-svn: 27848	2006-04-19 23:47:16 +00:00
Evan Cheng	4a1b0d3292	isSplatMask() bug: first element can be an undef. llvm-svn: 27847	2006-04-19 23:28:59 +00:00
Chris Lattner	73eb58e1a2	Simplify some code llvm-svn: 27846	2006-04-19 23:17:50 +00:00
Evan Cheng	a3caaee503	- Added support to do aribitrary 4 wide shuffle with no more than three instructions. - Fixed a commute vector_shuff bug. llvm-svn: 27845	2006-04-19 22:48:17 +00:00
Evan Cheng	6d5297dac3	Prefer {p}unpack* and movdup over {p}shuf as well. llvm-svn: 27844	2006-04-19 21:15:24 +00:00
Evan Cheng	52df74000a	Renamed AddedCost to AddedComplexity. llvm-svn: 27843	2006-04-19 20:38:28 +00:00
Evan Cheng	b416a25174	- Renamed AddedCost to AddedComplexity. - Added more movhlps and movlhps patterns. llvm-svn: 27842	2006-04-19 20:37:34 +00:00
Evan Cheng	9235d848b7	Rename AddedCost to AddedComplexity. llvm-svn: 27841	2006-04-19 20:36:09 +00:00
Evan Cheng	7855e4d032	Commute vector_shuffle to match more movlhps, movlp{s\|d} cases. llvm-svn: 27840	2006-04-19 20:35:22 +00:00
Chris Lattner	f2f4aedc6e	Final piece to get relinked .o files buildable universal on Darwin. llvm-svn: 27839	2006-04-19 18:45:29 +00:00
Chris Lattner	7d17a77d5e	Regenerate llvm-svn: 27838	2006-04-19 18:38:19 +00:00
Chris Lattner	b3305fb203	When on darwin, compiler_flags need to be percolated down to the 'gcc -r' command line so that relinked .o files can be built universal. llvm-svn: 27837	2006-04-19 18:34:41 +00:00
Evan Cheng	cc7abc6c38	More mov{h\|l}p{d\|s} patterns. llvm-svn: 27836	2006-04-19 18:20:17 +00:00
Evan Cheng	aeb09ccdd3	- More mov{h\|l}ps patterns. - Increase cost (complexity) of patterns which match mov{h\|l}ps ops. These are preferred over shufps in most cases. llvm-svn: 27835	2006-04-19 18:11:52 +00:00
Evan Cheng	aa3325e925	Allow "let AddedCost = n in" to increase pattern complexity. llvm-svn: 27834	2006-04-19 18:07:24 +00:00
Chris Lattner	7b96902d35	Alpha too! llvm-svn: 27833	2006-04-19 17:20:48 +00:00
Chris Lattner	05bbec5020	add a note llvm-svn: 27832	2006-04-19 16:22:38 +00:00
Andrew Lenharth	02f9df3b7b	Another simple case type merge case to try llvm-svn: 27831	2006-04-19 15:34:34 +00:00
Andrew Lenharth	edf349aba6	deal with memchr llvm-svn: 27830	2006-04-19 15:34:02 +00:00
Andrew Lenharth	7f2cee3d3e	friendlier error message llvm-svn: 27829	2006-04-19 15:33:35 +00:00
Chris Lattner	a922a516b0	add a note llvm-svn: 27828	2006-04-19 05:55:06 +00:00
Chris Lattner	bfab82817a	Add a note. llvm-svn: 27827	2006-04-19 05:53:27 +00:00
Chris Lattner	263fc2a642	grammaro llvm-svn: 27826	2006-04-19 04:21:57 +00:00
Chris Lattner	cb2170fb37	Fix a bug owen noticed llvm-svn: 27825	2006-04-19 04:21:16 +00:00
Chris Lattner	c3e92b56a3	Change wording llvm-svn: 27824	2006-04-19 04:12:01 +00:00
Chris Lattner	e9e46a746d	add a note llvm-svn: 27823	2006-04-19 04:05:21 +00:00
Chris Lattner	0b7fd71e1c	add some more notes llvm-svn: 27822	2006-04-19 04:02:47 +00:00
Andrew Lenharth	7c8be502e9	stupid stuff llvm-svn: 27821	2006-04-19 03:45:25 +00:00
Andrew Lenharth	2bdd6fe9ef	fix printing call graphs llvm-svn: 27820	2006-04-18 23:45:19 +00:00
Andrew Lenharth	3e642d012a	I understand now. Shoot. llvm-svn: 27819	2006-04-18 22:36:11 +00:00
Evan Cheng	3823aa1d0f	- PEXTRW cannot take a memory location as its first source operand. - PINSRWrmi encoding bug. llvm-svn: 27818	2006-04-18 21:59:43 +00:00
Evan Cheng	43f4ef4ffb	SHUFP{S\|D}, PSHUF* encoding bugs. Left out the mask immediate operand. llvm-svn: 27817	2006-04-18 21:56:36 +00:00
Evan Cheng	a179ea631d	Name change for clarity sake llvm-svn: 27816	2006-04-18 21:55:35 +00:00
Evan Cheng	09e36ef710	Encoding bug: CMPPSrmi, CMPPDrmi dropped operand 2 (condtion immediate). llvm-svn: 27815	2006-04-18 21:31:08 +00:00
Evan Cheng	d799d680f4	Name change for clarity sake llvm-svn: 27814	2006-04-18 21:29:50 +00:00
Evan Cheng	0ee281f37c	Left a pattern out llvm-svn: 27813	2006-04-18 21:29:08 +00:00
Andrew Lenharth	f70cb84083	llvm.memc* improvements. helps PA a lot in some specmarks llvm-svn: 27812	2006-04-18 20:59:52 +00:00
Andrew Lenharth	49e188d7f7	llvm.memc* improvements. helps PA a lot in some specmarks llvm-svn: 27811	2006-04-18 19:54:11 +00:00
Chris Lattner	34c901b50e	These are correctly encoded by the JIT. I checked :) llvm-svn: 27810	2006-04-18 19:03:38 +00:00
Chris Lattner	197d762232	add a note llvm-svn: 27809	2006-04-18 18:30:19 +00:00
Chris Lattner	518834c67e	Fix a crash on: void foo2(vector float A, vector float B) { vector float C = (vector float)vec_cmpeq(A, B); if (!vec_any_eq(A, B)) B = (vector float){0,0,0,0}; A = C; } llvm-svn: 27808	2006-04-18 18:28:22 +00:00
Evan Cheng	e2d25a1a50	Fixed an encoding bug: movd from XMM to R32. llvm-svn: 27807	2006-04-18 18:19:00 +00:00
Chris Lattner	1e174c87c3	pretty print node name llvm-svn: 27806	2006-04-18 18:05:58 +00:00
Chris Lattner	9754d142a4	Implement an important entry from README_ALTIVEC: If an altivec predicate compare is used immediately by a branch, don't use a (serializing) MFCR instruction to read the CR6 register, which requires a compare to get it back to CR's. Instead, just branch on CR6 directly. :) For example, for: void foo2(vector float A, vector float B) { if (!vec_any_eq(A, B)) *B = (vector float){0,0,0,0}; } We now generate: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 bne cr6, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr instead of: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 cmpwi cr0, r3, 0 beq cr0, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr This implements CodeGen/PowerPC/vec_br_cmp.ll. llvm-svn: 27804	2006-04-18 17:59:36 +00:00
Chris Lattner	11a9ac51e8	new testcase llvm-svn: 27803	2006-04-18 17:56:30 +00:00
Chris Lattner	68c16a201e	move some stuff around, clean things up llvm-svn: 27802	2006-04-18 17:52:36 +00:00
Chris Lattner	bfc2c68386	Teach the codegen about instructions used for SSE spill code, allowing it to optimize cases where it has to spill a lot llvm-svn: 27801	2006-04-18 16:44:51 +00:00
Nate Begeman	f776fc2c98	Fix a copy & paste error from long ago. llvm-svn: 27800	2006-04-18 16:03:18 +00:00
Chris Lattner	89e761c19d	Add some more notes, many still missing llvm-svn: 27799	2006-04-18 06:32:08 +00:00
Reid Spencer	b687ce80cd	Have the AutoRegen.sh script prompt the user for the LLVM src and obj directories if it can't find them. Then, replace those values into the configure.ac script and pass them to the LLVM_CONFIG_PROJECT so that the values become the default for llvm_src and llvm_obj variables. In this way the user is required to input this exactly once, and the scripts take it from there. llvm-svn: 27798	2006-04-18 06:27:47 +00:00
Reid Spencer	c81081ab5e	Make it possible to default the llvm_src and llvm_obj variables based on the arguments to the macro. This better supports the AutoRegen.sh script in projects/sample/autoconf. llvm-svn: 27797	2006-04-18 06:25:37 +00:00
Chris Lattner	9f87173df3	add a bunch of stuff, pieces still missing llvm-svn: 27796	2006-04-18 06:18:36 +00:00
Chris Lattner	9232c8c1c5	Add a warning. llvm-svn: 27795	2006-04-18 05:31:20 +00:00
Chris Lattner	3af67456dd	Add a warning llvm-svn: 27794	2006-04-18 05:26:10 +00:00
Chris Lattner	96d50487c9	Use vmladduhm to do v8i16 multiplies which is faster and simpler than doing even/odd halves. Thanks to Nate telling me what's what. llvm-svn: 27793	2006-04-18 04:28:57 +00:00
Chris Lattner	d6d82aa889	Implement v16i8 multiply with this code: vmuloub v5, v3, v2 vmuleub v2, v3, v2 vperm v2, v2, v5, v4 This implements CodeGen/PowerPC/vec_mul.ll. With this, v16i8 multiplies are 6.79x faster than before. Overall, UnitTests/Vector/multiplies.c is now 2.45x faster with LLVM than with GCC. Remove the 'integer multiplies' todo from the README file. llvm-svn: 27792	2006-04-18 03:57:35 +00:00
Chris Lattner	48786e4887	Add tests for v8i16 and v16i8 llvm-svn: 27791	2006-04-18 03:54:50 +00:00
Evan Cheng	4d36a36900	Correct comments llvm-svn: 27790	2006-04-18 03:45:01 +00:00
Chris Lattner	7e439874cb	Lower v8i16 multiply into this code: li r5, lo16(LCPI1_0) lis r6, ha16(LCPI1_0) lvx v4, r6, r5 vmulouh v5, v3, v2 vmuleuh v2, v3, v2 vperm v2, v2, v5, v4 where v4 is: LCPI1_0: ; <16 x ubyte> .byte 2 .byte 3 .byte 18 .byte 19 .byte 6 .byte 7 .byte 22 .byte 23 .byte 10 .byte 11 .byte 26 .byte 27 .byte 14 .byte 15 .byte 30 .byte 31 This is 5.07x faster on the G5 (measured) than lowering to scalar code + loads/stores. llvm-svn: 27789	2006-04-18 03:43:48 +00:00
Chris Lattner	a2cae1bb10	Custom lower v4i32 multiplies into a cute sequence, instead of having legalize scalarize the sequence into 4 mullw's and a bunch of load/store traffic. This speeds up v4i32 multiplies 4.1x (measured) on a G5. This implements PowerPC/vec_mul.ll llvm-svn: 27788	2006-04-18 03:24:30 +00:00
Chris Lattner	2dea154035	new testcase llvm-svn: 27787	2006-04-18 03:22:16 +00:00
Evan Cheng	0ef233509b	Another entry llvm-svn: 27786	2006-04-18 01:22:57 +00:00
Chris Lattner	3db2056315	Fix a build failure on Vladimir's tester. llvm-svn: 27785	2006-04-18 00:21:25 +00:00
Evan Cheng	e008bd3d27	Another entry. llvm-svn: 27784	2006-04-18 00:21:01 +00:00
Evan Cheng	5421206c4b	Use movss to insert_vector_elt(v, s, 0). llvm-svn: 27782	2006-04-17 22:45:49 +00:00
Chris Lattner	36dd7c98d1	Turn x86 unaligned load/store intrinsics into aligned load/store instructions if the pointer is known aligned. llvm-svn: 27781	2006-04-17 22:26:56 +00:00
Chris Lattner	916ae0775e	Fix handling of calls in functions that use vectors. This fixes a crash on the code in GCC PR26546. llvm-svn: 27780	2006-04-17 22:10:08 +00:00
Evan Cheng	6e5e205841	Use two pinsrw to insert an element into v4i32 / v4f32 vector. llvm-svn: 27779	2006-04-17 22:04:06 +00:00
Chris Lattner	63a5cdc423	remove done item llvm-svn: 27778	2006-04-17 21:52:03 +00:00
Chris Lattner	6bd68ae81e	Don't diddle VRSAVE if no registers need to be added/removed from it. This allows us to codegen functions as: _test_rol: vspltisw v2, -12 vrlw v2, v2, v2 blr instead of: _test_rol: mfvrsave r2, 256 mr r3, r2 mtvrsave r3 vspltisw v2, -12 vrlw v2, v2, v2 mtvrsave r2 blr Testcase here: CodeGen/PowerPC/vec_vrsave.ll llvm-svn: 27777	2006-04-17 21:48:13 +00:00
Chris Lattner	efe2b3f2fc	New testcase, shouldn't touch vrsave llvm-svn: 27776	2006-04-17 21:48:03 +00:00
Chris Lattner	bec79b4a59	Add a MachineInstr::eraseFromParent convenience method. llvm-svn: 27775	2006-04-17 21:35:41 +00:00
Chris Lattner	9fcad09b1b	Add some convenience methods. llvm-svn: 27774	2006-04-17 21:35:08 +00:00
Evan Cheng	22c06f054b	Encoding bug llvm-svn: 27773	2006-04-17 21:33:57 +00:00
Chris Lattner	72d7c27069	Vectors that are known live-in and live-out are clearly already marked in the vrsave register for the caller. This allows us to codegen a function as: _test_rol: mfspr r2, 256 mr r3, r2 mtspr 256, r3 vspltisw v2, -12 vrlw v2, v2, v2 mtspr 256, r2 blr instead of: _test_rol: mfspr r2, 256 oris r3, r2, 40960 mtspr 256, r3 vspltisw v0, -12 vrlw v2, v0, v0 mtspr 256, r2 blr llvm-svn: 27772	2006-04-17 21:22:06 +00:00
Chris Lattner	14c4972b6d	Prefer to allocate V2-V5 before V0,V1. This lets us generate code like this: vspltisw v2, -12 vrlw v2, v2, v2 instead of: vspltisw v0, -12 vrlw v2, v0, v0 when a function is returning a value. llvm-svn: 27771	2006-04-17 21:19:12 +00:00
Chris Lattner	6df094b4ab	Move some knowledge about registers out of the code emitter into the register info. llvm-svn: 27770	2006-04-17 21:07:20 +00:00
Chris Lattner	0f28d48da2	Use a small table instead of macros to do this conversion. llvm-svn: 27769	2006-04-17 20:59:25 +00:00
Evan Cheng	5022b3426e	Implement v8i16, v16i8 splat using unpckl + pshufd. llvm-svn: 27768	2006-04-17 20:43:08 +00:00
Chris Lattner	c070c621ac	implement returns of a vector, testcase here: CodeGen/X86/vec_return.ll llvm-svn: 27767	2006-04-17 20:32:50 +00:00
Chris Lattner	e757ae6534	New testcase llvm-svn: 27766	2006-04-17 20:32:27 +00:00
Chris Lattner	326870b40b	Codegen insertelement with constant insertion points as scalar_to_vector and a shuffle. For this: void %test2(<4 x float>* %F, float %f) { %tmp = load <4 x float>* %F ; <<4 x float>> [#uses=2] %tmp3 = add <4 x float> %tmp, %tmp ; <<4 x float>> [#uses=1] %tmp2 = insertelement <4 x float> %tmp3, float %f, uint 2 ; <<4 x float>> [#uses=2] %tmp6 = add <4 x float> %tmp2, %tmp2 ; <<4 x float>> [#uses=1] store <4 x float> %tmp6, <4 x float>* %F ret void } we now get this on X86 (which will get better): _test2: movl 4(%esp), %eax movaps (%eax), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, %xmm1 shufps $3, %xmm1, %xmm1 movaps %xmm0, %xmm2 shufps $1, %xmm2, %xmm2 unpcklps %xmm1, %xmm2 movss 8(%esp), %xmm1 unpcklps %xmm1, %xmm0 unpcklps %xmm2, %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%eax) ret instead of: _test2: subl $28, %esp movl 32(%esp), %eax movaps (%eax), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%esp) movss 36(%esp), %xmm0 movss %xmm0, 8(%esp) movaps (%esp), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%eax) addl $28, %esp ret llvm-svn: 27765	2006-04-17 19:21:01 +00:00
Chris Lattner	e54133cfba	Make sure to check splats of every constant we can, handle splat(31) by being a bit more clever, add support for odd splats from -31 to -17. llvm-svn: 27764	2006-04-17 18:09:22 +00:00
Evan Cheng	bf0d13c54f	Incorrect foldMemoryOperand entries llvm-svn: 27763	2006-04-17 18:06:12 +00:00
Evan Cheng	5112b5c544	Errors in patterns preventing load folding llvm-svn: 27762	2006-04-17 18:05:01 +00:00
Jeff Cohen	e3955a05e4	Add checks for __OpenBSD__. llvm-svn: 27761	2006-04-17 17:55:41 +00:00
Chris Lattner	264c908e3a	Teach the ppc backend to use rol and vsldoi to generate splatted constants. This implements vec_constants.ll:test_vsldoi and test_rol llvm-svn: 27760	2006-04-17 17:55:10 +00:00
Chris Lattner	8cdba16d5e	Some more cases that can be generated with two instructions llvm-svn: 27759	2006-04-17 17:54:18 +00:00
Chris Lattner	26fb8d9393	add a note llvm-svn: 27758	2006-04-17 17:29:41 +00:00
Evan Cheng	b3b41c4f3d	FP SETOLT, SETOLT, SETUGE, SETUGT conditions were implemented incorrectly llvm-svn: 27755	2006-04-17 07:24:10 +00:00
Chris Lattner	1b3806ace5	Make some code more general, adding support for constant formation of several new patterns. llvm-svn: 27754	2006-04-17 06:58:41 +00:00
Chris Lattner	9a3859b339	New testcases llvm-svn: 27753	2006-04-17 06:58:16 +00:00
Chris Lattner	f8dd76df5b	Learn how to make odd splatted constants in range [17,29]. This implements PowerPC/vec_constants.ll:test_29. llvm-svn: 27752	2006-04-17 06:07:44 +00:00
Chris Lattner	02440a996b	new testcase llvm-svn: 27751	2006-04-17 06:06:50 +00:00
Chris Lattner	2a099c04c1	Pull some code out into a helper function. Effeciently codegen even splats in the range [-32,30]. This allows us to codegen <30,30,30,30> as: vspltisw v0, 15 vadduwm v2, v0, v0 instead of as a cp load. llvm-svn: 27750	2006-04-17 06:00:21 +00:00
Chris Lattner	31b7d89e66	New testcase llvm-svn: 27749	2006-04-17 05:58:22 +00:00
Chris Lattner	071ad01ceb	Implement a TODO: for any shuffle that can be viewed as a v4[if]32 shuffle, if it can be implemented in 3 or fewer discrete altivec instructions, codegen it as such. This implements Regression/CodeGen/PowerPC/vec_perf_shuffle.ll llvm-svn: 27748	2006-04-17 05:28:54 +00:00
Chris Lattner	6e98b49b54	new testcase, these shuffles can be implemented with discrete instructions, and shouldn't be lowered to vperm. llvm-svn: 27747	2006-04-17 05:27:31 +00:00
Chris Lattner	85bfa3c2bc	Regenerate with adjusted costs llvm-svn: 27746	2006-04-17 05:26:20 +00:00

1 2 3 4 5 ...

24389 Commits