llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	ab882abce8	add support for using vxor to build zero vectors. This implements Regression/CodeGen/PowerPC/vec_zero.ll llvm-svn: 27059	2006-03-24 07:48:08 +00:00
Evan Cheng	082c8785ef	Handle BUILD_VECTOR with all zero elements. llvm-svn: 27056	2006-03-24 07:29:27 +00:00
Chris Lattner	f5efddf80b	Gabor points out that we can't spell. :) llvm-svn: 27049	2006-03-24 07:12:19 +00:00
Evan Cheng	a91d8a5b43	All v2f64 shuffle cases can be handled. llvm-svn: 27044	2006-03-24 06:40:32 +00:00
Evan Cheng	2595a687da	More efficient v2f64 shuffle using movlhps, movhlps, unpckhpd, and unpcklpd. llvm-svn: 27040	2006-03-24 02:58:06 +00:00
Evan Cheng	6afb3c2de7	A new entry llvm-svn: 27039	2006-03-24 02:57:03 +00:00
Reid Spencer	f9c3dcfdc1	Ignore the burg output files. llvm-svn: 27033	2006-03-24 02:21:35 +00:00
Evan Cheng	d27fb3e85e	Handle more shuffle cases with SHUFP* instructions. llvm-svn: 27024	2006-03-24 01:18:28 +00:00
Evan Cheng	4b5b4e373b	Typo llvm-svn: 27008	2006-03-23 23:24:51 +00:00
Chris Lattner	cbcfe46556	add a note llvm-svn: 27000	2006-03-23 21:28:44 +00:00
Evan Cheng	f842ea57bb	Typo llvm-svn: 26997	2006-03-23 20:26:04 +00:00
Chris Lattner	81137629e0	Add PPC vector bit-convert support llvm-svn: 26995	2006-03-23 19:54:27 +00:00
Jim Laskey	3c43609f1f	Add support to locate local variables in frames (early version.) llvm-svn: 26994	2006-03-23 18:12:57 +00:00
Jim Laskey	cf0166fbeb	Change interface to DwarfWriter. llvm-svn: 26991	2006-03-23 18:09:44 +00:00
Jim Laskey	267d39d128	Modify how CBE handles #lines. llvm-svn: 26990	2006-03-23 18:08:29 +00:00
Chris Lattner	ce0206e119	Fix the encodings of these new instructions, hopefully fixing the JIT failures from last night llvm-svn: 26981	2006-03-23 16:13:50 +00:00
Evan Cheng	82ed4a42f9	Following icc's lead: use movdqa to load / store 128-bit integer vectors llvm-svn: 26980	2006-03-23 07:44:07 +00:00
Chris Lattner	6f95ab7abb	Eliminate IntrinsicLowering from TargetMachine. Make the CBE and V9 backends create their own, since they're the only ones that use it. llvm-svn: 26974	2006-03-23 05:43:16 +00:00
Chris Lattner	811dd8d009	remove always-null IntrinsicLowering argument. llvm-svn: 26971	2006-03-23 05:28:02 +00:00
Evan Cheng	7055878170	Add v4i32 <-> v4f32 bitconvert patterns. llvm-svn: 26969	2006-03-23 02:36:37 +00:00
Evan Cheng	b9b0550dc6	Add 128-bit integer vector load and add (for testing). llvm-svn: 26967	2006-03-23 01:57:24 +00:00
Nate Begeman	fb6e02931c	Add support for 8 bit immediates with 16/32 bit cmp instructions llvm-svn: 26966	2006-03-23 01:29:48 +00:00
Evan Cheng	021bb7c956	Added a ValueType operand to isShuffleMaskLegal(). For now, x86 will not do 64-bit vector shuffle. llvm-svn: 26964	2006-03-22 22:07:06 +00:00
Evan Cheng	ed794cd27b	SHUFP* are two address code. llvm-svn: 26959	2006-03-22 20:08:18 +00:00
Evan Cheng	bc04722860	Some clean up. llvm-svn: 26957	2006-03-22 19:22:18 +00:00
Evan Cheng	d4e1557941	- Supposely movlhps is faster / better than unpcklpd. - Don't forget pshufd is only available with sse2. llvm-svn: 26956	2006-03-22 19:16:21 +00:00
Evan Cheng	68ad48bd1a	- Implement X86ISelLowering::isShuffleMaskLegal(). We currently only support splat and PSHUFD cases. - Clean up shuffle / splat matching code. llvm-svn: 26954	2006-03-22 18:59:22 +00:00
Evan Cheng	8fdbdf20cd	- VECTOR_SHUFFLE of v4i32 / v4f32 with undef second vector always matches PSHUFD. We can make permutes entries which point to the undef pointing anything we want. - Change some names to appease Chris. llvm-svn: 26951	2006-03-22 08:01:21 +00:00
Chris Lattner	e24cf9dfa1	add a note llvm-svn: 26950	2006-03-22 07:33:46 +00:00
Evan Cheng	3617caf526	Fix PSHUF* and SHUF* jit code emission problems llvm-svn: 26949	2006-03-22 07:10:28 +00:00
Chris Lattner	eccf46950c	This has been implemented. Tweak it into another note llvm-svn: 26944	2006-03-22 05:33:23 +00:00
Chris Lattner	4a66d69433	When possible, custom lower 32-bit SINT_TO_FP to this: _foo2: extsw r2, r3 std r2, -8(r1) lfd f0, -8(r1) fcfid f0, f0 frsp f1, f0 blr instead of this: _foo2: lis r2, ha16(LCPI2_0) lis r4, 17200 xoris r3, r3, 32768 stw r3, -4(r1) stw r4, -8(r1) lfs f0, lo16(LCPI2_0)(r2) lfd f1, -8(r1) fsub f0, f1, f0 frsp f1, f0 blr This speeds up Misc/pi from 2.44s->2.09s with LLC and from 3.01->2.18s with llcbeta (16.7% and 38.1% respectively). llvm-svn: 26943	2006-03-22 05:30:33 +00:00
Chris Lattner	77373d1bea	Add support for "ri" addressing modes where the immediate is a 14-bit field which is shifted left two bits before use. Instructions like STD use this addressing mode. llvm-svn: 26942	2006-03-22 05:26:03 +00:00
Chris Lattner	f5e36c8bc0	fix a warning llvm-svn: 26941	2006-03-22 04:18:34 +00:00
Evan Cheng	d097e67544	Some splat and shuffle support. llvm-svn: 26940	2006-03-22 02:53:00 +00:00
Evan Cheng	b1d3c64d1f	Add a couple more pseudo instructions. llvm-svn: 26939	2006-03-22 02:52:03 +00:00
Chris Lattner	4e7371758f	Fix the JIT encoding of the VAForm_1 instructions, including vmaddfp llvm-svn: 26935	2006-03-22 01:44:36 +00:00
Evan Cheng	baea59c61c	Didn't mean to check this in. No MMX support yet. llvm-svn: 26933	2006-03-21 23:04:23 +00:00
Evan Cheng	d5e905d762	- Use movaps to store 128-bit vector integers. - Each scalar to vector v8i16 and v16i8 is a any_extend followed by a movd. llvm-svn: 26932	2006-03-21 23:01:21 +00:00
Chris Lattner	00f4683bf6	These targets don't support EXTRACT_VECTOR_ELT, though, in time, X86 will. llvm-svn: 26930	2006-03-21 20:51:05 +00:00
Chris Lattner	3a2ae6ad3c	Don't emit pseudo instructions! llvm-svn: 26926	2006-03-21 20:19:37 +00:00
Nate Begeman	013127981a	Update readme llvm-svn: 26924	2006-03-21 18:58:20 +00:00
Chris Lattner	139eac5b71	Print absolute memory references like this: lwz r2, 8(0) instead of this: lwz r2, 8(r0) This fixes the llc/llc-beta failures on PPC last night. llvm-svn: 26922	2006-03-21 17:21:13 +00:00
Evan Cheng	2d819f5fa4	Combine 2 entries llvm-svn: 26921	2006-03-21 07:18:26 +00:00
Evan Cheng	aeebc96099	Add a note about x86 register coallescing llvm-svn: 26920	2006-03-21 07:12:57 +00:00
Evan Cheng	1208d9179a	- Remove scalar to vector pseudo ops. They are just wrong. - Handle FR32 to VR128:v4f32 and FR64 to VR128:v2f64 with aliases of MOVAPS and MOVAPD. Mark them as move instructions and hope they will be deleted. llvm-svn: 26919	2006-03-21 07:09:35 +00:00
Chris Lattner	bda7310ef7	With Evan's latest tblgen patch, this code is obsolete, thanks Evan! llvm-svn: 26917	2006-03-21 06:37:40 +00:00
Chris Lattner	d2132f87d7	When codegen'ing vector MUL using VFMADD, add the 0, don't mul the 0. llvm-svn: 26913	2006-03-21 00:51:38 +00:00
Chris Lattner	f194834161	minor note llvm-svn: 26912	2006-03-21 00:47:09 +00:00
Evan Cheng	e4d1416239	x86 ISD::SCALAR_TO_VECTOR support. llvm-svn: 26911	2006-03-21 00:33:35 +00:00
Evan Cheng	fb872b41c0	Junk unused vector register classes. llvm-svn: 26910	2006-03-21 00:30:59 +00:00
Chris Lattner	c8b16d00b9	Handle constant addresses more efficiently, folding the low bits into the disp field of the load/store if possible. This compiles CodeGen/PowerPC/load-constant-addr.ll to: _test: lis r2, 2838 lfs f1, 26848(r2) blr instead of: _test: lis r2, 2838 ori r2, r2, 26848 lfs f1, 0(r2) blr llvm-svn: 26908	2006-03-20 22:38:22 +00:00
Chris Lattner	6d74b09da7	remove dead variable llvm-svn: 26907	2006-03-20 22:37:23 +00:00
Chris Lattner	a1bc294f0c	Fix a couple of bugs in permute/splat generate, thanks to Nate for actually figuring these out! :) llvm-svn: 26904	2006-03-20 18:26:51 +00:00
Chris Lattner	eda030da04	reenable this hack, the tblgen version isn't quite ready llvm-svn: 26902	2006-03-20 17:54:43 +00:00
Chris Lattner	f96d523b8f	Fix the pattern for VADDUWM, add i32 splat llvm-svn: 26901	2006-03-20 17:51:58 +00:00
Evan Cheng	89f3cff0f5	Use tblgen'd VECTOR_SHUFFLE selection code. llvm-svn: 26900	2006-03-20 08:14:16 +00:00
Chris Lattner	a9a1313386	Add support for generating vspltw, instead of a vperm instruction with a constant pool load. This generates significantly nicer code for splats. When tblgen gets bugfixed, we can remove the custom selection code. llvm-svn: 26898	2006-03-20 06:51:10 +00:00
Chris Lattner	a8fbb6dd3d	Implement PPC::isSplatShuffleMask and PPC::getVSPLTImmediate. llvm-svn: 26897	2006-03-20 06:37:44 +00:00
Chris Lattner	ffc475689b	fix duplicate definition errors llvm-svn: 26896	2006-03-20 06:33:01 +00:00
Chris Lattner	80b6bd2746	Add a build_vector node llvm-svn: 26895	2006-03-20 06:18:01 +00:00
Chris Lattner	382f356bd9	Check in some intermediate code that adds a skeleton for matching vsplt* instructions llvm-svn: 26894	2006-03-20 06:15:45 +00:00
Evan Cheng	e6448448c2	Move a few things around. llvm-svn: 26893	2006-03-20 06:04:52 +00:00
Chris Lattner	e4e1ac37ba	add vector_shuffle llvm-svn: 26891	2006-03-20 05:40:45 +00:00
Chris Lattner	93d99f9928	fix typo llvm-svn: 26889	2006-03-20 05:05:55 +00:00
Chris Lattner	366b2514fa	add vsplat instructions, fix sched description for vperm llvm-svn: 26888	2006-03-20 04:47:33 +00:00
Chris Lattner	a8713b1ee6	Custom lower arbitrary VECTOR_SHUFFLE's to VPERM. TODO: leave specific ones as VECTOR_SHUFFLE's and turn them into specialized operations like vsplt* llvm-svn: 26887	2006-03-20 01:53:53 +00:00
Chris Lattner	0a8b4eaee9	Claim to have v16i8 for perm masks llvm-svn: 26886	2006-03-20 01:53:02 +00:00
Chris Lattner	e7a058de7d	add the vperm instruction llvm-svn: 26883	2006-03-20 01:00:56 +00:00
Chris Lattner	d16f6fdd49	add a note with a testcase llvm-svn: 26877	2006-03-19 22:27:41 +00:00
Chris Lattner	169e6238ad	Add a note about the MUL -> FMADD vector bug. llvm-svn: 26874	2006-03-19 22:08:08 +00:00
Evan Cheng	f7c2e3628b	Vector undef's llvm-svn: 26870	2006-03-19 09:38:54 +00:00
Chris Lattner	7e9440a4fc	Custom lower SCALAR_TO_VECTOR into lve*x. llvm-svn: 26868	2006-03-19 06:55:52 +00:00
Chris Lattner	b1ee9c7e24	PPC doesn't have SCALAR_TO_VECTOR llvm-svn: 26865	2006-03-19 06:17:19 +00:00
Chris Lattner	5b595af956	add support for vector undef llvm-svn: 26863	2006-03-19 06:10:09 +00:00
Evan Cheng	0a03f789c2	Remind us of exit value substitution llvm-svn: 26862	2006-03-19 06:09:23 +00:00
Evan Cheng	5111c81a3c	Turning on LSR by default llvm-svn: 26861	2006-03-19 06:08:49 +00:00
Evan Cheng	66a9c0dea7	Remember which tests are hurt by LSR. llvm-svn: 26860	2006-03-19 06:08:11 +00:00
Chris Lattner	0c9eb670bb	minor fixes llvm-svn: 26857	2006-03-19 05:43:01 +00:00
Chris Lattner	ea6468758d	notes llvm-svn: 26856	2006-03-19 05:33:30 +00:00
Chris Lattner	431c90c9fa	we don't use lmw/stmw. When we want them they are easy enough to add llvm-svn: 26853	2006-03-19 04:33:37 +00:00
Chris Lattner	f7b6e7212f	rename these nodes llvm-svn: 26848	2006-03-19 01:13:28 +00:00
Evan Cheng	9bf978dc20	Use the generic vector register classes VR64 / VR128 rather than V4F32, V8I16, etc. llvm-svn: 26838	2006-03-18 01:23:20 +00:00
Nate Begeman	21f87d0e4c	Fix subfic to match subc by default instead of sub so that it is correctly cost-modeled as producing a flag. This fixes the test I just added for neg llvm-svn: 26835	2006-03-17 22:41:37 +00:00
Evan Cheng	b09a56f3a4	Darwin should use _setjmp/_longjmp instead of setjmp/longjmp. llvm-svn: 26833	2006-03-17 20:31:41 +00:00
Evan Cheng	4f674921d6	Move some pattern fragments to the right files. llvm-svn: 26831	2006-03-17 19:55:52 +00:00
Chris Lattner	388fc4d9fb	Disable x86 fastcc from passing args in registers llvm-svn: 26824	2006-03-17 17:27:47 +00:00
Chris Lattner	43798850f9	Parameterize the number of integer arguments to pass in registers llvm-svn: 26818	2006-03-17 05:10:20 +00:00
Evan Cheng	bfc2e97383	Also fold MOV8r0, MOV16r0, MOV32r0 + store to MOV8mi, MOV16mi, and MOV32mi. llvm-svn: 26817	2006-03-17 02:36:22 +00:00
Evan Cheng	aca7915b70	Add some missing entries to X86RegisterInfo::foldMemoryOperand(). e.g. ADD32ri8. llvm-svn: 26816	2006-03-17 02:25:01 +00:00
Evan Cheng	27750f3287	- Nuke 16-bit SBB instructions. We'll never use them. - Nuke a bogus comment. llvm-svn: 26815	2006-03-17 02:24:04 +00:00
Nate Begeman	bb01d4f272	Remove BRTWOWAY* Make the PPC backend not dependent on BRTWOWAY_CC and make the branch selector smarter about the code it generates, fixing a case in the readme. llvm-svn: 26814	2006-03-17 01:40:33 +00:00
Chris Lattner	8bf1c59e7f	remove dead variable llvm-svn: 26813	2006-03-16 23:52:08 +00:00
Evan Cheng	c11fcceec5	A new entry. llvm-svn: 26810	2006-03-16 22:44:22 +00:00
Nate Begeman	fb0e36fa56	Notes on how to kill the eeevil brtwoway, and make ppc branch selector more target independant, generate better code, and be less conservative. llvm-svn: 26809	2006-03-16 22:37:48 +00:00
Chris Lattner	1e6dfa4c1f	Strangely, calls clobber call-clobbered vector regs. Whodathoughtit? llvm-svn: 26808	2006-03-16 22:35:59 +00:00
Chris Lattner	325bb46315	add a note llvm-svn: 26807	2006-03-16 22:25:55 +00:00
Chris Lattner	91400bd413	teach the ppc backend how to spill/reload vector regs llvm-svn: 26806	2006-03-16 22:24:02 +00:00
Chris Lattner	6e90062416	add callee saved vector regs llvm-svn: 26805	2006-03-16 22:07:06 +00:00
Evan Cheng	f75555feb9	Bug fix: condition inverted. llvm-svn: 26804	2006-03-16 22:02:48 +00:00
Evan Cheng	20931a798e	Added a way for TargetLowering to specify what values can be used as the scale component of the target addressing mode. llvm-svn: 26802	2006-03-16 21:47:42 +00:00
Chris Lattner	0b27047a6c	in functions that use a lot of callee saved regs, this can be more than 5 instructions away. llvm-svn: 26801	2006-03-16 21:31:45 +00:00
Chris Lattner	fd9f3e8ed3	Add support for copying registers. still needed: spilling and reloading them llvm-svn: 26800	2006-03-16 20:03:58 +00:00
Chris Lattner	ad74844bfa	set TransformToType correctly for vector types. llvm-svn: 26797	2006-03-16 19:50:01 +00:00
Nate Begeman	32e73f9881	Another case we could do better on. llvm-svn: 26795	2006-03-16 18:50:44 +00:00
Chris Lattner	1678a6c477	Save/restore VRSAVE once per function, not once per block. llvm-svn: 26793	2006-03-16 18:25:23 +00:00
Chris Lattner	4b41e40621	add support for the bitconvert node llvm-svn: 26789	2006-03-16 01:29:53 +00:00
Nate Begeman	2e1fde7c5c	Update scheduling info for vrsave instruction llvm-svn: 26776	2006-03-15 05:25:05 +00:00
Chris Lattner	5271a1f9b5	add a note llvm-svn: 26762	2006-03-14 19:31:24 +00:00
Chris Lattner	ab1ed2aa96	Fix an off by one error that caused PPC LLC failures last night. llvm-svn: 26758	2006-03-14 17:56:49 +00:00
Chris Lattner	30402be175	transformation implemented llvm-svn: 26754	2006-03-14 06:57:34 +00:00
Evan Cheng	0f9d6534f5	PPC LSR pass should use target lowering hooks. llvm-svn: 26743	2006-03-13 23:56:51 +00:00
Evan Cheng	2dd2c652b2	Added getTargetLowering() to TargetMachine. Refactored targets to support this. llvm-svn: 26742	2006-03-13 23:20:37 +00:00
Evan Cheng	60f495100a	Update llvm-svn: 26741	2006-03-13 23:19:10 +00:00
Evan Cheng	af598d2461	Add LSR hooks. llvm-svn: 26740	2006-03-13 23:18:16 +00:00
Chris Lattner	2b8eb375d7	Handle builtins that directly correspond to GCC builtins. llvm-svn: 26737	2006-03-13 23:09:05 +00:00
Chris Lattner	02e2c18c9c	For functions that use vector registers, save VRSAVE, mark used registers, and update it on entry to each function, then restore it on exit. This compiles: void func(vfloat a, vfloat b, vfloat c) { a = b c + c; } to this: _func: mfspr r2, 256 oris r6, r2, 49152 mtspr 256, r6 lvx v0, 0, r5 lvx v1, 0, r4 vmaddfp v0, v1, v0, v0 stvx v0, 0, r3 mtspr 256, r2 blr GCC produces this (which has additional stack accesses): _func: mfspr r0,256 stw r0,-4(r1) oris r0,r0,0xc000 mtspr 256,r0 lvx v0,0,r5 lvx v1,0,r4 lwz r12,-4(r1) vmaddfp v0,v0,v1,v0 stvx v0,0,r3 mtspr 256,r12 blr llvm-svn: 26733	2006-03-13 21:52:10 +00:00
Jim Laskey	acb6e34277	Handle the removal of the debug chain. llvm-svn: 26729	2006-03-13 13:07:37 +00:00
Chris Lattner	fe4c7fb7ae	remove two implemented items llvm-svn: 26728	2006-03-13 06:52:22 +00:00
Chris Lattner	3d761b6211	I can't convince myself that this is safe, remove the recursive call. llvm-svn: 26725	2006-03-13 06:42:16 +00:00
Chris Lattner	ec9d0bc3ec	Fix a couple of bugs that broke the alpha tester build llvm-svn: 26722	2006-03-13 05:23:59 +00:00
Chris Lattner	4fbb612685	Handle cracked instructions in dispatch group formation. llvm-svn: 26721	2006-03-13 05:20:04 +00:00
Chris Lattner	7579cfb1a0	Mark instructions that are cracked by the PPC970 decoder as such. llvm-svn: 26720	2006-03-13 05:15:10 +00:00
Chris Lattner	51348c5f27	Several big changes: 1. Use flags on the instructions in the .td file to indicate the PPC970 unit type instead of a table in the .cpp file. Much cleaner. 2. Change the hazard recognizer to build d-groups according to the actual algorithm used, not my flawed understanding of it. 3. Model "must be in the first slot" and "must be the only instr in a group" accurately. llvm-svn: 26719	2006-03-12 09:13:49 +00:00
Chris Lattner	d03132a409	blr is a branch too llvm-svn: 26710	2006-03-11 21:49:49 +00:00
Chris Lattner	4e56b686f1	add an example llvm-svn: 26709	2006-03-11 20:20:40 +00:00
Chris Lattner	003f633036	add a note llvm-svn: 26708	2006-03-11 20:17:08 +00:00
Chris Lattner	c2447e8b59	teach the JIT to encode vector registers llvm-svn: 26697	2006-03-10 20:19:50 +00:00
Evan Cheng	306c13a8fb	Add option -enable-x86-lsr to enable x86 loop strength reduction pass. llvm-svn: 26665	2006-03-09 21:51:28 +00:00
Chris Lattner	f136299635	add a note llvm-svn: 26661	2006-03-09 20:13:21 +00:00
Andrew Lenharth	43e569c95f	these are copies too llvm-svn: 26653	2006-03-09 18:18:51 +00:00
Chris Lattner	7e7dccd3ab	remove some now-dead code llvm-svn: 26652	2006-03-09 18:07:49 +00:00
Andrew Lenharth	70236fc12f	fcopysign for mixed mode llvm-svn: 26651	2006-03-09 17:56:33 +00:00
Andrew Lenharth	ebfd94fa1d	relax fcopysign llvm-svn: 26649	2006-03-09 17:47:22 +00:00
Andrew Lenharth	4a87e7d9a3	alpha and llvm have different oppinions on which arg is the sign bit llvm-svn: 26647	2006-03-09 17:41:50 +00:00
Andrew Lenharth	16b96d2cb4	Alpha Scheduling classes llvm-svn: 26643	2006-03-09 17:16:45 +00:00
Andrew Lenharth	ed7a293b44	fcopysign and get rid of dsnode cruft. custom PA runtimes make this better in some senses llvm-svn: 26641	2006-03-09 14:58:25 +00:00
Andrew Lenharth	b8a06a7c6c	fcopysign support llvm-svn: 26640	2006-03-09 14:57:36 +00:00
Chris Lattner	e363fdf318	Add support for 'special' llvm globals like debug info and static ctors/dtors. llvm-svn: 26628	2006-03-09 06:14:35 +00:00
Chris Lattner	920e661e50	a couple of miscellaneous things. llvm-svn: 26625	2006-03-09 01:39:46 +00:00
Jim Laskey	8f0a95f664	Add #line support for CBE. llvm-svn: 26621	2006-03-08 19:31:15 +00:00
Duraid Madina	5005b01c20	doo de doo llvm-svn: 26614	2006-03-08 06:18:46 +00:00
Chris Lattner	543832d39d	Change the interface for getting a target HazardRecognizer to be more clean. llvm-svn: 26608	2006-03-08 04:25:59 +00:00
Chris Lattner	a8dd636192	add a note llvm-svn: 26605	2006-03-08 00:25:47 +00:00
Evan Cheng	70b25efa57	X86ISD::REP_STOS and X86ISD::REP_MOVS now produces a flag. llvm-svn: 26604	2006-03-07 23:34:23 +00:00
Evan Cheng	adc7093fc1	Use rep/stosl; and Count 0x3; rep/stosb for memset with 4 byte aligned dest. and variable value. Similarly for memcpy. llvm-svn: 26603	2006-03-07 23:29:39 +00:00
Chris Lattner	207291fd1a	Two things: 1. Don't emit debug info, or other llvm.metadata to the .cbe.c file. 2. Mark static ctors/dtors as such, so that bugpoint works on C++ code compiled with the new CFE. llvm-svn: 26602	2006-03-07 22:58:23 +00:00
Jim Laskey	313570fb17	Use "llvm.metadata" section for debug globals. Filter out these globals in the asm printer. llvm-svn: 26599	2006-03-07 22:00:35 +00:00
Chris Lattner	907e13c742	add another missing store. llvm-svn: 26595	2006-03-07 16:26:48 +00:00
Chris Lattner	8c73d80b08	add a couple more load/store instrs, add a newline to the end of file. llvm-svn: 26594	2006-03-07 16:19:46 +00:00
Nate Begeman	3e3219cc0a	This kinda sorta implements "things that have to lead a dispatch group". llvm-svn: 26591	2006-03-07 08:30:27 +00:00
Chris Lattner	675567f77c	add some new instructions to the classifier. With this, we correctly insert a nop into Freebench/neural, which speeds it up from 136->129s (~5.4%). llvm-svn: 26590	2006-03-07 07:14:55 +00:00
Chris Lattner	05ad128dca	add some comments that describe what we model llvm-svn: 26588	2006-03-07 06:44:19 +00:00
Chris Lattner	2cab13573c	Implement a very very simple hazard recognizer for LSU rejects and ctr set/read flushes llvm-svn: 26587	2006-03-07 06:32:48 +00:00
Chris Lattner	883cefc656	add a note llvm-svn: 26585	2006-03-07 04:42:59 +00:00
Chris Lattner	bccb0e07f0	add a note llvm-svn: 26583	2006-03-07 02:46:26 +00:00
Evan Cheng	a4a4ceb478	- Emit subsections_via_symbols for Darwin. - Conditionalize Dwarf debugging output (Darwin only for now). llvm-svn: 26582	2006-03-07 02:23:26 +00:00
Evan Cheng	30d7b70b73	Enable Dwarf debugging info. llvm-svn: 26581	2006-03-07 02:02:57 +00:00
Chris Lattner	ea79d9fd73	implement TII::insertNoop llvm-svn: 26562	2006-03-05 23:49:55 +00:00
Chris Lattner	5032c32d30	add a note llvm-svn: 26549	2006-03-05 20:00:08 +00:00
Chris Lattner	c726a5c31f	Do not fold (add (shl x, c1), (shl c2, c1)) -> (shl (add x, c2), c1), we want to canonicalize the other way. llvm-svn: 26547	2006-03-05 19:52:57 +00:00
Chris Lattner	9c7f50376a	Copysign needs to be expanded everywhere. Note that Alpha and IA64 should implement copysign as a native op if they have it. llvm-svn: 26541	2006-03-05 05:08:37 +00:00
Chris Lattner	c2dd7aae71	add a note for something evan noticed llvm-svn: 26539	2006-03-05 01:15:18 +00:00
Chris Lattner	8d8b4cf63d	Implemented. llvm-svn: 26536	2006-03-04 23:33:44 +00:00
Chris Lattner	c9a318d8fa	Add a note llvm-svn: 26523	2006-03-04 08:44:51 +00:00
Evan Cheng	c66fd44541	Add an entry llvm-svn: 26520	2006-03-04 07:49:50 +00:00
Evan Cheng	6dc73297c3	MEMSET / MEMCPY lowering bugs: we can't issue a single WORD / DWORD version of rep/stos and rep/mov if the count is not a constant. We could do rep/stosl; and $count, 3; rep/stosb For now, I will lower them to memset / memcpy calls. We will revisit this after a little bit experiment. Also need to take care of the trailing bytes even if the count is a constant. Since the max. number of trailing bytes are 3, we will simply issue loads / stores. llvm-svn: 26517	2006-03-04 02:48:56 +00:00
Chris Lattner	e43e5c0697	add a note llvm-svn: 26513	2006-03-04 01:19:34 +00:00
Evan Cheng	084a102b17	Typo llvm-svn: 26512	2006-03-04 01:12:00 +00:00
Evan Cheng	a7fb285c60	Number of NodeTypes now exceeds 128. llvm-svn: 26503	2006-03-03 06:58:59 +00:00
Chris Lattner	b203355298	Split the valuetypes out of Target.td into ValueTypes.td llvm-svn: 26490	2006-03-03 01:55:26 +00:00
Chris Lattner	ad3c974a77	remove the read/write port/io intrinsics. llvm-svn: 26479	2006-03-03 00:19:58 +00:00
Chris Lattner	9067500e2e	add a note llvm-svn: 26472	2006-03-02 22:34:38 +00:00
Chris Lattner	60a60f4b1e	Implement CodeGen/PowerPC/or-addressing-mode.ll, which is also PR668. llvm-svn: 26450	2006-03-01 07:14:48 +00:00
Chris Lattner	3cb349a068	add a note llvm-svn: 26448	2006-03-01 06:36:20 +00:00
Chris Lattner	27f5345b1f	Compile this: void foo(float a, int b) { b = a; } to this: _foo: fctiwz f0, f1 stfiwx f0, 0, r4 blr instead of this: _foo: fctiwz f0, f1 stfd f0, -8(r1) lwz r2, -4(r1) stw r2, 0(r4) blr This implements CodeGen/PowerPC/stfiwx.ll, and also incidentally does the right thing for GCC bugzilla 26505. llvm-svn: 26447	2006-03-01 05:50:56 +00:00
Chris Lattner	f418435819	Use a target-specific dag-combine to implement CodeGen/PowerPC/fp-int-fp.ll. llvm-svn: 26445	2006-03-01 04:57:39 +00:00
Chris Lattner	4a2eeea671	Add interfaces for targets to provide target-specific dag combiner optimizations. llvm-svn: 26442	2006-03-01 04:52:55 +00:00
Evan Cheng	1926427351	Vector op lowering. llvm-svn: 26438	2006-03-01 01:11:20 +00:00
Evan Cheng	91c574b642	New type v2f32. llvm-svn: 26435	2006-03-01 01:06:22 +00:00
Evan Cheng	0e69f45b07	Another entry. llvm-svn: 26430	2006-02-28 23:38:49 +00:00
Evan Cheng	990c3602bd	Don't match x << 1 to LEAL. It's better to emit x + x. llvm-svn: 26429	2006-02-28 21:13:57 +00:00
Chris Lattner	b9f35f06bc	Add a subtarget feature for the stfiwx instruction. I know the G5 has it, but I don't know what other PPC impls do. If someone could update the proc table, I would appreciate it :) llvm-svn: 26421	2006-02-28 07:08:22 +00:00
Chris Lattner	872810da6c	remove implemented item llvm-svn: 26418	2006-02-28 06:36:04 +00:00
Nate Begeman	f918ed2e33	readme updates llvm-svn: 26405	2006-02-27 22:08:36 +00:00
Chris Lattner	ec185f7843	Don't print constant initializers, they may span lines now. llvm-svn: 26403	2006-02-27 20:09:23 +00:00
Jim Laskey	8f2c1021b4	Removed dependency on how operands are printed (want multi-line.) llvm-svn: 26399	2006-02-27 10:29:04 +00:00
Chris Lattner	ab8164042a	Implement bit propagation through sub nodes, this (re)implements PowerPC/div-2.ll llvm-svn: 26392	2006-02-27 01:00:42 +00:00
Chris Lattner	a60751dd43	Check RHS simplification before LHS simplification to avoid infinitely looping on PowerPC/small-arguments.ll llvm-svn: 26389	2006-02-27 00:36:27 +00:00
Chris Lattner	27220f8958	Just like we use the RHS of an AND to simplify the LHS, use the LHS to simplify the RHS. This allows for the elimination of many thousands of ands from multisource, and compiles CodeGen/PowerPC/and-elim.ll:test2 into this: _test2: srwi r2, r3, 1 xori r3, r2, 40961 blr instead of this: _test2: rlwinm r2, r3, 31, 17, 31 xori r2, r2, 40961 rlwinm r3, r2, 0, 16, 31 blr llvm-svn: 26388	2006-02-27 00:22:28 +00:00
Chris Lattner	118ddba929	Add a bunch of missed cases. Perhaps the most significant of which is that assertzext produces zero bits. llvm-svn: 26386	2006-02-26 23:36:02 +00:00
Evan Cheng	877ab55e06	ConstantPoolIndex is now the displacement portion of the address (rather than base). llvm-svn: 26382	2006-02-26 09:12:34 +00:00
Evan Cheng	75b8783aaf	Fixed ConstantPoolIndex operand asm print bug. This fixed 2005-07-17-INT-To-FP and 2005-05-12-Int64ToFP. llvm-svn: 26380	2006-02-26 08:28:12 +00:00
Evan Cheng	77d86ff8fc	* Cleaned up addressing mode matching code. * Cleaned up and tweaked LEA cost analysis code. Removed some hacks. * Handle ADD $X, c to MOV32ri $X+c. These patterns cannot be autogen'd and they need to be matched before LEA. llvm-svn: 26376	2006-02-25 10:09:08 +00:00
Evan Cheng	1c557bfeb5	Updates. llvm-svn: 26375	2006-02-25 10:04:07 +00:00
Evan Cheng	1fac3b3360	* Allow mul, shl nodes to be codegen'd as LEA (if appropriate). * Add patterns to handle GlobalAddress, ConstantPool, etc. MOV32ri to materialize these nodes in registers. ADD32ri to handle %reg + GA, etc. MOV32mi to handle store GA, etc. to memory. llvm-svn: 26374	2006-02-25 10:02:21 +00:00
Evan Cheng	e4a8b74e4f	ConstantPoolIndex is now the displacement field of addressing mode. llvm-svn: 26373	2006-02-25 09:56:50 +00:00
Evan Cheng	994700101e	Added a common about the need for X86ISD::Wrapper. llvm-svn: 26372	2006-02-25 09:55:19 +00:00
Evan Cheng	ed169db8a5	Added an offset field to ConstantPoolSDNode. llvm-svn: 26371	2006-02-25 09:54:52 +00:00
Evan Cheng	42d5ac557c	Fix an obvious bug exposed when we are doing ADD X, 4 ==> MOV32ri $X+4, ... llvm-svn: 26366	2006-02-25 01:37:02 +00:00
Chris Lattner	7674d90fa1	Add memory printing support for PPC. Input memory operands now work with inline asms! :) llvm-svn: 26365	2006-02-24 20:27:40 +00:00
Chris Lattner	a1ec1ddd59	Implement selection of inline asm memory operands llvm-svn: 26348	2006-02-24 02:13:12 +00:00
Chris Lattner	2a9e1e3e74	Recognize memory operand codes llvm-svn: 26345	2006-02-24 01:10:46 +00:00
Evan Cheng	0ed48fe601	PPC JIT relocation model should be DynamicNoPIC. llvm-svn: 26338	2006-02-23 22:18:07 +00:00
Evan Cheng	e0ed6ec13f	- Clean up the lowering and selection code of ConstantPool, GlobalAddress, and ExternalSymbol. - Use C++ code (rather than tblgen'd selection code) to match the above mentioned leaf nodes. Do not mutate and nodes and do not record the selection in CodeGenMap. These nodes should be safe to duplicate. This is a performance win. llvm-svn: 26335	2006-02-23 20:41:18 +00:00
Chris Lattner	1bad2546d0	Implement the PPC inline asm "L" modifier. This allows us to compile: long long test(long long X) { __asm__("foo %0 %L0 %1 %L1" : "=r"(X): "r"(X)); return X; } to: foo r2 r3 r2 r3 llvm-svn: 26333	2006-02-23 19:31:10 +00:00
Chris Lattner	16f08f53b1	"." isn't enough to get a private label on linux, use ".L". llvm-svn: 26327	2006-02-23 05:25:02 +00:00
Chris Lattner	2bacf981bf	add a small and simple case. llvm-svn: 26326	2006-02-23 05:17:43 +00:00
Evan Cheng	f4448cee66	A couple of new entries. llvm-svn: 26325	2006-02-23 02:50:21 +00:00
Evan Cheng	1f342c2884	PIC related bug fixes. 1. Various asm printer bug. 2. Lowering bug. Now TargetGlobalAddress is wrapped in X86ISD::TGAWrapper. llvm-svn: 26324	2006-02-23 02:43:52 +00:00
Evan Cheng	7eabbfd618	X86 codegen tweak to use lea in another case: Suppose base == %eax and it has multiple uses, then instead of movl %eax, %ecx addl $8, %ecx use leal 8(%eax), %ecx. llvm-svn: 26323	2006-02-23 00:13:58 +00:00
Evan Cheng	7714a59d91	Missing .globl for weak / link-once .text symbols. llvm-svn: 26321	2006-02-22 23:59:57 +00:00
Chris Lattner	2e124af406	Don't return registers from register classes that aren't legal. llvm-svn: 26317	2006-02-22 23:00:51 +00:00
Evan Cheng	73136dfecc	- Added option -relocation-model to set relocation model. Valid values include static, pic, dynamic-no-pic, and default. PPC and x86 default is dynamic-no-pic for Darwin, pic for others. - Removed options -enable-pic and -ppc-static. llvm-svn: 26315	2006-02-22 20:19:42 +00:00
Jim Laskey	2fa33a989d	Coordinate activities with llvm-gcc4 and dwarf. llvm-svn: 26314	2006-02-22 19:02:11 +00:00
Evan Cheng	9e252e3bcf	Added MMX, SSE1, and SSE2 vector instructions and some simple patterns. Fixed some existing bugs (wrong predicates, prefixes) at the same time. llvm-svn: 26310	2006-02-22 02:26:30 +00:00
Chris Lattner	7ad77dfc2a	split register class handling from explicit physreg handling. llvm-svn: 26308	2006-02-22 00:56:39 +00:00
Chris Lattner	7bb4696dc3	Updates to match change of getRegForInlineAsmConstraint prototype llvm-svn: 26305	2006-02-21 23:11:00 +00:00
Evan Cheng	d58478161f	One more round of reorg so sabre doesn't freak out. :-) llvm-svn: 26303	2006-02-21 20:00:20 +00:00
Evan Cheng	6fc1162855	A big more cleaning up. llvm-svn: 26302	2006-02-21 19:30:30 +00:00
Evan Cheng	8711b6bff3	Moving things to their proper places. llvm-svn: 26301	2006-02-21 19:26:52 +00:00
Evan Cheng	6e595b9fd8	Split instruction info into multiple files, one for each of x87, MMX, and SSE. llvm-svn: 26300	2006-02-21 19:13:53 +00:00
Chris Lattner	0a08f44704	missed optzn llvm-svn: 26299	2006-02-21 18:29:44 +00:00
Chris Lattner	747cf60696	The HasNoV9 hack isn't needed here, now that tblgen knows that CustomDAGSchedInserter instructions are expensive. llvm-svn: 26298	2006-02-21 18:04:32 +00:00
Evan Cheng	d57203c0a1	Added separate alias instructions for SSE logical ops that operate on non-packed types. llvm-svn: 26297	2006-02-21 02:24:38 +00:00
Evan Cheng	afffe63fc1	Added MMX and XMM packed integer move instructions, movd and movq. llvm-svn: 26296	2006-02-21 01:39:57 +00:00
Evan Cheng	fa57a0add9	Added SSE2 128-bit integer packed types: V16I8, V8I16, V4I32, and V2I64. Added generic vector types: VR64 and VR128. llvm-svn: 26295	2006-02-21 01:38:21 +00:00
Evan Cheng	43070b7541	Added x86 integer vector types: 64-bit packed byte integer (v16i8), 64-bit packed word integer (v8i16), and 64-bit packed doubleword integer (v2i32). llvm-svn: 26294	2006-02-20 22:34:53 +00:00
Evan Cheng	4547400ae2	Some updates llvm-svn: 26292	2006-02-20 19:58:27 +00:00
Evan Cheng	d13778eb30	If SSE3 is available, promote FP_TO_UINT i32 to FP_TO_SINT i64 to take advantage of fisttpll. llvm-svn: 26288	2006-02-18 07:26:17 +00:00
Nate Begeman	983ca89714	Add a fold for add that exchanges it with a constant shift if possible, so that the shift may be more easily folded into other operations. llvm-svn: 26286	2006-02-18 02:43:25 +00:00
Evan Cheng	70af620709	Added fisttp for fp to int conversion. llvm-svn: 26283	2006-02-18 02:36:28 +00:00
Evan Cheng	06c2e6d1b3	Disable PIC for JIT. llvm-svn: 26281	2006-02-18 01:49:25 +00:00
Evan Cheng	5caed8a231	Jit does not support PIC yet. llvm-svn: 26278	2006-02-18 00:57:10 +00:00
Evan Cheng	5588de9415	x86 / Darwin PIC support. llvm-svn: 26273	2006-02-18 00:15:05 +00:00
Evan Cheng	5f99760ae7	Moved PICEnabled to include/llvm/Target/TargetOptions.h llvm-svn: 26272	2006-02-18 00:08:58 +00:00
Chris Lattner	07a2677e43	unbreak the build llvm-svn: 26260	2006-02-17 07:09:27 +00:00
Evan Cheng	593bea73ba	Unbreak x86 be llvm-svn: 26259	2006-02-17 07:01:52 +00:00
Nate Begeman	5965bd19f8	kill ADD_PARTS & SUB_PARTS and replace them with fancy new ADDC, ADDE, SUBC and SUBE nodes that actually expose what's going on and allow for significant simplifications in the targets. llvm-svn: 26255	2006-02-17 05:43:56 +00:00
Chris Lattner	67c21b6c46	add note about div by power of 2 llvm-svn: 26253	2006-02-17 04:20:13 +00:00
Jeff Cohen	0d62ebd13f	Fix bug noticed by VC++. llvm-svn: 26252	2006-02-17 02:12:18 +00:00
Nate Begeman	3920ce4d8d	Whoops, didn't mean to check this in yet. llvm-svn: 26250	2006-02-17 00:56:19 +00:00
Nate Begeman	4a0dc0c8f6	Add a missing and useful pat frag llvm-svn: 26249	2006-02-17 00:51:06 +00:00
Evan Cheng	b590d3a72b	Remind ourselves to revisit the "pxor vs. xorps/xorpd to clear XMM registers" issue. Need to do more experiments. llvm-svn: 26247	2006-02-17 00:04:28 +00:00
Nate Begeman	7e5496d5fe	Kill the x86 pattern isel. boom. llvm-svn: 26246	2006-02-17 00:03:04 +00:00
Evan Cheng	db1dbbe8d6	Remove the entry about using movapd for SSE reg-reg moves. llvm-svn: 26245	2006-02-17 00:00:58 +00:00
Evan Cheng	eb7b3380fd	pxor (for FLD0SS) encoding was missing the OpSize prefix. llvm-svn: 26244	2006-02-16 23:59:30 +00:00
Chris Lattner	936cc9fe53	Remove the skeleton target, it doesn't produce useful code and there are other small targets that do that can be learned from. They also have the added advantage of being tested :) llvm-svn: 26243	2006-02-16 23:14:50 +00:00
Evan Cheng	24c461b51e	1. Use pxor instead of xoraps / xorapd to clear FR32 / FR64 registers. This proves to be worth 20% on Ptrdist/ks. Might be related to dependency breaking support. 2. Added FsMOVAPSrr and FsMOVAPDrr as aliases to MOVAPSrr and MOVAPDrr. These are used for FR32 / FR64 reg-to-reg copies. 3. Tell reg-allocator to generate MOVSSrm / MOVSDrm and MOVSSmr / MOVSDmr to spill / restore FsMOVAPSrr and FsMOVAPDrr. llvm-svn: 26241	2006-02-16 22:45:17 +00:00
Evan Cheng	3f99628939	Use movaps / movapd to spill / restore V4F4 / V2F8 registers. llvm-svn: 26240	2006-02-16 21:20:26 +00:00

... 3 4 5 6 7 ...

5094 Commits