llvm-project

Commit Graph

Author	SHA1	Message	Date
Evan Cheng	dd487d865b	More logical ops patterns llvm-svn: 27257	2006-03-30 07:33:32 +00:00
Evan Cheng	c58ef7deeb	Add support for _mm_cmp{cc}_ss and _mm_cmp{cc}_ps intrinsics llvm-svn: 27256	2006-03-30 06:21:22 +00:00
Evan Cheng	593310016d	Add 128-bit pmovmskb intrinsic support. llvm-svn: 27255	2006-03-30 00:33:26 +00:00
Evan Cheng	c5cf9bba05	Change SSE pack operation definitions to fit what the intrinsics expected. For example, packsswb actually creates a v16i8 from a pair of v8i16. But since the intrinsic specification forces the output type to match the operands. llvm-svn: 27254	2006-03-29 23:53:14 +00:00
Evan Cheng	b7fedffc78	- Added some SSE2 128-bit packed integer ops. - Added SSE2 128-bit integer pack with signed saturation ops. - Added pshufhw and pshuflw ops. llvm-svn: 27252	2006-03-29 23:07:14 +00:00
Evan Cheng	acc336475e	Need to special case splat after all. Make the second operand of splat vector_shuffle undef. llvm-svn: 27250	2006-03-29 19:02:40 +00:00
Evan Cheng	3cf95747c7	Floating point logical operation patterns should match bit_convert. Or else integer vector logical operations would match andp{s\|d} instead of pand. llvm-svn: 27248	2006-03-29 18:47:40 +00:00
Evan Cheng	500ec16578	- More shuffle related bug fixes. - Whenever possible use ops of the right packed types for vector shuffles / splats. llvm-svn: 27246	2006-03-29 03:04:49 +00:00
Evan Cheng	3a1c4e75de	Another entry about shuffles. llvm-svn: 27245	2006-03-29 03:03:46 +00:00
Evan Cheng	da59b0d2a8	- Only use pshufd for v4i32 vector shuffles. - Other shuffle related fixes. llvm-svn: 27244	2006-03-29 01:30:51 +00:00
Evan Cheng	38b34296d0	Added aliases to scalar SSE instructions, e.g. addss, to match x86 intrinsics. The source operands type are v4sf with upper bits passes through. Added matching code for these. llvm-svn: 27240	2006-03-28 23:51:43 +00:00
Evan Cheng	8160fd3d42	Fixing buggy code. llvm-svn: 27239	2006-03-28 23:41:33 +00:00
Jim Laskey	d1aa1638c6	Expose base register for DwarfWriter. Refactor code accordingly. llvm-svn: 27225	2006-03-28 13:48:33 +00:00
Jim Laskey	457e54efc1	Added missing paren on behalf of Ramana Radhakrishnan. llvm-svn: 27223	2006-03-28 10:17:11 +00:00
Evan Cheng	21e5476deb	Missed X86::isUNPCKHMask llvm-svn: 27222	2006-03-28 08:27:15 +00:00
Evan Cheng	be2d9a0e99	movlps and movlpd should be modeled as two address code. llvm-svn: 27221	2006-03-28 07:01:28 +00:00
Evan Cheng	dc57ae0711	Update llvm-svn: 27220	2006-03-28 06:55:45 +00:00
Evan Cheng	4e7374ff8a	Typo llvm-svn: 27219	2006-03-28 06:53:49 +00:00
Evan Cheng	1a194a5264	* Prefer using operation of matching types. e.g unpcklpd rather than movlhps. * Bug fixes. llvm-svn: 27218	2006-03-28 06:50:32 +00:00
Evan Cheng	08b473c619	Added a couple of entries about movhps and movlhps. llvm-svn: 27212	2006-03-28 02:49:12 +00:00
Evan Cheng	3765fadef6	All unpack cases are now being handled. llvm-svn: 27211	2006-03-28 02:44:05 +00:00
Evan Cheng	2bc3280659	- Clean up / consoladate various shuffle masks. - Some misc. bug fixes. - Use MOVHPDrm to load from m64 to upper half of a XMM register. llvm-svn: 27210	2006-03-28 02:43:26 +00:00
Evan Cheng	5df75889db	Model unpack lower and interleave as vector_shuffle so we can lower the intrinsics as such. llvm-svn: 27200	2006-03-28 00:39:58 +00:00
Jim Laskey	fa53b276d0	Translate llvm target registers to dwarf register numbers properly. llvm-svn: 27180	2006-03-27 20:18:45 +00:00
Chris Lattner	018e17c8de	unbreak the build llvm-svn: 27174	2006-03-27 16:52:45 +00:00
Evan Cheng	9b9cc4fb39	Use pcmpeq to generate vector of all ones. llvm-svn: 27167	2006-03-27 07:00:16 +00:00
Nate Begeman	ed728c1291	SelectionDAGISel can now natively handle Switch instructions, in the same manner that the LowerSwitch LLVM to LLVM pass does: emitting a binary search tree of basic blocks. The new approach has several advantages: it is faster, it generates significantly smaller code in many cases, and it paves the way for implementing dense switch tables as a jump table by handling switches directly in the instruction selector. This functionality is currently only enabled on x86, but should be safe for every target. In anticipation of making it the default, the cfg is now properly updated in the x86, ppc, and sparc select lowering code. llvm-svn: 27156	2006-03-27 01:32:24 +00:00
Nate Begeman	68cc9d4540	Readme note llvm-svn: 27152	2006-03-26 19:19:27 +00:00
Evan Cheng	ed6184aef2	Remove X86:isZeroVector, use ISD::isBuildVectorAllZeros instead; some fixes / cleanups llvm-svn: 27150	2006-03-26 09:53:12 +00:00
Evan Cheng	3e4d38eea5	Added missing (any_extend (load ...)) patterns. llvm-svn: 27120	2006-03-25 09:45:48 +00:00
Evan Cheng	2bc0941e2a	Build arbitrary vector with more than 2 distinct scalar elements with a series of unpack and interleave ops. llvm-svn: 27119	2006-03-25 09:37:23 +00:00
Chris Lattner	5d70a7c4a5	#include Intrinsics.h into all dag isels llvm-svn: 27109	2006-03-25 06:47:10 +00:00
Evan Cheng	79e500ec74	Added SSE cachebility ops llvm-svn: 27103	2006-03-25 06:03:26 +00:00
Evan Cheng	1aaa7280cd	Instruction encoding bug llvm-svn: 27102	2006-03-25 06:00:03 +00:00
Evan Cheng	6f7d31ea50	Added 128-bit packed integer subtraction. llvm-svn: 27096	2006-03-25 01:33:37 +00:00
Evan Cheng	8e481df625	Added CVTTPS2PI. llvm-svn: 27095	2006-03-25 01:31:59 +00:00
Evan Cheng	980c4d5b46	Added CVTSS2SI. llvm-svn: 27094	2006-03-25 01:00:18 +00:00
Evan Cheng	e7ee6a5e32	Support for scalar to vector with zero extension. llvm-svn: 27091	2006-03-24 23:15:12 +00:00
Evan Cheng	2f0277bf48	Added LDMXCSR llvm-svn: 27087	2006-03-24 22:28:37 +00:00
Chris Lattner	97599f1211	plug the intrinsics into the patterns for movmsk* llvm-svn: 27083	2006-03-24 21:49:18 +00:00
Jim Laskey	f0729b4067	Add dwarf register numbering to register data. llvm-svn: 27081	2006-03-24 21:15:58 +00:00
Evan Cheng	082c8785ef	Handle BUILD_VECTOR with all zero elements. llvm-svn: 27056	2006-03-24 07:29:27 +00:00
Chris Lattner	f5efddf80b	Gabor points out that we can't spell. :) llvm-svn: 27049	2006-03-24 07:12:19 +00:00
Evan Cheng	a91d8a5b43	All v2f64 shuffle cases can be handled. llvm-svn: 27044	2006-03-24 06:40:32 +00:00
Evan Cheng	2595a687da	More efficient v2f64 shuffle using movlhps, movhlps, unpckhpd, and unpcklpd. llvm-svn: 27040	2006-03-24 02:58:06 +00:00
Evan Cheng	6afb3c2de7	A new entry llvm-svn: 27039	2006-03-24 02:57:03 +00:00
Evan Cheng	d27fb3e85e	Handle more shuffle cases with SHUFP* instructions. llvm-svn: 27024	2006-03-24 01:18:28 +00:00
Evan Cheng	f842ea57bb	Typo llvm-svn: 26997	2006-03-23 20:26:04 +00:00
Jim Laskey	3c43609f1f	Add support to locate local variables in frames (early version.) llvm-svn: 26994	2006-03-23 18:12:57 +00:00
Jim Laskey	cf0166fbeb	Change interface to DwarfWriter. llvm-svn: 26991	2006-03-23 18:09:44 +00:00
Chris Lattner	ce0206e119	Fix the encodings of these new instructions, hopefully fixing the JIT failures from last night llvm-svn: 26981	2006-03-23 16:13:50 +00:00
Evan Cheng	82ed4a42f9	Following icc's lead: use movdqa to load / store 128-bit integer vectors llvm-svn: 26980	2006-03-23 07:44:07 +00:00
Chris Lattner	6f95ab7abb	Eliminate IntrinsicLowering from TargetMachine. Make the CBE and V9 backends create their own, since they're the only ones that use it. llvm-svn: 26974	2006-03-23 05:43:16 +00:00
Evan Cheng	7055878170	Add v4i32 <-> v4f32 bitconvert patterns. llvm-svn: 26969	2006-03-23 02:36:37 +00:00
Evan Cheng	b9b0550dc6	Add 128-bit integer vector load and add (for testing). llvm-svn: 26967	2006-03-23 01:57:24 +00:00
Nate Begeman	fb6e02931c	Add support for 8 bit immediates with 16/32 bit cmp instructions llvm-svn: 26966	2006-03-23 01:29:48 +00:00
Evan Cheng	021bb7c956	Added a ValueType operand to isShuffleMaskLegal(). For now, x86 will not do 64-bit vector shuffle. llvm-svn: 26964	2006-03-22 22:07:06 +00:00
Evan Cheng	ed794cd27b	SHUFP* are two address code. llvm-svn: 26959	2006-03-22 20:08:18 +00:00
Evan Cheng	bc04722860	Some clean up. llvm-svn: 26957	2006-03-22 19:22:18 +00:00
Evan Cheng	d4e1557941	- Supposely movlhps is faster / better than unpcklpd. - Don't forget pshufd is only available with sse2. llvm-svn: 26956	2006-03-22 19:16:21 +00:00
Evan Cheng	68ad48bd1a	- Implement X86ISelLowering::isShuffleMaskLegal(). We currently only support splat and PSHUFD cases. - Clean up shuffle / splat matching code. llvm-svn: 26954	2006-03-22 18:59:22 +00:00
Evan Cheng	8fdbdf20cd	- VECTOR_SHUFFLE of v4i32 / v4f32 with undef second vector always matches PSHUFD. We can make permutes entries which point to the undef pointing anything we want. - Change some names to appease Chris. llvm-svn: 26951	2006-03-22 08:01:21 +00:00
Evan Cheng	3617caf526	Fix PSHUF* and SHUF* jit code emission problems llvm-svn: 26949	2006-03-22 07:10:28 +00:00
Chris Lattner	f5e36c8bc0	fix a warning llvm-svn: 26941	2006-03-22 04:18:34 +00:00
Evan Cheng	d097e67544	Some splat and shuffle support. llvm-svn: 26940	2006-03-22 02:53:00 +00:00
Evan Cheng	b1d3c64d1f	Add a couple more pseudo instructions. llvm-svn: 26939	2006-03-22 02:52:03 +00:00
Evan Cheng	baea59c61c	Didn't mean to check this in. No MMX support yet. llvm-svn: 26933	2006-03-21 23:04:23 +00:00
Evan Cheng	d5e905d762	- Use movaps to store 128-bit vector integers. - Each scalar to vector v8i16 and v16i8 is a any_extend followed by a movd. llvm-svn: 26932	2006-03-21 23:01:21 +00:00
Chris Lattner	00f4683bf6	These targets don't support EXTRACT_VECTOR_ELT, though, in time, X86 will. llvm-svn: 26930	2006-03-21 20:51:05 +00:00
Evan Cheng	2d819f5fa4	Combine 2 entries llvm-svn: 26921	2006-03-21 07:18:26 +00:00
Evan Cheng	aeebc96099	Add a note about x86 register coallescing llvm-svn: 26920	2006-03-21 07:12:57 +00:00
Evan Cheng	1208d9179a	- Remove scalar to vector pseudo ops. They are just wrong. - Handle FR32 to VR128:v4f32 and FR64 to VR128:v2f64 with aliases of MOVAPS and MOVAPD. Mark them as move instructions and hope they will be deleted. llvm-svn: 26919	2006-03-21 07:09:35 +00:00
Evan Cheng	e4d1416239	x86 ISD::SCALAR_TO_VECTOR support. llvm-svn: 26911	2006-03-21 00:33:35 +00:00
Evan Cheng	fb872b41c0	Junk unused vector register classes. llvm-svn: 26910	2006-03-21 00:30:59 +00:00
Chris Lattner	80b6bd2746	Add a build_vector node llvm-svn: 26895	2006-03-20 06:18:01 +00:00
Evan Cheng	e6448448c2	Move a few things around. llvm-svn: 26893	2006-03-20 06:04:52 +00:00
Chris Lattner	d16f6fdd49	add a note with a testcase llvm-svn: 26877	2006-03-19 22:27:41 +00:00
Evan Cheng	f7c2e3628b	Vector undef's llvm-svn: 26870	2006-03-19 09:38:54 +00:00
Evan Cheng	5111c81a3c	Turning on LSR by default llvm-svn: 26861	2006-03-19 06:08:49 +00:00
Evan Cheng	66a9c0dea7	Remember which tests are hurt by LSR. llvm-svn: 26860	2006-03-19 06:08:11 +00:00
Chris Lattner	f7b6e7212f	rename these nodes llvm-svn: 26848	2006-03-19 01:13:28 +00:00
Evan Cheng	9bf978dc20	Use the generic vector register classes VR64 / VR128 rather than V4F32, V8I16, etc. llvm-svn: 26838	2006-03-18 01:23:20 +00:00
Evan Cheng	b09a56f3a4	Darwin should use _setjmp/_longjmp instead of setjmp/longjmp. llvm-svn: 26833	2006-03-17 20:31:41 +00:00
Evan Cheng	4f674921d6	Move some pattern fragments to the right files. llvm-svn: 26831	2006-03-17 19:55:52 +00:00
Chris Lattner	388fc4d9fb	Disable x86 fastcc from passing args in registers llvm-svn: 26824	2006-03-17 17:27:47 +00:00
Chris Lattner	43798850f9	Parameterize the number of integer arguments to pass in registers llvm-svn: 26818	2006-03-17 05:10:20 +00:00
Evan Cheng	bfc2e97383	Also fold MOV8r0, MOV16r0, MOV32r0 + store to MOV8mi, MOV16mi, and MOV32mi. llvm-svn: 26817	2006-03-17 02:36:22 +00:00
Evan Cheng	aca7915b70	Add some missing entries to X86RegisterInfo::foldMemoryOperand(). e.g. ADD32ri8. llvm-svn: 26816	2006-03-17 02:25:01 +00:00
Evan Cheng	27750f3287	- Nuke 16-bit SBB instructions. We'll never use them. - Nuke a bogus comment. llvm-svn: 26815	2006-03-17 02:24:04 +00:00
Nate Begeman	bb01d4f272	Remove BRTWOWAY* Make the PPC backend not dependent on BRTWOWAY_CC and make the branch selector smarter about the code it generates, fixing a case in the readme. llvm-svn: 26814	2006-03-17 01:40:33 +00:00
Evan Cheng	c11fcceec5	A new entry. llvm-svn: 26810	2006-03-16 22:44:22 +00:00
Evan Cheng	f75555feb9	Bug fix: condition inverted. llvm-svn: 26804	2006-03-16 22:02:48 +00:00
Evan Cheng	20931a798e	Added a way for TargetLowering to specify what values can be used as the scale component of the target addressing mode. llvm-svn: 26802	2006-03-16 21:47:42 +00:00
Evan Cheng	2dd2c652b2	Added getTargetLowering() to TargetMachine. Refactored targets to support this. llvm-svn: 26742	2006-03-13 23:20:37 +00:00
Evan Cheng	af598d2461	Add LSR hooks. llvm-svn: 26740	2006-03-13 23:18:16 +00:00
Evan Cheng	306c13a8fb	Add option -enable-x86-lsr to enable x86 loop strength reduction pass. llvm-svn: 26665	2006-03-09 21:51:28 +00:00
Chris Lattner	920e661e50	a couple of miscellaneous things. llvm-svn: 26625	2006-03-09 01:39:46 +00:00
Evan Cheng	70b25efa57	X86ISD::REP_STOS and X86ISD::REP_MOVS now produces a flag. llvm-svn: 26604	2006-03-07 23:34:23 +00:00
Evan Cheng	adc7093fc1	Use rep/stosl; and Count 0x3; rep/stosb for memset with 4 byte aligned dest. and variable value. Similarly for memcpy. llvm-svn: 26603	2006-03-07 23:29:39 +00:00
Jim Laskey	313570fb17	Use "llvm.metadata" section for debug globals. Filter out these globals in the asm printer. llvm-svn: 26599	2006-03-07 22:00:35 +00:00
Evan Cheng	a4a4ceb478	- Emit subsections_via_symbols for Darwin. - Conditionalize Dwarf debugging output (Darwin only for now). llvm-svn: 26582	2006-03-07 02:23:26 +00:00
Evan Cheng	30d7b70b73	Enable Dwarf debugging info. llvm-svn: 26581	2006-03-07 02:02:57 +00:00
Chris Lattner	9c7f50376a	Copysign needs to be expanded everywhere. Note that Alpha and IA64 should implement copysign as a native op if they have it. llvm-svn: 26541	2006-03-05 05:08:37 +00:00
Chris Lattner	c2dd7aae71	add a note for something evan noticed llvm-svn: 26539	2006-03-05 01:15:18 +00:00
Evan Cheng	c66fd44541	Add an entry llvm-svn: 26520	2006-03-04 07:49:50 +00:00
Evan Cheng	6dc73297c3	MEMSET / MEMCPY lowering bugs: we can't issue a single WORD / DWORD version of rep/stos and rep/mov if the count is not a constant. We could do rep/stosl; and $count, 3; rep/stosb For now, I will lower them to memset / memcpy calls. We will revisit this after a little bit experiment. Also need to take care of the trailing bytes even if the count is a constant. Since the max. number of trailing bytes are 3, we will simply issue loads / stores. llvm-svn: 26517	2006-03-04 02:48:56 +00:00
Evan Cheng	084a102b17	Typo llvm-svn: 26512	2006-03-04 01:12:00 +00:00
Chris Lattner	ad3c974a77	remove the read/write port/io intrinsics. llvm-svn: 26479	2006-03-03 00:19:58 +00:00
Evan Cheng	1926427351	Vector op lowering. llvm-svn: 26438	2006-03-01 01:11:20 +00:00
Evan Cheng	0e69f45b07	Another entry. llvm-svn: 26430	2006-02-28 23:38:49 +00:00
Evan Cheng	990c3602bd	Don't match x << 1 to LEAL. It's better to emit x + x. llvm-svn: 26429	2006-02-28 21:13:57 +00:00
Evan Cheng	877ab55e06	ConstantPoolIndex is now the displacement portion of the address (rather than base). llvm-svn: 26382	2006-02-26 09:12:34 +00:00
Evan Cheng	75b8783aaf	Fixed ConstantPoolIndex operand asm print bug. This fixed 2005-07-17-INT-To-FP and 2005-05-12-Int64ToFP. llvm-svn: 26380	2006-02-26 08:28:12 +00:00
Evan Cheng	77d86ff8fc	* Cleaned up addressing mode matching code. * Cleaned up and tweaked LEA cost analysis code. Removed some hacks. * Handle ADD $X, c to MOV32ri $X+c. These patterns cannot be autogen'd and they need to be matched before LEA. llvm-svn: 26376	2006-02-25 10:09:08 +00:00
Evan Cheng	1c557bfeb5	Updates. llvm-svn: 26375	2006-02-25 10:04:07 +00:00
Evan Cheng	1fac3b3360	* Allow mul, shl nodes to be codegen'd as LEA (if appropriate). * Add patterns to handle GlobalAddress, ConstantPool, etc. MOV32ri to materialize these nodes in registers. ADD32ri to handle %reg + GA, etc. MOV32mi to handle store GA, etc. to memory. llvm-svn: 26374	2006-02-25 10:02:21 +00:00
Evan Cheng	e4a8b74e4f	ConstantPoolIndex is now the displacement field of addressing mode. llvm-svn: 26373	2006-02-25 09:56:50 +00:00
Evan Cheng	994700101e	Added a common about the need for X86ISD::Wrapper. llvm-svn: 26372	2006-02-25 09:55:19 +00:00
Evan Cheng	ed169db8a5	Added an offset field to ConstantPoolSDNode. llvm-svn: 26371	2006-02-25 09:54:52 +00:00
Evan Cheng	42d5ac557c	Fix an obvious bug exposed when we are doing ADD X, 4 ==> MOV32ri $X+4, ... llvm-svn: 26366	2006-02-25 01:37:02 +00:00
Evan Cheng	e0ed6ec13f	- Clean up the lowering and selection code of ConstantPool, GlobalAddress, and ExternalSymbol. - Use C++ code (rather than tblgen'd selection code) to match the above mentioned leaf nodes. Do not mutate and nodes and do not record the selection in CodeGenMap. These nodes should be safe to duplicate. This is a performance win. llvm-svn: 26335	2006-02-23 20:41:18 +00:00
Chris Lattner	16f08f53b1	"." isn't enough to get a private label on linux, use ".L". llvm-svn: 26327	2006-02-23 05:25:02 +00:00
Chris Lattner	2bacf981bf	add a small and simple case. llvm-svn: 26326	2006-02-23 05:17:43 +00:00
Evan Cheng	f4448cee66	A couple of new entries. llvm-svn: 26325	2006-02-23 02:50:21 +00:00
Evan Cheng	1f342c2884	PIC related bug fixes. 1. Various asm printer bug. 2. Lowering bug. Now TargetGlobalAddress is wrapped in X86ISD::TGAWrapper. llvm-svn: 26324	2006-02-23 02:43:52 +00:00
Evan Cheng	7eabbfd618	X86 codegen tweak to use lea in another case: Suppose base == %eax and it has multiple uses, then instead of movl %eax, %ecx addl $8, %ecx use leal 8(%eax), %ecx. llvm-svn: 26323	2006-02-23 00:13:58 +00:00
Evan Cheng	7714a59d91	Missing .globl for weak / link-once .text symbols. llvm-svn: 26321	2006-02-22 23:59:57 +00:00
Evan Cheng	73136dfecc	- Added option -relocation-model to set relocation model. Valid values include static, pic, dynamic-no-pic, and default. PPC and x86 default is dynamic-no-pic for Darwin, pic for others. - Removed options -enable-pic and -ppc-static. llvm-svn: 26315	2006-02-22 20:19:42 +00:00
Evan Cheng	9e252e3bcf	Added MMX, SSE1, and SSE2 vector instructions and some simple patterns. Fixed some existing bugs (wrong predicates, prefixes) at the same time. llvm-svn: 26310	2006-02-22 02:26:30 +00:00
Chris Lattner	7ad77dfc2a	split register class handling from explicit physreg handling. llvm-svn: 26308	2006-02-22 00:56:39 +00:00
Chris Lattner	7bb4696dc3	Updates to match change of getRegForInlineAsmConstraint prototype llvm-svn: 26305	2006-02-21 23:11:00 +00:00
Evan Cheng	d58478161f	One more round of reorg so sabre doesn't freak out. :-) llvm-svn: 26303	2006-02-21 20:00:20 +00:00
Evan Cheng	6fc1162855	A big more cleaning up. llvm-svn: 26302	2006-02-21 19:30:30 +00:00
Evan Cheng	8711b6bff3	Moving things to their proper places. llvm-svn: 26301	2006-02-21 19:26:52 +00:00
Evan Cheng	6e595b9fd8	Split instruction info into multiple files, one for each of x87, MMX, and SSE. llvm-svn: 26300	2006-02-21 19:13:53 +00:00
Evan Cheng	d57203c0a1	Added separate alias instructions for SSE logical ops that operate on non-packed types. llvm-svn: 26297	2006-02-21 02:24:38 +00:00
Evan Cheng	afffe63fc1	Added MMX and XMM packed integer move instructions, movd and movq. llvm-svn: 26296	2006-02-21 01:39:57 +00:00
Evan Cheng	fa57a0add9	Added SSE2 128-bit integer packed types: V16I8, V8I16, V4I32, and V2I64. Added generic vector types: VR64 and VR128. llvm-svn: 26295	2006-02-21 01:38:21 +00:00
Evan Cheng	43070b7541	Added x86 integer vector types: 64-bit packed byte integer (v16i8), 64-bit packed word integer (v8i16), and 64-bit packed doubleword integer (v2i32). llvm-svn: 26294	2006-02-20 22:34:53 +00:00
Evan Cheng	4547400ae2	Some updates llvm-svn: 26292	2006-02-20 19:58:27 +00:00
Evan Cheng	d13778eb30	If SSE3 is available, promote FP_TO_UINT i32 to FP_TO_SINT i64 to take advantage of fisttpll. llvm-svn: 26288	2006-02-18 07:26:17 +00:00
Evan Cheng	70af620709	Added fisttp for fp to int conversion. llvm-svn: 26283	2006-02-18 02:36:28 +00:00
Evan Cheng	06c2e6d1b3	Disable PIC for JIT. llvm-svn: 26281	2006-02-18 01:49:25 +00:00
Evan Cheng	5caed8a231	Jit does not support PIC yet. llvm-svn: 26278	2006-02-18 00:57:10 +00:00
Evan Cheng	5588de9415	x86 / Darwin PIC support. llvm-svn: 26273	2006-02-18 00:15:05 +00:00
Chris Lattner	07a2677e43	unbreak the build llvm-svn: 26260	2006-02-17 07:09:27 +00:00
Evan Cheng	593bea73ba	Unbreak x86 be llvm-svn: 26259	2006-02-17 07:01:52 +00:00
Nate Begeman	5965bd19f8	kill ADD_PARTS & SUB_PARTS and replace them with fancy new ADDC, ADDE, SUBC and SUBE nodes that actually expose what's going on and allow for significant simplifications in the targets. llvm-svn: 26255	2006-02-17 05:43:56 +00:00
Chris Lattner	67c21b6c46	add note about div by power of 2 llvm-svn: 26253	2006-02-17 04:20:13 +00:00
Evan Cheng	b590d3a72b	Remind ourselves to revisit the "pxor vs. xorps/xorpd to clear XMM registers" issue. Need to do more experiments. llvm-svn: 26247	2006-02-17 00:04:28 +00:00
Nate Begeman	7e5496d5fe	Kill the x86 pattern isel. boom. llvm-svn: 26246	2006-02-17 00:03:04 +00:00
Evan Cheng	db1dbbe8d6	Remove the entry about using movapd for SSE reg-reg moves. llvm-svn: 26245	2006-02-17 00:00:58 +00:00
Evan Cheng	eb7b3380fd	pxor (for FLD0SS) encoding was missing the OpSize prefix. llvm-svn: 26244	2006-02-16 23:59:30 +00:00
Evan Cheng	24c461b51e	1. Use pxor instead of xoraps / xorapd to clear FR32 / FR64 registers. This proves to be worth 20% on Ptrdist/ks. Might be related to dependency breaking support. 2. Added FsMOVAPSrr and FsMOVAPDrr as aliases to MOVAPSrr and MOVAPDrr. These are used for FR32 / FR64 reg-to-reg copies. 3. Tell reg-allocator to generate MOVSSrm / MOVSDrm and MOVSSmr / MOVSDmr to spill / restore FsMOVAPSrr and FsMOVAPDrr. llvm-svn: 26241	2006-02-16 22:45:17 +00:00
Evan Cheng	3f99628939	Use movaps / movapd to spill / restore V4F4 / V2F8 registers. llvm-svn: 26240	2006-02-16 21:20:26 +00:00
Nate Begeman	8a77efe4f7	Rework the SelectionDAG-based implementations of SimplifyDemandedBits and ComputeMaskedBits to match the new improved versions in instcombine. Tested against all of multisource/benchmarks on ppc. llvm-svn: 26238	2006-02-16 21:11:51 +00:00
Evan Cheng	01afec2adb	MOVAPSrr and MOVAPDrr instruction format should be MRMSrcReg. llvm-svn: 26234	2006-02-16 19:34:41 +00:00
Evan Cheng	ae82498e81	Use movaps / movapd (instead of movss / movsd) to do FR32 / FR64 reg to reg transfer. According to the Intel P4 Optimization Manual: Moves that write a portion of a register can introduce unwanted dependences. The movsd reg, reg instruction writes only the bottom 64 bits of a register, not to all 128 bits. This introduces a dependence on the preceding instruction that produces the upper 64 bits (even if those bits are not longer wanted). The dependence inhibits register renaming, and thereby reduces parallelism. Not to mention movaps is shorter than movss. llvm-svn: 26226	2006-02-16 01:50:02 +00:00
Evan Cheng	03c1e6f48e	A bit more memset / memcpy optimization. Turns them into calls to memset / memcpy if 1) buffer(s) are not DWORD aligned, 2) size is not known to be greater or equal to some minimum value (currently 128). llvm-svn: 26224	2006-02-16 00:21:07 +00:00
Evan Cheng	7a6c21ac26	Remove an entry. llvm-svn: 26197	2006-02-15 01:56:48 +00:00
Evan Cheng	2d23c9f1ab	Use .zerofill on x86/darwin. llvm-svn: 26196	2006-02-15 01:56:23 +00:00
Evan Cheng	aacc4c3b4c	cvtsd2ss / cvtss2sd encoding bug. llvm-svn: 26193	2006-02-15 00:31:03 +00:00
Evan Cheng	665c26ab40	movaps, movapd encoding bug. llvm-svn: 26192	2006-02-15 00:11:37 +00:00
Chris Lattner	e3c793a71a	new note llvm-svn: 26186	2006-02-14 22:19:54 +00:00
Evan Cheng	f84774ed46	Don't special case XS, XD prefixes. llvm-svn: 26183	2006-02-14 21:52:51 +00:00
Evan Cheng	fb7b5ef74b	Bug fix: XS, XD prefixes were being emitted twice. XMM registers were not being handled. llvm-svn: 26182	2006-02-14 21:45:24 +00:00
Evan Cheng	43b72f4421	Duh llvm-svn: 26180	2006-02-14 20:37:37 +00:00
Evan Cheng	ad8c20cd2b	Remove -disable-x86-sse llvm-svn: 26179	2006-02-14 20:30:14 +00:00
Evan Cheng	4b40a42653	Rename maxStoresPerMemSet to maxStoresPerMemset, etc. llvm-svn: 26174	2006-02-14 08:38:30 +00:00
Evan Cheng	f976d79f78	Add a entry. llvm-svn: 26173	2006-02-14 08:25:32 +00:00
Evan Cheng	6a37456d73	Set maxStoresPerMemSet to 16. Ditto for maxStoresPerMemCpy and maxStoresPerMemMove. Although the last one is not used. llvm-svn: 26172	2006-02-14 08:25:08 +00:00
Evan Cheng	40b6eb9973	Enable SSE (for the right subtargets) llvm-svn: 26169	2006-02-14 08:07:58 +00:00
Chris Lattner	62c3484e43	Switch targets over to using SelectionDAG::getCALLSEQ_START to create CALLSEQ_START nodes. llvm-svn: 26143	2006-02-13 09:00:43 +00:00
Evan Cheng	a86ba85dc5	Prevent certain nodes that have already been selected from being folded into X86 addressing mode. Currently we do not allow any node whose target node produces a chain as well as any node that is at the root of the addressing mode expression tree. llvm-svn: 26117	2006-02-11 02:05:36 +00:00
Evan Cheng	2b6f78b664	Nicer code. :-) llvm-svn: 26111	2006-02-10 22:46:26 +00:00
Evan Cheng	d49cc3634e	Added X86 isel debugging stuff. llvm-svn: 26110	2006-02-10 22:24:32 +00:00
Evan Cheng	907be3e24c	Remove a completed entry; add a new entry about fisttp op llvm-svn: 26105	2006-02-10 05:48:15 +00:00
Evan Cheng	101e4b916a	Match tblgen change. llvm-svn: 26096	2006-02-09 22:12:53 +00:00
Evan Cheng	d1b82d8db0	Match getTargetNode() changes (now return SDNode* instead of SDOperand). llvm-svn: 26085	2006-02-09 07:17:49 +00:00
Chris Lattner	d94a3d2c8a	provide an explicit alignment for cp entries llvm-svn: 26069	2006-02-09 02:15:30 +00:00
Evan Cheng	6dc90ca172	Change Select() from SDOperand Select(SDOperand N); to void Select(SDOperand &Result, SDOperand N); llvm-svn: 26067	2006-02-09 00:37:58 +00:00
Chris Lattner	2e07d6370a	Darwin doesn't support #APP/#NO_APP llvm-svn: 26066	2006-02-08 23:42:22 +00:00
Chris Lattner	b4fc050f0f	add a simple optimization llvm-svn: 26062	2006-02-08 17:47:22 +00:00
Chris Lattner	b7e074ab9b	more email -> README moving llvm-svn: 26054	2006-02-08 07:12:07 +00:00
Chris Lattner	45bb34b715	Add some random notes, not high-prio llvm-svn: 26052	2006-02-08 06:52:06 +00:00
Evan Cheng	adeb8fb5a2	Fixed a local common symbol bug. llvm-svn: 26044	2006-02-07 23:32:58 +00:00
Evan Cheng	ec212fb66d	For ELF, .comm takes alignment value as the optional 3rd argument. It must be specified in bytes. llvm-svn: 26043	2006-02-07 21:54:08 +00:00
Evan Cheng	5a76680de1	Darwin ABI issues: weak, linkonce, etc. dynamic-no-pic support is complete. Also fixed a function stub bug. Added weak and linkonce support for x86 Linux. llvm-svn: 26038	2006-02-07 08:38:37 +00:00
Evan Cheng	227e469c25	Remind myself to add PIC and static asm printer support. llvm-svn: 26037	2006-02-07 08:35:44 +00:00
Chris Lattner	d62a3bfa66	Eliminate the printCallOperand method, using a 'call' modifier on printOperand instead. llvm-svn: 26025	2006-02-06 23:41:19 +00:00
Evan Cheng	d5f2ba0d6f	- Update load folding checks to match those auto-generated by tblgen. - Manually select SDOperand's returned by TryFoldLoad which make up the load address. llvm-svn: 26012	2006-02-06 06:02:33 +00:00
Evan Cheng	54cb1833a4	Use SelectRoot() as entry of any tblgen based isel. llvm-svn: 25997	2006-02-05 06:46:41 +00:00
Evan Cheng	d19d51f414	Re-commit the last bit of change that was backed out. llvm-svn: 25983	2006-02-05 05:25:07 +00:00
Chris Lattner	c070cb685d	Use getPreferredAlignmentLog. llvm-svn: 25980	2006-02-05 01:45:04 +00:00
Chris Lattner	22b4edfb42	Temporarily revert this patch, which probably breaks with the tblgen patch reverted. llvm-svn: 25971	2006-02-04 09:24:16 +00:00
Evan Cheng	ce87cac555	Complex pattern's custom matcher should not call Select() on any operands. Select them afterwards if it returns true. llvm-svn: 25968	2006-02-04 08:50:49 +00:00
Evan Cheng	0a977c95aa	Remove an unnecessary predicate. llvm-svn: 25954	2006-02-04 02:23:01 +00:00
Evan Cheng	11613a5219	Separate FILD and FILD_FLAG, the later is only used for SSE2. It produces a flag so it can be flagged to a FST. llvm-svn: 25953	2006-02-04 02:20:30 +00:00
Chris Lattner	a1d312c6ea	remove an old comment llvm-svn: 25940	2006-02-03 18:59:39 +00:00
Chris Lattner	23d55f2547	Remove the X86PeepholeOptimizerPass, a truly horrible old hack that is now obsolete. yaay :) llvm-svn: 25939	2006-02-03 18:54:24 +00:00
Chris Lattner	c408558638	When rewriting frame instructions, emit the appropriate small-immediate instruction when possible. llvm-svn: 25938	2006-02-03 18:20:04 +00:00
Chris Lattner	a23b04acdb	remove some target-indep and implemented notes llvm-svn: 25930	2006-02-03 06:22:11 +00:00
Chris Lattner	a1eac9b978	the X86 backend no longer needs to delete its own noop copies llvm-svn: 25923	2006-02-03 02:59:58 +00:00
Chris Lattner	5123346708	fix operand numbers llvm-svn: 25915	2006-02-02 20:38:12 +00:00
Chris Lattner	bb53acd03c	Move isLoadFrom/StoreToStackSlot from MRegisterInfo to TargetInstrInfo,a far more logical place. Other methods should also be moved if anyoneis interested. :) llvm-svn: 25913	2006-02-02 20:12:32 +00:00
Chris Lattner	246ee44c8f	implement isStoreToStackSlot llvm-svn: 25911	2006-02-02 20:00:41 +00:00
Chris Lattner	0acc90c67e	add a method llvm-svn: 25910	2006-02-02 19:57:16 +00:00
Chris Lattner	d8208c3665	more notes llvm-svn: 25908	2006-02-02 19:43:28 +00:00
Chris Lattner	d3f033e8e0	add a note, I have no idea how important this is. llvm-svn: 25907	2006-02-02 19:16:34 +00:00
Chris Lattner	4b2ec8af23	implemented, testcase here: test/Regression/CodeGen/X86/compare-add.ll llvm-svn: 25899	2006-02-02 06:36:48 +00:00
Evan Cheng	d3908f79cb	Update. llvm-svn: 25896	2006-02-02 02:40:17 +00:00
Evan Cheng	d8fba3a1ee	Fix a erroneous comment. llvm-svn: 25894	2006-02-02 00:28:23 +00:00
Chris Lattner	6132a87cf4	more notes llvm-svn: 25890	2006-02-01 23:38:08 +00:00
Evan Cheng	b3ea2677a4	Tell codegen MOVAPSrr and MOVAPDrr are copies. llvm-svn: 25889	2006-02-01 23:03:16 +00:00
Evan Cheng	f1ed826c2a	Added SSE entries to foldMemoryOperand(). llvm-svn: 25888	2006-02-01 23:02:25 +00:00
Evan Cheng	8b40cde148	Rearrange code to my liking. :) llvm-svn: 25887	2006-02-01 23:01:57 +00:00
Chris Lattner	2f7650f9dc	another note llvm-svn: 25883	2006-02-01 21:44:48 +00:00
Nate Begeman	7e7f439f85	Fix some of the stuff in the PPC README file, and clean up legalization of the SELECT_CC, BR_CC, and BRTWOWAY_CC nodes. llvm-svn: 25875	2006-02-01 07:19:44 +00:00
Chris Lattner	3da1bb520e	add a note, I'll take care of this after nate commits his big patch llvm-svn: 25873	2006-02-01 06:40:32 +00:00
Evan Cheng	9e350cd6ad	- Use xor to clear integer registers (set R, 0). - Added a new format for instructions where the source register is implied and it is same as the destination register. Used for pseudo instructions that clear the destination register. llvm-svn: 25872	2006-02-01 06:13:50 +00:00
Evan Cheng	c404b5748c	Remove another entry. llvm-svn: 25871	2006-02-01 06:08:48 +00:00
Chris Lattner	b0a76b0981	Another regression from the pattern isel llvm-svn: 25867	2006-02-01 01:44:25 +00:00
Evan Cheng	a24617f5d4	Return's chain should be matching either the chain produced by the value or the chain going into the load. llvm-svn: 25863	2006-02-01 01:19:32 +00:00
Evan Cheng	e1ce4d7115	When folding a load into a return of SSE value, check the chain to ensure the memory location has not been clobbered. llvm-svn: 25861	2006-02-01 00:20:21 +00:00
Evan Cheng	bc1fcd074e	Remove an item. It's done. llvm-svn: 25860	2006-02-01 00:15:53 +00:00
Evan Cheng	5659ca8f47	Be smarter about whether to store the SSE return value in memory. If it is already available in memory, do a fld directly from there. llvm-svn: 25859	2006-01-31 23:19:54 +00:00
Chris Lattner	64387c3e9c	turning these into 'adds' would require extra copies llvm-svn: 25858	2006-01-31 22:59:46 +00:00
Evan Cheng	72d5c256c9	- Allow XMM load (for scalar use) to be folded into ANDP* and XORP. - Use XORP to implement fneg. llvm-svn: 25857	2006-01-31 22:28:30 +00:00
Evan Cheng	a91eb48547	Remove entries on fabs and fneg. These are done. llvm-svn: 25856	2006-01-31 22:26:21 +00:00
Chris Lattner	c642aa5e1c	* Fix 80-column violations * Rename hasSSE -> hasSSE1 to avoid my continual confusion with 'has any SSE'. * Add inline asm constraint specification. llvm-svn: 25854	2006-01-31 19:43:35 +00:00
Evan Cheng	2dd217b88f	Added custom lowering of fabs llvm-svn: 25831	2006-01-31 03:14:29 +00:00
Chris Lattner	d916e78b0a	Another high-prio selection performance bug llvm-svn: 25828	2006-01-31 02:10:06 +00:00
Chris Lattner	2b70a6f853	more mumbling llvm-svn: 25826	2006-01-31 00:45:37 +00:00
Chris Lattner	b521361fb9	add some notes llvm-svn: 25825	2006-01-31 00:20:38 +00:00
Evan Cheng	45df7f84ff	Don't generate complex sequence for SETOLE, SETOLT, SETULT, and SETUGT. Flip the order of the compare operands and generate SETOGT, SETOGE, SETUGE, and SETULE instead. llvm-svn: 25824	2006-01-30 23:41:35 +00:00
Evan Cheng	08390f6a21	i64 -> f32, f32 -> i64 and some clean up. llvm-svn: 25818	2006-01-30 22:13:22 +00:00
Evan Cheng	5b97fcf0f5	Always use FP stack instructions to perform i64 to f64 as well as f64 to i64 conversions. SSE does not have instructions to handle these tasks. llvm-svn: 25817	2006-01-30 08:02:57 +00:00
Chris Lattner	f0b24d2dc0	Move MaskedValueIsZero from the DAGCombiner to the TargetLowering interface,making isMaskedValueZeroForTargetNode simpler, and useable from other partsof the compiler. llvm-svn: 25803	2006-01-30 04:09:27 +00:00
Chris Lattner	c6fa0282d2	adjust prototype llvm-svn: 25798	2006-01-30 03:49:07 +00:00
Chris Lattner	3c6a950653	add another note llvm-svn: 25789	2006-01-29 09:46:06 +00:00
Chris Lattner	dabee1f655	add some performance notes from looking at sgefa llvm-svn: 25788	2006-01-29 09:42:20 +00:00
Chris Lattner	7c7cbde0e5	add a high-priority SSE issue from sgefa llvm-svn: 25787	2006-01-29 09:14:47 +00:00
Chris Lattner	5a7a22c9dd	add a missed optimization llvm-svn: 25786	2006-01-29 09:08:15 +00:00
Reid Spencer	0c05a2c99c	Add a note about lowering llvm.memset, llvm.memcpy, and llvm.memmove to a few stores under certain conditions. llvm-svn: 25777	2006-01-29 06:48:25 +00:00
Chris Lattner	35d20a4c00	remove now-dead code, the legalizer takes care of this for us llvm-svn: 25776	2006-01-29 06:45:31 +00:00
Chris Lattner	132177e103	The FP stack doesn't support UNDEF, ask the legalizer to legalize it instead of lying and saying we have it. llvm-svn: 25775	2006-01-29 06:44:22 +00:00
Chris Lattner	61c9a8e942	Targets all now request ConstantFP to be legalized into TargetConstantFP. 'fpimm' in .td files is now TargetConstantFP. llvm-svn: 25771	2006-01-29 06:26:08 +00:00
Jeff Cohen	4ab39e43e8	Fix typo. llvm-svn: 25760	2006-01-29 03:45:35 +00:00
Jeff Cohen	8643ea67b1	Flesh out AMD family/models. llvm-svn: 25755	2006-01-28 20:30:18 +00:00
Jeff Cohen	58ca0be9af	Correctly determine CPU vendor. llvm-svn: 25754	2006-01-28 19:48:34 +00:00

... 3 4 5 6 7 ...

1788 Commits