llvm-project

Commit Graph

Author	SHA1	Message	Date
Dale Johannesen	74503f0b31	Add OpSize to 16-bit ADC and SBB. llvm-svn: 72045	2009-05-18 21:41:59 +00:00
Dale Johannesen	1df0e80380	Fill in the missing patterns for ADC and SBB. Some comment cleanup. llvm-svn: 72022	2009-05-18 17:44:15 +00:00
Dan Gohman	faf75c8c9a	Convert a subtract into a negate and an add when it helps x86 address folding. llvm-svn: 71446	2009-05-11 18:02:53 +00:00
Chris Lattner	be9fa506ad	Add basic support for code generation of addrspace(257) -> FS relative on x86. Patch by Zoltan Varga! llvm-svn: 70992	2009-05-05 18:52:19 +00:00
Dan Gohman	db3a57ec5c	Set mayLoad on MOVZX32_NOREXrm8 too. llvm-svn: 70466	2009-04-30 03:11:48 +00:00
Evan Cheng	99578674fd	Mark MOV8mr_NOREX and MOV8rm_NOREX as mayStore / mayLoad respectively. llvm-svn: 70461	2009-04-30 00:58:57 +00:00
Nate Begeman	8d6d4b9289	2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. llvm-svn: 70225	2009-04-27 18:41:29 +00:00
Dan Gohman	2986972118	Rename GR8_ABCD to GR8_ABCD_L and create GR8_ABCD_H, and use these to precisely describe the h-register subreg register classes. Thanks to Jakob Stoklund Olesen for spotting this and for the initial patch! Also, make getStoreRegOpcode and getLoadRegOpcode aware of the needs of h registers. llvm-svn: 70211	2009-04-27 16:41:36 +00:00
Dan Gohman	ec542ca65e	Rename GR8_, GR16_, GR32_, and GR64_ to GR8_ABCD, GR16_ABCD, GR32_ABCD, and GR64_ABCD, respectively, to help describe them. llvm-svn: 70210	2009-04-27 16:33:14 +00:00
Dan Gohman	ba99bddf1f	Break up long multi-mnemonic strings into separate lines for readability. llvm-svn: 70209	2009-04-27 15:13:28 +00:00
Mon P Wang	e15bf109be	Revised 68749 to allow matching of load/stores for address spaces < 256. llvm-svn: 70197	2009-04-27 07:22:10 +00:00
Rafael Espindola	c1396a2313	Fix PR 4004 by including the call to __tls_get_addr in X86tlsaddr. This is not very elegant, but neither is the tls specification :-( llvm-svn: 69968	2009-04-24 12:59:40 +00:00
Rafael Espindola	b93db668b3	Revert 69952. Causes testsuite failures on linux x86-64. llvm-svn: 69967	2009-04-24 12:40:33 +00:00
Nate Begeman	bb881d66f4	PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. A clean up of x86 shuffle code, and some canonicalizing in DAGCombiner is next. llvm-svn: 69952	2009-04-24 03:42:54 +00:00
Rafael Espindola	47ed1f5293	TLS_addr64 and TLS_addr32 define RDI and EAX. They don't use them. This fixes PR4002. llvm-svn: 69672	2009-04-21 08:22:09 +00:00
Rafael Espindola	355fe12c82	For general dynamic TLS access we must use leaq foo@TLSGD(%rip), %rdi as part of the instruction sequence. Using a register other than %rdi and then copying it to %rdi is not valid. llvm-svn: 69350	2009-04-17 14:35:58 +00:00
Dan Gohman	de7b3e74be	Fix 80-column violations. llvm-svn: 69204	2009-04-15 19:48:57 +00:00
Dan Gohman	7913ea5e4a	Add a new MOV8rr_NOREX, and make X86's copyRegToReg use it when either the source or destination is a physical h register. This fixes sqlite3 with the post-RA scheduler enabled. llvm-svn: 69111	2009-04-15 00:04:23 +00:00
Dan Gohman	6c1426308c	Rename COPY_TO_SUBCLASS to COPY_TO_REGCLASS, and generalize it accordingly. Thanks to Jakob Stoklund Olesen for pointing out how this might be useful. llvm-svn: 68986	2009-04-13 21:06:25 +00:00
Dan Gohman	57d6bd36b2	Implement x86 h-register extract support. - Add patterns for h-register extract, which avoids a shift and mask, and in some cases a temporary register. - Add address-mode matching for turning (X>>(8-n))&(255<<n), where n is a valid address-mode scale value, into an h-register extract and a scaled-offset address. - Replace X86's MOV32to32_ and related instructions with the new target-independent COPY_TO_SUBREG instruction. On x86-64 there are complicated constraints on h registers, and CodeGen doesn't currently provide a high-level way to express all of them, so they are handled with a bunch of special code. This code currently only supports extracts where the result is used by a zero-extend or a store, though these are fairly common. These transformations are not always beneficial; since there are only 4 h registers, they sometimes require extra move instructions, and this sometimes increases register pressure because it can force out values that would otherwise be in one of those registers. However, this appears to be relatively uncommon. llvm-svn: 68962	2009-04-13 16:09:41 +00:00
Chris Lattner	428f71623b	a few fixes to "addrspace(256) is reference offset of GS segment register". It turns out that there are still several problems with this, will file a bugzilla. llvm-svn: 68749	2009-04-10 00:16:23 +00:00
Rafael Espindola	3b2df10c9e	Re-apply 68552. Tested by bootstrapping llvm-gcc and using that to build llvm. llvm-svn: 68645	2009-04-08 21:14:34 +00:00
Bill Wendling	4aa25b79f9	Temporarily revert r68552. This was causing a failure in the self-hosting LLVM builds. --- Reverse-merging (from foreign repository) r68552 into '.': U test/CodeGen/X86/tls8.ll U test/CodeGen/X86/tls10.ll U test/CodeGen/X86/tls2.ll U test/CodeGen/X86/tls6.ll U lib/Target/X86/X86Instr64bit.td U lib/Target/X86/X86InstrSSE.td U lib/Target/X86/X86InstrInfo.td U lib/Target/X86/X86RegisterInfo.cpp U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86CodeEmitter.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86InstrInfo.h U lib/Target/X86/X86ISelDAGToDAG.cpp U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp U lib/Target/X86/AsmPrinter/X86IntelAsmPrinter.cpp U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.h U lib/Target/X86/AsmPrinter/X86IntelAsmPrinter.h U lib/Target/X86/X86ISelLowering.h U lib/Target/X86/X86InstrInfo.cpp U lib/Target/X86/X86InstrBuilder.h U lib/Target/X86/X86RegisterInfo.td llvm-svn: 68560	2009-04-07 22:35:25 +00:00
Rafael Espindola	1edda06792	Reduce code duplication on the TLS implementation. This introduces a small regression on the generated code quality in the case we are just computing addresses, not loading values. Will work on it and on X86-64 support. llvm-svn: 68552	2009-04-07 21:37:46 +00:00
Evan Cheng	a84a318873	When optimzing a mul by immediate into two, the resulting mul's should get a x86 specific node to avoid dag combiner from hacking on them further. llvm-svn: 68066	2009-03-30 21:36:47 +00:00
Rafael Espindola	997b74ac61	add 8 and 16 bit TLS moves. add a fixme note on how to remove code duplication. llvm-svn: 66932	2009-03-13 19:39:55 +00:00
Rafael Espindola	71144973f3	Improve sext and zext of TLS variables. llvm-svn: 66922	2009-03-13 18:37:06 +00:00
Evan Cheng	2a332aa866	Re-apply 66024 with fixes: 1. Fixed indirect call to immediate address assembly. 2. Fixed JIT encoding by making the address pc-relative. llvm-svn: 66803	2009-03-12 18:15:39 +00:00
Dan Gohman	5637df37cd	Revert r66024. The JIT encoding for CALLpcrel32 is wrong -- see PR3773, and the assembly text output uses an indirect call ("call *") instead of a direct call. llvm-svn: 66735	2009-03-11 23:01:47 +00:00
Rafael Espindola	294943c99b	optimize i8 and i16 tls values. llvm-svn: 66725	2009-03-11 22:40:04 +00:00
Dan Gohman	c719d73eec	Don't use plain INC32 and DEC32 on x86-64; it needs INC64_32r and INC64_16r, because these instructions are encoded differently on x86-64. This fixes JIT regressions on x86-64 in kimwitu++ and others. llvm-svn: 66207	2009-03-05 21:32:23 +00:00
Dan Gohman	55d7b2ac4f	Re-apply 66008, now that the unfoldMemoryOperand bug is fixed. llvm-svn: 66058	2009-03-04 19:44:21 +00:00
Evan Cheng	9edd616b59	Fix PR3666: isel calls to constant addresses. llvm-svn: 66024	2009-03-04 06:48:53 +00:00
Dan Gohman	6728f892be	Revert r66004 for now; it's causing a variety of test failures. llvm-svn: 66008	2009-03-04 03:54:19 +00:00
Dan Gohman	fe8d71f42a	Teach the x86 backend to eliminate "test" instructions by using the EFLAGS result from add, sub, inc, and dec instructions in simple cases. llvm-svn: 66004	2009-03-04 02:33:24 +00:00
Dan Gohman	3a72265d41	Add '(implicit EFLAGS)' for AND, OR, XOR, NEG, INC, and DEC instructions. These aren't used yet. llvm-svn: 65965	2009-03-03 19:53:46 +00:00
Evan Cheng	64fdacc27f	A few more isAsCheapAsAMove. llvm-svn: 63852	2009-02-05 08:42:55 +00:00
Evan Cheng	1bc8af207e	Implement multiple with overflow by 2 with an add instruction. llvm-svn: 63090	2009-01-27 03:30:42 +00:00
Nate Begeman	5eca265519	Map address space 256 to gs; similar mappings could be supported for the other x86 segments. address space 0 is stack/default, 1-255 are reserved for client use. llvm-svn: 62980	2009-01-26 01:24:32 +00:00
Evan Cheng	201501995f	Favors generating "not" over "xor -1". For example. unsigned test(unsigned a) { return ~a; } llvm used to generate: movl $4294967295, %eax xorl 4(%esp), %eax Now it generates: movl 4(%esp), %eax notl %eax It's 3 bytes shorter. llvm-svn: 62661	2009-01-21 02:09:05 +00:00
Dan Gohman	b8f5ba6781	Disable the register+memory forms of the bt instructions for now. Thanks to Eli for pointing out that these forms don't ignore the high bits of their index operands, and as such are not immediately suitable for use by isel. llvm-svn: 62194	2009-01-13 23:23:30 +00:00
Dan Gohman	0fdf71cb9d	Add bt instructions that take immediate operands. llvm-svn: 62180	2009-01-13 20:33:23 +00:00
Dan Gohman	eb2591bbdd	Fix a few more JIT encoding issues in the BT instructions. llvm-svn: 62179	2009-01-13 20:32:45 +00:00
Dan Gohman	8e8d1da35a	Add patterns to match conditional moves with loads folded into their left operand, rather than their right. Do this by commuting the operands and inverting the condition. llvm-svn: 61842	2009-01-07 01:00:24 +00:00
Dan Gohman	7e47cc7cda	Define instructions for cmovo and cmovno. llvm-svn: 61836	2009-01-07 00:35:10 +00:00
Dan Gohman	33e6fcd56f	X86_COND_C and X86_COND_NC are alternate mnemonics for X86_COND_B and X86_COND_AE, respectively. llvm-svn: 61835	2009-01-07 00:15:08 +00:00
Evan Cheng	4c91aa3418	Do not isel load folding bt instructions for pentium m, core, core2, and AMD processors. These are significantly slower than a load followed by a bt of a register. llvm-svn: 61557	2009-01-02 05:35:45 +00:00
Chris Lattner	1b8c9f795a	Fix some JIT encodings. llvm-svn: 61425	2008-12-25 01:32:49 +00:00
Chris Lattner	d1dfdab973	BT memory operands load from their address operand. llvm-svn: 61424	2008-12-25 01:27:10 +00:00
Dan Gohman	25a767d7f4	Add instruction patterns and encodings for the x86 bt instructions. llvm-svn: 61400	2008-12-23 22:45:23 +00:00
Bill Wendling	c4499feb1a	- Use patterns instead of creating completely new instruction matching patterns, which are identical to the original patterns. - Change the multiply with overflow so that we distinguish between signed and unsigned multiplication. Currently, unsigned multiplication with overflow isn't working! llvm-svn: 60963	2008-12-12 21:15:41 +00:00
Bill Wendling	1a317678bc	Redo the arithmetic with overflow architecture. I was changing the semantics of ISD::ADD to emit an implicit EFLAGS. This was horribly broken. Instead, replace the intrinsic with an ISD::SADDO node. Then custom lower that into an X86ISD::ADD node with a associated SETCC that checks the correct condition code (overflow or carry). Then that gets lowered into the correct X86::ADDOvf instruction. Similar for SUB and MUL instructions. llvm-svn: 60915	2008-12-12 00:56:36 +00:00
Bill Wendling	db8ec2d75a	Add sub/mul overflow intrinsics. This currently doesn't have a target-independent way of determining overflow on multiplication. It's very tricky. Patch by Zoltan Varga! llvm-svn: 60800	2008-12-09 22:08:41 +00:00
Nick Lewycky	f9e2394009	Fix typo, psuedo -> pseudo. llvm-svn: 60651	2008-12-07 03:49:52 +00:00
Dan Gohman	69cc2cbbff	Rename isSimpleLoad to canFoldAsLoad, to better reflect its meaning. llvm-svn: 60487	2008-12-03 18:15:48 +00:00
Bill Wendling	122c515809	Reapply r60382. This time, don't mark "ADC" nodes with "implicit EFLAGS". llvm-svn: 60385	2008-12-02 00:07:05 +00:00
Bill Wendling	351b6659ad	Temporarily revert r60382. It caused CodeGen/X86/i2k.ll and others to fail. llvm-svn: 60383	2008-12-01 23:44:08 +00:00
Bill Wendling	a435b1aebc	- Have "ADD" instructions return an implicit EFLAGS. - Add support for seto, setno, setc, and setnc instructions. llvm-svn: 60382	2008-12-01 23:30:42 +00:00
Bill Wendling	751a694ad3	Generate something sensible for an [SU]ADDO op when the overflow/carry flag is the conditional for the BRCOND statement. For instance, it will generate: addl %eax, %ecx jo LOF instead of addl %eax, %ecx ; About 10 instructions to compare the signs of LHS, RHS, and sum. jl LOF llvm-svn: 60123	2008-11-26 22:37:40 +00:00
Dan Gohman	c8d2b0135a	Don't set neverHasSideEffects on x86's divide instructions, since they trap on divide-by-zero, and this side effect is otherwise unmodeled. llvm-svn: 59551	2008-11-18 21:29:14 +00:00
Nicolas Geoffray	db30612fc4	Generate code for TLS instructions. llvm-svn: 58141	2008-10-25 15:22:06 +00:00
Evan Cheng	0fcc89b596	Add implicit defs of XMM8 to XMM15 on 32-bit call instructions. While this is not technically true, it tells tblgen that these instructions "clobber" the entire XMM register file. llvm-svn: 57723	2008-10-17 21:02:22 +00:00
Dan Gohman	ca0546facc	Fun x86 encoding tricks: when adding an immediate value of 128, use a SUB instruction instead of an ADD, because -128 can be encoded in an 8-bit signed immediate field, while +128 can't be. This avoids the need for a 32-bit immediate field in this case. A similar optimization applies to 64-bit adds with 0x80000000, with the 32-bit signed immediate field. To support this, teach tablegen how to handle 64-bit constants. llvm-svn: 57663	2008-10-17 01:33:43 +00:00
Dan Gohman	a39b0a1f05	Define patterns for shld and shrd that match immediate shift counts, and patterns that match dynamic shift counts when the subtract is obscured by a truncate node. Add DAGCombiner support for recognizing rotate patterns when the shift counts are defined by truncate nodes. Fix and simplify the code for commuting shld and shrd instructions to work even when the given instruction doesn't have a parent, and when the caller needs a new instruction. These changes allow LLVM to use the shld, shrd, rol, and ror instructions on x86 to replace equivalent code using two shifts and an or in many more cases. llvm-svn: 57662	2008-10-17 01:23:35 +00:00
Dan Gohman	29ad439782	Now that predicates can be composed, simplify several of the predicates by extending simple predicates to create more complex predicates instead of duplicating the logic for the simple predicates. This doesn't reduce much redundancy in DAGISelEmitter.cpp's generated source yet; that will require improvements to DAGISelEmitter.cpp's instruction sorting, to make it more effectively group nodes with similar predicates together. llvm-svn: 57565	2008-10-15 06:50:19 +00:00
Chris Lattner	2753955fc0	Change CALLSEQ_BEGIN and CALLSEQ_END to take TargetConstant's as parameters instead of raw Constants. This prevents the constants from being selected by the isel pass, fixing PR2735. llvm-svn: 57385	2008-10-11 22:08:30 +00:00
Dale Johannesen	422ef88f31	Model hardwired inputs & outputs of x86 8-bit divides correctly. Fixes local RA miscompilation of gcc.c-torture/execute/20020904-1.c -O0. llvm-svn: 57257	2008-10-07 18:54:28 +00:00
Dale Johannesen	8c36a1c09c	Make atomic Swap work, 64-bit on x86-32. Make it all work in non-pic mode. llvm-svn: 57034	2008-10-03 22:25:52 +00:00
Dale Johannesen	5d60c1ebb1	Pass MemOperand through for 64-bit atomics on 32-bit, incidentally making the case where the memop is a pointer deref work. Fix cmp-and-swap regression. llvm-svn: 57027	2008-10-03 19:41:08 +00:00
Dale Johannesen	867d549fce	Handle some 64-bit atomics on x86-32, some of the time. llvm-svn: 56963	2008-10-02 18:53:47 +00:00
Dan Gohman	6388dde98e	Split x86's ADJCALLSTACK instructions into 32-bit and 64-bit forms. This allows the 64-bit forms to use+def RSP instead of ESP. This doesn't fix any real bugs today, but it is more precise and it makes the debug dumps on x86-64 look more consistent. Also, add some comments describing the CALL instructions' physreg operand uses and defs. llvm-svn: 56925	2008-10-01 18:28:06 +00:00
Dan Gohman	bb3c5019f8	Mark CALL instructions as having a Use of ESP/RSP. llvm-svn: 56911	2008-10-01 04:14:30 +00:00
Evan Cheng	82237f2f42	Fix PR2835. Do not change the width of a volatile load. llvm-svn: 56792	2008-09-29 17:26:18 +00:00
Evan Cheng	7d6fa97567	Implement "punpckldq %xmm0, $xmm0" as "pshufd $0x50, %xmm0, %xmm" unless optimizing for code size. llvm-svn: 56711	2008-09-26 23:41:32 +00:00
Evan Cheng	f8ead16b50	Fix patterns for SSE4.1 move and sign extend instructions. Also add instructions which fold VZEXT_MOVL and VZEXT_LOAD. llvm-svn: 56594	2008-09-24 23:27:55 +00:00
Bill Wendling	24c79f28b1	Reverting r56249. On further investigation, this functionality isn't needed. Apologies for the thrashing. llvm-svn: 56251	2008-09-16 21:48:12 +00:00
Bill Wendling	8bc392fb1d	- Change "ExternalSymbolSDNode" to "SymbolSDNode". - Add linkage to SymbolSDNode (default to external). - Change ISD::ExternalSymbol to ISD::Symbol. - Change ISD::TargetExternalSymbol to ISD::TargetSymbol These changes pave the way to allowing SymbolSDNodes with non-external linkage. llvm-svn: 56249	2008-09-16 21:12:30 +00:00
Dan Gohman	effb894453	Rename ConstantSDNode::getValue to getZExtValue, for consistency with ConstantInt. This led to fixing a bug in TargetLowering.cpp using getValue instead of getAPIntValue. llvm-svn: 56159	2008-09-12 16:56:44 +00:00
Evan Cheng	cfb7f3abdf	Transform (x << (y&31)) -> (x << y). This takes advantage of the fact x86 shift instructions 2nd operand (shift count) is limited to 0 to 31 (or 63 in the x86-64 case). llvm-svn: 55558	2008-08-30 02:03:58 +00:00
Dale Johannesen	41be0d4445	Split the ATOMIC NodeType's to include the size, e.g. ATOMIC_LOAD_ADD_{8,16,32,64} instead of ATOMIC_LOAD_ADD. Increased the Hardcoded Constant OpActionsCapacity to match. Large but boring; no functional change. This is to support partial-word atomics on ppc; i8 is not a valid type there, so by the time we get to lowering, the ATOMIC_LOAD nodes looks the same whether the type was i8 or i32. The information can be added to the AtomicSDNode, but that is the largest SDNode; I don't fully understand the SDNode allocation, but it is sensitive to the largest node size, so increasing that must be bad. This is the alternative. llvm-svn: 55457	2008-08-28 02:44:49 +00:00
Bill Wendling	fc4f64eed0	Reverting r55190, r55191, and r55192. They broke the build with this error message: {standard input}:17:bad register name `%sil' make[4]: * [libgcc/./_addvsi3.o] Error 1 make[4]: * Waiting for unfinished jobs.... {standard input}:23:bad register name `%dil' {standard input}:28:bad register name `%dil' make[4]: * [libgcc/./_addvdi3.o] Error 1 {standard input}:18:bad register name `%sil' make[4]: * [libgcc/./_subvsi3.o] Error 1 llvm-svn: 55200	2008-08-22 20:51:05 +00:00
Dan Gohman	736779f088	Anyext tweaks for x86. When extloading a value to i32 or i64, choose instructions that define the full 32 or 64-bit value. When anyexting from i8 to i16 or i32, it's not necessary to zero out the high portion of the register. llvm-svn: 55190	2008-08-22 19:19:31 +00:00
Dan Gohman	814f291664	Move the handling of ANY_EXTEND, SIGN_EXTEND_INREG, and TRUNCATE out of X86ISelDAGToDAG.cpp C++ code and into tablegen code. Among other things, using tablegen for these things makes them friendlier to FastISel. Tablegen can handle the case of i8 subregs on x86-32, but currently the C++ code for that case uses MVT::Flag in a tricky way, and it happens to schedule better in some cases. So for now, leave the C++ code in place to handle the i8 case on x86-32. llvm-svn: 55078	2008-08-20 21:27:32 +00:00
Dan Gohman	8823b0d245	Tablegen generated code already tests the opcode value, so it's not necessary to use dyn_cast in these predicates. llvm-svn: 55055	2008-08-20 15:24:22 +00:00
Bill Wendling	f00f3055d8	Revert r55018 and apply the correct "fix" for the 64-bit sub_and_fetch atomic. Just expand it like the other X-bit sub_and_fetches. llvm-svn: 55023	2008-08-20 00:28:16 +00:00
Bill Wendling	e79740851f	Add support for the __sync_sub_and_fetch atomics and friends for X86. The code was already present, but not hooked up to anything. llvm-svn: 55018	2008-08-19 23:09:18 +00:00
Dale Johannesen	5afbf510aa	Add support for 8 and 16 bit forms of __sync builtins on X86. Change "lock" instructions to be on a separate line. This is needed to work around a bug in the Darwin assembler. llvm-svn: 54999	2008-08-19 18:47:28 +00:00
Dan Gohman	91c2c432c0	Re-introduce the 8-bit subreg zext-inreg patterns for x86-32, this time using MOV32to32_ and MOV16to16_. Thanks to Evan for suggesting this. llvm-svn: 54418	2008-08-06 18:27:21 +00:00
Dan Gohman	04f4c833e9	xchg does not modify FLAGS. llvm-svn: 54411	2008-08-06 15:52:50 +00:00
Dan Gohman	86b06335aa	Reapply r54147 with a constraint to only use the 8-bit subreg form on x86-64, to avoid the problem with x86-32 having GPRs that don't have 8-bit subregs. Also, change several 16-bit instructions to use equivalent 32-bit instructions. These have a smaller encoding and avoid partial-register updates. llvm-svn: 54223	2008-07-30 18:09:17 +00:00
Dan Gohman	43105328d3	Revert 54147. llvm-svn: 54148	2008-07-29 01:02:18 +00:00
Dan Gohman	26ec56c75c	Add x86 isel patterns to match what would be a ZERO_EXTEND_INREG operation, which is represented in codegen as an 'and' operation. This matches them with movz instructions, instead of leaving them to be matched by and instructions with an immediate field. llvm-svn: 54147	2008-07-28 22:18:25 +00:00
Anton Korobeynikov	2d29ee06cd	Fix encoding of atomic compare and swap for i64 llvm-svn: 53911	2008-07-22 16:22:48 +00:00
Mon P Wang	6a490371c9	Added MemOperands to Atomic operations since Atomics touches memory. Added abstract class MemSDNode for any Node that have an associated MemOperand Changed atomic.lcs => atomic.cmp.swap, atomic.las => atomic.load.add, and atomic.lss => atomic.load.sub llvm-svn: 52706	2008-06-25 08:15:39 +00:00
Evan Cheng	1d260dfa3b	XOR32rr, etc. are not AsCheapAsMove, but MOV32ri, etc. are. llvm-svn: 52454	2008-06-18 08:13:07 +00:00
Andrew Lenharth	f88d50bfcc	add missing atomic intrinsic from gcc llvm-svn: 52270	2008-06-14 05:48:15 +00:00
Dan Gohman	bd3390c73a	Teach the DAGISelEmitter to not compute the variable_ops operand index for the input pattern in terms of the output pattern. Instead keep track of how many fixed operands the input pattern actually has, and have the input matching code pass the output-emitting function that index value. This simplifies the code, disentangles variables_ops from the support for predication operations, and makes variable_ops more robust. llvm-svn: 51808	2008-05-31 02:11:25 +00:00
Dan Gohman	96af4ddb62	Add patterns for CALL32m and CALL64m. They aren't matched in most cases due to an isel deficiency already noted in lib/Target/X86/README.txt, but they can be matched in this fold-call.ll testcase, for example. This is interesting mainly because it exposes a tricky tblgen bug; tblgen was incorrectly computing the starting index for variable_ops in the case of a complex pattern. llvm-svn: 51706	2008-05-29 21:50:34 +00:00
Dan Gohman	6e582c449f	Fix a tblgen problem handling variable_ops in tblgen instruction definitions. This adds a new construct, "discard", for indicating that a named node in the input matching pattern is to be discarded, instead of corresponding to a node in the output pattern. This allows tblgen to know where the arguments for the varaible_ops are supposed to begin. This fixes "rdar://5791600", whatever that is ;-). llvm-svn: 51699	2008-05-29 19:57:41 +00:00
Bill Wendling	0252be178d	XOR?RI instructions aren't as cheap as moves. llvm-svn: 51664	2008-05-29 03:46:36 +00:00
Bill Wendling	7a1a8eb6e2	Implement "AsCheapAsAMove" for some obviously cheap instructions: xor and the like. llvm-svn: 51662	2008-05-29 01:02:09 +00:00
Evan Cheng	6f34ed0d36	Doh. Alignment is in bytes, not in bits. llvm-svn: 51092	2008-05-14 02:49:43 +00:00
Evan Cheng	f8ab712fa9	- Fix the pasto in the fix for a previous pasto. - Incorporate Chris' comment suggestion. llvm-svn: 51061	2008-05-13 18:59:59 +00:00
Evan Cheng	595e226085	- Don't treat anyext 16-bit load as a 32-bit load if it's volatile. - Correct a pasto. llvm-svn: 51054	2008-05-13 16:45:56 +00:00
Evan Cheng	3f40c69083	On x86, it's safe to treat i32 load anyext as a normal i32 load. Ditto for i8 anyext load to i16. llvm-svn: 51019	2008-05-13 00:54:02 +00:00
Dan Gohman	0863b19ae6	Fix a copy+paste bug; pseudo-instructions shouldn't have encoding information. llvm-svn: 50997	2008-05-12 20:22:45 +00:00
Mon P Wang	3e58393c3d	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Anton Korobeynikov	9205c8562c	Add General Dynamic TLS model for X86-64. Some parts looks really ugly (look for tlsaddr pattern), but should work. Work is in progress, more models will follow llvm-svn: 50630	2008-05-04 21:36:32 +00:00
Arnold Schwaighofer	be0de34ede	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Evan Cheng	6d653b58f9	Fix MMX_MOVQ2DQrr pattern. It's illegal to do a bitconvert from a smaller type to a larger one. llvm-svn: 50278	2008-04-25 18:19:54 +00:00
Evan Cheng	7f4240a47c	xchg which references a memory operand does not need to lock prefix. Atomicity is guaranteed. llvm-svn: 49946	2008-04-19 01:20:30 +00:00
Evan Cheng	00bd8d904a	- Fix atomic operation JIT encoding. - Remove unused instructions. llvm-svn: 49921	2008-04-18 20:55:36 +00:00
Evan Cheng	5879213597	Also support Intel asm syntax. llvm-svn: 49878	2008-04-17 23:35:10 +00:00
Evan Cheng	4704baa555	Fix assembly code for atomic operations. llvm-svn: 49869	2008-04-17 21:26:35 +00:00
Nate Begeman	7417348a7e	80 col fix llvm-svn: 49569	2008-04-12 00:47:57 +00:00
Evan Cheng	29e62a59f3	Allow certain lea instructions to be rematerialized. llvm-svn: 48855	2008-03-27 01:41:09 +00:00
Arnold Schwaighofer	7da2bceb3b	Don't loose incoming argument registers. Fix documentation style. llvm-svn: 48545	2008-03-19 16:39:45 +00:00
Evan Cheng	0e7b00d79f	Replace all target specific implicit def instructions with a target independent one: TargetInstrInfo::IMPLICIT_DEF. llvm-svn: 48380	2008-03-15 00:03:38 +00:00
Christopher Lamb	dd55d3f1b2	Get rid of a pseudo instruction and replace it with subreg based operation on real instructions, ridding the asm printers of the hack used to do this previously. In the process, update LowerSubregs to be careful about eliminating copies that have side affects. Note: the coalescer will have to be careful about this too, when it starts coalescing insert_subreg nodes. llvm-svn: 48329	2008-03-13 05:47:01 +00:00
Christopher Lamb	aa7c2105de	Recommitting parts of r48130. These do not appear to cause the observed failures. llvm-svn: 48223	2008-03-11 10:09:17 +00:00
Chris Lattner	1bd44363f2	Change the model for FP Stack return to use fp operands on the RET instruction instead of using FpSET_ST0_32. This also generalizes the code to handling returning of multiple FP results. llvm-svn: 48209	2008-03-11 03:23:40 +00:00
Evan Cheng	d4e1d9eeb2	Revert 48125, 48126, and 48130 for now to unbreak some x86-64 tests. llvm-svn: 48167	2008-03-10 19:31:26 +00:00
Christopher Lamb	4ba3f0430b	Allow insert_subreg into implicit, target-specific values. Change insert/extract subreg instructions to be able to be used in TableGen patterns. Use the above features to reimplement an x86-64 pseudo instruction as a pattern. llvm-svn: 48130	2008-03-10 06:12:08 +00:00
Andrew Lenharth	357061a74d	64bit CAS on 32bit x86. llvm-svn: 47929	2008-03-05 01:15:49 +00:00
Evan Cheng	59d58ab8c4	80 column violations. llvm-svn: 47878	2008-03-04 03:20:06 +00:00
Evan Cheng	33ff36321e	Remove -always-fold-and-in-test. llvm-svn: 47871	2008-03-04 00:40:35 +00:00
Andrew Lenharth	20bcdba9ca	good catch anton llvm-svn: 47800	2008-03-01 23:18:21 +00:00
Andrew Lenharth	f5c90ec12c	make CAS work llvm-svn: 47799	2008-03-01 22:27:48 +00:00
Andrew Lenharth	d032c33300	all but CAS working on x86 llvm-svn: 47798	2008-03-01 21:52:34 +00:00
Andrew Lenharth	0070dd1de3	Add lock prefix support to x86. Also add the instructions necessary for the atomic ops. They are still marked pseudo, since I cannot figure out what format to use, but they are the correct opcode. llvm-svn: 47795	2008-03-01 13:37:02 +00:00
Andrew Lenharth	95528943e9	Atomic op support. If any gcc test uses __sync builtins, it might start failing on archs that haven't implemented them yet llvm-svn: 47430	2008-02-21 06:45:13 +00:00
Evan Cheng	b6b69208ba	Poorly named option. llvm-svn: 47400	2008-02-20 20:57:32 +00:00
Evan Cheng	5ce8dd93ef	Add hidden option -x86-fold-and-in-test to test the effect the test / and folding change. llvm-svn: 47351	2008-02-19 23:36:51 +00:00
Chris Lattner	97b9662f78	Don't fold and's into test instructions if they have multiple uses. This compiles test-nofold.ll into: _test: movl $15, %ecx andl 4(%esp), %ecx testl %ecx, %ecx movl $42, %eax cmove %ecx, %eax ret instead of: _test: movl 4(%esp), %eax movl %eax, %ecx andl $15, %ecx testl $15, %eax movl $42, %eax cmove %ecx, %eax ret llvm-svn: 47330	2008-02-19 17:37:35 +00:00
Evan Cheng	a20a773654	Fix a x86-64 codegen deficiency. Allow gv + offset when using rip addressing mode. Before: _main: subq $8, %rsp leaq _X(%rip), %rax movsd 8(%rax), %xmm1 movss _X(%rip), %xmm0 call _t xorl %ecx, %ecx movl %ecx, %eax addq $8, %rsp ret Now: _main: subq $8, %rsp movsd _X+8(%rip), %xmm1 movss _X(%rip), %xmm0 call _t xorl %ecx, %ecx movl %ecx, %eax addq $8, %rsp ret Notice there is another idiotic codegen issue that needs to be fixed asap: xorl %ecx, %ecx movl %ecx, %eax llvm-svn: 46850	2008-02-07 08:53:49 +00:00
Nate Begeman	e14fdfaecd	SSE 4.1 Intrinsics and detection llvm-svn: 46681	2008-02-03 07:18:54 +00:00
Duncan Sands	95d46ef887	The last pieces needed for loading arbitrary precision integers. This won't actually work (and most of the code is dead) unless the new legalization machinery is turned on. While there, I rationalized the handling of i1, and removed some bogus (and unused) sextload patterns. For i1, this could result in microscopically better code for some architectures (not X86). It might also result in worse code if annotating with AssertZExt nodes turns out to be more harmful than helpful. llvm-svn: 46280	2008-01-23 20:39:46 +00:00
Chris Lattner	1ea55cf816	This commit changes: 1. Legalize now always promotes truncstore of i1 to i8. 2. Remove patterns and gunk related to truncstore i1 from targets. 3. Rename the StoreXAction stuff to TruncStoreAction in TLI. 4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions. 5. Mark a wide variety of invalid truncstores as such in various targets, e.g. X86 currently doesn't support truncstore of any of its integer types. 6. Add legalize support for truncstores with invalid value input types. 7. Add a dag combine transform to turn store(truncate) into truncstore when safe. The later allows us to compile CodeGen/X86/storetrunc-fp.ll to: _foo: fldt 20(%esp) fldt 4(%esp) faddp %st(1) movl 36(%esp), %eax fstps (%eax) ret instead of: _foo: subl $4, %esp fldt 24(%esp) fldt 8(%esp) faddp %st(1) fstps (%esp) movl 40(%esp), %eax movss (%esp), %xmm0 movss %xmm0, (%eax) addl $4, %esp ret llvm-svn: 46140	2008-01-17 19:59:44 +00:00
Chris Lattner	9a249b0ce5	rename SDTRet -> SDTNone. Move definition of 'trap' sdnode up from x86 instrinfo to targetselectiondag.td. llvm-svn: 46017	2008-01-15 22:02:54 +00:00
Chris Lattner	3c3fefde06	no need to expand ISD::TRAP to X86ISD::TRAP, just match ISD::TRAP. llvm-svn: 46015	2008-01-15 21:58:22 +00:00
Anton Korobeynikov	59e6d533bd	Fix JIT encoding of trap/ud2 instruction llvm-svn: 46012	2008-01-15 21:40:02 +00:00
Anton Korobeynikov	6bbbc4cbfa	For PR1839: add initial support for __builtin_trap. llvm-gcc part is missed as well as PPC codegen llvm-svn: 46001	2008-01-15 07:02:33 +00:00
Chris Lattner	806dd0e2ac	remove xchg and shift-reg-by-1 instructions, which are dead. llvm-svn: 45870	2008-01-11 18:00:50 +00:00
Chris Lattner	9283173061	more flags set right llvm-svn: 45860	2008-01-11 07:18:17 +00:00
Chris Lattner	8e60f2c996	IMPLICIT_USE and IMPLICIT_DEF are dead, remove them. llvm-svn: 45838	2008-01-10 19:27:54 +00:00
Chris Lattner	317332fc2a	Start inferring side effect information more aggressively, and fix many bugs in the x86 backend where instructions were not marked maystore/mayload, and perf issues where instructions were not marked neverHasSideEffects. It would be really nice if we could write patterns for copy instructions. I have audited all the x86 instructions down to MOVDQAmr. The flags on others and on other targets are probably not right in all cases, but no clients currently use this info that are enabled by default. llvm-svn: 45829	2008-01-10 07:59:24 +00:00
Chris Lattner	2e38f2458c	rename X86InstrX86-64.td -> X86Instr64bit.td llvm-svn: 45826	2008-01-10 05:50:42 +00:00
Chris Lattner	aca7ca3730	remove explicit sets of 'neverHasSideEffects' that can now be inferred from the instr patterns. llvm-svn: 45824	2008-01-10 05:45:39 +00:00
Chris Lattner	94de7bc3aa	get def use info more correct. llvm-svn: 45821	2008-01-10 05:12:37 +00:00
Chris Lattner	b296b0f1c1	The pic base can't be duplicated. llvm-svn: 45668	2008-01-06 23:49:32 +00:00
Chris Lattner	a4ce4f6987	rename isLoad -> isSimpleLoad due to evan's desire to have such a predicate. llvm-svn: 45667	2008-01-06 23:38:27 +00:00
Chris Lattner	9fa8ae6c6b	getting the pic base has no side effects. llvm-svn: 45618	2008-01-05 03:54:32 +00:00
Evan Cheng	f55b7381af	Combine MovePCtoStack + POP32r into one instruction MOVPC32r so it can be moved if needed. llvm-svn: 45605	2008-01-05 00:41:47 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Evan Cheng	f4f52dbc8c	Fix JIT code emission of X86::MovePCtoStack. llvm-svn: 45307	2007-12-22 02:26:46 +00:00
Bill Wendling	b3d85a5d4b	Add "mayHaveSideEffects" and "neverHasSideEffects" flags to some instructions. I based what flag to set on whether it was already marked as "isRematerializable". If there was a further check to determine if it's "really" rematerializable, then I marked it as "mayHaveSideEffects" and created a check in the X86 back-end similar to the remat one. llvm-svn: 45132	2007-12-17 23:07:56 +00:00
Evan Cheng	a56e6ff9a7	Fix bsf / bsr jit encoding. llvm-svn: 45037	2007-12-14 18:49:43 +00:00
Dan Gohman	9d2e9e376f	Fix Intel asm syntax for the bsr and bsf instructions. llvm-svn: 45030	2007-12-14 15:10:00 +00:00
Evan Cheng	0e6408124e	Fix ctlz and cttz. llvm definition requires them to return number of bits in of the src type when value is zero. llvm-svn: 45029	2007-12-14 08:30:15 +00:00
Evan Cheng	e9fbc3f014	Implement ctlz and cttz with bsr and bsf. llvm-svn: 45024	2007-12-14 02:13:44 +00:00
Evan Cheng	827d30db19	Fold some and + shift in x86 addressing mode. llvm-svn: 44970	2007-12-13 00:43:27 +00:00
Evan Cheng	6e68381e02	Implicit def instructions, e.g. X86::IMPLICIT_DEF_GR32, are always re-materializable and they should not be spilled. llvm-svn: 44960	2007-12-12 23:12:09 +00:00
Bill Wendling	77b13af9a6	Unifacalize the CALLSEQ{START,END} stuff. llvm-svn: 44045	2007-11-13 09:19:02 +00:00
Bill Wendling	f359fed9f9	Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If not, then there is the potential for the stack to be changed while the stack's being used by another instruction (like a call). This can only result in tears... llvm-svn: 44037	2007-11-13 00:44:25 +00:00
Owen Anderson	933b5b7e62	Add a flag for indirect branch instructions. Target maintainers: please check that the instructions for your target are correctly marked. llvm-svn: 44012	2007-11-12 07:39:39 +00:00
Evan Cheng	35ff79370b	Local spiller optimization: Turn a store folding instruction into a load folding instruction. e.g. xorl %edi, %eax movl %eax, -32(%ebp) movl -36(%ebp), %eax orl %eax, -32(%ebp) => xorl %edi, %eax orl -36(%ebp), %eax mov %eax, -32(%ebp) This enables the unfolding optimization for a subsequent instruction which will also eliminate the newly introduced store instruction. llvm-svn: 43192	2007-10-19 21:23:22 +00:00
Arnold Schwaighofer	1f0da1fefb	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Arnold Schwaighofer	9ccea99165	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Evan Cheng	1151ffde70	Commute x86 cmove instructions by swapping the operands and change the condition to its inverse. Testing this as llcbeta llvm-svn: 42661	2007-10-05 23:13:21 +00:00
Evan Cheng	97eba74a52	ADC and SBB uses EFLAGS. llvm-svn: 42640	2007-10-05 17:59:57 +00:00
Evan Cheng	5fb5a1f389	Enabling new condition code modeling scheme. llvm-svn: 42459	2007-09-29 00:00:36 +00:00
Evan Cheng	1f516560d1	Stop inventing new words. :-) llvm-svn: 42429	2007-09-28 01:35:02 +00:00
Evan Cheng	edfc5b2204	Pessimisively assume ADJCALLSTACKDOWN / ADJCALLSTACKUP (which becomes sub / add) clobbers EFLAGS. llvm-svn: 42426	2007-09-28 01:19:48 +00:00
Evan Cheng	b93de587cb	Some assemblers do not recognize aliases pushfd, pushfq, popfd, and popfq. Just emit them as pushf and popf. llvm-svn: 42371	2007-09-26 21:28:00 +00:00
Evan Cheng	b4b352656a	Typos: POPQ -> POPFQ, POPD -> POPFD. llvm-svn: 42348	2007-09-26 06:38:29 +00:00
Evan Cheng	0a6f47cff9	Add pushf{d\|q}, popf{d\|q} to push and pop EFLAGS register. llvm-svn: 42335	2007-09-26 01:29:06 +00:00
Evan Cheng	e95f391ef1	Added support for new condition code modeling scheme (i.e. physical register dependency). These are a bunch of instructions that are duplicated so the x86 backend can support both the old and new schemes at the same time. They will be deleted after all the kinks are worked out. llvm-svn: 42285	2007-09-25 01:57:46 +00:00
Dan Gohman	071efe28bb	Fix the syntax for the .loc directive in preparation for using it. llvm-svn: 42268	2007-09-24 19:25:06 +00:00
Dale Johannesen	e36c400255	Fix PR 1681. When X86 target uses +sse -sse2, keep f32 in SSE registers and f64 in x87. This is effectively a new codegen mode. Change addLegalFPImmediate to permit float and double variants to do different things. Adjust callers. llvm-svn: 42246	2007-09-23 14:52:20 +00:00
Evan Cheng	483e1ce16e	Add implicit def of EFLAGS on those instructions that may modify flags. llvm-svn: 41962	2007-09-14 21:48:26 +00:00
Evan Cheng	3e18e504ae	Remove (somewhat confusing) Imp<> helper, use let Defs = [], Uses = [] instead. llvm-svn: 41863	2007-09-11 19:55:27 +00:00
Evan Cheng	cef2c0efcc	TableGen no longer emit CopyFromReg nodes for implicit results in physical registers. The scheduler is now responsible for emitting them. llvm-svn: 41781	2007-09-07 23:59:02 +00:00
Dan Gohman	a95cbb0007	Avoid storing and reloading zeros and other constants from stack slots by flagging the associated instructions as being trivially rematerializable. llvm-svn: 41775	2007-09-07 21:32:51 +00:00
Evan Cheng	c2081fe573	Mark load instructions with isLoad = 1. llvm-svn: 41595	2007-08-30 05:49:43 +00:00
Dale Johannesen	b1888e73ad	Long double patch 4 of N: initial x87 implementation. Lots of problems yet but some simple things work. llvm-svn: 40847	2007-08-05 18:49:15 +00:00
Evan Cheng	473c5111c3	Switch some multiplication instructions over to the new scheme for testing. llvm-svn: 40723	2007-08-02 05:48:35 +00:00
Evan Cheng	763cdfd371	Mac OS X X86-64 low 4G address not available. llvm-svn: 40701	2007-08-01 23:45:51 +00:00
Evan Cheng	6f2ce6b842	Be more precise. llvm-svn: 40689	2007-08-01 20:22:37 +00:00
Dan Gohman	54ec4bfa5f	Change the x86 assembly output to use tab characters to separate the mnemonics from their operands instead of single spaces. This makes the assembly output a little more consistent with various other compilers (f.e. GCC), and slightly easier to read. Also, update the regression tests accordingly. llvm-svn: 40648	2007-07-31 20:11:57 +00:00
Evan Cheng	12c6be84ff	Redo and generalize previously removed opt for pinsrw: (vextract (v4i32 bc (v4f32 s2v (f32 load ))), 0) -> (i32 load ) llvm-svn: 40628	2007-07-31 08:04:03 +00:00
Christopher Lamb	5fecb80efa	Change the x86 backend to use extract_subreg for truncation operations. Passes DejaGnu, SingleSource and MultiSource. llvm-svn: 40578	2007-07-29 01:24:57 +00:00
Dan Gohman	c9edd977ea	In the .loc directive, print the fields as "debug" fields, so they don't get decorated as if for immediate fields for instructions. llvm-svn: 40529	2007-07-26 15:24:15 +00:00
Evan Cheng	ac1591be42	No more noResults. llvm-svn: 40132	2007-07-21 00:34:19 +00:00
Evan Cheng	94b5a80b93	Change instruction description to split OperandList into OutOperandList and InOperandList. This gives one piece of important information: # of results produced by an instruction. An example of the change: def ADD32rr : I<0x01, MRMDestReg, (ops GR32:$dst, GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; => def ADD32rr : I<0x01, MRMDestReg, (outs GR32:$dst), (ins GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; llvm-svn: 40033	2007-07-19 01:14:50 +00:00
Anton Korobeynikov	383a324735	Long live the exception handling! This patch fills the last necessary bits to enable exceptions handling in LLVM. Currently only on x86-32/linux. In fact, this patch adds necessary intrinsics (and their lowering) which represent really weird target-specific gcc builtins used inside unwinder. After corresponding llvm-gcc patch will land (easy) exceptions should be more or less workable. However, exceptions handling support should not be thought as 'finished': I expect many small and not so small glitches everywhere. llvm-svn: 39855	2007-07-14 14:06:15 +00:00
Dan Gohman	e8c1e428f2	Revert the earlier change that removed the M_REMATERIALIZABLE machine instruction flag, and use the flag along with a virtual member function hook for targets to override if there are instructions that are only trivially rematerializable with specific operands (i.e. constant pool loads). llvm-svn: 37728	2007-06-26 00:48:07 +00:00
Dan Gohman	9e82064924	Replace M_REMATERIALIZIBLE and the newly-added isOtherReMaterializableLoad with a general target hook to identify rematerializable instructions. Some instructions are only rematerializable with specific operands, such as loads from constant pools, while others are always rematerializable. This hook allows both to be identified as being rematerializable with the same mechanism. llvm-svn: 37644	2007-06-19 01:48:05 +00:00
Nate Begeman	4060c7ac63	Reference correct header llvm-svn: 36834	2007-05-06 04:00:55 +00:00
Bill Wendling	157d7ee7e5	Add SSSE3 as a feature of Core2. Add MMX registers to the list of registers clobbered by a call. llvm-svn: 36448	2007-04-25 21:31:48 +00:00
Lauro Ramos Venancio	6db679a49a	X86 TLS: optimize the implementation of "local exec" model. llvm-svn: 36359	2007-04-23 01:28:10 +00:00
Lauro Ramos Venancio	efb8077ddd	X86 TLS: fix and optimize the implementation of "initial exec" model. llvm-svn: 36355	2007-04-22 22:50:52 +00:00
Lauro Ramos Venancio	2518889872	Implement "general dynamic", "initial exec" and "local exec" TLS models for X86 32 bits. llvm-svn: 36283	2007-04-20 21:38:10 +00:00
Anton Korobeynikov	8b7aab009e	Implemented correct stack probing on mingw/cygwin for dynamic alloca's. Also, fixed static case in presence of eax livin. This fixes PR331 PS: Why don't we still have push/pop instructions? :) llvm-svn: 36195	2007-04-17 09:20:00 +00:00
Bill Wendling	f099841573	Add support for our first SSSE3 instruction "pmulhrsw". llvm-svn: 35869	2007-04-10 22:10:25 +00:00
Evan Cheng	61eee86487	Mark re-materializable instructions. llvm-svn: 35230	2007-03-21 00:16:56 +00:00
Jim Laskey	f9e5445ed4	Make LABEL a builtin opcode. llvm-svn: 33537	2007-01-26 14:34:52 +00:00
Chris Lattner	197aa9634e	Fix a misencoding of CBW and CWD. This fixes PR1030. llvm-svn: 33486	2007-01-24 18:31:00 +00:00
Evan Cheng	dd60ca029c	- Switch X86-64 JIT to large code size model. - Re-enable some codegen niceties for X86-64 static relocation model codegen. - Clean ups, etc. llvm-svn: 32238	2006-12-05 19:50:18 +00:00
Evan Cheng	ae1cd75af7	- Use a different wrapper node for RIP-relative GV, etc. - Proper support for both small static and PIC modes under X86-64 - Some (non-optimal) support for medium modes. llvm-svn: 32046	2006-11-30 21:55:46 +00:00
Evan Cheng	28a9e9b230	Custom lower READCYCLECOUNTER for x86-64. llvm-svn: 32017	2006-11-29 08:28:13 +00:00
Chris Lattner	9ac6442db6	remove dead/redundant vars llvm-svn: 31435	2006-11-03 23:48:56 +00:00
Reid Spencer	b51b5c0b1f	Add debug support for X86/ELF targets (Linux). This allows llvm-gcc4 generated object modules to be debugged with gdb. Hopefully this helps pre-release debugging. llvm-svn: 31299	2006-10-30 22:32:30 +00:00
Evan Cheng	ab51cf2e78	Merge ISD::TRUNCSTORE to ISD::STORE. Switch to using StoreSDNode. llvm-svn: 30945	2006-10-13 21:14:26 +00:00
Chris Lattner	adcaf294d7	Move the Imp tblgen class from the X86 backend to common code. llvm-svn: 30907	2006-10-12 17:49:27 +00:00
Chris Lattner	b5cda3daa9	Mark ADJCALLSTACKUP/DOWN as clobbering ESP so that virtregmap will notice that it can't assume ESP is unmodified across the instrs. llvm-svn: 30905	2006-10-12 17:42:56 +00:00
Evan Cheng	577ef7694e	Add properties to ComplexPattern. llvm-svn: 30891	2006-10-11 21:03:53 +00:00
Evan Cheng	e71fe34d75	Reflects ISD::LOAD / ISD::LOADX / LoadSDNode changes. llvm-svn: 30844	2006-10-09 20:57:25 +00:00
Evan Cheng	1da0ab2f58	Delete dead code; fix 80 col violations. llvm-svn: 30583	2006-09-22 21:43:59 +00:00
Evan Cheng	4259a0f654	X86ISD::CMP now produces a chain as well as a flag. Make that the chain operand of a conditional branch to allow load folding into CMP / TEST instructions. llvm-svn: 30241	2006-09-11 02:19:56 +00:00
Evan Cheng	11b0a5dbd4	Committing X86-64 support. llvm-svn: 30177	2006-09-08 06:48:29 +00:00
Chris Lattner	dc4ff5311f	Eliminate X86ISD::TEST, using X86ISD::CMP instead. Match X86ISD::CMP patterns using test, which provides nice simplifications like: - movl %edi, %ecx - andl $2, %ecx - cmpl $0, %ecx + testl $2, %edi je LBB1_11 #cond_next90 There are a couple of dagiselemitter deficiencies that this exposes, they will be handled later. llvm-svn: 30156	2006-09-07 20:33:45 +00:00
Evan Cheng	a9411c0977	Consistency. llvm-svn: 30152	2006-09-07 19:03:48 +00:00
Evan Cheng	81b645a76b	CALLSEQ_* produces chain even if that's not needed. llvm-svn: 29603	2006-08-11 09:03:33 +00:00
Evan Cheng	683b966485	Clean up. llvm-svn: 29228	2006-07-20 21:37:39 +00:00
Evan Cheng	02d8836cd5	INC / DEC instructions have shorter code size than ADD32ri8, etc. llvm-svn: 29194	2006-07-19 00:27:29 +00:00
Evan Cheng	d5a086ab12	Emit inc / dec of registers as one byte instruction. llvm-svn: 29110	2006-07-11 19:49:49 +00:00
Evan Cheng	fa9e60895b	Add shift and rotate by 1 instructions / patterns. llvm-svn: 28980	2006-06-29 00:36:51 +00:00
Evan Cheng	2aed9ebded	Remove dead code. llvm-svn: 28938	2006-06-27 20:34:14 +00:00
Evan Cheng	c8734381ac	X86 call instructions can take variable number of operands. Parameters of vector types are passed via XMM registers. llvm-svn: 28789	2006-06-14 22:24:55 +00:00
Evan Cheng	7ae8632cb4	Incorrect AT&T opcode. llvm-svn: 28666	2006-06-02 21:09:10 +00:00
Evan Cheng	cfaffdd335	Rename ASM modifier trunc8, trunc16 to subreg8, subreg16. llvm-svn: 28606	2006-05-31 22:34:26 +00:00
Evan Cheng	cf70c7f42d	Sign extender llvm-svn: 28603	2006-05-31 22:05:11 +00:00
Evan Cheng	734e1e241b	A addressing mode folding enhancement: Fold c2 in (x << c1) \| c2 where (c2 < c1) e.g. int test(int x) { return (x << 3) + 7; } This can be codegen'd as: leal 7(,%eax,8), %eax llvm-svn: 28550	2006-05-30 06:59:36 +00:00
Evan Cheng	b9ac06bb33	Remove unused patterns. llvm-svn: 28417	2006-05-20 01:40:16 +00:00
Evan Cheng	7b8feb27c8	- Use exact-width integer types, e.g. int32_t, to avoid confusion. - Fix a couple of minor bugs in i16immSExt8 and i16immZExt8. - Added loadiPTR fragment used for indirect jumps and calls. llvm-svn: 28392	2006-05-19 18:40:54 +00:00
Evan Cheng	1c8ef9832f	Explicitly specify MOV32mi can only be used store 32-bit GV, etc. llvm-svn: 28390	2006-05-19 07:30:36 +00:00
Evan Cheng	e59042d004	Use generic iPTR instead i32 to represent pointer type. llvm-svn: 28371	2006-05-17 21:21:41 +00:00
Evan Cheng	9fee442e63	X86 integer register classes naming changes. Make them consistent with FP, vector classes. llvm-svn: 28324	2006-05-16 07:21:53 +00:00
Evan Cheng	9733bde74c	Fixing truncate. Previously we were emitting truncate from r16 to r8 as movw. That is we promote the destination operand to r16. So %CH = TRUNC_R16_R8 %BP is emitted as movw %bp, %cx. This is incorrect. If %cl is live, it would be clobbered. Ideally we want to do the opposite, that is emitted it as movb ??, %ch But this is not possible since %bp does not have a r8 sub-register. We are now defining a new register class R16_ which is a subclass of R16 containing only those 16-bit registers that have r8 sub-registers (i.e. AX - DX). We isel the truncate to two instructions, a MOV16to16_ to copy the value to the R16_ class, followed by a TRUNC_R16_R8. Due to bug 770, the register colaescer is not going to coalesce between R16 and R16_. That will be fixed later so we can eliminate the MOV16to16_. Right now, it can only be eliminated if we are lucky that source and destination registers are the same. llvm-svn: 28164	2006-05-08 08:01:26 +00:00
Evan Cheng	52c22512b9	Need extload patterns after Chris' DAG combiner changes llvm-svn: 28127	2006-05-05 08:23:07 +00:00
Evan Cheng	ddb6cc1d8e	Better implementation of truncate. ISel matches it to a pseudo instruction that gets emitted as movl (for r32 to i16, i8) or a movw (for r16 to i8). And if the destination gets allocated a subregister of the source operand, then the instruction will not be emitted at all. llvm-svn: 28119	2006-05-05 05:40:20 +00:00
Evan Cheng	f4f3f0d25f	Make x86 isel lowering produce tailcall nodes. They are match to normal calls for now. Patch contributed by Alexander Friedman. llvm-svn: 27994	2006-04-27 08:40:39 +00:00
Nate Begeman	9f0b13c885	Optimized stores to the constant pool, while cool, are unnecessary. llvm-svn: 27948	2006-04-22 22:31:45 +00:00
Nate Begeman	4ca2ea5b43	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Evan Cheng	ebf1006d16	- More efficient extract_vector_elt with shuffle and movss, movsd, movd, etc. - Some bug fixes and naming inconsistency fixes. llvm-svn: 27377	2006-04-03 20:53:28 +00:00
Evan Cheng	3e4d38eea5	Added missing (any_extend (load ...)) patterns. llvm-svn: 27120	2006-03-25 09:45:48 +00:00
Chris Lattner	ce0206e119	Fix the encodings of these new instructions, hopefully fixing the JIT failures from last night llvm-svn: 26981	2006-03-23 16:13:50 +00:00
Nate Begeman	fb6e02931c	Add support for 8 bit immediates with 16/32 bit cmp instructions llvm-svn: 26966	2006-03-23 01:29:48 +00:00
Evan Cheng	9bf978dc20	Use the generic vector register classes VR64 / VR128 rather than V4F32, V8I16, etc. llvm-svn: 26838	2006-03-18 01:23:20 +00:00
Evan Cheng	4f674921d6	Move some pattern fragments to the right files. llvm-svn: 26831	2006-03-17 19:55:52 +00:00
Evan Cheng	27750f3287	- Nuke 16-bit SBB instructions. We'll never use them. - Nuke a bogus comment. llvm-svn: 26815	2006-03-17 02:24:04 +00:00
Evan Cheng	70b25efa57	X86ISD::REP_STOS and X86ISD::REP_MOVS now produces a flag. llvm-svn: 26604	2006-03-07 23:34:23 +00:00
Evan Cheng	30d7b70b73	Enable Dwarf debugging info. llvm-svn: 26581	2006-03-07 02:02:57 +00:00
Chris Lattner	ad3c974a77	remove the read/write port/io intrinsics. llvm-svn: 26479	2006-03-03 00:19:58 +00:00
Evan Cheng	1fac3b3360	* Allow mul, shl nodes to be codegen'd as LEA (if appropriate). * Add patterns to handle GlobalAddress, ConstantPool, etc. MOV32ri to materialize these nodes in registers. ADD32ri to handle %reg + GA, etc. MOV32mi to handle store GA, etc. to memory. llvm-svn: 26374	2006-02-25 10:02:21 +00:00
Evan Cheng	e0ed6ec13f	- Clean up the lowering and selection code of ConstantPool, GlobalAddress, and ExternalSymbol. - Use C++ code (rather than tblgen'd selection code) to match the above mentioned leaf nodes. Do not mutate and nodes and do not record the selection in CodeGenMap. These nodes should be safe to duplicate. This is a performance win. llvm-svn: 26335	2006-02-23 20:41:18 +00:00
Evan Cheng	1f342c2884	PIC related bug fixes. 1. Various asm printer bug. 2. Lowering bug. Now TargetGlobalAddress is wrapped in X86ISD::TGAWrapper. llvm-svn: 26324	2006-02-23 02:43:52 +00:00
Evan Cheng	9e252e3bcf	Added MMX, SSE1, and SSE2 vector instructions and some simple patterns. Fixed some existing bugs (wrong predicates, prefixes) at the same time. llvm-svn: 26310	2006-02-22 02:26:30 +00:00
Evan Cheng	d58478161f	One more round of reorg so sabre doesn't freak out. :-) llvm-svn: 26303	2006-02-21 20:00:20 +00:00
Evan Cheng	6fc1162855	A big more cleaning up. llvm-svn: 26302	2006-02-21 19:30:30 +00:00
Evan Cheng	8711b6bff3	Moving things to their proper places. llvm-svn: 26301	2006-02-21 19:26:52 +00:00
Evan Cheng	6e595b9fd8	Split instruction info into multiple files, one for each of x87, MMX, and SSE. llvm-svn: 26300	2006-02-21 19:13:53 +00:00
Evan Cheng	d57203c0a1	Added separate alias instructions for SSE logical ops that operate on non-packed types. llvm-svn: 26297	2006-02-21 02:24:38 +00:00
Evan Cheng	afffe63fc1	Added MMX and XMM packed integer move instructions, movd and movq. llvm-svn: 26296	2006-02-21 01:39:57 +00:00
Evan Cheng	43070b7541	Added x86 integer vector types: 64-bit packed byte integer (v16i8), 64-bit packed word integer (v8i16), and 64-bit packed doubleword integer (v2i32). llvm-svn: 26294	2006-02-20 22:34:53 +00:00
Evan Cheng	70af620709	Added fisttp for fp to int conversion. llvm-svn: 26283	2006-02-18 02:36:28 +00:00
Evan Cheng	5588de9415	x86 / Darwin PIC support. llvm-svn: 26273	2006-02-18 00:15:05 +00:00
Nate Begeman	5965bd19f8	kill ADD_PARTS & SUB_PARTS and replace them with fancy new ADDC, ADDE, SUBC and SUBE nodes that actually expose what's going on and allow for significant simplifications in the targets. llvm-svn: 26255	2006-02-17 05:43:56 +00:00
Evan Cheng	eb7b3380fd	pxor (for FLD0SS) encoding was missing the OpSize prefix. llvm-svn: 26244	2006-02-16 23:59:30 +00:00
Evan Cheng	24c461b51e	1. Use pxor instead of xoraps / xorapd to clear FR32 / FR64 registers. This proves to be worth 20% on Ptrdist/ks. Might be related to dependency breaking support. 2. Added FsMOVAPSrr and FsMOVAPDrr as aliases to MOVAPSrr and MOVAPDrr. These are used for FR32 / FR64 reg-to-reg copies. 3. Tell reg-allocator to generate MOVSSrm / MOVSDrm and MOVSSmr / MOVSDmr to spill / restore FsMOVAPSrr and FsMOVAPDrr. llvm-svn: 26241	2006-02-16 22:45:17 +00:00
Evan Cheng	01afec2adb	MOVAPSrr and MOVAPDrr instruction format should be MRMSrcReg. llvm-svn: 26234	2006-02-16 19:34:41 +00:00
Evan Cheng	aacc4c3b4c	cvtsd2ss / cvtss2sd encoding bug. llvm-svn: 26193	2006-02-15 00:31:03 +00:00
Evan Cheng	665c26ab40	movaps, movapd encoding bug. llvm-svn: 26192	2006-02-15 00:11:37 +00:00
Chris Lattner	d62a3bfa66	Eliminate the printCallOperand method, using a 'call' modifier on printOperand instead. llvm-svn: 26025	2006-02-06 23:41:19 +00:00
Evan Cheng	0a977c95aa	Remove an unnecessary predicate. llvm-svn: 25954	2006-02-04 02:23:01 +00:00
Evan Cheng	11613a5219	Separate FILD and FILD_FLAG, the later is only used for SSE2. It produces a flag so it can be flagged to a FST. llvm-svn: 25953	2006-02-04 02:20:30 +00:00
Evan Cheng	8b40cde148	Rearrange code to my liking. :) llvm-svn: 25887	2006-02-01 23:01:57 +00:00
Evan Cheng	9e350cd6ad	- Use xor to clear integer registers (set R, 0). - Added a new format for instructions where the source register is implied and it is same as the destination register. Used for pseudo instructions that clear the destination register. llvm-svn: 25872	2006-02-01 06:13:50 +00:00
Evan Cheng	72d5c256c9	- Allow XMM load (for scalar use) to be folded into ANDP* and XORP. - Use XORP to implement fneg. llvm-svn: 25857	2006-01-31 22:28:30 +00:00
Chris Lattner	c642aa5e1c	* Fix 80-column violations * Rename hasSSE -> hasSSE1 to avoid my continual confusion with 'has any SSE'. * Add inline asm constraint specification. llvm-svn: 25854	2006-01-31 19:43:35 +00:00
Evan Cheng	2dd217b88f	Added custom lowering of fabs llvm-svn: 25831	2006-01-31 03:14:29 +00:00
Evan Cheng	5b97fcf0f5	Always use FP stack instructions to perform i64 to f64 as well as f64 to i64 conversions. SSE does not have instructions to handle these tasks. llvm-svn: 25817	2006-01-30 08:02:57 +00:00
Chris Lattner	132177e103	The FP stack doesn't support UNDEF, ask the legalizer to legalize it instead of lying and saying we have it. llvm-svn: 25775	2006-01-29 06:44:22 +00:00
Evan Cheng	63045d221b	AT&T assembly convention: registers are in lower case. llvm-svn: 25714	2006-01-27 22:53:29 +00:00
Evan Cheng	cde9e30bc6	x86 CPU detection and proper subtarget support llvm-svn: 25679	2006-01-27 08:10:46 +00:00
Chris Lattner	1240574609	PHI and INLINEASM are now built-in instructions provided by Target.td llvm-svn: 25674	2006-01-27 01:46:15 +00:00
Evan Cheng	97c68f0f5c	Remove the uses of STATUS flag register. Rely on node property SDNPInFlag, SDNPOutFlag, and SDNPOptInFlag instead. llvm-svn: 25629	2006-01-26 00:29:36 +00:00
Chris Lattner	6f33eaeb81	Emit the copies out of call return registers after the ISD::CALLSEQ_END node, fixing fastcc and the case where a function has a frame pointer due to dynamic allocas. llvm-svn: 25580	2006-01-24 05:17:12 +00:00
Evan Cheng	468fecdc99	Rename fcmovae to fcmovnb and fcmova to fcmovnbe (following Intel manual). Some assemblers can't recognize the aliases. llvm-svn: 25494	2006-01-21 02:55:41 +00:00
Evan Cheng	cce748d316	A few more SH{L\|R}D peepholes. llvm-svn: 25473	2006-01-20 01:13:30 +00:00
Evan Cheng	8591b9f254	Added i16 SH{L\|R}D patterns. llvm-svn: 25468	2006-01-19 23:26:24 +00:00
Evan Cheng	91007126c2	adc and sbb need an incoming flag to ensure it reads the carry flag from add / sub. llvm-svn: 25444	2006-01-19 06:53:20 +00:00
Evan Cheng	a7bfbe996e	Two peepholes: (or (x >> c) \| (y << (32 - c))) ==> (shrd x, y, c) (or (x << c) \| (y >> (32 - c))) ==> (shld x, y, c) llvm-svn: 25438	2006-01-19 01:56:29 +00:00
Evan Cheng	14417ed99c	Zero extending load from i1 to i8. llvm-svn: 25391	2006-01-17 07:02:46 +00:00
Evan Cheng	bec9d720b0	Bug fixes: fpGETRESULT should produces a flag result and X86ISD::FST should read a flag. llvm-svn: 25378	2006-01-17 00:19:47 +00:00
Evan Cheng	c14bb1026b	More typo's llvm-svn: 25375	2006-01-16 23:26:53 +00:00
Evan Cheng	64eeed27d9	Some typo's llvm-svn: 25374	2006-01-16 22:48:46 +00:00
Evan Cheng	911c68d7a8	Fix FP_TO_INT**_IN_MEM lowering. llvm-svn: 25368	2006-01-16 21:21:29 +00:00
Evan Cheng	2494ce49f0	Added patterns for 8-bit multiply llvm-svn: 25338	2006-01-15 10:05:20 +00:00
Nate Begeman	2fba8a3aaa	bswap implementation llvm-svn: 25312	2006-01-14 03:14:10 +00:00
Evan Cheng	3bc25e8a54	A typo. llvm-svn: 25307	2006-01-14 01:18:49 +00:00
Evan Cheng	392c7d2779	Add truncstore i1 patterns. llvm-svn: 25296	2006-01-13 21:45:19 +00:00
Evan Cheng	6305e50ee1	Fix sint_to_fp (fild*) support. llvm-svn: 25257	2006-01-12 22:54:21 +00:00
Evan Cheng	c993d4522d	Specify transformation from GlobalAddress to TargetGlobalAddress and ExternalSymbol to TargetExternalSymbol. llvm-svn: 25253	2006-01-12 19:36:31 +00:00
Evan Cheng	84dc9b55f0	X86ISD::SETCC (e.g. SETEr) produces a flag (so multiple SETCC can be linked together). llvm-svn: 25247	2006-01-12 08:27:59 +00:00
Evan Cheng	b94db9e9a4	* Materialize GlobalAddress and ExternalSym with MOV32ri rather than LEA32r. * Do not lower GlobalAddress to TargetGlobalAddress. Let isel does it. llvm-svn: 25246	2006-01-12 07:56:47 +00:00
Evan Cheng	6d2ab04463	Added ROTL and ROTR. llvm-svn: 25232	2006-01-11 23:20:05 +00:00
Evan Cheng	ae986f1f1e	Support for MEMCPY and MEMSET. llvm-svn: 25226	2006-01-11 22:15:48 +00:00
Evan Cheng	bc7a0f44bd	* Add special entry code main() (to set x87 to 64-bit precision). * Allow a register node as SelectAddr() base. * ExternalSymbol -> TargetExternalSymbol as direct function callee. * Use X86::ESP register rather than CopyFromReg(X86::ESP) as stack ptr for call parmater passing. llvm-svn: 25207	2006-01-11 06:09:51 +00:00
Evan Cheng	339edad775	SSE cmov support. llvm-svn: 25190	2006-01-11 00:33:36 +00:00
Evan Cheng	efaf5c56fd	* fp to sint patterns. * fiadd, fisub, etc. llvm-svn: 25189	2006-01-10 22:22:02 +00:00
Evan Cheng	73a1ad975e	FP_TO_INT*_IN_MEM and x87 FP Select support. llvm-svn: 25188	2006-01-10 20:26:56 +00:00
Evan Cheng	7c4486215f	* Added undef patterns. * Some reorg. llvm-svn: 25163	2006-01-09 23:10:28 +00:00
Evan Cheng	9c249c37f8	Support for ADD_PARTS, SUB_PARTS, SHL_PARTS, SHR_PARTS, and SRA_PARTS. llvm-svn: 25158	2006-01-09 18:33:28 +00:00
Evan Cheng	53dd0ac226	Addd (shl x, 1) ==> (shl x, x) peepholes. llvm-svn: 25123	2006-01-06 02:31:59 +00:00
Evan Cheng	172fce7050	* Fast call support. * FP cmp, setcc, etc. llvm-svn: 25117	2006-01-06 00:43:03 +00:00
Evan Cheng	a5ae6e8320	Added ConstantFP patterns. llvm-svn: 25108	2006-01-05 02:08:37 +00:00
Evan Cheng	45e19098a6	DAG based isel call support. llvm-svn: 25103	2006-01-05 00:27:02 +00:00
Evan Cheng	14c53b45f5	Added field noResults to Instruction. Currently tblgen cannot tell which operands in the operand list are results so it assumes the first one is a result. This is bad. Ideally we would fix this by separating results from inputs, e.g. (res R32:$dst), (ops R32:$src1, R32:$src2). But that's a more distruptive change. Adding 'let noResults = 1' is the workaround to tell tblgen that the instruction does not produces a result. It works for now since tblgen does not support instructions which produce multiple results. llvm-svn: 25017	2005-12-26 09:11:45 +00:00
Evan Cheng	9ae486047e	* Removed the use of FLAG. Now use hasFlagIn and hasFlagOut instead. * Added a pseudo instruction (for each target) that represent "return void". This is a workaround for lack of optional flag operand (return void is not lowered so it does not have a flag operand.) llvm-svn: 24997	2005-12-23 22:14:32 +00:00
Evan Cheng	5c59d49630	More X86 floating point patterns. llvm-svn: 24990	2005-12-23 07:31:11 +00:00
Evan Cheng	dfad8ed54e	Bye bye HACKTROCITY. llvm-svn: 24935	2005-12-22 02:26:21 +00:00
Evan Cheng	9cdc16c6d3	* Fix a GlobalAddress lowering bug. * Teach DAG combiner about X86ISD::SETCC by adding a TargetLowering hook. llvm-svn: 24921	2005-12-21 23:05:39 +00:00
Evan Cheng	02767195bb	Oops. Accidentally deleted RET pattern. It's still needed for return void; llvm-svn: 24920	2005-12-21 22:22:16 +00:00
Evan Cheng	c1583dbd63	* Added support for X86 RET with an additional operand to specify number of bytes to pop off stack. * Added support for X86 SETCC. llvm-svn: 24917	2005-12-21 20:21:51 +00:00
Chris Lattner	0dcdd83c0e	This was meant to go in llvm-svn: 24900	2005-12-21 07:50:26 +00:00
Chris Lattner	f431ad4477	Rewrite FP stackifier support in the X86InstrInfo.td file, splitting patterns that were overloaded to work before and after the stackifier runs. With the new clean world, it is possible to write patterns for these instructions: woo! This also adds a few simple patterns here and there, though there are a lot still missing. These should be easy to add though. :) See the comments under "Floating Point Stack Support" for more details on the new world order. This patch as absolutely no effect on the generated code, woo! llvm-svn: 24899	2005-12-21 07:47:04 +00:00
Chris Lattner	988827a482	Wrap some long lines: no functionality change llvm-svn: 24898	2005-12-21 05:34:58 +00:00
Evan Cheng	a74ce62746	* Added lowering hook for external weak global address. It inserts a load for Darwin. * Added lowering hook for ISD::RET. It inserts CopyToRegs for the return value (or store / fld / copy to ST(0) for floating point value). This eliminate the need to write C++ code to handle RET with variable number of operands. llvm-svn: 24888	2005-12-21 02:39:21 +00:00
Evan Cheng	5c0b4df483	SSE2 floating point load / store patterns. SSE2 fp to int conversion patterns. llvm-svn: 24886	2005-12-20 22:59:51 +00:00
Evan Cheng	5815a6e455	Added X86 readport patterns. llvm-svn: 24879	2005-12-20 07:38:38 +00:00
Evan Cheng	6fc31046aa	X86 conditional branch support. llvm-svn: 24870	2005-12-19 23:12:38 +00:00
Chris Lattner	db8e888fb5	eliminate some redundancy llvm-svn: 24781	2005-12-17 19:47:05 +00:00
Evan Cheng	b06925d1dd	Added anyext, modelled as zext on X86. llvm-svn: 24759	2005-12-17 01:47:57 +00:00
Evan Cheng	cb19390ead	Added support for cmp, test, and conditional move instructions. llvm-svn: 24756	2005-12-17 01:24:02 +00:00
Evan Cheng	74151ba279	* Promote all 1 bit entities to 8 bit. * Handling extload (1 bit -> 8 bit) and remove C++ code that handle 1 bit zextload. llvm-svn: 24726	2005-12-15 19:49:23 +00:00
Evan Cheng	305c6a73b5	Added frameindex, constpool, globaladdr, and externalsym as root nodes of leaaddr. llvm-svn: 24724	2005-12-15 08:31:04 +00:00
Evan Cheng	bc9344477e	Use MOV8rm to load 1 bit value. llvm-svn: 24721	2005-12-15 00:59:17 +00:00
Evan Cheng	c273900dd8	Added sext and zext patterns. llvm-svn: 24705	2005-12-14 02:22:27 +00:00
Evan Cheng	229f0ee6d7	Add load + store folding srl and sra patterns. llvm-svn: 24696	2005-12-13 07:24:22 +00:00
Evan Cheng	acec857b1a	Beautify a few patterns. llvm-svn: 24690	2005-12-13 02:40:18 +00:00
Evan Cheng	89c6db4baf	Some shl patterns which do load + store folding. llvm-svn: 24689	2005-12-13 02:34:51 +00:00
Evan Cheng	108beceb0f	A few helper fragments for loads. e.g. (i8 (load addr:$src)) -> (loadi8 addr:$src). Only to improve readibility. llvm-svn: 24688	2005-12-13 01:57:51 +00:00
Evan Cheng	ddd5ae5a22	Add and, or, and xor patterns which fold load + stores. llvm-svn: 24687	2005-12-13 01:41:36 +00:00
Evan Cheng	e5a94a03e2	Add inc + dec patterns which fold load + stores. llvm-svn: 24686	2005-12-13 01:02:47 +00:00
Evan Cheng	bde9e6fca6	Add neg and not patterns which fold load + stores. llvm-svn: 24685	2005-12-13 00:54:44 +00:00
Evan Cheng	c414d563f0	Missed a couple redundant explicit type casts. llvm-svn: 24684	2005-12-13 00:25:07 +00:00
Evan Cheng	62e6808aa5	Fix some bad choice of names: i16SExt8 ->i16immSExt8, etc. llvm-svn: 24683	2005-12-13 00:14:11 +00:00
Evan Cheng	86b2cf22d2	* Split immSExt8 to i16SExt8 and i32SExt8 for i16 and i32 immediate operands. This enables the removal of some explicit type casts. * Rename immZExt8 to i16ZExt8 as well. llvm-svn: 24682	2005-12-13 00:01:09 +00:00
Evan Cheng	3e52756928	Add some integer mul patterns. llvm-svn: 24681	2005-12-12 23:47:46 +00:00
Evan Cheng	af3fe8217a	Add some sub patterns. llvm-svn: 24675	2005-12-12 21:54:05 +00:00
Evan Cheng	e80248b378	Add a few more add / store patterns. e.g. ADD32mi8. llvm-svn: 24670	2005-12-12 19:45:23 +00:00
Evan Cheng	0d6cfee704	* Added X86 store patterns. * Added X86 dec patterns. llvm-svn: 24654	2005-12-10 00:48:20 +00:00
Evan Cheng	275a3ed80c	Added patterns for ADD8rm, etc. These fold load operands. e.g. addb 4(%esp), %al llvm-svn: 24648	2005-12-09 22:48:48 +00:00
Evan Cheng	f039648614	Added explicit type field to ComplexPattern. llvm-svn: 24637	2005-12-08 02:15:07 +00:00
Evan Cheng	c9fab31098	* Added intelligence to X86 LEA addressing mode matching routine so it returns false if the match is not profitable. e.g. leal 1(%eax), %eax. * Added patterns for X86 integer loads and LEA32. llvm-svn: 24635	2005-12-08 02:01:35 +00:00
Evan Cheng	c0c190239d	Remove unnecessary let hasCtrlDep=1 now it can be inferred. llvm-svn: 24611	2005-12-05 23:09:43 +00:00
Chris Lattner	3c0b8f577d	Several things: 1. Remove redundant type casts now that PR673 is implemented. 2. Implement the OUTir instructions correctly. The port number really is* a 16-bit value, but the patterns should only match if the number is 0-255. Update the patterns so they now match. 3. Fix patterns for shifts to reflect that the shift amount is always an i8, not an i16 as they were believed to be before. This previous fib stopped working when we started knowing that CL has type i8. 4. Change use of i16i8imm in SH*ri patterns to all be imm. llvm-svn: 24599	2005-12-05 02:40:25 +00:00
Evan Cheng	95cb763818	Added isel patterns for RET, JMP, and WRITEPORT. llvm-svn: 24588	2005-12-04 08:19:43 +00:00
Evan Cheng	4b02426130	Proper support for shifts with register shift value. llvm-svn: 24559	2005-12-01 00:43:55 +00:00
Nate Begeman	6f8c1ace6e	No longer track value types for asm printer operands, and remove them as an argument to every operand printing function. Requires some slight tweaks to x86, the only user. llvm-svn: 24541	2005-11-30 18:54:35 +00:00
Chris Lattner	9c7af08bc9	Fix a bug in a recent patch that broke shifts llvm-svn: 24526	2005-11-30 05:11:18 +00:00
Evan Cheng	72ab335858	Add more X86 ISel patterns. llvm-svn: 24520	2005-11-29 19:38:52 +00:00
Chris Lattner	d1061ac8d1	encode rdtsc correctly llvm-svn: 24435	2005-11-20 22:13:18 +00:00
Andrew Lenharth	0bf68ae434	The second patch of X86 support for read cycle counter. llvm-svn: 24430	2005-11-20 21:41:10 +00:00
Chris Lattner	d7102c4980	Teach the x86 backend about the register constraints of its addressing mode. Patch by Evan Cheng llvm-svn: 24423	2005-11-19 07:01:30 +00:00
Chris Lattner	57ce97862d	add more patterns, patch by Evan Cheng. llvm-svn: 24406	2005-11-18 01:04:42 +00:00
Chris Lattner	2bf458af92	Add patterns for some 16-bit immediate instructions, patch contributed by Evan Cheng. llvm-svn: 24384	2005-11-17 02:01:55 +00:00
Chris Lattner	5930d3df3d	Add patterns for several simple instructions that take i32 immediates. Patch contributed by Evan Cheng! llvm-svn: 24382	2005-11-16 22:59:19 +00:00
Nate Begeman	9d7008b08d	Properly split f32 and f64 into separate register classes for scalar sse fp fixing a bunch of nasty hackery llvm-svn: 23735	2005-10-14 22:06:00 +00:00
Chris Lattner	2e84be22a8	give all operands names llvm-svn: 23356	2005-09-14 21:10:24 +00:00
Chris Lattner	423d7cbbf8	add a few missing cases llvm-svn: 22891	2005-08-19 00:41:29 +00:00
Chris Lattner	e2967ac53d	Give ADJCALLSTACKDOWN/UP the correct operands. Give a whole bunch of other stuff variable operands, particularly FP. The FP stackifier is playing fast and loose with operands here, so we have to mark them all as variable. This will have to be fixed before we can dag->dag the X86 backend. The solution is for the pre-stackifier and post-stackifier instructions to all be disjoint. llvm-svn: 22890	2005-08-19 00:38:22 +00:00
Nate Begeman	8d394eb703	Scalar SSE: load +0.0 -> xorps/xorpd Scalar SSE: a < b ? c : 0.0 -> cmpss, andps Scalar SSE: float -> i16 needs to be promoted llvm-svn: 22637	2005-08-03 23:26:28 +00:00
Nate Begeman	a0b5e035ea	Get closer to fully working scalar FP in SSE regs. This gets singlesource working, and Olden/power. llvm-svn: 22441	2005-07-15 00:38:55 +00:00
Nate Begeman	8a0933608a	First round of support for doing scalar FP using the SSE2 ISA extension and XMM registers. There are many known deficiencies and fixmes, which will be addressed ASAP. The major benefit of this work is that it will allow the LLVM register allocator to allocate FP registers across basic blocks. The x86 backend will still default to x87 style FP. To enable this work, you must pass -enable-sse-scalar-fp and either -sse2 or -sse3 to llc. An example before and after would be for: double foo(double *P) { double Sum = 0; int i; for (i = 0; i < 1000; ++i) Sum += P[i]; return Sum; } The inner loop looks like the following: x87: .LBB_foo_1: # no_exit fldl (%esp) faddl (%eax,%ecx,8) fstpl (%esp) incl %ecx cmpl $1000, %ecx #FP_REG_KILL jne .LBB_foo_1 # no_exit SSE2: addsd (%eax,%ecx,8), %xmm0 incl %ecx cmpl $1000, %ecx #FP_REG_KILL jne .LBB_foo_1 # no_exit llvm-svn: 22340	2005-07-06 18:59:04 +00:00
Nate Begeman	db32921535	Initial set of .td file changes necessary to get scalar fp in xmm registers working. The instruction selector changes will hopefully be coming later this week once they are debugged. This is necessary to support the darwin x86 FP model, and is recommended by intel as the replacement for x87. As a bonus, the register allocator knows how to deal with these registers across basic blocks, unliky the FP stackifier. This leads to significantly better codegen in several cases. llvm-svn: 22300	2005-06-27 21:20:31 +00:00
Chris Lattner	3f5a98d1f4	Add markers in the asm file for tail calls, add a new ADJSTACKPTRri sorta-pseudo-instruction llvm-svn: 22042	2005-05-15 03:10:37 +00:00
Chris Lattner	6b5fa91a63	Yes, calltarget is the operand of the day. llvm-svn: 22040	2005-05-15 01:10:30 +00:00
Chris Lattner	f0649db870	Add some new instructions llvm-svn: 22036	2005-05-14 23:35:21 +00:00
Chris Lattner	6e4c2302e6	add 'ret imm' instruction llvm-svn: 21945	2005-05-13 17:56:48 +00:00
Chris Lattner	46b5ca4310	Fix the syntax of the i/o instructions, these are obviously unused. llvm-svn: 21829	2005-05-09 20:49:20 +00:00
Chris Lattner	61827484c7	Add some new X86 instrs, patch contributed by Morten Ofstad llvm-svn: 21608	2005-04-28 21:50:05 +00:00
Chris Lattner	c21db6b15c	add signed versions of the extra precision multiplies llvm-svn: 21106	2005-04-06 04:19:22 +00:00
Chris Lattner	2d451658a6	add an fabs instr llvm-svn: 21006	2005-04-02 04:31:56 +00:00
Chris Lattner	0ce80cd542	Fix spelling, patch contributed by Gabor Greif! llvm-svn: 20343	2005-02-27 06:18:25 +00:00
Chris Lattner	0edf9535b9	Add rotate instructions. llvm-svn: 19690	2005-01-19 07:50:03 +00:00
Chris Lattner	d54845f530	Improve coverage of the X86 instruction set by adding 16-bit shift doubles. llvm-svn: 19687	2005-01-19 07:31:24 +00:00
Chris Lattner	2947801735	Teach the code generator that shrd/shld is commutable if it has an immediate. This allows us to generate this: foo: mov %EAX, DWORD PTR [%ESP + 4] mov %EDX, DWORD PTR [%ESP + 8] shld %EDX, %EDX, 2 shl %EAX, 2 ret instead of this: foo: mov %EAX, DWORD PTR [%ESP + 4] mov %ECX, DWORD PTR [%ESP + 8] mov %EDX, %EAX shrd %EDX, %ECX, 30 shl %EAX, 2 ret Note the magically transmogrifying immediate. llvm-svn: 19686	2005-01-19 07:11:01 +00:00
Chris Lattner	5b589ec0c4	Add conditional moves for the parity flag. llvm-svn: 19437	2005-01-10 22:09:33 +00:00
Chris Lattner	d4bb2bbce1	ADC and IMUL are also commutable. llvm-svn: 19264	2005-01-03 01:27:59 +00:00
Chris Lattner	295e45e60e	Two changes here: 1. Add new instructions for checking parity flags: JP, JNP, SETP, SETNP. 2. Set the isCommutable and isPromotableTo3Address bits on several instructions. llvm-svn: 19246	2005-01-02 02:35:46 +00:00
John Criswell	04570265a5	Correct the name of stosd for the AT&T syntax: It's stosl (l for long == 32 bit). llvm-svn: 17658	2004-11-10 04:48:15 +00:00
Chris Lattner	93867e516a	Remove debugging code, fix encoding problem. This fixes the problems the JIT had last night. llvm-svn: 16766	2004-10-06 14:31:50 +00:00
Chris Lattner	e9bfa5a2a4	Add some new instructions. Fix the asm string for sbb32rr llvm-svn: 16759	2004-10-06 04:01:02 +00:00
Chris Lattner	8bbde2fb33	Convert some missed patterns to support AT&T style llvm-svn: 16645	2004-10-04 07:23:07 +00:00
Chris Lattner	2e99778aad	Apparently the GNU assembler has a HUGE hack to be compatible with really old and broken AT&T syntax assemblers. The problem with this hack is that SOME forms of the fdiv and fsub instructions have the 'r' bit inverted. This was a real pain to figure out, but is trivially easy to support: thus we are now bug compatible with gas and gcc. llvm-svn: 16644	2004-10-04 07:08:46 +00:00
Chris Lattner	af69503332	Fix incorrect suffix llvm-svn: 16642	2004-10-04 05:20:16 +00:00
Chris Lattner	e1a2826d51	Fix some more missed suffixes and swapped operands llvm-svn: 16641	2004-10-04 01:38:10 +00:00
Chris Lattner	a488f04f3e	Add missing suffixes to FP instructions for AT&T mode llvm-svn: 16640	2004-10-04 00:43:31 +00:00
Chris Lattner	4e59a14909	Add support to the instruction patterns for AT&T style output, which will hopefully lead to the death of the 'GasBugWorkaroundEmitter'. This also includes changes to wrap the whole file to 80 columns! Woot! :) Note that the AT&T style output has not been tested at all. llvm-svn: 16638	2004-10-03 20:35:00 +00:00
Alkis Evlogimenos	371403193c	Use a shorter form to express implicit use/defs in FpGETRESULT and FpSETRESULT. llvm-svn: 16247	2004-09-08 18:29:31 +00:00
Alkis Evlogimenos	8b700215ed	A call instruction should implicitely define ST0 since the return value is returned in that register. The pseudo instructions FpGETRESULT and FpSETRESULT shold also have an implicity use and def of ST0 repsecitvely. llvm-svn: 16246	2004-09-08 16:54:54 +00:00
Chris Lattner	eb34c59930	Remove a bunch of ad-hoc target-specific flags that were only used by the old asmprinter. llvm-svn: 15660	2004-08-11 07:12:04 +00:00
Chris Lattner	a0bafce127	Add asmprintergen support for the last X86 instruction that needs it: pcrelative calls. llvm-svn: 15657	2004-08-11 06:59:12 +00:00
Chris Lattner	5f4b65e57d	Scrunch memoperands, add a few more for floating point memops Eliminate the FPI*m classes, converting them to use FPI instead. llvm-svn: 15655	2004-08-11 06:50:10 +00:00
Chris Lattner	6dd0474edd	Make FPI take asm string and operand list llvm-svn: 15653	2004-08-11 05:54:16 +00:00
Chris Lattner	c52899c3c7	Nuke the Imi patterns, by asmprintergenifying all users. llvm-svn: 15652	2004-08-11 05:31:07 +00:00
Chris Lattner	f5c767038a	X86 instructions that read-modify-write memory are not LLVM two-address instructions. llvm-svn: 15651	2004-08-11 05:07:25 +00:00
Chris Lattner	0d7bc2c5da	Get rid of the Im8, Im16, Im32 classes, converting more instructions over to asmprintergeneration llvm-svn: 15650	2004-08-11 04:31:00 +00:00
Chris Lattner	09ee05bcdf	Convert asmprinter to new style of instruction printer Start asmprintergen'ifying machine instrs with memory operands. llvm-svn: 15646	2004-08-11 02:25:00 +00:00
Chris Lattner	ce5fb7db1c	This is purely a formatting patch that gets us closer to the mecca of fitting X86InstrInfo.td into 80 columns llvm-svn: 15629	2004-08-10 21:21:30 +00:00
Chris Lattner	116fc25d79	Drop the first argument of FPI, and asmprinterify fxch llvm-svn: 15628	2004-08-10 21:02:13 +00:00
Chris Lattner	ead14c1a07	This purely mechanical patch gives the "I" tblgen class operand list and asm string operands, and adjusts all users to pass them in instead of using II. llvm-svn: 15624	2004-08-10 20:17:41 +00:00
Chris Lattner	c4eb5951d5	Convert Ii32 instructions over to use the asmprinter generator llvm-svn: 15621	2004-08-10 19:06:36 +00:00
Chris Lattner	9f49a91b44	Convert the Ii16 instructions over llvm-svn: 15606	2004-08-10 16:22:02 +00:00
Chris Lattner	4d66b78036	Convert all Ii8 instructions over to the autogenerated asmprinter. llvm-svn: 15605	2004-08-10 16:09:54 +00:00
Chris Lattner	2b47c02b64	Convert all I<> instructions to asmformat. Delete the 'name' field of all instructions that have asmformats. llvm-svn: 15403	2004-08-01 09:52:59 +00:00
Chris Lattner	27fcf976f2	Eliminate 3 of the X86 printImplicit* flags. llvm-svn: 15398	2004-08-01 08:23:17 +00:00
Chris Lattner	f6bd77190e	Convert more instructions over to the asmprinter llvm-svn: 15396	2004-08-01 08:13:11 +00:00
Chris Lattner	275d98dcbb	Switch more instructions over to using the asmprinter. Fix bugs in the emission of in/out instructions (missing %'s on registers). llvm-svn: 15393	2004-08-01 07:44:35 +00:00
Chris Lattner	321763358b	Specify an asm string and operands lists for a bunch of instructions. This only really covers no-operand instructions so far. llvm-svn: 15387	2004-08-01 06:01:00 +00:00
Chris Lattner	70d2260eb9	Entirely eliminate all patterns and expanders from this file. We shall go with an incremental approach rather than a revolutionary approach. llvm-svn: 15379	2004-08-01 03:25:01 +00:00
Chris Lattner	66a13e230d	Mark barrier instructions. Execution does not fall through uncond branches or return intructions. llvm-svn: 15356	2004-07-31 02:10:53 +00:00
Chris Lattner	5ed9113e14	No really, these are dead now llvm-svn: 14145	2004-06-11 04:50:14 +00:00
Chris Lattner	b35f47627d	Now that compare instructions aren't lumped in with the other twoargfp instructions, we can get rid of the FpUCOM/FpUCOMi pseudo instructions, which makes stuff simpler and faster. llvm-svn: 14144	2004-06-11 04:49:02 +00:00
Chris Lattner	0876edf122	Introduce a new FP instruction type to separate the compare cases from the twoarg cases. llvm-svn: 14143	2004-06-11 04:41:24 +00:00
Chris Lattner	a0cfedef3a	Add support for the setp instructions llvm-svn: 14140	2004-06-11 04:30:06 +00:00
Chris Lattner	a340febe52	Add immediate forms of in/out. Use let to shorten lines llvm-svn: 12895	2004-04-13 17:19:31 +00:00
Chris Lattner	a24f986333	Fix issues that the local allocator has dealing with instructions that implicitly use ST(0) llvm-svn: 12855	2004-04-12 03:02:48 +00:00
Chris Lattner	2e2b0ceab9	No really, fix printing for LLC. I gotta get a way for CVS to whine at me if I have unsaved emacs buffers, geeze... llvm-svn: 12854	2004-04-12 01:52:04 +00:00
Chris Lattner	ba1038e0f3	Correct printing for LLC and the encoding for the JIT llvm-svn: 12853	2004-04-12 01:50:04 +00:00
Chris Lattner	6c84d4ca44	Add two new instructions llvm-svn: 12850	2004-04-12 01:38:55 +00:00
Chris Lattner	b6e0b58fbc	Add some new instructions llvm-svn: 12838	2004-04-11 20:24:15 +00:00
John Criswell	c28c3b625f	Changes recommended by Chris: InstSelectSimple.cpp: Change the checks for proper I/O port address size into an exit() instead of an assertion. Assertions aren't used in Release builds, and handling this error should be graceful (not that this counts as graceful, but it's more graceful). Modified the generation of the IN/OUT instructions to have 0 arguments. X86InstrInfo.td: Added the OpSize attribute to the 16 bit IN and OUT instructions. llvm-svn: 12786	2004-04-08 22:39:13 +00:00
John Criswell	10db062d41	Added the llvm.readport and llvm.writeport intrinsics for x86. These do I/O port instructions on x86. The specific code sequence is tailored to the parameters and return value of the intrinsic call. Added the ability for implicit defintions to be printed in the Instruction Printer. Added the ability for RawFrm instruction to print implict uses and defintions with correct comma output. This required adjustment to some methods so that a leading comma would or would not be printed. llvm-svn: 12782	2004-04-08 20:31:47 +00:00
Chris Lattner	ba33ae5831	Fix incorrect encoding of some ADC and SBB instuctions llvm-svn: 12710	2004-04-06 19:20:32 +00:00
Chris Lattner	9366f0347d	The sbb instructions really ARE sbb's, not adc's llvm-svn: 12682	2004-04-06 02:02:11 +00:00
Alkis Evlogimenos	fe66caa9a0	Fix type in comments llvm-svn: 12611	2004-04-02 16:02:50 +00:00
Alkis Evlogimenos	d186ed02e4	Add more ADC and SBB variants llvm-svn: 12607	2004-04-02 07:11:10 +00:00
Chris Lattner	9fe1646804	Add FP conditional move instructions, which annoyingly have special properties that require the asmwriter to be extended (printing implicit uses before the explicit operands) llvm-svn: 12574	2004-03-31 22:02:13 +00:00
Chris Lattner	1563983d81	Fix some serious bugs in the cmov descriptions, which didn't cause a problem because we never generated them Make indentation a bit more consistent llvm-svn: 12549	2004-03-30 20:18:02 +00:00
Alkis Evlogimenos	804dc659b6	Add LAHF instruction llvm-svn: 12424	2004-03-15 17:20:14 +00:00
Alkis Evlogimenos	9884bda541	Add support for a wider range of CMOV instructions. llvm-svn: 12336	2004-03-12 17:59:56 +00:00
Alkis Evlogimenos	8a3f2f3600	Differentiate between extended precision floats (80-bit) and double precision floats (64-bit) llvm-svn: 12254	2004-03-09 03:37:54 +00:00
Alkis Evlogimenos	d6f62ba55b	Add memory operand version of conditional move. llvm-svn: 12190	2004-03-07 03:19:11 +00:00
Alkis Evlogimenos	0824ffc697	Use correct template for SHLD and SHRD instructions so that the memory operand size is correctly specified. llvm-svn: 11997	2004-02-29 09:19:40 +00:00
Alkis Evlogimenos	ea81b79a97	A big X86 instruction rename. The instructions are renamed to make their names more decriptive. A name consists of the base name, a default operand size followed by a character per operand with an optional special size. For example: ADD8rr -> add, 8-bit register, 8-bit register IMUL16rmi -> imul, 16-bit register, 16-bit memory, 16-bit immediate IMUL16rmi8 -> imul, 16-bit register, 16-bit memory, 8-bit immediate MOVSX32rm16 -> movsx, 32-bit register, 16-bit memory llvm-svn: 11995	2004-02-29 08:50:03 +00:00
Alkis Evlogimenos	876f6f96d0	Use correct template for ADC instruction with memory operands. llvm-svn: 11974	2004-02-29 02:18:17 +00:00
Alkis Evlogimenos	fa63580517	SHLD and SHRD take 32-bit operands but an 8-bit immediate. Rename them to denote this fact. llvm-svn: 11972	2004-02-28 23:46:44 +00:00
Alkis Evlogimenos	4953ae085a	Floating point loads/stores act on memory operands. Rename them to denote this fact. llvm-svn: 11971	2004-02-28 23:42:35 +00:00
Alkis Evlogimenos	c6948fa762	Rename instruction templates to be easier to the human eye to parse. The name is now I (operand size)*. For example: Im32 -> instruction with 32-bit memory operands. Im16i8 -> instruction with 16-bit memory operands and 8 bit immediate operands. llvm-svn: 11970	2004-02-28 23:09:03 +00:00
Alkis Evlogimenos	194939086d	Each instruction now has both an ImmType and a MemType. This describes the size of the immediate and the memory operand on instructions that use them. This resolves problems with instructions that take both a memory and an immediate operand but their sizes differ (i.e. ADDmi32b). llvm-svn: 11967	2004-02-28 22:02:05 +00:00
Alkis Evlogimenos	2debead504	Do not generate instructions with mismatched memory/immediate sized operands. The X86 backend doesn't handle them properly right now. llvm-svn: 11944	2004-02-28 06:01:43 +00:00
Alkis Evlogimenos	24b3d0bdae	Further comment updates. llvm-svn: 11933	2004-02-28 03:20:31 +00:00
Alkis Evlogimenos	f87966b8c4	Update comments. llvm-svn: 11932	2004-02-28 03:12:31 +00:00
Alkis Evlogimenos	2dbc79df84	My previous commit broke the jit. The shift instructions always take an 8-bit immediate. So mark the shifts that take immediates as taking an 8-bit argument. The rest with the implicit use of CL are marked appropriately. A bug still exists: def SHLDmri32 : I2A8 <"shld", 0xA4, MRMDestMem>, TB; // [mem32] <<= [mem32],R32 imm8 The immediate in the above instruction is 8-bit but the memory reference is 32-bit. The printer prints this as an 8-bit reference which confuses the assembler. Same with SHRDmri32. llvm-svn: 11931	2004-02-28 02:56:26 +00:00
Alkis Evlogimenos	b10b04c5ec	Fix argument size for SHL, SHR, SAR, SHLD and SHRD families of instructions. llvm-svn: 11923	2004-02-27 19:46:30 +00:00
Alkis Evlogimenos	75ed0f67bf	Fix encoding of ADD and SUB family of instructions. Also rearrange them so that they are consistent with AND, XOR, etc... llvm-svn: 11922	2004-02-27 18:57:00 +00:00
Alkis Evlogimenos	58270fcf1f	Rename MRMS[0-7]{r,m} to MRM[0-7]{r,m}. llvm-svn: 11921	2004-02-27 18:55:12 +00:00
Alkis Evlogimenos	9476b7cbe5	Add memory operand folding support for the SETcc family of instructions. llvm-svn: 11907	2004-02-27 16:13:37 +00:00
Alkis Evlogimenos	8d99063b38	Add memory operand folding support for SHLD and SHRD instructions. llvm-svn: 11905	2004-02-27 15:03:18 +00:00
Alkis Evlogimenos	3537404299	Add memory operand folding support for SHL, SHR and SAR, SHLD instructions. llvm-svn: 11903	2004-02-27 09:28:43 +00:00
Alkis Evlogimenos	f020dfb43c	Rename SHL, SHR, SAR, SHLD and SHLR instructions to make them consistent with the rest and also pepare for the addition of their memory operand variants. llvm-svn: 11902	2004-02-27 06:57:05 +00:00
Chris Lattner	378157c3d7	Add a new cmove instruction llvm-svn: 11722	2004-02-23 01:16:05 +00:00
Alkis Evlogimenos	6401b22fc2	Fix argument size for MOVSX and MOVZX instructions. llvm-svn: 11576	2004-02-18 16:20:40 +00:00
Alkis Evlogimenos	47ea17a852	These store to memory too. llvm-svn: 11558	2004-02-17 17:53:48 +00:00
Chris Lattner	49794be442	These store to memory, not read from it. llvm-svn: 11556	2004-02-17 17:46:50 +00:00
Alkis Evlogimenos	546513ccfd	Add TEST and XCHG memory operand support. llvm-svn: 11550	2004-02-17 15:48:42 +00:00
Alkis Evlogimenos	f08064b714	Add OR and XOR memory operand support. llvm-svn: 11549	2004-02-17 15:33:14 +00:00
Alkis Evlogimenos	e5585328d8	Add memory operand folding support for MUL, DIV, IDIV, NEG, NOT, MOVSX, and MOVZX. llvm-svn: 11546	2004-02-17 09:14:23 +00:00
Alkis Evlogimenos	574c7c9ce9	Add CMP{rm,mr,mi}{8,16,32}, INCm{8,16,32} and DECm{8,16,32} instructions. llvm-svn: 11544	2004-02-17 08:49:00 +00:00
Alkis Evlogimenos	d5ce14ddd1	Add SUB{rm,mr,mi}{8,16,32} instructions. llvm-svn: 11543	2004-02-17 08:17:40 +00:00
Alkis Evlogimenos	b591e5de31	Add support for ADC{rm.mr}32 and SBB{rm,mr}32. llvm-svn: 11540	2004-02-17 08:06:31 +00:00
Chris Lattner	3abcdf3b90	Fix the mneumonics for the mov instructions to have the source and destination order in the correct sense!! Arg! llvm-svn: 11530	2004-02-17 06:28:19 +00:00
Chris Lattner	ebd90733b0	Fix the last crimes against nature that used the 'ir' ordering to use the 'ri' ordering instead... no it's not possible to store a register into an immediate! llvm-svn: 11529	2004-02-17 06:24:02 +00:00
Chris Lattner	288e043e1b	Rename MOVi[mr] instructions to MOV[rm]i llvm-svn: 11527	2004-02-17 06:16:44 +00:00
Chris Lattner	c9586411cf	Add mem forms of AND instructions llvm-svn: 11521	2004-02-17 05:25:50 +00:00
Chris Lattner	818bcec247	Rename the IMULri* instructions to IMULrri, as they are actually three address instructions. Add forms of these instructions that read from memory llvm-svn: 11518	2004-02-17 04:26:43 +00:00
Alkis Evlogimenos	f6ce2e313a	Add two more variants of add. Update comments. llvm-svn: 11510	2004-02-16 23:48:42 +00:00
Chris Lattner	544c9781db	Add some ADD instructions that take memory operands for Alkis llvm-svn: 11502	2004-02-16 18:19:31 +00:00
Chris Lattner	2e4acc0a73	Add support for the 'pop' instruction llvm-svn: 11451	2004-02-14 21:06:02 +00:00
Chris Lattner	d6a39eaa70	Urg, right. These need an input value... llvm-svn: 11443	2004-02-14 04:47:23 +00:00
Chris Lattner	8bed37595d	add 'rep stos[bwd]' instructions llvm-svn: 11441	2004-02-14 04:45:37 +00:00
Chris Lattner	8dc99feeaf	Add support for the rep movs[bwd] instructions, and emit them when code generating the llvm.memcpy intrinsic. llvm-svn: 11351	2004-02-12 17:53:22 +00:00
Alkis Evlogimenos	dbf4b42fde	IMULri* instructions do not require their first two registers operands to be the same (IOW they are not two address instructions). llvm-svn: 11117	2004-02-04 17:21:04 +00:00
Chris Lattner	3c8c72c54f	Add the ftst instruction llvm-svn: 11095	2004-02-03 07:27:50 +00:00
Chris Lattner	63b61e8739	No need to declare implicit uses/defs of ST0 llvm-svn: 11081	2004-02-02 19:57:45 +00:00
Chris Lattner	30d26ac561	Generate the fchs instruction to negate a floating point number llvm-svn: 11078	2004-02-02 19:31:38 +00:00
Alkis Evlogimenos	68cff6bf4d	Remove floating point killer pass. This is now implemented in the instruction selector by adding a new pseudo-instruction FP_REG_KILL. This instruction implicitly defines all x86 fp registers and is a terminator so that passes which add machine code at the end of basic blocks (like phi elimination) do not add instructions between it and the branch or return instruction. llvm-svn: 10562	2003-12-20 16:22:59 +00:00
John Criswell	29265fe981	Added LLVM copyright header. llvm-svn: 9321	2003-10-21 15:17:13 +00:00
Chris Lattner	6acb1bedb1	Emit x86 instructions for: A = B op C, where A and B are 16-bit registers, C is a constant which can be sign-extended from 8 bits without value loss, and op is one of: add, sub, imul, and, or, xor. This allows the JIT to emit the one byte version of the constant instead of the two or 4 byte version. Because these instructions are very common, this can save a LOT of code space. For example, I sampled two benchmarks, 176.gcc and 254.gap. BM Old New Reduction 176.gcc 2673621 2548962 4.89% 254.gap 498261 475104 4.87% Note that while the percentage is not spectacular, this did eliminate 124.6 _KILOBYTES_ of codespace from gcc. Not bad. Note that this doesn't effect the llc version at all, because the assembler already does this optimization. llvm-svn: 9284	2003-10-20 05:53:31 +00:00
Chris Lattner	97e1b55723	* Rename X86::IMULr16 -> X86::IMULrr16 * Implement R1 = R2 * C where R1 and R2 are 32 or 16 bits. This avoids an extra copy into a register, reducing register pressure. llvm-svn: 9278	2003-10-20 03:42:58 +00:00
Chris Lattner	55a8ef0cc8	Add some new instructions. Wheee llvm-svn: 9266	2003-10-19 19:25:35 +00:00
Chris Lattner	7235d86507	Add support for unconditional branches and for emitting JE instructions llvm-svn: 7872	2003-08-15 04:50:49 +00:00
Chris Lattner	7606fa0d41	Add basic support for 16 and 32 bit function arguments! llvm-svn: 7755	2003-08-11 21:30:00 +00:00
Chris Lattner	2923637f63	Add (ret int) expander so that we can at least write testcases llvm-svn: 7730	2003-08-11 15:48:00 +00:00
Chris Lattner	7fed97d00a	Add patterns for multiply, and, or, and xor llvm-svn: 7725	2003-08-11 15:23:25 +00:00
Chris Lattner	19d25b3c41	add a pattern for RET, immediates no longer need to be explicitly typed llvm-svn: 7635	2003-08-06 15:31:35 +00:00
Chris Lattner	7c257321c7	This is the real fix for the previous register allocator problem. Physical registers should not float around. llvm-svn: 7587	2003-08-05 00:48:47 +00:00
Chris Lattner	148747e162	Add patterns for (mov R, R) (mov R, I) and subtracts. The moves are to enable testing, the subtracts are because I was in the neighborhood. llvm-svn: 7581	2003-08-04 21:18:19 +00:00
Chris Lattner	44cdcf013f	Change comments into something that TableGen can read! llvm-svn: 7580	2003-08-04 21:08:29 +00:00
Chris Lattner	2551080937	transition to using let instead of set llvm-svn: 7564	2003-08-04 04:59:56 +00:00
Chris Lattner	59a4a91703	Add new TableGen instruction definitions llvm-svn: 7537	2003-08-03 21:54:21 +00:00

... 9 10 11 12 13 ...

1004 Commits