llvm-project

Commit Graph

Author	SHA1	Message	Date
Anton Korobeynikov	5643cb7ecc	whitespace cleanup llvm-svn: 52859	2008-06-28 11:08:09 +00:00
Anton Korobeynikov	4e9dfe8391	Make intel asmprinter child of generic asmprinter, not x86 shared asm printer. This leads to some code duplication, which will be resolved later. llvm-svn: 52858	2008-06-28 11:07:54 +00:00
Anton Korobeynikov	bc7cce6b74	Cleanup llvm-svn: 52857	2008-06-28 11:07:35 +00:00
Anton Korobeynikov	44e99f47ad	Whitespace cleanup llvm-svn: 52856	2008-06-28 11:07:18 +00:00
Anton Korobeynikov	266f1cc1e4	Use StringSet instead of std::set<std::string> llvm-svn: 52836	2008-06-27 21:22:49 +00:00
Anton Korobeynikov	c1e80a759f	Provide correct encoding for PPC LWARX instructions. Patch by Gary Benson! llvm-svn: 52828	2008-06-27 16:10:20 +00:00
Owen Anderson	4f024862f6	Cache subregister relationships in a set in TargetRegisterInfo to allow faster lookups. This speeds up LiveVariables from 0.6279s to 0.6165s on kimwitu++. llvm-svn: 52818	2008-06-27 06:56:04 +00:00
Matthijs Kooijman	f61fd54237	Make LLVM compile on DragonFly BSD (PR2499). Patch by Hasso Tepper! llvm-svn: 52781	2008-06-26 10:36:58 +00:00
Dale Johannesen	a2de8eab61	Fixes the last x86-64 test failure in compat.exp: <16 x float> is 64-byte aligned (for some reason), which gets us into the stack realignment code. The computation changing FP-relative offsets to SP-relative was broken, assiging a spill temp to a location also used for parameter passing. This fixes it by rounding up the stack frame to a multiple of the largest alignment (I concluded it wasn't fixable without doing this, but I'm not very sure.) llvm-svn: 52750	2008-06-26 01:51:13 +00:00
Evan Cheng	3fc2372d3a	- Fix a x86 vector isel bug: illegal transformation of a vector_shuffle into a shift. - Add a readme entry for a missing vector_shuffle optimization that results in awful codegen. llvm-svn: 52740	2008-06-25 20:52:59 +00:00
Chris Lattner	d3406fc2a7	Switch the PPC backend and target-independent JIT to use the libsystem InvalidateInstructionCache method instead of calling through a hook on the JIT. This is a host feature, not a target feature. llvm-svn: 52734	2008-06-25 17:18:44 +00:00
Dan Gohman	906b630f83	SimpleInstructionSelector is here no more. llvm-svn: 52725	2008-06-25 16:38:59 +00:00
Dan Gohman	aa01afd47c	Remove the OrigVT member from AtomicSDNode, as it is redundant with the base SDNode's VTList. llvm-svn: 52722	2008-06-25 16:07:49 +00:00
Mon P Wang	6a490371c9	Added MemOperands to Atomic operations since Atomics touches memory. Added abstract class MemSDNode for any Node that have an associated MemOperand Changed atomic.lcs => atomic.cmp.swap, atomic.las => atomic.load.add, and atomic.lss => atomic.load.sub llvm-svn: 52706	2008-06-25 08:15:39 +00:00
Evan Cheng	73db52ebf8	Enable two-address remat by default. llvm-svn: 52701	2008-06-25 01:16:38 +00:00
Dale Johannesen	e5f4ffbdf1	Add v2f32 (MMX) type to X86. Support is primitive: load,store,call,return,bitcast. This is enough to make call and return work. llvm-svn: 52691	2008-06-24 22:01:44 +00:00
Evan Cheng	3f2ceac565	If it's determined safe, remat MOV32r0 (i.e. xor r, r) and others as it is instead of using the longer MOV32ri instruction. llvm-svn: 52670	2008-06-24 07:10:51 +00:00
Dan Gohman	02a2aaf2e7	Add a note about a potential PIC optimization. llvm-svn: 52663	2008-06-24 00:53:07 +00:00
Dan Gohman	76600aa35c	Fixes for being compiled PIC on Linux. This isn't the most general solution possible, but it's a fairly simple one. Based on a patch from the OpenGTL project! llvm-svn: 52662	2008-06-24 00:50:01 +00:00
Dan Gohman	1f2b2a4abe	Remove unnecessary #includes. llvm-svn: 52613	2008-06-22 19:21:26 +00:00
Dan Gohman	55083d5dd3	Use MachineBasicBlock::transferSuccessors. llvm-svn: 52594	2008-06-21 20:21:19 +00:00
Eli Friedman	8d66e98c92	Fix a bug with <8 x i16> shuffle lowering on X86 where parts of the shuffle could be skipped. The check is invalid because the loop index i doesn't correspond to the element actually inserted. The correct check is already done a few lines earlier, for whether the element is already in the right spot, so this shouldn't have any effect on the codegen for code that was already correct. llvm-svn: 52486	2008-06-19 06:09:51 +00:00
Evan Cheng	2dbba985d5	Unneeded include's. llvm-svn: 52478	2008-06-19 01:21:02 +00:00
Evan Cheng	1d260dfa3b	XOR32rr, etc. are not AsCheapAsMove, but MOV32ri, etc. are. llvm-svn: 52454	2008-06-18 08:13:07 +00:00
Evan Cheng	f6a1466829	Unbreak DECLARE isel in pic mode. llvm-svn: 52439	2008-06-18 02:48:27 +00:00
Anton Korobeynikov	f51ed6a161	Add one more 'magic' define :) llvm-svn: 52420	2008-06-17 17:57:43 +00:00
Anton Korobeynikov	8e5d9214ba	Unbreak non-PPC builds llvm-svn: 52419	2008-06-17 17:38:31 +00:00
Anton Korobeynikov	7d7dcd52db	Provide generic hooks for icache invalidation. Add PPC implementation. Patch by Gary Benson! llvm-svn: 52418	2008-06-17 17:30:05 +00:00
Evan Cheng	e47ca0940f	Rather than avoiding to wrap ISD::DECLARE GV operand in X86ISD::Wrapper, simply handle it at dagisel time with x86 specific isel code. llvm-svn: 52377	2008-06-17 02:01:22 +00:00
Evan Cheng	a5e30076a0	Horizontal-add instructions are not commutative. llvm-svn: 52363	2008-06-16 21:16:24 +00:00
Evan Cheng	b90be27f8c	mpsadbw is commutable. llvm-svn: 52352	2008-06-16 20:25:59 +00:00
Chris Lattner	8b69e8a647	Add support for icache invalidation on non-darwin ppc systems. Patch by Gary Benson! llvm-svn: 52332	2008-06-16 17:04:06 +00:00
Evan Cheng	03553bb59a	Add option to commuteInstruction() which forces it to create a new (commuted) instruction. llvm-svn: 52308	2008-06-16 07:33:11 +00:00
Chris Lattner	91f4a0ff58	Switch from generating the int128 typedefs based on targetdata to generating them based on the end-compiler's capabilities. This fixes PR2453 llvm-svn: 52297	2008-06-16 04:25:29 +00:00
Andrew Lenharth	f88d50bfcc	add missing atomic intrinsic from gcc llvm-svn: 52270	2008-06-14 05:48:15 +00:00
Duncan Sands	8651e9c584	Disable some DAG combiner optimizations that may be wrong for volatile loads and stores. In fact this is almost all of them! There are three types of problems: (1) it is wrong to change the width of a volatile memory access. These may be used to do memory mapped i/o, in which case a load can have an effect even if the result is not used. Consider loading an i32 but only using the lower 8 bits. It is wrong to change this into a load of an i8, because you are no longer tickling the other three bytes. It is also unwise to make a load/store wider. For example, changing an i16 load into an i32 load is wrong no matter how aligned things are, since the fact of loading an additional 2 bytes can have i/o side-effects. (2) it is wrong to change the number of volatile load/stores: they may be counted by the hardware. (3) it is wrong to change a volatile load/store that requires one memory access into one that requires several. For example on x86-32, you can store a double in one processor operation, but to store an i64 requires two (two i32 stores). In a multi-threaded program you may want to bitcast an i64 to a double and store as a double because that will occur atomically, and be indivisible to other threads. So it would be wrong to convert the store-of-double into a store of an i64, because this will become two i32 stores - no longer atomic. My policy here is to say that the number of processor operations for an illegal operation is undefined. So it is alright to change a store of an i64 (requires at least two stores; but could be validly lowered to memcpy for example) into a store of double (one processor op). In short, if the new store is legal and has the same size then I say that the transform is ok. It would also be possible to say that transforms are always ok if before they were illegal, whether after they are illegal or not, but that's more awkward to do and I doubt it buys us anything much. However this exposed an interesting thing - on x86-32 a store of i64 is considered legal! That is because operations are marked legal by default, regardless of whether the type is legal or not. In some ways this is clever: before type legalization this means that operations on illegal types are considered legal; after type legalization there are no illegal types so now operations are only legal if they really are. But I consider this to be too cunning for mere mortals. Better to do things explicitly by testing AfterLegalize. So I have changed things so that operations with illegal types are considered illegal - indeed they can never map to a machine operation. However this means that the DAG combiner is more conservative because before it was "accidentally" performing transforms where the type was illegal because the operation was nonetheless marked legal. So in a few such places I added a check on AfterLegalize, which I suppose was actually just forgotten before. This causes the DAG combiner to do slightly more than it used to, which resulted in the X86 backend blowing up because it got a slightly surprising node it wasn't expecting, so I tweaked it. llvm-svn: 52254	2008-06-13 19:07:40 +00:00
Anton Korobeynikov	729c4e95e2	Properly lower DYNAMIC_STACKALLOC - bracket all black magic with CALLSEQ_BEGIN & CALLSEQ_END. llvm-svn: 52225	2008-06-11 20:16:42 +00:00
Dan Gohman	6e384fc28e	CPPBackend support for extractvalue and insertvalue. llvm-svn: 52147	2008-06-09 14:12:10 +00:00
Dan Gohman	7be3fc7c97	Abort on an unrecognized opcode. llvm-svn: 52146	2008-06-09 14:09:13 +00:00
Dan Gohman	62f63f4320	Update the CPP backend for the ConstantFP::get API change. llvm-svn: 52144	2008-06-09 14:08:11 +00:00
Rafael Espindola	29479df2ac	add support for PIC on linux x86-64 llvm-svn: 52139	2008-06-09 09:52:31 +00:00
Duncan Sands	11dd424539	Remove comparison methods for MVT. The main cause of apint codegen failure is the DAG combiner doing the wrong thing because it was comparing MVT's using < rather than comparing the number of bits. Removing the < method makes this mistake impossible to commit. Instead, add helper methods for comparing bits and use them. llvm-svn: 52098	2008-06-08 20:54:56 +00:00
Bruno Cardoso Lopes	041604ba9f	Added FP instruction formats. llvm-svn: 52086	2008-06-08 01:39:36 +00:00
Bill Wendling	b7272db9f6	Temporarily reverting r52056. It's causing PPC to fail to bootstrap. llvm-svn: 52085	2008-06-08 01:36:24 +00:00
Bruno Cardoso Lopes	f09c372191	Added support for FP Registers llvm-svn: 52079	2008-06-07 21:32:41 +00:00
Evan Cheng	1a0835017a	Revert r52046. It broke cbe on x86 / Mac OS X. llvm-svn: 52071	2008-06-07 07:50:29 +00:00
Evan Cheng	0b8f2c53a2	Typo. llvm-svn: 52062	2008-06-06 21:00:10 +00:00
Evan Cheng	9bf9110d93	PPC preferred loop alignment is 16. llvm-svn: 52056	2008-06-06 19:50:46 +00:00
Anton Korobeynikov	f69bc3df9b	Handle assembler identifiers specially in CBE. This fixes PR2418. llvm-svn: 52046	2008-06-06 16:08:26 +00:00
Duncan Sands	13237ac3b9	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Bruno Cardoso Lopes	1a6e0d613f	Added custom isel for MUL, SDIVREM, UDIVREM, SMUL_LOHI and UMUL_LOHI nodes MUL is not anymore directly matched because its a pseudoinstruction. LogicI class fixed to zero-extend immediates. llvm-svn: 52036	2008-06-06 06:37:31 +00:00
Bruno Cardoso Lopes	4eed3afda0	Added custom SELECT_CC lowering Added special isel for ADDE,SUBE and new patterns to match SUBC,ADDC llvm-svn: 52031	2008-06-06 00:58:26 +00:00
Evan Cheng	9e76c047d1	Don't break strict aliasing. llvm-svn: 52026	2008-06-05 22:59:21 +00:00
Chris Lattner	c596ec04e1	Rewrite a bunch of the CBE's inline asm code, giving it the ability to handle indirect input operands. This fixes PR2407. llvm-svn: 51952	2008-06-04 18:03:28 +00:00
Duncan Sands	fc3c489b52	Change packed struct layout so that field sizes are the same as in unpacked structs, only field positions differ. This only matters for structs containing x86 long double or an apint; it may cause backwards compatibility problems if someone has bitcode containing a packed struct with a field of one of those types. The issue is that only 10 bytes are needed to hold an x86 long double: the store size is 10 bytes, but the ABI size is 12 or 16 bytes (linux/ darwin) which comes from rounding the store size up by the alignment. Because it seemed silly not to pack an x86 long double into 10 bytes in a packed struct, this is what was done. I now think this was a mistake. Reserving the ABI size for an x86 long double field even in a packed struct makes things more uniform: the ABI size is now always used when reserving space for a type. This means that developers are less likely to make mistakes. It also makes life easier for the CBE which otherwise could not represent all LLVM packed structs (PR2402). Front-end people might need to adjust the way they create LLVM structs - see following change to llvm-gcc. llvm-svn: 51928	2008-06-04 08:21:45 +00:00
Bruno Cardoso Lopes	326a03732e	Some Mips minor fixes Added support for mips little endian arch => mipsel llvm-svn: 51923	2008-06-04 01:45:25 +00:00
Dale Johannesen	355b74acc2	Add StringConstantPrefix to control what the assembler names of string constants look like. llvm-svn: 51909	2008-06-03 18:09:06 +00:00
Scott Michel	d831cc49e5	Add necessary 64-bit support so that gcc frontend compiles (mostly). Current issue is operand promotion for setcc/select... but looks like the fundamental stuff is implemented for CellSPU. llvm-svn: 51884	2008-06-02 22:18:03 +00:00
Dan Gohman	4e8a512f80	Implement CBE support for first-class structs and array values, and insertvalue and extractvalue instructions. First-class array values are not trivial because C doesn't support them. The approach I took here is to wrap all arrays in structs. Feedback is welcome. The 2007-01-15-NamedArrayType.ll test needed to be modified because it has a "not grep" for a string that now exists, because array types now have associated struct types, and those struct types have names. llvm-svn: 51881	2008-06-02 21:30:49 +00:00
Rafael Espindola	d04cd22ff4	Don't use the GOT for symbols that are not externally visible. llvm-svn: 51865	2008-06-02 07:52:43 +00:00
Bruno Cardoso Lopes	bdedc148a8	Fixed flag issue that was generating infinite loop while in list scheduling. llvm-svn: 51833	2008-06-01 03:49:39 +00:00
Nick Lewycky	035fe6f716	Peer through sext/zext when looking for not(cmp). llvm-svn: 51819	2008-05-31 19:01:33 +00:00
Nick Lewycky	69a51cbd6d	Yay us! Every one of these examples turns into icmp/zext/ret. llvm-svn: 51818	2008-05-31 18:20:26 +00:00
Chris Lattner	666d664595	Fix the CBE's handling of instructions whose result is an i1. Previously, we did not truncate the value down to i1 with (x&1). This caused a problem when the computation of x was nontrivial, for example, "add i1 1, 1" would return 2 instead of 0. This makes the testcase compile into: ... llvm_cbe_t = (((llvm_cbe_r == 0u) + (llvm_cbe_r == 0u))&1); llvm_cbe_u = (((unsigned int )(bool )llvm_cbe_t)); ... instead of: ... llvm_cbe_t = ((llvm_cbe_r == 0u) + (llvm_cbe_r == 0u)); llvm_cbe_u = (((unsigned int )(bool )llvm_cbe_t)); ... This fixes a miscompilation of mediabench/adpcm/rawdaudio/rawdaudio and 403.gcc with the CBE, regressions from LLVM 2.2. Tanya, please pull this into the release branch. llvm-svn: 51813	2008-05-31 09:23:55 +00:00
Dan Gohman	bd3390c73a	Teach the DAGISelEmitter to not compute the variable_ops operand index for the input pattern in terms of the output pattern. Instead keep track of how many fixed operands the input pattern actually has, and have the input matching code pass the output-emitting function that index value. This simplifies the code, disentangles variables_ops from the support for predication operations, and makes variable_ops more robust. llvm-svn: 51808	2008-05-31 02:11:25 +00:00
Evan Cheng	864541aa7b	Fix indentation. llvm-svn: 51792	2008-05-30 22:39:18 +00:00
Bill Wendling	b0aa651259	Add the "AsCheapAsAMove" flag to some 64-bit xor instructions. llvm-svn: 51761	2008-05-30 06:47:04 +00:00
Dan Gohman	96af4ddb62	Add patterns for CALL32m and CALL64m. They aren't matched in most cases due to an isel deficiency already noted in lib/Target/X86/README.txt, but they can be matched in this fold-call.ll testcase, for example. This is interesting mainly because it exposes a tricky tblgen bug; tblgen was incorrectly computing the starting index for variable_ops in the case of a complex pattern. llvm-svn: 51706	2008-05-29 21:50:34 +00:00
Bill Wendling	33e396d041	Remove more iostream header includes. Needed to implement a "FlushStream" function to flush a specified std::ostream. llvm-svn: 51705	2008-05-29 21:46:33 +00:00
Dan Gohman	6e582c449f	Fix a tblgen problem handling variable_ops in tblgen instruction definitions. This adds a new construct, "discard", for indicating that a named node in the input matching pattern is to be discarded, instead of corresponding to a node in the output pattern. This allows tblgen to know where the arguments for the varaible_ops are supposed to begin. This fixes "rdar://5791600", whatever that is ;-). llvm-svn: 51699	2008-05-29 19:57:41 +00:00
Dan Gohman	714663ab94	Expand small memmovs using inline code. Set the X86 threshold for expanding memmove to a more plausible value, now that it's actually being used. llvm-svn: 51696	2008-05-29 19:42:22 +00:00
Evan Cheng	5e28227dbd	Implement vector shift up / down and insert zero with ps{rl}lq / ps{rl}ldq. llvm-svn: 51667	2008-05-29 08:22:04 +00:00
Bill Wendling	0252be178d	XOR?RI instructions aren't as cheap as moves. llvm-svn: 51664	2008-05-29 03:46:36 +00:00
Bill Wendling	7a1a8eb6e2	Implement "AsCheapAsAMove" for some obviously cheap instructions: xor and the like. llvm-svn: 51662	2008-05-29 01:02:09 +00:00
Bill Wendling	3f6bb2713e	Add a flag to indicate that an instruction is as cheap (or cheaper) than a move instruction to execute. This can be used for transformations (like two-address conversion) to remat an instruction instead of generating a "move" instruction. The idea is to decrease the live ranges and register pressure and all that jazz. llvm-svn: 51660	2008-05-28 22:54:52 +00:00
Nate Begeman	e993b80ef5	Update some comments noticed in a recent checkin llvm-svn: 51644	2008-05-28 16:31:36 +00:00
Chris Lattner	f8910ab6db	Add chain inputs for loads. llvm-svn: 51635	2008-05-28 04:25:57 +00:00
Chris Lattner	633cd5949b	Fix CodeGen/Generic/2005-10-21-longlonggtu.ll on ia64. llvm-svn: 51634	2008-05-28 04:14:30 +00:00
Chris Lattner	724895625b	loads should get chains. THis helps but does not solve CodeGen/Generic/2003-05-27-phifcmpd.ll on ia64. llvm-svn: 51633	2008-05-28 04:06:52 +00:00
Chris Lattner	d2c3e86cc3	Fix 2006-04-28-Sign-extend-bool.ll for ia64. llvm-svn: 51632	2008-05-28 04:00:06 +00:00
Chris Lattner	c2fb8d7e2b	reindent. llvm-svn: 51631	2008-05-28 03:59:32 +00:00
Dan Gohman	68bddb8966	Fix the encoding for two more "rm" instructions that were using MRMSrcReg. llvm-svn: 51630	2008-05-28 01:50:19 +00:00
Mon P Wang	5e3faf2343	Fixed X86 encoding error CVTPS2PD and CVTPD2PS when the source operand is a memory location llvm-svn: 51626	2008-05-28 00:42:27 +00:00
Nate Begeman	f1e18c7c44	Don't attempt to create VZEXT_LOAD out of an extload. This an issue where the code generator would do something like this: f64 = load f32 <anyext>, f32mem v2f64 = insertelt undef, %0, 0 v2f64 = insertelt %1, 0.0, 1 into v2f64 = vzext_load f32mem which on x86 is movsd, when you really wanted a cvtss2sd/movsd pair. llvm-svn: 51624	2008-05-28 00:24:25 +00:00
Duncan Sands	698348dfac	Fix some constructs that gcc-4.4 warns about. llvm-svn: 51591	2008-05-27 11:50:51 +00:00
Chris Lattner	305fcd493f	Add FreeBSD/PPC support, patch by Marcel Moolenaar! llvm-svn: 51538	2008-05-24 04:58:48 +00:00
Evan Cheng	91a2e56b06	Eliminate x86.sse2.punpckh.qdq and x86.sse2.punpckl.qdq. llvm-svn: 51533	2008-05-24 02:56:30 +00:00
Evan Cheng	2146270c9b	Eliminate x86.sse2.movs.d, x86.sse2.shuf.pd, x86.sse2.unpckh.pd, and x86.sse2.unpckl.pd intrinsics. These will be lowered into shuffles. llvm-svn: 51531	2008-05-24 02:14:05 +00:00
Duncan Sands	91dea27d4c	Tweak how ConstantFP80Ty constants are output so that gcc doesn't warn about them. llvm-svn: 51529	2008-05-24 01:00:52 +00:00
Dale Johannesen	18cc4d3ea4	Put initialized const weak objects into correct sections on ppc32 darwin. g++.dg/abi/key2.C llvm-svn: 51527	2008-05-24 00:10:20 +00:00
Evan Cheng	8647b875cc	This is done. llvm-svn: 51526	2008-05-24 00:10:13 +00:00
Evan Cheng	6f8cfac755	Remove x86.sse2.loadh.pd and x86.sse2.loadl.pd. These will be lowered into load and shuffle instructions. llvm-svn: 51522	2008-05-24 00:07:29 +00:00
Dale Johannesen	002e554ce9	Add a missed CommonLinkage check. llvm-svn: 51503	2008-05-23 21:33:27 +00:00
Evan Cheng	04d24edcbb	Use movlps / movhps to modify low / high half of 16-byet memory location. llvm-svn: 51501	2008-05-23 21:23:16 +00:00
Dan Gohman	66eea1b9b3	Elaborate on the entry on integer vector multiplication by constants. llvm-svn: 51491	2008-05-23 18:05:39 +00:00
Evan Cheng	01b7fffb29	Fix a duplicated pattern. llvm-svn: 51490	2008-05-23 18:00:18 +00:00
Dan Gohman	3388d022ac	Use PMULDQ for v2i64 multiplies when SSE4.1 is available. And add load-folding table entries for PMULDQ and PMULLD. llvm-svn: 51489	2008-05-23 17:49:40 +00:00
Evan Cheng	d25cb8e0d2	New entry. llvm-svn: 51487	2008-05-23 17:28:11 +00:00
Dan Gohman	0f731017dd	Fix another isFirstClassType that now needs to be isSingleValueType. This fixes recent CBE regressions. llvm-svn: 51483	2008-05-23 16:57:00 +00:00
Chris Lattner	3546c2b4e4	we compile multiply-by-constant into horrible code. Doesn't sse4 have some instruction for doing this? llvm-svn: 51473	2008-05-23 04:29:53 +00:00
Evan Cheng	f3be7a7ea7	Bug: rcpps can only folds a load if the address is 16-byte aligned. Fixed many 'ps' load folding patterns in X86InstrSSE.td which are missing the proper alignment checks. Also fixed some 80 col. violations. llvm-svn: 51462	2008-05-23 00:37:07 +00:00
Dale Johannesen	6b4dcc1c14	Put const weak stuff in appropriate section on Darwin. g++.dg/abi/key2.C llvm-svn: 51458	2008-05-23 00:16:59 +00:00
Evan Cheng	97b020e61e	X86CodeEmitter should not set PIC style to None at initialization time. This will break codegen if relocation model is changed to PIC_ later. llvm-svn: 51455	2008-05-22 23:55:24 +00:00
Evan Cheng	53963b775e	Add missing patterns. llvm-svn: 51435	2008-05-22 18:56:56 +00:00
Chris Lattner	3d1797ccaa	fix an off-by-one error in my previous patch, don't treat the callee as a incoming arg. llvm-svn: 51422	2008-05-22 06:29:38 +00:00
Chris Lattner	79be90c3c7	Add support for multiple-return values in inline asm. This should get inline asm working as well as it did previously with the CBE with the new MRV support for inline asm. llvm-svn: 51420	2008-05-22 06:19:37 +00:00
Evan Cheng	f945f94397	movsd and movq do not require 16-byte alignment. This fixes vec_set-5.ll on Linux. llvm-svn: 51327	2008-05-20 18:24:47 +00:00
Evan Cheng	974722b16f	runOnMachineFunction should set IsPIC because relocation model may have been changed. llvm-svn: 51291	2008-05-20 01:56:59 +00:00
Dale Johannesen	5bf742f2aa	Handle quoted names when constructing $stub's, $non_lazy_ptr's and $lazy_ptr's. llvm-svn: 51277	2008-05-19 21:38:18 +00:00
Chris Lattner	1bafa6c886	trip count computation deficiency llvm-svn: 51222	2008-05-17 15:37:38 +00:00
Dale Johannesen	2e1d5e487b	Record weak external linkage in a case where we were missing it. gcc.dg/darwin-weakimport-2.c. Handle common and weak differently for darwin ppc32. llvm-svn: 51201	2008-05-16 20:09:25 +00:00
Gabor Greif	e1f6e4b21d	API change for {BinaryOperator\|CmpInst\|CastInst}::create*() --> Create. Legacy interfaces will be in place for some time. (Merge from use-diet branch.) llvm-svn: 51200	2008-05-16 19:29:10 +00:00
Dale Johannesen	876dd3ca9e	Treat common as distinct from weak global on Darwin x86. llvm-svn: 51172	2008-05-16 00:52:06 +00:00
Evan Cheng	29e59ad6c9	Fix typos and comments. llvm-svn: 51165	2008-05-15 22:13:02 +00:00
Evan Cheng	ef377adca0	Make use of vector load and store operations to implement memcpy, memmove, and memset. Currently only X86 target is taking advantage of these. llvm-svn: 51140	2008-05-15 08:39:06 +00:00
Evan Cheng	ad2b7a7b97	Silence warnings. llvm-svn: 51129	2008-05-14 20:33:21 +00:00
Dale Johannesen	ce4396bc92	Add CommonLinkage; currently tentative definitions are represented as "weak", but there are subtle differences in some cases on Darwin, so we need both. The intent is that "common" will behave identically to "weak" unless somebody changes their target to do something else. No functional change as yet. llvm-svn: 51118	2008-05-14 20:12:51 +00:00
Sanjiv Gupta	7fc6027873	Detabification. Fixed indentation and spacing. Changed cout to DOUT, and TODOs to FIXMEs. Other changes as per coding conventions. llvm-svn: 51105	2008-05-14 11:31:39 +00:00
Nicolas Geoffray	fbdca96469	Fix typo in ParameterAttribute fields usage. Add an include to make the Cpp backend output compilable. llvm-svn: 51095	2008-05-14 07:52:03 +00:00
Sanjiv Gupta	1f8c9ef4cc	Fixed the file description header at the top to remove the developer name. llvm-svn: 51094	2008-05-14 06:50:01 +00:00
Evan Cheng	6f34ed0d36	Doh. Alignment is in bytes, not in bits. llvm-svn: 51092	2008-05-14 02:49:43 +00:00
Dan Gohman	eabd647cd5	Change target-specific classes to use more precise static types. This eliminates the need for several awkward casts, including the last dynamic_cast under lib/Target. llvm-svn: 51091	2008-05-14 01:58:56 +00:00
Chris Lattner	03ce206143	add a note llvm-svn: 51062	2008-05-13 19:56:20 +00:00
Evan Cheng	f8ab712fa9	- Fix the pasto in the fix for a previous pasto. - Incorporate Chris' comment suggestion. llvm-svn: 51061	2008-05-13 18:59:59 +00:00
Chris Lattner	d17f58ae6e	add a note llvm-svn: 51060	2008-05-13 18:48:54 +00:00
Nate Begeman	6645714f16	Fix one more encoding bug. llvm-svn: 51057	2008-05-13 17:52:09 +00:00
Evan Cheng	595e226085	- Don't treat anyext 16-bit load as a 32-bit load if it's volatile. - Correct a pasto. llvm-svn: 51054	2008-05-13 16:45:56 +00:00
Sanjiv Gupta	4394c2376c	Adding files for Microchip's PIC16 target. A brief description about PIC16: =============================== PIC16 is an 8-bit microcontroller with only one 8-bit register which is the accumulator. All arithmetic/load/store operations are 8-bit only. The architecture has two address spaces: program and data. The program memory is divided into 2K pages and the data memory is divided into banks of 128 byte, with only 80 usable bytes, resulting in an non-contiguous data memory. It supports direct data memory access (by specifying the address as part of the instruction) and indirect data and program memory access (in an unorthodox fashion which utilize a 16 bit pointer register). Two classes of registers exist: (8-bit class which is only one accumulator) (16-bit class, which contains one or more 16 bit pointer(s)) llvm-svn: 51027	2008-05-13 09:02:57 +00:00
Evan Cheng	1120279ae6	Instead of a vector load, shuffle and then extract an element. Load the element from address with an offset. pshufd $1, (%rdi), %xmm0 movd %xmm0, %eax => movl 4(%rdi), %eax llvm-svn: 51026	2008-05-13 08:35:03 +00:00
Nate Begeman	50f7ef30bb	Fix and encoding error in the psrad xmm, imm8 instruction. llvm-svn: 51020	2008-05-13 01:47:52 +00:00
Evan Cheng	3f40c69083	On x86, it's safe to treat i32 load anyext as a normal i32 load. Ditto for i8 anyext load to i16. llvm-svn: 51019	2008-05-13 00:54:02 +00:00
Dan Gohman	d78c400b5b	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Nate Begeman	b87e63a730	Teach Legalize how to scalarize VSETCC Teach X86 a few more vsetcc patterns. Custom lowering for unsupported ones is next. llvm-svn: 51009	2008-05-12 23:09:43 +00:00
Evan Cheng	b980f6fb3d	Xform bitconvert(build_pair(load a, load b)) to a single load if the load locations are at the right offset from each other. llvm-svn: 51008	2008-05-12 23:04:07 +00:00
Bill Wendling	1e11768a4f	Constify the machine instruction passed into the "is{Trivially,Really}ReMaterializable" methods. llvm-svn: 51001	2008-05-12 20:54:26 +00:00
Nate Begeman	d875c3e2fd	Initial X86 codegen support for VSETCC. llvm-svn: 51000	2008-05-12 20:34:32 +00:00
Dan Gohman	0863b19ae6	Fix a copy+paste bug; pseudo-instructions shouldn't have encoding information. llvm-svn: 50997	2008-05-12 20:22:45 +00:00
Evan Cheng	2609d5e779	Refactor isConsecutiveLoad from X86 to TargetLowering so DAG combiner can make use of it. llvm-svn: 50991	2008-05-12 19:56:52 +00:00
Nate Begeman	cfcb56091b	Add support for vicmp/vfcmp codegen, more legalize support coming. This is necessary to unbreak the build. llvm-svn: 50988	2008-05-12 19:40:03 +00:00
Dan Gohman	906716c40f	Fix a compile error on compilers that still want a return value in a non-void function that calls abort. llvm-svn: 50969	2008-05-12 16:17:19 +00:00
Anton Korobeynikov	a38e72d247	Add note llvm-svn: 50959	2008-05-11 14:33:15 +00:00
Evan Cheng	71b9afb053	When transforming a vector_shuffle to a load, the base address must not be an undef. llvm-svn: 50940	2008-05-10 06:46:49 +00:00
Dan Gohman	3c0e11af64	For now, abort when an ISD::VAARG is encountered on x86-64, rather than silently generate invalid code. llvm-gcc does not currently use VAArgInst; it lowers va_arg in the front-end. llvm-svn: 50930	2008-05-10 01:26:14 +00:00
Evan Cheng	da2587cedc	Some clean up. llvm-svn: 50929	2008-05-10 00:59:18 +00:00
Evan Cheng	bb48d55a88	If movl top bits are undef, let it be selected to movlps, etc. llvm-svn: 50928	2008-05-10 00:58:41 +00:00
Evan Cheng	867af2678f	Add a pattern to do move the low element of a v4f32 and zero extend the rest. llvm-svn: 50922	2008-05-09 23:37:55 +00:00
Evan Cheng	961339bbdb	Handle a few more cases of folding load i64 into xmm and zero top bits. Note, some of the code will be moved into target independent part of DAG combiner in a subsequent patch. llvm-svn: 50918	2008-05-09 21:53:03 +00:00
Evan Cheng	0360ecbec1	Use movq to move low half of XMM register and zero-extend the rest. llvm-svn: 50874	2008-05-08 22:35:02 +00:00
Evan Cheng	78af38c392	Handle vector move / load which zero the destination register top bits (i.e. movd, movq, movss (addr), movsd (addr)) with X86 specific dag combine. llvm-svn: 50838	2008-05-08 00:57:18 +00:00
Duncan Sands	e2b0bf43a7	Output correct exception handling and frame info on x86-64 linux. This causes no regressions on 32 bit linux and 32 bit ppc. More tests pass on 64 bit ppc with no regressions. I didn't turn on eh on 64 bit linux because the intrinsics needed to compile the eh runtime aren't done yet. But if you turn it on and link with the mainline runtime then eh seems to work fine on x86-64 linux with this patch. Thanks to Dale for testing. The main point of the patch is that if you output that some object is encoded using 4 bytes you had better not output 8 bytes for it: the patch makes everything consistent. llvm-svn: 50825	2008-05-07 19:11:09 +00:00
Chris Lattner	888594bdf4	Match things like 'armv5tejl-unknown-linux-gnu' for PR2290 llvm-svn: 50698	2008-05-06 02:29:28 +00:00
Dan Gohman	6a2da37c0e	Make several variable declarations static. llvm-svn: 50696	2008-05-06 01:53:16 +00:00
Chris Lattner	6e2bf7c67e	add a micro optzn. llvm-svn: 50681	2008-05-05 23:19:45 +00:00
Mon P Wang	310a38d51e	Improved generated code for atomic operators llvm-svn: 50677	2008-05-05 22:56:23 +00:00
Evan Cheng	dbfcce37fe	Code clean up. No functionality change. llvm-svn: 50675	2008-05-05 22:12:23 +00:00
Mon P Wang	3e58393c3d	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Dan Gohman	b42c28c3dc	Fix IsLinux being uninitialized on non-Linux targets. llvm-svn: 50660	2008-05-05 18:43:07 +00:00
Anton Korobeynikov	4b0386ce62	Fix 80col violation llvm-svn: 50654	2008-05-05 17:08:59 +00:00
Dan Gohman	6fd71c6512	Use a dedicated IsLinux flag instead of an ELFLinux TargetType. llvm-svn: 50649	2008-05-05 16:11:31 +00:00
Dan Gohman	bcde172222	Add AsmPrinter support for emitting a directive to declare that the code being generated does not require an executable stack. Also, add target-specific code to make use of this on Linux on x86. llvm-svn: 50634	2008-05-05 00:28:39 +00:00
Anton Korobeynikov	9205c8562c	Add General Dynamic TLS model for X86-64. Some parts looks really ugly (look for tlsaddr pattern), but should work. Work is in progress, more models will follow llvm-svn: 50630	2008-05-04 21:36:32 +00:00
Evan Cheng	d9481366e3	Select vector shift with non-immediate i32 shift amount operand by first moving the operand into the right register. llvm-svn: 50619	2008-05-04 09:15:50 +00:00
Evan Cheng	cdf22f2953	Add separate intrinsics for MMX / SSE shifts with i32 integer operands. This allow us to simplify the horribly complicated matching code. llvm-svn: 50601	2008-05-03 00:52:09 +00:00
Evan Cheng	fa8f9f937a	Undo r50574. We are already ensuring the folded load address is 16-byte aligned. llvm-svn: 50578	2008-05-02 17:01:01 +00:00
Evan Cheng	4f9cd9181e	80 column violation. llvm-svn: 50575	2008-05-02 07:53:32 +00:00
Evan Cheng	50f82f2c8e	Not safe folding a load + FsXORPSrr into FsXORPSrm. It's loading a FR64 value but the load folding variant expects a 16-byte aligned address. llvm-svn: 50574	2008-05-02 07:50:58 +00:00
Arnold Schwaighofer	be0de34ede	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Scott Michel	c3a1910a07	Bug fixes and updates for CellSPU, syncing up with trunk. Most notable fixes are target-specific lowering of frame indices, fix constants generated for the FSMBI instruction, and fixing SPUTargetLowering::computeMaskedBitsFor- TargetNode(). llvm-svn: 50462	2008-04-30 00:30:08 +00:00
Anton Korobeynikov	0acc739817	Don't do stupid things: doInitialization(Module&) is not applicable to ModulePass :) llvm-svn: 50433	2008-04-29 18:16:22 +00:00
Dan Gohman	da44054867	Fix the SVOffset values for loads and stores produced by memcpy/memset expansion. It was a bug for the SVOffset value to be used in the actual address calculations. llvm-svn: 50359	2008-04-28 17:15:20 +00:00
Anton Korobeynikov	fac70f2f44	Fix FP return for Win64 ABI llvm-svn: 50342	2008-04-28 07:40:07 +00:00
Anton Korobeynikov	e183b3cd76	Properly lower vararg's FORMAL_ARGUMENTS node on win64 llvm-svn: 50325	2008-04-27 23:15:03 +00:00
Anton Korobeynikov	b5096e9c73	Handle fp80 for win64 llvm-svn: 50324	2008-04-27 22:54:09 +00:00
Chris Lattner	724539c001	A few inline asm cleanups: - Make targetlowering.h fit in 80 cols. - Make LowerAsmOperandForConstraint const. - Make lowerXConstraint -> LowerXConstraint - Make LowerXConstraint return a const char* instead of taking a string byref. llvm-svn: 50312	2008-04-26 23:02:14 +00:00
Chris Lattner	b4224cda3b	no need to implement this method and just have it call the default impl. llvm-svn: 50311	2008-04-26 22:59:59 +00:00
Evan Cheng	1e78184a99	Extract the lower 64-bit if a MMX value is passed in a XMM register. llvm-svn: 50292	2008-04-25 20:13:28 +00:00
Evan Cheng	5ba02020e6	Fix illegal MMX_MOVDQ2Qrr pattern. vector_extract result must be a scalar value. llvm-svn: 50291	2008-04-25 20:12:46 +00:00
Evan Cheng	ccde6dd016	Special handling for MMX values being passed in either GPR64 or lower 64-bits of XMM registers. llvm-svn: 50289	2008-04-25 19:11:04 +00:00
Evan Cheng	6d653b58f9	Fix MMX_MOVQ2DQrr pattern. It's illegal to do a bitconvert from a smaller type to a larger one. llvm-svn: 50278	2008-04-25 18:19:54 +00:00
Chris Lattner	33bd24bd92	add a note llvm-svn: 50267	2008-04-25 17:25:00 +00:00
Evan Cheng	715eaa031c	80 col violation. llvm-svn: 50266	2008-04-25 17:21:40 +00:00
Evan Cheng	59834d1c7a	Not checking for intrinsics which do not have a chain operand. llvm-svn: 50260	2008-04-25 08:55:28 +00:00
Evan Cheng	051da5deaa	- Switch from std::set to SmallPtrSet. - Add comments. llvm-svn: 50259	2008-04-25 08:22:20 +00:00
Evan Cheng	df38b35a1e	MMX argument passing fixes: On Darwin / Linux x86-32, v8i8, v4i16, v2i32 values are passed in MM[0-2]. On Darwin / Linux x86-32, v1i64 values are passed in memory. On Darwin x86-64, v8i8, v4i16, v2i32 values are passed in XMM[0-7]. On Darwin x86-64, v1i64 values are passed in 64-bit GPRs. llvm-svn: 50257	2008-04-25 07:56:45 +00:00
Chris Lattner	741c7a3b49	Loosen up an assertion to allow intrinsics. I really have no idea what this code (findNonImmUse) does, so I'm only guessing that this is the right thing. It would be really really nice if this had comments and perhaps switched to SmallPtrSet (hint hint) :) This fixes rdar://5886601, a crash on gcc.target/i386/sse4_1-pblendw.c llvm-svn: 50252	2008-04-25 05:13:01 +00:00
Evan Cheng	9165e165dc	Fix bug in x86 memcpy / memset lowering. If there are trailing bytes not handled by rep instructions, a new memcpy / memset is introduced for them. However, since source / destination addresses are already adjusted, their offsets should be zero. llvm-svn: 50239	2008-04-25 00:26:43 +00:00
Dan Gohman	c107d0020d	Make these variables static. llvm-svn: 50196	2008-04-23 23:15:23 +00:00
Anton Korobeynikov	1ae135c87b	Drop dead includes llvm-svn: 50192	2008-04-23 22:44:03 +00:00
Anton Korobeynikov	9dcc3e97a4	Adjust option names for C++ backend llvm-svn: 50190	2008-04-23 22:37:03 +00:00
Anton Korobeynikov	78695035c4	First step of implementing PR1538: move llvm2cpp logic to new 'target' llvm-svn: 50189	2008-04-23 22:29:24 +00:00
Dan Gohman	d871fa5cb6	Initial CBE support for multiple return values. llvm-svn: 50187	2008-04-23 21:49:29 +00:00
Anton Korobeynikov	0d6df367f1	Fix typo llvm-svn: 50169	2008-04-23 18:24:25 +00:00
Anton Korobeynikov	965babda19	Only allow increase of max alignment value llvm-svn: 50168	2008-04-23 18:23:50 +00:00
Anton Korobeynikov	c1534dca56	Be over-conservative: scan for all used virtual registers and calculate maximal stack alignment in assumption, that there will be spill of vector register. llvm-svn: 50167	2008-04-23 18:23:30 +00:00
Anton Korobeynikov	2659011b70	Add X86 Maximal Stack Alignment Calculator Pass before RA llvm-svn: 50166	2008-04-23 18:23:05 +00:00
Anton Korobeynikov	156550ae79	Do proper book-keeping of offsets and prologue/epilogue code for stack realignment llvm-svn: 50163	2008-04-23 18:21:27 +00:00
Anton Korobeynikov	89a0a017fb	If stack realignment is used - incoming args will use EBP as base register and locals - ESP llvm-svn: 50162	2008-04-23 18:21:02 +00:00
Anton Korobeynikov	ba5129073c	Eastimate required stack alignment early, so we can decide, whether we will need frame pointer or not llvm-svn: 50161	2008-04-23 18:20:17 +00:00
Anton Korobeynikov	c756b460d9	Cleanup llvm-svn: 50159	2008-04-23 18:19:23 +00:00
Anton Korobeynikov	a8aac3db3f	Simplify llvm-svn: 50158	2008-04-23 18:18:36 +00:00
Anton Korobeynikov	cb195f511d	Make stack alignment options global for all targets llvm-svn: 50157	2008-04-23 18:18:10 +00:00
Anton Korobeynikov	9328fbc4c7	Provide option for enabling-disabling stack realignment llvm-svn: 50156	2008-04-23 18:17:11 +00:00
Anton Korobeynikov	ca150edda6	Disable stack realignment for functions with dynamic-sized alloca's llvm-svn: 50155	2008-04-23 18:16:43 +00:00
Anton Korobeynikov	a7495260ee	Provide ABI-correct stack alignment llvm-svn: 50154	2008-04-23 18:16:16 +00:00
Anton Korobeynikov	8843487e16	Provide convenient helpers for some operations llvm-svn: 50153	2008-04-23 18:15:48 +00:00
Anton Korobeynikov	2ccafa47ac	Whitespace cleanup llvm-svn: 50152	2008-04-23 18:15:11 +00:00
Dan Gohman	f166d2d0d6	Implement an x86-64 ABI detail of passing structs by hidden first argument. The x86-64 ABI requires the incoming value of %rdi to be copied to %rax on exit from a function that is returning a large C struct. Also, add a README-X86-64 entry detailing the missed optimization opportunity and proposing an alternative approach. llvm-svn: 50075	2008-04-21 23:59:07 +00:00
Dan Gohman	db08f5218e	Fix the encoding of the MMX movd that moves from MMX to 64-bit GPR. llvm-svn: 50053	2008-04-21 19:52:29 +00:00
Chris Lattner	a89143f1e0	Add an ugly note. llvm-svn: 50029	2008-04-21 04:46:30 +00:00
Nicolas Geoffray	984e7199cc	Don't forget to update the current operand when getting the size of an instruction. llvm-svn: 50007	2008-04-20 23:36:47 +00:00
Chris Lattner	470ab00c76	A better fix for my previous patch, MOVZQI2PQIrr just requires SSE2. llvm-svn: 49986	2008-04-20 05:52:46 +00:00
Chris Lattner	3b18762f40	Switch to using Simplified ConstantFP::get API. llvm-svn: 49977	2008-04-20 00:41:09 +00:00
Evan Cheng	5102bd9359	64-bit atomic operations. llvm-svn: 49949	2008-04-19 02:30:38 +00:00
Evan Cheng	5e7ee0a002	Also LXCHG64 -> XCHG64rm. llvm-svn: 49948	2008-04-19 02:05:42 +00:00
Evan Cheng	51096affb5	PPC32 atomic operations. llvm-svn: 49947	2008-04-19 01:30:48 +00:00
Evan Cheng	7f4240a47c	xchg which references a memory operand does not need to lock prefix. Atomicity is guaranteed. llvm-svn: 49946	2008-04-19 01:20:30 +00:00
Dan Gohman	ad4071a9e1	Fix the handling of va_copy on x86-64. As of llvm-gcc r49920 llvm-gcc is now lowering va_copy on x86-64, so this completes the fix for PR2230. llvm-svn: 49922	2008-04-18 20:55:41 +00:00
Evan Cheng	00bd8d904a	- Fix atomic operation JIT encoding. - Remove unused instructions. llvm-svn: 49921	2008-04-18 20:55:36 +00:00
Evan Cheng	5879213597	Also support Intel asm syntax. llvm-svn: 49878	2008-04-17 23:35:10 +00:00
Evan Cheng	4704baa555	Fix assembly code for atomic operations. llvm-svn: 49869	2008-04-17 21:26:35 +00:00
Evan Cheng	147cb764b5	Don't forget about sub-register indices when rematting instructions. llvm-svn: 49830	2008-04-16 23:44:44 +00:00
Dale Johannesen	c1279f5e4b	Unbreak build on x86-64. llvm-svn: 49822	2008-04-16 22:24:33 +00:00
Nicolas Geoffray	a7557dfe71	Correlate stubs with functions in JIT: when emitting a stub, the JIT tells the memory manager which function the stub will resolve. llvm-svn: 49814	2008-04-16 20:46:05 +00:00
Nicolas Geoffray	ae84bbdbed	Infrastructure for getting the machine code size of a function and an instruction. X86, PowerPC and ARM are implemented llvm-svn: 49809	2008-04-16 20:10:13 +00:00
Evan Cheng	a15cee1036	Initialize X863DNowLevel. llvm-svn: 49808	2008-04-16 19:03:02 +00:00
Roman Levenstein	a3ee1a38a3	Ongoing work on improving the instruction selection infrastructure: Rename SDOperandImpl back to SDOperand. Introduce the SDUse class that represents a use of the SDNode referred by an SDOperand. Now it is more similar to Use/Value classes. Patch is approved by Dan Gohman. llvm-svn: 49795	2008-04-16 16:15:27 +00:00
Dan Gohman	d43d3beeb0	Add support for the form of the SSE41 extractps instruction that puts its result in a 32-bit GPR. llvm-svn: 49762	2008-04-16 02:32:24 +00:00
Dan Gohman	8c99ccaf96	Recreate the size SDNode instead of reusing the old one in the x86 memcpy lowering code; this ensures that the size node has the desired result type. This fixes a regression from r49572 with @llvm.memcpy.i64 on x86-32. llvm-svn: 49761	2008-04-16 01:32:32 +00:00
Dan Gohman	3dd8ba6235	Remove X86_64SRet; it isn't used anymore. llvm-svn: 49759	2008-04-16 00:24:30 +00:00
Dan Gohman	01a5d36d9d	Add movd instructions to move from MMX registers to 64-bit GPR registers on x86-64. llvm-svn: 49757	2008-04-15 23:55:07 +00:00
Nicolas Geoffray	7000c8f1aa	Change Divided flag to Split, as suggested by Evan llvm-svn: 49715	2008-04-15 08:08:50 +00:00
Dan Gohman	4fff979a43	Remove unnecessary <sstream> includes. llvm-svn: 49681	2008-04-14 20:40:47 +00:00
Dan Gohman	2505d86783	Fix const-correctness issues with the SrcValue handling in the memory intrinsic expansion code. llvm-svn: 49666	2008-04-14 17:55:48 +00:00
Dale Johannesen	876224b1e8	Reverse sense of unwind-tables option. This means stack tracebacks on Darwin x86-64 won't work by default; nevertheless, everybody but me thinks this is a good idea. llvm-svn: 49663	2008-04-14 17:54:17 +00:00
Nicolas Geoffray	dcc2eda5fc	Add a divided flag for the first piece of an argument divided into mulitple parts. Fixes PR1643 llvm-svn: 49611	2008-04-13 13:40:22 +00:00
Anton Korobeynikov	b9f38f38fa	Provide option for stack alignment override llvm-svn: 49593	2008-04-12 22:12:22 +00:00
Arnold Schwaighofer	634fc9a33a	This patch corrects the handling of byval arguments for tailcall optimized x86-64 (and x86) calls so that they work (... at least for my test cases). Should fix the following problems: Problem 1: When i introduced the optimized handling of arguments for tail called functions (using a sequence of copyto/copyfrom virtual registers instead of always lowering to top of the stack) i did not handle byval arguments correctly e.g they did not work at all :). Problem 2: On x86-64 after the arguments of the tail called function are moved to their registers (which include ESI/RSI etc), tail call optimization performs byval lowering which causes xSI,xDI, xCX registers to be overwritten. This is handled in this patch by moving the arguments to virtual registers first and after the byval lowering the arguments are moved from those virtual registers back to RSI/RDI/RCX. llvm-svn: 49584	2008-04-12 18:11:06 +00:00
Dan Gohman	544ab2c50b	Drop ISD::MEMSET, ISD::MEMMOVE, and ISD::MEMCPY, which are not Legal on any current target and aren't optimized in DAGCombiner. Instead of using intermediate nodes, expand the operations, choosing between simple loads/stores, target-specific code, and library calls, immediately. Previously, the code to emit optimized code for these operations was only used at initial SelectionDAG construction time; now it is used at all times. This fixes some cases where rep;movs was being used for small copies where simple loads/stores would be better. This also cleans up code that checks for alignments less than 4; let the targets make that decision instead of doing it in target-independent code. This allows x86 to use rep;movs in low-alignment cases. Also, this fixes a bug that resulted in the use of rep;stos for memsets of 0 with non-constant memory size when the alignment was at least 4. It's better to use the library in this case, which can be significantly faster when the size is large. This also preserves more SourceValue information when memory intrinsics are lowered into simple loads/stores. llvm-svn: 49572	2008-04-12 04:36:06 +00:00
Dan Gohman	8c7cf88f7e	Fix a bug that prevented x86-64 from using rep.movsq for 8-byte-aligned data. llvm-svn: 49571	2008-04-12 02:35:39 +00:00
Nate Begeman	7417348a7e	80 col fix llvm-svn: 49569	2008-04-12 00:47:57 +00:00
Chris Lattner	aeb23a8a34	add a note, this is actually not too bad to implement. llvm-svn: 49466	2008-04-10 05:54:50 +00:00
Chris Lattner	c692188075	move the x86-32 part of PR2108 here. llvm-svn: 49465	2008-04-10 05:37:47 +00:00
Chris Lattner	ad75302497	Fix the x86-64 side of PR2108 by adding a v2f64 version of MOVZQI2PQIrr. This would be better handled as a dag combine (with the goal of eliminating the bitconvert) but I don't know how to do that safely. Thoughts welcome. llvm-svn: 49463	2008-04-10 05:13:43 +00:00
Dan Gohman	33b3300178	Make isVectorClearMaskLegal's operand list const. llvm-svn: 49446	2008-04-09 20:09:42 +00:00
Dan Gohman	3d074a3125	Add XMM1 as a second return value register for f32 and f64 on x86-64. This is needed for the x86-64-ABI handling of structs that contain floating-point members that are returned by value. llvm-svn: 49441	2008-04-09 17:54:37 +00:00
Dan Gohman	cbf87313a2	Add DX as a second return value register for i16 on x86. llvm-svn: 49440	2008-04-09 17:53:38 +00:00
Dale Johannesen	4c0c018bc5	Rename -disable-required-unwind-tables to unwind-tables-optional. llvm-svn: 49389	2008-04-08 18:07:49 +00:00
Dale Johannesen	fe767621ca	Handle the situation in 2008-01-25-EmptyFunction.ll correctly when unwind info is being generated. llvm-svn: 49366	2008-04-08 00:37:56 +00:00
Dale Johannesen	344aec2952	Implement new llc flag -disable-required-unwind-tables. Corresponds to -fno-unwind-tables (usually default in gcc). llvm-svn: 49361	2008-04-08 00:10:24 +00:00
Dan Gohman	3bc3ddd638	Rename MemOperand to MachineMemOperand. This was suggested by review feedback from Chris quite a while ago. No functionality change. llvm-svn: 49348	2008-04-07 19:35:22 +00:00
Roman Levenstein	51f532f92d	Re-commit of the r48822, where the infinite looping problem discovered by Dan Gohman is fixed. llvm-svn: 49330	2008-04-07 10:06:32 +00:00
Gabor Greif	e9ecc68d8f	API changes for class Use size reduction, wave 1. Specifically, introduction of XXX::Create methods for Users that have a potentially variable number of Uses. llvm-svn: 49277	2008-04-06 20:25:17 +00:00
Evan Cheng	f77b5ef3d0	Favors pshufd over shufps when shuffling elements from one vector. pshufd is faster than shufps. llvm-svn: 49244	2008-04-05 00:30:36 +00:00
Torok Edwin	b20e659770	strdup needs <cstring>. This fixes a build error with g++-4.3. llvm-svn: 49218	2008-04-04 16:08:00 +00:00
Evan Cheng	6c66bd368e	Re-enable SSE4. llvm-svn: 49158	2008-04-03 08:53:29 +00:00
Evan Cheng	6db4b4cc65	Fix x86-64 encoding bug. REX prefix must always follow 0x0F prefix. For example, extractps in 64bit mode: 66 REX 0F 3A 17, not 66 0F 3A REX 17. llvm-svn: 49157	2008-04-03 08:53:17 +00:00
Evan Cheng	d9129d1de3	Cosmetic llvm-svn: 49156	2008-04-03 07:45:18 +00:00
Evan Cheng	3063c5546e	Temporarily disabling SSE4 until we fix the encoding issues. llvm-svn: 49129	2008-04-03 04:49:54 +00:00
Evan Cheng	025cea1126	Backing out 48222 temporarily. llvm-svn: 49124	2008-04-03 03:13:16 +00:00
Dan Gohman	bd72cea737	Suppress the 128-bit integer typedef on 32-bit targets, because it causes compile errors. llvm-svn: 49122	2008-04-02 23:52:49 +00:00
Dan Gohman	39d8b26322	Partial CBackend support for 128-bit integers. This is needed now that llvm-gcc is lowering appropriately-sized struct returns to i128 on x86-64. llvm-svn: 49109	2008-04-02 19:40:14 +00:00
Dale Johannesen	8780ecbbac	Cosmetic changes per EH patch review feedback. llvm-svn: 49096	2008-04-02 17:04:45 +00:00
Anton Korobeynikov	20c9e4cbee	Add new CC lowering rule: provide a list of registers, which can be 'shadowed', when some another register is used for argument passing. Currently is used on Win64. llvm-svn: 49079	2008-04-02 05:23:57 +00:00
Dale Johannesen	fd967cf3fa	Recommitting EH patch; this should answer most of the review feedback. -enable-eh is still accepted but doesn't do anything. EH intrinsics use Dwarf EH if the target supports that, and are handled by LowerInvoke otherwise. The separation of the EH table and frame move data is, I think, logically figured out, but either one still causes full EH info to be generated (not sure how to split the metadata correctly). MachineModuleInfo::needsFrameInfo is no longer used and is removed. llvm-svn: 49064	2008-04-02 00:25:04 +00:00
Evan Cheng	b86595fb0a	ReMat of load from stub in pic mode extends the life of pic base. Currently spiller doesn't do a good job of estimating the impact. Disable for now. llvm-svn: 49059	2008-04-01 23:26:12 +00:00
Evan Cheng	19a6dd9f2a	Remove unnecessary and non-deterministic checking code. Re-enable remat of load from gv stub. llvm-svn: 49054	2008-04-01 21:38:20 +00:00
Dan Gohman	cb9f8f6e4e	Don't use __bzero for memset if the second argument isn't zero. llvm-svn: 49050	2008-04-01 20:56:18 +00:00
Dan Gohman	980d7200c1	Speculatively micro-optimize memory-zeroing calls on Darwin 10. llvm-svn: 49048	2008-04-01 20:38:36 +00:00
Dale Johannesen	5e4e051c2a	Revert 49006 for the moment. llvm-svn: 49046	2008-04-01 20:00:57 +00:00
Evan Cheng	306e3dcff4	Disabling remat of load from gv stub (temporarily) again to fix llvmgcc bootstrap miscompare. llvm-svn: 49037	2008-04-01 07:33:13 +00:00
Evan Cheng	86e476b7cb	Unbreak ARM / Thumb soft FP support. llvm-svn: 49012	2008-04-01 01:50:16 +00:00
Dale Johannesen	efa81a6979	Accept 'y' constraint (MMX) in inline asm. llvm-svn: 49011	2008-04-01 00:57:48 +00:00
Dale Johannesen	7d02cf3c9c	Emit exception handling info for functions which are not marked nounwind, or for all functions when -enable-eh is set, provided the target supports Dwarf EH. llvm-gcc generates nounwind in the right places; other FEs will need to do so also. Given such a FE, -enable-eh should no longer be needed. llvm-svn: 49006	2008-03-31 23:40:23 +00:00
Evan Cheng	e4f77c69ac	It's not safe to fold a load from GV stub or constantpool into a two-address use. llvm-svn: 49002	2008-03-31 23:19:51 +00:00
Evan Cheng	ed6e34fe41	Move reMaterialize() from TargetRegisterInfo to TargetInstrInfo. llvm-svn: 48995	2008-03-31 20:40:39 +00:00
Evan Cheng	1973a46cd3	Re-apply 48911. llvm-svn: 48977	2008-03-31 07:54:19 +00:00
Nick Lewycky	9fb8908457	Moved from PR1570. llvm-svn: 48965	2008-03-30 19:07:11 +00:00
Chris Lattner	0f760dfe09	Fix "Control reaches the end of non-void function" warnings, patch by David Chisnall. llvm-svn: 48963	2008-03-30 18:22:13 +00:00
Dan Gohman	fd2eb00cc2	Fix a tokenfactor node to use the load chain rather than the load value. This fixes PR2177. llvm-svn: 48932	2008-03-28 23:45:16 +00:00
Evan Cheng	b8654202dd	Backing out 48911 for now. It's breaking stuff. llvm-svn: 48922	2008-03-28 17:49:06 +00:00
Evan Cheng	81e0c9a32c	New entry. llvm-svn: 48912	2008-03-28 07:07:06 +00:00
Evan Cheng	9ae4d7b719	Load from stub is already re-materializable. llvm-svn: 48911	2008-03-28 06:49:25 +00:00
Evan Cheng	308e564693	Code clean up. llvm-svn: 48856	2008-03-27 01:45:11 +00:00
Evan Cheng	29e62a59f3	Allow certain lea instructions to be rematerialized. llvm-svn: 48855	2008-03-27 01:41:09 +00:00
Evan Cheng	4fb07c6500	Remove an unused command line option. llvm-svn: 48854	2008-03-27 01:30:24 +00:00
Roman Levenstein	358e04a185	Use a linked data structure for the uses lists of an SDNode, just like LLVM Value/Use does and MachineRegisterInfo/MachineOperand does. This allows constant time for all uses list maintenance operations. The idea was suggested by Chris. Reviewed by Evan and Dan. Patch is tested and approved by Dan. On normal use-cases compilation speed is not affected. On very big basic blocks there are compilation speedups in the range of 15-20% or even better. llvm-svn: 48822	2008-03-26 12:39:26 +00:00
Evan Cheng	292063603e	Fix some SSE4.1 instruction encoding bugs. llvm-svn: 48815	2008-03-26 08:11:49 +00:00
Dale Johannesen	ad6c23d5e9	Use ## for comment delimiter on darwin x86-32, so llvm's output .s files will go through gcc -std=c99 without triggering preprocesser errors. Approach suggested by Daveed Vandevoorde. llvm-svn: 48808	2008-03-25 23:29:30 +00:00
Evan Cheng	ddc58ff92a	Smaller function alignment when optimizing for size. llvm-svn: 48805	2008-03-25 22:29:46 +00:00
Evan Cheng	88c44ef91f	Rename option -optimizefor-size to -optimize-size. llvm-svn: 48804	2008-03-25 22:28:39 +00:00
Dan Gohman	c60c67fc37	Add explicit keywords. llvm-svn: 48801	2008-03-25 22:06:05 +00:00
Dan Gohman	bdc24adaaf	A quick nm audit turned up several fixed tables and objects that were marked read-write. Use const so that they can be allocated in a read-only segment. llvm-svn: 48800	2008-03-25 21:45:14 +00:00
Devang Patel	246a52740b	Add optimize-for-size knob. llvm-svn: 48793	2008-03-25 21:02:35 +00:00
Dan Gohman	883cbfd0ba	Add CMP32mr and friends to the load-unfolding table. Among other things, this allows the scheduler to unfold a load operand in the 2008-01-08-SchedulerCrash.ll testcase, so it now successfully clones the comparison to avoid a pushf+popf. llvm-svn: 48777	2008-03-25 16:53:19 +00:00
Evan Cheng	50b536eef9	Add \t after .set. Fix by Jay Freeman. llvm-svn: 48753	2008-03-24 23:36:49 +00:00
Bill Wendling	6306183df3	Use the bit size of the operand instead of the hard-coded 32 to generate the mask. llvm-svn: 48750	2008-03-24 23:16:37 +00:00
Evan Cheng	615488ab45	- SSE4.1 extractfps extracts a f32 into a gr32 register. Very useful! Not. Fix the instruction specification and teaches lowering code to use it only when the only use is a store instruction. llvm-svn: 48746	2008-03-24 21:52:23 +00:00
Evan Cheng	58db865d6e	Remove duplicated entries. llvm-svn: 48714	2008-03-23 22:56:07 +00:00
Anton Korobeynikov	1fdd5e9133	Minor typo fixes. Also add another FIXME. llvm-svn: 48710	2008-03-23 20:32:06 +00:00
Anton Korobeynikov	17fb491469	Add license header llvm-svn: 48707	2008-03-23 14:53:18 +00:00

... 4 5 6 7 8 ...

8737 Commits