llvm-project

Commit Graph

Author	SHA1	Message	Date
Dan Gohman	31c8123d07	Replace some std::vectors that showed up in heap profiling with SmallVectors. Change the signature of TargetLowering::LowerArguments to avoid returning a vector by value, and update the two targets which still use this directly, Sparc and IA64, accordingly. llvm-svn: 52917	2008-06-30 20:31:15 +00:00
Dan Gohman	328e26d0ac	Correct the allocation size for CCState's UsedRegs member, which only needs one bit for each register. UsedRegs is a SmallVector sized at 16, so this eliminates a heap allocation/free for every call and return processed by Legalize on most targets. llvm-svn: 52915	2008-06-30 20:25:31 +00:00
Duncan Sands	9e08148f29	ExpungeNode is only needed for new nodes! This fixes CodeGen/PowerPC/2008-06-19-LegalizerCrash.ll when using the new LegalizeTypes infrastructure. llvm-svn: 52903	2008-06-30 16:43:45 +00:00
Duncan Sands	36410f6cde	Support for VAARG. As noted in a comment, this is wrong for types like x86 long double and i1, but no worse than what is done in LegalizeDAG. llvm-svn: 52898	2008-06-30 13:55:15 +00:00
Duncan Sands	dd5354df89	Support for promoting select_cc operands. llvm-svn: 52895	2008-06-30 11:50:11 +00:00
Duncan Sands	1ae6ef83ee	Revert the SelectionDAG optimization that makes it impossible to create a MERGE_VALUES node with only one result: sometimes it is useful to be able to create a node with only one result out of one of the results of a node with more than one result, for example because the new node will eventually be used to replace a one-result node using ReplaceAllUsesWith, cf X86TargetLowering::ExpandFP_TO_SINT. On the other hand, most users of MERGE_VALUES don't need this and for them the optimization was valuable. So add a new utility method getMergeValues for creating MERGE_VALUES nodes which by default performs the optimization. Change almost everywhere to use getMergeValues (and tidy some stuff up at the same time). llvm-svn: 52893	2008-06-30 10:19:09 +00:00
Evan Cheng	da3db11db3	- Re-apply 52748 and friends with fix. GetConstantStringInfo() returns an empty string for ConstantAggregateZero case which surprises selectiondag. - Correctly handle memcpy from constant string which is zero-initialized. llvm-svn: 52891	2008-06-30 07:31:25 +00:00
Chris Lattner	9d3740ed1c	Implement split and scalarize for SELECT_CC, fixing PR2504 llvm-svn: 52887	2008-06-30 02:43:01 +00:00
Anton Korobeynikov	a7c583d584	Revert (52748 and friends): Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. This unbreaks llvm-gcc bootstrap. llvm-svn: 52884	2008-06-29 17:57:03 +00:00
Chris Lattner	3cffa471d9	Really fix the bootstrap failure. llvm-svn: 52854	2008-06-28 06:24:50 +00:00
Chris Lattner	1701328675	Add back the capability to include nul characters in strings with GetConstantStringInfo. This will hopefully restore llvm-gcc to happy bootstrap land. llvm-svn: 52851	2008-06-28 05:33:32 +00:00
Dan Gohman	6f7b5a6392	When folding a bitcast into a load or store, preserve the alignment information of the original load or store, which is checked to be at least as good, and possibly better. llvm-svn: 52849	2008-06-28 00:45:22 +00:00
Chris Lattner	735705bc3e	simplify this check, GetConstantStringInfo validates that a global is constant already. No functionality change. llvm-svn: 52812	2008-06-27 03:18:41 +00:00
Bill Wendling	c758698d2c	Refactor the DebugInfoDesc stuff out of the MachineModuleInfo file. Clean up some uses of std::vector, where it's return std::vector by value. Yuck! llvm-svn: 52800	2008-06-27 00:09:40 +00:00
Chris Lattner	df1cbdd645	duncan points out that isOperationLegal includes a check for type legality. Thanks Duncan! llvm-svn: 52786	2008-06-26 17:16:00 +00:00
Eric Christopher	d0ab9c47e6	Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. llvm-svn: 52748	2008-06-26 00:31:12 +00:00
Chris Lattner	b1e66ce3bb	when we know the signbit of an input to uint_to_fp is zero, change it to sint_to_fp on targets where that is cheaper (and visaversa of course). This allows us to compile uint_to_fp to: _test: movl 4(%esp), %eax shrl $23, %eax cvtsi2ss %eax, %xmm0 movl 8(%esp), %eax movss %xmm0, (%eax) ret instead of: .align 3 LCPI1_0: ## double .long 0 ## double least significant word 4.5036e+15 .long 1127219200 ## double most significant word 4.5036e+15 .text .align 4,0x90 .globl _test _test: subl $12, %esp movl 16(%esp), %eax shrl $23, %eax movl %eax, (%esp) movl $1127219200, 4(%esp) movsd (%esp), %xmm0 subsd LCPI1_0, %xmm0 cvtsd2ss %xmm0, %xmm0 movl 20(%esp), %eax movss %xmm0, (%eax) addl $12, %esp ret llvm-svn: 52747	2008-06-26 00:16:49 +00:00
Evan Cheng	3fc2372d3a	- Fix a x86 vector isel bug: illegal transformation of a vector_shuffle into a shift. - Add a readme entry for a missing vector_shuffle optimization that results in awful codegen. llvm-svn: 52740	2008-06-25 20:52:59 +00:00
Duncan Sands	33ff5c8d0d	Add support for expanding PPC 128 bit floats. For this it is convenient to permit floats to be used with EXTRACT_ELEMENT, so I tweaked things to allow that. I also added libcalls for ppcf128 to i32 forms of FP_TO_XINT, since they exist in libgcc and this case can certainly occur (and does occur in the testsuite) - before the i64 libcall was being used. Also, the XINT_TO_FP result seemed to be wrong when the argument is an i128: the wrong fudge factor was added (the i32 and i64 cases were handled directly, but the i128 code fell through to some generic softening code which seemed to think it was i64 to f32!). So I fixed it by adding a fudge factor that I found in my breakfast cereal. llvm-svn: 52739	2008-06-25 20:24:48 +00:00
Duncan Sands	6920b254ad	Add/complete support for integer and float select_cc and friends. This code could be factorized a bit but I'm not sure that it's worth it. llvm-svn: 52724	2008-06-25 16:34:21 +00:00
Dan Gohman	aa01afd47c	Remove the OrigVT member from AtomicSDNode, as it is redundant with the base SDNode's VTList. llvm-svn: 52722	2008-06-25 16:07:49 +00:00
Mon P Wang	6a490371c9	Added MemOperands to Atomic operations since Atomics touches memory. Added abstract class MemSDNode for any Node that have an associated MemOperand Changed atomic.lcs => atomic.cmp.swap, atomic.las => atomic.load.add, and atomic.lss => atomic.load.sub llvm-svn: 52706	2008-06-25 08:15:39 +00:00
Dan Gohman	0d8a61eb60	Use the new PriorityQueue in ScheduleDAGList too, which also needs arbitrary-element removal. llvm-svn: 52654	2008-06-23 23:40:09 +00:00
Dan Gohman	fa63cc4e91	Move a DenseMap's declaration outside of a loop, and just call clear() on each iteration. This avoids allocating and deallocating all of DenseMap's memory on each iteration. llvm-svn: 52642	2008-06-23 21:15:00 +00:00
Dan Gohman	b4e2637e9b	Duncan pointed out this code could be tidied. llvm-svn: 52624	2008-06-23 15:29:14 +00:00
Duncan Sands	d803ce29a8	Port some integer multiplication fixes from LegalizeDAG. Bail out with an error if there is no libcall available for the given size of integer. llvm-svn: 52622	2008-06-23 15:15:44 +00:00
Duncan Sands	73ceffa3df	Support for expanding the result of EXTRACT_ELEMENT. llvm-svn: 52621	2008-06-23 15:08:15 +00:00
Duncan Sands	bc12dce8bb	Cleanup up LegalizeTypes handling of loads and stores. llvm-svn: 52620	2008-06-23 14:19:45 +00:00
Duncan Sands	5fb92e58de	Make custom lowering of ADD work correctly. This fixes PR2476; patch by Richard Osborne. The same problem exists for a bunch of other operators, but I'm ignoring this because they will be automagically fixed when the new LegalizeTypes infrastructure lands, since it already solves this problem centrally. llvm-svn: 52610	2008-06-22 09:42:16 +00:00
Dan Gohman	546505e7e1	Simplify some getNode calls. llvm-svn: 52604	2008-06-21 22:06:07 +00:00
Dan Gohman	ea0452016e	canClobberPhysRegDefs shouldn't called without checking hasPhysRegDefs; check this with an assert. llvm-svn: 52603	2008-06-21 22:05:24 +00:00
Dan Gohman	38c19aae38	Use clear() to zero an existing APInt. llvm-svn: 52601	2008-06-21 22:02:15 +00:00
Dan Gohman	14b911d929	Remove a redundant return. llvm-svn: 52585	2008-06-21 19:34:57 +00:00
Dan Gohman	46520a25a4	Remove ScheduleDAG's SUnitMap altogether. Instead, use SDNode's NodeId field, which is otherwise unused after instruction selection, as an index into the SUnit array. llvm-svn: 52583	2008-06-21 19:18:17 +00:00
Dan Gohman	a4db3352f9	Add a priority queue class, which is a wrapper around std::priority_queue and provides fairly efficient removal of arbitrary elements. Switch ScheduleDAGRRList from std::set to this new priority queue. llvm-svn: 52582	2008-06-21 18:35:25 +00:00
Duncan Sands	3bb8999719	Support for load/store of expanded float types. I don't know if a truncating store is possible here, but added support for it anyway. llvm-svn: 52577	2008-06-21 17:00:47 +00:00
Dan Gohman	e6e1348275	Change ScheduleDAG's SUnitMap from DenseMap<SDNode, vector<SUnit> > to DenseMap<SDNode, SUnit>, and adjust the way cloned SUnit nodes are handled so that only the original node needs to be in the map. This speeds up llc on 447.dealII.llvm.bc by about 2%. llvm-svn: 52576	2008-06-21 15:52:51 +00:00
Dan Gohman	4b49be1cbe	Simplify some template parameterization. llvm-svn: 52571	2008-06-21 01:08:22 +00:00
Duncan Sands	f362183c24	Share some code that is common between integer and float expansion (and sometimes vector splitting too). llvm-svn: 52548	2008-06-20 18:40:50 +00:00
Duncan Sands	49295b48eb	Rename the operation of turning a float type into an integer of the same type. Before it was "promotion", but this is confusing because it is quite different to promotion of integers. Call it "softening" instead, inspired by "soft float". llvm-svn: 52546	2008-06-20 17:49:55 +00:00
Dan Gohman	3792c470d5	Clean up some uses of std::distance, now that we have allnodes_size. llvm-svn: 52545	2008-06-20 17:15:19 +00:00
Dan Gohman	593a010c56	Teach ReturnInst lowering about aggregate return values. llvm-svn: 52522	2008-06-20 01:29:26 +00:00
Dan Gohman	44b2c57e2b	Fix the index calculations for the extractvalue lowering code. llvm-svn: 52517	2008-06-20 00:54:19 +00:00
Dan Gohman	c7a32fc8ca	Simplify the ComputeLinearIndex logic and fix a few bugs. llvm-svn: 52516	2008-06-20 00:53:00 +00:00
Evan Cheng	be0429c558	ISD::UNDEF should be expanded recursively / iteratively. llvm-svn: 52508	2008-06-19 22:01:11 +00:00
Duncan Sands	4c69995fb2	Split type expansion into ExpandInteger and ExpandFloat rather than bundling them together. Rename FloatToInt to PromoteFloat (better, if not perfect). Reorganize files by types rather than by operations. llvm-svn: 52408	2008-06-17 14:27:01 +00:00
Chris Lattner	1b08c4a709	add a new -enable-value-prop flag for llcbeta, that enables propagation of value info (sign/zero ext info) from one MBB to another. This doesn't handle much right now because of two limitations: 1) only handles zext/sext, not random bit propagation (no assert exists for this) 2) doesn't handle phis. llvm-svn: 52383	2008-06-17 06:09:18 +00:00
Duncan Sands	0ae829e5d1	Fix spelling. llvm-svn: 52381	2008-06-17 03:24:13 +00:00
Duncan Sands	37c1f5267b	Allow these transforms for types like i256 while still excluding types like i1 (not byte sized) and i120 (loading an i120 requires loading an i64, an i32, an i16 and an i8, which is expensive). llvm-svn: 52310	2008-06-16 08:14:38 +00:00
Duncan Sands	075293ff46	The transforms in visitEXTRACT_VECTOR_ELT are not valid if the load is volatile. Hopefully all wrong DAG combiner transforms of volatile loads and stores have now been caught. llvm-svn: 52293	2008-06-15 20:12:31 +00:00
Duncan Sands	0bc21c0551	LegalizeTypes support for INSERT_VECTOR_ELT with a non-constant index. llvm-svn: 52292	2008-06-15 20:00:14 +00:00
Duncan Sands	b1bfff53fe	Remove a redundant AfterLegalize check. Turn on some code when !AfterLegalize - but since this whole code section is turned off by an "if (0)" it's not really turning anything on. llvm-svn: 52276	2008-06-14 17:48:34 +00:00
Andrew Lenharth	f88d50bfcc	add missing atomic intrinsic from gcc llvm-svn: 52270	2008-06-14 05:48:15 +00:00
Duncan Sands	8651e9c584	Disable some DAG combiner optimizations that may be wrong for volatile loads and stores. In fact this is almost all of them! There are three types of problems: (1) it is wrong to change the width of a volatile memory access. These may be used to do memory mapped i/o, in which case a load can have an effect even if the result is not used. Consider loading an i32 but only using the lower 8 bits. It is wrong to change this into a load of an i8, because you are no longer tickling the other three bytes. It is also unwise to make a load/store wider. For example, changing an i16 load into an i32 load is wrong no matter how aligned things are, since the fact of loading an additional 2 bytes can have i/o side-effects. (2) it is wrong to change the number of volatile load/stores: they may be counted by the hardware. (3) it is wrong to change a volatile load/store that requires one memory access into one that requires several. For example on x86-32, you can store a double in one processor operation, but to store an i64 requires two (two i32 stores). In a multi-threaded program you may want to bitcast an i64 to a double and store as a double because that will occur atomically, and be indivisible to other threads. So it would be wrong to convert the store-of-double into a store of an i64, because this will become two i32 stores - no longer atomic. My policy here is to say that the number of processor operations for an illegal operation is undefined. So it is alright to change a store of an i64 (requires at least two stores; but could be validly lowered to memcpy for example) into a store of double (one processor op). In short, if the new store is legal and has the same size then I say that the transform is ok. It would also be possible to say that transforms are always ok if before they were illegal, whether after they are illegal or not, but that's more awkward to do and I doubt it buys us anything much. However this exposed an interesting thing - on x86-32 a store of i64 is considered legal! That is because operations are marked legal by default, regardless of whether the type is legal or not. In some ways this is clever: before type legalization this means that operations on illegal types are considered legal; after type legalization there are no illegal types so now operations are only legal if they really are. But I consider this to be too cunning for mere mortals. Better to do things explicitly by testing AfterLegalize. So I have changed things so that operations with illegal types are considered illegal - indeed they can never map to a machine operation. However this means that the DAG combiner is more conservative because before it was "accidentally" performing transforms where the type was illegal because the operation was nonetheless marked legal. So in a few such places I added a check on AfterLegalize, which I suppose was actually just forgotten before. This causes the DAG combiner to do slightly more than it used to, which resulted in the X86 backend blowing up because it got a slightly surprising node it wasn't expecting, so I tweaked it. llvm-svn: 52254	2008-06-13 19:07:40 +00:00
Duncan Sands	bf17080ec2	Sometimes (rarely) nodes held in LegalizeTypes maps can be deleted. This happens when RAUW replaces a node N with another equivalent node E, deleting the first node. Solve this by adding (N, E) to ReplacedNodes, which is already used to remap nodes to replacements. This means that deleted nodes are being allowed in maps, which can be delicate: the memory may be reused for a new node which might get confused with the old deleted node pointer hanging around in the maps, so detect this and flush out maps if it occurs (ExpungeNode). The expunging operation is expensive, however it never occurs during a llvm-gcc bootstrap or anywhere in the nightly testsuite. It occurs three times in "make check": Alpha/illegal-element-type.ll, PowerPC/illegal-element-type.ll and X86/mmx-shift.ll. If expunging proves to be too expensive then there are other more complicated ways of solving the problem. In the normal case this patch adds the overhead of a few more map lookups, which is hopefully negligable. llvm-svn: 52214	2008-06-11 11:42:12 +00:00
Dan Gohman	e38cc01244	Teach isGAPlusOffset to respect a GlobalAddressSDNode's offset value, which is something that apparently isn't used much. llvm-svn: 52158	2008-06-09 22:05:52 +00:00
Dan Gohman	6001b91d8e	CodeGen support for aggregate-value function arguments. llvm-svn: 52156	2008-06-09 21:19:23 +00:00
Duncan Sands	67d0f332d5	Various tweaks related to apint codegen. No functionality change for non-funky-sized integers. llvm-svn: 52151	2008-06-09 15:48:25 +00:00
Dan Gohman	d485e4eb5c	Handle empty aggregate values. llvm-svn: 52150	2008-06-09 15:21:47 +00:00
Duncan Sands	93b6609ae2	Remove some DAG combiner assumptions about sizes of integer types. Fix the isMask APInt method to actually work (hopefully) rather than crashing because it adds apints of different bitwidths. It looks like isShiftedMask is also broken, but I'm leaving that one to the APInt people (it is not used anywhere). llvm-svn: 52142	2008-06-09 11:32:28 +00:00
Duncan Sands	11dd424539	Remove comparison methods for MVT. The main cause of apint codegen failure is the DAG combiner doing the wrong thing because it was comparing MVT's using < rather than comparing the number of bits. Removing the < method makes this mistake impossible to commit. Instead, add helper methods for comparing bits and use them. llvm-svn: 52098	2008-06-08 20:54:56 +00:00
Dan Gohman	f6743d70ab	CodeGen support for insertvalue and extractvalue, and for loads and stores of aggregate values. llvm-svn: 52069	2008-06-07 02:02:36 +00:00
Owen Anderson	0bd08cf64c	Connect successors before creating the DAG node for the branch. This has no visible functionality change, but enables a future patch where node creation will update the CFG if it decides to create an unconditional rather than a conditional branch. llvm-svn: 52067	2008-06-07 00:00:23 +00:00
Duncan Sands	f1123e58fc	Tighten up the abstraction slightly. llvm-svn: 52045	2008-06-06 12:49:32 +00:00
Duncan Sands	13237ac3b9	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Evan Cheng	976b1eee81	Fix a memcpy lowering bug. Even though the memcpy alignment is smaller than the desired alignment, the frame destination alignment may still be larger than the desired alignment. Don't change its alignment to something smaller. llvm-svn: 51970	2008-06-04 23:37:54 +00:00
Scott Michel	a7d8649f78	Fix spellnig error llvm-svn: 51917	2008-06-03 19:13:20 +00:00
Dan Gohman	057240f4f0	Fold adds and subtracts of zero immediately, instead of waiting for dagcombine to do this. llvm-svn: 51886	2008-06-02 22:27:05 +00:00
Scott Michel	d831cc49e5	Add necessary 64-bit support so that gcc frontend compiles (mostly). Current issue is operand promotion for setcc/select... but looks like the fundamental stuff is implemented for CellSPU. llvm-svn: 51884	2008-06-02 22:18:03 +00:00
Dan Gohman	9a19f33842	Remove an unused variable. llvm-svn: 51807	2008-05-31 01:44:25 +00:00
Dan Gohman	8807147ada	Remove an unused variable. llvm-svn: 51721	2008-05-30 00:56:36 +00:00
Dan Gohman	714663ab94	Expand small memmovs using inline code. Set the X86 threshold for expanding memmove to a more plausible value, now that it's actually being used. llvm-svn: 51696	2008-05-29 19:42:22 +00:00
Evan Cheng	5e28227dbd	Implement vector shift up / down and insert zero with ps{rl}lq / ps{rl}ldq. llvm-svn: 51667	2008-05-29 08:22:04 +00:00
Duncan Sands	698348dfac	Fix some constructs that gcc-4.4 warns about. llvm-svn: 51591	2008-05-27 11:50:51 +00:00
Dan Gohman	643b3a0581	Add #includes to make some dependencies explicit. llvm-svn: 51496	2008-05-23 20:40:06 +00:00
Dan Gohman	6d5f120c5c	Generalize the new code in instcombine's ComputeNumSignBits for handling and/or to handle more cases (such as this add-sitofp.ll testcase), and port it to selectiondag's ComputeNumSignBits. llvm-svn: 51469	2008-05-23 02:28:01 +00:00
Dan Gohman	396ed504f1	Use isSingleValueType instead of isFirstClassType to exclude struct and array types. llvm-svn: 51460	2008-05-23 00:34:04 +00:00
Dan Gohman	fe13618682	Port the fix for the select operator from instcombine's ComputeNumSignBits to SelectionDAG's ComputeNumSignBits. llvm-svn: 51348	2008-05-20 20:59:51 +00:00
Dan Gohman	c1a4e212a3	Code simplification. llvm-svn: 51345	2008-05-20 20:56:33 +00:00
Evan Cheng	9ac3631fa3	If the result of a BIT_CONVERT is a v1* vector, it doesn't mean its source is a v1* vector. llvm-svn: 51192	2008-05-16 17:19:05 +00:00
Duncan Sands	70424d195a	Silence the compiler warning differently. The original method caused gcc-4.2 to complain. llvm-svn: 51186	2008-05-16 09:19:16 +00:00
Nate Begeman	f79f52282c	Actually scalarize the operand to BIT_CONVERT instead of asking someone to do something with a v1 type. llvm-svn: 51160	2008-05-15 20:40:58 +00:00
Dan Gohman	12fce7751b	IR support for extractvalue and insertvalue instructions. Also, begin moving toward making structs and arrays first-class types. llvm-svn: 51157	2008-05-15 19:50:34 +00:00
Evan Cheng	ef377adca0	Make use of vector load and store operations to implement memcpy, memmove, and memset. Currently only X86 target is taking advantage of these. llvm-svn: 51140	2008-05-15 08:39:06 +00:00
Evan Cheng	4ea9d49590	Use a better idiom to silence compiler warnings. llvm-svn: 51131	2008-05-14 21:08:07 +00:00
Evan Cheng	0f7fb95e79	Really silence compiler warnings. llvm-svn: 51126	2008-05-14 20:29:30 +00:00
Evan Cheng	a5b0a8d7fe	Really silence compiler warnings. llvm-svn: 51123	2008-05-14 20:26:35 +00:00
Evan Cheng	763ec13862	Silence some compiler warnings. llvm-svn: 51115	2008-05-14 20:07:51 +00:00
Dan Gohman	3ab94df276	When bit-twiddling CondCode values for integer comparisons produces SETOEQ, is it does with (SETEQ & SETULE), map it to SETEQ. llvm-svn: 51112	2008-05-14 18:17:09 +00:00
Dan Gohman	fd3e3003f3	Whitespace cleanups. llvm-svn: 51089	2008-05-14 00:43:10 +00:00
Evan Cheng	1120279ae6	Instead of a vector load, shuffle and then extract an element. Load the element from address with an offset. pshufd $1, (%rdi), %xmm0 movd %xmm0, %eax => movl 4(%rdi), %eax llvm-svn: 51026	2008-05-13 08:35:03 +00:00
Dan Gohman	d78c400b5b	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Nate Begeman	b87e63a730	Teach Legalize how to scalarize VSETCC Teach X86 a few more vsetcc patterns. Custom lowering for unsupported ones is next. llvm-svn: 51009	2008-05-12 23:09:43 +00:00
Evan Cheng	b980f6fb3d	Xform bitconvert(build_pair(load a, load b)) to a single load if the load locations are at the right offset from each other. llvm-svn: 51008	2008-05-12 23:04:07 +00:00
Evan Cheng	2609d5e779	Refactor isConsecutiveLoad from X86 to TargetLowering so DAG combiner can make use of it. llvm-svn: 50991	2008-05-12 19:56:52 +00:00
Nate Begeman	cfcb56091b	Add support for vicmp/vfcmp codegen, more legalize support coming. This is necessary to unbreak the build. llvm-svn: 50988	2008-05-12 19:40:03 +00:00
Dan Gohman	ecb77385ab	Fix a missing break in the ISD::FLT_ROUNDS_ handling. Patch by giuma! llvm-svn: 50967	2008-05-12 16:07:15 +00:00
Anton Korobeynikov	fc2edad4ae	Turn StripPointerCast() into a method llvm-svn: 50836	2008-05-07 22:54:15 +00:00
Dan Gohman	5a3eecdfd8	Fix a bug in the ComputeMaskedBits logic for multiply. llvm-svn: 50793	2008-05-07 00:35:55 +00:00
Anton Korobeynikov	82c02b28f3	Make StripPointerCast a common function (should we mak it method of Value instead?) llvm-svn: 50775	2008-05-06 22:52:30 +00:00
Dan Gohman	6a2da37c0e	Make several variable declarations static. llvm-svn: 50696	2008-05-06 01:53:16 +00:00
Dan Gohman	38dc08f36f	Instead of enumerating each opcode that isn't handled that ComputeMaskedBits handles, just use a 'default:'. This avoids TargetLowering's list getting out of date with SelectionDAG's. llvm-svn: 50693	2008-05-06 00:53:29 +00:00
Dan Gohman	cf0e3acf16	Correct the value of LowBits in srem and urem handling in ComputeMaskedBits. llvm-svn: 50692	2008-05-06 00:51:48 +00:00
Mon P Wang	3e58393c3d	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Dan Gohman	1962c2be6a	Fix a mistake in the computation of leading zeros for udiv. llvm-svn: 50591	2008-05-02 21:30:02 +00:00
Dan Gohman	2f83b47863	Fix a typo in a comment. llvm-svn: 50562	2008-05-02 00:05:03 +00:00
Dan Gohman	ea6357828b	Use push_back(...) instead of resize(1, ...), per review feedback. llvm-svn: 50561	2008-05-02 00:03:54 +00:00
Dan Gohman	752ce50b2d	Fix uninitialized uses of the FPC variable. llvm-svn: 50558	2008-05-01 23:40:44 +00:00
Chris Lattner	d4b2a67cf3	don't randomly miscompile seto/setuo just because we are in ffastmath mode. This fixes rdar://5902801, a miscompilation of gcc.dg/builtins-8.c. Bill, please pull this into Tak. llvm-svn: 50523	2008-05-01 07:26:11 +00:00
Arnold Schwaighofer	be0de34ede	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Scott Michel	be940424b3	Fix custom target lowering for zero/any/sign_extend: make sure that DAG.UpdateNodeOperands() is called before (not after) the call to TLI.LowerOperation(). llvm-svn: 50461	2008-04-30 00:26:38 +00:00
Roman Levenstein	6b37114590	Use std::set instead of std::priority_queue for the RegReductionPriorityQueue. This removes the existing bottleneck related to the removal of elements from the middle of the queue. Also fixes a subtle bug in ScheduleDAGRRList::CapturePred: It was updating the state of the SUnit before removing it. As a result, the comparison operators were working incorrectly and this SUnit could not be removed from the queue properly. Reviewed by Evan and Dan. Approved by Dan. llvm-svn: 50412	2008-04-29 09:07:59 +00:00
Chris Lattner	5c88f7b1ad	make the vector conversion magic handle multiple results. We now compile test2/test3 to: _test2: ## InlineAsm Start set %xmm0, %xmm1 ## InlineAsm End addps %xmm1, %xmm0 ret _test3: ## InlineAsm Start set %xmm0, %xmm1 ## InlineAsm End paddd %xmm1, %xmm0 ret as expected. llvm-svn: 50389	2008-04-29 04:48:56 +00:00
Chris Lattner	f9a49c4322	add support for multiple return values in inline asm. This is a step towards PR2094. It now compiles the attached .ll file to: _sad16_sse2: movslq %ecx, %rax ## InlineAsm Start %ecx %rdx %rax %rax %r8d %rdx %rsi ## InlineAsm End ## InlineAsm Start set %eax ## InlineAsm End ret which is pretty decent for a 3 output, 4 input asm. llvm-svn: 50386	2008-04-29 04:29:54 +00:00
Evan Cheng	b96782ecbd	Fix a bug in RegsForValue::getCopyToRegs() that causes cyclical scheduling units. If it's creating multiple CopyToReg nodes that are "flagged" together, it should not create a TokenFactor for it's chain outputs: c1, f1 = CopyToReg c2, f2 = CopyToReg c3 = TokenFactor c1, c2 ... = user c3, ..., f2 Now that the two CopyToReg's and the user are "flagged" together. They effectively forms a single scheduling unit. The TokenFactor is now both an operand and a successor of the Flagged nodes. llvm-svn: 50376	2008-04-28 22:07:13 +00:00
Dan Gohman	c968c1f592	Evan pointed out that folding sext to zext may not be correct if the zext is not legal. llvm-svn: 50368	2008-04-28 18:47:17 +00:00
Dan Gohman	77ce6da378	Delete an unused constructor. llvm-svn: 50367	2008-04-28 18:28:49 +00:00
Dan Gohman	d961d30b7f	Add a comment to CreateRegForValue that clarifies the handling of aggregate types. llvm-svn: 50366	2008-04-28 18:19:43 +00:00
Dan Gohman	80c692d439	Rewrite the comments for RegsForValue and its members, and reorder some of the members for clarity. llvm-svn: 50365	2008-04-28 18:10:39 +00:00
Dan Gohman	14a05df97b	Don't call size() on each iteration of the loop. llvm-svn: 50361	2008-04-28 17:42:03 +00:00
Dan Gohman	da44054867	Fix the SVOffset values for loads and stores produced by memcpy/memset expansion. It was a bug for the SVOffset value to be used in the actual address calculations. llvm-svn: 50359	2008-04-28 17:15:20 +00:00
Dan Gohman	72ec3f4562	Teach InstCombine's ComputeMaskedBits what SelectionDAG's ComputeMaskedBits knows about cttz, ctlz, and ctpop. Teach SelectionDAG's ComputeMaskedBits what InstCombine's knows about SRem. And teach them both some things about high bits in Mul, UDiv, URem, and Sub. This allows instcombine and dagcombine to eliminate sign-extension operations in several new cases. llvm-svn: 50358	2008-04-28 17:02:21 +00:00
Dan Gohman	3eb10f758e	Teach DAGCombine to convert (sext x) to (zext x) when the sign-bit of x is known to be zero. llvm-svn: 50357	2008-04-28 16:58:24 +00:00
Chris Lattner	c9e280c78a	Another collection of random cleanups. No functionality change. llvm-svn: 50341	2008-04-28 07:16:35 +00:00
Chris Lattner	52504e78fb	Remove the SmallVector ctor that converts from a SmallVectorImpl. This conversion open the door for many nasty implicit conversion issues, and can be easily solved by initializing with (V.begin(), V.end()) when needed. This patch includes many small cleanups for sdisel also. llvm-svn: 50340	2008-04-28 06:44:42 +00:00
Chris Lattner	8c7f5ad968	switch RegsForValue::Regs to be a SmallVector to avoid heap thrash on tiny (usually single-element) vectors. llvm-svn: 50335	2008-04-28 06:02:19 +00:00
Chris Lattner	d04b818a91	move static function out of anon namespace, no functionality change. llvm-svn: 50330	2008-04-27 23:48:12 +00:00
Chris Lattner	122721843b	Another step to getting multiple result inline asm to work. llvm-svn: 50329	2008-04-27 23:44:28 +00:00
Chris Lattner	58b9ece38d	typo llvm-svn: 50316	2008-04-27 01:49:46 +00:00
Chris Lattner	2237973438	Implement a signficant optimization for inline asm: When choosing between constraints with multiple options, like "ir", test to see if we can use the 'i' constraint and go with that if possible. This produces more optimal ASM in all cases (sparing a register and an instruction to load it), and fixes inline asm like this: void test () { asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14)); } Previously we would dump "42" into a memory location (which is ok for the 'm' constraint) which would cause a problem because the 'c' modifier is not valid on memory operands. Isn't it great how inline asm turns 'missed optimization' into 'compile failed'?? Incidentally, this was the todo in PowerPC/2007-04-24-InlineAsm-I-Modifier.ll Please do NOT pull this into Tak. llvm-svn: 50315	2008-04-27 00:37:18 +00:00
Chris Lattner	a937baeb9b	isa+cast -> dyn_cast llvm-svn: 50314	2008-04-27 00:16:18 +00:00
Chris Lattner	4793515a9c	Move a bunch of inline asm code out of line. llvm-svn: 50313	2008-04-27 00:09:47 +00:00
Chris Lattner	724539c001	A few inline asm cleanups: - Make targetlowering.h fit in 80 cols. - Make LowerAsmOperandForConstraint const. - Make lowerXConstraint -> LowerXConstraint - Make LowerXConstraint return a const char* instead of taking a string byref. llvm-svn: 50312	2008-04-26 23:02:14 +00:00
Dan Gohman	ca95a5f49f	Remove the code from CodeGenPrepare that moved getresult instructions to the block that defines their operands. This doesn't work in the case that the operand is an invoke, because invoke is a terminator and must be the last instruction in a block. Replace it with support in SelectionDAGISel for copying struct values into sequences of virtual registers. llvm-svn: 50279	2008-04-25 18:27:55 +00:00
Nate Begeman	6f94f61317	Pull the code to perform an INSERT_VECTOR_ELT in memory out into its own function, and then use it to fix a bug in SplitVectorOp that expected inserts to always have constant insertion indices. llvm-svn: 50273	2008-04-25 18:07:40 +00:00
Dan Gohman	e9e3891c09	Use isa instead of dyn_cast. llvm-svn: 50181	2008-04-23 20:25:16 +00:00
Dan Gohman	b418aafabf	Add support to codegen for getresult instructions with undef operands. llvm-svn: 50180	2008-04-23 20:21:29 +00:00
Dan Gohman	dc90919d2b	Fix an out-of-bounds access in -view-sunit-dags in the case of an empty ScheduleDAG. llvm-svn: 50054	2008-04-21 20:07:30 +00:00
Dale Johannesen	aac27592f0	Check we aren't trying to convert PPC long double. This fixes the testsuite failure on ppcf128-4.ll. llvm-svn: 49994	2008-04-20 18:23:46 +00:00
Chris Lattner	3b18762f40	Switch to using Simplified ConstantFP::get API. llvm-svn: 49977	2008-04-20 00:41:09 +00:00
Duncan Sands	1ec193e90b	Implement a bit more softfloat support in LegalizeTypes. Correct the load logic so that it actually works, and also teach it to handle floating point extending loads. llvm-svn: 49923	2008-04-18 20:56:03 +00:00
Duncan Sands	a8a61562af	Add some more FIXME's for indexed loads and stores. llvm-svn: 49916	2008-04-18 20:27:12 +00:00
Duncan Sands	b4e0b24e0a	Provide an explicit list of operands to MakeLibcall, rather than having it suck them out of a node. Add a bunch of new libcalls, and remove dead softfloat code (dead, because FloatToInt is used not Expand in this case). Note that indexed stores probably aren't handled properly, likewise for loads. llvm-svn: 49915	2008-04-18 20:25:14 +00:00
Dan Gohman	75c895dbc4	Remove the implicit conversion from SDOperandPtr to SDOperand*; this may fix a build error on Visual Studio. llvm-svn: 49876	2008-04-17 23:02:12 +00:00
Dan Gohman	9752a8f3b4	Correct the SrcValue information in the Expand code for va_copy. llvm-svn: 49839	2008-04-17 02:09:26 +00:00
Roman Levenstein	a3ee1a38a3	Ongoing work on improving the instruction selection infrastructure: Rename SDOperandImpl back to SDOperand. Introduce the SDUse class that represents a use of the SDNode referred by an SDOperand. Now it is more similar to Use/Value classes. Patch is approved by Dan Gohman. llvm-svn: 49795	2008-04-16 16:15:27 +00:00
Dan Gohman	82b6673c44	Fix the new scheduler assertion checks to work when the scheduler has inserted no-ops. This fixes the 2006-07-03-schedulers.ll regression on ppc32. llvm-svn: 49747	2008-04-15 22:40:14 +00:00
Nicolas Geoffray	7000c8f1aa	Change Divided flag to Split, as suggested by Evan llvm-svn: 49715	2008-04-15 08:08:50 +00:00
Dan Gohman	4370f26750	Treat EntryToken nodes as "passive" so that they aren't added to the ScheduleDAG; they don't correspond to any actual instructions so they don't need to be scheduled. This fixes a bug where the EntryToken was being scheduled multiple times in some cases, though it ended up not causing any trouble because EntryToken doesn't expand into anything. With this fixed the schedulers reliably schedule the expected number of units, so we can check this with an assertion. This requires a tweak to test/CodeGen/X86/loop-hoist.ll because it ends up getting scheduled differently in a trivial way, though it was enough to fool the prcontext+grep that the test does. llvm-svn: 49701	2008-04-15 01:22:18 +00:00
Dan Gohman	e5f21cea3e	In -view-sunit-dags, display "special" chain dependencies as cyan instead of blue to distinguish them from regular dependencies. llvm-svn: 49696	2008-04-14 23:15:07 +00:00

1 2 3 4 5 ...

2553 Commits