llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	dd8eeed096	Teach emitAlignment to handle explicit alignment requests by globals. llvm-svn: 24354	2005-11-14 19:00:06 +00:00
Jeff Cohen	cf1f782a2f	Fix operator precedence bug caught by VC++. llvm-svn: 24318	2005-11-12 00:59:01 +00:00
Andrew Lenharth	de1b5d6baa	added a chain output llvm-svn: 24306	2005-11-11 22:48:54 +00:00
Andrew Lenharth	01aa56397d	continued readcyclecounter support llvm-svn: 24300	2005-11-11 16:47:30 +00:00
Chris Lattner	4f827446da	nuke blank line llvm-svn: 24278	2005-11-10 18:49:46 +00:00
Chris Lattner	c0a1eba0ab	Get rid of casts by #including the right header llvm-svn: 24275	2005-11-10 18:36:17 +00:00
Chris Lattner	747960d21e	Compile C strings to: l1__2E_str_1: ; '.str_1' .asciz "foo" not: .align 0 l1__2E_str_1: ; '.str_1' .asciz "foo" llvm-svn: 24273	2005-11-10 18:09:27 +00:00
Chris Lattner	55a6d9067b	add support for .asciz, and enable it by default. If your target assemblerdoesn't support .asciz, just set AscizDirective to null in your asmprinter. This compiles C strings to: l1__2E_str_1: ; '.str_1' .asciz "foo" instead of: l1__2E_str_1: ; '.str_1' .ascii "foo\000" llvm-svn: 24272	2005-11-10 18:06:33 +00:00
Chris Lattner	bf4f233214	Switch the allnodes list from a vector of pointers to an ilist of nodes.This eliminates the vector, allows constant time removal of a node froma graph, and makes iteration over the all nodes list stable when adding nodes to the graph. llvm-svn: 24263	2005-11-09 23:47:37 +00:00
Chris Lattner	cd6f0f47f2	Refactor intrinsic lowering stuff out of visitCall llvm-svn: 24261	2005-11-09 19:44:01 +00:00
Chris Lattner	af3aefa10e	Handle the trivial (but common) two-op case more efficiently llvm-svn: 24259	2005-11-09 18:48:57 +00:00
Chris Lattner	619dfaa42b	Nuke noop copies. llvm-svn: 24258	2005-11-09 18:22:42 +00:00
Chris Lattner	41fd6d5d27	Fix CodeGen/X86/shift-folding.ll:test3 on X86 llvm-svn: 24256	2005-11-09 16:50:40 +00:00
Chris Lattner	35ecaa76fa	Disable some overly-aggressive checking code. This speeds up the local allocator from 23s to 11s on kc++ in debug mode. llvm-svn: 24255	2005-11-09 05:28:45 +00:00
Chris Lattner	b7cad90e55	Avoid creating a token factor node in trivially redundant cases. This eliminates almost one node per block in common cases. llvm-svn: 24254	2005-11-09 05:03:03 +00:00
Chris Lattner	43535a19b1	Handle GEP's a bit more intelligently. Fold constant indices early and turn power-of-two multiplies into shifts early to improve compile time. llvm-svn: 24253	2005-11-09 04:45:33 +00:00
Chris Lattner	c4d6050db6	Allocate the right amount of memory for this vector up front. llvm-svn: 24252	2005-11-08 23:32:44 +00:00
Chris Lattner	88fa11c3d5	Change the ValueList array for each node to be shared instead of individuallyallocated. Further, in the common case where a node has a single value, justreference an element from a small array. This is a small compile-time win. llvm-svn: 24251	2005-11-08 23:30:28 +00:00
Chris Lattner	7e4b5d33cb	Switch the operandlist/valuelist from being vectors to being just an array.This saves 12 bytes from SDNode, but doesn't speed things up substantially (our graphs apparently already fit within the cache on my g5). In any case this reduces memory usage. llvm-svn: 24249	2005-11-08 22:07:03 +00:00
Chris Lattner	3ba38cba64	Explicitly initialize some instance vars llvm-svn: 24247	2005-11-08 21:54:57 +00:00
Chris Lattner	aba48dd34c	Clean up RemoveDeadNodes significantly, by eliminating the need for a temporary set and eliminating the need to iterate whenever something is removed (which can be really slow in some cases). Thx to Jim for pointing out something silly I was getting stuck on. :) llvm-svn: 24241	2005-11-08 18:52:27 +00:00
Jim Laskey	1d2f26adcc	Let's try ignoring resource utilization on the backward pass. llvm-svn: 24231	2005-11-07 19:08:53 +00:00
Chris Lattner	629ba44e50	Always compute max align. llvm-svn: 24227	2005-11-06 17:43:20 +00:00
Nate Begeman	3ee3e69556	Add the necessary support to the ISel to allow targets to codegen the new alignment information appropriately. Includes code for PowerPC to support fixed-size allocas with alignment larger than the stack. Support for arbitrarily aligned dynamic allocas coming soon. llvm-svn: 24224	2005-11-06 09:00:38 +00:00
Jim Laskey	904dbb4a27	Fix logic bug in finding retry slot in tally. llvm-svn: 24188	2005-11-05 00:01:25 +00:00
Jim Laskey	ded4759d81	Fix a warning llvm-svn: 24187	2005-11-04 18:26:02 +00:00
Jim Laskey	e682b677c1	Scheduling now uses itinerary data. llvm-svn: 24180	2005-11-04 04:05:35 +00:00
Nate Begeman	ee065281e8	Fix a crash that Andrew noticed, and add a pair of braces to unfconfuse XCode's indenting. llvm-svn: 24159	2005-11-02 18:42:59 +00:00
Chris Lattner	17df608719	Fix a source of undefined behavior when dealing with 64-bit types. This may fix PR652. Thanks to Andrew for tracking down the problem. llvm-svn: 24145	2005-11-02 01:47:04 +00:00
Jim Laskey	5ce0538253	1. Embed and not inherit vector for NodeGroup. 2. Iterate operands and not uses (performance.) 3. Some long pending comment changes. llvm-svn: 24119	2005-10-31 12:49:09 +00:00
Chris Lattner	6871b23d02	Significantly simplify this code and make it more aggressive. Instead of having a special case hack for X86, make the hack more general: if an incoming argument register is not used in any block other than the entry block, don't copy it to a vreg. This helps us compile code like this: %struct.foo = type { int, int, [0 x ubyte] } int %test(%struct.foo* %X) { %tmp1 = getelementptr %struct.foo* %X, int 0, uint 2, int 100 %tmp = load ubyte* %tmp1 ; <ubyte> [#uses=1] %tmp2 = cast ubyte %tmp to int ; <int> [#uses=1] ret int %tmp2 } to: _test: lbz r3, 108(r3) blr instead of: _test: lbz r2, 108(r3) or r3, r2, r2 blr The (dead) copy emitted to copy r3 into a vreg for extra-block uses was increasing the live range of r3 past the load, preventing the coallescing. This implements CodeGen/PowerPC/reg-coallesce-simple.ll llvm-svn: 24115	2005-10-30 19:42:35 +00:00
Chris Lattner	dd5663dfa0	Reduce the number of copies emitted as machine instructions by generating results in vregs that will need them. In the case of something like this: CopyToReg((add X, Y), reg1024), we no longer emit code like this: reg1025 = add X, Y reg1024 = reg 1025 Instead, we emit: reg1024 = add X, Y Whoa! :) llvm-svn: 24111	2005-10-30 18:54:27 +00:00
Chris Lattner	a70878d4fb	Codegen mul by negative power of two with a shift and negate. This implements test/Regression/CodeGen/PowerPC/mul-neg-power-2.ll, producing: _foo: slwi r2, r3, 1 subfic r3, r2, 63 blr instead of: _foo: mulli r2, r3, -2 addi r3, r2, 63 blr llvm-svn: 24106	2005-10-30 06:41:49 +00:00
Chris Lattner	4b6d583d7a	Fix DSE to not nuke dead stores unless they redundant store is the same VT as the killing one. Fix fixes PR491 llvm-svn: 24034	2005-10-27 07:10:34 +00:00
Chris Lattner	d8c5c066a1	Add a simple xform that is useful for bitfield operations. llvm-svn: 24029	2005-10-27 05:06:38 +00:00
Chris Lattner	3c7974aade	Fix some spello's pointed out by Gabor Greif llvm-svn: 24019	2005-10-26 18:41:41 +00:00
Nate Begeman	d8f2a1a0f3	Allow custom lowered FP_TO_SINT ops in the check for whether a larger FP_TO_SINT is preferred to a larger FP_TO_UINT. This seems to be begging for a TLI.isOperationCustom() helper function. llvm-svn: 23992	2005-10-25 23:47:25 +00:00
Chris Lattner	3b409a85eb	Clear a bit in this file that was causing a miscompilation of 178.galgel. llvm-svn: 23980	2005-10-25 18:57:30 +00:00
Chris Lattner	476b8ddd55	Alkis agrees that that iterative scan allocator isn't going to be worked on in the future, remove it. llvm-svn: 23952	2005-10-24 04:14:30 +00:00
Jeff Cohen	11e26b52b2	When a function takes a variable number of pointer arguments, with a zero pointer marking the end of the list, the zero must be cast to the pointer type. An un-cast zero is a 32-bit int, and at least on x86_64, gcc will not extend the zero to 64 bits, thus allowing the upper 32 bits to be random junk. The new END_WITH_NULL macro may be used to annotate a such a function so that GCC (version 4 or newer) will detect the use of un-casted zero at compile time. llvm-svn: 23888	2005-10-23 04:37:20 +00:00
Andrew Lenharth	4b3932aa89	add TargetExternalSymbol llvm-svn: 23886	2005-10-23 03:40:17 +00:00
Chris Lattner	9faa5b7a9a	BuildSDIV and BuildUDIV only work for i32/i64, but they don't check that the input is that type, this caused a failure on gs on X86 last night. Move the hard checks into Build[US]Div since that is where decisions like this should be made. llvm-svn: 23881	2005-10-22 18:50:15 +00:00
Chris Lattner	75ea5b10bf	add a case missing from the dag combiner that exposed the failure on 2005-10-21-longlonggtu.ll. llvm-svn: 23875	2005-10-21 21:23:25 +00:00
Chris Lattner	e95b5745c0	Make the coallescer a bit smarter, allowing it to join more live ranges. For example, we can now join things like [0-30:0)[31-40:1)[52-59:2) with [40:60:0) if the 52-59 range is defined by a copy from the 40-60 range. The resultant range ends up being [0-30:0)[31-60:1). This fires a lot through-out the test suite (e.g. shrinking bc from 19492 -> 18509 machineinstrs) though most gains are smaller (e.g. about 50 copies eliminated from crafty). llvm-svn: 23866	2005-10-21 06:49:50 +00:00
Chris Lattner	76c97afbbc	Fix LiveInterval::getOverlapingRanges to take things in the right order (an unused method). Fix the merger so that it can merge ranges like this [10:12)[16:40) with [12:38) into [10:40) instead of bogus ranges. This sort of input will be possible for the merger coming shortly llvm-svn: 23865	2005-10-21 06:41:30 +00:00
Nate Begeman	8f62cd32ad	Fix a typo in the dag combiner, so that this can work on i64 targets llvm-svn: 23856	2005-10-21 01:51:45 +00:00
Nate Begeman	4dd383120f	Invert the TargetLowering flag that controls divide by consant expansion. Add a new flag to TargetLowering indicating if the target has really cheap signed division by powers of two, make ppc use it. This will probably go away in the future. Implement some more ISD::SDIV folds in the dag combiner Remove now dead code in the x86 backend. llvm-svn: 23853	2005-10-21 00:02:42 +00:00
Chris Lattner	b7b75e1b68	Fix a conditional so we don't access past the end of the range. Thanks to Andrew for bringing this to my attn. llvm-svn: 23850	2005-10-20 22:50:10 +00:00
Nate Begeman	7efe53d90b	Fix a couple bugs in the const div stuff where we'd generate MULHS/MULHU for types that aren't legal, and fail a divisor is less than zero comparison, which would cause us to drop a subtract. llvm-svn: 23846	2005-10-20 17:45:03 +00:00
Chris Lattner	a6efeb01f9	don't use llabs with apparently VC++ doesn't have llvm-svn: 23845	2005-10-20 17:01:00 +00:00
Chris Lattner	35852fc391	Fix order of eval problem from when I refactored this into a function. llvm-svn: 23844	2005-10-20 16:56:40 +00:00
Chris Lattner	3cf40798ab	add a new method, play around with some code. Fix a bug in the extendIntervalEndTo method. In particular, if adding [2:10) to an interval containing [0:2),[10:30), we produced [0:10),[10,30). Which is not the most smart thing to do. Now produce [0:30). llvm-svn: 23841	2005-10-20 07:39:25 +00:00
Chris Lattner	8816353040	Refactor some code, pulling it out into a function. No functionality change. llvm-svn: 23839	2005-10-20 06:06:30 +00:00
Nate Begeman	c6f067a8c4	Move the target constant divide optimization up into the dag combiner, so that the nodes can be folded with other nodes, and we can not duplicate code in every backend. Alpha will probably want this too. llvm-svn: 23835	2005-10-20 02:15:44 +00:00
Nate Begeman	5172ce641e	Teach Legalize how to do something with EXTRACT_ELEMENT when the type of the pair of elements is a legal type. llvm-svn: 23804	2005-10-19 00:06:56 +00:00
Nate Begeman	78afac2ddd	Add the ability to lower return instructions to TargetLowering. This allows us to lower legal return types to something else, to meet ABI requirements (such as that i64 be returned in two i32 regs on Darwin/ppc). llvm-svn: 23802	2005-10-18 23:23:37 +00:00
Chris Lattner	0a71a9ac86	Fix Generic/2005-10-18-ZeroSizeStackObject.ll by not requesting a zero sized stack object if either the array size or the type size is zero. llvm-svn: 23801	2005-10-18 22:14:06 +00:00
Chris Lattner	8396a308a7	remove hack llvm-svn: 23797	2005-10-18 22:11:42 +00:00
Chris Lattner	6c14c35bd7	Fold (select C, load A, load B) -> load (select C, A, B). This happens quite a lot throughout many programs. In particular, specfp triggers it a bunch for constant FP nodes when you have code like cond ? 1.0 : -1.0. If the PPC ISel exposed the loads implicit in pic references to external globals, we would be able to eliminate a load in cases like this as well: %X = external global int %Y = external global int int* %test4(bool %C) { %G = select bool %C, int* %X, int* %Y ret int* %G } Note that this breaks things that use SrcValue's (see the fixme), but since nothing uses them yet, this is ok. Also, simplify some code to use hasOneUse() on an SDOperand instead of hasNUsesOfValue directly. llvm-svn: 23781	2005-10-18 06:04:22 +00:00
Nate Begeman	418c6e4045	Implement some feedback from Chris re: constant canonicalization llvm-svn: 23777	2005-10-18 00:28:13 +00:00
Nate Begeman	bd5f41a6a6	Legalize BUILD_PAIR appropriately for upcoming 64 bit PowerPC work. llvm-svn: 23776	2005-10-18 00:27:41 +00:00
Nate Begeman	ec48a1bfbd	fold fmul X, +2.0 -> fadd X, X; llvm-svn: 23774	2005-10-17 20:40:11 +00:00
Chris Lattner	eeb2bda2fa	add a trivial fold llvm-svn: 23764	2005-10-17 01:07:11 +00:00
Chris Lattner	e540800d5a	Fix this logic. llvm-svn: 23756	2005-10-15 22:35:40 +00:00
Chris Lattner	17cc9edd33	Add a case we were missing that was causing us to fail CodeGen/PowerPC/rlwinm.ll:test3 llvm-svn: 23755	2005-10-15 22:18:08 +00:00
Chris Lattner	b986f471be	Use getExtLoad here instead of getNode, as extloads produce two values. This fixes a legalize failure on SPASS for itanium. llvm-svn: 23747	2005-10-15 20:24:07 +00:00
Nate Begeman	6e673b24d3	fold sext_in_reg, sext_in_reg where both have the same VT. This was popping up in Fourinarow. llvm-svn: 23722	2005-10-14 01:29:07 +00:00
Nate Begeman	d59e5a7abb	Relax the checking on zextload generation a bit, since as sabre pointed out you could be AND'ing with the result of a shift that shifts out all the bits you care about, in addition to a constant. Also, move over an add/sub_parts fold from legalize to the dag combiner, where it works for things other than constants. Woot! llvm-svn: 23720	2005-10-14 01:12:21 +00:00
Chris Lattner	b8282987f4	Fix the trunc(load) case, finally allowing crafty and povray to pass llvm-svn: 23718	2005-10-13 22:10:05 +00:00
Chris Lattner	dbc5ae3109	Fix some bugs in (sext (load x)) llvm-svn: 23717	2005-10-13 21:52:31 +00:00
Chris Lattner	258521d7ea	When ExpandOp'ing a [SZ]EXTLOAD, make sure to remember that the chain is also legal. Add support for ExpandOp'ing raw EXTLOADs too. llvm-svn: 23716	2005-10-13 21:44:47 +00:00
Chris Lattner	d23f4b7411	Implement PromoteOp for *EXTLOAD, allowing MallocBench/gs to Legalize llvm-svn: 23715	2005-10-13 20:07:41 +00:00
Nate Begeman	8e022b3d89	Fix the remaining DAGCombiner issues pointed out by sabre. This should fix the remainder of the failures introduced by my patch last night. llvm-svn: 23714	2005-10-13 18:34:58 +00:00
Chris Lattner	a80f1f6e72	Fix a minor bug in the dag combiner that broke pcompress2 and some other tests. llvm-svn: 23713	2005-10-13 18:16:34 +00:00
Nate Begeman	c3a89c5259	Add support to Legalize for expanding i64 sextload/zextload into hi and lo parts. This should fix the crafty and signed long long unit test failure on x86 last night. llvm-svn: 23711	2005-10-13 17:15:37 +00:00
Jim Laskey	5d7a50ac44	Inhibit instructions from being pushed before function calls. This will minimize unnecessary spilling. llvm-svn: 23710	2005-10-13 16:44:00 +00:00
Nate Begeman	02b23c6065	Move some Legalize functionality over to the DAGCombiner where it belongs. Kill some dead code. llvm-svn: 23706	2005-10-13 03:11:28 +00:00
Nate Begeman	70d28c5e32	Fix a potential bug with two combine-to's back to back that chris pointed out, where after the first CombineTo() call, the node the second CombineTo wishes to replace may no longer exist. Fix a very real bug with the truncated load optimization on little endian targets, which do not need a byte offset added to the load. llvm-svn: 23704	2005-10-12 23:18:53 +00:00
Nate Begeman	8caf81d617	More cool stuff for the dag combiner. We can now finally handle things like turning: _foo: fctiwz f0, f1 stfd f0, -8(r1) lwz r2, -4(r1) rlwinm r3, r2, 0, 16, 31 blr into _foo: fctiwz f0,f1 stfd f0,-8(r1) lhz r3,-2(r1) blr Also removed an unncessary constraint from sra -> srl conversion, which should take care of hte only reason we would ever need to handle sra in MaskedValueIsZero, AFAIK. llvm-svn: 23703	2005-10-12 20:40:40 +00:00
Jim Laskey	63b1419b74	Finally committing to the new scheduler. Still -sched=none by default. llvm-svn: 23702	2005-10-12 18:29:35 +00:00
Jim Laskey	d00db257c7	Added graphviz/gv support for MF. llvm-svn: 23700	2005-10-12 12:09:05 +00:00
Chris Lattner	514f058be1	Fix a powerpc crash on CodeGen/Generic/llvm-ct-intrinsics.ll llvm-svn: 23694	2005-10-11 17:56:34 +00:00
Chris Lattner	c38fb8e2a1	Add a canonicalization that got lost, fixing PowerPC/fold-li.ll:SUB llvm-svn: 23693	2005-10-11 06:07:15 +00:00
Chris Lattner	cc6e53e6ee	clean up some corner cases llvm-svn: 23692	2005-10-10 23:00:08 +00:00
Chris Lattner	04c737091f	Implement trivial DSE. If two stores are neighbors and store to the same location, replace them with a new store of the last value. This occurs in the same neighborhood in 197.parser, speeding it up about 1.5% llvm-svn: 23691	2005-10-10 22:31:19 +00:00
Chris Lattner	e260ed8628	Add support for CombineTo, allowing the dag combiner to replace nodes with multiple results. Use this support to implement trivial store->load forwarding, implementing CodeGen/PowerPC/store-load-fwd.ll. Though this is the most simple case and can be extended in the future, it is still useful. For example, it speeds up 197.parser by 6.2% by avoiding an LSU reject in xalloc: stw r6, lo16(l5_end_of_array)(r2) addi r2, r5, -4 stwx r5, r4, r2 - lwzx r5, r4, r2 - rlwinm r5, r5, 0, 0, 30 stwx r5, r4, r2 lwz r2, -4(r4) ori r2, r2, 1 llvm-svn: 23690	2005-10-10 22:04:48 +00:00
Nate Begeman	6828ed9bfd	Teach the DAGCombiner several new tricks, teaching it how to turn sext_inreg into zext_inreg based on the signbit (fires a lot), srem into urem, etc. llvm-svn: 23688	2005-10-10 21:26:48 +00:00
Chris Lattner	7730924067	Fix comment llvm-svn: 23686	2005-10-10 16:52:03 +00:00
Chris Lattner	3d1d4a3d12	Add ISD::ADD to MaskedValueIsZero llvm-svn: 23685	2005-10-10 16:51:40 +00:00
Chris Lattner	56e44a6da5	This function is now dead llvm-svn: 23684	2005-10-10 16:49:22 +00:00
Chris Lattner	bcfebebf22	Enable Nate's excellent DAG combiner work by default. This allows the removal of a bunch of ad-hoc and crufty code from SelectionDAG.cpp. llvm-svn: 23682	2005-10-10 16:47:10 +00:00
Chris Lattner	6a49b7cabb	add a todo for something I noticed llvm-svn: 23679	2005-10-09 22:59:08 +00:00
Chris Lattner	1d3dc00674	(X & Y) & C == 0 if either X&C or Y&C are zero llvm-svn: 23678	2005-10-09 22:12:36 +00:00
Chris Lattner	0832f2635a	When emiting a CopyFromReg and the source is already a vreg, do not bother creating a new vreg and inserting a copy: just use the input vreg directly. This speeds up the compile (e.g. about 5% on mesa with a debug build of llc) by not adding a bunch of copies and vregs to be coallesced away. On mesa, for example, this reduces the number of intervals from 168601 to 129040 going into the coallescer. llvm-svn: 23671	2005-10-09 05:58:56 +00:00
Nate Begeman	2042aa5b92	Lo and behold, the last bits of SelectionDAG.cpp have been moved over. llvm-svn: 23665	2005-10-08 00:29:44 +00:00
Chris Lattner	be4bbca0ba	remove debugging code llvm-svn: 23663	2005-10-07 15:31:26 +00:00
Chris Lattner	fb12624a3f	implement CodeGen/PowerPC/div-2.ll:test2-4 by propagating zero bits through C-X's llvm-svn: 23662	2005-10-07 15:30:32 +00:00
Chris Lattner	b27a4147d3	fix indentation llvm-svn: 23660	2005-10-07 06:37:02 +00:00
Chris Lattner	5bcd0dd811	Turn sdivs into udivs when we can prove the sign bits are clear. This implements CodeGen/PowerPC/div-2.ll llvm-svn: 23659	2005-10-07 06:10:46 +00:00
Chris Lattner	7bf8d06f02	silence a bogus GCC warning llvm-svn: 23646	2005-10-06 17:39:10 +00:00
Chris Lattner	fabe55f155	Fix the LLC regressions on X86 last night. In particular, when undoing previous copy elisions and we discover we need to reload a register, make sure to use the regclass of the original register for the reload, not the class of the current register. This avoid using 16-bit loads to reload 32-bit values. llvm-svn: 23645	2005-10-06 17:19:06 +00:00
Chris Lattner	4bbbb9eed7	Make the legalizer completely non-recursive llvm-svn: 23642	2005-10-06 01:20:27 +00:00
Nate Begeman	558beb3729	Let the combiner handle more cases llvm-svn: 23641	2005-10-05 21:44:43 +00:00
Nate Begeman	f8221c5e2c	Remove some bad code from Legalize llvm-svn: 23640	2005-10-05 21:44:10 +00:00
Nate Begeman	bd7df030d2	Check in some more DAGCombiner pieces llvm-svn: 23639	2005-10-05 21:43:42 +00:00
Chris Lattner	55149d7835	Fix a bug in the local spiller, where we could take code like this: store r12 -> [ss#2] R3 = load [ss#1] use R3 R3 = load [ss#2] R4 = load [ss#1] and turn it into this code: store R12 -> [ss#2] R3 = load [ss#1] use R3 R3 = R12 R4 = R3 <- oops! The problem was that promoting R3 = load[ss#2] to a copy missed the fact that the instruction invalidated R3 at that point. llvm-svn: 23638	2005-10-05 18:30:19 +00:00
Chris Lattner	a49e16fefa	implement visitBR_CC so that PowerPC/inverted-bool-compares.ll passes with the dag combiner. This speeds up espresso by 8%, reaching performance parity with the dag-combiner-disabled llc. llvm-svn: 23636	2005-10-05 06:47:48 +00:00
Chris Lattner	b11d15637a	fix some pastos llvm-svn: 23635	2005-10-05 06:37:22 +00:00
Chris Lattner	06f1d0f73a	Add a new HandleNode class, which is used to handle (haha) cases in the dead node elim and dag combiner passes where the root is potentially updated. This fixes a fixme in the dag combiner. llvm-svn: 23634	2005-10-05 06:35:28 +00:00
Chris Lattner	a6895d180e	Implement the code for PowerPC/inverted-bool-compares.ll, even though it that testcase still does not pass with the dag combiner. This is because not all forms of br* are folded yet. Also, when we combine a node into another one, delete the node immediately instead of waiting for the node to potentially come up in the future. llvm-svn: 23632	2005-10-05 06:11:08 +00:00
Chris Lattner	6bd8fd09b6	make sure that -view-isel-dags is the input to the isel, not the input to the second phase of dag combining llvm-svn: 23631	2005-10-05 06:09:10 +00:00
Chris Lattner	746d50a01a	Fix a crash compiling Olden/tsp llvm-svn: 23630	2005-10-05 04:45:43 +00:00
Jim Laskey	327d4298e1	Reverting to version - until problem isolated. llvm-svn: 23622	2005-10-04 16:41:51 +00:00
Nate Begeman	5da6908d65	Fix some faulty logic in the libcall inserter. Since calls return more than one value, don't bail if one of their uses happens to be a node that's not an MVT::Other when following the chain from CALLSEQ_START to CALLSEQ_END. Once we've found a CALLSEQ_START, we can just return; there's no need to tail-recurse further up the graph. Most importantly, just because something only has one use doesn't mean we should use it's one use to follow from start to end. This faulty logic caused us to follow a chain of one-use FP operations back to a much earlier call, putting a cycle in the graph from a later start to an earlier end. This is a better fix that reverting to the workaround committed earlier today. llvm-svn: 23620	2005-10-04 02:10:55 +00:00
Nate Begeman	54fb5002e5	Add back a workaround that fixes some breakages from chris's last change. Neither of us have yet figured out why this code is necessary, but stuff breaks if its not there. Still tracking this down... llvm-svn: 23617	2005-10-04 00:37:37 +00:00
Jim Laskey	409a6b204e	Refactor gathering node info and emission. llvm-svn: 23610	2005-10-03 12:30:32 +00:00
Chris Lattner	57b21f9f10	clean up this code a bit, no functionality change llvm-svn: 23609	2005-10-03 07:22:07 +00:00
Chris Lattner	5f096e2847	Break the body of the loop out into a new method llvm-svn: 23606	2005-10-03 04:47:08 +00:00
Chris Lattner	9cfccfb517	Fix a problem where the legalizer would run out of stack space on extremely large basic blocks because it was purely recursive. This switches it to an iterative/recursive hybrid. llvm-svn: 23596	2005-10-02 17:49:46 +00:00
Chris Lattner	7f718e61e8	silence a bogus warning llvm-svn: 23595	2005-10-02 16:30:51 +00:00
Chris Lattner	704d97f8b2	Add assertions to the trivial scheduler to check that the value types match up between defs and uses. llvm-svn: 23590	2005-10-02 07:10:55 +00:00
Chris Lattner	a038d901fb	Codegen CopyFromReg using the regclass that matches the valuetype of the destination vreg. llvm-svn: 23586	2005-10-02 06:34:16 +00:00
Chris Lattner	5a7bfe0b72	Add some very paranoid checking for operand/result reg class matchup For instructions that define multiple results, use the right regclass to define the result, not always the rc of result #0 llvm-svn: 23580	2005-10-01 07:45:09 +00:00
Jeff Cohen	f8a5e5ae6e	Fix VC++ warnings. llvm-svn: 23579	2005-10-01 03:57:14 +00:00
Chris Lattner	fda6944c5b	add a method llvm-svn: 23575	2005-10-01 00:17:07 +00:00
Jim Laskey	d3850457a1	typo llvm-svn: 23574	2005-10-01 00:08:23 +00:00
Jim Laskey	9d96932879	1. Simplify the gathering of node groups. 2. Printing node groups when displaying nodes. llvm-svn: 23573	2005-10-01 00:03:07 +00:00
Jim Laskey	3fe3841c2a	1. Made things node-centric (from operand). 2. Added node groups to handle flagged nodes. 3. Started weaning simple scheduling off existing emitter. llvm-svn: 23566	2005-09-30 19:15:27 +00:00
Chris Lattner	2e794c9198	now that we have a reg class to spill with, get this info from the regclass llvm-svn: 23559	2005-09-30 17:19:22 +00:00
Chris Lattner	51878189c5	Now that we have getCalleeSaveRegClasses() info, use it to pass the register class into the spill/reload methods. Targets can now rely on that argument. llvm-svn: 23556	2005-09-30 16:59:07 +00:00
Chris Lattner	5a6199f387	Change this code ot pass register classes into the stack slot spiller/reloader code. PrologEpilogInserter hasn't been updated yet though, so targets cannot use this info. llvm-svn: 23536	2005-09-30 01:29:00 +00:00
Chris Lattner	5b2be1f890	Fix two bugs in my patch earlier today that broke int->fp conversion on X86. llvm-svn: 23522	2005-09-29 06:44:39 +00:00
Jeff Cohen	b01a41a06d	Silence VC++ redeclaration warnings. llvm-svn: 23516	2005-09-29 01:59:49 +00:00
Chris Lattner	6f3b577ee6	Add FP versions of the binary operators, keeping the int and fp worlds seperate. Though I have done extensive testing, it is possible that this will break things in configs I can't test. Please let me know if this causes a problem and I'll fix it ASAP. llvm-svn: 23504	2005-09-28 22:28:18 +00:00
Chris Lattner	0fd8f9fbc9	If the target prefers it, use _setjmp/_longjmp should be used instead of setjmp/longjmp for llvm.setjmp/llvm.longjmp. llvm-svn: 23481	2005-09-27 22:15:53 +00:00
Jim Laskey	63523f98d5	Remove some redundancies. llvm-svn: 23469	2005-09-27 17:32:45 +00:00
Jim Laskey	5f2443c8a3	Addition of a simple two pass scheduler. This version is currently hacked up for testing and will require target machine info to do a proper scheduling. The simple scheduler can be turned on using -sched=simple (defaults to -sched=none) llvm-svn: 23455	2005-09-26 21:57:04 +00:00
Chris Lattner	59a05bdde6	Turn (X^C1) == C2 into X == C1^C2 iff X&~C1 = 0 (and move a function) This happens all the time on PPC for bool values, e.g. eliminating a xori in inverted-bool-compares.ll. This should be added to the dag combiner as well. llvm-svn: 23403	2005-09-23 00:55:52 +00:00
Chris Lattner	b1f8982ff0	Expose the LiveInterval interfaces as public headers. llvm-svn: 23400	2005-09-21 04:19:09 +00:00
Nate Begeman	c760f80fed	Stub out the rest of the DAG Combiner. Just need to fill in the select_cc bits and then wrap it in a convenience function for use with regular select. llvm-svn: 23389	2005-09-19 22:34:01 +00:00
Chris Lattner	2f838f2192	Teach the local spiller to turn stack slot loads into register-register copies when possible, avoiding the load (and avoiding the copy if the value is already in the right register). This patch came about when I noticed code like the following being generated: store R17 -> [SS1] ...blah... R4 = load [SS1] This was causing an LSU reject on the G5. This problem was due to the register allocator folding spill code into a reg-reg copy (producing the load), which prevented the spiller from being able to rewrite the load into a copy, despite the fact that the value was already available in a register. In the case above, we now rip out the R4 load and replace it with a R4 = R17 copy. This speeds up several programs on X86 (which spills a lot :) ), e.g. smg2k from 22.39->20.60s, povray from 12.93->12.66s, 168.wupwise from 68.54->53.83s (!), 197.parser from 7.33->6.62s (!), etc. This may have a larger impact in some cases on the G5 (by avoiding LSU rejects), though it probably won't trigger as often (less spilling in general). Targets that implement folding of loads/stores into copies should implement the isLoadFromStackSlot hook to get this. llvm-svn: 23388	2005-09-19 06:56:21 +00:00
Nate Begeman	24a7eca282	More DAG combining. Still need the branch instructions, and select_cc llvm-svn: 23371	2005-09-16 00:54:12 +00:00
Chris Lattner	d4382f0afa	If a function has liveins, and if the target requested that they be plopped into particular vregs, emit copies into the entry MBB. llvm-svn: 23331	2005-09-13 19:30:54 +00:00
Chris Lattner	2d454bf5be	Allow targets to say they don't support truncstore i1 (which includes a mask when storing to an 8-bit memory location), as most don't. llvm-svn: 23303	2005-09-10 00:20:18 +00:00
Chris Lattner	bd39c1a4c6	Add a missing #include, patch courtesy of Baptiste Lepilleur. llvm-svn: 23302	2005-09-09 23:53:39 +00:00
Chris Lattner	331b311f7b	Fix a problem duraid encountered on itanium where this folding: select (x < y), 1, 0 -> (x < y) incorrectly: the setcc returns i1 but the select returned i32. Add the zero extend as needed. llvm-svn: 23301	2005-09-09 23:00:07 +00:00
Chris Lattner	16e5cb87ba	Fix a crash viewing dags that have target nodes in them llvm-svn: 23300	2005-09-09 22:35:03 +00:00
Chris Lattner	1410003751	Use continue in the use-processing loop to make it clear what the early exits are, simplify logic, and cause things to not be nested as deeply. This also uses MRI->areAliases instead of an explicit loop. No functionality change, just code cleanup. llvm-svn: 23296	2005-09-09 20:29:51 +00:00
Nate Begeman	049b748c76	Last round of 2-node folds from SD.cpp. Will move on to 3 node ops such as setcc and select next. llvm-svn: 23295	2005-09-09 19:49:52 +00:00
Chris Lattner	ce3662f2a2	remove debugging code slaps head llvm-svn: 23294	2005-09-09 19:19:20 +00:00
Chris Lattner	c9053083eb	When spilling a live range that is used multiple times by one instruction, only add a reload live range once for the instruction. This is one step towards fixing a regalloc pessimization that Nate notice, but is later undone by the spiller (so no code is changed). llvm-svn: 23293	2005-09-09 19:17:47 +00:00
Nate Begeman	85c1cc4523	Move yet more folds over to the dag combiner from sd.cpp llvm-svn: 23278	2005-09-08 20:18:10 +00:00
Nate Begeman	2cc2c9a79c	Another round of dag combiner changes. This fixes some missing XOR folds as well as fixing how we replace old values with new values. llvm-svn: 23260	2005-09-07 23:25:52 +00:00
Chris Lattner	5d16dbd5bb	Fix a bug that Tzu-Chien Chiu noticed: live interval analysis does NOT preserve livevar llvm-svn: 23259	2005-09-07 17:34:39 +00:00
Nate Begeman	6791d63e55	Implement a common missing fold, (add (add x, c1), c2) -> (add x, c1+c2). This restores all of stanford to being identical with and without the dag combiner with the add folding turned off in sd.cpp. llvm-svn: 23258	2005-09-07 16:09:19 +00:00
Chris Lattner	fe883adfd2	Fix a bug nate ran into with replacealluseswith. In the recursive cse case, we were losing a node, causing an assertion to fail. Now we eagerly delete discovered CSE's, and provide an optional vector to keep track of these discovered equivalences. llvm-svn: 23255	2005-09-07 05:37:01 +00:00
Nate Begeman	007c650699	Add an option to the DAG Combiner to enable it for beta runs, and turn on that option for PowerPC's beta. llvm-svn: 23253	2005-09-07 00:15:36 +00:00
Nate Begeman	d23739d020	Next round of DAGCombiner changes. This version now passes all the tests I have run so far when run before Legalize. It still needs to pick up the SetCC folds, and nodes that use SetCC. llvm-svn: 23243	2005-09-06 04:43:02 +00:00
Chris Lattner	821628ff2a	Fix a checking failure in gs llvm-svn: 23235	2005-09-03 01:04:40 +00:00
Nate Begeman	7cea6ef16e	Next round of DAG Combiner changes. Just need to support multiple return values, and then we should be able to hook it up. llvm-svn: 23231	2005-09-02 21:18:40 +00:00
Chris Lattner	1a570f1fe4	Clean up some code from the last checkin llvm-svn: 23229	2005-09-02 20:32:45 +00:00
Chris Lattner	630226697f	Fix a bug in legalize where it would emit two calls to libcalls that return i64 values on targets that need that expanded to 32-bit registers. This fixes PowerPC/2005-09-02-LegalizeDuplicatesCalls.ll and speeds up 189.lucas from taking 122.72s to 81.96s on my desktop. llvm-svn: 23228	2005-09-02 20:26:58 +00:00
Chris Lattner	b95b280bee	Make sure to auto-cse nullary ops llvm-svn: 23224	2005-09-02 19:36:17 +00:00
Chris Lattner	1e89e36dcd	Fix some buggy logic where we would try to remove nodes with two operands from the binary ops map, even if they had multiple results. This latent bug caused a few failures with the dag isel last night. To prevent stuff like this from happening in the future, add some really strict checking to make sure that the CSE maps always match up with reality! llvm-svn: 23221	2005-09-02 19:15:44 +00:00
Chris Lattner	b0b4ec5655	Don't create zero sized stack objects even for array allocas with a zero number of elements. llvm-svn: 23219	2005-09-02 18:41:28 +00:00
Chris Lattner	b6cde17d29	Fix the release build, noticed by Eric van Riet Paap llvm-svn: 23215	2005-09-02 07:09:28 +00:00
Chris Lattner	d9af1aab51	Make sure to legalize assert[zs]ext's operand correctly llvm-svn: 23208	2005-09-02 01:15:01 +00:00
Chris Lattner	7138f91424	Teach live intervals to not crash on dead livein regs llvm-svn: 23206	2005-09-02 00:20:32 +00:00
Chris Lattner	a66403dbf7	For values that are live across basic blocks and need promotion, use ANY_EXTEND instead of ZERO_EXTEND to eliminate extraneous extensions. This eliminates dead zero extensions on formal arguments and other cases on PPC, implementing the newly tightened up test/Regression/CodeGen/PowerPC/small-arguments.ll test. llvm-svn: 23205	2005-09-02 00:19:37 +00:00
Chris Lattner	7753f175e6	legalize ANY_EXTEND appropriately llvm-svn: 23204	2005-09-02 00:18:10 +00:00
Chris Lattner	8c393c218b	Add support for ANY_EXTEND and add a few minor folds for it llvm-svn: 23203	2005-09-02 00:17:32 +00:00
Nate Begeman	d78d975437	Fix some code in the current node combining code, spotted when it was moved over to DAGCombiner.cpp 1. Don't assume that SetCC returns i1 when folding (xor (setcc) constant) 2. Don't duplicate code in folding AND with AssertZext that is handled by MaskedValueIsZero llvm-svn: 23196	2005-09-01 23:25:49 +00:00
Nate Begeman	2504fe2613	Implement first round of feedback from chris (there's still a couple things left to do). llvm-svn: 23195	2005-09-01 23:24:04 +00:00
Chris Lattner	975f5c9f46	It is NDEBUG not _NDEBUG llvm-svn: 23186	2005-09-01 18:44:10 +00:00
Nate Begeman	e8f78d1aab	Add the rest of the currently implemented visit routines to the switch statement in visit(). llvm-svn: 23185	2005-09-01 00:33:32 +00:00
Nate Begeman	21158fc485	First pass at the DAG Combiner. It isn't used anywhere yet, but it should be mostly functional. It currently has all folds from SelectionDAG.cpp that do not involve a condition code. llvm-svn: 23184	2005-09-01 00:19:25 +00:00
Chris Lattner	d4d10fff99	If a function has live ins/outs, print them llvm-svn: 23181	2005-08-31 22:34:59 +00:00
Chris Lattner	8a1a5f2818	Allow targets to custom expand shifts that are too large for their registers llvm-svn: 23173	2005-08-31 19:01:53 +00:00
Jeff Cohen	d8c84e3c7e	Fix VC++ precedence warnings llvm-svn: 23169	2005-08-31 02:47:06 +00:00
Nate Begeman	539e7c892c	Sigh, not my day. Fix typo. llvm-svn: 23166	2005-08-31 00:43:49 +00:00
Nate Begeman	d513d8a662	Fix a mistake in my previous patch pointed out by sabre; the AssertZext case in MaskedValueIsZero was wrong. llvm-svn: 23165	2005-08-31 00:43:08 +00:00
Nate Begeman	e07bc28cca	Remove some unnecessary casts, and add the AssertZext case to MaskedValueIsZero. llvm-svn: 23164	2005-08-31 00:27:53 +00:00
Chris Lattner	5764da422a	Allow physregs to occur in the dag with multiple types. Though I don't likethis, it is a requirement on PPC, which can have an f32 value in r3 at onepoint in a function and a f64 value in r3 at another point. :( This fixes compilation of mesa llvm-svn: 23161	2005-08-30 22:38:38 +00:00
Chris Lattner	4d602bed10	When checking the fixed intervals, don't forget to check for register aliases. This fixes PR621 and Regression/CodeGen/X86/2005-08-30-RegAllocAliasProblem.ll llvm-svn: 23158	2005-08-30 21:03:36 +00:00
Chris Lattner	61d21b1f3c	Fix FreeBench/fourinarow with the dag isel, by not adding a bogus result to SHIFT_PARTS nodes llvm-svn: 23151	2005-08-30 17:21:17 +00:00
Chris Lattner	9a4ad487f0	Fix a miscompile of PtrDist/bc. Sign extending bools is not the right thing, at least tends to expose problems elsewhere. llvm-svn: 23149	2005-08-30 16:56:19 +00:00
Nate Begeman	a3da8c4819	Remove a bogus piece of my AssertSext/AssertZext patch. oops. llvm-svn: 23148	2005-08-30 02:54:28 +00:00
Nate Begeman	43144a2fe0	Add support for AssertSext and AssertZext, folding other extensions with them. This allows for elminination of redundant extends in the entry blocks of functions on PowerPC. Add support for i32 x i32 -> i64 multiplies, by recognizing when the inputs to ISD::MUL in ExpandOp are actually just extended i32 values and not real i64 values. this allows us to codegen int mulhs(int a, int b) { return ((long long)a * b) >> 32; } as: _mulhs: mulhw r3, r4, r3 blr instead of: _mulhs: mulhwu r2, r4, r3 srawi r5, r3, 31 mullw r5, r4, r5 add r2, r2, r5 srawi r4, r4, 31 mullw r3, r4, r3 add r3, r2, r3 blr with a similar improvement on x86. llvm-svn: 23147	2005-08-30 02:44:00 +00:00
Chris Lattner	08a1e38730	Name this variable to be what it really is! llvm-svn: 23145	2005-08-30 01:58:51 +00:00
Chris Lattner	04cb82278a	Handle CopyToReg nodes with flag operands correctly llvm-svn: 23144	2005-08-30 01:57:23 +00:00
Chris Lattner	f7e5ec84c6	Add a hack to avoid some horrible code in some cases by always emitting token chains first. For this C function: int test() { int i; for (i = 0; i < 100000; ++i) foo(); } Instead of emitting this (condition before call) .LBB_test_1: ; no_exit addi r30, r30, 1 lis r2, 1 ori r2, r2, 34464 cmpw cr2, r30, r2 bl L_foo$stub bne cr2, .LBB_test_1 ; no_exit Emit this: .LBB_test_1: ; no_exit bl L_foo$stub addi r30, r30, 1 lis r2, 1 ori r2, r2, 34464 cmpw cr0, r30, r2 bne cr0, .LBB_test_1 ; no_exit Which makes it so we don't have to save/restore cr2 in the prolog/epilog of the function. This also makes the code much more similar to what the pattern isel produces. llvm-svn: 23135	2005-08-29 23:21:29 +00:00
Chris Lattner	c738d000d5	Add a new API for Nate llvm-svn: 23131	2005-08-29 21:59:31 +00:00
Andrew Lenharth	835cbb364d	Some of us cared about the the promote path llvm-svn: 23130	2005-08-29 20:46:51 +00:00
Chris Lattner	dcde1b2b6a	Fix an infinite loop on x86 llvm-svn: 23129	2005-08-29 17:30:00 +00:00
Chris Lattner	46d4c75cd1	Fix a bug in my previous patch that was using the wrong iterator. This fixes Olden/bisort among others. llvm-svn: 23124	2005-08-29 00:10:46 +00:00
Chris Lattner	87421c8658	Fix a bug in ReplaceAllUsesWith llvm-svn: 23122	2005-08-28 23:59:36 +00:00
Chris Lattner	075250bda1	Disable this code, which broke many tests last night llvm-svn: 23114	2005-08-27 16:16:51 +00:00
Chris Lattner	5ee85e89b6	fix PHI node emission for basic blocks that have select_cc's in them on ppc32 llvm-svn: 23113	2005-08-27 00:58:02 +00:00
Chris Lattner	56ca46ee04	Nate noticed that Andrew never did this. This fixes PR600 llvm-svn: 23110	2005-08-26 22:50:40 +00:00
Chris Lattner	e7a2998064	Don't copy regs that are only used in the entry block into a vreg. This changes the code generated for: short %test(short %A) { %B = xor short %A, -32768 ret short %B } to: _test: xori r2, r3, 32768 xoris r2, r2, 65535 extsh r3, r2 blr instead of: _test: rlwinm r2, r3, 0, 16, 31 xori r2, r3, 32768 xoris r2, r2, 65535 extsh r3, r2 blr llvm-svn: 23109	2005-08-26 22:49:59 +00:00
Chris Lattner	d4f43f7967	Make this code safe for when loadRegFromStackSlot inserts multiple instructions. llvm-svn: 23108	2005-08-26 22:18:32 +00:00
Chris Lattner	4a5ebe94ba	Checking types here is not safe, because multiple types can map to the same register class. llvm-svn: 23103	2005-08-26 21:39:15 +00:00
Chris Lattner	13d7c252e5	Call the InsertAtEndOfBasicBlock hook if the usesCustomDAGSchedInserter flag is set on an instruction. llvm-svn: 23098	2005-08-26 20:54:47 +00:00
Chris Lattner	373f048a79	Revampt ReplaceAllUsesWith to be more efficient and easier to use. llvm-svn: 23087	2005-08-26 18:36:28 +00:00
Chris Lattner	c30405e0ee	Change ConstantPoolSDNode to actually hold the Constant itself instead of putting it into the constant pool. This allows the isel machinery to create constants that it will end up deciding are not needed, without them ending up in the resultant function constant pool. llvm-svn: 23081	2005-08-26 17:15:30 +00:00
Chris Lattner	2091a36631	Fix a huge annoyance: SelectNodeTo took types before the opcode unlike every other SD API. Fix it to take the opcode before the types. llvm-svn: 23079	2005-08-26 16:36:26 +00:00
Chris Lattner	c6d481db7a	the 5th operand is the 4th number llvm-svn: 23074	2005-08-26 00:43:46 +00:00
Chris Lattner	5f573416cd	Add support for targets that want to custom expand select_cc in some cases. llvm-svn: 23071	2005-08-26 00:23:59 +00:00
Chris Lattner	dff50cadaa	Allow LowerOperation to return a null SDOperand in case it wants to lower some things given to it, but not all. llvm-svn: 23070	2005-08-26 00:14:16 +00:00
Chris Lattner	1cb550c603	Fix a nasty bug from a previous patch of mine llvm-svn: 23069	2005-08-26 00:13:12 +00:00
Nate Begeman	33840c3268	New fold for SELECT_CC llvm-svn: 23058	2005-08-25 20:04:38 +00:00
Chris Lattner	f9c19157df	Don't auto-cse nodes that return flags llvm-svn: 23055	2005-08-25 19:12:10 +00:00
Chris Lattner	12756be53b	add printer support for flag operands llvm-svn: 23054	2005-08-25 17:59:23 +00:00
Chris Lattner	9d28a56d55	simplify the code a bit using isOperationLegal llvm-svn: 23053	2005-08-25 17:54:58 +00:00
Chris Lattner	8a93f64efa	Add support for flag operands llvm-svn: 23050	2005-08-25 17:48:54 +00:00
Chris Lattner	407c6415b4	ADd support for TargetConstantPool nodes llvm-svn: 23041	2005-08-25 05:03:06 +00:00
Chris Lattner	bbe0e7df2c	add a new TargetFrameIndex node llvm-svn: 23035	2005-08-25 00:43:01 +00:00
Chris Lattner	45e1ce4e28	add a method llvm-svn: 23027	2005-08-24 23:00:29 +00:00
Chris Lattner	d7ee4d8671	Add ReplaceAllUsesWith that can take a vector of replacement values. Add some foldings to hopefully help the illegal setcc issue, and move some code around. llvm-svn: 23025	2005-08-24 22:44:39 +00:00
Chris Lattner	ad9565dfbe	Add support for external symbols, and support for variable arity instructions llvm-svn: 23022	2005-08-24 22:02:41 +00:00
Chris Lattner	bb8cc0acb2	Fix pasto that prevented VT ndoes from showing up in -view-isel-dags correctly llvm-svn: 23021	2005-08-24 18:30:00 +00:00
Chris Lattner	86b1658d58	teach selection dag mask tracking about the fact that select_cc operates like select. Also teach it that the bit count instructions can only set the low bits of the result, depending on the size of the input. This allows us to compile this: int %eq0(int %a) { %tmp.1 = seteq int %a, 0 ; <bool> [#uses=1] %tmp.2 = cast bool %tmp.1 to int ; <int> [#uses=1] ret int %tmp.2 } To this: _eq0: cntlzw r2, r3 srwi r3, r2, 5 blr instead of this: _eq0: cntlzw r2, r3 rlwinm r3, r2, 27, 31, 31 blr when setcc is marked illegal on ppc (which restores parity to non-illegal setcc). Thanks to Nate for pointing this out. llvm-svn: 23013	2005-08-24 16:46:55 +00:00
Chris Lattner	f12eb4d676	Start using isOperationLegal and isTypeLegal to simplify the code llvm-svn: 23012	2005-08-24 16:35:28 +00:00
Nate Begeman	45bbbb3f11	Teach SelectionDAG how to simplify a few more setcc-equivalent select_cc nodes so that backends don't have to. llvm-svn: 22999	2005-08-24 04:57:57 +00:00
Chris Lattner	99282c7b92	Make -view-isel-dags show the dag before instruction selecting, in case the target isel crashes due to unimplemented features like calls :) llvm-svn: 22997	2005-08-24 00:34:29 +00:00
Nate Begeman	72eab5dd5c	Fix optimization of select_cc seteq X, 0, 1, 0 -> srl (ctlz X), log2 X size llvm-svn: 22995	2005-08-24 00:21:28 +00:00
Chris Lattner	eeacce5a60	Implement LiveVariables.h change llvm-svn: 22994	2005-08-24 00:09:33 +00:00
Chris Lattner	469652752c	adjust to new live variables interface llvm-svn: 22992	2005-08-23 23:42:17 +00:00
Chris Lattner	774158239b	Simplify this code by using higher-level LiveVariables methods llvm-svn: 22989	2005-08-23 22:51:41 +00:00
Chris Lattner	22e91cc3b5	Keep track of which registers are related to which other registers. Use this information to avoid doing expensive interval intersections for registers that could not possible be interesting. This speeds up linscan on ia64 compiling kc++ in release mode from taking 7.82s to 4.8s(!), total itanium llc time on this program is 27.3s now. This marginally speeds up PPC and X86, but they appear to be limited by other parts of linscan, not this code. On this program, on itanium, live intervals now takes 41% of llc time. llvm-svn: 22986	2005-08-23 22:27:31 +00:00
Nate Begeman	bf8c3939d7	Teach the SelectionDAG how to transform select_cc eq, X, 0, 1, 0 into either seteq X, 0 or srl (ctlz X), size(X-1), depending on what's legal for the target. llvm-svn: 22978	2005-08-23 05:41:12 +00:00
Nate Begeman	987121a61a	Teach Legalize how to turn setcc into select_cc llvm-svn: 22977	2005-08-23 04:29:48 +00:00
Chris Lattner	834a2316a3	Try to avoid scanning the fixed list. On architectures with a non-stupid number of regs (e.g. most riscs), many functions won't need to use callee clobbered registers. Do a speculative check to see if we can get a free register without processing the fixed list (which has all of these). This saves a lot of time on machines with lots of callee clobbered regs (e.g. ppc and itanium, also x86). This reduces ppc llc compile time from 184s -> 172s on kc++. This is probably worth FAR FAR more on itanium though. llvm-svn: 22972	2005-08-22 20:59:30 +00:00
Chris Lattner	95a157ae1a	Move some code in the register assignment case that only needs to happen if we spill out of the fast path. The scan of active_ and the calls to updateSpillWeights don't need to happen unless a spill occurs. This reduces debug llc time of kc++ with ppc from 187.3s to 183.2s. llvm-svn: 22971	2005-08-22 20:20:42 +00:00
Chris Lattner	7f9e078d11	Fix a problem where constant expr shifts would not have their shift amount promoted to the right type. This fixes: IA64/2005-08-22-LegalizerCrash.ll llvm-svn: 22969	2005-08-22 17:28:31 +00:00
Chris Lattner	83b821b584	Speed up this loop a bit, based on some observations that Nate made, and add some comments. This loop really needs to be reevaluated! llvm-svn: 22966	2005-08-22 16:55:22 +00:00
Chris Lattner	92626b9bc5	Add a fast-path for register values. Add support for constant pool entries, allowing us to compile this: float %test2(float* %P) { %Q = load float* %P %R = add float %Q, 10.1 ret float %R } to this: _test2: lfs r2, 0(r3) lis r3, ha16(.CPI_test2_0) lfs r3, lo16(.CPI_test2_0)(r3) fadds f1, r2, r3 blr llvm-svn: 22962	2005-08-22 01:04:32 +00:00
Chris Lattner	466fecee19	add anew method llvm-svn: 22957	2005-08-21 22:30:30 +00:00
Chris Lattner	4866356907	Add support for frame index nodes llvm-svn: 22956	2005-08-21 19:56:04 +00:00
Chris Lattner	0548f50501	add a method llvm-svn: 22955	2005-08-21 19:48:59 +00:00
Chris Lattner	707b39fb8c	add a method llvm-svn: 22949	2005-08-21 18:49:33 +00:00
Chris Lattner	154b2bc59b	Add support for basic blocks, fix a bug in result # computation llvm-svn: 22948	2005-08-21 18:49:29 +00:00
Chris Lattner	539c3fa863	When legalizing brcond ->brcc or select -> selectcc, make sure to truncate the old condition to a one bit value. The incoming value must have been promoted, and the top bits are undefined. This causes us to generate: _test: rlwinm r2, r3, 0, 31, 31 li r3, 17 cmpwi cr0, r2, 0 bne .LBB_test_2 ; .LBB_test_1: ; li r3, 1 .LBB_test_2: ; blr instead of: _test: rlwinm r2, r3, 0, 31, 31 li r2, 17 cmpwi cr0, r3, 0 bne .LBB_test_2 ; .LBB_test_1: ; li r2, 1 .LBB_test_2: ; or r3, r2, r2 blr for: int %test(bool %c) { %retval = select bool %c, int 17, int 1 ret int %retval } llvm-svn: 22947	2005-08-21 18:03:09 +00:00
Chris Lattner	4b08ba26d8	fix bogus warning llvm-svn: 22943	2005-08-20 18:07:27 +00:00
Chris Lattner	319e65696d	Add support for global address nodes llvm-svn: 22940	2005-08-19 22:38:24 +00:00
Chris Lattner	1be7eddecf	Add support for TargetGlobalAddress nodes llvm-svn: 22938	2005-08-19 22:31:04 +00:00
Chris Lattner	6d7f814b01	Implement CopyFromReg, TokenFactor, and fix a bug in CopyToReg. This allows us to compile stuff like this: double %test(double %A, double %B, double %C, double %E) { %F = mul double %A, %A %G = add double %F, %B %H = sub double -0.0, %G %I = mul double %H, %C %J = add double %I, %E ret double %J } to: _test: fnmadd f0, f1, f1, f2 fmadd f1, f0, f3, f4 blr woot! llvm-svn: 22937	2005-08-19 21:43:53 +00:00
Chris Lattner	0875d1ab89	Fix a bug in previous commit llvm-svn: 22936	2005-08-19 21:34:13 +00:00
Chris Lattner	4990335eb8	Print physreg register nodes with target names (e.g. F1) instead of numbers llvm-svn: 22934	2005-08-19 21:21:16 +00:00
Chris Lattner	78b200eb74	Before implementing copyfromreg, we'll implement copytoreg correctly. This gets us this for the previous testcase: _test: lis r2, 0 ori r3, r2, 65535 blr Note that we actually write to r3 (the return reg) correctly now :) llvm-svn: 22933	2005-08-19 20:50:53 +00:00

... 3 4 5 6 7 ...

2106 Commits