llvm-project

Commit Graph

Author	SHA1	Message	Date
Nate Begeman	c41e1be2e8	Remove an unnecsesary file. PPC32 and PPC64 share architected registers. We will decide with subtarget support whether we ever use an i64 register class. llvm-svn: 23734	2005-10-14 18:58:46 +00:00
Chris Lattner	56f31f5408	add the integer truncate/extension operations llvm-svn: 23733	2005-10-14 06:40:20 +00:00
Chris Lattner	7d9f719d42	These are now autogenerated llvm-svn: 23731	2005-10-14 06:26:29 +00:00
Chris Lattner	9c0d3c5932	Add patterns for FP round/extend llvm-svn: 23727	2005-10-14 04:55:50 +00:00
Chris Lattner	6e83cbf7f3	add a new SDTCisOpSmallerThanOp type constraint, and implement fround/fextend in terms of it llvm-svn: 23726	2005-10-14 04:55:10 +00:00
Nate Begeman	6e673b24d3	fold sext_in_reg, sext_in_reg where both have the same VT. This was popping up in Fourinarow. llvm-svn: 23722	2005-10-14 01:29:07 +00:00
Chris Lattner	e3870fbe4a	Allow $ llvm-svn: 23721	2005-10-14 01:28:34 +00:00
Nate Begeman	d59e5a7abb	Relax the checking on zextload generation a bit, since as sabre pointed out you could be AND'ing with the result of a shift that shifts out all the bits you care about, in addition to a constant. Also, move over an add/sub_parts fold from legalize to the dag combiner, where it works for things other than constants. Woot! llvm-svn: 23720	2005-10-14 01:12:21 +00:00
Chris Lattner	b8282987f4	Fix the trunc(load) case, finally allowing crafty and povray to pass llvm-svn: 23718	2005-10-13 22:10:05 +00:00
Chris Lattner	dbc5ae3109	Fix some bugs in (sext (load x)) llvm-svn: 23717	2005-10-13 21:52:31 +00:00
Chris Lattner	258521d7ea	When ExpandOp'ing a [SZ]EXTLOAD, make sure to remember that the chain is also legal. Add support for ExpandOp'ing raw EXTLOADs too. llvm-svn: 23716	2005-10-13 21:44:47 +00:00
Chris Lattner	d23f4b7411	Implement PromoteOp for *EXTLOAD, allowing MallocBench/gs to Legalize llvm-svn: 23715	2005-10-13 20:07:41 +00:00
Nate Begeman	8e022b3d89	Fix the remaining DAGCombiner issues pointed out by sabre. This should fix the remainder of the failures introduced by my patch last night. llvm-svn: 23714	2005-10-13 18:34:58 +00:00
Chris Lattner	a80f1f6e72	Fix a minor bug in the dag combiner that broke pcompress2 and some other tests. llvm-svn: 23713	2005-10-13 18:16:34 +00:00
Nate Begeman	c3a89c5259	Add support to Legalize for expanding i64 sextload/zextload into hi and lo parts. This should fix the crafty and signed long long unit test failure on x86 last night. llvm-svn: 23711	2005-10-13 17:15:37 +00:00
Jim Laskey	5d7a50ac44	Inhibit instructions from being pushed before function calls. This will minimize unnecessary spilling. llvm-svn: 23710	2005-10-13 16:44:00 +00:00
Nate Begeman	02b23c6065	Move some Legalize functionality over to the DAGCombiner where it belongs. Kill some dead code. llvm-svn: 23706	2005-10-13 03:11:28 +00:00
Nate Begeman	70d28c5e32	Fix a potential bug with two combine-to's back to back that chris pointed out, where after the first CombineTo() call, the node the second CombineTo wishes to replace may no longer exist. Fix a very real bug with the truncated load optimization on little endian targets, which do not need a byte offset added to the load. llvm-svn: 23704	2005-10-12 23:18:53 +00:00
Nate Begeman	8caf81d617	More cool stuff for the dag combiner. We can now finally handle things like turning: _foo: fctiwz f0, f1 stfd f0, -8(r1) lwz r2, -4(r1) rlwinm r3, r2, 0, 16, 31 blr into _foo: fctiwz f0,f1 stfd f0,-8(r1) lhz r3,-2(r1) blr Also removed an unncessary constraint from sra -> srl conversion, which should take care of hte only reason we would ever need to handle sra in MaskedValueIsZero, AFAIK. llvm-svn: 23703	2005-10-12 20:40:40 +00:00
Jim Laskey	63b1419b74	Finally committing to the new scheduler. Still -sched=none by default. llvm-svn: 23702	2005-10-12 18:29:35 +00:00
Jim Laskey	d00db257c7	Added graphviz/gv support for MF. llvm-svn: 23700	2005-10-12 12:09:05 +00:00
Chris Lattner	192cd18f53	Fix (hopefully the last) issue where LSR is nondeterminstic. When pulling out CSE's of base expressions it could build a result whose order was nondet. llvm-svn: 23698	2005-10-11 18:41:04 +00:00
Chris Lattner	5c9d63da31	Fix another problem where LSR was being nondeterminstic. Also remove elements from the end of a vector instead of the beginning llvm-svn: 23697	2005-10-11 18:30:57 +00:00
Chris Lattner	b7a3894e7c	Fix another lsr-is-nondeterministic case llvm-svn: 23695	2005-10-11 18:17:57 +00:00
Chris Lattner	514f058be1	Fix a powerpc crash on CodeGen/Generic/llvm-ct-intrinsics.ll llvm-svn: 23694	2005-10-11 17:56:34 +00:00
Chris Lattner	c38fb8e2a1	Add a canonicalization that got lost, fixing PowerPC/fold-li.ll:SUB llvm-svn: 23693	2005-10-11 06:07:15 +00:00
Chris Lattner	cc6e53e6ee	clean up some corner cases llvm-svn: 23692	2005-10-10 23:00:08 +00:00
Chris Lattner	04c737091f	Implement trivial DSE. If two stores are neighbors and store to the same location, replace them with a new store of the last value. This occurs in the same neighborhood in 197.parser, speeding it up about 1.5% llvm-svn: 23691	2005-10-10 22:31:19 +00:00
Chris Lattner	e260ed8628	Add support for CombineTo, allowing the dag combiner to replace nodes with multiple results. Use this support to implement trivial store->load forwarding, implementing CodeGen/PowerPC/store-load-fwd.ll. Though this is the most simple case and can be extended in the future, it is still useful. For example, it speeds up 197.parser by 6.2% by avoiding an LSU reject in xalloc: stw r6, lo16(l5_end_of_array)(r2) addi r2, r5, -4 stwx r5, r4, r2 - lwzx r5, r4, r2 - rlwinm r5, r5, 0, 0, 30 stwx r5, r4, r2 lwz r2, -4(r4) ori r2, r2, 1 llvm-svn: 23690	2005-10-10 22:04:48 +00:00
Nate Begeman	6828ed9bfd	Teach the DAGCombiner several new tricks, teaching it how to turn sext_inreg into zext_inreg based on the signbit (fires a lot), srem into urem, etc. llvm-svn: 23688	2005-10-10 21:26:48 +00:00
Chris Lattner	7730924067	Fix comment llvm-svn: 23686	2005-10-10 16:52:03 +00:00
Chris Lattner	3d1d4a3d12	Add ISD::ADD to MaskedValueIsZero llvm-svn: 23685	2005-10-10 16:51:40 +00:00
Chris Lattner	56e44a6da5	This function is now dead llvm-svn: 23684	2005-10-10 16:49:22 +00:00
Chris Lattner	bcfebebf22	Enable Nate's excellent DAG combiner work by default. This allows the removal of a bunch of ad-hoc and crufty code from SelectionDAG.cpp. llvm-svn: 23682	2005-10-10 16:47:10 +00:00
Chris Lattner	d59a57a8d5	These definitions have been moved to common code. llvm-svn: 23681	2005-10-10 06:01:00 +00:00
Chris Lattner	d83571bbf2	Pull DAG ISel generation nodes out of the PowerPC backend to where they can be used by other targets. For those targets that want to use it, have at. :) llvm-svn: 23680	2005-10-10 06:00:30 +00:00
Chris Lattner	6a49b7cabb	add a todo for something I noticed llvm-svn: 23679	2005-10-09 22:59:08 +00:00
Chris Lattner	1d3dc00674	(X & Y) & C == 0 if either X&C or Y&C are zero llvm-svn: 23678	2005-10-09 22:12:36 +00:00
Chris Lattner	03b9eb506c	Make MaskedValueIsZero a bit more aggressive llvm-svn: 23677	2005-10-09 22:08:50 +00:00
Andrew Lenharth	1dfb85c7af	This seems useful from the original patch that added the function. If there is a reason it is not useful on a RISC type target, let me know and I will pull it out llvm-svn: 23676	2005-10-09 20:11:35 +00:00
Chris Lattner	62010c450f	Fix funky xcode indentation llvm-svn: 23674	2005-10-09 06:36:35 +00:00
Chris Lattner	eb4be8b942	Hrm, you didn't see this. llvm-svn: 23673	2005-10-09 06:24:02 +00:00
Chris Lattner	4ea0a3eaac	Fix a source of non-determinism in the backend: the order of processing IV strides dependend on the pointer order of the strides in memory. Non-determinism is bad. llvm-svn: 23672	2005-10-09 06:20:55 +00:00
Chris Lattner	0832f2635a	When emiting a CopyFromReg and the source is already a vreg, do not bother creating a new vreg and inserting a copy: just use the input vreg directly. This speeds up the compile (e.g. about 5% on mesa with a debug build of llc) by not adding a bunch of copies and vregs to be coallesced away. On mesa, for example, this reduces the number of intervals from 168601 to 129040 going into the coallescer. llvm-svn: 23671	2005-10-09 05:58:56 +00:00
Chris Lattner	89c7fa22b1	Disable formation of rlwinm instructions from SRA bases. This fixes the 177.mesa failure from last night, and fixes the CodeGen/PowerPC/2005-10-08-ArithmeticRotate.ll regression test I added. If this code cannot be fixed, it should be removed for good, but I'll leave it to Nate to decide its fate. llvm-svn: 23670	2005-10-09 05:36:17 +00:00
Nate Begeman	967ce74980	Remove another unused file. Preparing for the great "enable i64 on ppc32" merge, and using subtarget info for ptr size. llvm-svn: 23668	2005-10-08 01:32:34 +00:00
Nate Begeman	af72457fc4	Remove a file that is no longer used llvm-svn: 23666	2005-10-08 01:21:27 +00:00
Nate Begeman	2042aa5b92	Lo and behold, the last bits of SelectionDAG.cpp have been moved over. llvm-svn: 23665	2005-10-08 00:29:44 +00:00
Chris Lattner	dae96f8881	When preselecting, favor things that have low depth to select first. This is faster and uses less stack space. This reduces our stack requirement enough to compile sixtrack, and though it's a hack, should be enough until we switch to iterative isel llvm-svn: 23664	2005-10-07 22:10:27 +00:00
Chris Lattner	be4bbca0ba	remove debugging code llvm-svn: 23663	2005-10-07 15:31:26 +00:00
Chris Lattner	fb12624a3f	implement CodeGen/PowerPC/div-2.ll:test2-4 by propagating zero bits through C-X's llvm-svn: 23662	2005-10-07 15:30:32 +00:00
Chris Lattner	b27a4147d3	fix indentation llvm-svn: 23660	2005-10-07 06:37:02 +00:00
Chris Lattner	5bcd0dd811	Turn sdivs into udivs when we can prove the sign bits are clear. This implements CodeGen/PowerPC/div-2.ll llvm-svn: 23659	2005-10-07 06:10:46 +00:00
Jeff Cohen	572910c9a2	Remove useless variable. llvm-svn: 23656	2005-10-07 05:28:29 +00:00
Chris Lattner	20a244577d	add a hack to work around broken VC++ scoping rules. Thx to JeffC for pointing this out to me llvm-svn: 23655	2005-10-07 05:23:36 +00:00
Chris Lattner	e373592258	Fix a CQ regression from my patch to split F32/F64 into seperate register classes on PPC. We were emitting fmr instructions to do fp extensions, which weren't getting coallesced. This fixes Regression/CodeGen/PowerPC/fpcopy.ll llvm-svn: 23654	2005-10-07 05:00:52 +00:00
Chris Lattner	cd8b421799	Fix CodeGen/Generic/bool-to-double.ll llvm-svn: 23652	2005-10-07 04:50:48 +00:00
Chris Lattner	318622fb9f	Pull out Call, reducing stack frame size from 6032 bytes to 5184 bytes. llvm-svn: 23650	2005-10-06 19:07:45 +00:00
Chris Lattner	491b8294f4	Pull out setcc, this reduces stack frame size from 7520 to 6032 bytes llvm-svn: 23649	2005-10-06 19:03:35 +00:00
Chris Lattner	502a36935e	Pull two more methods out, reducing stack frame size from 8224 -> 7520 bytes llvm-svn: 23648	2005-10-06 18:56:10 +00:00
Chris Lattner	259e6c76f2	Add a recursive-iterative hybrid stage to attempt to reduce stack space, this helps but not enough. Start pulling cases out of PPC32DAGToDAGISel::Select. With GCC 4, this function required 8512 bytes of stack space for each invocation (GCC 3 required less than 700 bytes). Pulling this first function out gets us down to 8224. More to come :( llvm-svn: 23647	2005-10-06 18:45:51 +00:00
Chris Lattner	7bf8d06f02	silence a bogus GCC warning llvm-svn: 23646	2005-10-06 17:39:10 +00:00
Chris Lattner	fabe55f155	Fix the LLC regressions on X86 last night. In particular, when undoing previous copy elisions and we discover we need to reload a register, make sure to use the regclass of the original register for the reload, not the class of the current register. This avoid using 16-bit loads to reload 32-bit values. llvm-svn: 23645	2005-10-06 17:19:06 +00:00
Andrew Lenharth	e4c91fc9e8	This is suppose to work now llvm-svn: 23644	2005-10-06 16:54:29 +00:00
Andrew Lenharth	332df13b9e	remove VAX compatibility instruction, we will never use this llvm-svn: 23643	2005-10-06 16:53:32 +00:00
Chris Lattner	4bbbb9eed7	Make the legalizer completely non-recursive llvm-svn: 23642	2005-10-06 01:20:27 +00:00
Nate Begeman	558beb3729	Let the combiner handle more cases llvm-svn: 23641	2005-10-05 21:44:43 +00:00
Nate Begeman	f8221c5e2c	Remove some bad code from Legalize llvm-svn: 23640	2005-10-05 21:44:10 +00:00
Nate Begeman	bd7df030d2	Check in some more DAGCombiner pieces llvm-svn: 23639	2005-10-05 21:43:42 +00:00
Chris Lattner	55149d7835	Fix a bug in the local spiller, where we could take code like this: store r12 -> [ss#2] R3 = load [ss#1] use R3 R3 = load [ss#2] R4 = load [ss#1] and turn it into this code: store R12 -> [ss#2] R3 = load [ss#1] use R3 R3 = R12 R4 = R3 <- oops! The problem was that promoting R3 = load[ss#2] to a copy missed the fact that the instruction invalidated R3 at that point. llvm-svn: 23638	2005-10-05 18:30:19 +00:00
Chris Lattner	05da0d966e	silence some warnings llvm-svn: 23637	2005-10-05 17:15:09 +00:00
Chris Lattner	a49e16fefa	implement visitBR_CC so that PowerPC/inverted-bool-compares.ll passes with the dag combiner. This speeds up espresso by 8%, reaching performance parity with the dag-combiner-disabled llc. llvm-svn: 23636	2005-10-05 06:47:48 +00:00
Chris Lattner	b11d15637a	fix some pastos llvm-svn: 23635	2005-10-05 06:37:22 +00:00
Chris Lattner	06f1d0f73a	Add a new HandleNode class, which is used to handle (haha) cases in the dead node elim and dag combiner passes where the root is potentially updated. This fixes a fixme in the dag combiner. llvm-svn: 23634	2005-10-05 06:35:28 +00:00
Chris Lattner	a6895d180e	Implement the code for PowerPC/inverted-bool-compares.ll, even though it that testcase still does not pass with the dag combiner. This is because not all forms of br* are folded yet. Also, when we combine a node into another one, delete the node immediately instead of waiting for the node to potentially come up in the future. llvm-svn: 23632	2005-10-05 06:11:08 +00:00
Chris Lattner	6bd8fd09b6	make sure that -view-isel-dags is the input to the isel, not the input to the second phase of dag combining llvm-svn: 23631	2005-10-05 06:09:10 +00:00
Chris Lattner	746d50a01a	Fix a crash compiling Olden/tsp llvm-svn: 23630	2005-10-05 04:45:43 +00:00
Chris Lattner	3b793c6521	refactor a bit of code. When moving constant entries in 'Map' if the entry is the representative constant for the abstractypemap, make sure to update it as well. This fixes the bcreader failures from last night on several C++ apps. llvm-svn: 23628	2005-10-04 21:35:50 +00:00
Chris Lattner	dff59118c6	Minor speedup to avoid array searches given a Use*. This speeds up bc reading of the python test from 1:00 to 54s. llvm-svn: 23627	2005-10-04 18:47:09 +00:00
Chris Lattner	7a1450dbc6	Change the signature of replaceUsesOfWithOnConstant. The bool was always true dynamically. Finally, pass the Use* that replaceAllUsesWith has into the method for future use. llvm-svn: 23626	2005-10-04 18:13:04 +00:00
Chris Lattner	935aa922e3	For large constants (e.g. arrays and structs with many elements) just creating the keys and doing comparisons to index into 'Map' takes a lot of time. For these large constants, keep an inverse map so that 'remove' and move operations are much faster. This speeds up a release build of the bc reader on Eric's nasty python bytecode file from 1:39 to 1:00s. llvm-svn: 23624	2005-10-04 17:48:46 +00:00
Chris Lattner	5bbf60a5b6	minor cleanup/fastpath for the bcreader. This speeds up the bcreader from 1:41 -> 1:39 on the large python .bc file in a release build. llvm-svn: 23623	2005-10-04 16:52:46 +00:00
Jim Laskey	327d4298e1	Reverting to version - until problem isolated. llvm-svn: 23622	2005-10-04 16:41:51 +00:00
Chris Lattner	d1a5bc8dbd	Add a forward def llvm-svn: 23621	2005-10-04 05:09:20 +00:00
Nate Begeman	5da6908d65	Fix some faulty logic in the libcall inserter. Since calls return more than one value, don't bail if one of their uses happens to be a node that's not an MVT::Other when following the chain from CALLSEQ_START to CALLSEQ_END. Once we've found a CALLSEQ_START, we can just return; there's no need to tail-recurse further up the graph. Most importantly, just because something only has one use doesn't mean we should use it's one use to follow from start to end. This faulty logic caused us to follow a chain of one-use FP operations back to a much earlier call, putting a cycle in the graph from a later start to an earlier end. This is a better fix that reverting to the workaround committed earlier today. llvm-svn: 23620	2005-10-04 02:10:55 +00:00
Chris Lattner	8760ec73d8	implement the struct version of the array speedup, speeding up the testcase a bit more from 1:48 -> 1.40. llvm-svn: 23619	2005-10-04 01:17:50 +00:00
Chris Lattner	20b0754c41	Fix DemoteRegToStack on an invoke. This fixes PR634. llvm-svn: 23618	2005-10-04 00:44:01 +00:00
Nate Begeman	54fb5002e5	Add back a workaround that fixes some breakages from chris's last change. Neither of us have yet figured out why this code is necessary, but stuff breaks if its not there. Still tracking this down... llvm-svn: 23617	2005-10-04 00:37:37 +00:00
Chris Lattner	4c3b2b536c	Clean up the code a bit. Use isInstructionTriviallyDead to be more aggressive and more correct than use_empty(). This fixes PR635 and SimplifyCFG/2005-10-02-InvokeSimplify.ll llvm-svn: 23616	2005-10-03 23:43:43 +00:00
Chris Lattner	b64419ac40	Change ConstantArray::replaceUsesOfWithOnConstant to attempt to update constant arrays in place instead of reallocating them and replaceAllUsesOf'ing the result. This speeds up a release build of the bcreader from: 136.987u 120.866s 4:24.38 to 49.790u 49.890s 1:40.14 ... a 2.6x speedup parsing a large python bc file. llvm-svn: 23614	2005-10-03 22:51:37 +00:00
Chris Lattner	c4062ba65f	move some methods, no other changes llvm-svn: 23613	2005-10-03 21:58:36 +00:00
Chris Lattner	0144fadc17	minor microoptimizations llvm-svn: 23612	2005-10-03 21:56:24 +00:00
Chris Lattner	bad09e71d0	Use a map to cache the ModuleType information, so we can do logarithmic lookups instead of linear time lookups. This speeds up bc parsing of a large file from 137.834u 118.256s 4:27.96 to 132.611u 114.436s 4:08.53 with a release build. llvm-svn: 23611	2005-10-03 21:26:53 +00:00
Jim Laskey	409a6b204e	Refactor gathering node info and emission. llvm-svn: 23610	2005-10-03 12:30:32 +00:00
Chris Lattner	57b21f9f10	clean up this code a bit, no functionality change llvm-svn: 23609	2005-10-03 07:22:07 +00:00
Chris Lattner	afef68baff	Speed up the asm printer a lot by not printing formatted LLVM asm output for globals llvm-svn: 23608	2005-10-03 07:08:36 +00:00
Chris Lattner	5f096e2847	Break the body of the loop out into a new method llvm-svn: 23606	2005-10-03 04:47:08 +00:00
Chris Lattner	f07a587c79	Make IVUseShouldUsePostIncValue more aggressive when the use is a PHI. In particular, it should realize that phi's use their values in the pred block not the phi block itself. This change turns our em3d loop from this: _test: cmpwi cr0, r4, 0 bgt cr0, LBB_test_2 ; entry.no_exit_crit_edge LBB_test_1: ; entry.loopexit_crit_edge li r2, 0 b LBB_test_6 ; loopexit LBB_test_2: ; entry.no_exit_crit_edge li r6, 0 LBB_test_3: ; no_exit or r2, r6, r6 lwz r6, 0(r3) cmpw cr0, r6, r5 beq cr0, LBB_test_6 ; loopexit LBB_test_4: ; endif addi r3, r3, 4 addi r6, r2, 1 cmpw cr0, r6, r4 blt cr0, LBB_test_3 ; no_exit LBB_test_5: ; endif.loopexit.loopexit_crit_edge addi r3, r2, 1 blr LBB_test_6: ; loopexit or r3, r2, r2 blr into: _test: cmpwi cr0, r4, 0 bgt cr0, LBB_test_2 ; entry.no_exit_crit_edge LBB_test_1: ; entry.loopexit_crit_edge li r2, 0 b LBB_test_5 ; loopexit LBB_test_2: ; entry.no_exit_crit_edge li r6, 0 LBB_test_3: ; no_exit lwz r2, 0(r3) cmpw cr0, r2, r5 or r2, r6, r6 beq cr0, LBB_test_5 ; loopexit LBB_test_4: ; endif addi r3, r3, 4 addi r6, r6, 1 cmpw cr0, r6, r4 or r2, r6, r6 blt cr0, LBB_test_3 ; no_exit LBB_test_5: ; loopexit or r3, r2, r2 blr Unfortunately, this is actually worse code, because the register coallescer is getting confused somehow. If it were doing its job right, it could turn the code into this: _test: cmpwi cr0, r4, 0 bgt cr0, LBB_test_2 ; entry.no_exit_crit_edge LBB_test_1: ; entry.loopexit_crit_edge li r6, 0 b LBB_test_5 ; loopexit LBB_test_2: ; entry.no_exit_crit_edge li r6, 0 LBB_test_3: ; no_exit lwz r2, 0(r3) cmpw cr0, r2, r5 beq cr0, LBB_test_5 ; loopexit LBB_test_4: ; endif addi r3, r3, 4 addi r6, r6, 1 cmpw cr0, r6, r4 blt cr0, LBB_test_3 ; no_exit LBB_test_5: ; loopexit or r3, r6, r6 blr ... which I'll work on next. :) llvm-svn: 23604	2005-10-03 02:50:05 +00:00
Chris Lattner	e4ed42a426	Refactor some code into a function llvm-svn: 23603	2005-10-03 01:04:44 +00:00
Chris Lattner	360928dbed	This break is bogus and I have no idea why it was there. Basically it prevents memoizing code when IV's are used by phinodes outside of loops. In a simple example, we were getting this code before (note that r6 and r7 are isomorphic IV's): li r6, 0 or r7, r6, r6 LBB_test_3: ; no_exit lwz r2, 0(r3) cmpw cr0, r2, r5 or r2, r7, r7 beq cr0, LBB_test_5 ; loopexit LBB_test_4: ; endif addi r2, r7, 1 addi r7, r7, 1 addi r3, r3, 4 addi r6, r6, 1 cmpw cr0, r6, r4 blt cr0, LBB_test_3 ; no_exit Now we get: li r6, 0 LBB_test_3: ; no_exit or r2, r6, r6 lwz r6, 0(r3) cmpw cr0, r6, r5 beq cr0, LBB_test_6 ; loopexit LBB_test_4: ; endif addi r3, r3, 4 addi r6, r2, 1 cmpw cr0, r6, r4 blt cr0, LBB_test_3 ; no_exit this was noticed in em3d. llvm-svn: 23602	2005-10-03 00:37:33 +00:00
Chris Lattner	8fcce170cf	when checking if we should move a split edge block outside of a loop, check the presplit pred, not the post-split pred. This was causing us to make the wrong decision in some cases, leaving the critical edge block in the loop. llvm-svn: 23601	2005-10-03 00:31:52 +00:00
Chris Lattner	9cfccfb517	Fix a problem where the legalizer would run out of stack space on extremely large basic blocks because it was purely recursive. This switches it to an iterative/recursive hybrid. llvm-svn: 23596	2005-10-02 17:49:46 +00:00
Chris Lattner	7f718e61e8	silence a bogus warning llvm-svn: 23595	2005-10-02 16:30:51 +00:00
Chris Lattner	9982da2703	silence some warnings llvm-svn: 23594	2005-10-02 16:29:36 +00:00
Chris Lattner	c0e655b65d	silence a warning llvm-svn: 23593	2005-10-02 16:27:59 +00:00
Chris Lattner	68303a78ff	add patterns for float binops and fma ops llvm-svn: 23592	2005-10-02 07:46:28 +00:00
Chris Lattner	98da1d9910	Sort the cpu and features table, so that the alpha backend doesn't fail EVERY compile with an assertion that the tables are not sorted! llvm-svn: 23591	2005-10-02 07:13:52 +00:00
Chris Lattner	704d97f8b2	Add assertions to the trivial scheduler to check that the value types match up between defs and uses. llvm-svn: 23590	2005-10-02 07:10:55 +00:00
Chris Lattner	3734d204b8	another solution to the fsel issue. Instead of having 4 variants, just force the comparison to be 64-bits. This is fine because extensions from float to double are free. llvm-svn: 23589	2005-10-02 07:07:49 +00:00
Chris Lattner	9e98672962	fsel can take a different FP type for the comparison and for the result. As such split the FSEL family into 4 things instead of just two. llvm-svn: 23588	2005-10-02 06:58:23 +00:00
Chris Lattner	a17e6c486c	fix an f32/f64 type mismatch llvm-svn: 23587	2005-10-02 06:37:13 +00:00
Chris Lattner	a038d901fb	Codegen CopyFromReg using the regclass that matches the valuetype of the destination vreg. llvm-svn: 23586	2005-10-02 06:34:16 +00:00
Chris Lattner	4155ae0f74	Adjust to change in ctor llvm-svn: 23585	2005-10-02 06:23:51 +00:00
Chris Lattner	5ab9d42bb4	Minor tweak to the branch selector. When emitting a two-way branch, and if we're in a single-mbb loop, make sure to emit the backwards branch as the conditional branch instead of the uncond branch. For example, emit this: LBBl29_z__44: stw r9, 0(r15) stw r9, 4(r15) stw r9, 8(r15) stw r9, 12(r15) addi r15, r15, 16 addi r8, r8, 1 cmpw cr0, r8, r28 ble cr0, LBBl29_z__44 b LBBl29_z__48 * NOT PART OF LOOP Instead of: LBBl29_z__44: stw r9, 0(r15) stw r9, 4(r15) stw r9, 8(r15) stw r9, 12(r15) addi r15, r15, 16 addi r8, r8, 1 cmpw cr0, r8, r28 bgt cr0, LBBl29_z__48 * PART OF LOOP! b LBBl29_z__44 The former sequence has one fewer dispatch group for the loop body. llvm-svn: 23582	2005-10-01 23:06:26 +00:00
Chris Lattner	6f4dc51d6f	like the comment says, enable this llvm-svn: 23581	2005-10-01 23:02:40 +00:00
Chris Lattner	5a7bfe0b72	Add some very paranoid checking for operand/result reg class matchup For instructions that define multiple results, use the right regclass to define the result, not always the rc of result #0 llvm-svn: 23580	2005-10-01 07:45:09 +00:00
Jeff Cohen	f8a5e5ae6e	Fix VC++ warnings. llvm-svn: 23579	2005-10-01 03:57:14 +00:00
Chris Lattner	8713ebf37c	fix typo llvm-svn: 23578	2005-10-01 02:51:36 +00:00
Chris Lattner	d3eee1a09b	Modify the ppc backend to use two register classes for FP: F8RC and F4RC. These are used to represent float and double values, and the two regclasses contain the same physical registers. llvm-svn: 23577	2005-10-01 01:35:02 +00:00
Chris Lattner	fda6944c5b	add a method llvm-svn: 23575	2005-10-01 00:17:07 +00:00
Jim Laskey	d3850457a1	typo llvm-svn: 23574	2005-10-01 00:08:23 +00:00
Jim Laskey	9d96932879	1. Simplify the gathering of node groups. 2. Printing node groups when displaying nodes. llvm-svn: 23573	2005-10-01 00:03:07 +00:00
Jim Laskey	f61232354f	Should be using flag and not chain. llvm-svn: 23572	2005-09-30 23:43:37 +00:00
Nate Begeman	fbfad0b565	Remove some now-dead code. llvm-svn: 23571	2005-09-30 21:28:27 +00:00
Andrew Lenharth	49e48f6234	subtarget support for CIX and FIX extentions (the only 2 I care about right now) llvm-svn: 23569	2005-09-30 20:24:38 +00:00
Jim Laskey	3fe3841c2a	1. Made things node-centric (from operand). 2. Added node groups to handle flagged nodes. 3. Started weaning simple scheduling off existing emitter. llvm-svn: 23566	2005-09-30 19:15:27 +00:00
Chris Lattner	c9f4219cfc	Rename MRegisterDesc -> TargetRegisterDesc for consistency llvm-svn: 23564	2005-09-30 17:49:27 +00:00
Chris Lattner	81f32a2acb	trim down the target info structs now that we have a preferred spill register class for each callee save register Why is V9 maintaining these tables manually? ugh! llvm-svn: 23561	2005-09-30 17:38:36 +00:00
Chris Lattner	2e794c9198	now that we have a reg class to spill with, get this info from the regclass llvm-svn: 23559	2005-09-30 17:19:22 +00:00
Chris Lattner	88025e17c5	constant fold these calls llvm-svn: 23558	2005-09-30 17:16:59 +00:00
Chris Lattner	bb1c9ecb17	simplify this code using the new regclass info passed in llvm-svn: 23557	2005-09-30 17:12:38 +00:00
Chris Lattner	51878189c5	Now that we have getCalleeSaveRegClasses() info, use it to pass the register class into the spill/reload methods. Targets can now rely on that argument. llvm-svn: 23556	2005-09-30 16:59:07 +00:00
Chris Lattner	8688b92b86	stub out a virtual method llvm-svn: 23554	2005-09-30 06:55:18 +00:00
Chris Lattner	4984e99b83	CR registers are not used by this "target" llvm-svn: 23552	2005-09-30 06:43:58 +00:00
Chris Lattner	6169a78f46	these registers don't belong to any register classes, so don't mark them as callee save. They can never be generated by the compiler. llvm-svn: 23551	2005-09-30 06:42:24 +00:00
Chris Lattner	33ce5f8a73	Now that self referential classes are supported, get rid of a work-around. llvm-svn: 23544	2005-09-30 04:13:23 +00:00
Chris Lattner	f6d4173f75	pass extra args llvm-svn: 23539	2005-09-30 01:31:52 +00:00
Chris Lattner	64ca7cda3f	these methods get extra args llvm-svn: 23538	2005-09-30 01:30:55 +00:00
Chris Lattner	a654525c1c	Pass extra regclasses into spilling code llvm-svn: 23537	2005-09-30 01:29:42 +00:00
Chris Lattner	5a6199f387	Change this code ot pass register classes into the stack slot spiller/reloader code. PrologEpilogInserter hasn't been updated yet though, so targets cannot use this info. llvm-svn: 23536	2005-09-30 01:29:00 +00:00
Chris Lattner	08f157c5b2	Use the 32-bit version for now llvm-svn: 23534	2005-09-30 00:05:05 +00:00
Chris Lattner	027a2671ef	Add a bunch of patterns for F64 FP ops, add some more integer ops llvm-svn: 23533	2005-09-29 23:34:24 +00:00
Chris Lattner	1de5706e68	Remove code for patterns that are autogenerated llvm-svn: 23532	2005-09-29 23:33:31 +00:00
Andrew Lenharth	a7a83b9255	begining alpha subtarget support llvm-svn: 23531	2005-09-29 22:54:56 +00:00
Chris Lattner	0a1cd715d4	tblgen autogens this pattern now llvm-svn: 23530	2005-09-29 22:37:24 +00:00
Andrew Lenharth	bae1f9d790	copy and paste error llvm-svn: 23528	2005-09-29 21:11:57 +00:00
Chris Lattner	a748e3ae5b	now that tblgen is smarter, this pattern is not needed. Also, tblgen now inverts commuted versions of ANDC/ORC with the current .td file. llvm-svn: 23527	2005-09-29 19:29:15 +00:00
Chris Lattner	a554c9470b	Insert stores after phi nodes in the normal dest. This fixes LowerInvoke/2005-08-03-InvokeWithPHI.ll llvm-svn: 23525	2005-09-29 17:44:20 +00:00
Chris Lattner	02d3ba3db8	consistency with other cases, no functionality change llvm-svn: 23524	2005-09-29 17:38:52 +00:00
Chris Lattner	eca4f56646	Make the JIT default to the DAG isel instead of the pattern isel, like LLC. The Pattern isel has some strange memory corruption issues going on. :( This should have been converted over anyway, but it got forgotten somehow when switching to the dag isel. llvm-svn: 23523	2005-09-29 17:31:03 +00:00
Chris Lattner	5b2be1f890	Fix two bugs in my patch earlier today that broke int->fp conversion on X86. llvm-svn: 23522	2005-09-29 06:44:39 +00:00
Chris Lattner	87ef943a4c	Fold isascii into a simple comparison. This speeds up 197.parser by 7.4%, bringing the LLC time down to the CBE time. llvm-svn: 23521	2005-09-29 06:17:27 +00:00
Chris Lattner	5f6035feb0	remove a bunch of unneeded stuff, or self evident comments llvm-svn: 23519	2005-09-29 06:16:11 +00:00
Chris Lattner	c244e7c178	Implement a couple of memcmp folds from the todo list llvm-svn: 23517	2005-09-29 04:54:20 +00:00
Jeff Cohen	b01a41a06d	Silence VC++ redeclaration warnings. llvm-svn: 23516	2005-09-29 01:59:49 +00:00
Chris Lattner	08c319fbdd	Never rely on ReplaceAllUsesWith when selecting, use CodeGenMap instead. ReplaceAllUsesWith does not replace scalars SDOperand floating around on the stack, permitting things to be selected multiple times. llvm-svn: 23515	2005-09-29 00:59:32 +00:00
Chris Lattner	d4e9e8b7ec	Codegen ADD X, IMM -> addis/addi if needed. This implements PowerPC/fold-li.ll llvm-svn: 23514	2005-09-28 23:07:13 +00:00
Chris Lattner	b9b2e77295	Autogen MUL, move FP cases together llvm-svn: 23512	2005-09-28 22:53:16 +00:00
Chris Lattner	5769311c92	disentangle FP from INT versions of div/mul llvm-svn: 23511	2005-09-28 22:50:24 +00:00
Chris Lattner	585131baaf	Use the autogenerated matcher for ADD/SUB llvm-svn: 23510	2005-09-28 22:47:28 +00:00
Chris Lattner	f023b2cda2	add a patter for SUBFIC llvm-svn: 23509	2005-09-28 22:47:06 +00:00
Chris Lattner	21551ea5ab	Mark int binops as int-only, add FP binops. Mark FADD/FMUL as commutative but not associative. Add [SU]REM. llvm-svn: 23508	2005-09-28 22:38:27 +00:00
Chris Lattner	cd002b2461	wrap a long line llvm-svn: 23507	2005-09-28 22:30:58 +00:00
Chris Lattner	d3ea19b51a	Add FP versions of the binary operators, keeping the int and fp worlds seperate. llvm-svn: 23506	2005-09-28 22:29:58 +00:00
Chris Lattner	0815dcae3f	Add FP versions of the binary operators, keeping the int and fp worlds seperate. Though I have done extensive testing, it is possible that this will break things in configs I can't test. Please let me know if this causes a problem and I'll fix it ASAP. llvm-svn: 23505	2005-09-28 22:29:17 +00:00
Chris Lattner	6f3b577ee6	Add FP versions of the binary operators, keeping the int and fp worlds seperate. Though I have done extensive testing, it is possible that this will break things in configs I can't test. Please let me know if this causes a problem and I'll fix it ASAP. llvm-svn: 23504	2005-09-28 22:28:18 +00:00
Chris Lattner	7fe6734dff	Mark associative nodes as associative llvm-svn: 23503	2005-09-28 20:58:39 +00:00
Chris Lattner	b97b054ba7	Nate pointed out that mulh[us] are commutative as well. Thanks! llvm-svn: 23500	2005-09-28 19:01:44 +00:00
Chris Lattner	89d168ceb3	expose commutativity information llvm-svn: 23498	2005-09-28 18:27:58 +00:00
Chris Lattner	fab48b3285	All (xor *) cases are autogenerated now llvm-svn: 23497	2005-09-28 18:12:37 +00:00
Chris Lattner	037d69a404	add support for missed eqv tests llvm-svn: 23496	2005-09-28 18:10:51 +00:00
Chris Lattner	33f8e08c8f	Implement PowerPC/eqv-andc-orc-nor.ll:EQV3 llvm-svn: 23494	2005-09-28 18:04:52 +00:00
Chris Lattner	8cd7b88a88	learn to codegen not as NOR instead of xoris/xori llvm-svn: 23490	2005-09-28 17:13:15 +00:00
Chris Lattner	bb5939a436	These nodes are all autogenerated llvm-svn: 23489	2005-09-28 17:07:09 +00:00
Chris Lattner	ea7214b23d	Constant fold llvm.sqrt llvm-svn: 23487	2005-09-28 01:34:32 +00:00
Chris Lattner	3b63bb375c	add a note about a way to improve this code further, that I won't be getting to right now. llvm-svn: 23485	2005-09-27 22:44:59 +00:00
Chris Lattner	eb953f0ef8	Fix a regression in my previous patch, fixing GlobalOpt/2005-09-27-Crash.ll and PR632. llvm-svn: 23484	2005-09-27 22:28:11 +00:00
Chris Lattner	a028e7a39c	Darwin, like many BSD systems, has a setjmp/longjmp which saves the signal mask on setjmp calls and restores it on longjmp calls (both of which require syscalls). This makes the calls REALLY slow. Use _setjmp/_longjmp instead. This speeds up hexxagon from 120.31s to 15.68s: from 5.53x slower than GCC to 28% faster than GCC. llvm-svn: 23482	2005-09-27 22:18:25 +00:00
Chris Lattner	0fd8f9fbc9	If the target prefers it, use _setjmp/_longjmp should be used instead of setjmp/longjmp for llvm.setjmp/llvm.longjmp. llvm-svn: 23481	2005-09-27 22:15:53 +00:00
Chris Lattner	59dc1e082c	initialize new flag llvm-svn: 23480	2005-09-27 22:13:56 +00:00
Chris Lattner	e285f5ed8f	Avoid spilling stack slots... to stack slots. llvm-svn: 23478	2005-09-27 21:33:12 +00:00
Chris Lattner	87eb249300	Completely rewrite 'correct' eh support. This changes how setjmp insertion is performed so it is only at most once per function that contains an invoke instead of once per invoke in the function. This patch has the following perks: 1. It fixes PR631, which complains about slowness. 2. If fixes PR240, which complains about non-volatile vars being live across setjmp/longjmps. 3. It improves (but does not fix) the jmpbuf alignment issue on itanium by not forcing the jmpbufs to always be 8-bytes off the alignment of the structure. 4. It speeds up 253.perlbmk from 338s to 13.70s (a 25x improvement!), making us now about 4% faster than GCC. Further improvements are also possible. llvm-svn: 23477	2005-09-27 21:18:17 +00:00
Chris Lattner	92233d2175	Make the pass name simpler llvm-svn: 23476	2005-09-27 21:10:32 +00:00
Chris Lattner	5635cc069f	fix CBackend/2005-09-27-VolatileFuncPtr.ll llvm-svn: 23475	2005-09-27 20:52:44 +00:00
Chris Lattner	16cd356fb2	allow demotion to volatile values, add support for invoke llvm-svn: 23473	2005-09-27 19:39:00 +00:00
Chris Lattner	c628f00845	Make sure to clear the CodeGenMap after each basic block is selected to avoid cross MBB pollution. llvm-svn: 23470	2005-09-27 17:45:33 +00:00
Jim Laskey	63523f98d5	Remove some redundancies. llvm-svn: 23469	2005-09-27 17:32:45 +00:00
Chris Lattner	e7e139e8e8	Split SimpleConstantVal up into its components, so each Constant subclass getsa different enum value. This allows 'classof' for these to be really simple,not needing to call getType() anymore. This speeds up isa/dyncast/etc for constants, and also makes them smaller. For example, the text section of a release build of InstCombine.cpp shrinks from 230037 bytes to 216363 bytes, a 6% reduction. llvm-svn: 23467	2005-09-27 06:09:08 +00:00
Chris Lattner	3d27e7f27f	Add support for external calls that we know how to constant fold. This implements ctor-list-opt.ll:CTOR8 llvm-svn: 23465	2005-09-27 05:02:43 +00:00
Chris Lattner	29b2780c8a	Fix a bug where we would evaluate stores into linkonce objects which could be potentially replaced at link-time. llvm-svn: 23463	2005-09-27 04:50:03 +00:00
Chris Lattner	65a3a0918f	Implement support for static constructors with calls in them. This is useful because gccas runs globalopt before inlining. This implements ctor-list-opt.ll:CTOR7 llvm-svn: 23462	2005-09-27 04:45:34 +00:00
Chris Lattner	da1889b778	Refactor this code a bit, no functionality changes. llvm-svn: 23460	2005-09-27 04:27:01 +00:00
Chris Lattner	54ec5f2089	Move the post-lsr simplify cfg pass after lowereh, so it can clean up after eh lowering as well. llvm-svn: 23459	2005-09-27 00:14:41 +00:00
Chris Lattner	4435b149a0	minor pattern shuffling llvm-svn: 23458	2005-09-26 22:20:16 +00:00
Jim Laskey	5f2443c8a3	Addition of a simple two pass scheduler. This version is currently hacked up for testing and will require target machine info to do a proper scheduling. The simple scheduler can be turned on using -sched=simple (defaults to -sched=none) llvm-svn: 23455	2005-09-26 21:57:04 +00:00
Chris Lattner	f2f89af69a	Remove some dead code. ctor evaluation subsumes empty ctor elim llvm-svn: 23453	2005-09-26 20:38:20 +00:00
Chris Lattner	6bf2cd5735	Add support for alloca, implementing ctor-list-opt.ll:CTOR6 llvm-svn: 23452	2005-09-26 17:07:09 +00:00
Chris Lattner	46d9ff081d	Add a debug printout, fix a crash on kc++ llvm-svn: 23450	2005-09-26 07:34:35 +00:00
Chris Lattner	46af55e0e4	Implement loads/stores through GEP's of globals. This implements ctor-list-opt.ll:CTOR5. llvm-svn: 23449	2005-09-26 06:52:44 +00:00
Chris Lattner	61ff32cd70	Replace TraverseGEPInitializer with ConstantFoldLoadThroughGEPConstantExpr llvm-svn: 23447	2005-09-26 05:34:07 +00:00

... 2 3 4 5 6 ...

11351 Commits