llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	261009a4df	Autogen fsel llvm-svn: 23987	2005-10-25 20:55:47 +00:00
Chris Lattner	65845a2f7c	Expose the fextend on the DAG instead of doing it in the matcher llvm-svn: 23986	2005-10-25 20:54:57 +00:00
Chris Lattner	cd7f101c9a	Autogen a few new ppc-specific nodes llvm-svn: 23985	2005-10-25 20:41:46 +00:00
Chris Lattner	26ee5953f7	The dag isel generator generates this now llvm-svn: 23984	2005-10-25 20:36:10 +00:00
Chris Lattner	c0a201c318	Be a bit more paranoid about calling SelectNodeTo llvm-svn: 23982	2005-10-25 20:26:41 +00:00
Chris Lattner	e1fd05ebde	Fix a couple of minor bugs. The first fixes povray, the second fixes things if the dag combiner isn't run llvm-svn: 23981	2005-10-25 19:32:37 +00:00
Chris Lattner	3b409a85eb	Clear a bit in this file that was causing a miscompilation of 178.galgel. llvm-svn: 23980	2005-10-25 18:57:30 +00:00
Jim Laskey	db4621a5f5	Preparation of supporting scheduling info. Need to find info based on selected CPU. llvm-svn: 23974	2005-10-25 15:15:28 +00:00
Alkis Evlogimenos	cb67b650b5	Stop using deprecated types llvm-svn: 23973	2005-10-25 11:18:06 +00:00
Chris Lattner	76b12c4d95	do not wrap this whole file in namespace llvm llvm-svn: 23962	2005-10-24 06:38:35 +00:00
Chris Lattner	46705b2f2d	Handle allocations that, even after removing dead uses, still have more than one use (but one is a cast). This handles the very common case of: X = alloc [n x byte] Y = cast X to somethingbetter seteq X, null In order to avoid infinite looping when there are multiple casts, we only allow this if the xform is strictly increasing the alignment of the allocation. llvm-svn: 23961	2005-10-24 06:35:18 +00:00
Chris Lattner	355ecc09f8	Fix a bug where we would 'promote' an allocation from one type to another where the second has less alignment required. If we had explicit alignment support in the IR, we could handle this case, but we can't until we do. llvm-svn: 23960	2005-10-24 06:26:18 +00:00
Chris Lattner	ac87beb03a	Before promoting a malloc type, remove dead uses. This makes instcombine more effective at promoting these allocations, catching them earlier in the compile process. llvm-svn: 23959	2005-10-24 06:22:12 +00:00
Chris Lattner	216be91817	Pull some code out into a function, no functionality change llvm-svn: 23958	2005-10-24 06:03:58 +00:00
Chris Lattner	2a65d7b633	Make this build with GCC 4.1, patch contributed by Vladimir A. Merzliakov! llvm-svn: 23956	2005-10-24 04:51:35 +00:00
Chris Lattner	476b8ddd55	Alkis agrees that that iterative scan allocator isn't going to be worked on in the future, remove it. llvm-svn: 23952	2005-10-24 04:14:30 +00:00
Chris Lattner	90b0c99066	Remove this pass, it is not useful llvm-svn: 23949	2005-10-24 02:35:43 +00:00
Chris Lattner	b37336978f	Remove some beta code that no longer has an owner. llvm-svn: 23944	2005-10-24 02:32:41 +00:00
Chris Lattner	f9998d9704	Do not build the ProfilePaths directory anymore llvm-svn: 23943	2005-10-24 02:31:49 +00:00
Chris Lattner	bde3845548	DONT_BUILD_RELINKED is gone and implied by BUILD_ARCHIVE now llvm-svn: 23940	2005-10-24 02:26:13 +00:00
Chris Lattner	df88d79c08	only build .a version of this library llvm-svn: 23938	2005-10-24 02:14:49 +00:00
Chris Lattner	f07e354d8c	Only build .a file versions of these libraries, instead of .a and .o versions. This should speed up build times. llvm-svn: 23937	2005-10-24 02:11:51 +00:00
Chris Lattner	437b6116c9	There is no need to build an archive version of this library llvm-svn: 23936	2005-10-24 02:09:03 +00:00
Chris Lattner	87d4e1c130	This file is hopelessly out of date llvm-svn: 23935	2005-10-24 02:07:08 +00:00
Chris Lattner	1e050bd88b	Only build .a file versions of these libraries, instead of .a and .o versions. This should speed up build times. llvm-svn: 23934	2005-10-24 02:05:35 +00:00
Chris Lattner	8c087e962c	Only build .a file versions of these libraries, instead of .a and .o versions. This should speed up build times. llvm-svn: 23933	2005-10-24 01:59:48 +00:00
Chris Lattner	bd77fac034	Make sure that anything using the ADCE pass pulls in the UnifyFunctionExitNodes code llvm-svn: 23931	2005-10-24 01:40:23 +00:00
Chris Lattner	8df7c01299	don't bother building the archive version of this library llvm-svn: 23927	2005-10-24 01:08:20 +00:00
Chris Lattner	ca014adf8e	expose a ctor llvm-svn: 23924	2005-10-24 01:00:45 +00:00
Chris Lattner	75d4f6fbcb	implement some prototypes llvm-svn: 23920	2005-10-24 00:38:38 +00:00
Chris Lattner	b45e57c2dd	move this to the analyze tool llvm-svn: 23918	2005-10-24 00:27:36 +00:00
Chris Lattner	a551033ea8	Fix a nasty bug that was causing miscompilation of global variables on big endian 32-bit targets in some cases (e.g. PPC). This fixes several PPC JIT failures. llvm-svn: 23914	2005-10-23 23:54:56 +00:00
Chris Lattner	ac23014355	Shrinkify to match llc llvm-svn: 23912	2005-10-23 22:39:01 +00:00
Chris Lattner	d36c34822e	Simplify this, matching changes in the tblgen emitter llvm-svn: 23909	2005-10-23 22:34:25 +00:00
Chris Lattner	03bf3a1763	Simplify this due to changes in the tblgen side llvm-svn: 23908	2005-10-23 22:33:22 +00:00
Chris Lattner	abcce5c4b3	mark this as beta llvm-svn: 23906	2005-10-23 22:23:45 +00:00
Chris Lattner	8ff9df29a9	If a user requests help, give them help on both features and processors llvm-svn: 23905	2005-10-23 22:23:13 +00:00
Chris Lattner	766361e8f4	Autogen subtarget information from .td files. llvm-svn: 23904	2005-10-23 22:15:34 +00:00
Chris Lattner	4b5921d4d8	Add subtarget feature/processor defns to the .td file llvm-svn: 23903	2005-10-23 22:08:45 +00:00
Chris Lattner	a389f0d8fa	rearrange things a bit so that instructions can use subtarget features in the future. llvm-svn: 23902	2005-10-23 22:08:13 +00:00
Chris Lattner	437fd559d7	add a marker llvm-svn: 23901	2005-10-23 22:07:20 +00:00
Chris Lattner	b54070745e	add a note that Nate mentioned last week llvm-svn: 23898	2005-10-23 21:44:59 +00:00
Chris Lattner	2e81fba9cd	Put some of my random notes somewhere public llvm-svn: 23897	2005-10-23 19:52:42 +00:00
Chris Lattner	f64f8407c2	Improve help output. llvm-svn: 23893	2005-10-23 05:33:39 +00:00
Chris Lattner	0d4923b975	improve -help output llvm-svn: 23892	2005-10-23 05:28:51 +00:00
Chris Lattner	18a70c35b8	Move static functions from .h file, reduce #includes, pass strings by const&, use LowercaseString from StringExtras.h, remove extraneous space from help output. llvm-svn: 23891	2005-10-23 05:26:26 +00:00
Jeff Cohen	11e26b52b2	When a function takes a variable number of pointer arguments, with a zero pointer marking the end of the list, the zero must be cast to the pointer type. An un-cast zero is a 32-bit int, and at least on x86_64, gcc will not extend the zero to 64 bits, thus allowing the upper 32 bits to be random junk. The new END_WITH_NULL macro may be used to annotate a such a function so that GCC (version 4 or newer) will detect the use of un-casted zero at compile time. llvm-svn: 23888	2005-10-23 04:37:20 +00:00
Andrew Lenharth	c6072af580	Add several things. loads branches setcc working calls Global address External addresses now I can manage malloc calls. llvm-svn: 23887	2005-10-23 03:43:48 +00:00
Andrew Lenharth	4b3932aa89	add TargetExternalSymbol llvm-svn: 23886	2005-10-23 03:40:17 +00:00
Andrew Lenharth	5a990417f8	Well, the Constant matching pattern works. Can't say much about calls or globals yet. llvm-svn: 23884	2005-10-22 22:06:58 +00:00
Chris Lattner	42b3a5d596	This file is entirely ifdef'd out llvm-svn: 23882	2005-10-22 19:37:08 +00:00
Chris Lattner	9faa5b7a9a	BuildSDIV and BuildUDIV only work for i32/i64, but they don't check that the input is that type, this caused a failure on gs on X86 last night. Move the hard checks into Build[US]Div since that is where decisions like this should be made. llvm-svn: 23881	2005-10-22 18:50:15 +00:00
Jim Laskey	13a19453d2	Add g3 back to the mix and reorder to irritate them anal folk. Actually, it's to group appropriately and provide cues to maintainers that the lists don't need to be ordered. llvm-svn: 23880	2005-10-22 08:04:24 +00:00
Chris Lattner	c5d511c4d9	64-bit reg support should not be enabled by default, as support isn't complete. llvm-svn: 23878	2005-10-21 22:15:43 +00:00
Chris Lattner	75ea5b10bf	add a case missing from the dag combiner that exposed the failure on 2005-10-21-longlonggtu.ll. llvm-svn: 23875	2005-10-21 21:23:25 +00:00
Chris Lattner	e296949fbe	Instead of aborting if not a case we can handle specially, break out and let the generic code handle it. This fixes CodeGen/Generic/2005-10-21-longlonggtu.ll on ppc. also, reindent this code llvm-svn: 23874	2005-10-21 21:17:10 +00:00
Jim Laskey	9ed9032e22	Plugin new subtarget backend into the build. llvm-svn: 23870	2005-10-21 19:05:19 +00:00
Chris Lattner	229878b7bc	silence a release mode warning llvm-svn: 23868	2005-10-21 16:01:26 +00:00
Chris Lattner	e95b5745c0	Make the coallescer a bit smarter, allowing it to join more live ranges. For example, we can now join things like [0-30:0)[31-40:1)[52-59:2) with [40:60:0) if the 52-59 range is defined by a copy from the 40-60 range. The resultant range ends up being [0-30:0)[31-60:1). This fires a lot through-out the test suite (e.g. shrinking bc from 19492 -> 18509 machineinstrs) though most gains are smaller (e.g. about 50 copies eliminated from crafty). llvm-svn: 23866	2005-10-21 06:49:50 +00:00
Chris Lattner	76c97afbbc	Fix LiveInterval::getOverlapingRanges to take things in the right order (an unused method). Fix the merger so that it can merge ranges like this [10:12)[16:40) with [12:38) into [10:40) instead of bogus ranges. This sort of input will be possible for the merger coming shortly llvm-svn: 23865	2005-10-21 06:41:30 +00:00
Nate Begeman	fd0d55ec69	Match rotate. This does actually match the rotates in an rc5 cipher, but I haven't seen it fire on our testsuite. llvm-svn: 23863	2005-10-21 06:36:18 +00:00
Chris Lattner	5df0e36e98	My previous patch was too conservative. Reject FP and void types, but do allow pointer types. llvm-svn: 23859	2005-10-21 05:45:41 +00:00
Nate Begeman	ae5d9bd65b	Don't generate operations that aren't yet supported llvm-svn: 23858	2005-10-21 01:52:45 +00:00
Nate Begeman	62e9e5462c	Kill some now-dead code. llvm-svn: 23857	2005-10-21 01:52:20 +00:00
Nate Begeman	8f62cd32ad	Fix a typo in the dag combiner, so that this can work on i64 targets llvm-svn: 23856	2005-10-21 01:51:45 +00:00
Andrew Lenharth	a099c0131e	byte zap not immediate goodness llvm-svn: 23855	2005-10-21 01:24:05 +00:00
Nate Begeman	4dd383120f	Invert the TargetLowering flag that controls divide by consant expansion. Add a new flag to TargetLowering indicating if the target has really cheap signed division by powers of two, make ppc use it. This will probably go away in the future. Implement some more ISD::SDIV folds in the dag combiner Remove now dead code in the x86 backend. llvm-svn: 23853	2005-10-21 00:02:42 +00:00
Andrew Lenharth	a6a23b5874	Inst cleanup. As a bonus, operands are in the correct order for cmovs. Expect new stuff to pass in the JIT tonight llvm-svn: 23852	2005-10-20 23:58:36 +00:00
Chris Lattner	a553780e98	Use a literal to define ineg instead of immzero llvm-svn: 23851	2005-10-20 23:30:37 +00:00
Chris Lattner	b7b75e1b68	Fix a conditional so we don't access past the end of the range. Thanks to Andrew for bringing this to my attn. llvm-svn: 23850	2005-10-20 22:50:10 +00:00
Andrew Lenharth	d4c0ed74e4	added a few 1 operand form stuff. Seems to break regalloc on alpha. sigh llvm-svn: 23849	2005-10-20 19:39:24 +00:00
Andrew Lenharth	7e0e8234f6	add cttz and ctpop llvm-svn: 23848	2005-10-20 19:38:11 +00:00
Nate Begeman	7efe53d90b	Fix a couple bugs in the const div stuff where we'd generate MULHS/MULHU for types that aren't legal, and fail a divisor is less than zero comparison, which would cause us to drop a subtract. llvm-svn: 23846	2005-10-20 17:45:03 +00:00
Chris Lattner	a6efeb01f9	don't use llabs with apparently VC++ doesn't have llvm-svn: 23845	2005-10-20 17:01:00 +00:00
Chris Lattner	35852fc391	Fix order of eval problem from when I refactored this into a function. llvm-svn: 23844	2005-10-20 16:56:40 +00:00
Andrew Lenharth	eb0ad1863b	Sounds good, finish the intop conversion. llvm-svn: 23843	2005-10-20 14:42:48 +00:00
Nate Begeman	60bbe2d1e5	Add some more patterns for i64 on ppc llvm-svn: 23842	2005-10-20 07:51:08 +00:00
Chris Lattner	3cf40798ab	add a new method, play around with some code. Fix a bug in the extendIntervalEndTo method. In particular, if adding [2:10) to an interval containing [0:2),[10:30), we produced [0:10),[10,30). Which is not the most smart thing to do. Now produce [0:30). llvm-svn: 23841	2005-10-20 07:39:25 +00:00
Chris Lattner	8816353040	Refactor some code, pulling it out into a function. No functionality change. llvm-svn: 23839	2005-10-20 06:06:30 +00:00
Chris Lattner	0c0b38bb4c	Do NOT touch FP ops with LSR. This fixes a testcase Nate sent me from an inner loop like this: LBB_RateConvertMono8AltiVec_2: ; no_exit lis r2, ha16(.CPI_RateConvertMono8AltiVec_0) lfs f3, lo16(.CPI_RateConvertMono8AltiVec_0)(r2) fmr f3, f3 fadd f0, f2, f0 fadd f3, f0, f3 fcmpu cr0, f3, f1 bge cr0, LBB_RateConvertMono8AltiVec_2 ; no_exit to an inner loop like this: LBB_RateConvertMono8AltiVec_1: ; no_exit fsub f2, f2, f1 fcmpu cr0, f2, f1 fmr f0, f2 bge cr0, LBB_RateConvertMono8AltiVec_1 ; no_exit Doh! good catch! llvm-svn: 23838	2005-10-20 04:47:10 +00:00
Chris Lattner	fd07fcda67	Add some pattern fragments to simplify the repetitive parts of the patterns for some common ops and use them for a few examples. Andrew, if you like this, feel free to convert the rest over, if you hate it, feel free to revert. llvm-svn: 23837	2005-10-20 04:21:06 +00:00
Chris Lattner	cd4be8798f	simplify this a bit by using immediates llvm-svn: 23836	2005-10-20 03:57:03 +00:00
Nate Begeman	c6f067a8c4	Move the target constant divide optimization up into the dag combiner, so that the nodes can be folded with other nodes, and we can not duplicate code in every backend. Alpha will probably want this too. llvm-svn: 23835	2005-10-20 02:15:44 +00:00
Andrew Lenharth	794f15868a	forgot this one llvm-svn: 23833	2005-10-20 00:29:02 +00:00
Andrew Lenharth	7b69867052	ret 0; works, not much else still lots of uglyness. Maybe calls will come soon. Fixing the return value of things will be necessary to make alpha work. llvm-svn: 23832	2005-10-20 00:28:31 +00:00
John Criswell	196e8c1f58	This fixes PR638: Regression/CodeGen/Generic/2004-02-08-UnwindSupport.llx llvm-svn: 23831	2005-10-19 20:07:15 +00:00
Jim Laskey	74ab9960f2	Added InstrSchedClass to each of the PowerPC Instructions. Note that when adding new instructions that you should refer to the table at the bottom of PPCSchedule.td. llvm-svn: 23830	2005-10-19 19:51:16 +00:00
Nate Begeman	9f3c26c4ea	Write patterns for the various shl and srl patterns that don't involve doing something clever. llvm-svn: 23824	2005-10-19 18:42:01 +00:00
Jim Laskey	9761100055	Push processor descriptions to the top of target and add command line info. llvm-svn: 23820	2005-10-19 13:34:52 +00:00
Chris Lattner	c16b0c387f	now that tblgen is smarter, use integers directly. This should help Andrew too llvm-svn: 23818	2005-10-19 04:32:04 +00:00
Chris Lattner	5f37623218	teach ppc backend these are copies llvm-svn: 23813	2005-10-19 01:50:36 +00:00
Chris Lattner	5b6f4dc623	Convert these cases to patterns llvm-svn: 23811	2005-10-19 01:38:02 +00:00
Nate Begeman	9eaa6bac06	Woo, it kinda works. We now generate this atrociously bad, but correct, code for long long foo(long long a, long long b) { return a + b; } _foo: or r2, r3, r3 or r3, r4, r4 or r4, r5, r5 or r5, r6, r6 rldicr r2, r2, 32, 31 rldicl r3, r3, 0, 32 rldicr r4, r4, 32, 31 rldicl r5, r5, 0, 32 or r2, r3, r2 or r3, r5, r4 add r4, r3, r2 rldicl r2, r4, 32, 32 or r4, r4, r4 or r3, r2, r2 blr llvm-svn: 23809	2005-10-19 01:12:32 +00:00
Chris Lattner	ecdf842311	apply some tblgen majik to simplify the X register definitions llvm-svn: 23805	2005-10-19 00:17:55 +00:00
Nate Begeman	5172ce641e	Teach Legalize how to do something with EXTRACT_ELEMENT when the type of the pair of elements is a legal type. llvm-svn: 23804	2005-10-19 00:06:56 +00:00
Nate Begeman	92e77502f3	Make a new reg class for 64 bit regs that aliases the 32 bit regs. This will have to tide us over until we get real subreg support, but it prevents the PrologEpilogInserter from spilling 8 byte GPRs on a G4 processor. Add some initial support for TRUNCATE and ANY_EXTEND, but they don't currently work due to issues with ScheduleDAG. Something wll have to be figured out. llvm-svn: 23803	2005-10-19 00:05:37 +00:00
Nate Begeman	78afac2ddd	Add the ability to lower return instructions to TargetLowering. This allows us to lower legal return types to something else, to meet ABI requirements (such as that i64 be returned in two i32 regs on Darwin/ppc). llvm-svn: 23802	2005-10-18 23:23:37 +00:00
Chris Lattner	0a71a9ac86	Fix Generic/2005-10-18-ZeroSizeStackObject.ll by not requesting a zero sized stack object if either the array size or the type size is zero. llvm-svn: 23801	2005-10-18 22:14:06 +00:00
Chris Lattner	8396a308a7	remove hack llvm-svn: 23797	2005-10-18 22:11:42 +00:00
Jim Laskey	d812a2e449	Simple edits; remove unimplimented cases and clarify long haul SLU cases. llvm-svn: 23788	2005-10-18 16:59:23 +00:00
Chris Lattner	5a2fb9787b	Fix the JIT encoding of LWA, LD, STD, and STDU. llvm-svn: 23787	2005-10-18 16:51:22 +00:00
Jim Laskey	c6533006c8	Checking in first round of scheduling tablegen files. Not tied in as yet. llvm-svn: 23786	2005-10-18 16:23:40 +00:00
Chris Lattner	53b9c3ad4c	add a case llvm-svn: 23785	2005-10-18 06:30:51 +00:00
Chris Lattner	45517baf9f	Add an option to this pass. If it is set, we are allowed to internalize all but main. If it's not set, we can still internalize, but only if an explicit symbol list is provided. llvm-svn: 23783	2005-10-18 06:29:22 +00:00
Chris Lattner	6c14c35bd7	Fold (select C, load A, load B) -> load (select C, A, B). This happens quite a lot throughout many programs. In particular, specfp triggers it a bunch for constant FP nodes when you have code like cond ? 1.0 : -1.0. If the PPC ISel exposed the loads implicit in pic references to external globals, we would be able to eliminate a load in cases like this as well: %X = external global int %Y = external global int int* %test4(bool %C) { %G = select bool %C, int* %X, int* %Y ret int* %G } Note that this breaks things that use SrcValue's (see the fixme), but since nothing uses them yet, this is ok. Also, simplify some code to use hasOneUse() on an SDOperand instead of hasNUsesOfValue directly. llvm-svn: 23781	2005-10-18 06:04:22 +00:00
Nate Begeman	e74dfbb9ce	Do the right thing and enable 64 bit regs under the control of a subtarget option. Currently the only way to enable this is to specify the 64bitregs mattr flag. It is never enabled by default on any config yet. llvm-svn: 23779	2005-10-18 00:56:42 +00:00
Nate Begeman	0b71e007ef	First bits of 64 bit PowerPC stuff, currently disabled. A lot of this is purely mechanical. llvm-svn: 23778	2005-10-18 00:28:58 +00:00
Nate Begeman	418c6e4045	Implement some feedback from Chris re: constant canonicalization llvm-svn: 23777	2005-10-18 00:28:13 +00:00
Nate Begeman	bd5f41a6a6	Legalize BUILD_PAIR appropriately for upcoming 64 bit PowerPC work. llvm-svn: 23776	2005-10-18 00:27:41 +00:00
Nate Begeman	ec48a1bfbd	fold fmul X, +2.0 -> fadd X, X; llvm-svn: 23774	2005-10-17 20:40:11 +00:00
Chris Lattner	da1b152c43	Make this work for FP constantexprs llvm-svn: 23773	2005-10-17 20:18:38 +00:00
Chris Lattner	7fde91e365	Oops, X+0.0 isn't foldable, but X+-0.0 is. llvm-svn: 23772	2005-10-17 17:56:38 +00:00
Chris Lattner	32979336a7	relax this a bit, as we only support the default rounding mode llvm-svn: 23771	2005-10-17 17:49:32 +00:00
Chris Lattner	eeb2bda2fa	add a trivial fold llvm-svn: 23764	2005-10-17 01:07:11 +00:00
Nate Begeman	6cca84e43c	More PPC32 -> PPC changes, as well as merging some classes that were redundant after the change. llvm-svn: 23759	2005-10-16 05:39:50 +00:00
Chris Lattner	e540800d5a	Fix this logic. llvm-svn: 23756	2005-10-15 22:35:40 +00:00
Chris Lattner	17cc9edd33	Add a case we were missing that was causing us to fail CodeGen/PowerPC/rlwinm.ll:test3 llvm-svn: 23755	2005-10-15 22:18:08 +00:00
Nate Begeman	c0896117d3	Remove some dead code now that the dag combiner exists. llvm-svn: 23754	2005-10-15 22:08:02 +00:00
Chris Lattner	d869bec4fe	Remove some dead code: the ORI/ORIS cases are autogen'd. This makes SelectIntImmediateExpr dead. llvm-svn: 23753	2005-10-15 22:06:18 +00:00
Chris Lattner	03354280eb	prune #includes llvm-svn: 23752	2005-10-15 21:58:54 +00:00
Chris Lattner	a52969c8d6	These instructions are now autogenerated llvm-svn: 23751	2005-10-15 21:44:56 +00:00
Chris Lattner	286c1d7cfa	Add a pattern for FSQRTS llvm-svn: 23750	2005-10-15 21:44:15 +00:00
Chris Lattner	efa382616b	remove dead code llvm-svn: 23749	2005-10-15 21:40:12 +00:00
Chris Lattner	b986f471be	Use getExtLoad here instead of getNode, as extloads produce two values. This fixes a legalize failure on SPASS for itanium. llvm-svn: 23747	2005-10-15 20:24:07 +00:00
Chris Lattner	e33870d154	remove broken SRA/rlwimi case llvm-svn: 23746	2005-10-15 19:04:48 +00:00
Chris Lattner	6f3b954662	Rename PPC32.h to PPC.h This completes the grand PPC file renaming llvm-svn: 23745	2005-10-14 23:59:06 +00:00
Chris Lattner	0aa794ba5b	Merge PPCJITInfo.h and PPC32JITInfo.h. Note that the PowerPCJITInfo and PPC32JITInfo classes should be merged. llvm-svn: 23744	2005-10-14 23:53:41 +00:00
Chris Lattner	bfca1ab79d	Rename PowerPC.h to PPC.h llvm-svn: 23743	2005-10-14 23:51:18 +00:00
Chris Lattner	e80bf1b33a	Rename PowerPCInstrBuilder.h -> PPC* llvm-svn: 23742	2005-10-14 23:45:43 +00:00
Chris Lattner	2ed745a905	Nuke the PowerPCTargetMachine.h header. Note that the PowerPCTargetMachine still should be merged into the PPC32TargetMachine class llvm-svn: 23741	2005-10-14 23:44:05 +00:00
Chris Lattner	7503d46feb	Rename PowerPC.td -> PPC.td llvm-svn: 23740	2005-10-14 23:40:39 +00:00
Chris Lattner	f3b97f53b9	These are dead llvm-svn: 23739	2005-10-14 23:38:51 +00:00
Chris Lattner	0921e3bfc1	Eliminate PowerPC.td and PPC32.td, consolidating them into PPC.td llvm-svn: 23738	2005-10-14 23:37:35 +00:00
Chris Lattner	09cd9e7661	Like the comment says... llvm-svn: 23737	2005-10-14 22:48:24 +00:00
Chris Lattner	2121f3ca50	Nuke PowerPCInstrFormats.h, its contents are dead. Remove the definitions from the .td file that correspond to it llvm-svn: 23736	2005-10-14 22:44:13 +00:00
Nate Begeman	9d7008b08d	Properly split f32 and f64 into separate register classes for scalar sse fp fixing a bunch of nasty hackery llvm-svn: 23735	2005-10-14 22:06:00 +00:00
Nate Begeman	c41e1be2e8	Remove an unnecsesary file. PPC32 and PPC64 share architected registers. We will decide with subtarget support whether we ever use an i64 register class. llvm-svn: 23734	2005-10-14 18:58:46 +00:00
Chris Lattner	56f31f5408	add the integer truncate/extension operations llvm-svn: 23733	2005-10-14 06:40:20 +00:00
Chris Lattner	7d9f719d42	These are now autogenerated llvm-svn: 23731	2005-10-14 06:26:29 +00:00
Chris Lattner	9c0d3c5932	Add patterns for FP round/extend llvm-svn: 23727	2005-10-14 04:55:50 +00:00
Chris Lattner	6e83cbf7f3	add a new SDTCisOpSmallerThanOp type constraint, and implement fround/fextend in terms of it llvm-svn: 23726	2005-10-14 04:55:10 +00:00
Nate Begeman	6e673b24d3	fold sext_in_reg, sext_in_reg where both have the same VT. This was popping up in Fourinarow. llvm-svn: 23722	2005-10-14 01:29:07 +00:00
Chris Lattner	e3870fbe4a	Allow $ llvm-svn: 23721	2005-10-14 01:28:34 +00:00
Nate Begeman	d59e5a7abb	Relax the checking on zextload generation a bit, since as sabre pointed out you could be AND'ing with the result of a shift that shifts out all the bits you care about, in addition to a constant. Also, move over an add/sub_parts fold from legalize to the dag combiner, where it works for things other than constants. Woot! llvm-svn: 23720	2005-10-14 01:12:21 +00:00
Chris Lattner	b8282987f4	Fix the trunc(load) case, finally allowing crafty and povray to pass llvm-svn: 23718	2005-10-13 22:10:05 +00:00
Chris Lattner	dbc5ae3109	Fix some bugs in (sext (load x)) llvm-svn: 23717	2005-10-13 21:52:31 +00:00
Chris Lattner	258521d7ea	When ExpandOp'ing a [SZ]EXTLOAD, make sure to remember that the chain is also legal. Add support for ExpandOp'ing raw EXTLOADs too. llvm-svn: 23716	2005-10-13 21:44:47 +00:00
Chris Lattner	d23f4b7411	Implement PromoteOp for *EXTLOAD, allowing MallocBench/gs to Legalize llvm-svn: 23715	2005-10-13 20:07:41 +00:00
Nate Begeman	8e022b3d89	Fix the remaining DAGCombiner issues pointed out by sabre. This should fix the remainder of the failures introduced by my patch last night. llvm-svn: 23714	2005-10-13 18:34:58 +00:00
Chris Lattner	a80f1f6e72	Fix a minor bug in the dag combiner that broke pcompress2 and some other tests. llvm-svn: 23713	2005-10-13 18:16:34 +00:00
Nate Begeman	c3a89c5259	Add support to Legalize for expanding i64 sextload/zextload into hi and lo parts. This should fix the crafty and signed long long unit test failure on x86 last night. llvm-svn: 23711	2005-10-13 17:15:37 +00:00
Jim Laskey	5d7a50ac44	Inhibit instructions from being pushed before function calls. This will minimize unnecessary spilling. llvm-svn: 23710	2005-10-13 16:44:00 +00:00
Nate Begeman	02b23c6065	Move some Legalize functionality over to the DAGCombiner where it belongs. Kill some dead code. llvm-svn: 23706	2005-10-13 03:11:28 +00:00
Nate Begeman	70d28c5e32	Fix a potential bug with two combine-to's back to back that chris pointed out, where after the first CombineTo() call, the node the second CombineTo wishes to replace may no longer exist. Fix a very real bug with the truncated load optimization on little endian targets, which do not need a byte offset added to the load. llvm-svn: 23704	2005-10-12 23:18:53 +00:00
Nate Begeman	8caf81d617	More cool stuff for the dag combiner. We can now finally handle things like turning: _foo: fctiwz f0, f1 stfd f0, -8(r1) lwz r2, -4(r1) rlwinm r3, r2, 0, 16, 31 blr into _foo: fctiwz f0,f1 stfd f0,-8(r1) lhz r3,-2(r1) blr Also removed an unncessary constraint from sra -> srl conversion, which should take care of hte only reason we would ever need to handle sra in MaskedValueIsZero, AFAIK. llvm-svn: 23703	2005-10-12 20:40:40 +00:00
Jim Laskey	63b1419b74	Finally committing to the new scheduler. Still -sched=none by default. llvm-svn: 23702	2005-10-12 18:29:35 +00:00
Jim Laskey	d00db257c7	Added graphviz/gv support for MF. llvm-svn: 23700	2005-10-12 12:09:05 +00:00
Chris Lattner	192cd18f53	Fix (hopefully the last) issue where LSR is nondeterminstic. When pulling out CSE's of base expressions it could build a result whose order was nondet. llvm-svn: 23698	2005-10-11 18:41:04 +00:00
Chris Lattner	5c9d63da31	Fix another problem where LSR was being nondeterminstic. Also remove elements from the end of a vector instead of the beginning llvm-svn: 23697	2005-10-11 18:30:57 +00:00
Chris Lattner	b7a3894e7c	Fix another lsr-is-nondeterministic case llvm-svn: 23695	2005-10-11 18:17:57 +00:00
Chris Lattner	514f058be1	Fix a powerpc crash on CodeGen/Generic/llvm-ct-intrinsics.ll llvm-svn: 23694	2005-10-11 17:56:34 +00:00
Chris Lattner	c38fb8e2a1	Add a canonicalization that got lost, fixing PowerPC/fold-li.ll:SUB llvm-svn: 23693	2005-10-11 06:07:15 +00:00
Chris Lattner	cc6e53e6ee	clean up some corner cases llvm-svn: 23692	2005-10-10 23:00:08 +00:00
Chris Lattner	04c737091f	Implement trivial DSE. If two stores are neighbors and store to the same location, replace them with a new store of the last value. This occurs in the same neighborhood in 197.parser, speeding it up about 1.5% llvm-svn: 23691	2005-10-10 22:31:19 +00:00
Chris Lattner	e260ed8628	Add support for CombineTo, allowing the dag combiner to replace nodes with multiple results. Use this support to implement trivial store->load forwarding, implementing CodeGen/PowerPC/store-load-fwd.ll. Though this is the most simple case and can be extended in the future, it is still useful. For example, it speeds up 197.parser by 6.2% by avoiding an LSU reject in xalloc: stw r6, lo16(l5_end_of_array)(r2) addi r2, r5, -4 stwx r5, r4, r2 - lwzx r5, r4, r2 - rlwinm r5, r5, 0, 0, 30 stwx r5, r4, r2 lwz r2, -4(r4) ori r2, r2, 1 llvm-svn: 23690	2005-10-10 22:04:48 +00:00
Nate Begeman	6828ed9bfd	Teach the DAGCombiner several new tricks, teaching it how to turn sext_inreg into zext_inreg based on the signbit (fires a lot), srem into urem, etc. llvm-svn: 23688	2005-10-10 21:26:48 +00:00
Chris Lattner	7730924067	Fix comment llvm-svn: 23686	2005-10-10 16:52:03 +00:00
Chris Lattner	3d1d4a3d12	Add ISD::ADD to MaskedValueIsZero llvm-svn: 23685	2005-10-10 16:51:40 +00:00
Chris Lattner	56e44a6da5	This function is now dead llvm-svn: 23684	2005-10-10 16:49:22 +00:00
Chris Lattner	bcfebebf22	Enable Nate's excellent DAG combiner work by default. This allows the removal of a bunch of ad-hoc and crufty code from SelectionDAG.cpp. llvm-svn: 23682	2005-10-10 16:47:10 +00:00
Chris Lattner	d59a57a8d5	These definitions have been moved to common code. llvm-svn: 23681	2005-10-10 06:01:00 +00:00
Chris Lattner	d83571bbf2	Pull DAG ISel generation nodes out of the PowerPC backend to where they can be used by other targets. For those targets that want to use it, have at. :) llvm-svn: 23680	2005-10-10 06:00:30 +00:00
Chris Lattner	6a49b7cabb	add a todo for something I noticed llvm-svn: 23679	2005-10-09 22:59:08 +00:00
Chris Lattner	1d3dc00674	(X & Y) & C == 0 if either X&C or Y&C are zero llvm-svn: 23678	2005-10-09 22:12:36 +00:00
Chris Lattner	03b9eb506c	Make MaskedValueIsZero a bit more aggressive llvm-svn: 23677	2005-10-09 22:08:50 +00:00
Andrew Lenharth	1dfb85c7af	This seems useful from the original patch that added the function. If there is a reason it is not useful on a RISC type target, let me know and I will pull it out llvm-svn: 23676	2005-10-09 20:11:35 +00:00
Chris Lattner	62010c450f	Fix funky xcode indentation llvm-svn: 23674	2005-10-09 06:36:35 +00:00
Chris Lattner	eb4be8b942	Hrm, you didn't see this. llvm-svn: 23673	2005-10-09 06:24:02 +00:00
Chris Lattner	4ea0a3eaac	Fix a source of non-determinism in the backend: the order of processing IV strides dependend on the pointer order of the strides in memory. Non-determinism is bad. llvm-svn: 23672	2005-10-09 06:20:55 +00:00
Chris Lattner	0832f2635a	When emiting a CopyFromReg and the source is already a vreg, do not bother creating a new vreg and inserting a copy: just use the input vreg directly. This speeds up the compile (e.g. about 5% on mesa with a debug build of llc) by not adding a bunch of copies and vregs to be coallesced away. On mesa, for example, this reduces the number of intervals from 168601 to 129040 going into the coallescer. llvm-svn: 23671	2005-10-09 05:58:56 +00:00
Chris Lattner	89c7fa22b1	Disable formation of rlwinm instructions from SRA bases. This fixes the 177.mesa failure from last night, and fixes the CodeGen/PowerPC/2005-10-08-ArithmeticRotate.ll regression test I added. If this code cannot be fixed, it should be removed for good, but I'll leave it to Nate to decide its fate. llvm-svn: 23670	2005-10-09 05:36:17 +00:00
Nate Begeman	967ce74980	Remove another unused file. Preparing for the great "enable i64 on ppc32" merge, and using subtarget info for ptr size. llvm-svn: 23668	2005-10-08 01:32:34 +00:00
Nate Begeman	af72457fc4	Remove a file that is no longer used llvm-svn: 23666	2005-10-08 01:21:27 +00:00
Nate Begeman	2042aa5b92	Lo and behold, the last bits of SelectionDAG.cpp have been moved over. llvm-svn: 23665	2005-10-08 00:29:44 +00:00
Chris Lattner	dae96f8881	When preselecting, favor things that have low depth to select first. This is faster and uses less stack space. This reduces our stack requirement enough to compile sixtrack, and though it's a hack, should be enough until we switch to iterative isel llvm-svn: 23664	2005-10-07 22:10:27 +00:00
Chris Lattner	be4bbca0ba	remove debugging code llvm-svn: 23663	2005-10-07 15:31:26 +00:00
Chris Lattner	fb12624a3f	implement CodeGen/PowerPC/div-2.ll:test2-4 by propagating zero bits through C-X's llvm-svn: 23662	2005-10-07 15:30:32 +00:00
Chris Lattner	b27a4147d3	fix indentation llvm-svn: 23660	2005-10-07 06:37:02 +00:00
Chris Lattner	5bcd0dd811	Turn sdivs into udivs when we can prove the sign bits are clear. This implements CodeGen/PowerPC/div-2.ll llvm-svn: 23659	2005-10-07 06:10:46 +00:00
Jeff Cohen	572910c9a2	Remove useless variable. llvm-svn: 23656	2005-10-07 05:28:29 +00:00
Chris Lattner	20a244577d	add a hack to work around broken VC++ scoping rules. Thx to JeffC for pointing this out to me llvm-svn: 23655	2005-10-07 05:23:36 +00:00
Chris Lattner	e373592258	Fix a CQ regression from my patch to split F32/F64 into seperate register classes on PPC. We were emitting fmr instructions to do fp extensions, which weren't getting coallesced. This fixes Regression/CodeGen/PowerPC/fpcopy.ll llvm-svn: 23654	2005-10-07 05:00:52 +00:00
Chris Lattner	cd8b421799	Fix CodeGen/Generic/bool-to-double.ll llvm-svn: 23652	2005-10-07 04:50:48 +00:00
Chris Lattner	318622fb9f	Pull out Call, reducing stack frame size from 6032 bytes to 5184 bytes. llvm-svn: 23650	2005-10-06 19:07:45 +00:00
Chris Lattner	491b8294f4	Pull out setcc, this reduces stack frame size from 7520 to 6032 bytes llvm-svn: 23649	2005-10-06 19:03:35 +00:00
Chris Lattner	502a36935e	Pull two more methods out, reducing stack frame size from 8224 -> 7520 bytes llvm-svn: 23648	2005-10-06 18:56:10 +00:00
Chris Lattner	259e6c76f2	Add a recursive-iterative hybrid stage to attempt to reduce stack space, this helps but not enough. Start pulling cases out of PPC32DAGToDAGISel::Select. With GCC 4, this function required 8512 bytes of stack space for each invocation (GCC 3 required less than 700 bytes). Pulling this first function out gets us down to 8224. More to come :( llvm-svn: 23647	2005-10-06 18:45:51 +00:00
Chris Lattner	7bf8d06f02	silence a bogus GCC warning llvm-svn: 23646	2005-10-06 17:39:10 +00:00
Chris Lattner	fabe55f155	Fix the LLC regressions on X86 last night. In particular, when undoing previous copy elisions and we discover we need to reload a register, make sure to use the regclass of the original register for the reload, not the class of the current register. This avoid using 16-bit loads to reload 32-bit values. llvm-svn: 23645	2005-10-06 17:19:06 +00:00
Andrew Lenharth	e4c91fc9e8	This is suppose to work now llvm-svn: 23644	2005-10-06 16:54:29 +00:00
Andrew Lenharth	332df13b9e	remove VAX compatibility instruction, we will never use this llvm-svn: 23643	2005-10-06 16:53:32 +00:00
Chris Lattner	4bbbb9eed7	Make the legalizer completely non-recursive llvm-svn: 23642	2005-10-06 01:20:27 +00:00
Nate Begeman	558beb3729	Let the combiner handle more cases llvm-svn: 23641	2005-10-05 21:44:43 +00:00
Nate Begeman	f8221c5e2c	Remove some bad code from Legalize llvm-svn: 23640	2005-10-05 21:44:10 +00:00
Nate Begeman	bd7df030d2	Check in some more DAGCombiner pieces llvm-svn: 23639	2005-10-05 21:43:42 +00:00
Chris Lattner	55149d7835	Fix a bug in the local spiller, where we could take code like this: store r12 -> [ss#2] R3 = load [ss#1] use R3 R3 = load [ss#2] R4 = load [ss#1] and turn it into this code: store R12 -> [ss#2] R3 = load [ss#1] use R3 R3 = R12 R4 = R3 <- oops! The problem was that promoting R3 = load[ss#2] to a copy missed the fact that the instruction invalidated R3 at that point. llvm-svn: 23638	2005-10-05 18:30:19 +00:00
Chris Lattner	05da0d966e	silence some warnings llvm-svn: 23637	2005-10-05 17:15:09 +00:00
Chris Lattner	a49e16fefa	implement visitBR_CC so that PowerPC/inverted-bool-compares.ll passes with the dag combiner. This speeds up espresso by 8%, reaching performance parity with the dag-combiner-disabled llc. llvm-svn: 23636	2005-10-05 06:47:48 +00:00
Chris Lattner	b11d15637a	fix some pastos llvm-svn: 23635	2005-10-05 06:37:22 +00:00
Chris Lattner	06f1d0f73a	Add a new HandleNode class, which is used to handle (haha) cases in the dead node elim and dag combiner passes where the root is potentially updated. This fixes a fixme in the dag combiner. llvm-svn: 23634	2005-10-05 06:35:28 +00:00
Chris Lattner	a6895d180e	Implement the code for PowerPC/inverted-bool-compares.ll, even though it that testcase still does not pass with the dag combiner. This is because not all forms of br* are folded yet. Also, when we combine a node into another one, delete the node immediately instead of waiting for the node to potentially come up in the future. llvm-svn: 23632	2005-10-05 06:11:08 +00:00
Chris Lattner	6bd8fd09b6	make sure that -view-isel-dags is the input to the isel, not the input to the second phase of dag combining llvm-svn: 23631	2005-10-05 06:09:10 +00:00
Chris Lattner	746d50a01a	Fix a crash compiling Olden/tsp llvm-svn: 23630	2005-10-05 04:45:43 +00:00
Chris Lattner	3b793c6521	refactor a bit of code. When moving constant entries in 'Map' if the entry is the representative constant for the abstractypemap, make sure to update it as well. This fixes the bcreader failures from last night on several C++ apps. llvm-svn: 23628	2005-10-04 21:35:50 +00:00
Chris Lattner	dff59118c6	Minor speedup to avoid array searches given a Use*. This speeds up bc reading of the python test from 1:00 to 54s. llvm-svn: 23627	2005-10-04 18:47:09 +00:00
Chris Lattner	7a1450dbc6	Change the signature of replaceUsesOfWithOnConstant. The bool was always true dynamically. Finally, pass the Use* that replaceAllUsesWith has into the method for future use. llvm-svn: 23626	2005-10-04 18:13:04 +00:00
Chris Lattner	935aa922e3	For large constants (e.g. arrays and structs with many elements) just creating the keys and doing comparisons to index into 'Map' takes a lot of time. For these large constants, keep an inverse map so that 'remove' and move operations are much faster. This speeds up a release build of the bc reader on Eric's nasty python bytecode file from 1:39 to 1:00s. llvm-svn: 23624	2005-10-04 17:48:46 +00:00
Chris Lattner	5bbf60a5b6	minor cleanup/fastpath for the bcreader. This speeds up the bcreader from 1:41 -> 1:39 on the large python .bc file in a release build. llvm-svn: 23623	2005-10-04 16:52:46 +00:00
Jim Laskey	327d4298e1	Reverting to version - until problem isolated. llvm-svn: 23622	2005-10-04 16:41:51 +00:00
Chris Lattner	d1a5bc8dbd	Add a forward def llvm-svn: 23621	2005-10-04 05:09:20 +00:00
Nate Begeman	5da6908d65	Fix some faulty logic in the libcall inserter. Since calls return more than one value, don't bail if one of their uses happens to be a node that's not an MVT::Other when following the chain from CALLSEQ_START to CALLSEQ_END. Once we've found a CALLSEQ_START, we can just return; there's no need to tail-recurse further up the graph. Most importantly, just because something only has one use doesn't mean we should use it's one use to follow from start to end. This faulty logic caused us to follow a chain of one-use FP operations back to a much earlier call, putting a cycle in the graph from a later start to an earlier end. This is a better fix that reverting to the workaround committed earlier today. llvm-svn: 23620	2005-10-04 02:10:55 +00:00
Chris Lattner	8760ec73d8	implement the struct version of the array speedup, speeding up the testcase a bit more from 1:48 -> 1.40. llvm-svn: 23619	2005-10-04 01:17:50 +00:00
Chris Lattner	20b0754c41	Fix DemoteRegToStack on an invoke. This fixes PR634. llvm-svn: 23618	2005-10-04 00:44:01 +00:00
Nate Begeman	54fb5002e5	Add back a workaround that fixes some breakages from chris's last change. Neither of us have yet figured out why this code is necessary, but stuff breaks if its not there. Still tracking this down... llvm-svn: 23617	2005-10-04 00:37:37 +00:00
Chris Lattner	4c3b2b536c	Clean up the code a bit. Use isInstructionTriviallyDead to be more aggressive and more correct than use_empty(). This fixes PR635 and SimplifyCFG/2005-10-02-InvokeSimplify.ll llvm-svn: 23616	2005-10-03 23:43:43 +00:00
Chris Lattner	b64419ac40	Change ConstantArray::replaceUsesOfWithOnConstant to attempt to update constant arrays in place instead of reallocating them and replaceAllUsesOf'ing the result. This speeds up a release build of the bcreader from: 136.987u 120.866s 4:24.38 to 49.790u 49.890s 1:40.14 ... a 2.6x speedup parsing a large python bc file. llvm-svn: 23614	2005-10-03 22:51:37 +00:00
Chris Lattner	c4062ba65f	move some methods, no other changes llvm-svn: 23613	2005-10-03 21:58:36 +00:00
Chris Lattner	0144fadc17	minor microoptimizations llvm-svn: 23612	2005-10-03 21:56:24 +00:00
Chris Lattner	bad09e71d0	Use a map to cache the ModuleType information, so we can do logarithmic lookups instead of linear time lookups. This speeds up bc parsing of a large file from 137.834u 118.256s 4:27.96 to 132.611u 114.436s 4:08.53 with a release build. llvm-svn: 23611	2005-10-03 21:26:53 +00:00
Jim Laskey	409a6b204e	Refactor gathering node info and emission. llvm-svn: 23610	2005-10-03 12:30:32 +00:00
Chris Lattner	57b21f9f10	clean up this code a bit, no functionality change llvm-svn: 23609	2005-10-03 07:22:07 +00:00
Chris Lattner	afef68baff	Speed up the asm printer a lot by not printing formatted LLVM asm output for globals llvm-svn: 23608	2005-10-03 07:08:36 +00:00
Chris Lattner	5f096e2847	Break the body of the loop out into a new method llvm-svn: 23606	2005-10-03 04:47:08 +00:00
Chris Lattner	f07a587c79	Make IVUseShouldUsePostIncValue more aggressive when the use is a PHI. In particular, it should realize that phi's use their values in the pred block not the phi block itself. This change turns our em3d loop from this: _test: cmpwi cr0, r4, 0 bgt cr0, LBB_test_2 ; entry.no_exit_crit_edge LBB_test_1: ; entry.loopexit_crit_edge li r2, 0 b LBB_test_6 ; loopexit LBB_test_2: ; entry.no_exit_crit_edge li r6, 0 LBB_test_3: ; no_exit or r2, r6, r6 lwz r6, 0(r3) cmpw cr0, r6, r5 beq cr0, LBB_test_6 ; loopexit LBB_test_4: ; endif addi r3, r3, 4 addi r6, r2, 1 cmpw cr0, r6, r4 blt cr0, LBB_test_3 ; no_exit LBB_test_5: ; endif.loopexit.loopexit_crit_edge addi r3, r2, 1 blr LBB_test_6: ; loopexit or r3, r2, r2 blr into: _test: cmpwi cr0, r4, 0 bgt cr0, LBB_test_2 ; entry.no_exit_crit_edge LBB_test_1: ; entry.loopexit_crit_edge li r2, 0 b LBB_test_5 ; loopexit LBB_test_2: ; entry.no_exit_crit_edge li r6, 0 LBB_test_3: ; no_exit lwz r2, 0(r3) cmpw cr0, r2, r5 or r2, r6, r6 beq cr0, LBB_test_5 ; loopexit LBB_test_4: ; endif addi r3, r3, 4 addi r6, r6, 1 cmpw cr0, r6, r4 or r2, r6, r6 blt cr0, LBB_test_3 ; no_exit LBB_test_5: ; loopexit or r3, r2, r2 blr Unfortunately, this is actually worse code, because the register coallescer is getting confused somehow. If it were doing its job right, it could turn the code into this: _test: cmpwi cr0, r4, 0 bgt cr0, LBB_test_2 ; entry.no_exit_crit_edge LBB_test_1: ; entry.loopexit_crit_edge li r6, 0 b LBB_test_5 ; loopexit LBB_test_2: ; entry.no_exit_crit_edge li r6, 0 LBB_test_3: ; no_exit lwz r2, 0(r3) cmpw cr0, r2, r5 beq cr0, LBB_test_5 ; loopexit LBB_test_4: ; endif addi r3, r3, 4 addi r6, r6, 1 cmpw cr0, r6, r4 blt cr0, LBB_test_3 ; no_exit LBB_test_5: ; loopexit or r3, r6, r6 blr ... which I'll work on next. :) llvm-svn: 23604	2005-10-03 02:50:05 +00:00
Chris Lattner	e4ed42a426	Refactor some code into a function llvm-svn: 23603	2005-10-03 01:04:44 +00:00
Chris Lattner	360928dbed	This break is bogus and I have no idea why it was there. Basically it prevents memoizing code when IV's are used by phinodes outside of loops. In a simple example, we were getting this code before (note that r6 and r7 are isomorphic IV's): li r6, 0 or r7, r6, r6 LBB_test_3: ; no_exit lwz r2, 0(r3) cmpw cr0, r2, r5 or r2, r7, r7 beq cr0, LBB_test_5 ; loopexit LBB_test_4: ; endif addi r2, r7, 1 addi r7, r7, 1 addi r3, r3, 4 addi r6, r6, 1 cmpw cr0, r6, r4 blt cr0, LBB_test_3 ; no_exit Now we get: li r6, 0 LBB_test_3: ; no_exit or r2, r6, r6 lwz r6, 0(r3) cmpw cr0, r6, r5 beq cr0, LBB_test_6 ; loopexit LBB_test_4: ; endif addi r3, r3, 4 addi r6, r2, 1 cmpw cr0, r6, r4 blt cr0, LBB_test_3 ; no_exit this was noticed in em3d. llvm-svn: 23602	2005-10-03 00:37:33 +00:00
Chris Lattner	8fcce170cf	when checking if we should move a split edge block outside of a loop, check the presplit pred, not the post-split pred. This was causing us to make the wrong decision in some cases, leaving the critical edge block in the loop. llvm-svn: 23601	2005-10-03 00:31:52 +00:00
Chris Lattner	9cfccfb517	Fix a problem where the legalizer would run out of stack space on extremely large basic blocks because it was purely recursive. This switches it to an iterative/recursive hybrid. llvm-svn: 23596	2005-10-02 17:49:46 +00:00
Chris Lattner	7f718e61e8	silence a bogus warning llvm-svn: 23595	2005-10-02 16:30:51 +00:00
Chris Lattner	9982da2703	silence some warnings llvm-svn: 23594	2005-10-02 16:29:36 +00:00
Chris Lattner	c0e655b65d	silence a warning llvm-svn: 23593	2005-10-02 16:27:59 +00:00
Chris Lattner	68303a78ff	add patterns for float binops and fma ops llvm-svn: 23592	2005-10-02 07:46:28 +00:00
Chris Lattner	98da1d9910	Sort the cpu and features table, so that the alpha backend doesn't fail EVERY compile with an assertion that the tables are not sorted! llvm-svn: 23591	2005-10-02 07:13:52 +00:00
Chris Lattner	704d97f8b2	Add assertions to the trivial scheduler to check that the value types match up between defs and uses. llvm-svn: 23590	2005-10-02 07:10:55 +00:00
Chris Lattner	3734d204b8	another solution to the fsel issue. Instead of having 4 variants, just force the comparison to be 64-bits. This is fine because extensions from float to double are free. llvm-svn: 23589	2005-10-02 07:07:49 +00:00
Chris Lattner	9e98672962	fsel can take a different FP type for the comparison and for the result. As such split the FSEL family into 4 things instead of just two. llvm-svn: 23588	2005-10-02 06:58:23 +00:00
Chris Lattner	a17e6c486c	fix an f32/f64 type mismatch llvm-svn: 23587	2005-10-02 06:37:13 +00:00
Chris Lattner	a038d901fb	Codegen CopyFromReg using the regclass that matches the valuetype of the destination vreg. llvm-svn: 23586	2005-10-02 06:34:16 +00:00
Chris Lattner	4155ae0f74	Adjust to change in ctor llvm-svn: 23585	2005-10-02 06:23:51 +00:00
Chris Lattner	5ab9d42bb4	Minor tweak to the branch selector. When emitting a two-way branch, and if we're in a single-mbb loop, make sure to emit the backwards branch as the conditional branch instead of the uncond branch. For example, emit this: LBBl29_z__44: stw r9, 0(r15) stw r9, 4(r15) stw r9, 8(r15) stw r9, 12(r15) addi r15, r15, 16 addi r8, r8, 1 cmpw cr0, r8, r28 ble cr0, LBBl29_z__44 b LBBl29_z__48 * NOT PART OF LOOP Instead of: LBBl29_z__44: stw r9, 0(r15) stw r9, 4(r15) stw r9, 8(r15) stw r9, 12(r15) addi r15, r15, 16 addi r8, r8, 1 cmpw cr0, r8, r28 bgt cr0, LBBl29_z__48 * PART OF LOOP! b LBBl29_z__44 The former sequence has one fewer dispatch group for the loop body. llvm-svn: 23582	2005-10-01 23:06:26 +00:00

... 3 4 5 6 7 ...

11537 Commits