llvm-project

Commit Graph

Author	SHA1	Message	Date
Dan Gohman	47a7d6fafe	Factor the addressing mode and the load/store VT out of LoadSDNode and StoreSDNode into their common base class LSBaseSDNode. Member functions getLoadedVT and getStoredVT are replaced with the common getMemoryVT to simplify code that will handle both loads and stores. llvm-svn: 46538	2008-01-30 00:15:11 +00:00
Chris Lattner	89f36e6b21	Finally implement correct ordered comparisons for PPC, even though the code generated is not wonderful. This turns a miscompilation into a code quality bug (noted in the ppc readme). This fixes PR642, which is over 2 years old (!). Nate, please review this. llvm-svn: 45742	2008-01-08 06:46:30 +00:00
Chris Lattner	03ad885039	rename TargetInstrDescriptor -> TargetInstrDesc. Make MachineInstr::getDesc return a reference instead of a pointer, since it can never be null. llvm-svn: 45695	2008-01-07 07:27:27 +00:00
Chris Lattner	b0d06b4381	Move a bunch more accessors from TargetInstrInfo to TargetInstrDescriptor llvm-svn: 45680	2008-01-07 03:13:06 +00:00
Chris Lattner	a98c679de0	Rename MachineInstr::getInstrDescriptor -> getDesc(), which reflects that it is cheap and efficient to get. Move a variety of predicates from TargetInstrInfo into TargetInstrDescriptor, which makes it much easier to query a predicate when you don't have TII around. Now you can use MI->getDesc()->isBranch() instead of going through TII, and this is much more efficient anyway. Not all of the predicates have been moved over yet. Update old code that used MI->getInstrDescriptor()->Flags to use the new predicates in many places. llvm-svn: 45674	2008-01-07 01:56:04 +00:00
Chris Lattner	a10fff51d9	Rename SSARegMap -> MachineRegisterInfo in keeping with the idea that "machine" classes are used to represent the current state of the code being compiled. Given this expanded name, we can start moving other stuff into it. For now, move the UsedPhysRegs and LiveIn/LoveOuts vectors from MachineFunction into it. Update all the clients to match. This also reduces some needless #includes, such as MachineModuleInfo from MachineFunction. llvm-svn: 45467	2007-12-31 04:13:23 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Evan Cheng	ec271b104c	Temporary solution: added a different set of BCTRL_Macho / BCTRL_ELF with right callee-saved defs set for ppc64. llvm-svn: 43248	2007-10-23 06:42:42 +00:00
Evan Cheng	58d1eacd80	Prevent PPC::BCC first operand, the PRED number, from being isel'd into a LI instruction. llvm-svn: 37790	2007-06-29 01:25:06 +00:00
Dan Gohman	309d3d51b3	Move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits from TargetLowering to SelectionDAG so that they have more convenient access to the current DAG, in preparation for the ValueType routines being changed from standalone functions to members of SelectionDAG for the pre-legalize vector type changes. llvm-svn: 37704	2007-06-22 14:59:07 +00:00
Chris Lattner	3e21eb7fd7	Fix a bug which caused us to never be able to use signed comparisons for equality comparisons of a constant. This allows us to codegen the 'sintzero' loop in PR1288 as: LBB1_1: ;cond_next li r4, 0 addi r2, r2, 1 stw r4, 0(r3) addi r3, r3, 4 cmpwi cr0, r2, -1 bne cr0, LBB1_1 ;cond_next instead of: LBB1_1: ;cond_next addi r2, r2, 1 li r4, 0 xoris r5, r2, 65535 stw r4, 0(r3) addi r3, r3, 4 cmplwi cr0, r5, 65535 bne cr0, LBB1_1 ;cond_next This implements CodeGen/PowerPC/compare-simm.ll, and also cuts 74 instructions out of kc++. llvm-svn: 35590	2007-04-02 05:59:42 +00:00
Chris Lattner	1ef9cd400d	eliminate static ctors for Statistic objects. llvm-svn: 32703	2006-12-19 22:59:26 +00:00
Jim Laskey	095e6f3044	Reduce number of instructions to load 64-bit constants. llvm-svn: 32481	2006-12-12 13:23:43 +00:00
Bill Wendling	9bfb1e1f29	What should be the last unnecessary <iostream>s in the library. llvm-svn: 32333	2006-12-07 22:21:48 +00:00
Chris Lattner	700b873130	Detemplatize the Statistic class. The only type it is instantiated with is 'unsigned'. llvm-svn: 32279	2006-12-06 17:46:33 +00:00
Evan Cheng	20350c4025	Change MachineInstr ctor's to take a TargetInstrDescriptor reference instead of opcode and number of operands. llvm-svn: 31947	2006-11-27 23:37:22 +00:00
Chris Lattner	be9377a1e3	convert PPC::BCC to use the 'pred' operand instead of separate predicate value and CR reg #. This requires swapping the order of these everywhere that touches BCC and requires us to write custom matching logic for PPCcondbranch :( llvm-svn: 31835	2006-11-17 22:37:34 +00:00
Chris Lattner	e0263794f4	rename PPC::COND_BRANCH to PPC::BCC llvm-svn: 31834	2006-11-17 22:14:47 +00:00
Chris Lattner	8c6a41ea12	start using PPC predicates more consistently. llvm-svn: 31833	2006-11-17 22:10:59 +00:00
Chris Lattner	6f5840c409	add patterns for ppc32 preinc stores. ppc64 next. llvm-svn: 31775	2006-11-16 00:41:37 +00:00
Chris Lattner	474b5b7c95	fix ldu/stu jit encoding. Swith 64-bit preinc load instrs to use memri addrmodes. llvm-svn: 31757	2006-11-15 19:55:13 +00:00
Chris Lattner	b542925b22	remove a ton of custom selection logic no longer needed llvm-svn: 31733	2006-11-14 18:43:11 +00:00
Chris Lattner	c5102bfc7c	allow the offset of a preinc'd load to be the low-part of a global. This produces this clever code: _millisecs: lis r2, ha16(_Time.1182) lwzu r3, lo16(_Time.1182)(r2) lwz r2, 4(r2) addic r4, r2, 1 addze r3, r3 blr instead of this: _millisecs: lis r2, ha16(_Time.1182) la r3, lo16(_Time.1182)(r2) lwz r2, lo16(_Time.1182)(r2) lwz r3, 4(r3) addic r4, r3, 1 addze r3, r2 blr for: long %millisecs() { %tmp = load long* %Time.1182 ; <long> [#uses=1] %tmp1 = add long %tmp, 1 ; <long> [#uses=1] ret long %tmp1 } llvm-svn: 31673	2006-11-11 04:53:30 +00:00
Chris Lattner	c9fa36d706	implement preinc support for r+i loads on ppc64 llvm-svn: 31654	2006-11-10 23:58:45 +00:00
Chris Lattner	ce6455489a	add an initial cut at preinc loads for ppc32. This is broken for ppc64 (because the 64-bit reg target versions aren't implemented yet), doesn't support r+r addr modes, and doesn't handle stores, but it works otherwise. :) This is disabled unless -enable-ppc-preinc is passed to llc for now. llvm-svn: 31621	2006-11-10 02:08:47 +00:00
Evan Cheng	6cd0909da7	Match tblegen changes. llvm-svn: 31571	2006-11-08 20:34:28 +00:00
Chris Lattner	a801fcedd3	Refactor all the addressing mode selection stuff into the isel lowering class, where it can be used for preinc formation. llvm-svn: 31536	2006-11-08 02:15:41 +00:00
Reid Spencer	de46e48420	For PR786: Turn on -Wunused and -Wno-unused-parameter. Clean up most of the resulting fall out by removing unused variables. Remaining warnings have to do with unused functions (I didn't want to delete code without review) and unused variables in generated code. Maintainers should clean up the remaining issues when they see them. All changes pass DejaGnu tests and Olden. llvm-svn: 31380	2006-11-02 20:25:50 +00:00
Chris Lattner	3e36e07db2	fix miscompilation of llvm.isunordered, where we branched on the opposite condition. This fixes miscompilation of Olden/bh and many others. llvm-svn: 31301	2006-10-30 23:02:25 +00:00
Nate Begeman	d31efd190f	Fold AND and ROTL more often llvm-svn: 30577	2006-09-22 05:01:56 +00:00
Chris Lattner	da9b1a9322	Improve PPC64 equality comparisons like PPC32 comparisons. llvm-svn: 30510	2006-09-20 04:33:27 +00:00
Chris Lattner	aa3926b7ea	Two improvements: 1. Codegen this comparison: if (X == 0x8000) as: cmplwi cr0, r3, 32768 bne cr0, LBB1_2 ;cond_next instead of: lis r2, 0 ori r2, r2, 32768 cmpw cr0, r3, r2 bne cr0, LBB1_2 ;cond_next 2. Codegen this comparison: if (X == 0x12345678) as: xoris r2, r3, 4660 cmplwi cr0, r2, 22136 bne cr0, LBB1_2 ;cond_next instead of: lis r2, 4660 ori r2, r2, 22136 cmpw cr0, r3, r2 bne cr0, LBB1_2 ;cond_next llvm-svn: 30509	2006-09-20 04:25:47 +00:00
Chris Lattner	3d27be1333	s\|llvm/Support/Visibility.h\|llvm/Support/Compiler.h\| llvm-svn: 29911	2006-08-27 12:54:02 +00:00
Evan Cheng	c3acfc0b10	Do not use getTargetNode() and SelectNodeTo() which takes more than 3 SDOperand arguments. Use the variants which take an array and number instead. llvm-svn: 29907	2006-08-27 08:14:06 +00:00
Evan Cheng	34b70eea5c	SelectNodeTo now returns a SDNode*. llvm-svn: 29901	2006-08-26 08:00:10 +00:00
Evan Cheng	61413a3d72	Select() no longer require Result operand by reference. llvm-svn: 29898	2006-08-26 05:34:46 +00:00
Evan Cheng	ab8297f92d	Match tblgen changes. llvm-svn: 29895	2006-08-26 01:07:58 +00:00
Chris Lattner	bc485fdc4c	Fix PowerPC/2006-08-15-SelectionCrash.ll and simplify selection code. llvm-svn: 29715	2006-08-15 23:48:22 +00:00
Evan Cheng	bd1c5a8fb8	Match tablegen changes. llvm-svn: 29604	2006-08-11 09:08:15 +00:00
Chris Lattner	c24a1d3093	Start eliminating temporary vectors used to create DAG nodes. Instead, pass in the start of an array and a count of operands where applicable. In many cases, the number of operands is known, so this static array can be allocated on the stack, avoiding the heap. In many other cases, a SmallVector can be used, which has the same benefit in the common cases. I updated a lot of code calling getNode that takes a vector, but ran out of time. The rest of the code should be updated, and these methods should be removed. We should also do the same thing to eliminate the methods that take a vector of MVT::ValueTypes. It would be extra nice to convert the dagiselemitter to avoid creating vectors for operands when calling getTargetNode. llvm-svn: 29566	2006-08-08 02:23:42 +00:00
Evan Cheng	b9d34bd098	Match tablegen isel changes. llvm-svn: 29549	2006-08-07 22:28:20 +00:00
Evan Cheng	b572401bea	Remove InFlightSet hack. No longer needed. llvm-svn: 29373	2006-07-28 00:47:19 +00:00
Evan Cheng	f300896420	Remove NodeDepth llvm-svn: 29338	2006-07-27 06:40:15 +00:00
Chris Lattner	2f8c2d8ef2	shrink libllvmgcc.dylib another 25K llvm-svn: 28971	2006-06-28 22:00:36 +00:00
Chris Lattner	ca9c488528	Don't match 64-bit bitfield inserts into rlwimi's. todo add rldimi. :) llvm-svn: 28944	2006-06-27 21:08:52 +00:00
Chris Lattner	f882c54505	Fix ppc64 jump tables llvm-svn: 28941	2006-06-27 20:46:17 +00:00
Chris Lattner	9a40cca40f	Fix variable shadowing issue llvm-svn: 28922	2006-06-27 00:10:13 +00:00
Chris Lattner	97b3da1519	Implement a bunch of 64-bit cleanliness work. With this, treeadd builds (but doesn't work right). llvm-svn: 28921	2006-06-27 00:04:13 +00:00
Chris Lattner	b055c8737f	Work around a nasty tblgen bug where it doesn't add operands for varargs nodes correctly. llvm-svn: 28745	2006-06-10 01:15:02 +00:00
Chris Lattner	1fbb0d38c7	Fix build failure of povray llvm-svn: 28473	2006-05-25 18:06:16 +00:00
Chris Lattner	630bbcef8d	Fix Benchmarks/MallocBench/cfrac llvm-svn: 28471	2006-05-25 16:54:16 +00:00
Evan Cheng	4af59dac0b	Assert if InflightSet is not cleared after instruction selecting a BB. llvm-svn: 28459	2006-05-25 00:24:28 +00:00
Evan Cheng	1a8e74d113	Clear HandleMap and ReplaceMap after instruction selection. Or it may cause non-deterministic behavior. llvm-svn: 28454	2006-05-24 20:46:25 +00:00
Chris Lattner	eb755fc1b3	Make PPC call lowering more aggressive, making the isel matching code simple enough to be autogenerated. llvm-svn: 28354	2006-05-17 19:00:46 +00:00
Chris Lattner	b1e9e37c58	Switch PPC over to a call-selection model where the lowering code creates the copyto/fromregs instead of making the PPCISD::CALL selection code create them. This vastly simplifies the selection code, and moves the ABI handling parts into one place. llvm-svn: 28346	2006-05-17 06:01:33 +00:00
Chris Lattner	f058f5aef1	implement passing/returning vector regs to calls, at least non-varargs calls. llvm-svn: 28341	2006-05-16 23:54:25 +00:00
Chris Lattner	a296339c87	Fix PowerPC/2006-05-12-rlwimi-crash.ll Nate, please verify that if InsertMask is 0, rlwimi shouldn't be used. This fixes the crash and causes no PPC testsuite regressions. llvm-svn: 28243	2006-05-12 16:29:37 +00:00
Nate Begeman	9b6d4c2968	Fold more shifts into inserts, and update the README llvm-svn: 28168	2006-05-08 17:38:32 +00:00
Nate Begeman	dc996b3f6c	Update some stuff now that the new rlwimi code has gone in llvm-svn: 28162	2006-05-08 02:52:38 +00:00
Nate Begeman	1333cead5b	New rlwimi implementation, which is superior to the old one. There are still a couple missed optimizations, but we now generate all the possible rlwimis for multiple inserts into the same bitfield. More regression tests to come. llvm-svn: 28156	2006-05-07 00:23:38 +00:00
Nate Begeman	4ca2ea5b43	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Chris Lattner	0a3d1bbca4	Add VRRC select support llvm-svn: 27543	2006-04-08 22:45:08 +00:00
Chris Lattner	6961fc76bb	Codegen vector predicate compares. llvm-svn: 27151	2006-03-26 10:06:40 +00:00
Chris Lattner	5d70a7c4a5	#include Intrinsics.h into all dag isels llvm-svn: 27109	2006-03-25 06:47:10 +00:00
Chris Lattner	f2286d5917	Like the comment says, prefer to use the implicit add done by [r+r] addressing modes than emitting an explicit add and using a base of r0. This implements Regression/CodeGen/PowerPC/mem-rr-addr-mode.ll llvm-svn: 27068	2006-03-24 17:58:06 +00:00
Chris Lattner	77373d1bea	Add support for "ri" addressing modes where the immediate is a 14-bit field which is shifted left two bits before use. Instructions like STD use this addressing mode. llvm-svn: 26942	2006-03-22 05:26:03 +00:00
Chris Lattner	bda7310ef7	With Evan's latest tblgen patch, this code is obsolete, thanks Evan! llvm-svn: 26917	2006-03-21 06:37:40 +00:00
Chris Lattner	c8b16d00b9	Handle constant addresses more efficiently, folding the low bits into the disp field of the load/store if possible. This compiles CodeGen/PowerPC/load-constant-addr.ll to: _test: lis r2, 2838 lfs f1, 26848(r2) blr instead of: _test: lis r2, 2838 ori r2, r2, 26848 lfs f1, 0(r2) blr llvm-svn: 26908	2006-03-20 22:38:22 +00:00
Chris Lattner	eda030da04	reenable this hack, the tblgen version isn't quite ready llvm-svn: 26902	2006-03-20 17:54:43 +00:00
Evan Cheng	89f3cff0f5	Use tblgen'd VECTOR_SHUFFLE selection code. llvm-svn: 26900	2006-03-20 08:14:16 +00:00
Chris Lattner	a9a1313386	Add support for generating vspltw, instead of a vperm instruction with a constant pool load. This generates significantly nicer code for splats. When tblgen gets bugfixed, we can remove the custom selection code. llvm-svn: 26898	2006-03-20 06:51:10 +00:00
Nate Begeman	bb01d4f272	Remove BRTWOWAY* Make the PPC backend not dependent on BRTWOWAY_CC and make the branch selector smarter about the code it generates, fixing a case in the readme. llvm-svn: 26814	2006-03-17 01:40:33 +00:00
Chris Lattner	1678a6c477	Save/restore VRSAVE once per function, not once per block. llvm-svn: 26793	2006-03-16 18:25:23 +00:00
Chris Lattner	ab1ed2aa96	Fix an off by one error that caused PPC LLC failures last night. llvm-svn: 26758	2006-03-14 17:56:49 +00:00
Evan Cheng	2dd2c652b2	Added getTargetLowering() to TargetMachine. Refactored targets to support this. llvm-svn: 26742	2006-03-13 23:20:37 +00:00
Chris Lattner	02e2c18c9c	For functions that use vector registers, save VRSAVE, mark used registers, and update it on entry to each function, then restore it on exit. This compiles: void func(vfloat a, vfloat b, vfloat c) { a = b c + c; } to this: _func: mfspr r2, 256 oris r6, r2, 49152 mtspr 256, r6 lvx v0, 0, r5 lvx v1, 0, r4 vmaddfp v0, v1, v0, v0 stvx v0, 0, r3 mtspr 256, r2 blr GCC produces this (which has additional stack accesses): _func: mfspr r0,256 stw r0,-4(r1) oris r0,r0,0xc000 mtspr 256,r0 lvx v0,0,r5 lvx v1,0,r4 lwz r12,-4(r1) vmaddfp v0,v0,v1,v0 stvx v0,0,r3 mtspr 256,r12 blr llvm-svn: 26733	2006-03-13 21:52:10 +00:00
Chris Lattner	51348c5f27	Several big changes: 1. Use flags on the instructions in the .td file to indicate the PPC970 unit type instead of a table in the .cpp file. Much cleaner. 2. Change the hazard recognizer to build d-groups according to the actual algorithm used, not my flawed understanding of it. 3. Model "must be in the first slot" and "must be the only instr in a group" accurately. llvm-svn: 26719	2006-03-12 09:13:49 +00:00
Chris Lattner	543832d39d	Change the interface for getting a target HazardRecognizer to be more clean. llvm-svn: 26608	2006-03-08 04:25:59 +00:00
Chris Lattner	2cab13573c	Implement a very very simple hazard recognizer for LSU rejects and ctr set/read flushes llvm-svn: 26587	2006-03-07 06:32:48 +00:00
Chris Lattner	60a60f4b1e	Implement CodeGen/PowerPC/or-addressing-mode.ll, which is also PR668. llvm-svn: 26450	2006-03-01 07:14:48 +00:00
Chris Lattner	a1ec1ddd59	Implement selection of inline asm memory operands llvm-svn: 26348	2006-02-24 02:13:12 +00:00
Nate Begeman	5965bd19f8	kill ADD_PARTS & SUB_PARTS and replace them with fancy new ADDC, ADDE, SUBC and SUBE nodes that actually expose what's going on and allow for significant simplifications in the targets. llvm-svn: 26255	2006-02-17 05:43:56 +00:00
Evan Cheng	42c01c8d39	If the false case is the current basic block, then this is a self loop. We do not want to emit "Loop: ... brcond Out; br Loop", as it adds an extra instruction in the loop. Instead, invert the condition and emit "Loop: ... br!cond Loop; br Out. Generalize the fix by moving it from PPCDAGToDAGISel to SelectionDAGLowering. llvm-svn: 26231	2006-02-16 08:27:56 +00:00
Evan Cheng	d1b82d8db0	Match getTargetNode() changes (now return SDNode* instead of SDOperand). llvm-svn: 26085	2006-02-09 07:17:49 +00:00
Evan Cheng	6dc90ca172	Change Select() from SDOperand Select(SDOperand N); to void Select(SDOperand &Result, SDOperand N); llvm-svn: 26067	2006-02-09 00:37:58 +00:00
Evan Cheng	bfa4b7cc75	Complex pattern isel code shouldn't select nodes. llvm-svn: 26010	2006-02-05 08:45:01 +00:00
Evan Cheng	54cb1833a4	Use SelectRoot() as entry of any tblgen based isel. llvm-svn: 25997	2006-02-05 06:46:41 +00:00
Chris Lattner	f424a66524	Use PPCISD::CALL instead of ISD::CALL llvm-svn: 25717	2006-01-27 23:34:02 +00:00
Chris Lattner	de02d7727f	Add explicit #includes of <iostream> llvm-svn: 25515	2006-01-22 23:41:00 +00:00
Chris Lattner	5bd514d7b0	Use the default impl of DYNAMIC_STACKALLOC, allowing us to delete some code. llvm-svn: 25334	2006-01-15 09:02:48 +00:00
Chris Lattner	1014b38404	these cases are autogenerated llvm-svn: 25238	2006-01-12 02:01:45 +00:00
Chris Lattner	44416f92f1	remove dead code llvm-svn: 25237	2006-01-12 01:54:15 +00:00
Chris Lattner	20c88dfd1b	Fix a compile crash building MultiSource/Applications/d with the new front-end. The PPC backend was generating random shift counts in this case, due to an uninitialized variable. llvm-svn: 25114	2006-01-05 18:32:49 +00:00
Nate Begeman	9aea6e4691	Fix one of the things in the todo file, and get a bit closer to folding constant offsets from statics into the address arithmetic. llvm-svn: 24999	2005-12-24 01:00:15 +00:00
Nate Begeman	b11b8e44fa	Pattern-match return. Includes gross hack! llvm-svn: 24874	2005-12-20 00:26:01 +00:00
Nate Begeman	c126397a69	Fix a couple of the FIXMEs, thanks to suggestion from Chris. This allows us to load and store vectors directly at a pointer (offset of zero) by using r0 as the base register. This also requires some asm printer work to satisfy the darwin assembler. For void %foo(<4 x float> * %a) { entry: %tmp1 = load <4 x float> * %a; %tmp2 = add <4 x float> %tmp1, %tmp1 store <4 x float> %tmp2, <4 x float> *%a ret void } We now produce: _foo: lvx v0, 0, r3 vaddfp v0, v0, v0 stvx v0, 0, r3 blr Instead of: _foo: li r2, 0 lvx v0, r2, r3 vaddfp v0, v0, v0 stvx v0, r2, r3 blr llvm-svn: 24872	2005-12-19 23:40:42 +00:00
Nate Begeman	8e6a8af205	Convert load/store over to being pattern matched llvm-svn: 24871	2005-12-19 23:25:09 +00:00
Chris Lattner	2f5fb6a720	This is handled by the autogen'd code llvm-svn: 24834	2005-12-18 21:06:11 +00:00
Nate Begeman	808f7a8abb	Remove a now unused statistic. llvm-svn: 24720	2005-12-14 22:56:16 +00:00
Nate Begeman	e37cb604c1	Use the new predicate support that Evan Cheng added to remove some code from the DAGToDAG cpp file. This adds pattern support for vector and scalar fma, which passes test/Regression/CodeGen/PowerPC/fma.ll, and does the right thing in the presence of -disable-excess-fp-precision. Allows us to match: void %foo(<4 x float> * %a) { entry: %tmp1 = load <4 x float> * %a; %tmp2 = mul <4 x float> %tmp1, %tmp1 %tmp3 = add <4 x float> %tmp2, %tmp1 store <4 x float> %tmp3, <4 x float> *%a ret void } As: _foo: li r2, 0 lvx v0, r2, r3 vmaddfp v0, v0, v0, v0 stvx v0, r2, r3 blr Or, with llc -disable-excess-fp-precision, _foo: li r2, 0 lvx v0, r2, r3 vxor v1, v1, v1 vmaddfp v1, v0, v0, v1 vaddfp v0, v1, v0 stvx v0, r2, r3 blr llvm-svn: 24719	2005-12-14 22:54:33 +00:00
Nate Begeman	4e56db674c	Add support for TargetConstantPool nodes to the dag isel emitter, and use them in the PPC backend, to simplify some logic out of Select and SelectAddr. llvm-svn: 24657	2005-12-10 02:36:00 +00:00
Chris Lattner	de085f0165	Silence another annoying GCC warning llvm-svn: 24627	2005-12-06 20:56:18 +00:00
Chris Lattner	efc86f5f7a	The basic fneg cases are already autogen'd llvm-svn: 24592	2005-12-04 19:04:38 +00:00
Chris Lattner	f979794717	Autogen matching code for ADJCALLSTACK[UP\|DOWN], thanks to Evan's tblgen improvements. llvm-svn: 24591	2005-12-04 19:01:59 +00:00
Chris Lattner	fd857daa0d	Finish moving uncond br over to .td file, remove from .cpp file. llvm-svn: 24590	2005-12-04 18:48:01 +00:00
Chris Lattner	df9287836e	Make sure these get added into the codegenmap when appropriate llvm-svn: 24566	2005-12-01 18:09:22 +00:00
Chris Lattner	bd099102f0	Fix a regression caused by a patch earlier today llvm-svn: 24561	2005-12-01 03:50:19 +00:00
Evan Cheng	d94aa71e1a	Use a getCopyToReg() variant to generate a flaggy CopyToReg node. llvm-svn: 24558	2005-12-01 00:41:50 +00:00
Chris Lattner	e318977940	SelectNodeTo now returns N. Use it instead of return N directly. llvm-svn: 24549	2005-11-30 22:53:06 +00:00
Nate Begeman	1064d6ec43	First chunk of actually generating vector code for packed types. These changes allow us to generate the following code: _foo: li r2, 0 lvx v0, r2, r3 vaddfp v0, v0, v0 stvx v0, r2, r3 blr for this llvm: void %foo(<4 x float>* %a) { entry: %tmp1 = load <4 x float>* %a %tmp2 = add <4 x float> %tmp1, %tmp1 store <4 x float> %tmp2, <4 x float>* %a ret void } llvm-svn: 24534	2005-11-30 08:22:07 +00:00
Chris Lattner	5aba6ae3b3	Enable global address legalization, fixing a todo and allowing the removal of some code. This exposes the implicit load from the stubs to the DAG, allowing them to be optimized by the dag combiner. It also moves darwin specific stuff out of the isel into the legalizer, and allows more to be moved to the .td file. llvm-svn: 24397	2005-11-17 18:26:56 +00:00
Chris Lattner	0fe88e3f32	Teach the selector to fold lo(g) into load instruction immediate fields llvm-svn: 24396	2005-11-17 18:02:16 +00:00
Chris Lattner	595088aa0f	Add an initial hack at legalizing GlobalAddress into the appropriate nodes on Darwin to remove smarts from the isel. This is currently disabled by default (uncomment setOperationAction(ISD::GlobalAddress to enable it). tblgen needs to become smarter about tglobaladdr nodes and bigger patterns needed to be added to the .td file. However, we can currently emit stuff like this: :) li r2, lo16(L_x$non_lazy_ptr) lis r3, ha16(L_x$non_lazy_ptr) lwzx r2, r3, r2 The obvious improvements will follow. llvm-svn: 24390	2005-11-17 07:30:41 +00:00
Chris Lattner	b7025749e1	When lowering direct calls, lower them to use a targetglobaladress directly instead of a globaladdress. This has no effect on the generated code at all. llvm-svn: 24386	2005-11-17 05:56:14 +00:00
Nate Begeman	a171f6b20c	Patch to clean up function call pseudos and support the BLA instruction, which branches to an absolute address. This is required to support objc direct dispatch. llvm-svn: 24370	2005-11-16 00:48:01 +00:00
Chris Lattner	7ca53a5783	Don't emit "32" for unordered comparison llvm-svn: 24073	2005-10-28 22:58:07 +00:00
Chris Lattner	f8899a6877	add a hack to get code with ordered comparisons working. This hack is tracked as PR642 llvm-svn: 24068	2005-10-28 20:49:47 +00:00
Chris Lattner	5d6cb604de	add support for branch on ordered/unordered. llvm-svn: 24067	2005-10-28 20:32:44 +00:00
Chris Lattner	81ff73ec46	autogen undef llvm-svn: 23991	2005-10-25 21:03:41 +00:00
Chris Lattner	261009a4df	Autogen fsel llvm-svn: 23987	2005-10-25 20:55:47 +00:00
Chris Lattner	cd7f101c9a	Autogen a few new ppc-specific nodes llvm-svn: 23985	2005-10-25 20:41:46 +00:00
Chris Lattner	26ee5953f7	The dag isel generator generates this now llvm-svn: 23984	2005-10-25 20:36:10 +00:00
Chris Lattner	c0a201c318	Be a bit more paranoid about calling SelectNodeTo llvm-svn: 23982	2005-10-25 20:26:41 +00:00
Chris Lattner	e1fd05ebde	Fix a couple of minor bugs. The first fixes povray, the second fixes things if the dag combiner isn't run llvm-svn: 23981	2005-10-25 19:32:37 +00:00
Chris Lattner	e296949fbe	Instead of aborting if not a case we can handle specially, break out and let the generic code handle it. This fixes CodeGen/Generic/2005-10-21-longlonggtu.ll on ppc. also, reindent this code llvm-svn: 23874	2005-10-21 21:17:10 +00:00
Nate Begeman	4dd383120f	Invert the TargetLowering flag that controls divide by consant expansion. Add a new flag to TargetLowering indicating if the target has really cheap signed division by powers of two, make ppc use it. This will probably go away in the future. Implement some more ISD::SDIV folds in the dag combiner Remove now dead code in the x86 backend. llvm-svn: 23853	2005-10-21 00:02:42 +00:00
Nate Begeman	c6f067a8c4	Move the target constant divide optimization up into the dag combiner, so that the nodes can be folded with other nodes, and we can not duplicate code in every backend. Alpha will probably want this too. llvm-svn: 23835	2005-10-20 02:15:44 +00:00
Nate Begeman	9f3c26c4ea	Write patterns for the various shl and srl patterns that don't involve doing something clever. llvm-svn: 23824	2005-10-19 18:42:01 +00:00
Chris Lattner	5b6f4dc623	Convert these cases to patterns llvm-svn: 23811	2005-10-19 01:38:02 +00:00
Nate Begeman	9eaa6bac06	Woo, it kinda works. We now generate this atrociously bad, but correct, code for long long foo(long long a, long long b) { return a + b; } _foo: or r2, r3, r3 or r3, r4, r4 or r4, r5, r5 or r5, r6, r6 rldicr r2, r2, 32, 31 rldicl r3, r3, 0, 32 rldicr r4, r4, 32, 31 rldicl r5, r5, 0, 32 or r2, r3, r2 or r3, r5, r4 add r4, r3, r2 rldicl r2, r4, 32, 32 or r4, r4, r4 or r3, r2, r2 blr llvm-svn: 23809	2005-10-19 01:12:32 +00:00
Nate Begeman	92e77502f3	Make a new reg class for 64 bit regs that aliases the 32 bit regs. This will have to tide us over until we get real subreg support, but it prevents the PrologEpilogInserter from spilling 8 byte GPRs on a G4 processor. Add some initial support for TRUNCATE and ANY_EXTEND, but they don't currently work due to issues with ScheduleDAG. Something wll have to be figured out. llvm-svn: 23803	2005-10-19 00:05:37 +00:00
Nate Begeman	78afac2ddd	Add the ability to lower return instructions to TargetLowering. This allows us to lower legal return types to something else, to meet ABI requirements (such as that i64 be returned in two i32 regs on Darwin/ppc). llvm-svn: 23802	2005-10-18 23:23:37 +00:00
Nate Begeman	0b71e007ef	First bits of 64 bit PowerPC stuff, currently disabled. A lot of this is purely mechanical. llvm-svn: 23778	2005-10-18 00:28:58 +00:00
Nate Begeman	6cca84e43c	More PPC32 -> PPC changes, as well as merging some classes that were redundant after the change. llvm-svn: 23759	2005-10-16 05:39:50 +00:00
Chris Lattner	d869bec4fe	Remove some dead code: the ORI/ORIS cases are autogen'd. This makes SelectIntImmediateExpr dead. llvm-svn: 23753	2005-10-15 22:06:18 +00:00
Chris Lattner	a52969c8d6	These instructions are now autogenerated llvm-svn: 23751	2005-10-15 21:44:56 +00:00
Chris Lattner	efa382616b	remove dead code llvm-svn: 23749	2005-10-15 21:40:12 +00:00
Chris Lattner	6f3b954662	Rename PPC32.h to PPC.h This completes the grand PPC file renaming llvm-svn: 23745	2005-10-14 23:59:06 +00:00
Chris Lattner	bfca1ab79d	Rename PowerPC.h to PPC.h llvm-svn: 23743	2005-10-14 23:51:18 +00:00
Chris Lattner	0921e3bfc1	Eliminate PowerPC.td and PPC32.td, consolidating them into PPC.td llvm-svn: 23738	2005-10-14 23:37:35 +00:00
Chris Lattner	7d9f719d42	These are now autogenerated llvm-svn: 23731	2005-10-14 06:26:29 +00:00
Chris Lattner	89c7fa22b1	Disable formation of rlwinm instructions from SRA bases. This fixes the 177.mesa failure from last night, and fixes the CodeGen/PowerPC/2005-10-08-ArithmeticRotate.ll regression test I added. If this code cannot be fixed, it should be removed for good, but I'll leave it to Nate to decide its fate. llvm-svn: 23670	2005-10-09 05:36:17 +00:00
Chris Lattner	dae96f8881	When preselecting, favor things that have low depth to select first. This is faster and uses less stack space. This reduces our stack requirement enough to compile sixtrack, and though it's a hack, should be enough until we switch to iterative isel llvm-svn: 23664	2005-10-07 22:10:27 +00:00
Chris Lattner	318622fb9f	Pull out Call, reducing stack frame size from 6032 bytes to 5184 bytes. llvm-svn: 23650	2005-10-06 19:07:45 +00:00
Chris Lattner	491b8294f4	Pull out setcc, this reduces stack frame size from 7520 to 6032 bytes llvm-svn: 23649	2005-10-06 19:03:35 +00:00
Chris Lattner	502a36935e	Pull two more methods out, reducing stack frame size from 8224 -> 7520 bytes llvm-svn: 23648	2005-10-06 18:56:10 +00:00
Chris Lattner	259e6c76f2	Add a recursive-iterative hybrid stage to attempt to reduce stack space, this helps but not enough. Start pulling cases out of PPC32DAGToDAGISel::Select. With GCC 4, this function required 8512 bytes of stack space for each invocation (GCC 3 required less than 700 bytes). Pulling this first function out gets us down to 8224. More to come :( llvm-svn: 23647	2005-10-06 18:45:51 +00:00
Chris Lattner	3734d204b8	another solution to the fsel issue. Instead of having 4 variants, just force the comparison to be 64-bits. This is fine because extensions from float to double are free. llvm-svn: 23589	2005-10-02 07:07:49 +00:00
Chris Lattner	9e98672962	fsel can take a different FP type for the comparison and for the result. As such split the FSEL family into 4 things instead of just two. llvm-svn: 23588	2005-10-02 06:58:23 +00:00
Chris Lattner	5ab9d42bb4	Minor tweak to the branch selector. When emitting a two-way branch, and if we're in a single-mbb loop, make sure to emit the backwards branch as the conditional branch instead of the uncond branch. For example, emit this: LBBl29_z__44: stw r9, 0(r15) stw r9, 4(r15) stw r9, 8(r15) stw r9, 12(r15) addi r15, r15, 16 addi r8, r8, 1 cmpw cr0, r8, r28 ble cr0, LBBl29_z__44 b LBBl29_z__48 * NOT PART OF LOOP Instead of: LBBl29_z__44: stw r9, 0(r15) stw r9, 4(r15) stw r9, 8(r15) stw r9, 12(r15) addi r15, r15, 16 addi r8, r8, 1 cmpw cr0, r8, r28 bgt cr0, LBBl29_z__48 * PART OF LOOP! b LBBl29_z__44 The former sequence has one fewer dispatch group for the loop body. llvm-svn: 23582	2005-10-01 23:06:26 +00:00
Chris Lattner	8713ebf37c	fix typo llvm-svn: 23578	2005-10-01 02:51:36 +00:00
Chris Lattner	d3eee1a09b	Modify the ppc backend to use two register classes for FP: F8RC and F4RC. These are used to represent float and double values, and the two regclasses contain the same physical registers. llvm-svn: 23577	2005-10-01 01:35:02 +00:00
Jim Laskey	f61232354f	Should be using flag and not chain. llvm-svn: 23572	2005-09-30 23:43:37 +00:00
Chris Lattner	1de5706e68	Remove code for patterns that are autogenerated llvm-svn: 23532	2005-09-29 23:33:31 +00:00
Chris Lattner	08c319fbdd	Never rely on ReplaceAllUsesWith when selecting, use CodeGenMap instead. ReplaceAllUsesWith does not replace scalars SDOperand floating around on the stack, permitting things to be selected multiple times. llvm-svn: 23515	2005-09-29 00:59:32 +00:00
Chris Lattner	b9b2e77295	Autogen MUL, move FP cases together llvm-svn: 23512	2005-09-28 22:53:16 +00:00
Chris Lattner	5769311c92	disentangle FP from INT versions of div/mul llvm-svn: 23511	2005-09-28 22:50:24 +00:00
Chris Lattner	585131baaf	Use the autogenerated matcher for ADD/SUB llvm-svn: 23510	2005-09-28 22:47:28 +00:00
Chris Lattner	d3ea19b51a	Add FP versions of the binary operators, keeping the int and fp worlds seperate. llvm-svn: 23506	2005-09-28 22:29:58 +00:00
Chris Lattner	fab48b3285	All (xor *) cases are autogenerated now llvm-svn: 23497	2005-09-28 18:12:37 +00:00
Chris Lattner	33f8e08c8f	Implement PowerPC/eqv-andc-orc-nor.ll:EQV3 llvm-svn: 23494	2005-09-28 18:04:52 +00:00
Chris Lattner	bb5939a436	These nodes are all autogenerated llvm-svn: 23489	2005-09-28 17:07:09 +00:00
Chris Lattner	c628f00845	Make sure to clear the CodeGenMap after each basic block is selected to avoid cross MBB pollution. llvm-svn: 23470	2005-09-27 17:45:33 +00:00
Chris Lattner	b011cb2746	we don't need this proto any longer llvm-svn: 23342	2005-09-13 22:05:21 +00:00
Chris Lattner	03e08eefc7	move the #include for the generated code into the isel class body so we can use/define class methods llvm-svn: 23339	2005-09-13 22:03:06 +00:00
Chris Lattner	4309c3a785	PowerPC cannot truncstore i1 natively llvm-svn: 23304	2005-09-10 00:21:06 +00:00
Chris Lattner	498915dafa	Remove some cases handled by the generated portion of the isel llvm-svn: 23262	2005-09-07 23:45:15 +00:00
Nate Begeman	6095214bf0	Implement i64<->fp using the fctidz/fcfid instructions on PowerPC when we are allowed to generate 64-bit-only PowerPC instructions for 32 bit hosts, such as the PowerPC 970. This speeds up 189.lucas from 81.99 to 32.64 seconds. llvm-svn: 23250	2005-09-06 22:03:27 +00:00
Chris Lattner	8ae9525bd0	include the dag isel fragment llvm-svn: 23239	2005-09-03 01:17:22 +00:00
Chris Lattner	5f12cf14be	Change the isel to not break out of the big giant switch. Instead, the switch should never be exited, so its bottom is now unreachable. llvm-svn: 23234	2005-09-03 00:53:47 +00:00
Chris Lattner	a305d28cf6	Implement dynamic allocas correctly. In particular, because we were copying directly out of R1 (without using a CopyFromReg, which uses a chain), multiple allocas were getting CSE'd together, producing bogus code. For this: int %foo(bool %X, int %A, int %B) { br bool %X, label %T, label %F F: %G = alloca int %H = alloca int store int %A, int* %G store int %B, int* %H %R = load int* %G ret int %R T: ret int 0 } We were generating: _foo: stwu r1, -16(r1) stw r31, 4(r1) or r31, r1, r1 stw r1, 12(r31) cmpwi cr0, r3, 0 bne cr0, .LBB_foo_2 ; T .LBB_foo_1: ; F li r2, 16 subf r2, r2, r1 ;; One alloca or r1, r2, r2 or r3, r1, r1 or r1, r2, r2 or r2, r1, r1 stw r4, 0(r3) stw r5, 0(r2) lwz r3, 0(r3) lwz r1, 12(r31) lwz r31, 4(r31) lwz r1, 0(r1) blr .LBB_foo_2: ; T li r3, 0 lwz r1, 12(r31) lwz r31, 4(r31) lwz r1, 0(r1) blr Now we generate: _foo: stwu r1, -16(r1) stw r31, 4(r1) or r31, r1, r1 stw r1, 12(r31) cmpwi cr0, r3, 0 bne cr0, .LBB_foo_2 ; T .LBB_foo_1: ; F or r2, r1, r1 li r3, 16 subf r2, r3, r2 ;; Alloca 1 or r1, r2, r2 or r2, r1, r1 or r6, r1, r1 subf r3, r3, r6 ;; Alloca 2 or r1, r3, r3 or r3, r1, r1 stw r4, 0(r2) stw r5, 0(r3) lwz r3, 0(r2) lwz r1, 12(r31) lwz r31, 4(r31) lwz r1, 0(r1) blr .LBB_foo_2: ; T li r3, 0 lwz r1, 12(r31) lwz r31, 4(r31) lwz r1, 0(r1) blr This fixes Povray and SPASS with the dag isel, the last two failing cases. Tommorow we will hopefully turn it on by default! :) llvm-svn: 23190	2005-09-01 21:31:30 +00:00
Chris Lattner	293b3a68e0	Fix a bug where we were useing HA to get the high part, which seems like it could cause a miscompile. Fixing this didn't fix the two programs that fail though. :( This also changes the implementation to follow the pattern selector more closely, causing us to select 0 to li instead of lis. llvm-svn: 23189	2005-09-01 19:38:28 +00:00
Chris Lattner	34182aff7f	Do not select the operands being passed into SelectCC. IT does this itself and selecting early prevents folding immediates into the cmpw* instructions llvm-svn: 23188	2005-09-01 19:20:44 +00:00
Chris Lattner	da2e04c69d	Move FCTIWZ handling out of the instruction selectors and into legalization, getting them out of the business of making stack slots. llvm-svn: 23180	2005-08-31 21:09:52 +00:00
Chris Lattner	6bad1fb19e	Remove dead code llvm-svn: 23179	2005-08-31 20:25:15 +00:00
Chris Lattner	2bd2af8ecd	add assert zext/sext to the dag isel llvm-svn: 23171	2005-08-31 18:08:46 +00:00
Chris Lattner	f4d594370b	Fix 'ret long' to return the high and lo parts in the right registers. This fixes crafty and probably others. llvm-svn: 23167	2005-08-31 01:34:29 +00:00
Chris Lattner	69e9a9a94c	now that physregs can exist in the same dag with multiple types, remove some ugly hacks llvm-svn: 23162	2005-08-30 22:59:48 +00:00
Chris Lattner	8f8d539746	Fix type mismatches when passing f32 values to calls llvm-svn: 23159	2005-08-30 21:28:19 +00:00
Chris Lattner	9f23ae226f	Fix some indentation (first hunks). Remove code (last hunk) that miscompiled immediate and's, such as and uint %tmp.30, 4294958079 into andi. r8, r8, 56319 andis. r8, r8, 65535 instead of: li r9, -9217 and r8, r8, r9 The first always generates zero. This fixes espresso. llvm-svn: 23155	2005-08-30 18:37:48 +00:00
Chris Lattner	6a41fd75cd	Fix a problem Nate found where we swapped the operands of SHL/SHR_PARTS. This fixes fourinarow llvm-svn: 23153	2005-08-30 17:42:59 +00:00
Chris Lattner	bdf3d3defb	codegen ADD_PARTS correctly: put the results in the right registers! This fixes fhourstones llvm-svn: 23152	2005-08-30 17:40:13 +00:00
Chris Lattner	45706e9fb8	add operands in the right order, fixing McCat/18-imp with the dag isel llvm-svn: 23150	2005-08-30 17:13:58 +00:00
Chris Lattner	7a59b1cf90	Make sure the selector emits register register copies with flag operands linking them to calls when appropriate, this prevents the scheduler from pulling these copies away from the call. This fixes Ptrdist/yacr2 llvm-svn: 23143	2005-08-30 01:57:02 +00:00
Chris Lattner	e413b60632	The first operand to AND does not always have more than two operands. This fixes MediaBench/toast with the dag selector llvm-svn: 23141	2005-08-30 00:59:16 +00:00
Chris Lattner	61f7c3e843	emit FMR instructions to convert f64<->f32 instructions, so things like STOREs, know the right type to store. llvm-svn: 23139	2005-08-30 00:30:43 +00:00
Chris Lattner	12357281b8	fix a crash in cfrac llvm-svn: 23137	2005-08-29 23:49:25 +00:00
Chris Lattner	1cbbe1015a	Implement DYNAMIC_STACKALLOC, wrap some long lines llvm-svn: 23136	2005-08-29 23:30:11 +00:00
Chris Lattner	b2b418509b	Fix a dumb bug of mine where we were mishandling the PPC ABI (undef handling). This fixes voronoi and bh in Olden, allowing all of olden to pass! llvm-svn: 23133	2005-08-29 22:22:57 +00:00
Chris Lattner	c429ab2fb1	Fix a bug the last patch exposed in treeadd among others llvm-svn: 23127	2005-08-29 01:07:02 +00:00
Chris Lattner	d4d683a47b	A hack to fix a problem folding immedaites. This fixes Olden/power. llvm-svn: 23126	2005-08-29 01:01:01 +00:00
Chris Lattner	3ccad3fb8c	Fix order of operands for copytoreg node when emitting calls. This fixes Olden/msFix order of operands for copytoreg node when emitting calls. This fixes Olden/mstt. llvm-svn: 23125	2005-08-29 00:26:57 +00:00
Chris Lattner	66ddc8d3bf	add operands in the correct order llvm-svn: 23123	2005-08-29 00:02:01 +00:00
Chris Lattner	dfcde88d07	Fix a bug in FP_EXTEND, implement FP_TO_SINT llvm-svn: 23121	2005-08-28 23:59:09 +00:00
Chris Lattner	38660c6666	fix an assertion failure in treeadd llvm-svn: 23120	2005-08-28 23:39:22 +00:00
Chris Lattner	9b577f108a	implement SELECT_CC fully for the DAG->DAG isel! llvm-svn: 23101	2005-08-26 21:23:58 +00:00
Chris Lattner	b2854fadda	Make fsel emission work with both the pattern and dag-dag selectors, by giving it a non-instruction opcode. The dag->dag selector used to not select the operands of the fsel, because it thought that whole tree was already selected. llvm-svn: 23091	2005-08-26 20:25:03 +00:00
Chris Lattner	bec817ce6f	implement the fold for: bool %test(int %X, int %Y) { %C = setne int %X, 0 ret bool %C } to: _test: addic r2, r3, -1 subfe r3, r2, r3 blr llvm-svn: 23089	2005-08-26 18:46:49 +00:00
Chris Lattner	a9e6a82d66	Changes to adjust to new ReplaceAllUsesWith syntax. Change FP_EXTEND to just return its input, instead of emitting an explicit copy. llvm-svn: 23088	2005-08-26 18:37:23 +00:00
Chris Lattner	c75e047245	now that fsel is formed during legalization, this code is dead llvm-svn: 23084	2005-08-26 17:40:39 +00:00

... 2 3 4 5 6 ...

390 Commits