llvm-project

Commit Graph

Author	SHA1	Message	Date
Bill Schmidt	27917785ae	Large code model support for PowerPC. Large code model is identical to medium code model except that the addis/addi sequence for "local" accesses is never used. All accesses use the addis/ld sequence. The coding changes are straightforward; most of the patch is taken up with creating variants of the medium model tests for large model. llvm-svn: 175767	2013-02-21 17:12:27 +00:00
Bill Schmidt	3822ef2c0c	Relocation enablement for PPC DAG postprocessing pass llvm-svn: 175693	2013-02-21 00:05:29 +00:00
Bill Schmidt	9f0b4ec0f5	This patch improves the 64-bit PowerPC InitialExec TLS support by providing for a wider range of GOT entries that can hold thread-relative offsets. This matches the behavior of GCC, which was not documented in the PPC64 TLS ABI. The ABI will be updated with the new code sequence. Former sequence: ld 9,x@got@tprel(2) add 9,9,x@tls New sequence: addis 9,2,x@got@tprel@ha ld 9,x@got@tprel@l(9) add 9,9,x@tls Note that a linker optimization exists to transform the new sequence into the shorter sequence when appropriate, by replacing the addis with a nop and modifying the base register and relocation type of the ld. llvm-svn: 170209	2012-12-14 17:02:38 +00:00
Bill Schmidt	24b8dd6eb7	This patch implements local-dynamic TLS model support for the 64-bit PowerPC target. This is the last of the four models, so we now have full TLS support. This is mostly a straightforward extension of the general dynamic model. I had to use an additional Chain operand to tie ADDIS_DTPREL_HA to the register copy following ADDI_TLSLD_L; otherwise everything above the ADDIS_DTPREL_HA appeared dead and was removed. As before, there are new test cases to test the assembly generation, and the relocations output during integrated assembly. The expected code gen sequence can be read in test/CodeGen/PowerPC/tls-ld.ll. There are a couple of things I think can be done more efficiently in the overall TLS code, so there will likely be a clean-up patch forthcoming; but for now I want to be sure the functionality is in place. Bill llvm-svn: 170003	2012-12-12 19:29:35 +00:00
Bill Schmidt	c56f1d34bc	This patch implements the general dynamic TLS model for 64-bit PowerPC. Given a thread-local symbol x with global-dynamic access, the generated code to obtain x's address is: Instruction Relocation Symbol addis ra,r2,x@got@tlsgd@ha R_PPC64_GOT_TLSGD16_HA x addi r3,ra,x@got@tlsgd@l R_PPC64_GOT_TLSGD16_L x bl __tls_get_addr(x@tlsgd) R_PPC64_TLSGD x R_PPC64_REL24 __tls_get_addr nop <use address in r3> The implementation borrows from the medium code model work for introducing special forms of ADDIS and ADDI into the DAG representation. This is made slightly more complicated by having to introduce a call to the external function __tls_get_addr. Using the full call machinery is overkill and, more importantly, makes it difficult to add a special relocation. So I've introduced another opcode GET_TLS_ADDR to represent the function call, and surrounded it with register copies to set up the parameter and return value. Most of the code is pretty straightforward. I ran into one peculiarity when I introduced a new PPC opcode BL8_NOP_ELF_TLSGD, which is just like BL8_NOP_ELF except that it takes another parameter to represent the symbol ("x" above) that requires a relocation on the call. Something in the TblGen machinery causes BL8_NOP_ELF and BL8_NOP_ELF_TLSGD to be treated identically during the emit phase, so this second operand was never visited to generate relocations. This is the reason for the slightly messy workaround in PPCMCCodeEmitter.cpp:getDirectBrEncoding(). Two new tests are included to demonstrate correct external assembly and correct generation of relocations using the integrated assembler. Comments welcome! Thanks, Bill llvm-svn: 169910	2012-12-11 20:30:11 +00:00
Bill Schmidt	ca4a0c9dbd	This patch introduces initial-exec model support for thread-local storage on 64-bit PowerPC ELF. The patch includes code to handle external assembly and MC output with the integrated assembler. It intentionally does not support the "old" JIT. For the initial-exec TLS model, the ABI requires the following to calculate the address of external thread-local variable x: Code sequence Relocation Symbol ld 9,x@got@tprel(2) R_PPC64_GOT_TPREL16_DS x add 9,9,x@tls R_PPC64_TLS x The register 9 is arbitrary here. The linker will replace x@got@tprel with the offset relative to the thread pointer to the generated GOT entry for symbol x. It will replace x@tls with the thread-pointer register (13). The two test cases verify correct assembly output and relocation output as just described. PowerPC-specific selection node variants are added for the two instructions above: LD_GOT_TPREL and ADD_TLS. These are inserted when an initial-exec global variable is encountered by PPCTargetLowering::LowerGlobalTLSAddress(), and later lowered to machine instructions LDgotTPREL and ADD8TLS. LDgotTPREL is a pseudo that uses the same LDrs support added for medium code model's LDtocL, with a different relocation type. The rest of the processing is straightforward. llvm-svn: 169281	2012-12-04 16:18:08 +00:00
Bill Schmidt	34627e3434	This patch implements medium code model support for 64-bit PowerPC. The default for 64-bit PowerPC is small code model, in which TOC entries must be addressable using a 16-bit offset from the TOC pointer. Additionally, only TOC entries are addressed via the TOC pointer. With medium code model, TOC entries and data sections can all be addressed via the TOC pointer using a 32-bit offset. Cooperation with the linker allows 16-bit offsets to be used when these are sufficient, reducing the number of extra instructions that need to be executed. Medium code model also does not generate explicit TOC entries in ".section toc" for variables that are wholly internal to the compilation unit. Consider a load of an external 4-byte integer. With small code model, the compiler generates: ld 3, .LC1@toc(2) lwz 4, 0(3) .section .toc,"aw",@progbits .LC1: .tc ei[TC],ei With medium model, it instead generates: addis 3, 2, .LC1@toc@ha ld 3, .LC1@toc@l(3) lwz 4, 0(3) .section .toc,"aw",@progbits .LC1: .tc ei[TC],ei Here .LC1@toc@ha is a relocation requesting the upper 16 bits of the 32-bit offset of ei's TOC entry from the TOC base pointer. Similarly, .LC1@toc@l is a relocation requesting the lower 16 bits. Note that if the linker determines that ei's TOC entry is within a 16-bit offset of the TOC base pointer, it will replace the "addis" with a "nop", and replace the "ld" with the identical "ld" instruction from the small code model example. Consider next a load of a function-scope static integer. For small code model, the compiler generates: ld 3, .LC1@toc(2) lwz 4, 0(3) .section .toc,"aw",@progbits .LC1: .tc test_fn_static.si[TC],test_fn_static.si .type test_fn_static.si,@object .local test_fn_static.si .comm test_fn_static.si,4,4 For medium code model, the compiler generates: addis 3, 2, test_fn_static.si@toc@ha addi 3, 3, test_fn_static.si@toc@l lwz 4, 0(3) .type test_fn_static.si,@object .local test_fn_static.si .comm test_fn_static.si,4,4 Again, the linker may replace the "addis" with a "nop", calculating only a 16-bit offset when this is sufficient. Note that it would be more efficient for the compiler to generate: addis 3, 2, test_fn_static.si@toc@ha lwz 4, test_fn_static.si@toc@l(3) The current patch does not perform this optimization yet. This will be addressed as a peephole optimization in a later patch. For the moment, the default code model for 64-bit PowerPC will remain the small code model. We plan to eventually change the default to medium code model, which matches current upstream GCC behavior. Note that the different code models are ABI-compatible, so code compiled with different models will be linked and execute correctly. I've tested the regression suite and the application/benchmark test suite in two ways: Once with the patch as submitted here, and once with additional logic to force medium code model as the default. The tests all compile cleanly, with one exception. The mandel-2 application test fails due to an unrelated ABI compatibility with passing complex numbers. It just so happens that small code model was incredibly lucky, in that temporary values in floating-point registers held the expected values needed by the external library routine that was called incorrectly. My current thought is to correct the ABI problems with _Complex before making medium code model the default, to avoid introducing this "regression." Here are a few comments on how the patch works, since the selection code can be difficult to follow: The existing logic for small code model defines three pseudo-instructions: LDtoc for most uses, LDtocJTI for jump table addresses, and LDtocCPT for constant pool addresses. These are expanded by SelectCodeCommon(). The pseudo-instruction approach doesn't work for medium code model, because we need to generate two instructions when we match the same pattern. Instead, new logic in PPCDAGToDAGISel::Select() intercepts the TOC_ENTRY node for medium code model, and generates an ADDIStocHA followed by either a LDtocL or an ADDItocL. These new node types correspond naturally to the sequences described above. The addis/ld sequence is generated for the following cases: * Jump table addresses * Function addresses * External global variables * Tentative definitions of global variables (common linkage) The addis/addi sequence is generated for the following cases: * Constant pool entries * File-scope static global variables * Function-scope static variables Expanding to the two-instruction sequences at select time exposes the instructions to subsequent optimization, particularly scheduling. The rest of the processing occurs at assembly time, in PPCAsmPrinter::EmitInstruction. Each of the instructions is converted to a "real" PowerPC instruction. When a TOC entry needs to be created, this is done here in the same manner as for the existing LDtoc, LDtocJTI, and LDtocCPT pseudo-instructions (I factored out a new routine to handle this). I had originally thought that if a TOC entry was needed for LDtocL or ADDItocL, it would already have been generated for the previous ADDIStocHA. However, at higher optimization levels, the ADDIStocHA may appear in a different block, which may be assembled textually following the block containing the LDtocL or ADDItocL. So it is necessary to include the possibility of creating a new TOC entry for those two instructions. Note that for LDtocL, we generate a new form of LD called LDrs. This allows specifying the @toc@l relocation for the offset field of the LD instruction (i.e., the offset is replaced by a SymbolLo relocation). When the peephole optimization described above is added, we will need to do similar things for all immediate-form load and store operations. The seven "mcm-n.ll" test cases are kept separate because otherwise the intermingling of various TOC entries and so forth makes the tests fragile and hard to understand. The above assumes use of an external assembler. For use of the integrated assembler, new relocations are added and used by PPCELFObjectWriter. Testing is done with "mcm-obj.ll", which tests for proper generation of the various relocations for the same sequences tested with the external assembler. llvm-svn: 168708	2012-11-27 17:35:46 +00:00
Ulrich Weigand	0f79500af5	Fix wrong PowerPC instruction opcodes for: - lwaux - lhzux - stbu llvm-svn: 167863	2012-11-13 19:21:31 +00:00
Ulrich Weigand	a82389b3d0	Fix wrong PowerPC instruction encodings due to operand field name mismatches in: - AForm_3 (fmul, fmuls) - XFXForm_5 (mtcrf) - XFLForm (mtfsf) llvm-svn: 167862	2012-11-13 19:19:46 +00:00
Ulrich Weigand	0117718580	Fix instruction encoding for "bd(n)z" on PowerPC, by using a new instruction format BForm_1. llvm-svn: 167861	2012-11-13 19:15:52 +00:00
Ulrich Weigand	84ee76acfe	Fix instruction encoding for "isel" on PowerPC, using a new instruction format AForm_4. llvm-svn: 167860	2012-11-13 19:14:19 +00:00
Will Schmidt	314c6c4c2b	- Mark the BCC and BLR defs as isCodeGenOnly per error output from llvm-tblgen -gen-asm-matcher. PPCInstrInfo.td \| 11 ++++++----- 1 file changed, 6 insertions(+), 5 deletions(-) llvm-svn: 165315	2012-10-05 15:16:11 +00:00
Will Schmidt	4a67f2e2a7	- add tokens to PPCInstrInfo.td and PPCInstr64Bit.td to resolve "Instruction 'foo' has no tokens" errors during llvm-tblgen -gen-asm-matcher attempts. At this time, the added tokens are "#comment" style rather than the actual mnemonic. This will be revisited once the rest of the base asmparser bits get straightened out for ppc64-elf-linux. llvm-svn: 165237	2012-10-04 18:14:28 +00:00
Hal Finkel	efe4a44106	Move the PPC TOC defs into the PPC64 InstrInfo file. Since TOC is just defined for PPC64, move its definition to PPC64 td file. Patch by Adhemerval Zanella. llvm-svn: 163234	2012-09-05 19:22:27 +00:00
Hal Finkel	679c73cb33	Split several PPC instruction classes. Slight reorganisation of PPC instruction classes for scheduling. No functionality change for existing subtargets. - Clearly separate load/store-with-update instructions from regular loads and stores. - Split IntRotateD -> IntRotateD and IntRotateDI - Split out fsub and fadd from FPGeneral -> FPAddSub - Update existing itineraries Patch by Tobias von Koch. llvm-svn: 162729	2012-08-28 02:49:14 +00:00
Hal Finkel	686f2ee226	Allow remat of LI on PPC. Allow load-immediates to be rematerialised in the register coalescer for PPC. This makes test/CodeGen/PowerPC/big-endian-formal-args.ll fail, because it relies on a register move getting emitted. The immediate load is equivalent, so change this test case. Patch by Tobias von Koch. llvm-svn: 162727	2012-08-28 02:10:33 +00:00
Hal Finkel	5ab378037f	Eliminate redundant CR moves on PPC32. The 32-bit ABI requires CR bit 6 to be set if the call has fp arguments and unset if it doesn't. The solution up to now was to insert a MachineNode to set/unset the CR bit, which produces a CR vreg. This vreg was then copied into CR bit 6. When the register allocator saw a bunch of these in the same function, it allocated the set/unset CR bit in some random CR register (1 extra instruction) and then emitted CR moves before every vararg function call, rather than just setting and unsetting CR bit 6 directly before every vararg function call. This patch instead inserts a PPCcrset/PPCcrunset instruction which are then matched by a dedicated instruction pattern. Patch by Tobias von Koch. llvm-svn: 162725	2012-08-28 02:10:27 +00:00
Jakob Stoklund Olesen	a954e92053	Add missing SDNPSideEffect flags. llvm-svn: 162557	2012-08-24 14:43:27 +00:00
Jakob Stoklund Olesen	ed6c0408fa	Remove variable_ops from call instructions in most targets. Call instructions are no longer required to be variadic, and variable_ops should only be used for instructions that encode a variable number of arguments, like the ARM stm/ldm instructions. llvm-svn: 160189	2012-07-13 20:44:29 +00:00
Hal Finkel	460e94d842	Add support for the PPC isel instruction. The isel (integer select) instruction is supported on the 440 and A2 embedded cores and on the POWER7. llvm-svn: 159045	2012-06-22 23:10:08 +00:00
Hal Finkel	0a479ae7d1	Convert the PPC backend to use the new FMA infrastructure. The existing contraction patterns are replaced with fma/fneg. Overall functionality should be the same. llvm-svn: 158955	2012-06-22 00:49:52 +00:00
Hal Finkel	ca542beffe	Add support for generating reg+reg (indexed) pre-inc loads on PPC. llvm-svn: 158823	2012-06-20 15:43:03 +00:00
Lang Hames	39fb1d08dc	Add DAG-combines for aggressive FMA formation. This patch adds DAG combines to form FMAs from pairs of FADD + FMUL or FSUB + FMUL. The combines are performed when: (a) Either AllowExcessFPPrecision option (-enable-excess-fp-precision for llc) OR UnsafeFPMath option (-enable-unsafe-fp-math) are set, and (b) TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) is true for the type of the FADD/FSUB, and (c) The FMUL only has one user (the FADD/FSUB). If your target has fast FMA instructions you can make use of these combines by overriding TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) to return true for types supported by your FMA instruction, and adding patterns to match ISD::FMA to your FMA instructions. llvm-svn: 158757	2012-06-19 22:51:23 +00:00
Hal Finkel	1cc27e44a4	Add support for generating reg+reg preinc stores on PPC. PPC will now generate STWUX and friends. llvm-svn: 158698	2012-06-19 02:34:32 +00:00
Hal Finkel	8c33dde666	Split out the PPC instruction class IntSimple from IntGeneral. On the POWER7, adds and logical operations can also be handled in the load/store pipelines. We'll call these IntSimple. llvm-svn: 158366	2012-06-12 19:01:24 +00:00
Hal Finkel	2c09058f19	Emit the two-operand form of the PPC mfcr instruction as mfocrf. This is necessary on Linux and supported on Darwin, see PR2604. llvm-svn: 158315	2012-06-11 15:43:15 +00:00
Hal Finkel	96c2d4d945	Add the PPCCTRLoops pass: a PPC machine-code-level optimization pass to form CTR-based loop branching code. This pass is derived from the Hexagon HardwareLoops pass. The only significant enhancement over the Hexagon pass is that PPCCTRLoops will also attempt to delete the replaced add and compare operations if they are no longer otherwise used. Also, invalid preheader DebugLoc is not used. llvm-svn: 158204	2012-06-08 15:38:21 +00:00
Roman Divacky	e3f15c98d1	Implement local-exec TLS on PowerPC. llvm-svn: 157935	2012-06-04 17:36:38 +00:00
Hal Finkel	322e41a914	Enable prefetch generation on PPC64. llvm-svn: 153851	2012-04-01 20:08:17 +00:00
Hal Finkel	59607e63cb	Split the LdStGeneral PPC itin. class into LdStLoad and LdStStore. Loads and stores can have different pipeline behavior, especially on embedded chips. This change allows those differences to be expressed. Except for the 440 scheduler, there are no functionality changes. On the 440, the latency adjustment is only by one cycle, and so this probably does not affect much. Nevertheless, it will make a larger difference in the future and this removes a FIXME from the 440 itin. llvm-svn: 153821	2012-04-01 04:44:16 +00:00
Hal Finkel	51861b4855	Fix dynamic linking on PPC64. Dynamic linking on PPC64 has had problems since we had to move the top-down hazard-detection logic post-ra. For dynamic linking to work there needs to be a nop placed after every call. It turns out that it is really hard to guarantee that nothing will be placed in between the call (bl) and the nop during post-ra scheduling. Previous attempts at fixing this by placing logic inside the hazard detector only partially worked. This is now fixed in a different way: call+nop codegen-only instructions. As far as CodeGen is concerned the pair is now a single instruction and cannot be split. This solution works much better than previous attempts. The scoreboard hazard detector is also renamed to be more generic, there is currently no cpu-specific logic in it. llvm-svn: 153816	2012-03-31 14:45:15 +00:00
Roman Divacky	ef21be2cda	Convert PowerPC to register mask operands. llvm-svn: 152122	2012-03-06 16:41:49 +00:00
Jia Liu	b22310fda6	Emacs-tag and some comment fix for all ARM, CellSPU, Hexagon, MBlaze, MSP430, PPC, PTX, Sparc, X86, XCore. llvm-svn: 150878	2012-02-18 12:03:15 +00:00
Hal Finkel	ac9df3d411	make CR spill and restore 64-bit clean (no functional change), and fix some other problems found with -verify-machineinstrs llvm-svn: 146024	2011-12-07 06:34:06 +00:00
Hal Finkel	abbc2529c1	set mayStore and mayLoad on CR pseudos llvm-svn: 146022	2011-12-07 06:33:57 +00:00
Hal Finkel	bde7f8ffe2	add RESTORE_CR and support CR unspills llvm-svn: 145961	2011-12-06 20:55:36 +00:00
Nick Lewycky	50f02cb21b	Move global variables in TargetMachine into new TargetOptions class. As an API change, now you need a TargetOptions object to create a TargetMachine. Clang patch to follow. One small functionality change in PTX. PTX had commented out the machine verifier parts in their copy of printAndVerify. That now calls the version in LLVMTargetMachine. Users of PTX who need verification disabled should rely on not passing the command-line flag to enable it. llvm-svn: 145714	2011-12-02 22:16:29 +00:00
Hal Finkel	6fa5697af0	Add PPC 440 scheduler and some associated tests llvm-svn: 142170	2011-10-17 04:03:49 +00:00
Roman Divacky	71038e7021	Set CR1EQ only when lowering vararg floating arguments (not any vararg arguments as before), unset CR1EQ otherwise. llvm-svn: 138802	2011-08-30 17:04:16 +00:00
Eli Friedman	26a484852e	Code generation for 'fence' instruction. llvm-svn: 136283	2011-07-27 22:21:52 +00:00
Cameron Zwarich	dadd73390f	Fix PR8828 by removing the explicit def in MovePCToLR as well as the pointless piclabel operand. The operand in the tablegen definition doesn't actually turn into an MI operand, so it just confuses anything checking the TargetInstrDesc for the number of operands. It suffices to just have an implicit def of LR. llvm-svn: 131626	2011-05-19 02:56:28 +00:00
Jakob Stoklund Olesen	86e1a65ce5	PowerPC atomic pseudos clobber CR0, they don't read it. llvm-svn: 128829	2011-04-04 17:07:09 +00:00
Chris Lattner	2a0a3b43d7	Flag -> Glue, the ongoing saga llvm-svn: 122513	2010-12-23 18:28:41 +00:00
Chris Lattner	cfedba706c	Fix a bug I introduced in the ppc refactoring, which caused long branches to be emitted as: bne cr0, 2 instead of: bne cr0, $+8 llvm-svn: 119317	2010-11-16 01:45:05 +00:00
Chris Lattner	efacb9ee42	split out an encoder for memri operands, allowing a relocation to be plopped into the immediate field. This allows us to encode stuff like this: lbz r3, lo16(__ZL4init)(r4) ; globalopt.cpp:5 ; encoding: [0x88,0x64,A,A] ; fixup A - offset: 0, value: lo16(__ZL4init), kind: fixup_ppc_lo16 stw r3, lo16(__ZL1s)(r5) ; globalopt.cpp:6 ; encoding: [0x90,0x65,A,A] ; fixup A - offset: 0, value: lo16(__ZL1s), kind: fixup_ppc_lo16 With this, we should have a completely function MCCodeEmitter for PPC, wewt. llvm-svn: 119134	2010-11-15 08:22:03 +00:00
Chris Lattner	8f4444d003	add support for encoding the lo14 forms used for a few PPC64 addressing modes. For example, we now get: ld r3, lo16(_G)(r3) ; encoding: [0xe8,0x63,A,0bAAAAAA00] ; fixup A - offset: 0, value: lo16(_G), kind: fixup_ppc_lo14 llvm-svn: 119133	2010-11-15 08:02:41 +00:00
Chris Lattner	6566112e9c	implement the start of support for lo16 and ha16, allowing us to get stuff like: lis r4, ha16(__ZL4init) ; encoding: [0x3c,0x80,A,A] ; fixup A - offset: 0, value: ha16(__ZL4init), kind: fixup_ppc_ha16 llvm-svn: 119127	2010-11-15 06:33:39 +00:00
Chris Lattner	0e3461e417	change direct branches to encode with the same encoding method as direct calls. Change conditional branches to encode with their own method, simplifying the JIT encoder and making room for adding an mc fixup. llvm-svn: 119125	2010-11-15 06:09:35 +00:00
Chris Lattner	7064198397	eliminate a now-unneeded operand printer. llvm-svn: 119124	2010-11-15 06:01:10 +00:00
Chris Lattner	79fa37152a	split call operands out to their own encoding class, simplifying code in the JIT. Use this to form the first fixup for the PPC backend, giving us stuff like this: bl L_foo$stub ; encoding: [0b010010AA,A,A,0bAAAAAA01] ; fixup A - offset: 0, value: L_foo$stub, kind: fixup_ppc_br24 llvm-svn: 119123	2010-11-15 05:57:53 +00:00
Chris Lattner	d6a07ccd10	add proper encoding for MTCRF instead of using a hack. llvm-svn: 119121	2010-11-15 05:19:25 +00:00
Chris Lattner	c877d8f44c	add basic encoding support for immediates and registers, allowing us to encode all of these instructions correctly (for example): mflr r0 ; encoding: [0x7c,0x08,0x02,0xa6] stw r0, 8(r1) ; encoding: [0x90,0x01,0x00,0x08] stwu r1, -64(r1) ; encoding: [0x94,0x21,0xff,0xc0] llvm-svn: 119118	2010-11-15 04:51:55 +00:00
Chris Lattner	aa4d03d1f5	remove asmstrings (which can never be printed) from pseudo instructions, allowing is to eliminate some dead operand printing methods from the instprinter. llvm-svn: 119113	2010-11-15 03:48:58 +00:00
Chris Lattner	2f9f63af0b	lower PPC::MFCRpseud when transforming to MC, avoiding calling the aborting printSpecial() method. This gets us to 8 failures. llvm-svn: 119084	2010-11-14 22:03:15 +00:00
Jakob Stoklund Olesen	44629eb81b	Emit COPY instead of FMR/FMSD instructions for floating point conversion on PowerPC. llvm-svn: 108555	2010-07-16 21:03:52 +00:00
Dale Johannesen	d7d6638e3e	The PPC MFCR instruction implicitly uses all 8 of the CR registers. Currently it is not so marked, which leads to VCMPEQ instructions that feed into it getting deleted. If it is so marked, local RA complains about this sequence: vreg = MCRF CR0 MFCR <kill of whatever preg got assigned to vreg> All current uses of this instruction are only interested in one of the 8 CR registers, so redefine MFCR to be a normal unary instruction with a CR input (which is emitted only as a comment). That avoids all problems. 7739628. llvm-svn: 104238	2010-05-20 17:48:26 +00:00
Dan Gohman	30e3db2ba3	Set isTerminator on TRAP instructions. llvm-svn: 103778	2010-05-14 16:46:02 +00:00
Dan Gohman	c56ca22616	Don't use isBarrier for the PowerPC sync instruction. isBarrier is for control barriers, not memory ordering barriers. llvm-svn: 103777	2010-05-14 16:42:16 +00:00
Chris Lattner	0433699ef0	set SDNPVariadic on nodes throughout the rest of the targets that need them. llvm-svn: 98937	2010-03-19 05:33:51 +00:00
Jakob Stoklund Olesen	17d54920d7	Merge PPC instructions FMRS and FMRD into a single FMR instruction. This is possible because F8RC is a subclass of F4RC. We keep FMRSD around so fextend has a pattern. Also allow folding of memory operands on FMRSD. llvm-svn: 97275	2010-02-26 21:53:24 +00:00
Chris Lattner	d17089231a	remove a bunch of dead named arguments in input patterns, though some look dubious afaict, these are all ok. llvm-svn: 96899	2010-02-23 06:54:29 +00:00
Chris Lattner	986ab3fb1d	Eliminate some uses of immAllOnes, just use -1, it does the same thing and is more efficient for the matcher. llvm-svn: 96712	2010-02-21 03:12:16 +00:00
Jakob Stoklund Olesen	aee326860c	Don't specify CR sub-registers as implicit defs of BL instructions. It is enough to give the super registers CR0, CR1, ..., and specifying the sub-registers as well causes confusion in the liveness computations. llvm-svn: 92778	2010-01-05 21:38:37 +00:00
Tilmann Scheller	79fef9349c	Add support for calls through function pointers in the 64-bit PowerPC SVR4 ABI. Patch contributed by Ken Werner of IBM! llvm-svn: 91680	2009-12-18 13:00:15 +00:00
Dan Gohman	de5dea869f	Remove ISD::DEBUG_LOC and ISD::DBG_LABEL, which are no longer used. Note that "hasDotLocAndDotFile"-style debug info was already broken; people wanting this functionality should implement it in the AsmPrinter/DwarfWriter code. llvm-svn: 89711	2009-11-23 23:20:51 +00:00
Bob Wilson	f84f7105f7	Add PowerPC codegen for indirect branches. llvm-svn: 86050	2009-11-04 21:31:18 +00:00
Dan Gohman	453d64c9f5	Rename usesCustomDAGSchedInserter to usesCustomInserter, and update a bunch of associated comments, because it doesn't have anything to do with DAGs or scheduling. This is another step in decoupling MachineInstr emitting from scheduling. llvm-svn: 85517	2009-10-29 18:10:34 +00:00
Dan Gohman	48b185d6f7	Improve MachineMemOperand handling. - Allocate MachineMemOperands and MachineMemOperand lists in MachineFunctions. This eliminates MachineInstr's std::list member and allows the data to be created by isel and live for the remainder of codegen, avoiding a lot of copying and unnecessary translation. This also shrinks MemSDNode. - Delete MemOperandSDNode. Introduce MachineSDNode which has dedicated fields for MachineMemOperands. - Change MemSDNode to have a MachineMemOperand member instead of its own fields with the same information. This introduces some redundancy, but it's more consistent with what MachineInstr will eventually want. - Ignore alignment when searching for redundant loads for CSE, but remember the greatest alignment. Target-specific code which previously used MemOperandSDNodes with generic SDNodes now use MemIntrinsicSDNodes, with opcodes in a designated range so that the SelectionDAG framework knows that MachineMemOperand information is available. llvm-svn: 82794	2009-09-25 20:36:54 +00:00
Dale Johannesen	5e9a5c3664	Model the carry bit on ppc32. Without this we could move a SUBFC (etc.) below the SUBFE (etc.) that consumed the carry bit. Add missing ADDIC8, noticed along the way. llvm-svn: 82266	2009-09-18 20:15:22 +00:00
Tilmann Scheller	d1aaa3243a	Add support for the PowerPC 64-bit SVR4 ABI. The Link Register is volatile when using the 32-bit SVR4 ABI. Make it possible to use the 64-bit SVR4 ABI. Add non-volatile registers for the 64-bit SVR4 ABI. Make sure r2 is a reserved register when using the 64-bit SVR4 ABI. Update PPCFrameInfo for the 64-bit SVR4 ABI. Add FIXME for 64-bit Darwin PPC. Insert NOP instruction after direct function calls. Emit official procedure descriptors. Create TOC entries for GlobalAddress references. Spill 64-bit non-volatile registers to the correct slots. Only custom lower VAARG when using the 32-bit SVR4 ABI. Use simple VASTART lowering for the 64-bit SVR4 ABI. llvm-svn: 79091	2009-08-15 11:54:46 +00:00
Owen Anderson	9f94459d24	Split EVT into MVT and EVT, the former representing _just_ a primitive type, while the latter is capable of representing either a primitive or an extended type. llvm-svn: 78713	2009-08-11 20:47:22 +00:00
Owen Anderson	53aa7a960c	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Dan Gohman	f9bbcd1afd	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Tilmann Scheller	773f14c008	Refactor ABI code in the PowerPC backend. Make CalculateParameterAndLinkageAreaSize() Darwin-specific. Remove SVR4 specific code from LowerCALL_Darwin() and LowerFORMAL_ARGUMENTS_Darwin(). Rename MachoABI to DarwinABI for consistency. Rename ELF ABI to SVR4 ABI for consistency. Factor out common call return lowering between the Darwin and SVR4 ABI. Factor out common call lowering between the Darwin and SVR4 ABI. llvm-svn: 74766	2009-07-03 06:47:08 +00:00
Tilmann Scheller	b93960d779	Implement the SVR4 ABI for PowerPC. Implement LowerFORMAL_ARGUMENTS_SVR4(). Implement LowerCALL_SVR4(). Add support for split arguments. Implement by value parameter passing for aggregates. Add support for variable argument lists. Create the spill area for argument registers of variable argument functions no longer at a fixed offset. Make sure callee saved registers are spilled to the correct stack offsets. Change allocation order of non-volatile floating-point registers. Add VRSAVE to the list of callee-saved registers, add CallConvLowering for vararg calls. Add support for variable argument calls with Vector arguments. Add support for VR and VRSAVE save area, improve allocation order for non-volatile vector registers. Stop creating illegal i8 values in LowerVASTART(). Add memory access width hints. Make sure to reserve space on the stack for the frame pointer. When using the SVR4 ABI, reserve r13 for the Small Data Area pointer. Assure that the frame pointer is spilled to the correct location on the stack. Some FP registers were not marked as volatile. Make sure the i64 words from a long double are passed either both in registers or both on the stack. Only put integer arguments in registers which are not marked with the inreg flag. llvm-svn: 74765	2009-07-03 06:45:56 +00:00
Dan Gohman	69cc2cbbff	Rename isSimpleLoad to canFoldAsLoad, to better reflect its meaning. llvm-svn: 60487	2008-12-03 18:15:48 +00:00
Dan Gohman	ae3ba45eb2	Add a sanity-check to tablegen to catch the case where isSimpleLoad is set but mayLoad is not set. Fix all the problems this turned up. Change code to not use isSimpleLoad instead of mayLoad unless it really wants isSimpleLoad. llvm-svn: 60459	2008-12-03 02:30:17 +00:00
Dale Johannesen	98aa9d3e49	Add a RM pseudoreg for the rounding mode, which allows ppcf128->int conversion to work with DeadInstructionElimination. This is now turned off but RM is harmless. It does not do a complete job of modeling the rounding mode. Revert marking MFCR as using all 7 CR subregisters; while correct, this caused the problem in PR 2964, plus the local RA crash noted in the comments. This was needed to make DeadInstructionElimination, but as we are not running that, it is backed out for now. Eventually it should go back in and the other problems fixed where they're broken. llvm-svn: 58391	2008-10-29 18:26:45 +00:00
Dale Johannesen	71f361e758	Mark MFCR as reading all condition code registers. Prevents some more overzealous deletions (mostly in AltiVec code). llvm-svn: 58121	2008-10-24 22:08:01 +00:00
Dale Johannesen	e395d78657	Mark defs and uses of CTR and LR correctly. Prevents DeadMachineInstructionElim from thinking things like MTCTR are dead (fixes massive testsuite breakage at -O0). llvm-svn: 58043	2008-10-23 20:41:28 +00:00
Duncan Sands	dc84511146	Fix warnings about mb/me being potentially used uninitialized in these functions with gcc-4.3. llvm-svn: 57635	2008-10-16 13:02:33 +00:00
Chris Lattner	2753955fc0	Change CALLSEQ_BEGIN and CALLSEQ_END to take TargetConstant's as parameters instead of raw Constants. This prevents the constants from being selected by the isel pass, fixing PR2735. llvm-svn: 57385	2008-10-11 22:08:30 +00:00
Dan Gohman	effb894453	Rename ConstantSDNode::getValue to getZExtValue, for consistency with ConstantInt. This led to fixing a bug in TargetLowering.cpp using getValue instead of getAPIntValue. llvm-svn: 56159	2008-09-12 16:56:44 +00:00
Dale Johannesen	a32affb9ba	Implement partial-word binary atomics on ppc. llvm-svn: 55478	2008-08-28 17:53:09 +00:00
Dale Johannesen	d4eb0521e4	Implement 32 & 64 bit versions of PPC atomic binary primitives. llvm-svn: 55343	2008-08-25 22:34:37 +00:00
Dale Johannesen	765065c982	Remove PPC-specific lowering for atomics; the generic stuff works fine. Mark rewritten cmp-and-swap as not using CR1. llvm-svn: 55336	2008-08-25 21:09:52 +00:00
Dale Johannesen	ed86f689cb	Implement __sync_synchronize on ppc32. Patch by Gary Benson. llvm-svn: 55186	2008-08-22 17:20:54 +00:00
Dale Johannesen	dec51704ed	Rewrite ppc code generated for __sync_{bool\|val}_compare_and_swap so that lwarx and stwcx are always executed the same number of times. This is important for performance, I'm told. llvm-svn: 55163	2008-08-22 03:49:10 +00:00
Nate Begeman	f69d13b60a	Implement ISD::TRAP support on PPC llvm-svn: 54644	2008-08-11 17:36:31 +00:00
Evan Cheng	32e376f354	Implement llvm.atomic.cmp.swap.i32 on PPC. Patch by Gary Benson! llvm-svn: 53505	2008-07-12 02:23:19 +00:00
Anton Korobeynikov	c1e80a759f	Provide correct encoding for PPC LWARX instructions. Patch by Gary Benson! llvm-svn: 52828	2008-06-27 16:10:20 +00:00
Arnold Schwaighofer	be0de34ede	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Evan Cheng	5102bd9359	64-bit atomic operations. llvm-svn: 49949	2008-04-19 02:30:38 +00:00
Evan Cheng	51096affb5	PPC32 atomic operations. llvm-svn: 49947	2008-04-19 01:30:48 +00:00
Evan Cheng	0e7b00d79f	Replace all target specific implicit def instructions with a target independent one: TargetInstrInfo::IMPLICIT_DEF. llvm-svn: 48380	2008-03-15 00:03:38 +00:00
Nicolas Geoffray	b1de7a35f9	Add description of individual bits in CR. This fix PR1765. llvm-svn: 48143	2008-03-10 14:12:10 +00:00
Chris Lattner	20b5a2b037	Add support for ppc64 shifts with 7-bit (oversized) shift amount (e.g. PPCshl). llvm-svn: 48027	2008-03-07 20:18:24 +00:00
Chris Lattner	25ff7e217d	Replace SDT_PPCShiftOp in favor of SDTIntBinOps. This allows it to work with 32 or 64-bit operands/results. llvm-svn: 48026	2008-03-07 20:13:51 +00:00
Bill Wendling	632ea65072	This is the initial check-in for adding register scavenging to PPC. (Currently, PPC-64 doesn't work.) This also lowers the spilling of the CR registers so that it uses a register other than the default R0 register (the scavenger scrounges for one). A significant part of this patch fixes how kill information is handled. llvm-svn: 47863	2008-03-03 22:19:16 +00:00
Bill Wendling	97925ec704	Final de-tabification. llvm-svn: 47663	2008-02-27 06:33:05 +00:00
Nate Begeman	87abe955fc	Make register scavenging happy by not using a reg (CR0) that isn't defined llvm-svn: 47045	2008-02-13 02:58:33 +00:00
Chris Lattner	9a249b0ce5	rename SDTRet -> SDTNone. Move definition of 'trap' sdnode up from x86 instrinfo to targetselectiondag.td. llvm-svn: 46017	2008-01-15 22:02:54 +00:00
Chris Lattner	aca7ca3730	remove explicit sets of 'neverHasSideEffects' that can now be inferred from the instr patterns. llvm-svn: 45824	2008-01-10 05:45:39 +00:00
Chris Lattner	94de7bc3aa	get def use info more correct. llvm-svn: 45821	2008-01-10 05:12:37 +00:00
Chris Lattner	a4ce4f6987	rename isLoad -> isSimpleLoad due to evan's desire to have such a predicate. llvm-svn: 45667	2008-01-06 23:38:27 +00:00
Chris Lattner	10324d0175	rename isStore -> mayStore to more accurately reflect what it captures. llvm-svn: 45656	2008-01-06 08:36:04 +00:00
Chris Lattner	a348f55ec6	Change the 'isStore' inferrer to look for 'SDNPMayStore' instead of "ISD::STORE". This allows us to mark target-specific dag nodes as storing (such as ppc byteswap stores). This allows us to remove more explicit isStore flags from the .td files. Finally, add a warning for when a .td file contains an explicit isStore and tblgen is able to infer it. llvm-svn: 45654	2008-01-06 06:44:58 +00:00
Chris Lattner	e20f380fbf	remove some isStore flags that are now inferred automatically. llvm-svn: 45652	2008-01-06 05:53:26 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Bill Wendling	ca77ecb40a	Mark the "isRemat" instruction as never having side effects. llvm-svn: 45190	2007-12-19 06:07:48 +00:00
Evan Cheng	6e68381e02	Implicit def instructions, e.g. X86::IMPLICIT_DEF_GR32, are always re-materializable and they should not be spilled. llvm-svn: 44960	2007-12-12 23:12:09 +00:00
Bill Wendling	fb706bc52b	Initial commit of the machine code LICM pass. It successfully hoists this: _foo: li r2, 0 LBB1_1: ; bb li r5, 0 stw r5, 0(r3) addi r2, r2, 1 addi r3, r3, 4 cmplw cr0, r2, r4 bne cr0, LBB1_1 ; bb LBB1_2: ; return blr to: _foo: li r2, 0 li r5, 0 LBB1_1: ; bb stw r5, 0(r3) addi r2, r2, 1 addi r3, r3, 4 cmplw cr0, r2, r4 bne cr0, LBB1_1 ; bb LBB1_2: ; return blr ZOMG!! :-) Moar to come... llvm-svn: 44687	2007-12-07 21:42:31 +00:00
Bill Wendling	77b13af9a6	Unifacalize the CALLSEQ{START,END} stuff. llvm-svn: 44045	2007-11-13 09:19:02 +00:00
Bill Wendling	f359fed9f9	Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If not, then there is the potential for the stack to be changed while the stack's being used by another instruction (like a call). This can only result in tears... llvm-svn: 44037	2007-11-13 00:44:25 +00:00
Owen Anderson	933b5b7e62	Add a flag for indirect branch instructions. Target maintainers: please check that the instructions for your target are correctly marked. llvm-svn: 44012	2007-11-12 07:39:39 +00:00
Evan Cheng	ec271b104c	Temporary solution: added a different set of BCTRL_Macho / BCTRL_ELF with right callee-saved defs set for ppc64. llvm-svn: 43248	2007-10-23 06:42:42 +00:00
Dale Johannesen	666323eacd	Next PPC long double bits: ppcf128->i32 conversion. Surprisingly complicated. Adds getTargetNode for 2 outputs, no inputs (missing). llvm-svn: 42822	2007-10-10 01:01:31 +00:00
Evan Cheng	3e18e504ae	Remove (somewhat confusing) Imp<> helper, use let Defs = [], Uses = [] instead. llvm-svn: 41863	2007-09-11 19:55:27 +00:00
Evan Cheng	58c3c30921	Some out operands were incorrectly specified as input operands. llvm-svn: 40697	2007-08-01 23:07:38 +00:00
Evan Cheng	ac1591be42	No more noResults. llvm-svn: 40132	2007-07-21 00:34:19 +00:00
Evan Cheng	9081ab8127	Oops. These stores actually produce results. llvm-svn: 40074	2007-07-20 00:20:46 +00:00
Evan Cheng	94b5a80b93	Change instruction description to split OperandList into OutOperandList and InOperandList. This gives one piece of important information: # of results produced by an instruction. An example of the change: def ADD32rr : I<0x01, MRMDestReg, (ops GR32:$dst, GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; => def ADD32rr : I<0x01, MRMDestReg, (outs GR32:$dst), (ins GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; llvm-svn: 40033	2007-07-19 01:14:50 +00:00
Evan Cheng	76a97c5f8a	Do away with ImmutablePredicateOperand. llvm-svn: 37961	2007-07-06 23:22:46 +00:00
Evan Cheng	ea4a82bcfb	PPC conditional branch predicate does not change after isel. llvm-svn: 37893	2007-07-05 07:09:50 +00:00
Evan Cheng	d194a8603d	PredicateOperand can be used as a normal operand for isel. llvm-svn: 36947	2007-05-08 21:06:08 +00:00
Nicolas Geoffray	fbfc451ba9	The ELF ABI specifies F1-F8 registers as argument registers for double, not F1-F10. This affects only ELF, not MachO. llvm-svn: 35622	2007-04-03 10:27:07 +00:00
Nicolas Geoffray	89d81878d2	Differentiate between the MachO and the ELF ABI the CALL instruction. llvm-svn: 34667	2007-02-27 13:01:19 +00:00
Chris Lattner	535bd6d3ba	always lower to RETFLAG, never leave it as just ret. llvm-svn: 34639	2007-02-26 19:44:02 +00:00
Chris Lattner	84ab9a556c	one important bugfix: PPC32 didn't have both elf and macho support for external symbols and global addresses. Add the missing ones. one important workaround: PPCISD::CALL is matched by both PPCcall_ELF and PPCcall_Macho, disable the _ELF patterns for now. llvm-svn: 34601	2007-02-25 19:20:53 +00:00
Chris Lattner	43df5b335c	implement support for the linux/ppc function call ABI. Patch by Nicolas Geoffray! llvm-svn: 34574	2007-02-25 05:34:32 +00:00
Jim Laskey	f9e5445ed4	Make LABEL a builtin opcode. llvm-svn: 33537	2007-01-26 14:34:52 +00:00
Chris Lattner	542dfd5510	Rewrite the branch selector to be correct in the face of large functions. The algorithm it used before wasn't 100% correct, we now use an iterative expansion model. This fixes assembler errors when compiling 403.gcc with tail merging enabled. Change the way the branch selector works overall: Now, the isel generates PPC::BCC instructions (as it used to) directly, and these BCC instructions are emitted to the output or jitted directly if branches don't need expansion. Only if branches need expansion are instructions rewritten and created. This should make branch select faster, and eliminates the Bxx instructions from the .td file. llvm-svn: 31837	2006-11-18 00:32:03 +00:00
Chris Lattner	33fc1d45e5	add encoding for BCC, after finally wrestling strange ppc/tblgen endianness issues to the ground. llvm-svn: 31836	2006-11-17 23:53:28 +00:00
Chris Lattner	be9377a1e3	convert PPC::BCC to use the 'pred' operand instead of separate predicate value and CR reg #. This requires swapping the order of these everywhere that touches BCC and requires us to write custom matching logic for PPCcondbranch :( llvm-svn: 31835	2006-11-17 22:37:34 +00:00
Chris Lattner	e0263794f4	rename PPC::COND_BRANCH to PPC::BCC llvm-svn: 31834	2006-11-17 22:14:47 +00:00
Chris Lattner	8c6a41ea12	start using PPC predicates more consistently. llvm-svn: 31833	2006-11-17 22:10:59 +00:00
Jim Laskey	48850c10c0	This is a general clean up of the PowerPC ABI. Address several problems and bugs including making sure that the TOS links back to the previous frame, that the maximum call frame size is not included twice when using frame pointers, no longer growing the frame on calls, double storing of SP and a cleaner/faster dynamic alloca. llvm-svn: 31792	2006-11-16 22:43:37 +00:00
Chris Lattner	a7ff5162b0	fix broken encoding llvm-svn: 31778	2006-11-16 01:01:28 +00:00
Chris Lattner	6f5840c409	add patterns for ppc32 preinc stores. ppc64 next. llvm-svn: 31775	2006-11-16 00:41:37 +00:00
Chris Lattner	3a494989a6	switch these back to the 'bad old way' llvm-svn: 31774	2006-11-16 00:33:34 +00:00
Chris Lattner	5771156be0	Stop using isTwoAddress, switching to operand constraints instead. Tell the codegen emitter that specific operands are not to be encoded, fixing JIT regressions w.r.t. pre-inc loads and stores (e.g. lwzu, which we generate even when general preinc loads are not enabled). llvm-svn: 31770	2006-11-15 23:24:18 +00:00
Chris Lattner	474b5b7c95	fix ldu/stu jit encoding. Swith 64-bit preinc load instrs to use memri addrmodes. llvm-svn: 31757	2006-11-15 19:55:13 +00:00
Chris Lattner	1396961e85	Switch loads over to use memri as the operand instead of a reg/imm operand pair for cleanliness. Add instructions for PPC32 preinc-stores with commented out patterns. More improvement is needed to enable the patterns, but we're getting close. llvm-svn: 31749	2006-11-15 02:43:19 +00:00
Chris Lattner	e79a451475	group load and store instructions together. No functionality change. llvm-svn: 31736	2006-11-14 19:19:53 +00:00
Chris Lattner	44dbdbe5cf	Rework PPC64 calls. Now we have a LR8/CTR8 register which the PPC64 calls clobber. This allows LR8 to be save/restored correctly as a 64-bit quantity, instead of handling it as a 32-bit quantity. This unbreaks ppc64 codegen when the code is actually located above the 4G boundary. llvm-svn: 31734	2006-11-14 18:44:47 +00:00
Chris Lattner	2ff632c54b	Mark operands as symbol lo instead of imm32 so that they print lo(x) around globals. llvm-svn: 31672	2006-11-11 04:51:36 +00:00
Chris Lattner	6c8656a6b1	dform 8/9 are identical to dform 1 llvm-svn: 31637	2006-11-10 17:51:02 +00:00
Chris Lattner	ce6455489a	add an initial cut at preinc loads for ppc32. This is broken for ppc64 (because the 64-bit reg target versions aren't implemented yet), doesn't support r+r addr modes, and doesn't handle stores, but it works otherwise. :) This is disabled unless -enable-ppc-preinc is passed to llc for now. llvm-svn: 31621	2006-11-10 02:08:47 +00:00
Chris Lattner	6a5a4f85d3	correct the (currently unused) pattern for lwzu. llvm-svn: 31535	2006-11-08 02:13:12 +00:00
Chris Lattner	2959789c92	encode BLR predicate info for the JIT llvm-svn: 31450	2006-11-04 05:42:48 +00:00
Chris Lattner	6be726048e	Go through all kinds of trouble to mark 'blr' as having a predicate operand that takes a register and condition code. Print these pieces of BLR the right way, even though it is currently set to 'always'. Next up: get the JIT encoding right, then enhance branch folding to produce predicated blr for simple examples. llvm-svn: 31449	2006-11-04 05:27:39 +00:00
Chris Lattner	c8a68d08c3	Describe PPC predicates, which are a pair of CR# and condition. llvm-svn: 31438	2006-11-03 23:53:25 +00:00
Chris Lattner	895d199348	remove dead vars llvm-svn: 31433	2006-11-03 23:46:45 +00:00
Chris Lattner	d43e8a7429	Add intrinsics for the rest of the DCB* instructions. llvm-svn: 31148	2006-10-24 01:08:42 +00:00
Evan Cheng	ab51cf2e78	Merge ISD::TRUNCSTORE to ISD::STORE. Switch to using StoreSDNode. llvm-svn: 30945	2006-10-13 21:14:26 +00:00
Chris Lattner	cf56917053	set isBarrier correctly llvm-svn: 30936	2006-10-13 19:10:34 +00:00
Chris Lattner	7374bc0577	mark adjcallstack up/down as clobbering and using the SP llvm-svn: 30908	2006-10-12 17:56:34 +00:00
Evan Cheng	577ef7694e	Add properties to ComplexPattern. llvm-svn: 30891	2006-10-11 21:03:53 +00:00
Evan Cheng	e71fe34d75	Reflects ISD::LOAD / ISD::LOADX / LoadSDNode changes. llvm-svn: 30844	2006-10-09 20:57:25 +00:00
Chris Lattner	67f8cc51f4	Use abstract private/comment directives, to increase portability to ppc/linux llvm-svn: 30621	2006-09-27 02:55:21 +00:00
Nate Begeman	d31efd190f	Fold AND and ROTL more often llvm-svn: 30577	2006-09-22 05:01:56 +00:00
Evan Cheng	81b645a76b	CALLSEQ_* produces chain even if that's not needed. llvm-svn: 29603	2006-08-11 09:03:33 +00:00
Chris Lattner	4f8eb5ccaf	bswapped load/store instructions are only availble in indexed addressing form. As such, use xoaddr (indexed only), not xaddr for address selection. This fixes CodeGen/PowerPC/2006-07-19-stwbrx-crash.ll, a crash compiling lencod. llvm-svn: 29208	2006-07-19 17:15:36 +00:00
Chris Lattner	b00b6c2e86	Make the implicit def instructions look like other instrs. llvm-svn: 29174	2006-07-18 16:33:26 +00:00
Chris Lattner	a7976d329e	Implement Regression/CodeGen/PowerPC/bswap-load-store.ll by folding bswaps into i16/i32 load/stores. llvm-svn: 29089	2006-07-10 20:56:58 +00:00
Chris Lattner	3b5873456e	Add 64-bit MTCTR so that indirect calls work. llvm-svn: 28931	2006-06-27 18:36:44 +00:00
Chris Lattner	d48ce27532	Implement 64-bit undef, sub, shl/shr, srem/urem llvm-svn: 28929	2006-06-27 18:18:41 +00:00
Chris Lattner	97b3da1519	Implement a bunch of 64-bit cleanliness work. With this, treeadd builds (but doesn't work right). llvm-svn: 28921	2006-06-27 00:04:13 +00:00
Chris Lattner	b6a65f4661	Remove two more definitions llvm-svn: 28918	2006-06-26 22:47:37 +00:00
Chris Lattner	86e6046515	remove two unused instructions. llvm-svn: 28917	2006-06-26 22:44:13 +00:00
Chris Lattner	1f1b096142	Make these predicates correct in 64-bit mode too. llvm-svn: 28890	2006-06-20 23:21:20 +00:00
Chris Lattner	52a956da52	Rename OR4 -> OR. Move some PPC64-specific stuff to the 64-bit file llvm-svn: 28889	2006-06-20 23:18:58 +00:00
Chris Lattner	5705d4d519	remove unused flag llvm-svn: 28888	2006-06-20 23:15:07 +00:00
Chris Lattner	7a856a6d88	remove some unused patterns llvm-svn: 28886	2006-06-20 23:11:36 +00:00
Chris Lattner	7e742e46ac	Add some 64-bit logical ops. Split imm16Shifted into a sext/zext form for 64-bit support. Add some patterns for immediate formation. For example, we now compile this: static unsigned long long Y; void test3() { Y = 0xF0F00F00; } into: _test3: li r2, 3840 lis r3, ha16(_Y) xoris r2, r2, 61680 std r2, lo16(_Y)(r3) blr GCC produces: _test3: li r0,0 lis r2,ha16(_Y) ori r0,r0,61680 sldi r0,r0,16 ori r0,r0,3840 std r0,lo16(_Y)(r2) blr llvm-svn: 28883	2006-06-20 22:34:10 +00:00
Chris Lattner	d6e160d14d	64-bit bugfix: 0xFFFF0000 cannot be formed with a single lis. llvm-svn: 28880	2006-06-20 21:39:30 +00:00
Chris Lattner	868a75bec6	Remove some now-unneeded casts from instruction patterns. With the casts removed, tblgen produces identical output to with them in. llvm-svn: 28867	2006-06-20 00:39:56 +00:00
Chris Lattner	e8fe5e2bf4	In 64-bit mode, addr mode operands use G8RC instead of GPRC. llvm-svn: 28840	2006-06-16 21:29:03 +00:00
Chris Lattner	a5190ae7a9	fix some assumptions that pointers can only be 32-bits. With this, we can now compile: static unsigned long X; void test1() { X = 0; } into: _test1: lis r2, ha16(_X) li r3, 0 stw r3, lo16(_X)(r2) blr Totally amazing :) llvm-svn: 28839	2006-06-16 21:01:35 +00:00
Chris Lattner	b429983988	Split 64-bit instructions out into a separate .td file llvm-svn: 28838	2006-06-16 20:22:01 +00:00
Chris Lattner	006b2c6ab9	Fix a problem exposed by the local allocator. CALL instructions are not marked as using incoming argument registers, so the local allocator would clobber them between their set and use. To fix this, we give the call instructions a variable number of uses in the CALL MachineInstr itself, so live variables understands the live ranges of these register arguments. llvm-svn: 28744	2006-06-10 01:14:28 +00:00
Chris Lattner	c8587d4b81	Add PowerPC intrinsics to support dcbz[l] llvm-svn: 28696	2006-06-06 21:29:23 +00:00
Chris Lattner	eb755fc1b3	Make PPC call lowering more aggressive, making the isel matching code simple enough to be autogenerated. llvm-svn: 28354	2006-05-17 19:00:46 +00:00
Chris Lattner	b1e9e37c58	Switch PPC over to a call-selection model where the lowering code creates the copyto/fromregs instead of making the PPCISD::CALL selection code create them. This vastly simplifies the selection code, and moves the ABI handling parts into one place. llvm-svn: 28346	2006-05-17 06:01:33 +00:00
Nate Begeman	4ca2ea5b43	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Chris Lattner	34c901b50e	These are correctly encoded by the JIT. I checked :) llvm-svn: 27810	2006-04-18 19:03:38 +00:00
Chris Lattner	9754d142a4	Implement an important entry from README_ALTIVEC: If an altivec predicate compare is used immediately by a branch, don't use a (serializing) MFCR instruction to read the CR6 register, which requires a compare to get it back to CR's. Instead, just branch on CR6 directly. :) For example, for: void foo2(vector float A, vector float B) { if (!vec_any_eq(A, B)) *B = (vector float){0,0,0,0}; } We now generate: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 bne cr6, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr instead of: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 cmpwi cr0, r3, 0 beq cr0, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr This implements CodeGen/PowerPC/vec_br_cmp.ll. llvm-svn: 27804	2006-04-18 17:59:36 +00:00
Chris Lattner	0a3d1bbca4	Add VRRC select support llvm-svn: 27543	2006-04-08 22:45:08 +00:00
Chris Lattner	d7495ae7e9	Lower vector compares to VCMP nodes, just like we lower vector comparison predicates to VCMPo nodes. llvm-svn: 27285	2006-03-31 05:13:27 +00:00
Chris Lattner	cb5ec07cc3	Use normal lvx for scalar_to_vector instead of lve*x. They do the exact same thing and we have a dag node for the former. llvm-svn: 27205	2006-03-28 01:43:22 +00:00
Chris Lattner	6961fc76bb	Codegen vector predicate compares. llvm-svn: 27151	2006-03-26 10:06:40 +00:00
Chris Lattner	2a85fa1f79	Move all Altivec stuff out into a new PPCInstrAltivec.td file. Add a bunch of patterns for different datatypes, e.g. bit_convert, undef and zero vector support. llvm-svn: 27117	2006-03-25 07:51:43 +00:00
Chris Lattner	1cb91b3cd9	Add some basic patterns for other datatypes llvm-svn: 27116	2006-03-25 07:39:07 +00:00
Chris Lattner	f653cdd3f9	Add support for __builtin_altivec_vnmsubfp /vmaddfp llvm-svn: 27112	2006-03-25 07:05:55 +00:00
Chris Lattner	2771e2c960	Codegen things like: <int -1, int -1, int -1, int -1> and <int 65537, int 65537, int 65537, int 65537> Using things like: vspltisb v0, -1 and: vspltish v0, 1 instead of using constant pool loads. This implements CodeGen/PowerPC/vec_splat.ll:splat_imm_i{32\|16}. llvm-svn: 27106	2006-03-25 06:12:06 +00:00
Chris Lattner	d589dd1352	Fix a bad JIT encoding of VPERM. Why is VPERM D,A,B,C but vfmadd is D,A,C,B ?? llvm-svn: 27069	2006-03-24 18:24:43 +00:00
Chris Lattner	ab882abce8	add support for using vxor to build zero vectors. This implements Regression/CodeGen/PowerPC/vec_zero.ll llvm-svn: 27059	2006-03-24 07:48:08 +00:00
Chris Lattner	f5efddf80b	Gabor points out that we can't spell. :) llvm-svn: 27049	2006-03-24 07:12:19 +00:00
Chris Lattner	81137629e0	Add PPC vector bit-convert support llvm-svn: 26995	2006-03-23 19:54:27 +00:00
Chris Lattner	4a66d69433	When possible, custom lower 32-bit SINT_TO_FP to this: _foo2: extsw r2, r3 std r2, -8(r1) lfd f0, -8(r1) fcfid f0, f0 frsp f1, f0 blr instead of this: _foo2: lis r2, ha16(LCPI2_0) lis r4, 17200 xoris r3, r3, 32768 stw r3, -4(r1) stw r4, -8(r1) lfs f0, lo16(LCPI2_0)(r2) lfd f1, -8(r1) fsub f0, f1, f0 frsp f1, f0 blr This speeds up Misc/pi from 2.44s->2.09s with LLC and from 3.01->2.18s with llcbeta (16.7% and 38.1% respectively). llvm-svn: 26943	2006-03-22 05:30:33 +00:00
Chris Lattner	4e7371758f	Fix the JIT encoding of the VAForm_1 instructions, including vmaddfp llvm-svn: 26935	2006-03-22 01:44:36 +00:00
Chris Lattner	d2132f87d7	When codegen'ing vector MUL using VFMADD, add the 0, don't mul the 0. llvm-svn: 26913	2006-03-21 00:51:38 +00:00
Chris Lattner	a1bc294f0c	Fix a couple of bugs in permute/splat generate, thanks to Nate for actually figuring these out! :) llvm-svn: 26904	2006-03-20 18:26:51 +00:00
Chris Lattner	f96d523b8f	Fix the pattern for VADDUWM, add i32 splat llvm-svn: 26901	2006-03-20 17:51:58 +00:00
Evan Cheng	89f3cff0f5	Use tblgen'd VECTOR_SHUFFLE selection code. llvm-svn: 26900	2006-03-20 08:14:16 +00:00
Chris Lattner	a9a1313386	Add support for generating vspltw, instead of a vperm instruction with a constant pool load. This generates significantly nicer code for splats. When tblgen gets bugfixed, we can remove the custom selection code. llvm-svn: 26898	2006-03-20 06:51:10 +00:00
Chris Lattner	382f356bd9	Check in some intermediate code that adds a skeleton for matching vsplt* instructions llvm-svn: 26894	2006-03-20 06:15:45 +00:00
Chris Lattner	93d99f9928	fix typo llvm-svn: 26889	2006-03-20 05:05:55 +00:00
Chris Lattner	366b2514fa	add vsplat instructions, fix sched description for vperm llvm-svn: 26888	2006-03-20 04:47:33 +00:00
Chris Lattner	a8713b1ee6	Custom lower arbitrary VECTOR_SHUFFLE's to VPERM. TODO: leave specific ones as VECTOR_SHUFFLE's and turn them into specialized operations like vsplt* llvm-svn: 26887	2006-03-20 01:53:53 +00:00
Chris Lattner	e7a058de7d	add the vperm instruction llvm-svn: 26883	2006-03-20 01:00:56 +00:00
Chris Lattner	7e9440a4fc	Custom lower SCALAR_TO_VECTOR into lve*x. llvm-svn: 26868	2006-03-19 06:55:52 +00:00
Chris Lattner	5b595af956	add support for vector undef llvm-svn: 26863	2006-03-19 06:10:09 +00:00
Chris Lattner	0c9eb670bb	minor fixes llvm-svn: 26857	2006-03-19 05:43:01 +00:00
Chris Lattner	431c90c9fa	we don't use lmw/stmw. When we want them they are easy enough to add llvm-svn: 26853	2006-03-19 04:33:37 +00:00
Nate Begeman	21f87d0e4c	Fix subfic to match subc by default instead of sub so that it is correctly cost-modeled as producing a flag. This fixes the test I just added for neg llvm-svn: 26835	2006-03-17 22:41:37 +00:00
Nate Begeman	bb01d4f272	Remove BRTWOWAY* Make the PPC backend not dependent on BRTWOWAY_CC and make the branch selector smarter about the code it generates, fixing a case in the readme. llvm-svn: 26814	2006-03-17 01:40:33 +00:00
Chris Lattner	1e6dfa4c1f	Strangely, calls clobber call-clobbered vector regs. Whodathoughtit? llvm-svn: 26808	2006-03-16 22:35:59 +00:00
Chris Lattner	fd9f3e8ed3	Add support for copying registers. still needed: spilling and reloading them llvm-svn: 26800	2006-03-16 20:03:58 +00:00
Nate Begeman	2e1fde7c5c	Update scheduling info for vrsave instruction llvm-svn: 26776	2006-03-15 05:25:05 +00:00
Chris Lattner	02e2c18c9c	For functions that use vector registers, save VRSAVE, mark used registers, and update it on entry to each function, then restore it on exit. This compiles: void func(vfloat a, vfloat b, vfloat c) { a = b c + c; } to this: _func: mfspr r2, 256 oris r6, r2, 49152 mtspr 256, r6 lvx v0, 0, r5 lvx v1, 0, r4 vmaddfp v0, v1, v0, v0 stvx v0, 0, r3 mtspr 256, r2 blr GCC produces this (which has additional stack accesses): _func: mfspr r0,256 stw r0,-4(r1) oris r0,r0,0xc000 mtspr 256,r0 lvx v0,0,r5 lvx v1,0,r4 lwz r12,-4(r1) vmaddfp v0,v0,v1,v0 stvx v0,0,r3 mtspr 256,r12 blr llvm-svn: 26733	2006-03-13 21:52:10 +00:00
Chris Lattner	7579cfb1a0	Mark instructions that are cracked by the PPC970 decoder as such. llvm-svn: 26720	2006-03-13 05:15:10 +00:00
Chris Lattner	51348c5f27	Several big changes: 1. Use flags on the instructions in the .td file to indicate the PPC970 unit type instead of a table in the .cpp file. Much cleaner. 2. Change the hazard recognizer to build d-groups according to the actual algorithm used, not my flawed understanding of it. 3. Model "must be in the first slot" and "must be the only instr in a group" accurately. llvm-svn: 26719	2006-03-12 09:13:49 +00:00
Chris Lattner	ea79d9fd73	implement TII::insertNoop llvm-svn: 26562	2006-03-05 23:49:55 +00:00
Chris Lattner	27f5345b1f	Compile this: void foo(float a, int b) { b = a; } to this: _foo: fctiwz f0, f1 stfiwx f0, 0, r4 blr instead of this: _foo: fctiwz f0, f1 stfd f0, -8(r1) lwz r2, -4(r1) stw r2, 0(r4) blr This implements CodeGen/PowerPC/stfiwx.ll, and also incidentally does the right thing for GCC bugzilla 26505. llvm-svn: 26447	2006-03-01 05:50:56 +00:00
Nate Begeman	5965bd19f8	kill ADD_PARTS & SUB_PARTS and replace them with fancy new ADDC, ADDE, SUBC and SUBE nodes that actually expose what's going on and allow for significant simplifications in the targets. llvm-svn: 26255	2006-02-17 05:43:56 +00:00
Nate Begeman	bc3ec1d37b	Add missing patterns for andi. and andis., fixing test/Regression/CodeGen/ PowerPC/and-imm.ll llvm-svn: 26136	2006-02-12 09:09:52 +00:00
Chris Lattner	1240574609	PHI and INLINEASM are now built-in instructions provided by Target.td llvm-svn: 25674	2006-01-27 01:46:15 +00:00
Chris Lattner	268d3584fc	ahem :) llvm-svn: 25239	2006-01-12 02:05:36 +00:00
Nate Begeman	1b8121b227	Add bswap, rotl, and rotr nodes Add dag combiner code to recognize rotl, rotr Add ppc code to match rotl Targets should add rotl/rotr patterns if they have them llvm-svn: 25222	2006-01-11 21:21:00 +00:00
Nate Begeman	477933cfbd	Remove a comment that no longer applies. llvm-svn: 25167	2006-01-10 00:15:59 +00:00
Chris Lattner	bfb2de9030	add ret void support back llvm-svn: 25164	2006-01-09 23:20:37 +00:00
Evan Cheng	7785e5b3a4	New DAG node properties SNDPInFlag, SNDPOutFlag, and SNDPOptInFlag to replace hasInFlag, hasOutFlag. llvm-svn: 25155	2006-01-09 18:28:21 +00:00
Jim Laskey	762e9ec06c	Added initial support for DEBUG_LABEL allowing debug specific labels to be inserted in the code. llvm-svn: 25104	2006-01-05 01:25:28 +00:00
Jim Laskey	0da76a676a	Add unique id to debug location for debug label use (work in progress.) llvm-svn: 25096	2006-01-04 15:04:11 +00:00
Nate Begeman	336dba6fb1	Add support for generating v4i32 altivec code llvm-svn: 25046	2005-12-30 00:12:56 +00:00
Evan Cheng	14c53b45f5	Added field noResults to Instruction. Currently tblgen cannot tell which operands in the operand list are results so it assumes the first one is a result. This is bad. Ideally we would fix this by separating results from inputs, e.g. (res R32:$dst), (ops R32:$src1, R32:$src2). But that's a more distruptive change. Adding 'let noResults = 1' is the workaround to tell tblgen that the instruction does not produces a result. It works for now since tblgen does not support instructions which produce multiple results. llvm-svn: 25017	2005-12-26 09:11:45 +00:00
Evan Cheng	9ae486047e	* Removed the use of FLAG. Now use hasFlagIn and hasFlagOut instead. * Added a pseudo instruction (for each target) that represent "return void". This is a workaround for lack of optional flag operand (return void is not lowered so it does not have a flag operand.) llvm-svn: 24997	2005-12-23 22:14:32 +00:00
Evan Cheng	82285c55aa	Flip the meaning of FPContractions to reflect Requires<[]> change. llvm-svn: 24884	2005-12-20 20:08:53 +00:00
Nate Begeman	b11b8e44fa	Pattern-match return. Includes gross hack! llvm-svn: 24874	2005-12-20 00:26:01 +00:00
Nate Begeman	8e6a8af205	Convert load/store over to being pattern matched llvm-svn: 24871	2005-12-19 23:25:09 +00:00
Jim Laskey	7c462768ed	Added source file/line correspondence for dwarf (PowerPC only at this point.) llvm-svn: 24748	2005-12-16 22:45:29 +00:00
Nate Begeman	672578bd94	Add a second vector type to the VRRC register class, and fix some patterns so that tablegen can infer all types. llvm-svn: 24746	2005-12-16 09:19:13 +00:00
Nate Begeman	e37cb604c1	Use the new predicate support that Evan Cheng added to remove some code from the DAGToDAG cpp file. This adds pattern support for vector and scalar fma, which passes test/Regression/CodeGen/PowerPC/fma.ll, and does the right thing in the presence of -disable-excess-fp-precision. Allows us to match: void %foo(<4 x float> * %a) { entry: %tmp1 = load <4 x float> * %a; %tmp2 = mul <4 x float> %tmp1, %tmp1 %tmp3 = add <4 x float> %tmp2, %tmp1 store <4 x float> %tmp3, <4 x float> *%a ret void } As: _foo: li r2, 0 lvx v0, r2, r3 vmaddfp v0, v0, v0, v0 stvx v0, r2, r3 blr Or, with llc -disable-excess-fp-precision, _foo: li r2, 0 lvx v0, r2, r3 vxor v1, v1, v1 vmaddfp v1, v0, v0, v1 vaddfp v0, v1, v0 stvx v0, r2, r3 blr llvm-svn: 24719	2005-12-14 22:54:33 +00:00
Evan Cheng	3db275d996	Added predicate !NoExcessFPPrecision to FMADD, FMADDS, FMSUB, and FMSUBS. llvm-svn: 24716	2005-12-14 22:07:12 +00:00
Nate Begeman	40f081d8e0	Add support for fmul node of type v4f32. void %foo(<4 x float> * %a) { entry: %tmp1 = load <4 x float> * %a; %tmp2 = mul <4 x float> %tmp1, %tmp1 store <4 x float> %tmp2, <4 x float> *%a ret void } Is selected to: _foo: li r2, 0 lvx v0, r2, r3 vxor v1, v1, v1 vmaddfp v0, v0, v0, v1 stvx v0, r2, r3 blr llvm-svn: 24701	2005-12-14 00:34:09 +00:00
Nate Begeman	69caef2b78	Prepare support for AltiVec multiply, divide, and sqrt. llvm-svn: 24700	2005-12-13 22:55:22 +00:00
Chris Lattner	090eed0483	Remove type casts that are no longer needed llvm-svn: 24661	2005-12-11 07:45:47 +00:00
Nate Begeman	4e56db674c	Add support for TargetConstantPool nodes to the dag isel emitter, and use them in the PPC backend, to simplify some logic out of Select and SelectAddr. llvm-svn: 24657	2005-12-10 02:36:00 +00:00
Nate Begeman	ade6f9a255	Add support patterns to many load and store instructions which will hopefully use patterns in the near future. llvm-svn: 24651	2005-12-09 23:54:18 +00:00

... 3 4 5 6 7 ...

482 Commits