llvm-project

Commit Graph

Author	SHA1	Message	Date
Jim Grosbach	08aa534239	ARM: Fix ELF global base reg intialization. The create machine code wasn't properly in SSA, which the machine verifier properly complains about. Now that fast-isel is closer to verifier clean, errors like this show up more clearly. Additionally, the Thumb pseudo tPICADD was used for both ARM and Thumb mode functions, which is obviously wrong. Fix that along the way. Test case is part of the following commit which will finish making an additional fast-isel test verifier clean an enable it for the regression test suite. This commit is separate since its not just a verifier cleanup, but an actual correctness issue. rdar://12594152 (for the fast-isel verifier aspects) llvm-svn: 189269	2013-08-26 20:07:25 +00:00
Benjamin Kramer	309206667d	When initializing the PIC global base register on ARM/ELF add pc to fix the address. This unbreaks PIC with fast isel on ELF targets (PR16717). The output matches what GCC and SDag do for PIC but may not cover all of the many flavors of PIC that exist. llvm-svn: 188551	2013-08-16 12:52:08 +00:00
Chandler Carruth	26ad41ed6e	Create a constant pool symbol for the GOT in the ARMCGBR the same way we do in the SDag when lowering references to the GOT: use ARMConstantPoolSymbol rather than creating a dummy global variable. The computation of the alignment still feels weird (it uses IR types and datalayout) but it preserves the exact previous behavior. This change fixes the memory leak of the global variable detected on the valgrind leak checking bot. Thanks to Benjamin Kramer for pointing me at ARMConstantPoolSymbol to handle this use case. llvm-svn: 187303	2013-07-27 11:58:26 +00:00
Bill Wendling	f95178e679	Don't cache the instruction and register info from the TargetMachine, because the internals of TargetMachine could change. llvm-svn: 183488	2013-06-07 05:54:19 +00:00
Chandler Carruth	9fb823bbd4	Move all of the header files which are involved in modelling the LLVM IR into their new header subdirectory: include/llvm/IR. This matches the directory structure of lib, and begins to correct a long standing point of file layout clutter in LLVM. There are still more header files to move here, but I wanted to handle them in separate commits to make tracking what files make sense at each layer easier. The only really questionable files here are the target intrinsic tablegen files. But that's a battle I'd rather not fight today. I've updated both CMake and Makefile build systems (I think, and my tests think, but I may have missed something). I've also re-sorted the includes throughout the project. I'll be committing updates to Clang, DragonEgg, and Polly momentarily. llvm-svn: 171366	2013-01-02 11:36:10 +00:00
Micah Villmow	cdfe20b97f	Move TargetData to DataLayout. llvm-svn: 165402	2012-10-08 16:38:25 +00:00
Jush Lu	47172a064f	[arm-fast-isel] Add support for ELF PIC. This is a preliminary step towards ELF support; currently ARMFastISel hasn't been used for ELF object files yet. llvm-svn: 164759	2012-09-27 05:21:41 +00:00
Jim Grosbach	cb540f5cff	ARM: Define generic HINT instruction. The NOP, WFE, WFI, SEV and YIELD instructions are all hints w/ a different immediate value in bits [7,0]. Define a generic HINT instruction and refactor NOP, WFI, WFI, SEV and YIELD to be assembly aliases of that. rdar://11600518 llvm-svn: 158674	2012-06-18 19:45:50 +00:00
Jim Grosbach	617f84ddbd	ARM implement TargetInstrInfo::getNoopForMachoTarget() Without this hook, functions w/ a completely empty body (including no epilogue) will cause an MCEmitter assertion failure. For example, define internal fastcc void @empty_function() { unreachable } rdar://10947471 llvm-svn: 151673	2012-02-28 23:53:30 +00:00
Jia Liu	b22310fda6	Emacs-tag and some comment fix for all ARM, CellSPU, Hexagon, MBlaze, MSP430, PPC, PTX, Sparc, X86, XCore. llvm-svn: 150878	2012-02-18 12:03:15 +00:00
Owen Anderson	16d33f36d5	invalid-LDR_PRE-arm.txt was already passing, but for the wrong reasons. We were failing to specify enough fixed bits of LDR_PRE/LDRB_PRE, resulting in decoding conflicts. Separate them into immediate vs. register versions, allowing us to specify the necessary fixed bits. This in turn results in the test being decoded properly, and being rejected as UNPREDICTABLE rather than a hard failure. llvm-svn: 138653	2011-08-26 20:43:14 +00:00
Owen Anderson	2aedba6c5e	Split am2offset into register addend and immediate addend forms, necessary for allowing the fixed-length disassembler to distinguish between SBFX and STR_PRE. llvm-svn: 136141	2011-07-26 20:54:26 +00:00
Evan Cheng	a20cde31e7	Sink ARMMCExpr and ARMAddressingModes into MC layer. First step to separate ARM MC code from target. llvm-svn: 135636	2011-07-20 23:34:39 +00:00
Evan Cheng	1e210d08d8	Merge XXXGenRegisterNames.inc into XXXGenRegisterInfo.inc llvm-svn: 134024	2011-06-28 20:07:07 +00:00
Evan Cheng	f478cf9685	Eliminate ARM::MOVi2pieces. Just use MOVi32imm and expand it to either movi+orr or movw+movt depending on the subtarget. llvm-svn: 118938	2010-11-12 23:03:38 +00:00
Jim Grosbach	4a0c2d73c3	Convert ARM::MOVi2pieces to a true pseudo-instruction and expand it in the ARMExpandPseudos pass rather than during the asm lowering. llvm-svn: 117714	2010-10-29 21:35:25 +00:00
Jim Grosbach	338de3ee56	Refactor ARM STR/STRB instruction patterns into STR{B}i12 and STR{B}rs, like the LDR instructions have. This makes the literal/register forms of the instructions explicit and allows us to assign scheduling itineraries appropriately. rdar://8477752 llvm-svn: 117505	2010-10-27 23:12:14 +00:00
Jim Grosbach	5a7c715470	Split ARM::LDRB into LDRBi12 and LDRBrs. Adjust accordingly. Continuing on rdar://8477752. llvm-svn: 117419	2010-10-27 00:19:44 +00:00
Jim Grosbach	1e4d9a17c2	First part of refactoring ARM addrmode2 (load/store) instructions to be more explicit about the operands. Split out the different variants into separate instructions. This gives us the ability to, among other things, assign different scheduling itineraries to the variants. rdar://8477752. llvm-svn: 117409	2010-10-26 22:37:02 +00:00
Jakob Stoklund Olesen	a8ad97743d	Slightly change the meaning of the reMaterialize target hook when the original instruction defines subregisters. Any existing subreg indices on the original instruction are preserved or composed with the new subreg index. Also substitute multiple operands mentioning the original register by using the new MachineInstr::substituteRegister() function. This is necessary because there will soon be <imp-def> operands added to non read-modify-write partial definitions. This instruction: %reg1234:foo = FLAP %reg1234<imp-def> will reMaterialize(%reg3333, bar) like this: %reg3333:bar-foo = FLAP %reg333:bar<imp-def> Finally, replace the TargetRegisterInfo pointer argument with a reference to indicate that it cannot be NULL. llvm-svn: 105358	2010-06-02 22:47:25 +00:00
Dan Gohman	047a767d74	Remove the target hook TargetInstrInfo::BlockHasNoFallThrough in favor of MachineBasicBlock::canFallThrough(), which is target-independent and more thorough. llvm-svn: 90634	2009-12-05 00:44:40 +00:00
Evan Cheng	6ad7da96fe	- Change TargetInstrInfo::reMaterialize to pass in TargetRegisterInfo. - If destination is a physical register and it has a subreg index, use the sub-register instead. This fixes PR5423. llvm-svn: 88745	2009-11-14 02:55:43 +00:00
Evan Cheng	fe864425cb	Refactor code. llvm-svn: 86423	2009-11-08 00:15:23 +00:00
Evan Cheng	207b246650	- Add pseudo instructions tLDRpci_pic and t2LDRpci_pic which does a pc-relative load of a GV from constantpool and then add pc. It allows the code sequence to be rematerializable so it would be hoisted by machine licm. - Add a late pass to break these pseudo instructions into a number of real instructions. Also move the code in Thumb2 IT pass that breaks up t2MOVi32imm to this pass. This is done before post regalloc scheduling to allow the scheduler to proper schedule these instructions. It also allow them to be if-converted and shrunk by later passes. llvm-svn: 86304	2009-11-06 23:52:48 +00:00
Anton Korobeynikov	14635da94b	Use NEON reg-reg moves, where profitable. This reduces "domain-cross" stalls, when we used to mix vfp and neon code (the former were used for reg-reg moves) llvm-svn: 85764	2009-11-02 00:10:38 +00:00
Bob Wilson	73789b848d	Add a Thumb BRIND pattern. Change the ARM BRIND assembly to separate the opcode and operand with a tab. Check for these instructions in the usual places. llvm-svn: 85411	2009-10-28 18:26:41 +00:00
Evan Cheng	5457a96b63	Trim more includes. llvm-svn: 84832	2009-10-22 05:11:00 +00:00
Evan Cheng	83e0d481ae	Make ARM and Thumb2 32-bit immediate materialization into a single 32-bit pseudo instruction. This makes it re-materializable. Thumb2 will split it back out into two instructions so IT pass will generate the right mask. Also, this expose opportunies to optimize the movw to a 16-bit move. llvm-svn: 82982	2009-09-28 09:14:39 +00:00
Anton Korobeynikov	c30d816d7a	Fix thinko in my recent movt commit: it's not safe to remat movt, since it has input reg argument. Disable rematting of it for now. llvm-svn: 82975	2009-09-28 07:26:46 +00:00
Anton Korobeynikov	7c2b1e71c1	Use movt/movw pair to materialize 32 bit constants on ARMv6T2+. This should be better than single load from constpool. llvm-svn: 82948	2009-09-27 23:52:58 +00:00
Chris Lattner	7b26fce23e	Rename TargetAsmInfo (and its subclasses) to MCAsmInfo. llvm-svn: 79763	2009-08-22 20:48:53 +00:00
Evan Cheng	f43cf709cb	Remove ARM specific getInlineAsmLength. We'll rely on the simpler (and faster) generic algorithm for now. If more accurate computation is needed, we'll rely on the disassembler. llvm-svn: 78032	2009-08-04 01:56:09 +00:00
Chris Lattner	e98a3c3ca3	Move the getInlineAsmLength virtual method from TAI to TII, where the only real caller (GetFunctionSizeInBytes) uses it. The custom ARM implementation of this is basically reimplementing an assembler poorly for negligible gain. It should be removed IMNSHO, but I'll leave that to ARMish folks to decide. llvm-svn: 77877	2009-08-02 05:20:37 +00:00
Evan Cheng	780748d565	- More refactoring. This gets rid of all of the getOpcode calls. - This change also makes it possible to switch between ARM / Thumb on a per-function basis. - Fixed thumb2 routine which expand reg + arbitrary immediate. It was using using ARM so_imm logic. - Use movw and movt to do reg + imm when profitable. - Other code clean ups and minor optimizations. llvm-svn: 77300	2009-07-28 05:48:47 +00:00
Evan Cheng	38b7eee164	More DCE. llvm-svn: 77231	2009-07-27 18:48:45 +00:00
Evan Cheng	18688f431d	Get rid of more dead code. llvm-svn: 77227	2009-07-27 18:38:54 +00:00
Evan Cheng	056c669e93	Get rid of some more getOpcode calls. This also fixes potential problems in ARMBaseInstrInfo routines not recognizing thumb1 instructions when 32-bit and 16-bit instructions mix. llvm-svn: 77218	2009-07-27 18:20:05 +00:00
Evan Cheng	c47e109335	Use t2LDRi12 and t2STRi12 to load / store to / from stack frames. Eliminate more getOpcode calls. llvm-svn: 77181	2009-07-27 03:14:20 +00:00
Evan Cheng	f3a1fce8ae	Change Thumb2 jumptable codegen to one that uses two level jumps: Before: adr r12, #LJTI3_0_0 ldr pc, [r12, +r0, lsl #2] LJTI3_0_0: .long LBB3_24 .long LBB3_30 .long LBB3_31 .long LBB3_32 After: adr r12, #LJTI3_0_0 add pc, r12, +r0, lsl #2 LJTI3_0_0: b.w LBB3_24 b.w LBB3_30 b.w LBB3_31 b.w LBB3_32 This has several advantages. 1. This will make it easier to optimize this to a TBB / TBH instruction + (smaller) table. 2. This eliminate the need for ugly asm printer hack to force the address into thumb addresses (bit 0 is one). 3. Same codegen for pic and non-pic. 4. This eliminate the need to align the table so constantpool island pass won't have to over-estimate the size. Based on my calculation, the later is probably slightly faster as well since ldr pc with shifter address is very slow. That is, it should be a win as long as the HW implementation can do a reasonable job of branch predict the second branch. llvm-svn: 77024	2009-07-25 00:33:29 +00:00
Evan Cheng	6cfbe61361	FLDD, FLDS, FCPYD, FCPYS, FSTD, FSTS, VMOVD, VMOVQ maps to the same instructions on all sub-targets. llvm-svn: 76925	2009-07-24 00:53:56 +00:00
David Goodwin	cdd405d804	Correctly handle the Thumb-2 imm8 addrmode. Specialize frame index elimination more exactly for Thumb-2 to get better code gen. llvm-svn: 76919	2009-07-24 00:16:18 +00:00
David Goodwin	6deba28c6f	Fix frame index elimination to correctly handle thumb-2 addressing modes that don't allow negative offsets. During frame elimination convert i12 opcode to a i8 when necessary due to a negative offset. llvm-svn: 76883	2009-07-23 17:06:46 +00:00
Evan Cheng	84517443ca	Let callers decide the sub-register index on the def operand of rematerialized instructions. Avoid remat'ing instructions whose def have sub-register indices for now. It's just really really hard to get all the cases right. llvm-svn: 75900	2009-07-16 09:20:10 +00:00
David Goodwin	03ab0bbb24	Generalize opcode selection in ARMBaseRegisterInfo. llvm-svn: 75036	2009-07-08 20:28:28 +00:00
David Goodwin	af7451b674	Checkpoint Thumb2 Instr info work. Generalized base code so that it can be shared between ARM and Thumb2. Not yet activated because register information must be generalized first. llvm-svn: 75010	2009-07-08 16:09:28 +00:00
David Goodwin	ade05a37f1	Checkpoint refactoring of ThumbInstrInfo and ThumbRegisterInfo into Thumb1InstrInfo, Thumb2InstrInfo, Thumb1RegisterInfo and Thumb2RegisterInfo. Move methods from ARMInstrInfo to ARMBaseInstrInfo to prepare for sharing with Thumb2. llvm-svn: 74731	2009-07-02 22:18:33 +00:00
Evan Cheng	d379e896ff	Handle IMPLICIT_DEF with isUndef operand marker, part 2. This patch moves the code to annotate machineoperands to LiveIntervalAnalysis. It also add markers for implicit_def that define physical registers. The rest, is just a lot of details. llvm-svn: 74580	2009-07-01 01:59:31 +00:00
David Goodwin	28d6d87244	Improve Thumb-2 jump table support. llvm-svn: 74549	2009-06-30 19:50:22 +00:00
David Goodwin	27303cde82	Add conditional and unconditional thumb-2 branch. Add thumb-2 jump table. llvm-svn: 74543	2009-06-30 18:04:13 +00:00
Anton Korobeynikov	0f2158b35f	Simplify a bit llvm-svn: 74385	2009-06-27 12:59:03 +00:00

1 2 3 4

151 Commits