llvm-project

Commit Graph

Author	SHA1	Message	Date
Bob Wilson	0bfbd9b68c	Fix a crash compiling 254.gap for Thumb2. The Thumb2 add/sub with 12-bit immediate instructions cannot set the condition codes, so they do not have the extra cc_out operand. We hit an assertion during tail duplication because the instruction being duplicated had more operands that expected. llvm-svn: 98001	2010-03-08 22:56:15 +00:00
Bob Wilson	5638c36efd	Handle AddrMode6 (for NEON load/stores) in Thumb2's rewriteT2FrameIndex. Radar 7614112. llvm-svn: 95456	2010-02-06 00:24:38 +00:00
Jakob Stoklund Olesen	bdc17f6840	Remove predicates when changing an add into an unpredicable mov. Since the mov is executed unconditionally, make sure that the add didn't have any predicate. llvm-svn: 93909	2010-01-19 21:08:28 +00:00
Dan Gohman	047a767d74	Remove the target hook TargetInstrInfo::BlockHasNoFallThrough in favor of MachineBasicBlock::canFallThrough(), which is target-independent and more thorough. llvm-svn: 90634	2009-12-05 00:44:40 +00:00
Evan Cheng	fe864425cb	Refactor code. llvm-svn: 86423	2009-11-08 00:15:23 +00:00
Jim Grosbach	4e9f379554	80-column cleanup of file header comments llvm-svn: 86408	2009-11-07 22:00:39 +00:00
Evan Cheng	8b5278a466	t2ldrpci_pic can be used for blockaddress as well. llvm-svn: 86400	2009-11-07 19:40:04 +00:00
Evan Cheng	a8e8a7c976	Refactor code. Fix a potential missing check. Teach isIdentical() about tLDRpci_pic. llvm-svn: 86330	2009-11-07 04:04:34 +00:00
Evan Cheng	7ff831962a	- Add TargetInstrInfo::isIdentical(). It's similar to MachineInstr::isIdentical except it doesn't care if the definitions' virtual registers differ. This is used by machine LICM and other MI passes to perform CSE. - Teach Thumb2InstrInfo::isIdentical() to check two t2LDRpci_pic are identical. Since pc relative constantpool entries are always different, this requires it it check if the values can actually the same. llvm-svn: 86328	2009-11-07 03:52:02 +00:00
Evan Cheng	207b246650	- Add pseudo instructions tLDRpci_pic and t2LDRpci_pic which does a pc-relative load of a GV from constantpool and then add pc. It allows the code sequence to be rematerializable so it would be hoisted by machine licm. - Add a late pass to break these pseudo instructions into a number of real instructions. Also move the code in Thumb2 IT pass that breaks up t2MOVi32imm to this pass. This is done before post regalloc scheduling to allow the scheduler to proper schedule these instructions. It also allow them to be if-converted and shrunk by later passes. llvm-svn: 86304	2009-11-06 23:52:48 +00:00
Anton Korobeynikov	14635da94b	Use NEON reg-reg moves, where profitable. This reduces "domain-cross" stalls, when we used to mix vfp and neon code (the former were used for reg-reg moves) llvm-svn: 85764	2009-11-02 00:10:38 +00:00
Evan Cheng	1a4492be97	Fix a couple more places where we are creating ld / st instructions without memoperands. llvm-svn: 85746	2009-11-01 22:04:35 +00:00
Bob Wilson	73789b848d	Add a Thumb BRIND pattern. Change the ARM BRIND assembly to separate the opcode and operand with a tab. Check for these instructions in the usual places. llvm-svn: 85411	2009-10-28 18:26:41 +00:00
Bob Wilson	967bf27de2	Handle AddrMode4 for Thumb2 in rewriteT2FrameIndex. This occurs for VLDM/VSTM instructions, and without this check, the code assumes that an offset is allowed, as it would be with VLDR/VSTR. The asm printer, however, silently drops the offset, producing incorrect code. Since the address register in this case is either the stack or frame pointer, the spill location ends up conflicting with some other stack slot or with outgoing arguments on the stack. llvm-svn: 81879	2009-09-15 17:56:18 +00:00
Evan Cheng	7a37b1a2ca	Fix PR4789. Teach eliminateFrameIndex how to handle VLDRQ and VSTRQ which cannot fold any immediate offset. llvm-svn: 80191	2009-08-27 01:23:50 +00:00
Jim Grosbach	f24f9d9cb6	Whitespace cleanup. Remove trailing whitespace. llvm-svn: 78666	2009-08-11 15:33:49 +00:00
Evan Cheng	5b4c308f0c	Always use the 16-bit tMOVgpr2gpr instead of the 32-bit t2MOVr. llvm-svn: 78549	2009-08-10 02:06:53 +00:00
Evan Cheng	f0237b1aa6	Use 16-bit tMOVgpr2gpr instead of tMOVr to copy GPR registers in Thumb2 mode. llvm-svn: 78398	2009-08-07 19:34:35 +00:00
Evan Cheng	b972e5633f	It turns out most of the thumb2 instructions are not allowed to touch SP. The semantics of such instructions are unpredictable. We have just been lucky that tests have been passing. This patch takes pain to ensure all the PEI lowering code does the right thing when lowering frame indices, insert code to manipulate stack pointers, etc. It's also custom lowering dynamic stack alloc into pseudo instructions so we can insert the right instructions at scheduling time. This fixes PR4659 and PR4682. llvm-svn: 78361	2009-08-07 00:34:42 +00:00
Evan Cheng	8b9deebba3	Use the i12 variant of load / store opcodes if offset is zero. Now we pass all of multisource as well. llvm-svn: 77939	2009-08-03 02:38:06 +00:00
Chris Lattner	e98a3c3ca3	Move the getInlineAsmLength virtual method from TAI to TII, where the only real caller (GetFunctionSizeInBytes) uses it. The custom ARM implementation of this is basically reimplementing an assembler poorly for negligible gain. It should be removed IMNSHO, but I'll leave that to ARMish folks to decide. llvm-svn: 77877	2009-08-02 05:20:37 +00:00
Evan Cheng	c6d70ae063	Optimize Thumb2 jumptable to use tbb / tbh when all the offsets fit in byte / halfword. llvm-svn: 77422	2009-07-29 02:18:14 +00:00
David Goodwin	0830980141	Thumb-2: fix typo that caused incorrect stack elimination for VFP operations and very large stack frames. llvm-svn: 77401	2009-07-28 23:52:33 +00:00
Evan Cheng	780748d565	- More refactoring. This gets rid of all of the getOpcode calls. - This change also makes it possible to switch between ARM / Thumb on a per-function basis. - Fixed thumb2 routine which expand reg + arbitrary immediate. It was using using ARM so_imm logic. - Use movw and movt to do reg + imm when profitable. - Other code clean ups and minor optimizations. llvm-svn: 77300	2009-07-28 05:48:47 +00:00
Evan Cheng	38b7eee164	More DCE. llvm-svn: 77231	2009-07-27 18:48:45 +00:00
Evan Cheng	18688f431d	Get rid of more dead code. llvm-svn: 77227	2009-07-27 18:38:54 +00:00
Evan Cheng	056c669e93	Get rid of some more getOpcode calls. This also fixes potential problems in ARMBaseInstrInfo routines not recognizing thumb1 instructions when 32-bit and 16-bit instructions mix. llvm-svn: 77218	2009-07-27 18:20:05 +00:00
Evan Cheng	c47e109335	Use t2LDRi12 and t2STRi12 to load / store to / from stack frames. Eliminate more getOpcode calls. llvm-svn: 77181	2009-07-27 03:14:20 +00:00
Evan Cheng	186332f898	Use the right instructions to copy between GPR and the more strictive tGPR classes. t2MOV does not match the RC requirements. llvm-svn: 77175	2009-07-27 00:33:08 +00:00
Evan Cheng	c1a5cfa9a0	Get rid of a couple of unnecessary getOpcode calls. llvm-svn: 77035	2009-07-25 01:25:08 +00:00
Evan Cheng	f3a1fce8ae	Change Thumb2 jumptable codegen to one that uses two level jumps: Before: adr r12, #LJTI3_0_0 ldr pc, [r12, +r0, lsl #2] LJTI3_0_0: .long LBB3_24 .long LBB3_30 .long LBB3_31 .long LBB3_32 After: adr r12, #LJTI3_0_0 add pc, r12, +r0, lsl #2 LJTI3_0_0: b.w LBB3_24 b.w LBB3_30 b.w LBB3_31 b.w LBB3_32 This has several advantages. 1. This will make it easier to optimize this to a TBB / TBH instruction + (smaller) table. 2. This eliminate the need for ugly asm printer hack to force the address into thumb addresses (bit 0 is one). 3. Same codegen for pic and non-pic. 4. This eliminate the need to align the table so constantpool island pass won't have to over-estimate the size. Based on my calculation, the later is probably slightly faster as well since ldr pc with shifter address is very slow. That is, it should be a win as long as the HW implementation can do a reasonable job of branch predict the second branch. llvm-svn: 77024	2009-07-25 00:33:29 +00:00
Evan Cheng	886f303480	Clean up. llvm-svn: 76984	2009-07-24 18:20:16 +00:00
Evan Cheng	6cfbe61361	FLDD, FLDS, FCPYD, FCPYS, FSTD, FSTS, VMOVD, VMOVQ maps to the same instructions on all sub-targets. llvm-svn: 76925	2009-07-24 00:53:56 +00:00
David Goodwin	cdd405d804	Correctly handle the Thumb-2 imm8 addrmode. Specialize frame index elimination more exactly for Thumb-2 to get better code gen. llvm-svn: 76919	2009-07-24 00:16:18 +00:00
David Goodwin	6deba28c6f	Fix frame index elimination to correctly handle thumb-2 addressing modes that don't allow negative offsets. During frame elimination convert i12 opcode to a i8 when necessary due to a negative offset. llvm-svn: 76883	2009-07-23 17:06:46 +00:00
Anton Korobeynikov	c5df7e2dc1	Emit cross regclass register moves for thumb2. Minor code duplication cleanup. llvm-svn: 76124	2009-07-16 23:26:06 +00:00
Evan Cheng	cd4cdd1157	Major changes to Thumb (not Thumb2). Many 16-bit instructions either modifies CPSR when they are outside the IT blocks, or they can predicated when in Thumb2. Move the implicit def of CPSR to an optional def which defaults CPSR. This allows the 's' bit to be toggled dynamically. A side-effect of this change is asm printer is now using unified assembly. There are some minor clean ups and fixes as well. llvm-svn: 75359	2009-07-11 06:43:01 +00:00
David Goodwin	c9b1efd515	t2LDM_RET does not fall-through. llvm-svn: 75250	2009-07-10 15:33:46 +00:00
David Goodwin	22c2fba978	Use common code for both ARM and Thumb-2 instruction and register info. llvm-svn: 75067	2009-07-08 23:10:31 +00:00
David Goodwin	03ab0bbb24	Generalize opcode selection in ARMBaseRegisterInfo. llvm-svn: 75036	2009-07-08 20:28:28 +00:00
David Goodwin	af7451b674	Checkpoint Thumb2 Instr info work. Generalized base code so that it can be shared between ARM and Thumb2. Not yet activated because register information must be generalized first. llvm-svn: 75010	2009-07-08 16:09:28 +00:00
David Goodwin	ade05a37f1	Checkpoint refactoring of ThumbInstrInfo and ThumbRegisterInfo into Thumb1InstrInfo, Thumb2InstrInfo, Thumb1RegisterInfo and Thumb2RegisterInfo. Move methods from ARMInstrInfo to ARMBaseInstrInfo to prepare for sharing with Thumb2. llvm-svn: 74731	2009-07-02 22:18:33 +00:00

42 Commits