llvm-project

Commit Graph

Author	SHA1	Message	Date
Bob Wilson	9eeb890172	Set alignment operand for NEON VLD instructions. llvm-svn: 114696	2010-09-23 21:43:54 +00:00
Chris Lattner	0e023ea02a	fix a long standing wart: all the ComplexPattern's were being passed the root of the match, even though only a few patterns actually needed this (one in X86, several in ARM [which should be refactored anyway], and some in CellSPU that I don't feel like detangling). Instead of requiring all ComplexPatterns to take the dead root, have targets opt into getting the root by putting SDNPWantRoot on the ComplexPattern. llvm-svn: 114471	2010-09-21 20:31:19 +00:00
Eric Christopher	726838a3e5	Fix QOpcode assignment to Opc. llvm-svn: 113837	2010-09-14 08:31:25 +00:00
Bob Wilson	c597fd3b4a	Convert some VTBL and VTBX instructions to use pseudo instructions prior to register allocation. Remove the NEONPreAllocPass, which is no longer needed. Yeah!! llvm-svn: 113818	2010-09-13 23:55:10 +00:00
Bob Wilson	d5c57a5ed4	Switch all the NEON vld-lane and vst-lane instructions over to the new pseudo-instruction approach. Change ARMExpandPseudoInsts to use a table to record all the NEON load/store information. llvm-svn: 113812	2010-09-13 23:01:35 +00:00
Chris Lattner	f43cb302ca	remove some dead code. t2addrmode_imm8s4 is never used in a pattern, so there is no need to define a matching function. llvm-svn: 113122	2010-09-05 22:51:11 +00:00
Bob Wilson	35fafca587	Finish converting the rest of the NEON VLD instructions to use pseudo- instructions prior to regalloc. Since it's getting a little close to the 2.8 branch deadline, I'll have to leave the rest of the instructions handled by the NEONPreAllocPass for now, but I didn't want to leave half of the VLD instructions converted and the other half not. llvm-svn: 112983	2010-09-03 18:16:02 +00:00
Bob Wilson	75a6408f88	Convert VLD1 and VLD2 instructions to use pseudo-instructions until after regalloc. llvm-svn: 112825	2010-09-02 16:00:54 +00:00
Chris Lattner	39eccb4754	temporarily revert r112664, it is causing a decoding conflict, and the testcases should be merged. llvm-svn: 112711	2010-09-01 16:00:50 +00:00
Bill Wendling	6789f8b6ae	We have a chance for an optimization. Consider this code: int x(int t) { if (t & 256) return -26; return 0; } We generate this: tst.w r0, #256 mvn r0, #25 it eq moveq r0, #0 while gcc generates this: ands r0, r0, #256 it ne mvnne r0, #25 bx lr Scandalous really! During ISel time, we can look for this particular pattern. One where we have a "MOVCC" that uses the flag off of a CMPZ that itself is comparing an AND instruction to 0. Something like this (greatly simplified): %r0 = ISD::AND ... ARMISD::CMPZ %r0, 0 @ sets [CPSR] %r0 = ARMISD::MOVCC 0, -26 @ reads [CPSR] All we have to do is convert the "ISD::AND" into an "ARM::ANDS" that sets [CPSR] when it's zero. The zero value will all ready be in the %r0 register and we only need to change it if the AND wasn't zero. Easy! llvm-svn: 112664	2010-08-31 22:41:22 +00:00
Bob Wilson	950882be07	Use pseudo instructions for VST1 and VST2. llvm-svn: 112357	2010-08-28 05:12:57 +00:00
Bob Wilson	8ee9394750	We don't need to custom-select VLDMQ and VSTMQ anymore. llvm-svn: 112336	2010-08-28 00:20:11 +00:00
Bob Wilson	13ce07fa92	Change ARM VFP VLDM/VSTM instructions to use addressing mode #4 , just like all the other LDM/STM instructions. This fixes asm printer crashes when compiling with -O0. I've changed one of the NEON tests (vst3.ll) to run with -O0 to check this in the future. Prior to this change VLDM/VSTM used addressing mode #5, but not really. The offset field was used to hold a count of the number of registers being loaded or stored, and the AM5 opcode field was expanded to specify the IA or DB mode, instead of the standard ADD/SUB specifier. Much of the backend was not aware of these special cases. The crashes occured when rewriting a frameindex caused the AM5 offset field to be changed so that it did not have a valid submode. I don't know exactly what changed to expose this now. Maybe we've never done much with -O0 and NEON. Regardless, there's no longer any reason to keep a count of the VLDM/VSTM registers, so we can use addressing mode #4 and clean things up in a lot of places. llvm-svn: 112322	2010-08-27 23:18:17 +00:00
Bob Wilson	97919e9c59	Use pseudo instructions for VST3. llvm-svn: 112208	2010-08-26 18:51:29 +00:00
Bob Wilson	4cec44975e	Use pseudo instructions for VST1d64Q. llvm-svn: 112170	2010-08-26 05:33:30 +00:00
Bob Wilson	9392b0e960	Start converting NEON load/stores to use pseudo instructions, beginning here with the VST4 instructions. Until after register allocation, we want to represent sets of adjacent registers by a single super-register. These VST4 pseudo instructions have a single QQ or QQQQ source register operand. They get expanded to the real VST4 instructions with 4 separate D register operands. Once this conversion is complete, we'll be able to remove the NEONPreAllocPass and avoid some fragile and hacky code elsewhere. llvm-svn: 112108	2010-08-25 23:27:42 +00:00
Jakob Stoklund Olesen	e2cbaf6ed7	Don't call tablegen'ed Predicate_* functions in the ARM target. llvm-svn: 111277	2010-08-17 20:39:04 +00:00
Evan Cheng	59069ec784	Add -disable-shifter-op to disable isel of shifter ops. On Cortex-a9 the shifts cost extra instructions so it might be better to emit them separately to take advantage of dual-issues. llvm-svn: 109934	2010-07-30 23:33:54 +00:00
Bob Wilson	5bc8a79e7f	Also use REG_SEQUENCE for VTBX instructions. llvm-svn: 107743	2010-07-07 00:08:54 +00:00
Bob Wilson	3ed511bc6b	Use REG_SEQUENCE nodes to make the table registers for VTBL instructions be allocated to consecutive registers. llvm-svn: 107730	2010-07-06 23:36:25 +00:00
Duncan Sands	78ad27ca2b	Remove an unused and a pointless variable. llvm-svn: 107131	2010-06-29 13:00:29 +00:00
Dan Gohman	f1d8304fe3	Eliminate unnecessary uses of getZExtValue(). llvm-svn: 106279	2010-06-18 14:22:04 +00:00
Bob Wilson	01ac8f9fc0	Remove the hidden "neon-reg-sequence" option. The reg sequences are working now, so there's no need to disable them. llvm-svn: 106155	2010-06-16 21:34:01 +00:00
Bob Wilson	d8a9a04739	For NEON vectors with 32- or 64-bit elements, select BUILD_VECTORs and VECTOR_SHUFFLEs to REG_SEQUENCE instructions. The standard ISD::BUILD_VECTOR node corresponds closely to REG_SEQUENCE but I couldn't use it here because its operands do not get legalized. That is pretty awful, but I guess it makes sense for other targets. Instead, I have added an ARM-specific version of BUILD_VECTOR that will have its operands properly legalized. This fixes the rest of Radar 7872877. llvm-svn: 105439	2010-06-04 00:04:02 +00:00
Dale Johannesen	d679ff7330	Early implementation of tail call for ARM. A temporary flag -arm-tail-calls defaults to off, so there is no functional change by default. Intrepid users may try this; simple cases work but there are bugs. llvm-svn: 105413	2010-06-03 21:09:53 +00:00
Jim Grosbach	84511e1526	Clean up 80 column violations. No functional change. llvm-svn: 105350	2010-06-02 21:53:11 +00:00
Bob Wilson	b6112e8706	Add the cc_out operand for t2RSBrs instructions. I missed this when I changed the instruction class for t2RSB to add that operand in svn r104582. Radar 8033757. llvm-svn: 104907	2010-05-28 00:27:15 +00:00
Jakob Stoklund Olesen	8d042c0269	Fix a few places that depended on the numeric value of subreg indices. Add assertions in places that depend on consecutive indices. llvm-svn: 104510	2010-05-24 17:13:28 +00:00
Jakob Stoklund Olesen	6c47d6423c	Switch ARMRegisterInfo.td to use SubRegIndex and eliminate the parallel enums from ARMRegisterInfo.h llvm-svn: 104508	2010-05-24 16:54:32 +00:00
Evan Cheng	e89f5ae9d4	Target instruction selection should copy memoperands. llvm-svn: 104110	2010-05-19 06:06:09 +00:00
Evan Cheng	3d98b996ff	Turn on -neon-reg-sequence by default. Using NEON load / store multiple instructions will no longer create gobs of vmov of D registers! llvm-svn: 103960	2010-05-17 19:51:20 +00:00
Evan Cheng	298e6b82eb	Model vst lane instructions with REG_SEQUENCE. llvm-svn: 103898	2010-05-16 03:27:48 +00:00
Evan Cheng	9e688cbcc9	Model 128-bit vld lane with REG_SEQUENCE. llvm-svn: 103868	2010-05-15 07:53:37 +00:00
Evan Cheng	0cbd11dfb2	Model 64-bit lane vld with REG_SEQUENCE. llvm-svn: 103851	2010-05-15 01:36:29 +00:00
Evan Cheng	cb78e5558b	Model VST_UPD and VSToddUPD pair with REG_SEQUENCE. llvm-svn: 103833	2010-05-14 22:54:52 +00:00
Evan Cheng	cfa7d02d6e	Model VLD_UPD and VLDodd_UPD pair with REG_SEQUENCE. llvm-svn: 103790	2010-05-14 18:54:59 +00:00
Evan Cheng	ca21cc8b13	Fix comments. llvm-svn: 103749	2010-05-14 00:21:45 +00:00
Evan Cheng	e276c18385	Model some vst3 and vst4 with reg_sequence. llvm-svn: 103453	2010-05-11 01:19:40 +00:00
Evan Cheng	630063aa0d	Model some vld3 instructions with REG_SEQUENCE. llvm-svn: 103437	2010-05-10 21:26:24 +00:00
Evan Cheng	c2ae5f546f	Model vld2 / vst2 with reg_sequence. llvm-svn: 103411	2010-05-10 17:34:18 +00:00
Bob Wilson	f765e1f34a	Add a missing break statement to fix unintentional fall-through (replacing the previous patch for the same issue). llvm-svn: 103183	2010-05-06 16:05:26 +00:00
Jim Grosbach	5e3cccb1e4	Fix unintentional fallthrough. Patch by Edmund Grimley-Evans <Edmund.Grimley-Evans@arm.com> llvm-svn: 103181	2010-05-06 15:32:49 +00:00
Evan Cheng	d85631e700	Model CONCAT_VECTORS of two 64-bit values as a REG_SEQUENCE. llvm-svn: 103104	2010-05-05 18:28:36 +00:00
Evan Cheng	8e6b40a881	With -neon-reg-sequence, models forming a Q register from a pair of consecutive D registers as a REG_SEQUENCE. llvm-svn: 103047	2010-05-04 20:39:49 +00:00
Jim Grosbach	825cb299cd	Update ARM DAGtoDAG for matching UBFX instruction for unsigned bitfield extraction. This fixes PR5998. llvm-svn: 102144	2010-04-22 23:24:18 +00:00
Dan Gohman	21cea8ac2e	Use const qualifiers with TargetLowering. This eliminates several const_casts, and it reinforces the design of the Target classes being immutable. SelectionDAGISel::IsLegalToFold is now a static member function, because PIC16 uses it in an unconventional way. There is more room for API cleanup here. And PIC16's AsmPrinter no longer uses TargetLowering. llvm-svn: 101635	2010-04-17 15:26:15 +00:00
Evan Cheng	3da64f7672	Use getAL() rather than a major constant. llvm-svn: 101446	2010-04-16 05:46:06 +00:00
Evan Cheng	f7f97b4bbd	Use default lowering of DYNAMIC_STACKALLOC. As far as I can tell, ARM isle is doing the right thing and codegen looks correct for both Thumb and Thumb2. llvm-svn: 101410	2010-04-15 22:20:34 +00:00
Evan Cheng	1ba1428577	ARM SelectDYN_ALLOC should emit a copy from SP rather than referencing SP directly. In cases where there are two dyn_alloc in the same BB it would have caused the old SP value to be reused and badness ensues. rdar://7493908 llvm is generating poor code for dynamic alloca, I'll fix that later. llvm-svn: 101383	2010-04-15 18:42:28 +00:00
Bob Wilson	59f75bba24	Fix VLDMQ and VSTMQ instructions to use the correct encoding and address modes. These instructions are only needed for codegen, so I've removed all the explicit encoding bits for now; they should be set in the same way as the for VLDMD and VSTMD whenever we add encodings for VFP. The use of addrmode5 requires that the instructions be custom-selected so that the number of registers can be set in the AM5Opc value. llvm-svn: 99309	2010-03-23 18:54:46 +00:00

1 2 3 4 5 ...

321 Commits