llvm-project

Commit Graph

Author	SHA1	Message	Date
Jim Grosbach	bb0547d9c4	Pseudo-ize VMOVDcc and VMOVScc. llvm-svn: 127506	2011-03-11 23:09:50 +00:00
Bob Wilson	00d09428fe	Remove unused conditional negate operations. llvm-svn: 127090	2011-03-05 16:54:31 +00:00
Evan Cheng	04ad35b53f	VFP single precision arith instructions can go down to NEON pipeline, but on Cortex-A8 only. llvm-svn: 126238	2011-02-22 19:53:14 +00:00
Evan Cheng	4a8c43fe6d	Some single precision VFP instructions may be executed on NEON pipeline, but not double precision ones. llvm-svn: 125624	2011-02-16 00:35:02 +00:00
Bruno Cardoso Lopes	2082057b18	Create two new generic classes to represent the following VMRS/VMSR variations: vmrs reg, fpexc vmrs reg, fpsid vmsr fpexc, reg vmsr fpsid, reg llvm-svn: 123783	2011-01-18 21:58:20 +00:00
Bob Wilson	e5863d6639	Fix a comment: We now have intrinsics for vcvtr. llvm-svn: 123246	2011-01-11 17:56:41 +00:00
Chris Lattner	2a0a3b43d7	Flag -> Glue, the ongoing saga llvm-svn: 122513	2010-12-23 18:28:41 +00:00
Evan Cheng	62c7b5bf76	Making use of VFP / NEON floating point multiply-accumulate / subtraction is difficult on current ARM implementations for a few reasons. 1. Even though a single vmla has latency that is one cycle shorter than a pair of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause additional pipeline stall. So it's frequently better to single codegen vmul + vadd. 2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to stall for 4 cycles. We need to schedule them apart. 3. A vmla followed vmla is a special case. Obvious issuing back to back RAW vmla + vmla is very bad. But this isn't ideal either: vmul vadd vmla Instead, we want to expand the second vmla: vmla vmul vadd Even with the 4 cycle vmul stall, the second sequence is still 2 cycles faster. Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough but it isn't the optimial solution. This patch attempts to make it possible to use vmla / vmls in cases where it is profitable. A. Add missing isel predicates which cause vmla to be codegen'ed. B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to compute a fmul and a fmla. C. Add additional isel checks for vmla, avoid cases where vmla is feeding into fp instructions (except for the #3 exceptional case). D. Add ARM hazard recognizer to model the vmla / vmls hazards. E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the vmla / vmls will trigger one of the special hazards. Work in progress, only A+B are enabled. llvm-svn: 120960	2010-12-05 22:04:16 +00:00
Bill Wendling	9898ac97fd	Proper encoding for VLDM and VSTM instructions. The register lists for these instructions have to distinguish between lists of single- and double-precision registers in order for the ASM matcher to do a proper job. In all other respects, a list of single- or double-precision registers are the same as a list of GPR registers. llvm-svn: 119460	2010-11-17 04:32:08 +00:00
Bill Wendling	02089a39a0	vldm and vstm are mnemonics for vldmia and vstmia resp. llvm-svn: 119321	2010-11-16 02:00:24 +00:00
Bill Wendling	a68e3a5397	Encode the multi-load/store instructions with their respective modes ('ia', 'db', 'ib', 'da') instead of having that mode as a separate field in the instruction. It's more convenient for the asm parser and much more readable for humans. <rdar://problem/8654088> llvm-svn: 119310	2010-11-16 01:16:36 +00:00
Bill Wendling	705ec77ab5	Add uses of the *_ldst_multi multiclasses. These aren't used yet. llvm-svn: 118999	2010-11-13 10:57:02 +00:00
Bill Wendling	c4c642832d	Convert the modes to lower case. llvm-svn: 118998	2010-11-13 10:43:34 +00:00
Bill Wendling	e69afc6bb7	Add *_ldst_mult multiclasses to the ARM back-end. These will be used in the future to separate out the ia, ib, da, db variants of the load/store multiple instructions. llvm-svn: 118995	2010-11-13 09:09:38 +00:00
Evan Cheng	2d59ee34f1	Add some missing isel predicates on def : pat patterns to avoid generating VFP vmla / vmls (they cause stalls). Disabling them in isel is properly not a right solution, I'll look into a proper solution next. llvm-svn: 118922	2010-11-12 20:32:20 +00:00
Bill Wendling	a91d02bc61	Add "write back" bit encoding. llvm-svn: 118446	2010-11-08 21:28:03 +00:00
Bill Wendling	c002463ac4	Add encoding for VSTR. llvm-svn: 118220	2010-11-04 00:59:42 +00:00
Bill Wendling	e84eb99cbb	The MC code couldn't handle ARM LDR instructions with negative offsets: vldr.64 d1, [r0, #-32] The problem was with how the addressing mode 5 encodes the offsets. This change makes sure that the way offsets are handled in addressing mode 5 is consistent throughout the MC code. It involves re-refactoring the "getAddrModeImmOpValue" method into an "Imm12" and "addressing mode 5" version. But not to worry! The majority of the duplicated code has been unified. llvm-svn: 118144	2010-11-03 01:49:29 +00:00
Jim Grosbach	c6af2b4066	Break ARM addrmode4 (load/store multiple base address) into its constituent parts. Represent the operation mode as an optional operand instead. rdar://8614429 llvm-svn: 118137	2010-11-03 01:01:43 +00:00
Bill Wendling	603bd8f54c	Rename getAddrModeImm12OpValue to getAddrModeImmOpValue and expand it to work with immediates up to 16-bits in size. The same logic is applied to other LDR encodings, e.g. VLDR, but which use a different immediate bit width (8-bits in VLDR's case). Removing the "12" allows it to be more generic. llvm-svn: 118094	2010-11-02 22:31:46 +00:00
Bill Wendling	3f37ade36e	Missed reverting this bit. llvm-svn: 117971	2010-11-01 23:17:54 +00:00
Bill Wendling	f7e176a3ec	Minor cleanup. llvm-svn: 117969	2010-11-01 23:11:22 +00:00
Bill Wendling	418bd53008	Move the machine operand MC encoding patterns to the parent classes. llvm-svn: 117956	2010-11-01 21:17:06 +00:00
Bill Wendling	2623343625	Move instruction encoding bits into the parent class and remove the temporary *_Encode classes. These instructions are the only ones which use those classes, so a subclass isn't necessary. llvm-svn: 117906	2010-11-01 06:00:39 +00:00
Chris Lattner	33fc3e095b	reapply r117858 with apparent editor malfunction fixed (somehow I got a dulicated line). llvm-svn: 117860	2010-10-31 19:10:56 +00:00
Chris Lattner	e59eef3dd1	revert r117858 while I check out a failure I missed. llvm-svn: 117859	2010-10-31 19:05:32 +00:00
Chris Lattner	9293008e90	the asm matcher can't handle operands with modifiers (like ${foo:bar}). Instead of silently ignoring these instructions, emit a hard error and force the target author to either refactor the target or mark the instruction 'isCodeGenOnly'. Mark a few instructions in ARM and MBlaze as isCodeGenOnly the are doing this. llvm-svn: 117858	2010-10-31 18:48:12 +00:00
Jim Grosbach	bbe2bbd7f7	Add FIXME. llvm-svn: 117787	2010-10-30 14:54:23 +00:00
Bill Wendling	a65f914bb0	Add encoding for moving a value between two ARM core registers and a doublework extension register. llvm-svn: 116970	2010-10-20 23:37:40 +00:00
Bill Wendling	058190507b	Add encodings for movement between ARM core registers and single-precision registers. llvm-svn: 116961	2010-10-20 22:44:54 +00:00
Bill Wendling	399add01d4	Reformatting. No functionalogicality changes. llvm-svn: 116625	2010-10-15 21:50:45 +00:00
Bill Wendling	6f52f8a87d	Add support for vmov.f64/.f32 encoding. There's a bit of a hack going on here. The f32 in FCONSTS is handled as a double instead of a float in the code. So the encoding of the immediate into the instruction isn't exactly in line with the documentation in that regard. But given that we know it's handled as a double, it doesn't cause any harm. llvm-svn: 116471	2010-10-14 02:33:26 +00:00
Bill Wendling	0441c6cba0	Add encoding for 'fmstat'. llvm-svn: 116466	2010-10-14 01:19:34 +00:00
Bill Wendling	0825f3e441	- Add encodings for multiply add/subtract instructions in all their glory. - Add missing patterns for some multiply add/subtract instructions. - Add encodings for VMRS and VMSR. llvm-svn: 116464	2010-10-14 01:02:08 +00:00
Bill Wendling	f106ecfa59	Add MC encodings for VCVT* instrunctions. llvm-svn: 116431	2010-10-13 20:58:46 +00:00
Bill Wendling	6e27b4f530	Add encodings for VNEG and VSQRT. Also add encodings for VMOV, but not a test just yet. llvm-svn: 116386	2010-10-13 01:17:33 +00:00
Bill Wendling	576fd0b110	Add encodings for VCVT instructions. llvm-svn: 116385	2010-10-13 00:56:35 +00:00
Bill Wendling	da4ddf0fcf	Add VCMPZ and VABS. llvm-svn: 116383	2010-10-13 00:38:07 +00:00
Bill Wendling	f9ca535495	Refactor VCMP instructions. llvm-svn: 116379	2010-10-13 00:04:29 +00:00
Bill Wendling	7dd8c0b991	Add encodings for VNMUL[SD]. llvm-svn: 116375	2010-10-12 23:47:37 +00:00
Bill Wendling	a06aee826c	Add encodings for VDIV and VMUL. llvm-svn: 116370	2010-10-12 23:22:27 +00:00
Bill Wendling	42200bcaea	Refactor some of the encoding logic into a base class. This keeps us from having to add 10+ lines to every instruction. It may turn out that we can move this base class into it's parent class. llvm-svn: 116362	2010-10-12 23:06:54 +00:00
Bill Wendling	646a506724	Add encoding for VSUB and VCMP. Fear not! I'm going to try a refactoring right now. :) llvm-svn: 116359	2010-10-12 22:55:35 +00:00
Bill Wendling	ac6cd00706	Encoding for VADDD. Plus a test for the VFP instructions. llvm-svn: 116348	2010-10-12 22:08:41 +00:00
Jim Grosbach	576640f0e3	Encoding for ARM-mode VADD.F32 instruction. llvm-svn: 116338	2010-10-12 21:22:40 +00:00
Evan Cheng	1958cefd69	Model operand cycles of vldm / vstm; also fixes scheduling itineraries of vldr / vstr, etc. llvm-svn: 115898	2010-10-07 01:50:48 +00:00
Eric Christopher	e68635acdb	Fix typo. llvm-svn: 114931	2010-09-28 00:35:33 +00:00
Jim Grosbach	abcbe2474d	VFP/NEON load/store multiple instructions are addrmode4, not 5. llvm-svn: 113322	2010-09-08 00:25:50 +00:00
Bob Wilson	13ce07fa92	Change ARM VFP VLDM/VSTM instructions to use addressing mode #4 , just like all the other LDM/STM instructions. This fixes asm printer crashes when compiling with -O0. I've changed one of the NEON tests (vst3.ll) to run with -O0 to check this in the future. Prior to this change VLDM/VSTM used addressing mode #5, but not really. The offset field was used to hold a count of the number of registers being loaded or stored, and the AM5 opcode field was expanded to specify the IA or DB mode, instead of the standard ADD/SUB specifier. Much of the backend was not aware of these special cases. The crashes occured when rewriting a frameindex caused the AM5 offset field to be changed so that it did not have a valid submode. I don't know exactly what changed to expose this now. Maybe we've never done much with -O0 and NEON. Regardless, there's no longer any reason to keep a count of the VLDM/VSTM registers, so we can use addressing mode #4 and clean things up in a lot of places. llvm-svn: 112322	2010-08-27 23:18:17 +00:00
Daniel Dunbar	07cc87438f	ARM: Mark some disassembler only instructions as not available for matching -- for some reason they have a very odd MCInst form where the operands overlap, but I haven't dug in to find out why yet. llvm-svn: 110781	2010-08-11 04:46:13 +00:00
Nate Begeman	b69b182191	Add support for getting & setting the FPSCR application register on ARM when VFP is enabled. Add support for using the FPSCR in conjunction with the vcvtr instruction, for controlling fp to int rounding. Add support for the FLT_ROUNDS_ node now that the FPSCR is exposed. llvm-svn: 110152	2010-08-03 21:31:55 +00:00
Jim Grosbach	84511e1526	Clean up 80 column violations. No functional change. llvm-svn: 105350	2010-06-02 21:53:11 +00:00
Evan Cheng	dd7f566597	Mark pattern-less mayLoad / mayStore instructions neverHasSideEffects. These do not have other un-modeled side effects. llvm-svn: 104111	2010-05-19 06:07:03 +00:00
Evan Cheng	79efd71962	Mark some pattern-less instructions as neverHasSideEffects. llvm-svn: 103683	2010-05-13 00:16:46 +00:00
Anton Korobeynikov	2063705d91	Define new itin classes for ARM <-> VFP reg moves to distinguish from NEON ops. Define proper scheduling itinerary for them on A9. A8 TRM does not specify latency for them at all :( llvm-svn: 100650	2010-04-07 18:20:02 +00:00
Anton Korobeynikov	c1e7a6feac	FCONST{S,D} behaves the same way as FP unary instructions. This is true for both A8 and A9. llvm-svn: 100649	2010-04-07 18:19:56 +00:00
Anton Korobeynikov	4c1da0f82a	Add new itin classes for FP16 <-> FP32 conversions and make uise of them for A9. llvm-svn: 100647	2010-04-07 18:19:46 +00:00
Jim Grosbach	34de7768bf	Make the use of the vmla and vmls VFP instructions controllable via cmd line. Preliminary testing shows significant performance wins by not using these instructions. llvm-svn: 99436	2010-03-24 22:31:46 +00:00
Bob Wilson	2497d85c9e	Revert the rest of 98679. --- Reverse-merging r98679 into 'lib/Target/ARM/ARMInstrVFP.td': U lib/Target/ARM/ARMInstrVFP.td llvm-svn: 99049	2010-03-20 06:34:02 +00:00
Bob Wilson	e4191e719b	Revert this change, since it was causing ARM performance regressions. --- Reverse-merging r98889 into '.': U lib/Target/ARM/ARMInstrNEON.td U lib/Target/ARM/ARMISelLowering.h U lib/Target/ARM/ARMInstrInfo.td U lib/Target/ARM/ARMInstrVFP.td U lib/Target/ARM/ARMISelLowering.cpp U lib/Target/ARM/ARMInstrFormats.td llvm-svn: 99010	2010-03-19 22:51:32 +00:00
Anton Korobeynikov	f11aa9e7b4	Get rid of target-specific fp <-> int nodes when still I'm here. llvm-svn: 98889	2010-03-18 22:35:45 +00:00
Anton Korobeynikov	64578d5599	Get rid of target-specific nodes for fp16 <-> fp32 conversion. llvm-svn: 98888	2010-03-18 22:35:37 +00:00
Johnny Chen	71ab18bdd5	Disambiguate the _UPD and variants by specifying the writeback flag as 1. This is for the disassembly work. There are cases where this is not possible, for example, A8.6.53 LDM Encoding T1. In such case, we'll use an adhoc approach to deduce the Opcode programmatically. llvm-svn: 98679	2010-03-16 21:25:05 +00:00
Bob Wilson	466d1e3dc2	Remove redundant writeback flag in ARM addressing mode 5. llvm-svn: 98648	2010-03-16 18:38:09 +00:00
Anton Korobeynikov	d7fece38fc	Add codegen support for FP16 on ARM llvm-svn: 98502	2010-03-14 18:42:31 +00:00
Bob Wilson	f1e8f7ff7d	Attempt to appease the arm-linux buildbot by fixing the JIT encodings for new base register updating load/store-multiple instructions. llvm-svn: 98427	2010-03-13 07:34:35 +00:00
Bob Wilson	947f04bad0	Change ARM ld/st multiple instructions to have variant instructions for writebacks to the address register. This gets rid of the hack that the first register on the list was the magic writeback register operand. There was an implicit constraint that if that operand was not reg0 it had to match the base register operand. The post-RA scheduler's antidependency breaker did not understand that constraint and sometimes changed one without the other. This also fixes Radar 7495976 and should help the verifier work better for ARM code. There are now new ld/st instructions explicit writeback operands and explicit constraints that tie those registers together. llvm-svn: 98409	2010-03-13 01:08:20 +00:00
Chris Lattner	b8a7427636	fix a bunch of partially ambiguous patterns on ARM. As an example, this: (set DPR:$dst, (fsub (fneg (fmul DPR:$a, DPR:$b)), DPR:$dstin)) is ambiguous because DPR contains both f64 and v2f32. tblgen currently accidentally picks f64 because it's first in the regclass. llvm-svn: 97955	2010-03-08 18:51:21 +00:00
Dan Gohman	8c5d683aa9	The mayHaveSideEffects flag is no longer used. llvm-svn: 97348	2010-02-27 23:47:46 +00:00
Johnny Chen	2588efd071	Added VCVT (between floating-point and fixed-point, VFP) for disassembly. A8.6.297 llvm-svn: 95885	2010-02-11 18:17:16 +00:00
Johnny Chen	b618f66c5f	Added VMRS/VMSR for disassembly only. A8.6.335 & A8.6.336 llvm-svn: 95703	2010-02-09 22:35:38 +00:00
Johnny Chen	64e0ae8dd4	Added vcvtb/vcvtt (between half-precision and single-precision, VFP). For disassembly only. A8.6.300 llvm-svn: 95669	2010-02-09 17:21:56 +00:00
Johnny Chen	9e60686a83	Add VCVTR (between floating-point and integer, VFP) for disassembly purpose. The 'R' suffix means the to-integer operations use the rounding mode specified by the FPSCR, encoded as Inst{7} = 0. A8.6.295 llvm-svn: 95584	2010-02-08 22:02:41 +00:00
Johnny Chen	beb1238a85	Add VCMP (VFP floating-point compare without 'E' bit set) for disassembly purpose. llvm-svn: 95560	2010-02-08 19:41:48 +00:00
Johnny Chen	c7e606f132	Added VMOVRRS/VMOVSRR to ARMInstrVFP.td for disassembly purpose. A8.6.331 VMOV (between two ARM core registers and two single-precision registers) llvm-svn: 95548	2010-02-08 17:26:09 +00:00
Johnny Chen	a778db9a91	VMOVRRD and VMOVDRR both have Inst{7-6} = 0b00. llvm-svn: 95397	2010-02-05 18:04:58 +00:00
Johnny Chen	34a6afc68d	Modified encoding bits specification for VFP instructions. In particular, the D bit (Inst{22}) and the M bit (Inst{5}) should be left unspecified. For binary format instructions, Inst{6} and Inst{4} need to specified for proper decodings. llvm-svn: 94855	2010-01-29 23:21:10 +00:00
Evan Cheng	ece825dc4f	Data type suffix must come after predicate. llvm-svn: 89723	2009-11-24 01:05:23 +00:00
Jim Grosbach	dbb4140f37	move fconst[sd] to UAL. <rdar://7414913> llvm-svn: 89700	2009-11-23 21:08:25 +00:00
Evan Cheng	bdb43a9d99	Remat VLDRD from constpool. Clean up some instruction property specifications. llvm-svn: 89478	2009-11-20 19:57:15 +00:00
Jim Grosbach	969910b3e8	use lower case for readability llvm-svn: 87054	2009-11-13 01:17:22 +00:00
Evan Cheng	e6548f4106	Add a comment. llvm-svn: 86706	2009-11-10 19:44:56 +00:00
Jim Grosbach	ad95414c26	Work around assembler not recognizing #0.0 form immediate for vmcp llvm-svn: 86548	2009-11-09 15:27:51 +00:00
Jim Grosbach	d7cf55cd0e	Use Unified Assembly Syntax for the ARM backend. llvm-svn: 86494	2009-11-09 00:11:35 +00:00
Evan Cheng	6203c6868f	fconsts and fconstd are obviously re-materializable. llvm-svn: 85410	2009-10-28 18:19:56 +00:00
Evan Cheng	4a609f3cef	Use fconsts and fconstd to materialize small fp constants. llvm-svn: 85362	2009-10-28 01:44:26 +00:00
Evan Cheng	538984c1c3	Now VFP instructions. llvm-svn: 85186	2009-10-27 00:20:49 +00:00
Evan Cheng	1b2b64f618	Add hasExtraSrcRegAllocReq and hasExtraDefRegAllocReq flags to ld / st multiple, ld / st pairs, etc. llvm-svn: 83197	2009-10-01 08:22:27 +00:00
Evan Cheng	3bbc6c3ae6	Change ld/st multiples to explicitly model the writeback to base register. This fixes most of the -ldstopti-before-sched2 regressions. llvm-svn: 83191	2009-10-01 01:33:39 +00:00
David Goodwin	bea6848f9d	Finish scheduling itineraries for NEON. llvm-svn: 82788	2009-09-25 18:38:29 +00:00
David Goodwin	5090273367	Add Cortex-A8 VFP model. llvm-svn: 82483	2009-09-21 20:52:17 +00:00
David Goodwin	85b5b027f7	Use NEON for single-precision int<->FP conversions. llvm-svn: 78604	2009-08-10 22:17:39 +00:00
David Goodwin	b062c236c5	Add parameter to pattern classes to enable an itinerary to be specified for instructions. For now just use the existing itineraries or NoItinerary. llvm-svn: 78321	2009-08-06 16:52:47 +00:00
David Goodwin	30bf625ac2	Add NEON single-precision FP support for fabs and fneg. llvm-svn: 78101	2009-08-04 20:39:05 +00:00
David Goodwin	a3839bc6c0	Match common pattern for FNMAC. Add NEON SP support. llvm-svn: 78085	2009-08-04 18:44:29 +00:00
David Goodwin	3b9c52c5c1	Initial support for single-precision FP using NEON. Added "neonfp" attribute to enable. Added patterns for some binary FP operations. llvm-svn: 78081	2009-08-04 17:53:06 +00:00
Evan Cheng	d214b72962	Model fpscr to prevent fcmped / fcmpezs etc from being deleted. llvm-svn: 76390	2009-07-20 02:12:31 +00:00
David Goodwin	81cdd21dcb	Predicate VFP instructions on HasVFP2 instead of IsARM. This allows VFP instructions with thumb-2. llvm-svn: 75254	2009-07-10 17:03:29 +00:00
Evan Cheng	d93b5b672f	Mark some pattern-less instructions as neverHasSideEffects. llvm-svn: 73252	2009-06-12 20:46:18 +00:00
Evan Cheng	a52c3b4b8b	Fix a 80 col. violation. llvm-svn: 60901	2008-12-11 22:02:02 +00:00

1 2 3 4

185 Commits