llvm-project

Commit Graph

Author	SHA1	Message	Date
Bob Wilson	e5863d6639	Fix a comment: We now have intrinsics for vcvtr. llvm-svn: 123246	2011-01-11 17:56:41 +00:00
Chris Lattner	2a0a3b43d7	Flag -> Glue, the ongoing saga llvm-svn: 122513	2010-12-23 18:28:41 +00:00
Evan Cheng	62c7b5bf76	Making use of VFP / NEON floating point multiply-accumulate / subtraction is difficult on current ARM implementations for a few reasons. 1. Even though a single vmla has latency that is one cycle shorter than a pair of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause additional pipeline stall. So it's frequently better to single codegen vmul + vadd. 2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to stall for 4 cycles. We need to schedule them apart. 3. A vmla followed vmla is a special case. Obvious issuing back to back RAW vmla + vmla is very bad. But this isn't ideal either: vmul vadd vmla Instead, we want to expand the second vmla: vmla vmul vadd Even with the 4 cycle vmul stall, the second sequence is still 2 cycles faster. Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough but it isn't the optimial solution. This patch attempts to make it possible to use vmla / vmls in cases where it is profitable. A. Add missing isel predicates which cause vmla to be codegen'ed. B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to compute a fmul and a fmla. C. Add additional isel checks for vmla, avoid cases where vmla is feeding into fp instructions (except for the #3 exceptional case). D. Add ARM hazard recognizer to model the vmla / vmls hazards. E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the vmla / vmls will trigger one of the special hazards. Work in progress, only A+B are enabled. llvm-svn: 120960	2010-12-05 22:04:16 +00:00
Bill Wendling	9898ac97fd	Proper encoding for VLDM and VSTM instructions. The register lists for these instructions have to distinguish between lists of single- and double-precision registers in order for the ASM matcher to do a proper job. In all other respects, a list of single- or double-precision registers are the same as a list of GPR registers. llvm-svn: 119460	2010-11-17 04:32:08 +00:00
Bill Wendling	02089a39a0	vldm and vstm are mnemonics for vldmia and vstmia resp. llvm-svn: 119321	2010-11-16 02:00:24 +00:00
Bill Wendling	a68e3a5397	Encode the multi-load/store instructions with their respective modes ('ia', 'db', 'ib', 'da') instead of having that mode as a separate field in the instruction. It's more convenient for the asm parser and much more readable for humans. <rdar://problem/8654088> llvm-svn: 119310	2010-11-16 01:16:36 +00:00
Bill Wendling	705ec77ab5	Add uses of the *_ldst_multi multiclasses. These aren't used yet. llvm-svn: 118999	2010-11-13 10:57:02 +00:00
Bill Wendling	c4c642832d	Convert the modes to lower case. llvm-svn: 118998	2010-11-13 10:43:34 +00:00
Bill Wendling	e69afc6bb7	Add *_ldst_mult multiclasses to the ARM back-end. These will be used in the future to separate out the ia, ib, da, db variants of the load/store multiple instructions. llvm-svn: 118995	2010-11-13 09:09:38 +00:00
Evan Cheng	2d59ee34f1	Add some missing isel predicates on def : pat patterns to avoid generating VFP vmla / vmls (they cause stalls). Disabling them in isel is properly not a right solution, I'll look into a proper solution next. llvm-svn: 118922	2010-11-12 20:32:20 +00:00
Bill Wendling	a91d02bc61	Add "write back" bit encoding. llvm-svn: 118446	2010-11-08 21:28:03 +00:00
Bill Wendling	c002463ac4	Add encoding for VSTR. llvm-svn: 118220	2010-11-04 00:59:42 +00:00
Bill Wendling	e84eb99cbb	The MC code couldn't handle ARM LDR instructions with negative offsets: vldr.64 d1, [r0, #-32] The problem was with how the addressing mode 5 encodes the offsets. This change makes sure that the way offsets are handled in addressing mode 5 is consistent throughout the MC code. It involves re-refactoring the "getAddrModeImmOpValue" method into an "Imm12" and "addressing mode 5" version. But not to worry! The majority of the duplicated code has been unified. llvm-svn: 118144	2010-11-03 01:49:29 +00:00
Jim Grosbach	c6af2b4066	Break ARM addrmode4 (load/store multiple base address) into its constituent parts. Represent the operation mode as an optional operand instead. rdar://8614429 llvm-svn: 118137	2010-11-03 01:01:43 +00:00
Bill Wendling	603bd8f54c	Rename getAddrModeImm12OpValue to getAddrModeImmOpValue and expand it to work with immediates up to 16-bits in size. The same logic is applied to other LDR encodings, e.g. VLDR, but which use a different immediate bit width (8-bits in VLDR's case). Removing the "12" allows it to be more generic. llvm-svn: 118094	2010-11-02 22:31:46 +00:00
Bill Wendling	3f37ade36e	Missed reverting this bit. llvm-svn: 117971	2010-11-01 23:17:54 +00:00
Bill Wendling	f7e176a3ec	Minor cleanup. llvm-svn: 117969	2010-11-01 23:11:22 +00:00
Bill Wendling	418bd53008	Move the machine operand MC encoding patterns to the parent classes. llvm-svn: 117956	2010-11-01 21:17:06 +00:00
Bill Wendling	2623343625	Move instruction encoding bits into the parent class and remove the temporary *_Encode classes. These instructions are the only ones which use those classes, so a subclass isn't necessary. llvm-svn: 117906	2010-11-01 06:00:39 +00:00
Chris Lattner	33fc3e095b	reapply r117858 with apparent editor malfunction fixed (somehow I got a dulicated line). llvm-svn: 117860	2010-10-31 19:10:56 +00:00
Chris Lattner	e59eef3dd1	revert r117858 while I check out a failure I missed. llvm-svn: 117859	2010-10-31 19:05:32 +00:00
Chris Lattner	9293008e90	the asm matcher can't handle operands with modifiers (like ${foo:bar}). Instead of silently ignoring these instructions, emit a hard error and force the target author to either refactor the target or mark the instruction 'isCodeGenOnly'. Mark a few instructions in ARM and MBlaze as isCodeGenOnly the are doing this. llvm-svn: 117858	2010-10-31 18:48:12 +00:00
Jim Grosbach	bbe2bbd7f7	Add FIXME. llvm-svn: 117787	2010-10-30 14:54:23 +00:00
Bill Wendling	a65f914bb0	Add encoding for moving a value between two ARM core registers and a doublework extension register. llvm-svn: 116970	2010-10-20 23:37:40 +00:00
Bill Wendling	058190507b	Add encodings for movement between ARM core registers and single-precision registers. llvm-svn: 116961	2010-10-20 22:44:54 +00:00
Bill Wendling	399add01d4	Reformatting. No functionalogicality changes. llvm-svn: 116625	2010-10-15 21:50:45 +00:00
Bill Wendling	6f52f8a87d	Add support for vmov.f64/.f32 encoding. There's a bit of a hack going on here. The f32 in FCONSTS is handled as a double instead of a float in the code. So the encoding of the immediate into the instruction isn't exactly in line with the documentation in that regard. But given that we know it's handled as a double, it doesn't cause any harm. llvm-svn: 116471	2010-10-14 02:33:26 +00:00
Bill Wendling	0441c6cba0	Add encoding for 'fmstat'. llvm-svn: 116466	2010-10-14 01:19:34 +00:00
Bill Wendling	0825f3e441	- Add encodings for multiply add/subtract instructions in all their glory. - Add missing patterns for some multiply add/subtract instructions. - Add encodings for VMRS and VMSR. llvm-svn: 116464	2010-10-14 01:02:08 +00:00
Bill Wendling	f106ecfa59	Add MC encodings for VCVT* instrunctions. llvm-svn: 116431	2010-10-13 20:58:46 +00:00
Bill Wendling	6e27b4f530	Add encodings for VNEG and VSQRT. Also add encodings for VMOV, but not a test just yet. llvm-svn: 116386	2010-10-13 01:17:33 +00:00
Bill Wendling	576fd0b110	Add encodings for VCVT instructions. llvm-svn: 116385	2010-10-13 00:56:35 +00:00
Bill Wendling	da4ddf0fcf	Add VCMPZ and VABS. llvm-svn: 116383	2010-10-13 00:38:07 +00:00
Bill Wendling	f9ca535495	Refactor VCMP instructions. llvm-svn: 116379	2010-10-13 00:04:29 +00:00
Bill Wendling	7dd8c0b991	Add encodings for VNMUL[SD]. llvm-svn: 116375	2010-10-12 23:47:37 +00:00
Bill Wendling	a06aee826c	Add encodings for VDIV and VMUL. llvm-svn: 116370	2010-10-12 23:22:27 +00:00
Bill Wendling	42200bcaea	Refactor some of the encoding logic into a base class. This keeps us from having to add 10+ lines to every instruction. It may turn out that we can move this base class into it's parent class. llvm-svn: 116362	2010-10-12 23:06:54 +00:00
Bill Wendling	646a506724	Add encoding for VSUB and VCMP. Fear not! I'm going to try a refactoring right now. :) llvm-svn: 116359	2010-10-12 22:55:35 +00:00
Bill Wendling	ac6cd00706	Encoding for VADDD. Plus a test for the VFP instructions. llvm-svn: 116348	2010-10-12 22:08:41 +00:00
Jim Grosbach	576640f0e3	Encoding for ARM-mode VADD.F32 instruction. llvm-svn: 116338	2010-10-12 21:22:40 +00:00
Evan Cheng	1958cefd69	Model operand cycles of vldm / vstm; also fixes scheduling itineraries of vldr / vstr, etc. llvm-svn: 115898	2010-10-07 01:50:48 +00:00
Eric Christopher	e68635acdb	Fix typo. llvm-svn: 114931	2010-09-28 00:35:33 +00:00
Jim Grosbach	abcbe2474d	VFP/NEON load/store multiple instructions are addrmode4, not 5. llvm-svn: 113322	2010-09-08 00:25:50 +00:00
Bob Wilson	13ce07fa92	Change ARM VFP VLDM/VSTM instructions to use addressing mode #4 , just like all the other LDM/STM instructions. This fixes asm printer crashes when compiling with -O0. I've changed one of the NEON tests (vst3.ll) to run with -O0 to check this in the future. Prior to this change VLDM/VSTM used addressing mode #5, but not really. The offset field was used to hold a count of the number of registers being loaded or stored, and the AM5 opcode field was expanded to specify the IA or DB mode, instead of the standard ADD/SUB specifier. Much of the backend was not aware of these special cases. The crashes occured when rewriting a frameindex caused the AM5 offset field to be changed so that it did not have a valid submode. I don't know exactly what changed to expose this now. Maybe we've never done much with -O0 and NEON. Regardless, there's no longer any reason to keep a count of the VLDM/VSTM registers, so we can use addressing mode #4 and clean things up in a lot of places. llvm-svn: 112322	2010-08-27 23:18:17 +00:00
Daniel Dunbar	07cc87438f	ARM: Mark some disassembler only instructions as not available for matching -- for some reason they have a very odd MCInst form where the operands overlap, but I haven't dug in to find out why yet. llvm-svn: 110781	2010-08-11 04:46:13 +00:00
Nate Begeman	b69b182191	Add support for getting & setting the FPSCR application register on ARM when VFP is enabled. Add support for using the FPSCR in conjunction with the vcvtr instruction, for controlling fp to int rounding. Add support for the FLT_ROUNDS_ node now that the FPSCR is exposed. llvm-svn: 110152	2010-08-03 21:31:55 +00:00
Jim Grosbach	84511e1526	Clean up 80 column violations. No functional change. llvm-svn: 105350	2010-06-02 21:53:11 +00:00
Evan Cheng	dd7f566597	Mark pattern-less mayLoad / mayStore instructions neverHasSideEffects. These do not have other un-modeled side effects. llvm-svn: 104111	2010-05-19 06:07:03 +00:00
Evan Cheng	79efd71962	Mark some pattern-less instructions as neverHasSideEffects. llvm-svn: 103683	2010-05-13 00:16:46 +00:00
Anton Korobeynikov	2063705d91	Define new itin classes for ARM <-> VFP reg moves to distinguish from NEON ops. Define proper scheduling itinerary for them on A9. A8 TRM does not specify latency for them at all :( llvm-svn: 100650	2010-04-07 18:20:02 +00:00

1 2 3

130 Commits