llvm-project

Commit Graph

Author	SHA1	Message	Date
Owen Anderson	0747307049	Add support for code generation of the one register with immediate form of vorr. We could be more aggressive about making this work for a larger range of constants, but this seems like a good start. llvm-svn: 118201	2010-11-03 22:44:51 +00:00
Owen Anderson	bb81f80af6	Unlike a lot of NEON instructions, vext isn't _actually_ parameterized by element size. Instead, all of the different element sizes are pseudo instructions that map down to vext.8 underneath, with the immediate shifted left to reflect the increased element size. llvm-svn: 118183	2010-11-03 18:16:27 +00:00
Bob Wilson	7d0ac84abd	Add codegen patterns for VST1-lane instructions. Radar 8599955. llvm-svn: 118176	2010-11-03 16:24:53 +00:00
Jim Grosbach	c6af2b4066	Break ARM addrmode4 (load/store multiple base address) into its constituent parts. Represent the operation mode as an optional operand instead. rdar://8614429 llvm-svn: 118137	2010-11-03 01:01:43 +00:00
Owen Anderson	0ebd1fd594	Revert r118097 to fix buildbots. llvm-svn: 118121	2010-11-02 23:47:29 +00:00
Owen Anderson	7c30390277	Since these fields are not exactly equivalent to the encoded field, rename them to something with semantic meaning. llvm-svn: 118097	2010-11-02 22:41:42 +00:00
Owen Anderson	dec87e10fd	Provide correct encodings for the remaining vst variants that we currently generate. llvm-svn: 118087	2010-11-02 22:18:18 +00:00
Owen Anderson	adf88d4c5f	Tentative encodings for the "single element from one lane" variant of vst1. llvm-svn: 118084	2010-11-02 21:54:45 +00:00
Owen Anderson	b95618cfe0	Add correct encodings for basic variants for vst3 and vst4. llvm-svn: 118082	2010-11-02 21:47:03 +00:00
Bob Wilson	d80b29d6f7	Add NEON VST1-lane instructions. Partial fix for Radar 8599955. llvm-svn: 118069	2010-11-02 21:18:25 +00:00
Owen Anderson	fa08e1e277	Add correct encodings for the basic variants for vst2. llvm-svn: 118068	2010-11-02 21:16:58 +00:00
Owen Anderson	87c62e54e6	Add correct encodings for the basic form of vst1. llvm-svn: 118067	2010-11-02 21:06:06 +00:00
Owen Anderson	9f20daf3b4	Factor out a common encoding class for loads and stores with a lane parameter. llvm-svn: 118055	2010-11-02 20:47:39 +00:00
Owen Anderson	a83859539f	Add correct encodings for the rest of the vld instructions that we generate. llvm-svn: 118053	2010-11-02 20:40:59 +00:00
Owen Anderson	526ffd57d2	Add correct NEON encodings for vld2, vld3, and vld4 basic variants. llvm-svn: 117997	2010-11-02 01:24:55 +00:00
Owen Anderson	b3ca2060c0	Attempt to provide correct encodings for a number of other vld1 variants, which we can't test since we can neither generate nor parse them at the moment. llvm-svn: 117988	2010-11-02 00:24:52 +00:00
Owen Anderson	ad40234eff	Add correct NEON encodings for the "multiple single elements" form of vld. llvm-svn: 117984	2010-11-02 00:05:05 +00:00
Bob Wilson	dc44990c7d	Add NEON VLD1-lane instructions. Partial fix for Radar 8599955. llvm-svn: 117964	2010-11-01 22:04:05 +00:00
Owen Anderson	2ef668840a	Add correct NEON encodings for vtbl and vtbx. llvm-svn: 117513	2010-10-28 00:18:46 +00:00
Owen Anderson	14be930317	Add correct NEON encodings for vext, vtrn, vuzp, and vzip. llvm-svn: 117512	2010-10-27 23:56:39 +00:00
Owen Anderson	fadb951e5b	Provide correct encodings for NEON vcvt, which has its own special immediate encoding for specifying fractional bits for fixed point conversions. llvm-svn: 117501	2010-10-27 22:49:00 +00:00
Owen Anderson	ed9652f959	Provide correct encodings for the get_lane and set_lane variants of vmov. llvm-svn: 117495	2010-10-27 21:28:09 +00:00
Owen Anderson	40d24a4abf	Provide correct NEON encodings for vdup. llvm-svn: 117475	2010-10-27 19:25:54 +00:00
Owen Anderson	8576a42cf3	Add correct NEON encodings for vsli and vsri. llvm-svn: 117459	2010-10-27 17:40:08 +00:00
Owen Anderson	d7e8135e1e	Add correct NEON encodings for vsra and vrsra. llvm-svn: 117458	2010-10-27 17:29:29 +00:00
Owen Anderson	825b2d1946	Add correct NEON encodings for vqshl, vqshrn, vqshrun, vqrshl, vqshrn, and vqrshrun. llvm-svn: 117411	2010-10-26 22:50:46 +00:00
Owen Anderson	2888e2c7f9	Correct NEON encodings for vshrn, vrshl, vrshr, vrshrn. llvm-svn: 117402	2010-10-26 21:58:41 +00:00
Owen Anderson	e18579976f	Simplify classes for shift instructions, which are never commutable. llvm-svn: 117398	2010-10-26 21:13:59 +00:00
Owen Anderson	3665fee8de	Provide correct NEON encodings for vshl, register and immediate forms. llvm-svn: 117394	2010-10-26 20:56:57 +00:00
Owen Anderson	691ce68d3c	Add correct NEON encoding for vpadal. llvm-svn: 117380	2010-10-26 18:18:03 +00:00
Owen Anderson	284cb361d1	Add NEON encodings for vmov and vmvn of immediates. llvm-svn: 117374	2010-10-26 17:40:54 +00:00
Owen Anderson	1f6aad053d	Add correct encodings for NEON vabal. llvm-svn: 117315	2010-10-25 21:29:04 +00:00
Owen Anderson	b9c91679aa	Add correct NEON encodings for vaba. llvm-svn: 117309	2010-10-25 20:52:57 +00:00
Owen Anderson	dd001b89d7	Attempt to provide correct encodings for NEON vbit and vbif, even though we can't test them at the moment. llvm-svn: 117294	2010-10-25 20:17:22 +00:00
Owen Anderson	dea09c7564	Provide correct NEON encodings for vbsl. llvm-svn: 117293	2010-10-25 20:13:13 +00:00
Owen Anderson	2477446ee5	Add correct instruction encodings for vbic, vorn, and vmvn. llvm-svn: 117282	2010-10-25 18:43:52 +00:00
Owen Anderson	feb3ee0c93	Add NEON encoding tests for vcgt and vacgt. llvm-svn: 117276	2010-10-25 18:03:59 +00:00
Owen Anderson	e5d0677173	Add tests for NEON encodings of vcge and vacge. llvm-svn: 117274	2010-10-25 17:49:32 +00:00
Owen Anderson	c178b80f65	Add a warning about our inability to test the encoding of vceq with immediate zero. llvm-svn: 117273	2010-10-25 17:33:02 +00:00
Owen Anderson	9d0122af7d	Add correct NEON encodings for vqdmlal. llvm-svn: 117134	2010-10-22 19:35:48 +00:00
Owen Anderson	3d0264667f	Provide correct encodings for NEON vmlal. llvm-svn: 117131	2010-10-22 19:05:25 +00:00
Owen Anderson	f48719f1b5	Provide correct NEON encodings for vmla. llvm-svn: 117126	2010-10-22 18:54:37 +00:00
Owen Anderson	9e44cf2bb2	ARM encodes Q registers as 2xregno (i.e. the number of the D register that corresponds to the lower half of the Q register), rather than with just regno. This allows us to unify the encodings for a lot of different NEON instrucitons that differ only in whether they have Q or D register operands. llvm-svn: 117056	2010-10-21 20:21:49 +00:00
Owen Anderson	6b7e401049	Add correct NEON encodings for vhadd and vrhadd. llvm-svn: 117047	2010-10-21 18:55:04 +00:00
Owen Anderson	9561084188	Add correct encodings for NEON vaddw.s* and vaddw.u*. llvm-svn: 117040	2010-10-21 18:20:25 +00:00
Owen Anderson	15c97706e8	Provide correct NEON encodings for vaddl.u* and vaddl.s*. llvm-svn: 117039	2010-10-21 18:09:17 +00:00
Owen Anderson	6083502848	Implement correct encodings for NEON vadd, both integer and floating point. llvm-svn: 116981	2010-10-21 00:48:00 +00:00
Jim Grosbach	340cd5174b	A few 80 column fixes. llvm-svn: 116451	2010-10-13 23:34:31 +00:00
Evan Cheng	e790afcbe1	More ARM scheduling itinerary fixes. llvm-svn: 116266	2010-10-11 23:41:41 +00:00
Evan Cheng	94ad008beb	Proper VST scheduling itineraries. llvm-svn: 116251	2010-10-11 22:03:18 +00:00
Evan Cheng	d7a404d85f	Add VLD4 scheduling itineraries. llvm-svn: 116143	2010-10-09 04:07:58 +00:00
Evan Cheng	a762400bed	Finish vld3 and vld4. llvm-svn: 116140	2010-10-09 01:45:34 +00:00
Evan Cheng	05f13e94bf	Correct some load / store instruction itinerary mistakes: 1. Cortex-A8 load / store multiplies can only issue on ALU0. 2. Eliminate A8_Issue, A8_LSPipe will correctly limit the load / store issues. 3. Correctly model all vld1 and vld2 variants. llvm-svn: 116134	2010-10-09 01:03:04 +00:00
Evan Cheng	1958cefd69	Model operand cycles of vldm / vstm; also fixes scheduling itineraries of vldr / vstr, etc. llvm-svn: 115898	2010-10-07 01:50:48 +00:00
Jim Grosbach	2e3e2a006b	Change the NEON VDUPfdf and VDUPfqf pseudo-instructions to actually be pseudo instructions. llvm-svn: 115840	2010-10-06 21:16:16 +00:00
Jim Grosbach	233b3a2f95	Add a 'pattern' arg to the ARM PseudoNeonI class. llvm-svn: 115831	2010-10-06 20:36:55 +00:00
Jim Grosbach	fae8305e2b	Nuke the rest of the :comment references llvm-svn: 115373	2010-10-01 23:21:38 +00:00
Evan Cheng	1969887fc6	Fix scheduling infor for vmovn and vshrn which I broke accidentially. llvm-svn: 115354	2010-10-01 21:48:06 +00:00
Evan Cheng	2a5d764858	NEON scheduling info fix. vmov reg, reg are single cycle instructions. llvm-svn: 115344	2010-10-01 20:50:58 +00:00
Bob Wilson	6b853c3ce3	Change VLDMQ and VSTMQ to be pseudo instructions. They are expanded after register allocation to VLDMD and VSTMD respectively. This avoids using the dregpair operand modifier. llvm-svn: 114047	2010-09-16 00:31:02 +00:00
Bob Wilson	b1e9d4bff1	Use VLD1/VST1 pseudo instructions for loadRegFromStackSlot and storeRegToStackSlot. llvm-svn: 113918	2010-09-15 01:48:05 +00:00
Jim Grosbach	c7cf42d80b	Reapply r113875 with additional cleanups. "The register specified for a dregpair is the corresponding Q register, so to get the pair, we need to look up the sub-regs based on the qreg. Create a lookup function since we don't have access to TargetRegisterInfo here to be able to use getSubReg(ARM::dsub_[01])." Additionaly, fix the NEON VLD1* and VST1* instruction patterns not to use the dregpair modifier for the 2xdreg versions. Explicitly specifying the two registers as operands is more correct and more consistent with the other instruction patterns. This enables further cleanup of special case code in the disassembler as a nice side-effect. llvm-svn: 113903	2010-09-14 23:54:06 +00:00
Bob Wilson	dd29db5635	Make NEON ld/st pseudo instruction classes take the instruction itinerary as an argument, so that we can distinguish instructions with the same register classes but different numbers of registers (e.g., vld3 and vld4). Fix some of the non-pseudo NEON ld/st instruction itineraries to reflect the number of registers loaded or stored, not just the opcode name. llvm-svn: 113854	2010-09-14 20:59:49 +00:00
Bob Wilson	c597fd3b4a	Convert some VTBL and VTBX instructions to use pseudo instructions prior to register allocation. Remove the NEONPreAllocPass, which is no longer needed. Yeah!! llvm-svn: 113818	2010-09-13 23:55:10 +00:00
Bob Wilson	d5c57a5ed4	Switch all the NEON vld-lane and vst-lane instructions over to the new pseudo-instruction approach. Change ARMExpandPseudoInsts to use a table to record all the NEON load/store information. llvm-svn: 113812	2010-09-13 23:01:35 +00:00
Bob Wilson	4adbaf1843	Fix NEON VLD pseudo instruction itineraries that were incorrectly copied from the VST pseudos. The VLD/VST scheduling still needs work (see pr6722), but at least we shouldn't confuse the loads with the stores. llvm-svn: 113473	2010-09-09 05:40:26 +00:00
Jim Grosbach	abcbe2474d	VFP/NEON load/store multiple instructions are addrmode4, not 5. llvm-svn: 113322	2010-09-08 00:25:50 +00:00
Bob Wilson	35fafca587	Finish converting the rest of the NEON VLD instructions to use pseudo- instructions prior to regalloc. Since it's getting a little close to the 2.8 branch deadline, I'll have to leave the rest of the instructions handled by the NEONPreAllocPass for now, but I didn't want to leave half of the VLD instructions converted and the other half not. llvm-svn: 112983	2010-09-03 18:16:02 +00:00
Bob Wilson	f65c9ef720	Replace NEON vabdl, vaba, and vabal intrinsics with combinations of the vabd intrinsic and add and/or zext operations. In the case of vaba, this also avoids the need for a DAG combine pattern to combine vabd with add. Update tests. Auto-upgrade the old intrinsics. llvm-svn: 112941	2010-09-03 01:35:08 +00:00
Bob Wilson	75a6408f88	Convert VLD1 and VLD2 instructions to use pseudo-instructions until after regalloc. llvm-svn: 112825	2010-09-02 16:00:54 +00:00
Bob Wilson	38ab35a911	Remove NEON vmull, vmlal, and vmlsl intrinsics, replacing them with multiply, add, and subtract operations with zero-extended or sign-extended vectors. Update tests. Add auto-upgrade support for the old intrinsics. llvm-svn: 112773	2010-09-01 23:50:19 +00:00
Bob Wilson	4cd8a126c3	Remove NEON vmovn intrinsic, replacing it with vector truncate operations. Auto-upgrade the old intrinsic and update tests. llvm-svn: 112507	2010-08-30 20:02:30 +00:00
Bob Wilson	d0c054886c	Remove NEON vaddl, vaddw, vsubl, and vsubw intrinsics. Instead, use llvm IR add/sub operations with one or both operands sign- or zero-extended. Auto-upgrade the old intrinsics. llvm-svn: 112416	2010-08-29 05:57:34 +00:00
Bob Wilson	950882be07	Use pseudo instructions for VST1 and VST2. llvm-svn: 112357	2010-08-28 05:12:57 +00:00
Bob Wilson	8ee9394750	We don't need to custom-select VLDMQ and VSTMQ anymore. llvm-svn: 112336	2010-08-28 00:20:11 +00:00
Bob Wilson	13ce07fa92	Change ARM VFP VLDM/VSTM instructions to use addressing mode #4 , just like all the other LDM/STM instructions. This fixes asm printer crashes when compiling with -O0. I've changed one of the NEON tests (vst3.ll) to run with -O0 to check this in the future. Prior to this change VLDM/VSTM used addressing mode #5, but not really. The offset field was used to hold a count of the number of registers being loaded or stored, and the AM5 opcode field was expanded to specify the IA or DB mode, instead of the standard ADD/SUB specifier. Much of the backend was not aware of these special cases. The crashes occured when rewriting a frameindex caused the AM5 offset field to be changed so that it did not have a valid submode. I don't know exactly what changed to expose this now. Maybe we've never done much with -O0 and NEON. Regardless, there's no longer any reason to keep a count of the VLDM/VSTM registers, so we can use addressing mode #4 and clean things up in a lot of places. llvm-svn: 112322	2010-08-27 23:18:17 +00:00
Bob Wilson	97919e9c59	Use pseudo instructions for VST3. llvm-svn: 112208	2010-08-26 18:51:29 +00:00
Bob Wilson	4cec44975e	Use pseudo instructions for VST1d64Q. llvm-svn: 112170	2010-08-26 05:33:30 +00:00
Bob Wilson	9392b0e960	Start converting NEON load/stores to use pseudo instructions, beginning here with the VST4 instructions. Until after register allocation, we want to represent sets of adjacent registers by a single super-register. These VST4 pseudo instructions have a single QQ or QQQQ source register operand. They get expanded to the real VST4 instructions with 4 separate D register operands. Once this conversion is complete, we'll be able to remove the NEONPreAllocPass and avoid some fragile and hacky code elsewhere. llvm-svn: 112108	2010-08-25 23:27:42 +00:00
Bob Wilson	9a511c07e4	Replace the arm.neon.vmovls and vmovlu intrinsics with vector sign-extend and zero-extend operations. llvm-svn: 111614	2010-08-20 04:54:02 +00:00
Daniel Dunbar	727be43a3d	Silence some -Asserts uninitialized variable warnings. llvm-svn: 109956	2010-07-31 21:08:54 +00:00
Bob Wilson	bad47f62f6	Add support for NEON VMVN immediate instructions. llvm-svn: 108324	2010-07-14 06:31:50 +00:00
Bob Wilson	bd54a53628	The bits in the cmode field of 32-bit VMOV immediate instructions all depend of the value of the immediate. llvm-svn: 108323	2010-07-14 06:30:44 +00:00
Bob Wilson	a3f1901531	Use a target-specific VMOVIMM DAG node instead of BUILD_VECTOR to represent NEON VMOV-immediate instructions. This simplifies some things. llvm-svn: 108275	2010-07-13 21:16:48 +00:00
Bob Wilson	5bc8a79e7f	Also use REG_SEQUENCE for VTBX instructions. llvm-svn: 107743	2010-07-07 00:08:54 +00:00
Bob Wilson	3ed511bc6b	Use REG_SEQUENCE nodes to make the table registers for VTBL instructions be allocated to consecutive registers. llvm-svn: 107730	2010-07-06 23:36:25 +00:00
Bob Wilson	574f68f815	Fix indentation. llvm-svn: 106881	2010-06-25 20:54:44 +00:00
Bob Wilson	6d12973143	Remove a fixme comment that is no longer relevant. llvm-svn: 106382	2010-06-19 05:32:41 +00:00
Bob Wilson	f3f7a770b7	Add basic support for NEON modified immediates besides VMOV. llvm-svn: 106030	2010-06-15 19:05:35 +00:00
Bob Wilson	5b2b504038	Rename functions referring to VMOV immediates to refer to NEON "modified immediate" operands. These functions have so far only been used for VMOV but they also apply to other NEON instructions with modified immediate operands. No functional changes. llvm-svn: 105969	2010-06-14 22:19:57 +00:00
Bob Wilson	6eae520de9	Add instruction encoding for the Neon VMOV immediate instruction. This changes the machine instruction representation of the immediate value to be encoded into an integer with similar fields as the actual VMOV instruction. This makes things easier for the disassembler, since it can just stuff the bits into the immediate operand, but harder for the asm printer since it has to decode the value to be printed. Testcase for the encoding will follow later when MC has more support for ARM. llvm-svn: 105836	2010-06-11 21:34:50 +00:00
Bob Wilson	846bd7992c	Further changes for Neon vector shuffles: - change isShuffleMaskLegal to show that all shuffles with 32-bit and 64-bit elements are legal - the Neon shuffle instructions do not support 64-bit elements, but we were not checking for that before lowering shuffles to use them - remove some 64-bit element vduplane patterns that are no longer needed llvm-svn: 105586	2010-06-07 23:53:38 +00:00
Jakob Stoklund Olesen	8d042c0269	Fix a few places that depended on the numeric value of subreg indices. Add assertions in places that depend on consecutive indices. llvm-svn: 104510	2010-05-24 17:13:28 +00:00
Jakob Stoklund Olesen	6c47d6423c	Switch ARMRegisterInfo.td to use SubRegIndex and eliminate the parallel enums from ARMRegisterInfo.h llvm-svn: 104508	2010-05-24 16:54:32 +00:00
Evan Cheng	dd7f566597	Mark pattern-less mayLoad / mayStore instructions neverHasSideEffects. These do not have other un-modeled side effects. llvm-svn: 104111	2010-05-19 06:07:03 +00:00
Evan Cheng	cd04ed3533	vmov of immediates are trivially re-materializable. llvm-svn: 103982	2010-05-17 21:54:50 +00:00
Anton Korobeynikov	497d831966	Chris said that the comment char should be escaped. Fix all the occurences of "@" in *.td llvm-svn: 103903	2010-05-16 09:15:36 +00:00
Evan Cheng	cd67c21407	Added a QQQQ register file to model 4-consecutive Q registers. llvm-svn: 103760	2010-05-14 02:13:41 +00:00
Evan Cheng	9de7cfe3f4	Bring back VLD1q and VST1q and use them for reloading / spilling Q registers. This allows folding loads and stores into VMOVQ. llvm-svn: 103692	2010-05-13 01:12:06 +00:00
Evan Cheng	79efd71962	Mark some pattern-less instructions as neverHasSideEffects. llvm-svn: 103683	2010-05-13 00:16:46 +00:00
Evan Cheng	86eb22976f	Use VLD2q32 / VST2q32 to reload / spill QQ (pair of Q) registers when stack slot is sufficiently aligned. Use VLDMD / VSTMD otherwise. llvm-svn: 103235	2010-05-07 02:04:02 +00:00
Evan Cheng	ddc93c7e04	Remove VLD1q and VST1q for reloading and spilling Q registers. Just use VLD1q64 / VST1q64 and reference sub-registers. llvm-svn: 103218	2010-05-07 00:24:52 +00:00
Evan Cheng	31cdcd46d6	Re-apply 103156 and 103157. 103156 didn't break anything. 10315 exposed a coalescer bug that's fixed by 103170. llvm-svn: 103172	2010-05-06 06:36:08 +00:00
Eric Christopher	9feb1bb117	Revert r103156 since it was breaking the build bots. Reverse-merging r103156 into '.': U lib/Target/ARM/ARMInstrNEON.td U lib/Target/ARM/ARMRegisterInfo.h U lib/Target/ARM/ARMBaseRegisterInfo.cpp U lib/Target/ARM/ARMBaseInstrInfo.cpp U lib/Target/ARM/ARMRegisterInfo.td llvm-svn: 103159	2010-05-06 02:29:06 +00:00
Evan Cheng	8f99a1c6b4	Adding pseudo 256-bit registers QQ0 . . . QQ7 to represent pairs of Q registers. These will be used to model VLD2 / VST2 instructions in order to get substantially better codegen for them. llvm-svn: 103156	2010-05-06 01:52:03 +00:00
Anton Korobeynikov	4d36f8890f	More fixes for itins llvm-svn: 100662	2010-04-07 18:21:10 +00:00
Anton Korobeynikov	ceb54d5ab0	Fix invalid itins for 32-bit varians of VMLAL and friends llvm-svn: 100661	2010-04-07 18:21:04 +00:00
Anton Korobeynikov	a248becd6c	Fix itins for VABA llvm-svn: 100657	2010-04-07 18:20:42 +00:00
Anton Korobeynikov	a3e4989ad8	Correct VMVN itinerary: operand is read in the second cycle, not in the first. llvm-svn: 100656	2010-04-07 18:20:36 +00:00
Anton Korobeynikov	140a65ce0b	More A9 itineraries llvm-svn: 100655	2010-04-07 18:20:29 +00:00
Anton Korobeynikov	1a1af5a830	Correct itinerary class for VPADD llvm-svn: 100654	2010-04-07 18:20:24 +00:00
Anton Korobeynikov	4650fd5fc6	VP{MAX, MIN} are of IIC_VSUBi4D itin class as well. llvm-svn: 100653	2010-04-07 18:20:18 +00:00
Anton Korobeynikov	7d4fad5942	VHADD differs from VHSUB at least on A9 - the former reads both operands in the second cycle, while the latter reads second operand in first cycle. Introduce new itin classes to catch this behavior. Whether this is true for A8 as well is WIP. llvm-svn: 100652	2010-04-07 18:20:13 +00:00
Johnny Chen	c86256fa5d	Add NVTBLFrm to represent A8.6.406 VTBL, VTBX Vector Table Lookup Instructions. These instructions use byte index in a control vector (M:Vm) to lookup byte values in a table and generate a new vector (D:Vd). The table is specified via a list of vectors, which can be: {Dn} {Dn D<n+1>} {Dn D<n+1> D<n+2>} {Dn D<n+1> D<n+2> D<n+3>} llvm-svn: 99789	2010-03-29 01:14:22 +00:00
Chris Lattner	3dad5fbeb9	fix integer negates to use the proper type for the zero vectors, this also depends on the new "bitconvert dropping" behavior just added to tblgen. llvm-svn: 99757	2010-03-28 08:39:10 +00:00
Chris Lattner	6c223ee0e9	fix vnot matching to explicitly specify the type of the input to be v8i8 or v16i8, which buildvectors get canonicalized to. This allows the patterns that were previously using a bare 'vnot' to match, before they couldn't. llvm-svn: 99754	2010-03-28 08:08:07 +00:00
Bob Wilson	0f8a02830a	Fix indentation. llvm-svn: 99705	2010-03-27 04:01:23 +00:00
Bob Wilson	cf603fb1c5	Add a format argument to the N3V and N3VX classes, removing the N3Vf class. llvm-svn: 99704	2010-03-27 03:56:52 +00:00
Johnny Chen	6094cdab9f	Add NVMulSLFrm to represent "3-register multiply with scalar" operations and set it as the format for the appropriate N3VSL<> classes. These instructions require special handling of the M:Vm field which encodes the restricted Dm and the lane index within Dm. Examples are A8.6.325 VMLA, VMLAL, VMLS, VMLSL (by scalar): vmlal.s32 q3, d2, d10[0] llvm-svn: 99690	2010-03-27 01:03:13 +00:00
Johnny Chen	93acfbf441	Remove the duplicate multiclass N3VSh_QHSD and use N3VInt_QHSD which is modified to now take a format argument. N3VDInt<> and N3VQInt<> are modified to take a format argument as well. llvm-svn: 99676	2010-03-26 23:49:07 +00:00
Johnny Chen	0b57de3c4c	Add NVExtFrm to represent NEON Vector Extract Instructions, that uses Inst{11-8} to encode the byte location of the extracted result in the concatenation of the operands, from the least significant end. Modify VEXTd and VEXTq classes to use the format. llvm-svn: 99659	2010-03-26 22:28:56 +00:00
Johnny Chen	2cf04957c2	Add N3RegVShFrm to represent 3-Register Vector Shift Instructions, which do not follow the N3RegFrm's operand order of D:Vd N:Vn M:Vm. The operand order of N3RegVShFrm is D:Vd M:Vm N:Vn (notice that M:Vm is the first src operand). Add a parent class N3Vf which requires passing a Format argument and which the N3V class is modified to inherit from. N3V class represents the "normal" 3-Register NEON Instructions with N3RegFrm. Also add a multiclass N3VSh_QHSD to represent clusters of NEON 3-Register Shift Instructions and replace 8 invocations with it. llvm-svn: 99655	2010-03-26 21:26:28 +00:00
Johnny Chen	5d4e917d9f	Add N2RegVShLFrm and N2RegVShRFrm formats so that the disassembler can easily dispatch to the appropriate routines to handle the different interpretations of the shift amount encoded in the imm6 field. The Vd, Vm fields are interpreted the same between the two, though. See, for example, A8.6.367 VQSHL, VQSHLU (immediate) for N2RegVShLFrm format and A8.6.368 VQSHRN, VQSHRUN for N2RegVShRFrm format. llvm-svn: 99590	2010-03-26 01:07:59 +00:00
Johnny Chen	d82f9002e4	Add NVCVTFrm (NEON Convert with fractional bits immediate) and modify N2VImm to expect a Format arg. N2VCvtD/N2VCvtQ are modified to use the NVCVTFrm format. llvm-svn: 99548	2010-03-25 20:39:04 +00:00
Johnny Chen	45ab3f3ccf	Added a new instruction class NVDupLane to be inherited by VDUPLND and VDUPLNQ, instead of the current N2V. Format of NVDupLane instances are set to NEONFrm currently. llvm-svn: 99518	2010-03-25 17:01:27 +00:00
Johnny Chen	bff23ca690	Trivial formating change. llvm-svn: 99428	2010-03-24 21:25:07 +00:00
Johnny Chen	e99953ce9c	Reverted r99326 which added NVdVmVCVTFrm, and later renamed to NVCVTFrm. NVCVTFrm will later be used to describe "vcvt with fractional bits". llvm-svn: 99415	2010-03-24 19:47:14 +00:00
Johnny Chen	da44d5977f	Reverted r99376. The disassembler will deal with the 2-reg format of these two N3VX instructions using special case code. llvm-svn: 99409	2010-03-24 18:46:34 +00:00
Johnny Chen	aa9b1c81a7	Mark VMOVDneon and VMOVQ as having the N2RegFrm form to help the disassembler. llvm-svn: 99376	2010-03-24 01:29:25 +00:00
Johnny Chen	9b1f60adec	Renamed NVdVmImmFrm and NVdVmVCVTFrm to the more proper N2RegFrm and NVCVTFrm, respectively, and add some more comment. llvm-svn: 99373	2010-03-24 00:57:50 +00:00
Johnny Chen	5be6d5a6a9	Add comment. llvm-svn: 99327	2010-03-23 21:30:12 +00:00
Johnny Chen	5dbf39285d	Add New NEON Format NVdVmVCVTFrm. Converted some of the NEON vcvt instructions to this format. llvm-svn: 99326	2010-03-23 21:25:38 +00:00
Bob Wilson	59f75bba24	Fix VLDMQ and VSTMQ instructions to use the correct encoding and address modes. These instructions are only needed for codegen, so I've removed all the explicit encoding bits for now; they should be set in the same way as the for VLDMD and VSTMD whenever we add encodings for VFP. The use of addrmode5 requires that the instructions be custom-selected so that the number of registers can be set in the AM5Opc value. llvm-svn: 99309	2010-03-23 18:54:46 +00:00
Bob Wilson	9b680e21c0	Rename some instructions to match the corresponding NEON opcode. llvm-svn: 99266	2010-03-23 06:26:18 +00:00
Bob Wilson	cc0a2a75a0	Change VST1 instructions for loading Q register values to operate on pairs of D registers. Add a separate VST1q instruction with a Q register source operand for use by storeRegToStackSlot. llvm-svn: 99265	2010-03-23 06:20:33 +00:00
Bob Wilson	340861d29e	Change VLD1 instructions for loading Q register values to operate on pairs of D registers. Add a separate VLD1q instruction with a Q register destination operand for use by loadRegFromStackSlot. llvm-svn: 99261	2010-03-23 05:25:43 +00:00
Bob Wilson	e60e3ab624	Rename one more NEON instruction that I missed earlier. llvm-svn: 99201	2010-03-22 20:31:39 +00:00
Bob Wilson	c286c88db0	Regroup some instructions. No functional change. llvm-svn: 99192	2010-03-22 18:22:06 +00:00
Bob Wilson	c53a1125ff	Rename some VLD1/VST1 instructions to match the implementation, i.e., the corresponding NEON instructions, instead of operation they are currently used for. llvm-svn: 99189	2010-03-22 18:13:18 +00:00
Bob Wilson	98bf5189d7	Remove some redundant instruction classes. llvm-svn: 99187	2010-03-22 18:02:38 +00:00
Bob Wilson	debe0bdb13	Refactor instruction encoding arguments for VLDnLN/VSTnLN classes to specify encoding bits in arguments instead of "let" expressions. llvm-svn: 99185	2010-03-22 16:43:10 +00:00
Bob Wilson	ae08a736d6	Re-commit r98683 ("remove redundant writeback flag from ARM address mode 6") with changes to add a separate optional register update argument. Change all the NEON instructions with address register writeback to use it. llvm-svn: 99095	2010-03-20 22:13:40 +00:00
Bob Wilson	59e5141d44	Add instruction variants for VST2, VST3, and VST4 "store-lane" operations with address register writeback. llvm-svn: 99094	2010-03-20 21:57:36 +00:00
Bob Wilson	b18adef4ad	Add variants of VST2, VST3 and VST4 with address register writeback, and rewrite the existing VST3 and VST4 instructions to use the same classes as the others. llvm-svn: 99093	2010-03-20 21:45:18 +00:00
Bob Wilson	89ba42c4ce	Add instructions for double-spaced VST3 and VST4 without address register writeback, and refactor the existing double-spaced VST2 instructions. These are only for the disassembler since codegen doesn't use them, at least for now. llvm-svn: 99090	2010-03-20 21:15:48 +00:00
Bob Wilson	322cbff3d3	Add VST1 instructions with address register writeback. llvm-svn: 99083	2010-03-20 20:54:36 +00:00
Bob Wilson	9152d96dfb	Add instruction variants for VLD2, VLD3, and VLD4 "load-lane" operations with address register writeback. llvm-svn: 99082	2010-03-20 20:47:18 +00:00
Bob Wilson	9b1584245a	Tidy some more comments and whitespace. llvm-svn: 99081	2010-03-20 20:39:53 +00:00
Bob Wilson	cf324658f6	Add variants of VLD2, VLD3 and VLD4 with address register writeback, and rewrite the existing VLD3 and VLD4 instructions to use the same classes as the others. llvm-svn: 99080	2010-03-20 20:10:51 +00:00
Bob Wilson	7ee900da22	Tidy some comments and whitespace for consistency. llvm-svn: 99078	2010-03-20 19:57:03 +00:00
Bob Wilson	c0795f8b87	Rename some instructions for consistency and sanity: use "_UPD" suffix for load/stores with address register writeback, and use "odd" suffix to distinguish instructions to access odd numbered registers (instead of "a" and "b"). No functional changes. llvm-svn: 99066	2010-03-20 18:35:24 +00:00
Bob Wilson	d092669b48	Add instructions for double-spaced VLD3 and VLD4 without address register writeback, and refactor the existing double-spaced VLD2 instructions. These are only for the disassembler since codegen doesn't use them, at least for now. llvm-svn: 99065	2010-03-20 18:14:26 +00:00
Bob Wilson	496766cb56	Add VLD1 instructions with address register writeback. llvm-svn: 99062	2010-03-20 17:59:03 +00:00
Bob Wilson	e4191e719b	Revert this change, since it was causing ARM performance regressions. --- Reverse-merging r98889 into '.': U lib/Target/ARM/ARMInstrNEON.td U lib/Target/ARM/ARMISelLowering.h U lib/Target/ARM/ARMInstrInfo.td U lib/Target/ARM/ARMInstrVFP.td U lib/Target/ARM/ARMISelLowering.cpp U lib/Target/ARM/ARMInstrFormats.td llvm-svn: 99010	2010-03-19 22:51:32 +00:00
Anton Korobeynikov	f11aa9e7b4	Get rid of target-specific fp <-> int nodes when still I'm here. llvm-svn: 98889	2010-03-18 22:35:45 +00:00
Bob Wilson	a7f236ae3a	Refactor NEON ld/st instructions to hardcode class arguments that are constants. No functional changes. llvm-svn: 98860	2010-03-18 20:18:39 +00:00
Johnny Chen	274a0d3794	Revert 98745 with respect to the addition of NEONFrm subformats for disassembly. There is a better way coming up. llvm-svn: 98777	2010-03-17 23:26:50 +00:00
Johnny Chen	8f3004cff2	Added sub-formats to the NeonI/NeonXI instructions to further refine the NEONFrm instructions to help disassembly. We also changed the output of the addressing modes to omit the '+' from the assembler syntax #+/-<imm> or +/-<Rm>. See, for example, A8.6.57/58/60. And modified test cases to not expect '+' in +reg or #+num. For example, ; CHECK: ldr.w r9, [r7, #28] llvm-svn: 98745	2010-03-17 17:52:21 +00:00
Bob Wilson	c7ba918b84	Revert 98683. It is breaking something in the disassembler. llvm-svn: 98692	2010-03-16 23:01:13 +00:00
Bob Wilson	c953bca10b	Remove redundant writeback flag from ARM address mode 6. Also remove the optional register update argument, which is currently unused -- when we add support for that, it can just be a separate operand. llvm-svn: 98683	2010-03-16 21:44:40 +00:00
Bob Wilson	1b4e8cc69c	--- Reverse-merging r98637 into '.': U test/CodeGen/ARM/tls2.ll U test/CodeGen/ARM/arm-negative-stride.ll U test/CodeGen/ARM/2009-10-30.ll U test/CodeGen/ARM/globals.ll U test/CodeGen/ARM/str_pre-2.ll U test/CodeGen/ARM/ldrd.ll U test/CodeGen/ARM/2009-10-27-double-align.ll U test/CodeGen/Thumb2/thumb2-strb.ll U test/CodeGen/Thumb2/ldr-str-imm12.ll U test/CodeGen/Thumb2/thumb2-strh.ll U test/CodeGen/Thumb2/thumb2-ldr.ll U test/CodeGen/Thumb2/thumb2-str_pre.ll U test/CodeGen/Thumb2/thumb2-str.ll U test/CodeGen/Thumb2/thumb2-ldrh.ll U utils/TableGen/TableGen.cpp U utils/TableGen/DisassemblerEmitter.cpp D utils/TableGen/RISCDisassemblerEmitter.h D utils/TableGen/RISCDisassemblerEmitter.cpp U Makefile.rules U lib/Target/ARM/ARMInstrNEON.td U lib/Target/ARM/Makefile U lib/Target/ARM/AsmPrinter/ARMInstPrinter.cpp U lib/Target/ARM/AsmPrinter/ARMAsmPrinter.cpp U lib/Target/ARM/AsmPrinter/ARMInstPrinter.h D lib/Target/ARM/Disassembler U lib/Target/ARM/ARMInstrFormats.td U lib/Target/ARM/ARMAddressingModes.h U lib/Target/ARM/Thumb2ITBlockPass.cpp llvm-svn: 98640	2010-03-16 16:59:47 +00:00
Johnny Chen	3d9327bd06	Initial ARM/Thumb disassembler check-in. It consists of a tablgen backend (RISCDisassemblerEmitter) which emits the decoder functions for ARM and Thumb, and the disassembler core which invokes the decoder function and builds up the MCInst based on the decoded Opcode. Added sub-formats to the NeonI/NeonXI instructions to further refine the NEONFrm instructions to help disassembly. We also changed the output of the addressing modes to omit the '+' from the assembler syntax #+/-<imm> or +/-<Rm>. See, for example, A8.6.57/58/60. And modified test cases to not expect '+' in +reg or #+num. For example, ; CHECK: ldr.w r9, [r7, #28] llvm-svn: 98637	2010-03-16 16:36:54 +00:00
Chris Lattner	ce81b3c120	fix an ambiguous pattern, contrary to expectations, scalar_to_vector doesn't have a type constraint on the scalar because we don't have an 'sAny' type. llvm-svn: 98527	2010-03-15 00:52:43 +00:00
Bob Wilson	27cce1c0b6	Remove obsolete comments. VLDM is implemented in ARMInstrVFP.td. llvm-svn: 98395	2010-03-12 22:00:08 +00:00
Chris Lattner	b8a7427636	fix a bunch of partially ambiguous patterns on ARM. As an example, this: (set DPR:$dst, (fsub (fneg (fmul DPR:$a, DPR:$b)), DPR:$dstin)) is ambiguous because DPR contains both f64 and v2f32. tblgen currently accidentally picks f64 because it's first in the regclass. llvm-svn: 97955	2010-03-08 18:51:21 +00:00
Johnny Chen	86ba44a4c7	Added Vector Swap (VSWPd and VSWPq) instructions for disassembly only. A8.6.405 llvm-svn: 97052	2010-02-24 20:06:07 +00:00
Johnny Chen	03ac201ad9	Fixed typo of opcodestr, should be "vst1", not "vld1". llvm-svn: 97044	2010-02-24 18:00:40 +00:00
Johnny Chen	d5c472d811	Added for disassembly VST1 (multiple single elements) which stores elements to memory from three or four registers and VST2 (multiple two-element structures) which stores to memory from two double-spaced registers. A8.6.391 & A8.6.393 llvm-svn: 97018	2010-02-24 02:57:20 +00:00
Johnny Chen	b14a5c52bc	Added for disassembly VLD1 (multiple single elements) which loads memory into three or four registers and VLD2 (multiple two-element structures) which loads memory into two double-spaced registers. A8.6.307 & A8.6.310 llvm-svn: 96980	2010-02-23 20:51:23 +00:00
Johnny Chen	21dbd6f449	Added versions of VCGE, VCGT, VCLE, and VCLT NEON instructions which compare to (immediate #0) for disassembly only. A8.6.283, A8.6.285, A8.6.287, A8.6.290 llvm-svn: 96856	2010-02-23 01:42:58 +00:00
Johnny Chen	886915e3bb	Added VCEQ (immediate #0 ) NEON instruction for disassembly only. A8.6.281 llvm-svn: 96838	2010-02-23 00:33:12 +00:00
Bob Wilson	c6c13a3515	Use NEON vmin/vmax instructions for floating-point selects. Radar 7461718. llvm-svn: 96572	2010-02-18 06:05:53 +00:00
Bob Wilson	cb2deb2aaf	Remove the NEON N2VSInt instruction class: it's only used in one place and since it has no pattern, there's not much point in distinguishing an "N2VS" class for intrinsics anyway. llvm-svn: 96525	2010-02-17 22:42:54 +00:00
Bob Wilson	004d280d5e	More cleanup for NEON: * Use "S" abbreviation for scalar single FP registers in class and pattern names, instead of keeping the "D" (for "double") abbreviation and tacking on an "s" elsewhere in the name. * Move the scalar single FP register classes and patterns to be more consistent with other definitions in the file. * Rename "VNEGf32d" definition to "VNEGfd" for consistency. * Deleted the N2VDIntsPat pattern; N2VSPat is good enough. llvm-svn: 96521	2010-02-17 22:23:11 +00:00
Bob Wilson	9e89907ed5	Wrap lines to 80 columns and generally try to clean up whitespace and indentation. No functional changes. llvm-svn: 96418	2010-02-17 00:31:29 +00:00
Johnny Chen	1215c774f2	Add VBIF/VBIT for disassembly only. A8.6.279 llvm-svn: 95713	2010-02-09 23:05:23 +00:00
Bob Wilson	7430a98619	Emit spaces after commas in Neon register lists. This is more consistent with the rest of the assembly output, is easier to read, and matches the expected output for gcc's Neon tests. llvm-svn: 93703	2010-01-18 01:24:43 +00:00
Bob Wilson	9349437c65	The Neon "vtst" instruction takes a suffix that is the element size alone -- adding an "i" to the suffix, indicating that the elements are integers, is accepted but not part of the standard syntax. This helps us pass a few more of the Neon tests from gcc. llvm-svn: 93677	2010-01-17 06:35:17 +00:00
Johnny Chen	86fc920742	For VLDM/VSTM (Advanced SIMD), set encoding bits Inst{11-8} to 0b1011. llvm-svn: 90243	2009-12-01 17:37:06 +00:00
Johnny Chen	ee536b0ea4	For VMOV (immediate), make some of the encoding bits (cmode and op) unspecified. For VMOVv*i[16,32], op bit is don't care, and some cmode bits vary depending on the immediate values. Ref: Table A7-15 Modified immediate values for Advanced SIMD instructions. llvm-svn: 90173	2009-12-01 00:02:02 +00:00
Evan Cheng	738a97a1db	Massive refactoring of NEON instructions. Separate opcode from data size specifier suffix, move \t up stream to instruction format, and fix more 80 column violations. This fixes the NEON asm printing so the "predicate" field is printed between the opcode and the data type suffix. llvm-svn: 89706	2009-11-23 21:57:23 +00:00
Johnny Chen	b6528d3244	Partially revert r84730 by removing N2VDup from ARMInstrFormats.td and modifying VDUPLND and VDUPLNQ to derive from N2V instead of N2VDup. VDUPLND and VDUPLNQ now expect op19_18 and op17_16 as the first two args. llvm-svn: 89699	2009-11-23 21:00:43 +00:00
Johnny Chen	5ad7416260	Revert r84572 by removing N3VImm from ARMInstrFormats.td now that we can specify {?,?,?,?} as op11_8 for VEXTd and VEXTq. llvm-svn: 89693	2009-11-23 20:09:13 +00:00
Johnny Chen	e97457afbc	Partially revert r89377 by removing NLdStLN class definition from ARMInstrFormats.td and fixing VLD[234]LN* and VST[234]LN* to derive from NLdSt instead of NLdStLN. llvm-svn: 89684	2009-11-23 18:16:16 +00:00
Johnny Chen	ebc60ef80c	Make it clear that the index bit(s) of Vector Get Lane and Vector Set Lane should be left unspecified now that Bob Wilson has fixed pr5470. llvm-svn: 89676	2009-11-23 17:48:17 +00:00
Evan Cheng	a33fc86be3	Add predicate operand to NEON instructions. Fix lots (but not all) 80 col violations in ARMInstrNEON.td. llvm-svn: 89542	2009-11-21 06:21:52 +00:00
Johnny Chen	b3b8209d77	Added NLdStLN which is similar to NLdSt with the exception that op7_4 is not fully specified at this level. Subclasses of NLdStLN can specify selective bit(s) for Inst{7-4}, as is done for VLD[234]LN* and VST[234]LN* inside ARMInstrNEON.td. llvm-svn: 89377	2009-11-19 19:20:17 +00:00
Evan Cheng	e129dd311e	Use table to separate opcode from operands. llvm-svn: 86965	2009-11-12 07:16:34 +00:00
Jim Grosbach	d7cf55cd0e	Use Unified Assembly Syntax for the ARM backend. llvm-svn: 86494	2009-11-09 00:11:35 +00:00
Bob Wilson	d95ccd6c4d	Print VMOV (immediate) operands as hexadecimal values. Apple's assembler will not accept negative values for these. LLVM's default operand printing sign extends values, so that valid unsigned values appear as negative immediates. Print all VMOV immediate operands as hex values to resolve this. Radar 7372576. llvm-svn: 86301	2009-11-06 23:33:28 +00:00
Anton Korobeynikov	0f38d989bd	Do not infer the target type for COPY_TO_REGCLASS from dest regclass, this won't work if it can contain several types. Require explicit result type for the node for now. This fixes PR5364. PS: It seems that blackfin usage of copy_to_regclass is completely bogus! llvm-svn: 85766	2009-11-02 00:11:39 +00:00
Jim Grosbach	5cba8de2c8	vml[as].f32 cause stalls in following advanced SIMD instructions. Avoid using them for scalar floating point operations for now. llvm-svn: 85697	2009-10-31 22:57:36 +00:00
Bob Wilson	0db964a3a0	Fix NEON VST2LN instruction encoding. Patch by Johnny Chen. llvm-svn: 84767	2009-10-21 17:54:01 +00:00
Bob Wilson	87671da29a	Revert 84732. It was the wrong fix. llvm-svn: 84766	2009-10-21 17:52:34 +00:00
Bob Wilson	5b5cb92816	Fix some more NEON instruction encoding problems. Thanks to Johnny Chen for discovering the problem. llvm-svn: 84732	2009-10-21 02:27:20 +00:00
Bob Wilson	bd3650cc84	Leave some NEON instruction encoding bits unspecified instead of setting a default value of zero. This is important for decoding the instructions. Patch by Johnny Chen, with some changes from me, too. llvm-svn: 84730	2009-10-21 02:15:46 +00:00
Jim Grosbach	772b2f84eb	Refs: A8-598. Leave Inst{11-8}, which represents the starting byte index of the extracted result in the concatenation of the operands and is left unspecified. Patch by Johnny Chen. llvm-svn: 84572	2009-10-20 00:38:19 +00:00
Bob Wilson	01404ecec7	Fix more NEON instruction encodings. Patch by Johnny Chen. llvm-svn: 84243	2009-10-16 03:58:44 +00:00
Bob Wilson	4138b11c93	Fix encoding bits for N3VLInt3_QHS multiclass with 8-bit elements. Patch by Johnny Chen. llvm-svn: 84206	2009-10-15 21:57:47 +00:00
Bob Wilson	cfcf6bc70d	Fix instruction encoding bits for NEON VPADAL. Patch by Johnny Chen. llvm-svn: 84146	2009-10-14 21:43:17 +00:00
Jim Grosbach	94068707e1	Inst{11-8} for vshl should be 0b0101, not 0b1111. Refs: A7-17 & A8-750. Patch by Johnny Chen. llvm-svn: 84131	2009-10-14 20:31:01 +00:00
Bob Wilson	84e7967fae	Add codegen support for NEON vst4lane intrinsics with 128-bit vectors. llvm-svn: 83600	2009-10-09 00:01:36 +00:00
Bob Wilson	c409030838	Add codegen support for NEON vst3lane intrinsics with 128-bit vectors. llvm-svn: 83598	2009-10-08 23:51:31 +00:00
Bob Wilson	b851eb356a	Add codegen support for NEON vst2lane intrinsics with 128-bit vectors. llvm-svn: 83596	2009-10-08 23:38:24 +00:00
Bob Wilson	38ba47225a	Add codegen support for NEON vld4lane intrinsics with 128-bit vectors. Also fix some copy-and-paste errors in previous changes. llvm-svn: 83590	2009-10-08 22:53:57 +00:00
Bob Wilson	cf54e934f8	Add codegen support for NEON vld3lane intrinsics with 128-bit vectors. llvm-svn: 83585	2009-10-08 22:27:33 +00:00
Bob Wilson	c2728f44a9	Add codegen support for NEON vld2lane intrinsics with 128-bit vectors. llvm-svn: 83568	2009-10-08 18:56:10 +00:00
Bob Wilson	b6b0ab6117	Add codegen support for NEON vst4 intrinsics with <1 x i64> vectors. llvm-svn: 83526	2009-10-08 05:18:18 +00:00
Bob Wilson	71387b4b2f	Add codegen support for NEON vst3 intrinsics with <1 x i64> vectors. llvm-svn: 83518	2009-10-08 00:28:28 +00:00
Bob Wilson	d4f5670096	Add codegen support for NEON vst2 intrinsics with <1 x i64> vectors. llvm-svn: 83513	2009-10-08 00:21:01 +00:00
Bob Wilson	32cc4ec304	Add codegen support for NEON vld4 intrinsics with <1 x i64> vectors. llvm-svn: 83508	2009-10-07 23:54:04 +00:00
Bob Wilson	5ef3c6d9f4	Add codegen support for NEON vld3 intrinsics with <1 x i64> vectors. llvm-svn: 83506	2009-10-07 23:39:57 +00:00
Bob Wilson	763be1a248	Add codegen support for NEON vld2 intrinsics with <1 x i64> vectors. llvm-svn: 83502	2009-10-07 22:57:01 +00:00
Bob Wilson	50820a2677	Add some instruction encoding bits for NEON load/store instructions. llvm-svn: 83490	2009-10-07 21:53:04 +00:00
Bob Wilson	e7ef4a9a6b	Add codegen support for NEON vst4 intrinsics with 128-bit vectors. llvm-svn: 83486	2009-10-07 20:49:18 +00:00
Bob Wilson	23464866ad	Add codegen support for NEON vst3 intrinsics with 128-bit vectors. llvm-svn: 83484	2009-10-07 20:30:08 +00:00
Bob Wilson	3dcb5377ef	Add codegen support for NEON vst2 intrinsics with 128-bit vectors. llvm-svn: 83482	2009-10-07 18:47:39 +00:00
Bob Wilson	ab3a9474d6	Add codegen support for NEON vld4 intrinsics with 128-bit vectors. llvm-svn: 83479	2009-10-07 18:09:32 +00:00
Bob Wilson	6bbefc2f67	Add codegen support for NEON vld3 intrinsics with 128-bit vectors. llvm-svn: 83471	2009-10-07 17:24:55 +00:00
Bob Wilson	e6b778d5ff	Add codegen support for NEON vld2 operations on quad registers. llvm-svn: 83422	2009-10-06 22:01:59 +00:00
Bob Wilson	d76b9b766c	Add a comment to describe letters used in multiclass name suffixes. llvm-svn: 83257	2009-10-03 04:44:16 +00:00
Bob Wilson	a9abf57409	Fix encoding problem for VMLS instruction. Thanks to Johnny Chen for pointing this out! llvm-svn: 83256	2009-10-03 04:41:21 +00:00
Evan Cheng	1b2b64f618	Add hasExtraSrcRegAllocReq and hasExtraDefRegAllocReq flags to ld / st multiple, ld / st pairs, etc. llvm-svn: 83197	2009-10-01 08:22:27 +00:00
David Goodwin	bea6848f9d	Finish scheduling itineraries for NEON. llvm-svn: 82788	2009-09-25 18:38:29 +00:00
David Goodwin	afcaf79603	Checkpoint NEON scheduling itineraries. llvm-svn: 82657	2009-09-23 21:38:08 +00:00
Anton Korobeynikov	8d0fbebb9f	Add QPR_VFP2 regclass and add copy_to_regclass nodes, where needed to constraint the register usage. llvm-svn: 81635	2009-09-12 22:21:08 +00:00
Anton Korobeynikov	7697d37777	Unbreak getOnesVector() / getZeroVector() to use valid ARM extended imm's. llvm-svn: 81262	2009-09-08 22:51:43 +00:00
Anton Korobeynikov	59e2b8e894	Add NEON 'laned' operations. This fixes another bunch of gcc testsuite fails and makes the code faster. llvm-svn: 81220	2009-09-08 15:22:32 +00:00
Anton Korobeynikov	f0da41c3e4	More missed vdup patterns llvm-svn: 80838	2009-09-02 21:21:28 +00:00
Bob Wilson	d7797754d4	Add support for generating code for vst{234}lane intrinsics. llvm-svn: 80707	2009-09-01 18:51:56 +00:00
Bob Wilson	da9817cddd	Generate code for vld{234}_lane intrinsics. llvm-svn: 80656	2009-09-01 04:26:28 +00:00
Anton Korobeynikov	3681144bd8	Add missed pattern llvm-svn: 80502	2009-08-30 19:06:39 +00:00
Anton Korobeynikov	cd41d07f29	Add missed extract_element pattern llvm-svn: 80408	2009-08-28 23:41:26 +00:00
Anton Korobeynikov	076f105d86	Forgot about actual change :) llvm-svn: 80250	2009-08-27 16:10:17 +00:00
Anton Korobeynikov	58ebae4acd	Transform float scalar_to_vector into subreg accesses. No idea whether this is profitable or not. llvm-svn: 80245	2009-08-27 14:38:44 +00:00
Bob Wilson	f1beef9f48	Remove some unused SDNode definitions. llvm-svn: 80015	2009-08-25 17:52:39 +00:00
Bob Wilson	9129376719	Expose the instruction contraint string as an argument to the NLdSt class. llvm-svn: 80011	2009-08-25 17:46:06 +00:00
Bob Wilson	ceffeb6abd	Rename ARM "lane_cst" operands to "nohash_imm" since they are used for several things other than Neon vector lane numbers. For inline assembly operands with a "c" print code, check that they really are immediates. llvm-svn: 79676	2009-08-21 21:58:55 +00:00
Anton Korobeynikov	232b19c3d5	Fix some typos and use type-based isel for VZIP/VUZP/VTRN llvm-svn: 79625	2009-08-21 12:41:42 +00:00
Anton Korobeynikov	ce3ff1be8a	Add nodes & dummy matchers for some v{zip,uzp,trn} instructions llvm-svn: 79622	2009-08-21 12:40:50 +00:00
Anton Korobeynikov	38f284f2ae	Provide vext.{16,32} llvm-svn: 79620	2009-08-21 12:40:21 +00:00
Bob Wilson	32cd8550ce	Add support for Neon VEXT (vector extract) shuffles. This is derived from a patch by Anton Korzh. I modified it to recognize the VEXT shuffles during legalization and lower them to a target-specific DAG node. llvm-svn: 79428	2009-08-19 17:03:43 +00:00
Bob Wilson	eb54d51759	Create a new ARM-specific DAG node, VDUP, to represent a splat from a scalar_to_vector. Generate these VDUP nodes during legalization instead of trying to recognize the pattern during selection. llvm-svn: 78994	2009-08-14 05:13:08 +00:00
Bob Wilson	cce31f6831	During legalization, change Neon vdup_lane operations from shuffles to target-specific VDUPLANE nodes. This allows the subreg handling for the quad-register version to be done easily with Pats in the .td file, instead of with custom code in ARMISelDAGToDAG.cpp. llvm-svn: 78993	2009-08-14 05:08:32 +00:00
Bob Wilson	ef6e602bf4	Revert r78852 for now. I want to do this differently, but I don't have time to fix it tonight. llvm-svn: 78896	2009-08-13 05:58:56 +00:00
Bob Wilson	ff2db10211	Recognize Neon VDUP shuffles during legalization instead of selection. llvm-svn: 78852	2009-08-12 22:54:19 +00:00
Bob Wilson	ea3a402ae7	Recognize Neon VREV shuffles during legalization instead of selection. llvm-svn: 78850	2009-08-12 22:31:50 +00:00
Bob Wilson	4b35448360	Generate Neon VTBL and VTBX instructions from the corresponding intrinsics. llvm-svn: 78835	2009-08-12 20:51:55 +00:00
Bob Wilson	25cae66713	Fix TableGen warnings. This partly reverts my previous change to this file, leaving the mayLoad and mayStore settings around only the load/store instructions where those can't be inferred from the patterns. llvm-svn: 78815	2009-08-12 17:04:56 +00:00
Bob Wilson	f042eadd1e	Add missing chain operands for VLD* and VST* instructions. Set "mayLoad" and "mayStore" on the load/store instructions. llvm-svn: 78761	2009-08-12 00:49:01 +00:00
Owen Anderson	9f94459d24	Split EVT into MVT and EVT, the former representing _just_ a primitive type, while the latter is capable of representing either a primitive or an extended type. llvm-svn: 78713	2009-08-11 20:47:22 +00:00
Bob Wilson	12842f9865	Use vAny type to get rid of Neon intrinsics that differed only in whether the overloaded vector types allowed floating-point or integer vector elements. Most of these operations actually depend on the element type, so bitcasting was not an option. If you include the vpadd intrinsics that I updated earlier, this gets rid of 20 intrinsics. llvm-svn: 78646	2009-08-11 05:39:44 +00:00
Bob Wilson	741a9c7bf6	Use new EVT::vAny type to combine Neon intrinsics for VPADD. llvm-svn: 78632	2009-08-11 01:15:26 +00:00
David Goodwin	b80734bb15	Fix bug in NEON convert for single-precision FP. This also fixes the tblgen warnings. llvm-svn: 78629	2009-08-11 01:07:38 +00:00
Owen Anderson	53aa7a960c	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
David Goodwin	85b5b027f7	Use NEON for single-precision int<->FP conversions. llvm-svn: 78604	2009-08-10 22:17:39 +00:00
Anton Korobeynikov	cfed3005e5	Use subclassing to print lane-like immediates (w/o hash) eliminating 'no_hash' modifier. Hopefully this will make Daniel happy :) llvm-svn: 78514	2009-08-08 23:10:41 +00:00
Anton Korobeynikov	7167f33872	Add insert_elt / extract_elt patterns for v4f32 stuff. Did anyone tests v4f32 ever? llvm-svn: 78470	2009-08-08 14:06:07 +00:00
Anton Korobeynikov	4218516f5d	Lane number should be printed w/o hash llvm-svn: 78469	2009-08-08 14:05:53 +00:00
Anton Korobeynikov	887d05ce9b	Use VLDM / VSTM to spill/reload 128-bit Neon registers llvm-svn: 78468	2009-08-08 13:35:48 +00:00
Bob Wilson	e2231070ff	Implement Neon VZIP and VUZP instructions. These are very similar to VTRN, so I generalized the class for VTRN in the .td file to handle all 3 of them. llvm-svn: 78460	2009-08-08 06:13:25 +00:00
Bob Wilson	db46af0461	Implement Neon VTRN instructions. For now, anyway, these are selected directly from the intrinsics produced by the frontend. If it is more convenient to have a custom DAG node for using these to implement shuffles, we can add that later. llvm-svn: 78459	2009-08-08 05:53:00 +00:00
Anton Korobeynikov	d28a26dfab	Unbreak the stuff llvm-svn: 78425	2009-08-07 22:51:13 +00:00
Anton Korobeynikov	23b28cb824	2 more vdup.32 cases llvm-svn: 78419	2009-08-07 22:36:50 +00:00
Evan Cheng	4c3b1ca5a0	Fix support to use NEON for single precision fp math. llvm-svn: 78397	2009-08-07 19:30:41 +00:00
Bob Wilson	0127031c20	Implement Neon VST[234] operations. llvm-svn: 78330	2009-08-06 18:47:44 +00:00
David Goodwin	b062c236c5	Add parameter to pattern classes to enable an itinerary to be specified for instructions. For now just use the existing itineraries or NoItinerary. llvm-svn: 78321	2009-08-06 16:52:47 +00:00
Bob Wilson	488db94e7b	Neon does not actually have VLD{234}.64 instructions. These operations will have to be synthesized from other instructions. llvm-svn: 78263	2009-08-06 00:24:27 +00:00
David Goodwin	e5b5d8fbb3	When using NEON for single-precision FP, the NEON result must be placed in D0-D15 as these are the only D registers with S subregs. Introduce a new regclass to represent D0-D15 and use it in the NEON single-precision FP patterns. llvm-svn: 78244	2009-08-05 21:02:22 +00:00
Evan Cheng	e219be7346	80 col violations. llvm-svn: 78175	2009-08-05 06:41:25 +00:00
Bob Wilson	20f79e321e	Change DAG nodes for Neon VLD2/3/4 operations to return multiple results. Get rid of yesterday's code to fix the register usage during isel. Select the new DAG nodes to machine instructions. The new pre-alloc pass to choose adjacent registers for these results is not done, so the results of this will generally not assemble yet. llvm-svn: 78136	2009-08-05 00:49:09 +00:00
Bob Wilson	a8720101b5	Replace dregsingle operand modifier with explicit escaped curly brackets. For other VLDn and VSTn operations, we need to list the multiple registers explicitly anyway, so there's no point in special-casing this one usage. llvm-svn: 78109	2009-08-04 21:39:33 +00:00
David Goodwin	30bf625ac2	Add NEON single-precision FP support for fabs and fneg. llvm-svn: 78101	2009-08-04 20:39:05 +00:00
David Goodwin	a3839bc6c0	Match common pattern for FNMAC. Add NEON SP support. llvm-svn: 78085	2009-08-04 18:44:29 +00:00
David Goodwin	3b9c52c5c1	Initial support for single-precision FP using NEON. Added "neonfp" attribute to enable. Added patterns for some binary FP operations. llvm-svn: 78081	2009-08-04 17:53:06 +00:00
Bob Wilson	f45dee3ad2	Lower Neon VLD* intrinsics to custom DAG nodes, and manually allocate the results to fixed registers. llvm-svn: 78025	2009-08-04 00:36:16 +00:00
Bob Wilson	cf19885a32	Change Neon VLDn intrinsics to return multiple values instead of really wide vectors. Likewise, change VSTn intrinsics to take separate arguments for each vector in a multi-vector struct. Adjust tests accordingly. llvm-svn: 77468	2009-07-29 16:39:22 +00:00
Bob Wilson	8a37bbebfd	Add support for ARM Neon VREV instructions. Patch by Anton Korzh, with some modifications from me. llvm-svn: 77101	2009-07-26 00:39:34 +00:00
Evan Cheng	5edd90cbbc	- Add some NEON ld / st instruction static encoding. - Make bits 25-27 for ldrh, etc. explicitly zero. Previously only the JIT uses the encoding information and it's assuming anything not specified to be zero. Making them explicit so the disassembler is happy. Patch by Sean Callanan. llvm-svn: 75065	2009-07-08 22:51:32 +00:00
Bob Wilson	1d298fd75b	Implement NEON vst1 instruction. llvm-svn: 75037	2009-07-08 20:32:02 +00:00
Bob Wilson	f731a2df6b	Implement NEON vld1 instructions. llvm-svn: 75019	2009-07-08 18:11:30 +00:00
Bob Wilson	2e076c4e02	Add support for ARM's Advanced SIMD (NEON) instruction set. This is still a work in progress but most of the NEON instruction set is supported. llvm-svn: 73919	2009-06-22 23:27:02 +00:00

... 5 6 7 8 9 ...

582 Commits