llvm-project

Commit Graph

Author	SHA1	Message	Date
Johnny Chen	86ba44a4c7	Added Vector Swap (VSWPd and VSWPq) instructions for disassembly only. A8.6.405 llvm-svn: 97052	2010-02-24 20:06:07 +00:00
Johnny Chen	03ac201ad9	Fixed typo of opcodestr, should be "vst1", not "vld1". llvm-svn: 97044	2010-02-24 18:00:40 +00:00
Johnny Chen	d5c472d811	Added for disassembly VST1 (multiple single elements) which stores elements to memory from three or four registers and VST2 (multiple two-element structures) which stores to memory from two double-spaced registers. A8.6.391 & A8.6.393 llvm-svn: 97018	2010-02-24 02:57:20 +00:00
Johnny Chen	b14a5c52bc	Added for disassembly VLD1 (multiple single elements) which loads memory into three or four registers and VLD2 (multiple two-element structures) which loads memory into two double-spaced registers. A8.6.307 & A8.6.310 llvm-svn: 96980	2010-02-23 20:51:23 +00:00
Johnny Chen	21dbd6f449	Added versions of VCGE, VCGT, VCLE, and VCLT NEON instructions which compare to (immediate #0) for disassembly only. A8.6.283, A8.6.285, A8.6.287, A8.6.290 llvm-svn: 96856	2010-02-23 01:42:58 +00:00
Johnny Chen	886915e3bb	Added VCEQ (immediate #0 ) NEON instruction for disassembly only. A8.6.281 llvm-svn: 96838	2010-02-23 00:33:12 +00:00
Bob Wilson	c6c13a3515	Use NEON vmin/vmax instructions for floating-point selects. Radar 7461718. llvm-svn: 96572	2010-02-18 06:05:53 +00:00
Bob Wilson	cb2deb2aaf	Remove the NEON N2VSInt instruction class: it's only used in one place and since it has no pattern, there's not much point in distinguishing an "N2VS" class for intrinsics anyway. llvm-svn: 96525	2010-02-17 22:42:54 +00:00
Bob Wilson	004d280d5e	More cleanup for NEON: * Use "S" abbreviation for scalar single FP registers in class and pattern names, instead of keeping the "D" (for "double") abbreviation and tacking on an "s" elsewhere in the name. * Move the scalar single FP register classes and patterns to be more consistent with other definitions in the file. * Rename "VNEGf32d" definition to "VNEGfd" for consistency. * Deleted the N2VDIntsPat pattern; N2VSPat is good enough. llvm-svn: 96521	2010-02-17 22:23:11 +00:00
Bob Wilson	9e89907ed5	Wrap lines to 80 columns and generally try to clean up whitespace and indentation. No functional changes. llvm-svn: 96418	2010-02-17 00:31:29 +00:00
Johnny Chen	1215c774f2	Add VBIF/VBIT for disassembly only. A8.6.279 llvm-svn: 95713	2010-02-09 23:05:23 +00:00
Bob Wilson	7430a98619	Emit spaces after commas in Neon register lists. This is more consistent with the rest of the assembly output, is easier to read, and matches the expected output for gcc's Neon tests. llvm-svn: 93703	2010-01-18 01:24:43 +00:00
Bob Wilson	9349437c65	The Neon "vtst" instruction takes a suffix that is the element size alone -- adding an "i" to the suffix, indicating that the elements are integers, is accepted but not part of the standard syntax. This helps us pass a few more of the Neon tests from gcc. llvm-svn: 93677	2010-01-17 06:35:17 +00:00
Johnny Chen	86fc920742	For VLDM/VSTM (Advanced SIMD), set encoding bits Inst{11-8} to 0b1011. llvm-svn: 90243	2009-12-01 17:37:06 +00:00
Johnny Chen	ee536b0ea4	For VMOV (immediate), make some of the encoding bits (cmode and op) unspecified. For VMOVv*i[16,32], op bit is don't care, and some cmode bits vary depending on the immediate values. Ref: Table A7-15 Modified immediate values for Advanced SIMD instructions. llvm-svn: 90173	2009-12-01 00:02:02 +00:00
Evan Cheng	738a97a1db	Massive refactoring of NEON instructions. Separate opcode from data size specifier suffix, move \t up stream to instruction format, and fix more 80 column violations. This fixes the NEON asm printing so the "predicate" field is printed between the opcode and the data type suffix. llvm-svn: 89706	2009-11-23 21:57:23 +00:00
Johnny Chen	b6528d3244	Partially revert r84730 by removing N2VDup from ARMInstrFormats.td and modifying VDUPLND and VDUPLNQ to derive from N2V instead of N2VDup. VDUPLND and VDUPLNQ now expect op19_18 and op17_16 as the first two args. llvm-svn: 89699	2009-11-23 21:00:43 +00:00
Johnny Chen	5ad7416260	Revert r84572 by removing N3VImm from ARMInstrFormats.td now that we can specify {?,?,?,?} as op11_8 for VEXTd and VEXTq. llvm-svn: 89693	2009-11-23 20:09:13 +00:00
Johnny Chen	e97457afbc	Partially revert r89377 by removing NLdStLN class definition from ARMInstrFormats.td and fixing VLD[234]LN* and VST[234]LN* to derive from NLdSt instead of NLdStLN. llvm-svn: 89684	2009-11-23 18:16:16 +00:00
Johnny Chen	ebc60ef80c	Make it clear that the index bit(s) of Vector Get Lane and Vector Set Lane should be left unspecified now that Bob Wilson has fixed pr5470. llvm-svn: 89676	2009-11-23 17:48:17 +00:00
Evan Cheng	a33fc86be3	Add predicate operand to NEON instructions. Fix lots (but not all) 80 col violations in ARMInstrNEON.td. llvm-svn: 89542	2009-11-21 06:21:52 +00:00
Johnny Chen	b3b8209d77	Added NLdStLN which is similar to NLdSt with the exception that op7_4 is not fully specified at this level. Subclasses of NLdStLN can specify selective bit(s) for Inst{7-4}, as is done for VLD[234]LN* and VST[234]LN* inside ARMInstrNEON.td. llvm-svn: 89377	2009-11-19 19:20:17 +00:00
Evan Cheng	e129dd311e	Use table to separate opcode from operands. llvm-svn: 86965	2009-11-12 07:16:34 +00:00
Jim Grosbach	d7cf55cd0e	Use Unified Assembly Syntax for the ARM backend. llvm-svn: 86494	2009-11-09 00:11:35 +00:00
Bob Wilson	d95ccd6c4d	Print VMOV (immediate) operands as hexadecimal values. Apple's assembler will not accept negative values for these. LLVM's default operand printing sign extends values, so that valid unsigned values appear as negative immediates. Print all VMOV immediate operands as hex values to resolve this. Radar 7372576. llvm-svn: 86301	2009-11-06 23:33:28 +00:00
Anton Korobeynikov	0f38d989bd	Do not infer the target type for COPY_TO_REGCLASS from dest regclass, this won't work if it can contain several types. Require explicit result type for the node for now. This fixes PR5364. PS: It seems that blackfin usage of copy_to_regclass is completely bogus! llvm-svn: 85766	2009-11-02 00:11:39 +00:00
Jim Grosbach	5cba8de2c8	vml[as].f32 cause stalls in following advanced SIMD instructions. Avoid using them for scalar floating point operations for now. llvm-svn: 85697	2009-10-31 22:57:36 +00:00
Bob Wilson	0db964a3a0	Fix NEON VST2LN instruction encoding. Patch by Johnny Chen. llvm-svn: 84767	2009-10-21 17:54:01 +00:00
Bob Wilson	87671da29a	Revert 84732. It was the wrong fix. llvm-svn: 84766	2009-10-21 17:52:34 +00:00
Bob Wilson	5b5cb92816	Fix some more NEON instruction encoding problems. Thanks to Johnny Chen for discovering the problem. llvm-svn: 84732	2009-10-21 02:27:20 +00:00
Bob Wilson	bd3650cc84	Leave some NEON instruction encoding bits unspecified instead of setting a default value of zero. This is important for decoding the instructions. Patch by Johnny Chen, with some changes from me, too. llvm-svn: 84730	2009-10-21 02:15:46 +00:00
Jim Grosbach	772b2f84eb	Refs: A8-598. Leave Inst{11-8}, which represents the starting byte index of the extracted result in the concatenation of the operands and is left unspecified. Patch by Johnny Chen. llvm-svn: 84572	2009-10-20 00:38:19 +00:00
Bob Wilson	01404ecec7	Fix more NEON instruction encodings. Patch by Johnny Chen. llvm-svn: 84243	2009-10-16 03:58:44 +00:00
Bob Wilson	4138b11c93	Fix encoding bits for N3VLInt3_QHS multiclass with 8-bit elements. Patch by Johnny Chen. llvm-svn: 84206	2009-10-15 21:57:47 +00:00
Bob Wilson	cfcf6bc70d	Fix instruction encoding bits for NEON VPADAL. Patch by Johnny Chen. llvm-svn: 84146	2009-10-14 21:43:17 +00:00
Jim Grosbach	94068707e1	Inst{11-8} for vshl should be 0b0101, not 0b1111. Refs: A7-17 & A8-750. Patch by Johnny Chen. llvm-svn: 84131	2009-10-14 20:31:01 +00:00
Bob Wilson	84e7967fae	Add codegen support for NEON vst4lane intrinsics with 128-bit vectors. llvm-svn: 83600	2009-10-09 00:01:36 +00:00
Bob Wilson	c409030838	Add codegen support for NEON vst3lane intrinsics with 128-bit vectors. llvm-svn: 83598	2009-10-08 23:51:31 +00:00
Bob Wilson	b851eb356a	Add codegen support for NEON vst2lane intrinsics with 128-bit vectors. llvm-svn: 83596	2009-10-08 23:38:24 +00:00
Bob Wilson	38ba47225a	Add codegen support for NEON vld4lane intrinsics with 128-bit vectors. Also fix some copy-and-paste errors in previous changes. llvm-svn: 83590	2009-10-08 22:53:57 +00:00
Bob Wilson	cf54e934f8	Add codegen support for NEON vld3lane intrinsics with 128-bit vectors. llvm-svn: 83585	2009-10-08 22:27:33 +00:00
Bob Wilson	c2728f44a9	Add codegen support for NEON vld2lane intrinsics with 128-bit vectors. llvm-svn: 83568	2009-10-08 18:56:10 +00:00
Bob Wilson	b6b0ab6117	Add codegen support for NEON vst4 intrinsics with <1 x i64> vectors. llvm-svn: 83526	2009-10-08 05:18:18 +00:00
Bob Wilson	71387b4b2f	Add codegen support for NEON vst3 intrinsics with <1 x i64> vectors. llvm-svn: 83518	2009-10-08 00:28:28 +00:00
Bob Wilson	d4f5670096	Add codegen support for NEON vst2 intrinsics with <1 x i64> vectors. llvm-svn: 83513	2009-10-08 00:21:01 +00:00
Bob Wilson	32cc4ec304	Add codegen support for NEON vld4 intrinsics with <1 x i64> vectors. llvm-svn: 83508	2009-10-07 23:54:04 +00:00
Bob Wilson	5ef3c6d9f4	Add codegen support for NEON vld3 intrinsics with <1 x i64> vectors. llvm-svn: 83506	2009-10-07 23:39:57 +00:00
Bob Wilson	763be1a248	Add codegen support for NEON vld2 intrinsics with <1 x i64> vectors. llvm-svn: 83502	2009-10-07 22:57:01 +00:00
Bob Wilson	50820a2677	Add some instruction encoding bits for NEON load/store instructions. llvm-svn: 83490	2009-10-07 21:53:04 +00:00
Bob Wilson	e7ef4a9a6b	Add codegen support for NEON vst4 intrinsics with 128-bit vectors. llvm-svn: 83486	2009-10-07 20:49:18 +00:00
Bob Wilson	23464866ad	Add codegen support for NEON vst3 intrinsics with 128-bit vectors. llvm-svn: 83484	2009-10-07 20:30:08 +00:00
Bob Wilson	3dcb5377ef	Add codegen support for NEON vst2 intrinsics with 128-bit vectors. llvm-svn: 83482	2009-10-07 18:47:39 +00:00
Bob Wilson	ab3a9474d6	Add codegen support for NEON vld4 intrinsics with 128-bit vectors. llvm-svn: 83479	2009-10-07 18:09:32 +00:00
Bob Wilson	6bbefc2f67	Add codegen support for NEON vld3 intrinsics with 128-bit vectors. llvm-svn: 83471	2009-10-07 17:24:55 +00:00
Bob Wilson	e6b778d5ff	Add codegen support for NEON vld2 operations on quad registers. llvm-svn: 83422	2009-10-06 22:01:59 +00:00
Bob Wilson	d76b9b766c	Add a comment to describe letters used in multiclass name suffixes. llvm-svn: 83257	2009-10-03 04:44:16 +00:00
Bob Wilson	a9abf57409	Fix encoding problem for VMLS instruction. Thanks to Johnny Chen for pointing this out! llvm-svn: 83256	2009-10-03 04:41:21 +00:00
Evan Cheng	1b2b64f618	Add hasExtraSrcRegAllocReq and hasExtraDefRegAllocReq flags to ld / st multiple, ld / st pairs, etc. llvm-svn: 83197	2009-10-01 08:22:27 +00:00
David Goodwin	bea6848f9d	Finish scheduling itineraries for NEON. llvm-svn: 82788	2009-09-25 18:38:29 +00:00
David Goodwin	afcaf79603	Checkpoint NEON scheduling itineraries. llvm-svn: 82657	2009-09-23 21:38:08 +00:00
Anton Korobeynikov	8d0fbebb9f	Add QPR_VFP2 regclass and add copy_to_regclass nodes, where needed to constraint the register usage. llvm-svn: 81635	2009-09-12 22:21:08 +00:00
Anton Korobeynikov	7697d37777	Unbreak getOnesVector() / getZeroVector() to use valid ARM extended imm's. llvm-svn: 81262	2009-09-08 22:51:43 +00:00
Anton Korobeynikov	59e2b8e894	Add NEON 'laned' operations. This fixes another bunch of gcc testsuite fails and makes the code faster. llvm-svn: 81220	2009-09-08 15:22:32 +00:00
Anton Korobeynikov	f0da41c3e4	More missed vdup patterns llvm-svn: 80838	2009-09-02 21:21:28 +00:00
Bob Wilson	d7797754d4	Add support for generating code for vst{234}lane intrinsics. llvm-svn: 80707	2009-09-01 18:51:56 +00:00
Bob Wilson	da9817cddd	Generate code for vld{234}_lane intrinsics. llvm-svn: 80656	2009-09-01 04:26:28 +00:00
Anton Korobeynikov	3681144bd8	Add missed pattern llvm-svn: 80502	2009-08-30 19:06:39 +00:00
Anton Korobeynikov	cd41d07f29	Add missed extract_element pattern llvm-svn: 80408	2009-08-28 23:41:26 +00:00
Anton Korobeynikov	076f105d86	Forgot about actual change :) llvm-svn: 80250	2009-08-27 16:10:17 +00:00
Anton Korobeynikov	58ebae4acd	Transform float scalar_to_vector into subreg accesses. No idea whether this is profitable or not. llvm-svn: 80245	2009-08-27 14:38:44 +00:00
Bob Wilson	f1beef9f48	Remove some unused SDNode definitions. llvm-svn: 80015	2009-08-25 17:52:39 +00:00
Bob Wilson	9129376719	Expose the instruction contraint string as an argument to the NLdSt class. llvm-svn: 80011	2009-08-25 17:46:06 +00:00
Bob Wilson	ceffeb6abd	Rename ARM "lane_cst" operands to "nohash_imm" since they are used for several things other than Neon vector lane numbers. For inline assembly operands with a "c" print code, check that they really are immediates. llvm-svn: 79676	2009-08-21 21:58:55 +00:00
Anton Korobeynikov	232b19c3d5	Fix some typos and use type-based isel for VZIP/VUZP/VTRN llvm-svn: 79625	2009-08-21 12:41:42 +00:00
Anton Korobeynikov	ce3ff1be8a	Add nodes & dummy matchers for some v{zip,uzp,trn} instructions llvm-svn: 79622	2009-08-21 12:40:50 +00:00
Anton Korobeynikov	38f284f2ae	Provide vext.{16,32} llvm-svn: 79620	2009-08-21 12:40:21 +00:00
Bob Wilson	32cd8550ce	Add support for Neon VEXT (vector extract) shuffles. This is derived from a patch by Anton Korzh. I modified it to recognize the VEXT shuffles during legalization and lower them to a target-specific DAG node. llvm-svn: 79428	2009-08-19 17:03:43 +00:00
Bob Wilson	eb54d51759	Create a new ARM-specific DAG node, VDUP, to represent a splat from a scalar_to_vector. Generate these VDUP nodes during legalization instead of trying to recognize the pattern during selection. llvm-svn: 78994	2009-08-14 05:13:08 +00:00
Bob Wilson	cce31f6831	During legalization, change Neon vdup_lane operations from shuffles to target-specific VDUPLANE nodes. This allows the subreg handling for the quad-register version to be done easily with Pats in the .td file, instead of with custom code in ARMISelDAGToDAG.cpp. llvm-svn: 78993	2009-08-14 05:08:32 +00:00
Bob Wilson	ef6e602bf4	Revert r78852 for now. I want to do this differently, but I don't have time to fix it tonight. llvm-svn: 78896	2009-08-13 05:58:56 +00:00
Bob Wilson	ff2db10211	Recognize Neon VDUP shuffles during legalization instead of selection. llvm-svn: 78852	2009-08-12 22:54:19 +00:00
Bob Wilson	ea3a402ae7	Recognize Neon VREV shuffles during legalization instead of selection. llvm-svn: 78850	2009-08-12 22:31:50 +00:00
Bob Wilson	4b35448360	Generate Neon VTBL and VTBX instructions from the corresponding intrinsics. llvm-svn: 78835	2009-08-12 20:51:55 +00:00
Bob Wilson	25cae66713	Fix TableGen warnings. This partly reverts my previous change to this file, leaving the mayLoad and mayStore settings around only the load/store instructions where those can't be inferred from the patterns. llvm-svn: 78815	2009-08-12 17:04:56 +00:00
Bob Wilson	f042eadd1e	Add missing chain operands for VLD* and VST* instructions. Set "mayLoad" and "mayStore" on the load/store instructions. llvm-svn: 78761	2009-08-12 00:49:01 +00:00
Owen Anderson	9f94459d24	Split EVT into MVT and EVT, the former representing _just_ a primitive type, while the latter is capable of representing either a primitive or an extended type. llvm-svn: 78713	2009-08-11 20:47:22 +00:00
Bob Wilson	12842f9865	Use vAny type to get rid of Neon intrinsics that differed only in whether the overloaded vector types allowed floating-point or integer vector elements. Most of these operations actually depend on the element type, so bitcasting was not an option. If you include the vpadd intrinsics that I updated earlier, this gets rid of 20 intrinsics. llvm-svn: 78646	2009-08-11 05:39:44 +00:00
Bob Wilson	741a9c7bf6	Use new EVT::vAny type to combine Neon intrinsics for VPADD. llvm-svn: 78632	2009-08-11 01:15:26 +00:00
David Goodwin	b80734bb15	Fix bug in NEON convert for single-precision FP. This also fixes the tblgen warnings. llvm-svn: 78629	2009-08-11 01:07:38 +00:00
Owen Anderson	53aa7a960c	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
David Goodwin	85b5b027f7	Use NEON for single-precision int<->FP conversions. llvm-svn: 78604	2009-08-10 22:17:39 +00:00
Anton Korobeynikov	cfed3005e5	Use subclassing to print lane-like immediates (w/o hash) eliminating 'no_hash' modifier. Hopefully this will make Daniel happy :) llvm-svn: 78514	2009-08-08 23:10:41 +00:00
Anton Korobeynikov	7167f33872	Add insert_elt / extract_elt patterns for v4f32 stuff. Did anyone tests v4f32 ever? llvm-svn: 78470	2009-08-08 14:06:07 +00:00
Anton Korobeynikov	4218516f5d	Lane number should be printed w/o hash llvm-svn: 78469	2009-08-08 14:05:53 +00:00
Anton Korobeynikov	887d05ce9b	Use VLDM / VSTM to spill/reload 128-bit Neon registers llvm-svn: 78468	2009-08-08 13:35:48 +00:00
Bob Wilson	e2231070ff	Implement Neon VZIP and VUZP instructions. These are very similar to VTRN, so I generalized the class for VTRN in the .td file to handle all 3 of them. llvm-svn: 78460	2009-08-08 06:13:25 +00:00
Bob Wilson	db46af0461	Implement Neon VTRN instructions. For now, anyway, these are selected directly from the intrinsics produced by the frontend. If it is more convenient to have a custom DAG node for using these to implement shuffles, we can add that later. llvm-svn: 78459	2009-08-08 05:53:00 +00:00
Anton Korobeynikov	d28a26dfab	Unbreak the stuff llvm-svn: 78425	2009-08-07 22:51:13 +00:00
Anton Korobeynikov	23b28cb824	2 more vdup.32 cases llvm-svn: 78419	2009-08-07 22:36:50 +00:00
Evan Cheng	4c3b1ca5a0	Fix support to use NEON for single precision fp math. llvm-svn: 78397	2009-08-07 19:30:41 +00:00

1 2 3 4

167 Commits