llvm-project

Commit Graph

Author	SHA1	Message	Date
Anton Korobeynikov	887d05ce9b	Use VLDM / VSTM to spill/reload 128-bit Neon registers llvm-svn: 78468	2009-08-08 13:35:48 +00:00
Bob Wilson	e2231070ff	Implement Neon VZIP and VUZP instructions. These are very similar to VTRN, so I generalized the class for VTRN in the .td file to handle all 3 of them. llvm-svn: 78460	2009-08-08 06:13:25 +00:00
Bob Wilson	db46af0461	Implement Neon VTRN instructions. For now, anyway, these are selected directly from the intrinsics produced by the frontend. If it is more convenient to have a custom DAG node for using these to implement shuffles, we can add that later. llvm-svn: 78459	2009-08-08 05:53:00 +00:00
Evan Cheng	1be453b462	Add a skeleton Thumb2 instruction size reduction pass. llvm-svn: 78456	2009-08-08 03:21:23 +00:00
Evan Cheng	2aa91cc2be	Code refactoring. No functionality change. llvm-svn: 78455	2009-08-08 03:20:32 +00:00
Evan Cheng	274fcbe43e	tADDhirr should target GPR, not tGPR. llvm-svn: 78454	2009-08-08 03:19:44 +00:00
Evan Cheng	4dc201eb64	I can type. llvm-svn: 78453	2009-08-08 02:54:37 +00:00
Chris Lattner	b94284b5e2	make printInstruction return void since its result is omitted. Make the error condition get trapped with an assert. llvm-svn: 78449	2009-08-08 01:32:19 +00:00
David Goodwin	742db6a6d4	Make NEON single-precision FP support the default for cortex-a8 (again). llvm-svn: 78430	2009-08-07 23:32:33 +00:00
Anton Korobeynikov	d28a26dfab	Unbreak the stuff llvm-svn: 78425	2009-08-07 22:51:13 +00:00
Anton Korobeynikov	23b28cb824	2 more vdup.32 cases llvm-svn: 78419	2009-08-07 22:36:50 +00:00
Evan Cheng	fb93be2b6f	A big oops. Thumb1 default CC is a def of CPSR, not a use of CPSR. llvm-svn: 78418	2009-08-07 22:36:37 +00:00
Evan Cheng	6e130db3b7	Thumb2 32-bit ldm / stm needs .w suffix if submode is ia. llvm-svn: 78410	2009-08-07 21:19:10 +00:00
Evan Cheng	b64ec07ea6	This is done. llvm-svn: 78399	2009-08-07 19:34:52 +00:00
Evan Cheng	f0237b1aa6	Use 16-bit tMOVgpr2gpr instead of tMOVr to copy GPR registers in Thumb2 mode. llvm-svn: 78398	2009-08-07 19:34:35 +00:00
Evan Cheng	4c3b1ca5a0	Fix support to use NEON for single precision fp math. llvm-svn: 78397	2009-08-07 19:30:41 +00:00
Evan Cheng	82ff022ed2	Error out, rather than infinite looping, if constant island pass can't converge. llvm-svn: 78377	2009-08-07 07:35:21 +00:00
Evan Cheng	317bd7aab2	tBfar is bl, which clobbers LR. llvm-svn: 78370	2009-08-07 05:45:07 +00:00
Dan Gohman	a6d0afcb74	Fix a bunch of namespace pollution. llvm-svn: 78363	2009-08-07 01:32:21 +00:00
Evan Cheng	b972e5633f	It turns out most of the thumb2 instructions are not allowed to touch SP. The semantics of such instructions are unpredictable. We have just been lucky that tests have been passing. This patch takes pain to ensure all the PEI lowering code does the right thing when lowering frame indices, insert code to manipulate stack pointers, etc. It's also custom lowering dynamic stack alloc into pseudo instructions so we can insert the right instructions at scheduling time. This fixes PR4659 and PR4682. llvm-svn: 78361	2009-08-07 00:34:42 +00:00
Bob Wilson	0127031c20	Implement Neon VST[234] operations. llvm-svn: 78330	2009-08-06 18:47:44 +00:00
David Goodwin	b062c236c5	Add parameter to pattern classes to enable an itinerary to be specified for instructions. For now just use the existing itineraries or NoItinerary. llvm-svn: 78321	2009-08-06 16:52:47 +00:00
Bob Wilson	488db94e7b	Neon does not actually have VLD{234}.64 instructions. These operations will have to be synthesized from other instructions. llvm-svn: 78263	2009-08-06 00:24:27 +00:00
Bob Wilson	e148ceaf65	Add a new pre-allocation pass to assign adjacent registers for Neon instructions that have that constraint. This is currently just assigning a fixed set of registers, and it only handles VLDn for n=2,3,4 with DPR registers. I'm going to expand it to handle more operations next; we can make it smarter once everything is working correctly. llvm-svn: 78256	2009-08-05 23:12:45 +00:00
David Goodwin	e5b5d8fbb3	When using NEON for single-precision FP, the NEON result must be placed in D0-D15 as these are the only D registers with S subregs. Introduce a new regclass to represent D0-D15 and use it in the NEON single-precision FP patterns. llvm-svn: 78244	2009-08-05 21:02:22 +00:00
Anton Korobeynikov	ef98dbe3de	Remove redundand checks: the only way to have, e.g. f32 RegVT is exactly hardfloat case. llvm-svn: 78237	2009-08-05 20:15:19 +00:00
Anton Korobeynikov	ef42862ef5	Unbreak the stuff, this is ugly, but we cannot do better for now with 'plain' C calling conv. llvm-svn: 78232	2009-08-05 19:40:16 +00:00
Anton Korobeynikov	22ef75155e	Missed pieces for ARM HardFP ABI. Patch by Sandeep Patel! llvm-svn: 78225	2009-08-05 19:04:42 +00:00
Daniel Dunbar	4cc1feff4f	Remove some dead code. llvm-svn: 78219	2009-08-05 18:12:37 +00:00
Bob Wilson	9ede773c4e	Remove a redundant declaration. llvm-svn: 78216	2009-08-05 17:39:44 +00:00
David Goodwin	21788bef7c	Disable NEON single-precision FP support for Cortex-A8, for now... llvm-svn: 78209	2009-08-05 16:40:57 +00:00
Devang Patel	44c4417812	Remove dead code. MDNode and MDString are not Constant anymore. llvm-svn: 78207	2009-08-05 16:40:02 +00:00
David Goodwin	a307edbdd5	By default, for cortex-a8 use NEON for single-precision FP. llvm-svn: 78200	2009-08-05 16:01:19 +00:00
Evan Cheng	e219be7346	80 col violations. llvm-svn: 78175	2009-08-05 06:41:25 +00:00
Bob Wilson	85f60cc5a8	Oops. I didn't mean to commit this piece yet. llvm-svn: 78146	2009-08-05 02:47:13 +00:00
Dan Gohman	f9bbcd1afd	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Dan Gohman	c6b5e8a5c5	Don't flush the raw_ostream between each MachineFunction. These flush calls were originally put in place because errs() at one time was not unbuffered, and these print routines are commonly used with errs() for debugging. However, errs() is now properly unbuffered, so the flush calls are no longer needed. This significantly reduces the number of write(2) calls for regular asm printing when there are many small functions. llvm-svn: 78137	2009-08-05 00:49:25 +00:00
Bob Wilson	20f79e321e	Change DAG nodes for Neon VLD2/3/4 operations to return multiple results. Get rid of yesterday's code to fix the register usage during isel. Select the new DAG nodes to machine instructions. The new pre-alloc pass to choose adjacent registers for these results is not done, so the results of this will generally not assemble yet. llvm-svn: 78136	2009-08-05 00:49:09 +00:00
Evan Cheng	7cc6aca1e6	Fix part 1 of pr4682. PICADD is a 16-bit instruction even in thumb2 mode. llvm-svn: 78126	2009-08-04 23:47:55 +00:00
Bob Wilson	a8720101b5	Replace dregsingle operand modifier with explicit escaped curly brackets. For other VLDn and VSTn operations, we need to list the multiple registers explicitly anyway, so there's no point in special-casing this one usage. llvm-svn: 78109	2009-08-04 21:39:33 +00:00
Evan Cheng	783b65b546	Enable load / store multiple pass for Thumb2. It's not using ldrd / strd yet. llvm-svn: 78104	2009-08-04 21:12:13 +00:00
David Goodwin	30bf625ac2	Add NEON single-precision FP support for fabs and fneg. llvm-svn: 78101	2009-08-04 20:39:05 +00:00
Evan Cheng	a3abe2a7ce	In thumb mode, r7 is used as frame register. This fixes pr4681. llvm-svn: 78086	2009-08-04 18:46:17 +00:00
David Goodwin	a3839bc6c0	Match common pattern for FNMAC. Add NEON SP support. llvm-svn: 78085	2009-08-04 18:44:29 +00:00
David Goodwin	3b9c52c5c1	Initial support for single-precision FP using NEON. Added "neonfp" attribute to enable. Added patterns for some binary FP operations. llvm-svn: 78081	2009-08-04 17:53:06 +00:00
Anton Korobeynikov	d0a53d380a	Ooops, I was too fast to commit the wrong fix :( llvm-svn: 78060	2009-08-04 11:18:31 +00:00
Anton Korobeynikov	3c5b68e2a7	Fix a typo - this unbreaks llvm-gcc build on arm llvm-svn: 78059	2009-08-04 11:12:51 +00:00
Evan Cheng	3870fbb561	Thumb2 does not have ib (increment before) and da (decrement after) forms of ldm / stm. llvm-svn: 78057	2009-08-04 08:34:18 +00:00
Evan Cheng	f43cf709cb	Remove ARM specific getInlineAsmLength. We'll rely on the simpler (and faster) generic algorithm for now. If more accurate computation is needed, we'll rely on the disassembler. llvm-svn: 78032	2009-08-04 01:56:09 +00:00
Evan Cheng	71756e789b	Load / store multiple pass fixes for Thumb2. Not enabled yet. llvm-svn: 78031	2009-08-04 01:43:45 +00:00

1 2 3 4 5 ...

1372 Commits