llvm-project

Commit Graph

Author	SHA1	Message	Date
Ulrich Weigand	136ac22eaa	PowerPC: Use RegisterOperand instead of RegisterClass operands In the default PowerPC assembler syntax, registers are specified simply by number, so they cannot be distinguished from immediate values (without looking at the opcode). This means that the default operand matching logic for the asm parser does not work, and we need to specify custom matchers. Since those can only be specified with RegisterOperand classes and not directly on the RegisterClass, all instructions patterns used by the asm parser need to use a RegisterOperand (instead of a RegisterClass) for all their register operands. This patch adds one RegisterOperand for each RegisterClass, using the same name as the class, just in lower case, and updates all instruction patterns to use RegisterOperand instead of RegisterClass operands. llvm-svn: 180611	2013-04-26 16:53:15 +00:00
Ulrich Weigand	551b085d55	PowerPC: Fix encoding of vsubcuw and vsum4sbs instructions When testing the asm parser, I noticed wrong encodings for the above instructions (wrong sub-opcodes). Tests will be added together with the asm parser. llvm-svn: 180608	2013-04-26 15:39:57 +00:00
Hal Finkel	0c6d21933a	PPC: Add a FIXME regarding the non-working fma+fneg Altivec pattern llvm-svn: 178658	2013-04-03 14:40:16 +00:00
Ulrich Weigand	084ff8e891	More direct types in PowerPC AltiVec intrinsics. This patch follows up on work done by Bill Schmidt in r178277, and replaces most of the remaining uses of VRRC in ISEL DAG patterns. The resulting .inc files are identical except for comments, so no change in code generation is expected. llvm-svn: 178656	2013-04-03 14:08:13 +00:00
Hal Finkel	2e10331057	Use PPC reciprocal estimates with Newton iteration in fast-math mode When unsafe FP math operations are enabled, we can use the fre[s] and frsqrte[s] instructions, which generate reciprocal (sqrt) estimates, together with some Newton iteration, in order to quickly generate floating-point division and sqrt results. All of these instructions are separately optional, and so each has its own feature flag (except for the Altivec instructions, which are covered under the existing Altivec flag). Doing this is not only faster than using the IEEE-compliant fdiv/fsqrt instructions, but allows these computations to be pipelined with other computations in order to hide their overall latency. I've also added a couple of missing fnmsub patterns which turned out to be missing (but are necessary for good code generation of the Newton iterations). Altivec needs a similar fix, but that will probably be more complicated because fneg is expanded for Altivec's v4f32. llvm-svn: 178617	2013-04-03 04:01:11 +00:00
Bill Schmidt	74b2e72ab3	Use direct types in most PowerPC Altivec instructions and patterns. This follows up Ulrich Weigand's work in PPCInstrInfo.td and PPCInstr64Bit.td by doing the corresponding work for most of the Altivec patterns. I have not been able to do anything for the following classes of instructions: (1) Vector logicals. These don't have corresponding intrinsics and don't have a single obvious vector type. So far as I can tell I need to leave these as VRRC. Affected instructions are: VAND, VANDC, VNOR, VOR, VXOR, V_SET0. (2) Instructions that make use of vector shuffle. The selection code promotes all shuffles to v16i8, so any pattern that matches on a shuffle is constrained. I haven't found any way to make the patterns match on their natural types, so I plan to leave these as VRRC. Affected instructions are: VMRG*, VSPLTB, VSPLTH, VSPLTW, VPKUHUM, VPKUWUM. No change in behavior is anticipated. llvm-svn: 178277	2013-03-28 19:27:24 +00:00
Ulrich Weigand	bbfb0c55c8	PowerPC: Mark patterns as isCodeGenOnly. There remain a number of patterns that cannot (and should not) be handled by the asm parser, in particular all the Pseudo patterns. This commit marks those patterns as isCodeGenOnly. No change in generated code. llvm-svn: 178008	2013-03-26 10:57:16 +00:00
Hal Finkel	b0fac42987	Protect PPC Altivec patterns with a predicate In preparation for the addition of other SIMD ISA extensions (such as QPX) we need to make sure that all Altivec patterns are properly predicated on having Altivec support. No functionality change intended (one test case needed to be updated b/c it assumed that Altivec intrinsics would be supported without enabling Altivec support). llvm-svn: 177152	2013-03-15 13:21:21 +00:00
Adhemerval Zanella	812410f2d1	This patch fixes the Altivec addend construction for the fused multiply-add instruction (vmaddfp) to conform with IEEE to ensure the sign of a zero result when resulting product is -0.0. The -0.0 vector addend to vmaddfp is generated by a creating a vector with full bits sets and then shifting each elements by 31-bits to the left, resulting in a vector of 0x80000000 (or -0.0 as float). The 'buildvec_canonicalize.ll' was adjusted to reflect this change and the 'vec_mul.ll' was complemented with the float vector multiplication test. llvm-svn: 168998	2012-11-30 13:05:44 +00:00
Adhemerval Zanella	bdface5699	PowerPC: Lowering floor intrinsic for Altivec This patch lowers the llvm.floor, llvm.ceil, llvm.trunc, and llvm.nearbyint to Altivec instruction when using 4 single-precision float vectors. llvm-svn: 168086	2012-11-15 20:56:03 +00:00
Adhemerval Zanella	5c6e08435e	Add floating-point to and from integer conversion This patch add altivec support for v4i32 to v4f32 and for v4f32 to v4i32 vector rounding conversion. llvm-svn: 165409	2012-10-08 17:27:24 +00:00
Hal Finkel	0a479ae7d1	Convert the PPC backend to use the new FMA infrastructure. The existing contraction patterns are replaced with fma/fneg. Overall functionality should be the same. llvm-svn: 158955	2012-06-22 00:49:52 +00:00
Hal Finkel	59607e63cb	Split the LdStGeneral PPC itin. class into LdStLoad and LdStStore. Loads and stores can have different pipeline behavior, especially on embedded chips. This change allows those differences to be expressed. Except for the 440 scheduler, there are no functionality changes. On the 440, the latency adjustment is only by one cycle, and so this probably does not affect much. Nevertheless, it will make a larger difference in the future and this removes a FIXME from the 440 itin. llvm-svn: 153821	2012-04-01 04:44:16 +00:00
Jia Liu	b22310fda6	Emacs-tag and some comment fix for all ARM, CellSPU, Hexagon, MBlaze, MSP430, PPC, PTX, Sparc, X86, XCore. llvm-svn: 150878	2012-02-18 12:03:15 +00:00
Chris Lattner	1c85e3476d	fix up vnot matching, eliminating a dead pattern, correcting a couple of patterns that would never match because of bitcast, and eliminating use of vnot_conv. llvm-svn: 99753	2010-03-28 08:00:23 +00:00
Chris Lattner	dac58bd094	Fix a bunch of ambiguous patterns which tblgen happens to infer types for, due to a bug. llvm-svn: 97953	2010-03-08 18:44:04 +00:00
Eli Friedman	be1bb0f8b1	PR3628: Add patterns to match SHL/SRL/SRA to the corresponding Altivec instructions. llvm-svn: 73009	2009-06-07 01:07:55 +00:00
Nate Begeman	8d6d4b9289	2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. llvm-svn: 70225	2009-04-27 18:41:29 +00:00
Rafael Espindola	b93db668b3	Revert 69952. Causes testsuite failures on linux x86-64. llvm-svn: 69967	2009-04-24 12:40:33 +00:00
Nate Begeman	bb881d66f4	PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. A clean up of x86 shuffle code, and some canonicalizing in DAGCombiner is next. llvm-svn: 69952	2009-04-24 03:42:54 +00:00
Dan Gohman	69cc2cbbff	Rename isSimpleLoad to canFoldAsLoad, to better reflect its meaning. llvm-svn: 60487	2008-12-03 18:15:48 +00:00
Gabor Greif	f304a7aa4d	erect abstraction boundaries for accessing SDValue members, rename Val -> Node to reflect semantics llvm-svn: 55504	2008-08-28 21:40:38 +00:00
Evan Cheng	0e7b00d79f	Replace all target specific implicit def instructions with a target independent one: TargetInstrInfo::IMPLICIT_DEF. llvm-svn: 48380	2008-03-15 00:03:38 +00:00
Chris Lattner	a4ce4f6987	rename isLoad -> isSimpleLoad due to evan's desire to have such a predicate. llvm-svn: 45667	2008-01-06 23:38:27 +00:00
Chris Lattner	e20f380fbf	remove some isStore flags that are now inferred automatically. llvm-svn: 45652	2008-01-06 05:53:26 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Bill Wendling	b9bf812ba5	Add the 64-bit versions of the DS* Altivec instructions. llvm-svn: 41717	2007-09-05 04:05:20 +00:00
Dale Johannesen	f5124b36e4	Fix arguments for some Altivec instructions. From SWB. llvm-svn: 40957	2007-08-09 00:49:19 +00:00
Dale Johannesen	4e7ff3593c	Fix spelling of mtvscr and mfvscr. llvm-svn: 40908	2007-08-07 23:08:00 +00:00
Evan Cheng	581d2795dc	Vector fneg must be expanded into fsub -0.0, X. llvm-svn: 40586	2007-07-30 07:51:22 +00:00
Evan Cheng	ac1591be42	No more noResults. llvm-svn: 40132	2007-07-21 00:34:19 +00:00
Evan Cheng	94b5a80b93	Change instruction description to split OperandList into OutOperandList and InOperandList. This gives one piece of important information: # of results produced by an instruction. An example of the change: def ADD32rr : I<0x01, MRMDestReg, (ops GR32:$dst, GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; => def ADD32rr : I<0x01, MRMDestReg, (outs GR32:$dst), (ins GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; llvm-svn: 40033	2007-07-19 01:14:50 +00:00
Chris Lattner	8091f28d1a	fix incorrect encoding of vminsw. llvm-svn: 34351	2007-02-16 21:20:09 +00:00
Chris Lattner	b00b6c2e86	Make the implicit def instructions look like other instrs. llvm-svn: 29174	2006-07-18 16:33:26 +00:00
Chris Lattner	868a75bec6	Remove some now-unneeded casts from instruction patterns. With the casts removed, tblgen produces identical output to with them in. llvm-svn: 28867	2006-06-20 00:39:56 +00:00
Chris Lattner	99d3da9d2c	Fix the CodeGen/PowerPC/buildvec_canonicalize.ll regression last night. llvm-svn: 27908	2006-04-20 19:01:30 +00:00
Chris Lattner	0cd0065c58	Make sure that the new instructions selected have the right type. This fixes CodeGen/PowerPC/2006-04-19-vmaddfp-crash.ll llvm-svn: 27868	2006-04-20 05:58:10 +00:00
Chris Lattner	06a21ba96b	Implement a TODO: have the legalizer canonicalize a bunch of operations to one type (v4i32) so that we don't have to write patterns for each type, and so that more CSE opportunities are exposed. llvm-svn: 27731	2006-04-16 01:37:57 +00:00
Chris Lattner	873202fabd	Add patterns for matching vnots with bit converted inputs. Most of these will go away when I start using evan's binop type canonicalizer llvm-svn: 27725	2006-04-15 23:45:24 +00:00
Chris Lattner	74cf9ff761	Rename get_VSPLI_elt -> get_VSPLTI_elt Canonicalize BUILD_VECTOR's that match VSPLTI's into a single type for each form, eliminating a bunch of Pat patterns in the .td file and allowing us to CSE stuff more aggressively. This implements PowerPC/buildvec_canonicalize.ll:VSPLTI llvm-svn: 27614	2006-04-12 17:37:20 +00:00
Chris Lattner	e318a7574e	Ensure that zero vectors are always v4i32, which forces them to CSE with each other. This implements CodeGen/PowerPC/vxor-canonicalize.ll llvm-svn: 27609	2006-04-12 16:53:28 +00:00
Chris Lattner	d71a1f946d	Change the interface to the predicate that determines if vsplti* can be used. No functionality changes. llvm-svn: 27536	2006-04-08 06:46:53 +00:00
Chris Lattner	a4bbfaed5c	Match vpku[hw]um(x,x). Convert vsldoi(x,x) to work the same way other (x,x) cases work. llvm-svn: 27467	2006-04-06 22:28:36 +00:00
Chris Lattner	f38e033270	Add support for matching vmrg(x,x) patterns llvm-svn: 27463	2006-04-06 22:02:42 +00:00
Chris Lattner	d1dcb52093	Pattern match vmrg* instructions, which are now lowered by the CFE into shuffles. llvm-svn: 27457	2006-04-06 21:11:54 +00:00
Chris Lattner	1d33819194	Support pattern matching vsldoi(x,y) and vsldoi(x,x), which allows the f.e. to lower it and LLVM to have one fewer intrinsic. This implements CodeGen/PowerPC/vec_shuffle.ll llvm-svn: 27450	2006-04-06 18:26:28 +00:00
Chris Lattner	e8b83b4206	Compile the vpkuhum/vpkuwum intrinsics into vpkuhum/vpkuwum instead of into vperm with a perm mask lvx'd from the constant pool. llvm-svn: 27448	2006-04-06 17:23:16 +00:00
Chris Lattner	c94d932447	Add all of the data stream intrinsics and instructions. woo llvm-svn: 27442	2006-04-05 22:27:14 +00:00
Chris Lattner	39dc64c955	Fix a typo llvm-svn: 27440	2006-04-05 20:15:25 +00:00
Chris Lattner	2f8e2b2895	add vsl llvm-svn: 27425	2006-04-05 01:16:22 +00:00

1 2

88 Commits