llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	79c50d96c9	Mapping of physregs can make it so that the designated and input physregs are the same. In this case, don't emit a noop copy. llvm-svn: 28008	2006-04-28 04:43:18 +00:00
Chris Lattner	e63d808b6e	Fix Transforms/Reassociate/2006-04-27-ReassociateVector.ll llvm-svn: 28007	2006-04-28 04:14:49 +00:00
Chris Lattner	7abfb81e30	new testcase llvm-svn: 28006	2006-04-28 04:14:29 +00:00
Evan Cheng	f0157cb0bc	Use movaps instead of movapd for spill / restore. llvm-svn: 28005	2006-04-28 02:23:35 +00:00
Evan Cheng	51ab4498e7	Added a temporary option -no-isel-fold-inflight to control whether a "inflight" node can be folded. llvm-svn: 28003	2006-04-28 02:09:19 +00:00
Evan Cheng	54acf6eddc	When isel'ing a node, mark its operands "InFlight" before selecting them. These nodes should not be folded into other nodes. This fixes the miscompilation of PR 749. Temporarily under flag control. llvm-svn: 28002	2006-04-28 02:08:10 +00:00
Chris Lattner	84e95d00b5	When we have a two-address instruction where the input cannot be clobbered and is already available, instead of falling back to emitting a load, fall back to emitting a reg-reg copy. This generates significantly better code for some SSE testcases, as SSE has lots of two-address instructions and none of them are read/modify/write. As one example, this change does: pshufd %XMM5, XMMWORD PTR [%ESP + 84], 255 xorps %XMM2, %XMM5 cmpltps %XMM1, %XMM0 - movaps XMMWORD PTR [%ESP + 52], %XMM0 - movapd %XMM6, XMMWORD PTR [%ESP + 52] + movaps %XMM6, %XMM0 cmpltps %XMM6, XMMWORD PTR [%ESP + 68] movapd XMMWORD PTR [%ESP + 52], %XMM6 movaps %XMM6, %XMM0 cmpltps %XMM6, XMMWORD PTR [%ESP + 36] cmpltps %XMM3, %XMM0 - movaps XMMWORD PTR [%ESP + 20], %XMM0 - movapd %XMM7, XMMWORD PTR [%ESP + 20] + movaps %XMM7, %XMM0 cmpltps %XMM7, XMMWORD PTR [%ESP + 4] movapd XMMWORD PTR [%ESP + 20], %XMM7 cmpltps %XMM4, %XMM0 ... which is far better than a store followed by a load! llvm-svn: 28001	2006-04-28 01:46:50 +00:00
Evan Cheng	c4d77c46b8	Test case for PR748 llvm-svn: 28000	2006-04-28 01:21:37 +00:00
Chris Lattner	a4c2c4a276	Add a note llvm-svn: 27999	2006-04-28 00:04:05 +00:00
Chris Lattner	b209131b56	Add a note llvm-svn: 27998	2006-04-27 21:40:57 +00:00
Chris Lattner	b6cb64b7e6	Add support for inserting undef into a vector. This implements Transforms/InstCombine/vec_insert_to_shuffle.ll llvm-svn: 27997	2006-04-27 21:14:21 +00:00
Chris Lattner	51fecaa8b3	This should turn into one vector shuffle instruction. llvm-svn: 27996	2006-04-27 21:13:58 +00:00
Evan Cheng	f4f3f0d25f	Make x86 isel lowering produce tailcall nodes. They are match to normal calls for now. Patch contributed by Alexander Friedman. llvm-svn: 27994	2006-04-27 08:40:39 +00:00
Evan Cheng	ec04a37edd	A couple of new entries. llvm-svn: 27993	2006-04-27 08:31:33 +00:00
Evan Cheng	89001ad729	Support for passing 128-bit vector arguments via XMM registers. llvm-svn: 27992	2006-04-27 08:31:10 +00:00
Evan Cheng	3784f3c57c	Insert a VBIT_CONVERT between a FORMAL_ARGUMENT node and its vector uses (VAND, VADD, etc.). Legalizer will assert otherwise. llvm-svn: 27991	2006-04-27 08:29:42 +00:00
Reid Spencer	bb0be98fed	For PR747: If we fail to find a required program, simply set that program to echo out something that tells the user the situation. That is, instead of just "true runtest" we now get "echo 'Skipped: runtest not found'". llvm-svn: 27990	2006-04-27 07:49:24 +00:00
Evan Cheng	a0374e1bed	Oops llvm-svn: 27989	2006-04-27 05:44:50 +00:00
Evan Cheng	24eb3f4765	Bug fix: not updating NumIntRegs. llvm-svn: 27988	2006-04-27 05:35:28 +00:00
Chris Lattner	393d96a56c	Fix Regression/CodeGen/Generic/2006-04-26-SetCCAnd.ll and PR748. llvm-svn: 27987	2006-04-27 05:01:07 +00:00
Chris Lattner	fb1ab10337	new testcase llvm-svn: 27986	2006-04-27 05:00:43 +00:00
Evan Cheng	48940d16b2	- Clean up formal argument lowering code. Prepare for vector pass by value work. - Fixed vararg support. llvm-svn: 27985	2006-04-27 01:32:22 +00:00
Chris Lattner	f98b4aa2e7	Fix some nondeterminstic behavior in the mem2reg pass that (in addition to nondeterminism being bad) could cause some trivial missed optimizations (dead phi nodes being left around for later passes to clean up). With this, llvm-gcc4 now bootstraps and correctly compares. I don't know why I never tried to do it before... :) llvm-svn: 27984	2006-04-27 01:14:43 +00:00
Jeff Cohen	027fbc2a6a	Actually, semantical doesn't appear to be a word. llvm-svn: 27983	2006-04-26 21:03:17 +00:00
Chris Lattner	e8cbdbf314	Implement Transforms/IndVarsSimplify/complex-scev.ll, a case where we didn't recognize some simple affine IV's. llvm-svn: 27982	2006-04-26 18:34:07 +00:00
Chris Lattner	684a8546f4	new testcase llvm-svn: 27981	2006-04-26 18:32:59 +00:00
Evan Cheng	1c39903297	Fix fastcc failures. llvm-svn: 27980	2006-04-26 18:21:31 +00:00
Chris Lattner	6b764e7625	Unfortunately this really isn't the place for advertisement. llvm-svn: 27979	2006-04-26 18:10:59 +00:00
Jeff Cohen	e42f3ba44c	Fix typo. llvm-svn: 27978	2006-04-26 18:05:25 +00:00
Reid Spencer	4d3171fc41	Fix some anchors. llvm-svn: 27977	2006-04-26 15:46:53 +00:00
Reid Spencer	9aa244ed16	Add some notes about the current state of source (front end) languages so we can point to them on llvm-dev. llvm-svn: 27976	2006-04-26 14:52:19 +00:00
Evan Cheng	e0bcfbe811	Switching over FORMAL_ARGUMENTS mechanism to lower call arguments. llvm-svn: 27975	2006-04-26 01:20:17 +00:00
Evan Cheng	9618df1190	Don't forget return void. llvm-svn: 27974	2006-04-25 23:03:35 +00:00
Nate Begeman	4530327c04	Keep the stack from on darwin 16-byte aligned. This fixes many JIT failres. llvm-svn: 27973	2006-04-25 20:54:26 +00:00
Evan Cheng	a9467aab0a	Separate LowerOperation() into multiple functions, one per opcode. llvm-svn: 27972	2006-04-25 20:13:52 +00:00
Andrew Lenharth	3c775bcd86	slightly more useful error message llvm-svn: 27971	2006-04-25 19:33:41 +00:00
Andrew Lenharth	f5a713d273	better c99 struct handling llvm-svn: 27970	2006-04-25 19:33:23 +00:00
Andrew Lenharth	a3f7583408	another c99 style problem llvm-svn: 27969	2006-04-25 19:27:56 +00:00
Evan Cheng	4cc3e0b05f	Fix a typo. llvm-svn: 27968	2006-04-25 17:48:41 +00:00
Nate Begeman	48ccd3f826	Fix a warning llvm-svn: 27967	2006-04-25 17:46:32 +00:00
Nate Begeman	318bb96f9e	No functionality changes, but cleaner code with correct comments. llvm-svn: 27966	2006-04-25 04:45:59 +00:00
Evan Cheng	fb46b2bf5d	Explicitly specify result type for def : Pat<> patterns (if it produces a vector result). Otherwise tblgen will pick the default (v16i8 for 128-bit vector). llvm-svn: 27965	2006-04-25 00:50:01 +00:00
Evan Cheng	25b09295f8	Added X86 SSE2 intrinsics which can be represented as vector_shuffles. This is a temporary workaround for the 2-wide vector_shuffle problem (i.e. its mask would have type v2i32 which is not legal). llvm-svn: 27964	2006-04-24 23:34:56 +00:00
Evan Cheng	d03631ee76	Add a new entry. llvm-svn: 27963	2006-04-24 23:30:10 +00:00
Evan Cheng	347a5a7afa	Added a movq test case. llvm-svn: 27962	2006-04-24 23:03:22 +00:00
Evan Cheng	5c2bfb069e	Special case handling two wide build_vector(0, x). llvm-svn: 27961	2006-04-24 22:58:52 +00:00
Evan Cheng	63bd4d3730	Some missing movlps, movhps, movlpd, and movhpd patterns. llvm-svn: 27960	2006-04-24 21:58:20 +00:00
Evan Cheng	b0461080e4	A little bit more build_vector enhancement for v8i16 cases. llvm-svn: 27959	2006-04-24 18:01:45 +00:00
Evan Cheng	2f9b0bcbd5	Remove a completed entry. llvm-svn: 27958	2006-04-24 17:38:16 +00:00
Chris Lattner	3046d54df2	Suggest looking at the SPARC backend. How weird is that?? :) llvm-svn: 27957	2006-04-24 16:34:45 +00:00
Evan Cheng	60bb622e4d	Added addJumpTableIndex llvm-svn: 27956	2006-04-24 06:42:15 +00:00
Evan Cheng	ab0ee6340c	MakeMIInst() should handle jump table index operands. llvm-svn: 27955	2006-04-24 05:37:35 +00:00
Chris Lattner	f110527a29	Add a note llvm-svn: 27954	2006-04-23 19:47:09 +00:00
Evan Cheng	b4f31dd1a8	MOVL shuffle (i.e. movd or movss / movsd from memory) of undef, V2 == V2 llvm-svn: 27953	2006-04-23 06:35:19 +00:00
Nate Begeman	866b4b4d45	Fix the updating of the machine CFG when a PHI node was in a successor of the jump table's range check block. This re-enables 100% dense jump tables by default on PPC & x86 llvm-svn: 27952	2006-04-23 06:26:20 +00:00
Jeff Cohen	7a2a972ba9	Keep Visual Studio informed. llvm-svn: 27951	2006-04-23 04:37:08 +00:00
Nate Begeman	3e04bb482b	Code cleanup associated with jump tables, thanks to Chris for noticing these. llvm-svn: 27950	2006-04-22 23:52:35 +00:00
Nate Begeman	ecb1dafd3d	Turn of jump tables for a bit, there are still some issues to work out with updating the machine CFG. llvm-svn: 27949	2006-04-22 23:51:56 +00:00
Nate Begeman	9f0b13c885	Optimized stores to the constant pool, while cool, are unnecessary. llvm-svn: 27948	2006-04-22 22:31:45 +00:00
Nate Begeman	4ca2ea5b43	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Evan Cheng	e728efdfce	Don't do all the lowering stuff for 2-wide build_vector's. Also, minor optimization for shuffle of undef. llvm-svn: 27946	2006-04-22 08:34:05 +00:00
Evan Cheng	16ef94f4e8	Fix a performance regression. Use {p}shuf* when there are only two distinct elements in a build_vector. llvm-svn: 27945	2006-04-22 06:21:46 +00:00
Evan Cheng	40ff2928ed	Two more build_vector tests. llvm-svn: 27944	2006-04-22 06:19:11 +00:00
Chris Lattner	c8afdfec52	Teach the JIT how to relocate LI, this fixes the JIT on Prolangs-C/TimberWolfMC llvm-svn: 27943	2006-04-22 06:17:56 +00:00
Chris Lattner	17aae4c516	This no longer needs the JIT. llvm-svn: 27942	2006-04-22 05:04:23 +00:00
Chris Lattner	fe36eaebda	Fix JIT support for static ctors, which was apparently completely broken! This allows Prolangs-C++/city and probably a bunch of other stuff to work well with the new front-end llvm-svn: 27941	2006-04-22 05:02:46 +00:00
Evan Cheng	14215c36b6	Revamp build_vector lowering to take advantage of movss and movd instructions. movd always clear the top 96 bits and movss does so when it's loading the value from memory. The net result is codegen for 4-wide shuffles is much improved. It is near optimal if one or more elements is a zero. e.g. __m128i test(int a, int b) { return _mm_set_epi32(0, 0, b, a); } compiles to _test: movd 8(%esp), %xmm1 movd 4(%esp), %xmm0 punpckldq %xmm1, %xmm0 ret compare to gcc: _test: subl $12, %esp movd 20(%esp), %xmm0 movd 16(%esp), %xmm1 punpckldq %xmm0, %xmm1 movq %xmm1, %xmm0 movhps LC0, %xmm0 addl $12, %esp ret or icc: _test: movd 4(%esp), %xmm0 #5.10 movd 8(%esp), %xmm3 #5.10 xorl %eax, %eax #5.10 movd %eax, %xmm1 #5.10 punpckldq %xmm1, %xmm0 #5.10 movd %eax, %xmm2 #5.10 punpckldq %xmm2, %xmm3 #5.10 punpckldq %xmm3, %xmm0 #5.10 ret #5.10 There are still room for improvement, for example the FP variant of the above example: __m128 test(float a, float b) { return _mm_set_ps(0.0, 0.0, b, a); } _test: movss 8(%esp), %xmm1 movss 4(%esp), %xmm0 unpcklps %xmm1, %xmm0 xorps %xmm1, %xmm1 movlhps %xmm1, %xmm0 ret The xorps and movlhps are unnecessary. This will require post legalizer optimization to handle. llvm-svn: 27939	2006-04-21 23:03:30 +00:00
Nate Begeman	57a32f0bc1	Fix the comment llvm-svn: 27938	2006-04-21 22:11:27 +00:00
Nate Begeman	516b393992	Change the PPC JIT to use a Static relocation model llvm-svn: 27937	2006-04-21 22:04:15 +00:00
Chris Lattner	905bd17dc7	typo llvm-svn: 27936	2006-04-21 21:37:40 +00:00
Chris Lattner	3e62d4b289	fix thinko llvm-svn: 27935	2006-04-21 21:05:22 +00:00
Chris Lattner	e1f9ab7d53	add some low-prio notes llvm-svn: 27934	2006-04-21 21:03:21 +00:00
Chris Lattner	b21d3bfd1f	The BFS scheduler is apparently nondeterminstic (causes many llvmgcc bootstrap miscompares). Switch RISC targets to use the list-td scheduler, which isn't. llvm-svn: 27933	2006-04-21 17:16:16 +00:00
Evan Cheng	236d23b88e	movddup is a SSE3 instruction. llvm-svn: 27932	2006-04-21 16:42:47 +00:00
Chris Lattner	28ead23d1c	Remove a hack required by V9. llvm-svn: 27931	2006-04-21 15:33:35 +00:00
Chris Lattner	662e940f73	Fix a couple more memory issues llvm-svn: 27930	2006-04-21 15:32:26 +00:00
Reid Spencer	d835ea4c1c	Remove the extraneous --defined-only option to nm. This is the default and some versions of nm don't recognize it (its a gnu option). llvm-svn: 27928	2006-04-21 05:29:25 +00:00
Evan Cheng	9c2707ad5a	Add && to each RUN: line (except the last one). llvm-svn: 27927	2006-04-21 04:58:23 +00:00
Evan Cheng	8ef22ae8af	More build_vector tests. llvm-svn: 27926	2006-04-21 01:22:41 +00:00
Evan Cheng	5ddb6f991f	Check for llc crash. llvm-svn: 27925	2006-04-21 01:21:23 +00:00
Evan Cheng	b3864a0ce4	To be replaced with another test. llvm-svn: 27924	2006-04-21 01:12:32 +00:00
Evan Cheng	e8b5180044	Now generating perfect (I think) code for "vector set" with a single non-zero scalar value. e.g. _mm_set_epi32(0, a, 0, 0); ==> movd 4(%esp), %xmm0 pshufd $69, %xmm0, %xmm0 _mm_set_epi8(0, 0, 0, 0, 0, a, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0); ==> movzbw 4(%esp), %ax movzwl %ax, %eax pxor %xmm0, %xmm0 pinsrw $5, %eax, %xmm0 llvm-svn: 27923	2006-04-21 01:05:10 +00:00
Chris Lattner	cc47ab3305	Fix a really subtle and obnoxious memory bug that caused issues with an llvm-gcc4 boostrap. Whenever a node is deleted by the dag combiner, it must be returned by the visit function, or the dag combiner will not know that the node has been processed (and will, e.g., send it to the target dag combine xforms). llvm-svn: 27922	2006-04-20 23:55:59 +00:00
Reid Spencer	5e6f97b7c9	Don't require the "dot" program if -flat option is given, and don't produce any of the fancy graphs or other output. Just produce the flat, makefile style output on stdout. llvm-svn: 27921	2006-04-20 23:09:57 +00:00
Jeff Cohen	9c0ba827f7	Keep Visual Studio informed. llvm-svn: 27920	2006-04-20 22:19:06 +00:00
Reid Spencer	aff245875b	Configure llvm-config in tools, not utils. llvm-svn: 27919	2006-04-20 22:15:30 +00:00
Reid Spencer	0a1de79808	Remove the llvm-config directory. Its now in tools. llvm-svn: 27918	2006-04-20 21:16:32 +00:00
Reid Spencer	6826031c70	llvm-config doesn't live here anymore ==> tools llvm-svn: 27917	2006-04-20 21:15:41 +00:00
Reid Spencer	2b1041cf13	Okay, llvm-config is good to go now. llvm-svn: 27916	2006-04-20 21:14:39 +00:00
Reid Spencer	5952201b67	Several Changes To Support Building llvm-config: 1. If Perl is not available, don't run Perl dependent targets. 2. Check in FinalLibDeps.txt for build environments that can't build it because there's no Perl processor. 3. Ensure that FinalLibDeps.txt depends on LibDeps.txt so it is automatically regenerated. 4. Support objdir != srcdir building. llvm-svn: 27915	2006-04-20 21:13:58 +00:00
Reid Spencer	62b9b71ff7	Don't build llvm-config until issues are resolved. llvm-svn: 27914	2006-04-20 21:00:24 +00:00
Reid Spencer	57d891afe1	Build the llvm-config directory as a tool. llvm-svn: 27913	2006-04-20 20:53:23 +00:00
Chris Lattner	dae49df407	Fix Transforms/ScalarRepl/2006-04-20-PromoteCrash.ll llvm-svn: 27912	2006-04-20 20:48:50 +00:00
Chris Lattner	c11f4f4214	new testcase llvm-svn: 27911	2006-04-20 20:48:32 +00:00
Reid Spencer	8794317843	Update the library dependencies for 1.8 with SparcV9 now gone. llvm-svn: 27910	2006-04-20 19:50:53 +00:00
Evan Cheng	968db7ffb7	Don't know what I was thinking... llvm-svn: 27909	2006-04-20 19:47:28 +00:00
Chris Lattner	99d3da9d2c	Fix the CodeGen/PowerPC/buildvec_canonicalize.ll regression last night. llvm-svn: 27908	2006-04-20 19:01:30 +00:00
Chris Lattner	d1c3a067ee	add a note llvm-svn: 27907	2006-04-20 18:49:28 +00:00
Reid Spencer	53345be5e2	Remove Burg as a directory to be considered. llvm-svn: 27906	2006-04-20 18:47:13 +00:00
Reid Spencer	a278128a59	Remove more references to Burg. llvm-svn: 27905	2006-04-20 18:46:45 +00:00
Reid Spencer	355913aa7a	Remove reference to BURG variable. It no longer exists. llvm-svn: 27904	2006-04-20 18:44:24 +00:00
Chris Lattner	32150b90b7	remove some stuff llvm-svn: 27903	2006-04-20 18:43:59 +00:00
Reid Spencer	9597684ba7	Remove traces of Burg utility now that its gone and not needed. llvm-svn: 27902	2006-04-20 18:42:24 +00:00
Reid Spencer	06502b6135	Burg not needed any more now that SparcV9 is gone. llvm-svn: 27901	2006-04-20 18:39:19 +00:00
Chris Lattner	3e5521799c	remove some v9 specific code llvm-svn: 27900	2006-04-20 18:33:11 +00:00
Chris Lattner	dcc1f995eb	This field no longer exists llvm-svn: 27899	2006-04-20 18:32:41 +00:00
Chris Lattner	778509c844	Don't fill in fields that no longer exist. llvm-svn: 27898	2006-04-20 18:32:22 +00:00
Chris Lattner	f2a5922fa9	Remove a bunch of dead stuff, shrinkifying TargetInstrDescriptor significantly. llvm-svn: 27897	2006-04-20 18:32:02 +00:00
Chris Lattner	7d7ed24b96	Remove some obsolete interfaces llvm-svn: 27896	2006-04-20 18:17:21 +00:00
Chris Lattner	2a875285f7	Remove this obsolete file llvm-svn: 27895	2006-04-20 18:16:45 +00:00
Chris Lattner	862755b95b	Remove some of the obvious v9-specific cruft llvm-svn: 27894	2006-04-20 18:09:13 +00:00
Chris Lattner	a38c3580bd	Remove some of the obvious V9-specific cruft llvm-svn: 27893	2006-04-20 18:08:53 +00:00
Evan Cheng	73b12f2f53	Vector extract element test case. llvm-svn: 27892	2006-04-20 17:59:30 +00:00
Chris Lattner	d5737be0f0	Remove V9 jit support llvm-svn: 27891	2006-04-20 17:52:00 +00:00
Evan Cheng	aecd41384f	Vector insert test case. llvm-svn: 27890	2006-04-20 17:50:10 +00:00
Chris Lattner	ec86eace63	allow this dir to get pruned llvm-svn: 27889	2006-04-20 17:45:33 +00:00
Chris Lattner	1798687dba	Remove this target's reg tests llvm-svn: 27888	2006-04-20 17:44:51 +00:00
Chris Lattner	5197dfadf2	Fails with all sparcs llvm-svn: 27887	2006-04-20 17:43:41 +00:00
Chris Lattner	7991e85b2e	Remove V9 llvm-svn: 27886	2006-04-20 17:42:23 +00:00
Chris Lattner	ac61195539	This target is no longer built. The ,v files now live in the reoptimizer. llvm-svn: 27885	2006-04-20 17:15:44 +00:00
Chris Lattner	53f4499b22	Never link in sparcv9 llvm-svn: 27884	2006-04-20 17:07:46 +00:00
Chris Lattner	8fe3dbceb0	Never build SparcV9 llvm-svn: 27883	2006-04-20 17:01:19 +00:00
Chris Lattner	d0a3a32eae	remove a dead prototype llvm-svn: 27882	2006-04-20 15:45:54 +00:00
Andrew Lenharth	f89e630b2f	Make code match cvs commit message :) llvm-svn: 27881	2006-04-20 15:41:37 +00:00
Andrew Lenharth	61eae29ad6	If we can convert the return pointer type into an integer that IntPtrType can be converted to losslessly, we can continue the conversion to a direct call. llvm-svn: 27880	2006-04-20 14:56:47 +00:00
Andrew Lenharth	b950dbea0b	can we cast between pointers and IntPtrType llvm-svn: 27879	2006-04-20 14:54:17 +00:00
Reid Spencer	8923c09997	Add a missing =back to eliminate error. llvm-svn: 27878	2006-04-20 14:17:47 +00:00
Evan Cheng	3ee104c852	v16i8 splat with 2 punpcklbw and a single pshufd. llvm-svn: 27877	2006-04-20 09:05:16 +00:00
Evan Cheng	f2c5fe9139	Another shuffle test. For 4-wide shuffle, no more than 3 {p}shuf*. llvm-svn: 27876	2006-04-20 09:01:54 +00:00
Evan Cheng	60f0b8998e	- Added support to turn "vector clear elements", e.g. pand V, <-1, -1, 0, -1> to a vector shuffle. - VECTOR_SHUFFLE lowering change in preparation for more efficient codegen of vector shuffle with zero (or any splat) vector. llvm-svn: 27875	2006-04-20 08:58:49 +00:00
Evan Cheng	a320abc494	Turn a VAND into a VECTOR_SHUFFLE is applicable. DAG combiner can turn a VAND V, <-1, 0, -1, -1>, i.e. vector clear elements, into a vector shuffle with a zero vector. It only does so when TLI tells it the xform is profitable. llvm-svn: 27874	2006-04-20 08:56:16 +00:00
Evan Cheng	8d6c229f8c	Added a virtual method isVectorClearMaskLegal to TLI. It is similar to isShuffleMaskLegal, used to determine if it makes sense to turn a "vector clear" (e.g. pand V, <0, -1, 0, -1> to a shuffle of the vector and a zero vector. llvm-svn: 27873	2006-04-20 08:54:13 +00:00
Evan Cheng	2bd632a02a	Added a test case for , e.g. xform pand <0, 0, -1, -1> to a shuffle. llvm-svn: 27872	2006-04-20 08:51:03 +00:00
Evan Cheng	059676f77b	Added a movhlps, movlhps test case. llvm-svn: 27871	2006-04-20 08:47:47 +00:00
Chris Lattner	171db236e5	Don't hardcode in 1.5 for the website, just use 'CVS'. llvm-svn: 27870	2006-04-20 06:24:16 +00:00
Chris Lattner	a23665ee78	This is old, out of date, and isn't linked to by anything. llvm-svn: 27869	2006-04-20 06:15:48 +00:00
Chris Lattner	0cd0065c58	Make sure that the new instructions selected have the right type. This fixes CodeGen/PowerPC/2006-04-19-vmaddfp-crash.ll llvm-svn: 27868	2006-04-20 05:58:10 +00:00
Chris Lattner	4ae41a3556	New testcase for a codegen crash llvm-svn: 27867	2006-04-20 05:57:43 +00:00
Tanya Lattner	cf46848098	Changing domain name llvm-svn: 27865	2006-04-20 05:51:53 +00:00
Chris Lattner	bc1b262725	Implement folding of a bunch of binops with undef llvm-svn: 27863	2006-04-20 05:39:12 +00:00
Chris Lattner	499abb5695	Update llvmgcc4 tarball names llvm-svn: 27861	2006-04-20 05:08:23 +00:00
Tanya Lattner	3687d08f8e	Removed listing of llvm releases after 1.4, and said "1.4 and newer" llvm-svn: 27860	2006-04-20 05:05:12 +00:00
Tanya Lattner	5a423b3c4f	Made warning red. llvm-svn: 27859	2006-04-20 04:57:19 +00:00
Tanya Lattner	011f7359d6	Document is out of date.. added warning and link to llvm-config. llvm-svn: 27858	2006-04-20 04:55:50 +00:00
Tanya Lattner	1a96c7158c	Fixed up comment on xfail for llvmgcc version. llvm-svn: 27857	2006-04-20 04:47:55 +00:00
Tanya Lattner	79674968b6	Added note about being able to XFAIL based on llvmgcc version. llvm-svn: 27856	2006-04-20 04:45:59 +00:00
Tanya Lattner	3962e5965a	Removed cvs mirror comment llvm-svn: 27855	2006-04-20 04:38:16 +00:00
Tanya Lattner	fb76291234	Minor fixes for the release. llvm-svn: 27854	2006-04-20 04:35:34 +00:00
Chris Lattner	6400606664	This has been fixed! Thanks Reid. llvm-svn: 27853	2006-04-20 04:24:28 +00:00
Chris Lattner	d05559828a	Yeah that's right! llvm-svn: 27852	2006-04-20 04:22:06 +00:00
Chris Lattner	70dfe24866	Fixes from Tanya llvm-svn: 27851	2006-04-20 04:01:31 +00:00
Reid Spencer	df65ba121b	Add in missing #defines for _OpenBSD_ systems. llvm-svn: 27850	2006-04-20 00:18:39 +00:00
Evan Cheng	15c264b753	Handle v2i64 BUILD_VECTOR custom lowering correctly. v2i64 is a legal type, but i64 is not. If possible, change a i64 op to a f64 (e.g. load, constant) and then cast it back. llvm-svn: 27849	2006-04-20 00:11:39 +00:00
Reid Spencer	48b9203a40	Allow OpenBSD to be recognized as a UNIX platform. llvm-svn: 27848	2006-04-19 23:47:16 +00:00
Evan Cheng	4a1b0d3292	isSplatMask() bug: first element can be an undef. llvm-svn: 27847	2006-04-19 23:28:59 +00:00
Chris Lattner	73eb58e1a2	Simplify some code llvm-svn: 27846	2006-04-19 23:17:50 +00:00
Evan Cheng	a3caaee503	- Added support to do aribitrary 4 wide shuffle with no more than three instructions. - Fixed a commute vector_shuff bug. llvm-svn: 27845	2006-04-19 22:48:17 +00:00
Evan Cheng	6d5297dac3	Prefer {p}unpack* and movdup over {p}shuf as well. llvm-svn: 27844	2006-04-19 21:15:24 +00:00
Evan Cheng	52df74000a	Renamed AddedCost to AddedComplexity. llvm-svn: 27843	2006-04-19 20:38:28 +00:00
Evan Cheng	b416a25174	- Renamed AddedCost to AddedComplexity. - Added more movhlps and movlhps patterns. llvm-svn: 27842	2006-04-19 20:37:34 +00:00
Evan Cheng	9235d848b7	Rename AddedCost to AddedComplexity. llvm-svn: 27841	2006-04-19 20:36:09 +00:00
Evan Cheng	7855e4d032	Commute vector_shuffle to match more movlhps, movlp{s\|d} cases. llvm-svn: 27840	2006-04-19 20:35:22 +00:00
Chris Lattner	f2f4aedc6e	Final piece to get relinked .o files buildable universal on Darwin. llvm-svn: 27839	2006-04-19 18:45:29 +00:00
Chris Lattner	7d17a77d5e	Regenerate llvm-svn: 27838	2006-04-19 18:38:19 +00:00
Chris Lattner	b3305fb203	When on darwin, compiler_flags need to be percolated down to the 'gcc -r' command line so that relinked .o files can be built universal. llvm-svn: 27837	2006-04-19 18:34:41 +00:00
Evan Cheng	cc7abc6c38	More mov{h\|l}p{d\|s} patterns. llvm-svn: 27836	2006-04-19 18:20:17 +00:00
Evan Cheng	aeb09ccdd3	- More mov{h\|l}ps patterns. - Increase cost (complexity) of patterns which match mov{h\|l}ps ops. These are preferred over shufps in most cases. llvm-svn: 27835	2006-04-19 18:11:52 +00:00
Evan Cheng	aa3325e925	Allow "let AddedCost = n in" to increase pattern complexity. llvm-svn: 27834	2006-04-19 18:07:24 +00:00
Chris Lattner	7b96902d35	Alpha too! llvm-svn: 27833	2006-04-19 17:20:48 +00:00
Chris Lattner	05bbec5020	add a note llvm-svn: 27832	2006-04-19 16:22:38 +00:00
Andrew Lenharth	02f9df3b7b	Another simple case type merge case to try llvm-svn: 27831	2006-04-19 15:34:34 +00:00
Andrew Lenharth	edf349aba6	deal with memchr llvm-svn: 27830	2006-04-19 15:34:02 +00:00
Andrew Lenharth	7f2cee3d3e	friendlier error message llvm-svn: 27829	2006-04-19 15:33:35 +00:00
Chris Lattner	a922a516b0	add a note llvm-svn: 27828	2006-04-19 05:55:06 +00:00
Chris Lattner	bfab82817a	Add a note. llvm-svn: 27827	2006-04-19 05:53:27 +00:00
Chris Lattner	263fc2a642	grammaro llvm-svn: 27826	2006-04-19 04:21:57 +00:00
Chris Lattner	cb2170fb37	Fix a bug owen noticed llvm-svn: 27825	2006-04-19 04:21:16 +00:00
Chris Lattner	c3e92b56a3	Change wording llvm-svn: 27824	2006-04-19 04:12:01 +00:00
Chris Lattner	e9e46a746d	add a note llvm-svn: 27823	2006-04-19 04:05:21 +00:00
Chris Lattner	0b7fd71e1c	add some more notes llvm-svn: 27822	2006-04-19 04:02:47 +00:00
Andrew Lenharth	7c8be502e9	stupid stuff llvm-svn: 27821	2006-04-19 03:45:25 +00:00
Andrew Lenharth	2bdd6fe9ef	fix printing call graphs llvm-svn: 27820	2006-04-18 23:45:19 +00:00
Andrew Lenharth	3e642d012a	I understand now. Shoot. llvm-svn: 27819	2006-04-18 22:36:11 +00:00
Evan Cheng	3823aa1d0f	- PEXTRW cannot take a memory location as its first source operand. - PINSRWrmi encoding bug. llvm-svn: 27818	2006-04-18 21:59:43 +00:00
Evan Cheng	43f4ef4ffb	SHUFP{S\|D}, PSHUF* encoding bugs. Left out the mask immediate operand. llvm-svn: 27817	2006-04-18 21:56:36 +00:00
Evan Cheng	a179ea631d	Name change for clarity sake llvm-svn: 27816	2006-04-18 21:55:35 +00:00
Evan Cheng	09e36ef710	Encoding bug: CMPPSrmi, CMPPDrmi dropped operand 2 (condtion immediate). llvm-svn: 27815	2006-04-18 21:31:08 +00:00
Evan Cheng	d799d680f4	Name change for clarity sake llvm-svn: 27814	2006-04-18 21:29:50 +00:00
Evan Cheng	0ee281f37c	Left a pattern out llvm-svn: 27813	2006-04-18 21:29:08 +00:00
Andrew Lenharth	f70cb84083	llvm.memc* improvements. helps PA a lot in some specmarks llvm-svn: 27812	2006-04-18 20:59:52 +00:00
Andrew Lenharth	49e188d7f7	llvm.memc* improvements. helps PA a lot in some specmarks llvm-svn: 27811	2006-04-18 19:54:11 +00:00
Chris Lattner	34c901b50e	These are correctly encoded by the JIT. I checked :) llvm-svn: 27810	2006-04-18 19:03:38 +00:00
Chris Lattner	197d762232	add a note llvm-svn: 27809	2006-04-18 18:30:19 +00:00
Chris Lattner	518834c67e	Fix a crash on: void foo2(vector float A, vector float B) { vector float C = (vector float)vec_cmpeq(A, B); if (!vec_any_eq(A, B)) B = (vector float){0,0,0,0}; A = C; } llvm-svn: 27808	2006-04-18 18:28:22 +00:00
Evan Cheng	e2d25a1a50	Fixed an encoding bug: movd from XMM to R32. llvm-svn: 27807	2006-04-18 18:19:00 +00:00
Chris Lattner	1e174c87c3	pretty print node name llvm-svn: 27806	2006-04-18 18:05:58 +00:00
Chris Lattner	9754d142a4	Implement an important entry from README_ALTIVEC: If an altivec predicate compare is used immediately by a branch, don't use a (serializing) MFCR instruction to read the CR6 register, which requires a compare to get it back to CR's. Instead, just branch on CR6 directly. :) For example, for: void foo2(vector float A, vector float B) { if (!vec_any_eq(A, B)) *B = (vector float){0,0,0,0}; } We now generate: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 bne cr6, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr instead of: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 cmpwi cr0, r3, 0 beq cr0, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr This implements CodeGen/PowerPC/vec_br_cmp.ll. llvm-svn: 27804	2006-04-18 17:59:36 +00:00
Chris Lattner	11a9ac51e8	new testcase llvm-svn: 27803	2006-04-18 17:56:30 +00:00
Chris Lattner	68c16a201e	move some stuff around, clean things up llvm-svn: 27802	2006-04-18 17:52:36 +00:00
Chris Lattner	bfc2c68386	Teach the codegen about instructions used for SSE spill code, allowing it to optimize cases where it has to spill a lot llvm-svn: 27801	2006-04-18 16:44:51 +00:00
Nate Begeman	f776fc2c98	Fix a copy & paste error from long ago. llvm-svn: 27800	2006-04-18 16:03:18 +00:00
Chris Lattner	89e761c19d	Add some more notes, many still missing llvm-svn: 27799	2006-04-18 06:32:08 +00:00
Reid Spencer	b687ce80cd	Have the AutoRegen.sh script prompt the user for the LLVM src and obj directories if it can't find them. Then, replace those values into the configure.ac script and pass them to the LLVM_CONFIG_PROJECT so that the values become the default for llvm_src and llvm_obj variables. In this way the user is required to input this exactly once, and the scripts take it from there. llvm-svn: 27798	2006-04-18 06:27:47 +00:00
Reid Spencer	c81081ab5e	Make it possible to default the llvm_src and llvm_obj variables based on the arguments to the macro. This better supports the AutoRegen.sh script in projects/sample/autoconf. llvm-svn: 27797	2006-04-18 06:25:37 +00:00
Chris Lattner	9f87173df3	add a bunch of stuff, pieces still missing llvm-svn: 27796	2006-04-18 06:18:36 +00:00
Chris Lattner	9232c8c1c5	Add a warning. llvm-svn: 27795	2006-04-18 05:31:20 +00:00
Chris Lattner	3af67456dd	Add a warning llvm-svn: 27794	2006-04-18 05:26:10 +00:00
Chris Lattner	96d50487c9	Use vmladduhm to do v8i16 multiplies which is faster and simpler than doing even/odd halves. Thanks to Nate telling me what's what. llvm-svn: 27793	2006-04-18 04:28:57 +00:00
Chris Lattner	d6d82aa889	Implement v16i8 multiply with this code: vmuloub v5, v3, v2 vmuleub v2, v3, v2 vperm v2, v2, v5, v4 This implements CodeGen/PowerPC/vec_mul.ll. With this, v16i8 multiplies are 6.79x faster than before. Overall, UnitTests/Vector/multiplies.c is now 2.45x faster with LLVM than with GCC. Remove the 'integer multiplies' todo from the README file. llvm-svn: 27792	2006-04-18 03:57:35 +00:00
Chris Lattner	48786e4887	Add tests for v8i16 and v16i8 llvm-svn: 27791	2006-04-18 03:54:50 +00:00
Evan Cheng	4d36a36900	Correct comments llvm-svn: 27790	2006-04-18 03:45:01 +00:00
Chris Lattner	7e439874cb	Lower v8i16 multiply into this code: li r5, lo16(LCPI1_0) lis r6, ha16(LCPI1_0) lvx v4, r6, r5 vmulouh v5, v3, v2 vmuleuh v2, v3, v2 vperm v2, v2, v5, v4 where v4 is: LCPI1_0: ; <16 x ubyte> .byte 2 .byte 3 .byte 18 .byte 19 .byte 6 .byte 7 .byte 22 .byte 23 .byte 10 .byte 11 .byte 26 .byte 27 .byte 14 .byte 15 .byte 30 .byte 31 This is 5.07x faster on the G5 (measured) than lowering to scalar code + loads/stores. llvm-svn: 27789	2006-04-18 03:43:48 +00:00
Chris Lattner	a2cae1bb10	Custom lower v4i32 multiplies into a cute sequence, instead of having legalize scalarize the sequence into 4 mullw's and a bunch of load/store traffic. This speeds up v4i32 multiplies 4.1x (measured) on a G5. This implements PowerPC/vec_mul.ll llvm-svn: 27788	2006-04-18 03:24:30 +00:00
Chris Lattner	2dea154035	new testcase llvm-svn: 27787	2006-04-18 03:22:16 +00:00
Evan Cheng	0ef233509b	Another entry llvm-svn: 27786	2006-04-18 01:22:57 +00:00
Chris Lattner	3db2056315	Fix a build failure on Vladimir's tester. llvm-svn: 27785	2006-04-18 00:21:25 +00:00
Evan Cheng	e008bd3d27	Another entry. llvm-svn: 27784	2006-04-18 00:21:01 +00:00
Evan Cheng	5421206c4b	Use movss to insert_vector_elt(v, s, 0). llvm-svn: 27782	2006-04-17 22:45:49 +00:00
Chris Lattner	36dd7c98d1	Turn x86 unaligned load/store intrinsics into aligned load/store instructions if the pointer is known aligned. llvm-svn: 27781	2006-04-17 22:26:56 +00:00
Chris Lattner	916ae0775e	Fix handling of calls in functions that use vectors. This fixes a crash on the code in GCC PR26546. llvm-svn: 27780	2006-04-17 22:10:08 +00:00
Evan Cheng	6e5e205841	Use two pinsrw to insert an element into v4i32 / v4f32 vector. llvm-svn: 27779	2006-04-17 22:04:06 +00:00
Chris Lattner	63a5cdc423	remove done item llvm-svn: 27778	2006-04-17 21:52:03 +00:00
Chris Lattner	6bd68ae81e	Don't diddle VRSAVE if no registers need to be added/removed from it. This allows us to codegen functions as: _test_rol: vspltisw v2, -12 vrlw v2, v2, v2 blr instead of: _test_rol: mfvrsave r2, 256 mr r3, r2 mtvrsave r3 vspltisw v2, -12 vrlw v2, v2, v2 mtvrsave r2 blr Testcase here: CodeGen/PowerPC/vec_vrsave.ll llvm-svn: 27777	2006-04-17 21:48:13 +00:00
Chris Lattner	efe2b3f2fc	New testcase, shouldn't touch vrsave llvm-svn: 27776	2006-04-17 21:48:03 +00:00
Chris Lattner	bec79b4a59	Add a MachineInstr::eraseFromParent convenience method. llvm-svn: 27775	2006-04-17 21:35:41 +00:00
Chris Lattner	9fcad09b1b	Add some convenience methods. llvm-svn: 27774	2006-04-17 21:35:08 +00:00
Evan Cheng	22c06f054b	Encoding bug llvm-svn: 27773	2006-04-17 21:33:57 +00:00
Chris Lattner	72d7c27069	Vectors that are known live-in and live-out are clearly already marked in the vrsave register for the caller. This allows us to codegen a function as: _test_rol: mfspr r2, 256 mr r3, r2 mtspr 256, r3 vspltisw v2, -12 vrlw v2, v2, v2 mtspr 256, r2 blr instead of: _test_rol: mfspr r2, 256 oris r3, r2, 40960 mtspr 256, r3 vspltisw v0, -12 vrlw v2, v0, v0 mtspr 256, r2 blr llvm-svn: 27772	2006-04-17 21:22:06 +00:00
Chris Lattner	14c4972b6d	Prefer to allocate V2-V5 before V0,V1. This lets us generate code like this: vspltisw v2, -12 vrlw v2, v2, v2 instead of: vspltisw v0, -12 vrlw v2, v0, v0 when a function is returning a value. llvm-svn: 27771	2006-04-17 21:19:12 +00:00
Chris Lattner	6df094b4ab	Move some knowledge about registers out of the code emitter into the register info. llvm-svn: 27770	2006-04-17 21:07:20 +00:00
Chris Lattner	0f28d48da2	Use a small table instead of macros to do this conversion. llvm-svn: 27769	2006-04-17 20:59:25 +00:00
Evan Cheng	5022b3426e	Implement v8i16, v16i8 splat using unpckl + pshufd. llvm-svn: 27768	2006-04-17 20:43:08 +00:00
Chris Lattner	c070c621ac	implement returns of a vector, testcase here: CodeGen/X86/vec_return.ll llvm-svn: 27767	2006-04-17 20:32:50 +00:00
Chris Lattner	e757ae6534	New testcase llvm-svn: 27766	2006-04-17 20:32:27 +00:00
Chris Lattner	326870b40b	Codegen insertelement with constant insertion points as scalar_to_vector and a shuffle. For this: void %test2(<4 x float>* %F, float %f) { %tmp = load <4 x float>* %F ; <<4 x float>> [#uses=2] %tmp3 = add <4 x float> %tmp, %tmp ; <<4 x float>> [#uses=1] %tmp2 = insertelement <4 x float> %tmp3, float %f, uint 2 ; <<4 x float>> [#uses=2] %tmp6 = add <4 x float> %tmp2, %tmp2 ; <<4 x float>> [#uses=1] store <4 x float> %tmp6, <4 x float>* %F ret void } we now get this on X86 (which will get better): _test2: movl 4(%esp), %eax movaps (%eax), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, %xmm1 shufps $3, %xmm1, %xmm1 movaps %xmm0, %xmm2 shufps $1, %xmm2, %xmm2 unpcklps %xmm1, %xmm2 movss 8(%esp), %xmm1 unpcklps %xmm1, %xmm0 unpcklps %xmm2, %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%eax) ret instead of: _test2: subl $28, %esp movl 32(%esp), %eax movaps (%eax), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%esp) movss 36(%esp), %xmm0 movss %xmm0, 8(%esp) movaps (%esp), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%eax) addl $28, %esp ret llvm-svn: 27765	2006-04-17 19:21:01 +00:00
Chris Lattner	e54133cfba	Make sure to check splats of every constant we can, handle splat(31) by being a bit more clever, add support for odd splats from -31 to -17. llvm-svn: 27764	2006-04-17 18:09:22 +00:00
Evan Cheng	bf0d13c54f	Incorrect foldMemoryOperand entries llvm-svn: 27763	2006-04-17 18:06:12 +00:00
Evan Cheng	5112b5c544	Errors in patterns preventing load folding llvm-svn: 27762	2006-04-17 18:05:01 +00:00
Jeff Cohen	e3955a05e4	Add checks for __OpenBSD__. llvm-svn: 27761	2006-04-17 17:55:41 +00:00
Chris Lattner	264c908e3a	Teach the ppc backend to use rol and vsldoi to generate splatted constants. This implements vec_constants.ll:test_vsldoi and test_rol llvm-svn: 27760	2006-04-17 17:55:10 +00:00
Chris Lattner	8cdba16d5e	Some more cases that can be generated with two instructions llvm-svn: 27759	2006-04-17 17:54:18 +00:00
Chris Lattner	26fb8d9393	add a note llvm-svn: 27758	2006-04-17 17:29:41 +00:00
Evan Cheng	b3b41c4f3d	FP SETOLT, SETOLT, SETUGE, SETUGT conditions were implemented incorrectly llvm-svn: 27755	2006-04-17 07:24:10 +00:00
Chris Lattner	1b3806ace5	Make some code more general, adding support for constant formation of several new patterns. llvm-svn: 27754	2006-04-17 06:58:41 +00:00
Chris Lattner	9a3859b339	New testcases llvm-svn: 27753	2006-04-17 06:58:16 +00:00
Chris Lattner	f8dd76df5b	Learn how to make odd splatted constants in range [17,29]. This implements PowerPC/vec_constants.ll:test_29. llvm-svn: 27752	2006-04-17 06:07:44 +00:00
Chris Lattner	02440a996b	new testcase llvm-svn: 27751	2006-04-17 06:06:50 +00:00
Chris Lattner	2a099c04c1	Pull some code out into a helper function. Effeciently codegen even splats in the range [-32,30]. This allows us to codegen <30,30,30,30> as: vspltisw v0, 15 vadduwm v2, v0, v0 instead of as a cp load. llvm-svn: 27750	2006-04-17 06:00:21 +00:00
Chris Lattner	31b7d89e66	New testcase llvm-svn: 27749	2006-04-17 05:58:22 +00:00
Chris Lattner	071ad01ceb	Implement a TODO: for any shuffle that can be viewed as a v4[if]32 shuffle, if it can be implemented in 3 or fewer discrete altivec instructions, codegen it as such. This implements Regression/CodeGen/PowerPC/vec_perf_shuffle.ll llvm-svn: 27748	2006-04-17 05:28:54 +00:00
Chris Lattner	6e98b49b54	new testcase, these shuffles can be implemented with discrete instructions, and shouldn't be lowered to vperm. llvm-svn: 27747	2006-04-17 05:27:31 +00:00
Chris Lattner	85bfa3c2bc	Regenerate with adjusted costs llvm-svn: 27746	2006-04-17 05:26:20 +00:00
Chris Lattner	e2e2cc5b28	Encode a cost of zero as a cost of 1. llvm-svn: 27745	2006-04-17 05:25:16 +00:00
Chris Lattner	aac2a200cd	Regenerate with correct offset llvm-svn: 27744	2006-04-17 05:08:46 +00:00
Chris Lattner	3dcfef6310	Really, I can count! llvm-svn: 27743	2006-04-17 05:05:52 +00:00
Chris Lattner	311b1a6e23	Increase the opcodes by one each to disambiguate COPY from VMRGHW. llvm-svn: 27742	2006-04-17 00:47:48 +00:00
Chris Lattner	895dba9714	assign stable opcodes to the various altivec ops. llvm-svn: 27741	2006-04-17 00:47:18 +00:00
Chris Lattner	77a3ca6357	PPCPerfectShuffle.h is autogenerated, don't include it in the LOC counts. llvm-svn: 27740	2006-04-17 00:46:09 +00:00
Chris Lattner	07a3d01a91	Check in a table, generated by llvm-PerfectShuffle, of optimal shuffles of various 4-element vectors. llvm-svn: 27739	2006-04-17 00:37:02 +00:00
Chris Lattner	ac66b4a11d	Rename BuildShuffleTable -> PerfectShuffle llvm-svn: 27738	2006-04-17 00:35:34 +00:00
Chris Lattner	156c85ea42	rename the table llvm-svn: 27737	2006-04-17 00:33:35 +00:00
Chris Lattner	2951955da9	Initial checking of a perfect shuffle generation program for 4-element Altivec vectors. llvm-svn: 27736	2006-04-17 00:30:41 +00:00
Evan Cheng	c6f665f5f3	Better way to splat v2f64 llvm-svn: 27735	2006-04-16 18:16:43 +00:00
Evan Cheng	20712deecb	movduprm, movshduprm bugs llvm-svn: 27734	2006-04-16 18:11:28 +00:00
Evan Cheng	3064f9aaa6	Encoding bugs llvm-svn: 27733	2006-04-16 07:02:22 +00:00
Evan Cheng	685ddd8152	Can't fold loads into alias vector SSE ops used for scalar operation. The load address has to be 16-byte aligned but the values aren't spilled to 128-bit locations. llvm-svn: 27732	2006-04-16 06:58:19 +00:00
Chris Lattner	06a21ba96b	Implement a TODO: have the legalizer canonicalize a bunch of operations to one type (v4i32) so that we don't have to write patterns for each type, and so that more CSE opportunities are exposed. llvm-svn: 27731	2006-04-16 01:37:57 +00:00
Chris Lattner	91226e5799	Add support for promoting stores from one legal type to another, allowing us to write one pattern for vector stores instead of 4. llvm-svn: 27730	2006-04-16 01:36:45 +00:00
Chris Lattner	fa5aa396c2	Make the BUILD_VECTOR lowering code much more aggressive w.r.t constant vectors. Remove some done items from the todo list. llvm-svn: 27729	2006-04-16 01:01:29 +00:00
Chris Lattner	9095186deb	Fix a bug in the 'shuffle(undef,x,mask) -> shuffle(x, undef,mask')' xform Make the insert/extract elt -> shuffle code more aggressive. This fixes CodeGen/PowerPC/vec_shuffle.ll llvm-svn: 27728	2006-04-16 00:51:47 +00:00
Chris Lattner	34cebe785d	Canonicalize shuffle(undef,x,mask) -> shuffle(x, undef,mask'). llvm-svn: 27727	2006-04-16 00:03:56 +00:00
Chris Lattner	24acbe46c0	Fix a crash when faced with a shuffle vector that has an undef in its mask. llvm-svn: 27726	2006-04-15 23:48:05 +00:00
Chris Lattner	873202fabd	Add patterns for matching vnots with bit converted inputs. Most of these will go away when I start using evan's binop type canonicalizer llvm-svn: 27725	2006-04-15 23:45:24 +00:00
Chris Lattner	41df12ff4c	Add a new vnot_conv predicate for matching vnot's where the allones vector is bitconverted from some other type. llvm-svn: 27724	2006-04-15 23:39:14 +00:00
Chris Lattner	7e7ad593cc	Make these predicates return true for bit_convert(buildvector)'s as well as buildvectors. llvm-svn: 27723	2006-04-15 23:38:00 +00:00
Evan Cheng	8f1d801389	More encoding bugs llvm-svn: 27722	2006-04-15 06:10:09 +00:00
Evan Cheng	91944e8699	pslldrm, psrawrm, etc. encoding bug llvm-svn: 27721	2006-04-15 05:59:08 +00:00
Evan Cheng	1220b31a31	hsubp{s\|d} encoding bug llvm-svn: 27720	2006-04-15 05:52:42 +00:00
Evan Cheng	6222cf2a36	Silly bug llvm-svn: 27719	2006-04-15 05:37:34 +00:00
Evan Cheng	65bb720a8b	Do not use movs{h\|l}dup for a shuffle with a single non-undef node. llvm-svn: 27718	2006-04-15 03:13:24 +00:00
Chris Lattner	39fac448d6	significant cleanups to code that uses insert/extractelt heavily. This builds maximal shuffles out of them where possible. llvm-svn: 27717	2006-04-15 01:39:45 +00:00
Evan Cheng	0ba896c75b	Added SSE (and other) entries to foldMemoryOperand(). llvm-svn: 27716	2006-04-14 23:33:27 +00:00
Evan Cheng	00a5b3d9d3	Some clean up llvm-svn: 27715	2006-04-14 23:32:40 +00:00
Chris Lattner	559c8ba466	Allow undef in a shuffle mask llvm-svn: 27714	2006-04-14 23:19:08 +00:00
Chris Lattner	0875d94567	Move these ctors out of line llvm-svn: 27713	2006-04-14 22:20:32 +00:00
Chris Lattner	5876571afc	These instructions always return a packed vector. Improve the class definitions to expose this fact. llvm-svn: 27712	2006-04-14 22:20:07 +00:00
Evan Cheng	5d247f81c1	Last few SSE3 intrinsics. llvm-svn: 27711	2006-04-14 21:59:03 +00:00
Chris Lattner	3323ce165d	Teach scalarrepl to promote unions of vectors and floats, producing insert/extractelement operations. This implements Transforms/ScalarRepl/vector_promote.ll llvm-svn: 27710	2006-04-14 21:42:41 +00:00
Chris Lattner	1c0ef4385d	New testcase, checking to see we can turn this code: void test(vector float F, float f) { vector float G = F + F; ((float)&G) = f; F = G + G; } void test2(vector float F, float f) { vector float G = F + F; ((float)&G)[2] = f; F = G + G; } void test3(vector float F, float f) { vector float G = F + F; f = ((float)&G)[2]; } void test4(vector float F, float f) { vector float G = F + F; f = ((float)&G); } into insert/extract element operations with no memory traffic. llvm-svn: 27709	2006-04-14 21:41:54 +00:00
Tanya Lattner	b138bbe85b	Adding back vector instructions to keep in mainline. llvm-svn: 27704	2006-04-14 19:24:33 +00:00
Tanya Lattner	8333c7d736	Removing unimplemented vector instructions from language referrence. llvm-svn: 27703	2006-04-14 19:07:42 +00:00
Reid Spencer	83ee73fdb8	Correct the Superclasses list for GlobalVariable and Function to indicate that they are "Constant" as they derive from GlobalValue. Also, fix some of the wording where it mentions this. Patch inspired by Nai Xia. llvm-svn: 27701	2006-04-14 14:11:48 +00:00
Reid Spencer	9a20587c27	Allow an option, -with-externals, to be specified that provides the location of the external tests. This turns into --with-externals option to configure. llvm-svn: 27700	2006-04-14 13:53:56 +00:00
Evan Cheng	3bd605397b	Misc. SSE2 intrinsics: clflush, lfench, mfence llvm-svn: 27699	2006-04-14 07:43:12 +00:00
Evan Cheng	e349d01acf	We were not adjusting the frame size to ensure proper alignment when alloca / vla are present in the function. This causes a crash when a leaf function allocates space on the stack used to store / load with 128-bit SSE instructions. llvm-svn: 27698	2006-04-14 07:26:43 +00:00
Evan Cheng	8d76f3922b	New entry llvm-svn: 27697	2006-04-14 07:24:04 +00:00
Reid Spencer	ef56d92d6c	Don't print out the install command for Intrinsics.gen unless VERBOSE mode. llvm-svn: 27696	2006-04-14 06:32:31 +00:00
Chris Lattner	086e986e94	Make this assertion better llvm-svn: 27695	2006-04-14 06:08:35 +00:00
Chris Lattner	174bccb676	Force a specific config, because this test fails in certain configs otherwise. llvm-svn: 27694	2006-04-14 06:06:51 +00:00
Chris Lattner	4211ca9108	Move the rest of the PPCTargetLowering::LowerOperation cases out into separate functions, for simplicity and code clarity. llvm-svn: 27693	2006-04-14 06:01:58 +00:00

... 4 5 6 7 8 ...

24641 Commits