llvm-project

Commit Graph

Author	SHA1	Message	Date
Dan Gohman	fd3e3003f3	Whitespace cleanups. llvm-svn: 51089	2008-05-14 00:43:10 +00:00
Evan Cheng	1120279ae6	Instead of a vector load, shuffle and then extract an element. Load the element from address with an offset. pshufd $1, (%rdi), %xmm0 movd %xmm0, %eax => movl 4(%rdi), %eax llvm-svn: 51026	2008-05-13 08:35:03 +00:00
Dan Gohman	d78c400b5b	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Nate Begeman	b87e63a730	Teach Legalize how to scalarize VSETCC Teach X86 a few more vsetcc patterns. Custom lowering for unsupported ones is next. llvm-svn: 51009	2008-05-12 23:09:43 +00:00
Evan Cheng	b980f6fb3d	Xform bitconvert(build_pair(load a, load b)) to a single load if the load locations are at the right offset from each other. llvm-svn: 51008	2008-05-12 23:04:07 +00:00
Evan Cheng	2609d5e779	Refactor isConsecutiveLoad from X86 to TargetLowering so DAG combiner can make use of it. llvm-svn: 50991	2008-05-12 19:56:52 +00:00
Nate Begeman	cfcb56091b	Add support for vicmp/vfcmp codegen, more legalize support coming. This is necessary to unbreak the build. llvm-svn: 50988	2008-05-12 19:40:03 +00:00
Dan Gohman	ecb77385ab	Fix a missing break in the ISD::FLT_ROUNDS_ handling. Patch by giuma! llvm-svn: 50967	2008-05-12 16:07:15 +00:00
Anton Korobeynikov	fc2edad4ae	Turn StripPointerCast() into a method llvm-svn: 50836	2008-05-07 22:54:15 +00:00
Dan Gohman	5a3eecdfd8	Fix a bug in the ComputeMaskedBits logic for multiply. llvm-svn: 50793	2008-05-07 00:35:55 +00:00
Anton Korobeynikov	82c02b28f3	Make StripPointerCast a common function (should we mak it method of Value instead?) llvm-svn: 50775	2008-05-06 22:52:30 +00:00
Dan Gohman	6a2da37c0e	Make several variable declarations static. llvm-svn: 50696	2008-05-06 01:53:16 +00:00
Dan Gohman	38dc08f36f	Instead of enumerating each opcode that isn't handled that ComputeMaskedBits handles, just use a 'default:'. This avoids TargetLowering's list getting out of date with SelectionDAG's. llvm-svn: 50693	2008-05-06 00:53:29 +00:00
Dan Gohman	cf0e3acf16	Correct the value of LowBits in srem and urem handling in ComputeMaskedBits. llvm-svn: 50692	2008-05-06 00:51:48 +00:00
Mon P Wang	3e58393c3d	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Dan Gohman	1962c2be6a	Fix a mistake in the computation of leading zeros for udiv. llvm-svn: 50591	2008-05-02 21:30:02 +00:00
Dan Gohman	2f83b47863	Fix a typo in a comment. llvm-svn: 50562	2008-05-02 00:05:03 +00:00
Dan Gohman	ea6357828b	Use push_back(...) instead of resize(1, ...), per review feedback. llvm-svn: 50561	2008-05-02 00:03:54 +00:00
Dan Gohman	752ce50b2d	Fix uninitialized uses of the FPC variable. llvm-svn: 50558	2008-05-01 23:40:44 +00:00
Chris Lattner	d4b2a67cf3	don't randomly miscompile seto/setuo just because we are in ffastmath mode. This fixes rdar://5902801, a miscompilation of gcc.dg/builtins-8.c. Bill, please pull this into Tak. llvm-svn: 50523	2008-05-01 07:26:11 +00:00
Arnold Schwaighofer	be0de34ede	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Scott Michel	be940424b3	Fix custom target lowering for zero/any/sign_extend: make sure that DAG.UpdateNodeOperands() is called before (not after) the call to TLI.LowerOperation(). llvm-svn: 50461	2008-04-30 00:26:38 +00:00
Roman Levenstein	6b37114590	Use std::set instead of std::priority_queue for the RegReductionPriorityQueue. This removes the existing bottleneck related to the removal of elements from the middle of the queue. Also fixes a subtle bug in ScheduleDAGRRList::CapturePred: It was updating the state of the SUnit before removing it. As a result, the comparison operators were working incorrectly and this SUnit could not be removed from the queue properly. Reviewed by Evan and Dan. Approved by Dan. llvm-svn: 50412	2008-04-29 09:07:59 +00:00
Chris Lattner	5c88f7b1ad	make the vector conversion magic handle multiple results. We now compile test2/test3 to: _test2: ## InlineAsm Start set %xmm0, %xmm1 ## InlineAsm End addps %xmm1, %xmm0 ret _test3: ## InlineAsm Start set %xmm0, %xmm1 ## InlineAsm End paddd %xmm1, %xmm0 ret as expected. llvm-svn: 50389	2008-04-29 04:48:56 +00:00
Chris Lattner	f9a49c4322	add support for multiple return values in inline asm. This is a step towards PR2094. It now compiles the attached .ll file to: _sad16_sse2: movslq %ecx, %rax ## InlineAsm Start %ecx %rdx %rax %rax %r8d %rdx %rsi ## InlineAsm End ## InlineAsm Start set %eax ## InlineAsm End ret which is pretty decent for a 3 output, 4 input asm. llvm-svn: 50386	2008-04-29 04:29:54 +00:00
Evan Cheng	b96782ecbd	Fix a bug in RegsForValue::getCopyToRegs() that causes cyclical scheduling units. If it's creating multiple CopyToReg nodes that are "flagged" together, it should not create a TokenFactor for it's chain outputs: c1, f1 = CopyToReg c2, f2 = CopyToReg c3 = TokenFactor c1, c2 ... = user c3, ..., f2 Now that the two CopyToReg's and the user are "flagged" together. They effectively forms a single scheduling unit. The TokenFactor is now both an operand and a successor of the Flagged nodes. llvm-svn: 50376	2008-04-28 22:07:13 +00:00
Dan Gohman	c968c1f592	Evan pointed out that folding sext to zext may not be correct if the zext is not legal. llvm-svn: 50368	2008-04-28 18:47:17 +00:00
Dan Gohman	77ce6da378	Delete an unused constructor. llvm-svn: 50367	2008-04-28 18:28:49 +00:00
Dan Gohman	d961d30b7f	Add a comment to CreateRegForValue that clarifies the handling of aggregate types. llvm-svn: 50366	2008-04-28 18:19:43 +00:00
Dan Gohman	80c692d439	Rewrite the comments for RegsForValue and its members, and reorder some of the members for clarity. llvm-svn: 50365	2008-04-28 18:10:39 +00:00
Dan Gohman	14a05df97b	Don't call size() on each iteration of the loop. llvm-svn: 50361	2008-04-28 17:42:03 +00:00
Dan Gohman	da44054867	Fix the SVOffset values for loads and stores produced by memcpy/memset expansion. It was a bug for the SVOffset value to be used in the actual address calculations. llvm-svn: 50359	2008-04-28 17:15:20 +00:00
Dan Gohman	72ec3f4562	Teach InstCombine's ComputeMaskedBits what SelectionDAG's ComputeMaskedBits knows about cttz, ctlz, and ctpop. Teach SelectionDAG's ComputeMaskedBits what InstCombine's knows about SRem. And teach them both some things about high bits in Mul, UDiv, URem, and Sub. This allows instcombine and dagcombine to eliminate sign-extension operations in several new cases. llvm-svn: 50358	2008-04-28 17:02:21 +00:00
Dan Gohman	3eb10f758e	Teach DAGCombine to convert (sext x) to (zext x) when the sign-bit of x is known to be zero. llvm-svn: 50357	2008-04-28 16:58:24 +00:00
Chris Lattner	c9e280c78a	Another collection of random cleanups. No functionality change. llvm-svn: 50341	2008-04-28 07:16:35 +00:00
Chris Lattner	52504e78fb	Remove the SmallVector ctor that converts from a SmallVectorImpl. This conversion open the door for many nasty implicit conversion issues, and can be easily solved by initializing with (V.begin(), V.end()) when needed. This patch includes many small cleanups for sdisel also. llvm-svn: 50340	2008-04-28 06:44:42 +00:00
Chris Lattner	8c7f5ad968	switch RegsForValue::Regs to be a SmallVector to avoid heap thrash on tiny (usually single-element) vectors. llvm-svn: 50335	2008-04-28 06:02:19 +00:00
Chris Lattner	d04b818a91	move static function out of anon namespace, no functionality change. llvm-svn: 50330	2008-04-27 23:48:12 +00:00
Chris Lattner	122721843b	Another step to getting multiple result inline asm to work. llvm-svn: 50329	2008-04-27 23:44:28 +00:00
Chris Lattner	58b9ece38d	typo llvm-svn: 50316	2008-04-27 01:49:46 +00:00
Chris Lattner	2237973438	Implement a signficant optimization for inline asm: When choosing between constraints with multiple options, like "ir", test to see if we can use the 'i' constraint and go with that if possible. This produces more optimal ASM in all cases (sparing a register and an instruction to load it), and fixes inline asm like this: void test () { asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14)); } Previously we would dump "42" into a memory location (which is ok for the 'm' constraint) which would cause a problem because the 'c' modifier is not valid on memory operands. Isn't it great how inline asm turns 'missed optimization' into 'compile failed'?? Incidentally, this was the todo in PowerPC/2007-04-24-InlineAsm-I-Modifier.ll Please do NOT pull this into Tak. llvm-svn: 50315	2008-04-27 00:37:18 +00:00
Chris Lattner	a937baeb9b	isa+cast -> dyn_cast llvm-svn: 50314	2008-04-27 00:16:18 +00:00
Chris Lattner	4793515a9c	Move a bunch of inline asm code out of line. llvm-svn: 50313	2008-04-27 00:09:47 +00:00
Chris Lattner	724539c001	A few inline asm cleanups: - Make targetlowering.h fit in 80 cols. - Make LowerAsmOperandForConstraint const. - Make lowerXConstraint -> LowerXConstraint - Make LowerXConstraint return a const char* instead of taking a string byref. llvm-svn: 50312	2008-04-26 23:02:14 +00:00
Dan Gohman	ca95a5f49f	Remove the code from CodeGenPrepare that moved getresult instructions to the block that defines their operands. This doesn't work in the case that the operand is an invoke, because invoke is a terminator and must be the last instruction in a block. Replace it with support in SelectionDAGISel for copying struct values into sequences of virtual registers. llvm-svn: 50279	2008-04-25 18:27:55 +00:00
Nate Begeman	6f94f61317	Pull the code to perform an INSERT_VECTOR_ELT in memory out into its own function, and then use it to fix a bug in SplitVectorOp that expected inserts to always have constant insertion indices. llvm-svn: 50273	2008-04-25 18:07:40 +00:00
Dan Gohman	e9e3891c09	Use isa instead of dyn_cast. llvm-svn: 50181	2008-04-23 20:25:16 +00:00
Dan Gohman	b418aafabf	Add support to codegen for getresult instructions with undef operands. llvm-svn: 50180	2008-04-23 20:21:29 +00:00
Dan Gohman	dc90919d2b	Fix an out-of-bounds access in -view-sunit-dags in the case of an empty ScheduleDAG. llvm-svn: 50054	2008-04-21 20:07:30 +00:00
Dale Johannesen	aac27592f0	Check we aren't trying to convert PPC long double. This fixes the testsuite failure on ppcf128-4.ll. llvm-svn: 49994	2008-04-20 18:23:46 +00:00

1 2 3 4 5 ...

2364 Commits