llvm-project

Commit Graph

Author	SHA1	Message	Date
Evan Cheng	b8a662f0d1	Remove an unused variable. llvm-svn: 120964	2010-12-05 23:03:35 +00:00
Cameron Zwarich	c7223a3e37	Some cleanup before I start committing some incremental progress on StrongPHIElimination. llvm-svn: 120961	2010-12-05 22:34:08 +00:00
Evan Cheng	62c7b5bf76	Making use of VFP / NEON floating point multiply-accumulate / subtraction is difficult on current ARM implementations for a few reasons. 1. Even though a single vmla has latency that is one cycle shorter than a pair of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause additional pipeline stall. So it's frequently better to single codegen vmul + vadd. 2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to stall for 4 cycles. We need to schedule them apart. 3. A vmla followed vmla is a special case. Obvious issuing back to back RAW vmla + vmla is very bad. But this isn't ideal either: vmul vadd vmla Instead, we want to expand the second vmla: vmla vmul vadd Even with the 4 cycle vmul stall, the second sequence is still 2 cycles faster. Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough but it isn't the optimial solution. This patch attempts to make it possible to use vmla / vmls in cases where it is profitable. A. Add missing isel predicates which cause vmla to be codegen'ed. B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to compute a fmul and a fmla. C. Add additional isel checks for vmla, avoid cases where vmla is feeding into fp instructions (except for the #3 exceptional case). D. Add ARM hazard recognizer to model the vmla / vmls hazards. E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the vmla / vmls will trigger one of the special hazards. Work in progress, only A+B are enabled. llvm-svn: 120960	2010-12-05 22:04:16 +00:00
Cameron Zwarich	a3fb8cb3d4	Remove the PHIElimination.h header, as it is no longer needed. llvm-svn: 120959	2010-12-05 21:39:42 +00:00
Frits van Bommel	7cf63ace18	Clarify some of the differences between indexing with getelementptr and indexing with insertvalue/extractvalue. llvm-svn: 120957	2010-12-05 20:54:38 +00:00
Frits van Bommel	16ebe77be0	Fix PR 4170 by having ExtractValueInst::getIndexedType() reject out-of-bounds indexing. Also add asserts that the indices are valid in InsertValueInst::init(). ExtractValueInst already asserts when constructed with invalid indices. llvm-svn: 120956	2010-12-05 20:50:26 +00:00
Greg Clayton	c5f5783044	Fixed an issue where SBProcess::LoadImage(...) was not returning the image token. llvm-svn: 120954	2010-12-05 20:38:01 +00:00
Cameron Zwarich	6766c420a2	I forgot to actually remove the FindCopyInsertPoint() declaration from PHIElimination.h. llvm-svn: 120953	2010-12-05 19:58:57 +00:00
Cameron Zwarich	8d1695589c	Remove the SplitCriticalEdge() method declaration from PHIElimination.h. At one time, this method existed, but now PHIElimination uses the method of the same name on MachineBasicBlock. llvm-svn: 120952	2010-12-05 19:54:23 +00:00
Cameron Zwarich	da592a9e41	Move the FindCopyInsertPoint method of PHIElimination to a new standalone function so that it can be shared with StrongPHIElimination. llvm-svn: 120951	2010-12-05 19:51:05 +00:00
Greg Clayton	1c2f283864	Added "void SBBroadcaster::Clear ();" method to SBBroadcaster. llvm-svn: 120949	2010-12-05 19:36:39 +00:00
Greg Clayton	920c696c54	Fixed a crasher when trying to get event data flavors on events that don't have event data. llvm-svn: 120948	2010-12-05 19:21:02 +00:00
Greg Clayton	a9ff306151	Make sure that STDOUT and STDERR events in lldb_private::Process carry along a ProcessEventData so clients can get the process from these events. llvm-svn: 120947	2010-12-05 19:16:56 +00:00
Frits van Bommel	76244867cf	Refactor jump threading. Should have no functional change other than the order of two transformations that are mutually-exclusive and the exact formatting of debug output. Internally, it now stores the ConstantInts as Constants, and actual undef values instead of nulls. llvm-svn: 120946	2010-12-05 19:06:41 +00:00
Frits van Bommel	5e75ef4a8e	Remove trailing whitespace. llvm-svn: 120945	2010-12-05 19:02:47 +00:00
Frits van Bommel	8fb69ee805	Teach SimplifyCFG to turn (indirectbr (select cond, blockaddress(@fn, BlockA), blockaddress(@fn, BlockB))) into (br cond, BlockA, BlockB). llvm-svn: 120943	2010-12-05 18:29:03 +00:00
Chris Lattner	6886171792	Teach X86ISelLowering that the second result of X86ISD::UMUL is a flags result. This allows us to compile: void *test12(long count) { return new int[count]; } into: test12: movl $4, %ecx movq %rdi, %rax mulq %rcx movq $-1, %rdi cmovnoq %rax, %rdi jmp __Znam ## TAILCALL instead of: test12: movl $4, %ecx movq %rdi, %rax mulq %rcx seto %cl testb %cl, %cl movq $-1, %rdi cmoveq %rax, %rdi jmp __Znam Of course it would be even better if the regalloc inverted the cmov to 'cmovoq', which would eliminate the need for the 'movq %rdi, %rax'. llvm-svn: 120936	2010-12-05 07:49:54 +00:00
Chris Lattner	364bb0a081	it turns out that when ".with.overflow" intrinsics were added to the X86 backend that they were all implemented except umul. This one fell back to the default implementation that did a hi/lo multiply and compared the top. Fix this to check the overflow flag that the 'mul' instruction sets, so we can avoid an explicit test. Now we compile: void *func(long count) { return new int[count]; } into: __Z4funcl: ## @_Z4funcl movl $4, %ecx ## encoding: [0xb9,0x04,0x00,0x00,0x00] movq %rdi, %rax ## encoding: [0x48,0x89,0xf8] mulq %rcx ## encoding: [0x48,0xf7,0xe1] seto %cl ## encoding: [0x0f,0x90,0xc1] testb %cl, %cl ## encoding: [0x84,0xc9] movq $-1, %rdi ## encoding: [0x48,0xc7,0xc7,0xff,0xff,0xff,0xff] cmoveq %rax, %rdi ## encoding: [0x48,0x0f,0x44,0xf8] jmp __Znam ## TAILCALL instead of: __Z4funcl: ## @_Z4funcl movl $4, %ecx ## encoding: [0xb9,0x04,0x00,0x00,0x00] movq %rdi, %rax ## encoding: [0x48,0x89,0xf8] mulq %rcx ## encoding: [0x48,0xf7,0xe1] testq %rdx, %rdx ## encoding: [0x48,0x85,0xd2] movq $-1, %rdi ## encoding: [0x48,0xc7,0xc7,0xff,0xff,0xff,0xff] cmoveq %rax, %rdi ## encoding: [0x48,0x0f,0x44,0xf8] jmp __Znam ## TAILCALL Other than the silly seto+test, this is using the o bit directly, so it's going in the right direction. llvm-svn: 120935	2010-12-05 07:30:36 +00:00
Chris Lattner	183ddd8ed3	fix the rest of the linux miscompares :) llvm-svn: 120933	2010-12-05 02:08:07 +00:00
Chris Lattner	116580a11c	generalize the previous check to handle -1 on either side of the select, inserting a not to compensate. Add a missing isZero check that I lost somehow. This improves codegen of: void *func(long count) { return new int[count]; } from: __Z4funcl: ## @_Z4funcl movl $4, %ecx ## encoding: [0xb9,0x04,0x00,0x00,0x00] movq %rdi, %rax ## encoding: [0x48,0x89,0xf8] mulq %rcx ## encoding: [0x48,0xf7,0xe1] testq %rdx, %rdx ## encoding: [0x48,0x85,0xd2] movq $-1, %rdi ## encoding: [0x48,0xc7,0xc7,0xff,0xff,0xff,0xff] cmoveq %rax, %rdi ## encoding: [0x48,0x0f,0x44,0xf8] jmp __Znam ## TAILCALL ## encoding: [0xeb,A] to: __Z4funcl: ## @_Z4funcl movl $4, %ecx ## encoding: [0xb9,0x04,0x00,0x00,0x00] movq %rdi, %rax ## encoding: [0x48,0x89,0xf8] mulq %rcx ## encoding: [0x48,0xf7,0xe1] cmpq $1, %rdx ## encoding: [0x48,0x83,0xfa,0x01] sbbq %rdi, %rdi ## encoding: [0x48,0x19,0xff] notq %rdi ## encoding: [0x48,0xf7,0xd7] orq %rax, %rdi ## encoding: [0x48,0x09,0xc7] jmp __Znam ## TAILCALL ## encoding: [0xeb,A] llvm-svn: 120932	2010-12-05 02:00:51 +00:00
John McCall	a2342eb857	Fix a bug in the emission of __real/__imag l-values on scalar operands. Fix a bug in the emission of complex compound assignment l-values. Introduce a method to emit an expression whose value isn't relevant. Make that method evaluate its operand as an l-value if it is one. Fixes our volatile compliance in C++. llvm-svn: 120931	2010-12-05 02:00:02 +00:00
Chris Lattner	77a11c6174	relax this to handle linux defaulting to -static. llvm-svn: 120930	2010-12-05 01:31:13 +00:00
Chris Lattner	342e6ea5f9	Improve an integer select optimization in two ways: 1. generalize (select (x == 0), -1, 0) -> (sign_bit (x - 1)) to: (select (x == 0), -1, y) -> (sign_bit (x - 1)) \| y 2. Handle the identical pattern that happens with !=: (select (x != 0), y, -1) -> (sign_bit (x - 1)) \| y cmov is often high latency and can't fold immediates or memory operands. For example for (x == 0) ? -1 : 1, before we got: < testb %sil, %sil < movl $-1, %ecx < movl $1, %eax < cmovel %ecx, %eax now we get: > cmpb $1, %sil > sbbl %eax, %eax > orl $1, %eax llvm-svn: 120929	2010-12-05 01:23:24 +00:00
Chris Lattner	0523388d60	merge some tests into select.ll and make them more specific. llvm-svn: 120928	2010-12-05 01:13:58 +00:00
Chris Lattner	b89b6f17da	rename test llvm-svn: 120927	2010-12-05 01:02:23 +00:00
Chris Lattner	d4f8c9641a	remove two tests that aren't really testing anything. llvm-svn: 120926	2010-12-05 01:02:13 +00:00
Anders Carlsson	0febb8acdf	Put each test in class-layout.cpp into a separate namespace. llvm-svn: 120925	2010-12-05 00:08:52 +00:00
Anders Carlsson	a518b2a5a1	Add a LayoutBase member function. No functionality change. llvm-svn: 120924	2010-12-04 23:59:48 +00:00
Bill Wendling	2bce78e8fc	Initialize HasPOPCNT. llvm-svn: 120923	2010-12-04 23:57:24 +00:00
Anders Carlsson	d74cad80b0	Replace calls to AppendBytes with calls to AppendPadding when the bytes appended are padding. llvm-svn: 120922	2010-12-04 23:53:18 +00:00
Rafael Espindola	8867390cf2	Once the layout is done we don't need to keep updating which fragments are valid. Addresses will not change. llvm-svn: 120921	2010-12-04 22:47:22 +00:00
Rafael Espindola	99e026dbca	Remember the contents of leb and dwarfline fragments when relaxing. This avoids having to evaluate the expression again when writing. llvm-svn: 120920	2010-12-04 21:58:52 +00:00
Fariborz Jahanian	83e7d5a90a	Fix rewriter to match recent changes in property ref AST. llvm-svn: 120919	2010-12-04 21:22:13 +00:00
Cameron Zwarich	fbd47dcc55	Remove PHIElimination's private copy of SkipPHIsAndLabels. llvm-svn: 120918	2010-12-04 20:40:15 +00:00
Benjamin Kramer	2f489236ab	Add patterns for the x86 popcnt instruction. - Also adds a new POPCNT subtarget feature that is currently enabled if the target supports SSE4.2 (nehalem) or SSE4A (barcelona). llvm-svn: 120917	2010-12-04 20:32:23 +00:00
Bill Wendling	3336f748a5	Silence 'may be used uninitialized in this function' warnings. Static analysis may determine that they cannot be used uninitialized. But that might be a bit too much for the compiler to determine. llvm-svn: 120916	2010-12-04 20:20:34 +00:00
Howard Hinnant	75357bcd39	oops, forgot std:: llvm-svn: 120915	2010-12-04 19:56:43 +00:00
Howard Hinnant	816cb8975d	Fix up uses of new/terminate/unexpected handlers to use the new getters. llvm-svn: 120914	2010-12-04 19:54:11 +00:00
Michael J. Spencer	66a1f86f7a	Support/PathV2: Remove redundant calls to make_error_code. llvm-svn: 120913	2010-12-04 18:45:32 +00:00
Benjamin Kramer	f1a04edb42	APInt: microoptimize a few methods. llvm-svn: 120912	2010-12-04 18:05:36 +00:00
Benjamin Kramer	5d75f0bddd	Simplify APInt::getAllOnesValue. llvm-svn: 120911	2010-12-04 16:37:47 +00:00
Benjamin Kramer	31920b0a2a	Remove unneeded zero arrays. llvm-svn: 120910	2010-12-04 15:28:22 +00:00
Benjamin Kramer	6f88fcb16b	Apparently APFloat::getZero doesn't like PPCDoubleDoubles. llvm-svn: 120909	2010-12-04 14:43:08 +00:00
Francois Pichet	82f3b5f945	Disable C++ exception handling on MSVC. Total size of bin\Release on disk goes from 82.9 MB to 74.2 MB. (~10% saving) llvm-svn: 120908	2010-12-04 14:30:22 +00:00
Benjamin Kramer	8ceebfaa04	Simplify code. No functionality change. llvm-svn: 120907	2010-12-04 14:22:24 +00:00
John McCall	594827281c	Silly special case: never load when dereferencing void*. llvm-svn: 120905	2010-12-04 12:43:24 +00:00
John McCall	ca61b6567b	First pass at implementing the intent of ANSI C DR106. llvm-svn: 120904	2010-12-04 12:29:11 +00:00
John McCall	a03eddad58	dyn_cast else unreachable -> cast llvm-svn: 120902	2010-12-04 09:57:16 +00:00
Francois Pichet	916fae2a34	Disable RTTI on Windows. Total size of bin\Release on disk goes from 83.6 MB to 81.8MB. (~2% saving) llvm-svn: 120901	2010-12-04 09:42:30 +00:00
Francois Pichet	d583da04d0	More anonymous struct/union redesign. This one deals with anonymous field used in a constructor initializer list: struct X { X() : au_i1(123) {} union { int au_i1; float au_f1; }; }; clang will now deal with au_i1 explicitly as an IndirectFieldDecl. llvm-svn: 120900	2010-12-04 09:14:42 +00:00

1 2 3 4 5 ...

97016 Commits All Branches Search

97016 Commits

All Branches