llvm-project

Commit Graph

Author	SHA1	Message	Date
Nick Lewycky	b4d9f7a9b3	Simplify trunc(extend(x)) in SCEVs, just for completeness. Also fix some odd whitespace in the same file. llvm-svn: 69870	2009-04-23 05:15:08 +00:00
Dan Gohman	14efb90fcf	Add support for printing MO_ExternalSymbol operands in memory operand tuples. This doesn't ever come up in normal code however. llvm-svn: 69848	2009-04-23 00:57:37 +00:00
Evan Cheng	d8174d3d09	Make sure both operands have binary instructions have the same type. llvm-svn: 69844	2009-04-22 23:39:28 +00:00
Evan Cheng	59ca33053b	A few more places where the check of use_empty is needed. llvm-svn: 69842	2009-04-22 23:09:16 +00:00
Evan Cheng	cbfe9df096	Avoid deferencing use_begin() if value does not have a use. llvm-svn: 69836	2009-04-22 22:45:37 +00:00
Devang Patel	51c3619018	Fix cut-n-pasto. llvm-svn: 69816	2009-04-22 18:51:05 +00:00
Dan Gohman	79af854f55	Simplify trivial cast-of-cast SCEVs. llvm-svn: 69809	2009-04-22 16:20:48 +00:00
Dan Gohman	54fd2869a3	SCEVExpander's InsertCastOfTo knows how to move existing cast instructions in order to avoid inserting new ones. However, if the cast instruction is the SCEVExpander's InsertPt, this causes subsequently emitted instructions to be inserted near the cast, and not at the location of the original insert point. Fix this by adjusting the insert point in such cases. This fixes PR4009. llvm-svn: 69808	2009-04-22 16:11:16 +00:00
Dan Gohman	9dd9a45e12	Use BasicBlock::iterator instead of Instruction* for insert points, to better handle inserting instructions at the end of a block. llvm-svn: 69807	2009-04-22 16:05:50 +00:00
Sanjiv Gupta	107b2818ce	Make the function begin label start after ther data pointer. The address of data frame for function can be obtained by subtracting 2 from the function begin label. llvm-svn: 69801	2009-04-22 12:02:36 +00:00
Owen Anderson	6cbf5bb9bb	Real fix for PR3549, by using caching for predecessor counts in addition to the predecessors themselves. This halves the time to optimize the testcase, beyond what my previous patch did. llvm-svn: 69792	2009-04-22 08:50:12 +00:00
Owen Anderson	bb754826c9	Use PredIteratorCache in LCSSA, which gives a 37% overall speedup on the testcase from PR3549. More improvements to come. llvm-svn: 69788	2009-04-22 08:09:13 +00:00
Chris Lattner	58be2d4413	use predicate instead of hand-rolled loop llvm-svn: 69752	2009-04-21 23:37:18 +00:00
Dan Gohman	c8e236278e	De-pImpl-ify ScalarEvolution. The pImpl pattern doesn't provide much practical benefit in the case of ScalarEvolution, and it's otherwise a nuisance. llvm-svn: 69749	2009-04-21 23:15:49 +00:00
Evan Cheng	1a99a5f501	It has finally happened. Spiller is now using live interval info. This fixes a very subtle bug. vr defined by an implicit_def is allowed overlap with any register since it doesn't actually modify anything. However, if it's used as a two-address use, its live range can be extended and it can be spilled. The spiller must take care not to emit a reload for the vn number that's defined by the implicit_def. This is both a correctness and performance issue. llvm-svn: 69743	2009-04-21 22:46:52 +00:00
Dan Gohman	e14efcc9f4	When turning (ashr(shl(x, n), n)) into sext(trunc(x)), the width of the type to truncate to should be the number of bits of the value that are preserved, not the number that are clobbered with sign-extension. This fixes regressions in ldecod. llvm-svn: 69704	2009-04-21 20:18:36 +00:00
Chris Lattner	69223bb7f5	fix a crash on a pointless but valid zero-length memset, rdar://6808691 llvm-svn: 69680	2009-04-21 16:52:12 +00:00
Anton Korobeynikov	f7d0c1a5e6	Silence warnings. Patch by Jay Foad! llvm-svn: 69679	2009-04-21 16:04:56 +00:00
Anton Korobeynikov	5a4b52dc0c	Drop obsolete reference to __eprintf. Patch by Jay Foad! llvm-svn: 69678	2009-04-21 16:04:41 +00:00
Duncan Sands	7ce5cc6bd1	Get rid of what looks like a copy-and-pasted typo. Spotted by gcc-4.5. llvm-svn: 69673	2009-04-21 09:44:39 +00:00
Rafael Espindola	47ed1f5293	TLS_addr64 and TLS_addr32 define RDI and EAX. They don't use them. This fixes PR4002. llvm-svn: 69672	2009-04-21 08:22:09 +00:00
Sanjiv Gupta	3e3ef7c4d9	Handle direct aggregate type arguments. llvm-svn: 69665	2009-04-21 05:54:51 +00:00
Dan Gohman	0ec0537403	Teach ScalarEvolution how to recognize zext-inreg and sext-inreg, as they appear in LLVM IR. This isn't particularly interesting on its own; this is just setting up some infrastructure. llvm-svn: 69655	2009-04-21 02:26:00 +00:00
Dan Gohman	c4938497b3	This FIXME is fixed, now that SCEV understands pointers. llvm-svn: 69651	2009-04-21 01:41:18 +00:00
Dan Gohman	4860db61be	Factor out a common base class from SCEVTruncateExpr, SCEVZeroExtendExpr, and SCEVSignExtendExpr. llvm-svn: 69649	2009-04-21 01:25:57 +00:00
Dan Gohman	49e062fcc1	Usage getAnalysisToUpdate for TargetData, per PR760. llvm-svn: 69645	2009-04-21 01:11:19 +00:00
Dan Gohman	b397e1a7a2	Introduce encapsulation for ScalarEvolution's TargetData object, and refactor the code to minimize dependencies on TargetData. llvm-svn: 69644	2009-04-21 01:07:12 +00:00
Dan Gohman	413e91f440	Move some assertion checks so they can do more complete checking. llvm-svn: 69643	2009-04-21 00:55:22 +00:00
Dan Gohman	e20f824565	Convert ScalarEvolution to use raw_ostream instead of OStream. llvm-svn: 69640	2009-04-21 00:47:46 +00:00
Devang Patel	7f413cb05f	Fix Visual Studio 2008 build failure. Patch by Marius Wachtler llvm-svn: 69637	2009-04-21 00:08:56 +00:00
Dan Gohman	1addf64735	Make X86's copyRegToReg able to handle copies to and from subclasses. This makes the extra copyRegToReg calls in ScheduleDAGSDNodesEmit.cpp unnecessary. Derived from a patch by Jakob Stoklund Olesen. llvm-svn: 69635	2009-04-20 22:54:34 +00:00
Dan Gohman	e014b69919	Simplify this code. getConstant knows how to make broadcasted vector constants. llvm-svn: 69634	2009-04-20 22:51:43 +00:00
Dale Johannesen	1238220473	Adjust loop size estimate for full unrolling; GEP's don't usually become instructions. llvm-svn: 69631	2009-04-20 22:19:33 +00:00
Daniel Dunbar	3222b9be04	Make Unix.h:MakeErrMsg separate the prefix and errno string, so we get: clang: error: unable to make temporary file: /etc/cc: can't make unique filename: Permission denied instead of clang: error: unable to make temporary file: /etc/cc: can't make unique filenamePermission denied for example. Also, audited the uses of MakeErrMsg to make the prefix strings consistent (not end with newline/punctuation/space/": "). llvm-svn: 69626	2009-04-20 20:50:13 +00:00
Daniel Dunbar	2c441c6afe	Remove unused variable. llvm-svn: 69624	2009-04-20 20:34:38 +00:00
Bob Wilson	f8b85477ae	Move duplicated AddLiveIn function from X86 and ARM backends to be a method in the MachineFunction class, renaming it to addLiveIn for consistency with the same method in MachineBasicBlock. Thanks for Anton for suggesting this. llvm-svn: 69615	2009-04-20 18:36:57 +00:00
Devang Patel	958d5eb032	Match C backend only if it explicitly requested. llvm-svn: 69613	2009-04-20 18:07:22 +00:00
Bob Wilson	da188ebbbd	Revise my previous change 68996 as suggested by Duncan. llvm-svn: 69607	2009-04-20 17:27:09 +00:00
Evan Cheng	5dd2e29b67	- Remove an arbitrary spill weight tweak that should not have been there. - Find more reloads from SS. llvm-svn: 69606	2009-04-20 17:23:48 +00:00
Sanjiv Gupta	0fcc019d36	Emit the auto variables of a function into a different section than parameters. llvm-svn: 69605	2009-04-20 16:59:35 +00:00
Dan Gohman	f7db87f2dc	It's not necessary for PrintModulePass to flush the output streams now that errs() is properly non-buffered. llvm-svn: 69602	2009-04-20 16:26:25 +00:00
Dan Gohman	6c7a485c19	Don't discard an AssemblyAnnotationWriter when writing GlobalValues, which include Functions, where it can be quite useful to use an AssemblyAnnotationWriter. llvm-svn: 69598	2009-04-20 16:10:33 +00:00
Dan Gohman	5fe6b530a5	Implement operator<<(raw_ostream &OS, const Type &T). llvm-svn: 69596	2009-04-20 15:55:38 +00:00
Evan Cheng	d67efaa847	Added a linearscan register allocation optimization. When the register allocator spill an interval with multiple uses in the same basic block, it creates a different virtual register for each of the reloads. e.g. %reg1498<def> = MOV32rm %reg1024, 1, %reg0, 12, %reg0, Mem:LD(4,4) [sunkaddr39 + 0] %reg1506<def> = MOV32rm %reg1024, 1, %reg0, 8, %reg0, Mem:LD(4,4) [sunkaddr42 + 0] %reg1486<def> = MOV32rr %reg1506 %reg1486<def> = XOR32rr %reg1486, %reg1498, %EFLAGS<imp-def,dead> %reg1510<def> = MOV32rm %reg1024, 1, %reg0, 4, %reg0, Mem:LD(4,4) [sunkaddr45 + 0] => %reg1498<def> = MOV32rm %reg2036, 1, %reg0, 12, %reg0, Mem:LD(4,4) [sunkaddr39 + 0] %reg1506<def> = MOV32rm %reg2037, 1, %reg0, 8, %reg0, Mem:LD(4,4) [sunkaddr42 + 0] %reg1486<def> = MOV32rr %reg1506 %reg1486<def> = XOR32rr %reg1486, %reg1498, %EFLAGS<imp-def,dead> %reg1510<def> = MOV32rm %reg2038, 1, %reg0, 4, %reg0, Mem:LD(4,4) [sunkaddr45 + 0] From linearscan's point of view, each of reg2036, 2037, and 2038 are separate registers, each is "killed" after a single use. The reloaded register is available and it's often clobbered right away. e.g. In thise case reg1498 is allocated EAX while reg2036 is allocated RAX. This means we end up with multiple reloads from the same stack slot in the same basic block. Now linearscan recognize there are other reloads from same SS in the same BB. So it'll "downgrade" RAX (and its aliases) after reg2036 is allocated until the next reload (reg2037) is done. This greatly increase the likihood reloads from SS are reused. This speeds up sha1 from OpenSSL by 5.8%. It is also an across the board win for SPEC2000 and 2006. llvm-svn: 69585	2009-04-20 08:01:12 +00:00
Douglas Gregor	b231e5791e	Make all raw_ostreams support the tell() function. llvm-svn: 69583	2009-04-20 07:34:17 +00:00
Sanjiv Gupta	428d490332	Before trying to introduce/eliminate cast/ext/trunc to make indices type as pointer type, make sure that the pointer size is a valid sequential index type. llvm-svn: 69574	2009-04-20 06:05:54 +00:00
Nick Lewycky	0575dbb692	Use an AssertingVH to detect the case where the Function was deleted but freeMachineCodeForFunction was never called. llvm-svn: 69531	2009-04-19 18:32:03 +00:00
Duncan Sands	f2e7133d34	Now that BUILD_VECTOR operands are allowed to be bigger than the vector element type, turn checking of the operand type back on again, appropriately adjusted. llvm-svn: 69516	2009-04-19 06:40:30 +00:00
Chris Lattner	7b01e66443	Fix PR3898, which manifests as failures on are an Xcore, patch by Jakob Stoklund Olesen! llvm-svn: 69472	2009-04-18 20:48:07 +00:00
Duncan Sands	e4ff21ba4b	Don't try to make BUILD_VECTOR operands have the same type as the vector element type: allow them to be of a wider integer type than the element type all the way through the system, and not just as far as LegalizeDAG. This should be safe because it used to be this way (the old type legalizer would produce such nodes), so backends should be able to handle it. In fact only targets which have legal vector types with an illegal promoted element type will ever see this (eg: <4 x i16> on ppc). This fixes a regression with the new type legalizer (vec_splat.ll). Also, treat SCALAR_TO_VECTOR the same as BUILD_VECTOR. After all, it is just a special case of BUILD_VECTOR. llvm-svn: 69467	2009-04-18 20:16:54 +00:00
Dan Gohman	31efa3098f	Add a ScalarEvolution::getCouldNotCompute() function, and use it instead of allocating and leaking new SCEVCouldNotCompute objects. llvm-svn: 69452	2009-04-18 17:58:19 +00:00
Dan Gohman	927e90c716	More const qualifiers. llvm-svn: 69451	2009-04-18 17:57:20 +00:00
Dan Gohman	056857aa21	Use more const qualifiers with SCEV interfaces. llvm-svn: 69450	2009-04-18 17:56:28 +00:00
Evan Cheng	b685be0c1e	Add a new LiveInterval::overlaps(). It checks if the live interval overlaps a range specified by [Start, End). llvm-svn: 69434	2009-04-18 08:52:15 +00:00
Mon P Wang	6c8bcf9da1	Fixed a few 64 bit cases in X86InstrInfo::commuteInstruction llvm-svn: 69417	2009-04-18 05:16:01 +00:00
Dale Johannesen	ad968ee286	Inline asm's were still introducing bogus dependencies; my earlier patch to this code only fixed half of it. llvm-svn: 69408	2009-04-18 00:09:40 +00:00
Jim Grosbach	8d62763779	remove trailing whitespace llvm-svn: 69402	2009-04-17 23:30:55 +00:00
Bill Wendling	06684350c4	Recommit r69335 and r69336. These were not causing problems. llvm-svn: 69394	2009-04-17 22:40:38 +00:00
Bob Wilson	b0b10f8bf6	Move the AddLiveIn function definition closer to its uses. llvm-svn: 69382	2009-04-17 20:42:34 +00:00
Bob Wilson	deeaf70dad	Rearrange code to reduce indentation. llvm-svn: 69381	2009-04-17 20:40:45 +00:00
Bob Wilson	ea09d4aca8	Clean up formatting, remove trailing whitespace, fix comment typos and punctuation. No functional changes. llvm-svn: 69378	2009-04-17 20:35:10 +00:00
Bob Wilson	a4c2290e5f	Use CallConvLower.h and TableGen descriptions of the calling conventions for ARM. Patch by Sandeep Patel. llvm-svn: 69371	2009-04-17 19:07:39 +00:00
David Greene	22fa407ed7	Use a safer iterator interface and get rid of std C++ library misuse. This fixes a --enable-expensive-checks problem. llvm-svn: 69353	2009-04-17 14:56:18 +00:00
Rafael Espindola	355fe12c82	For general dynamic TLS access we must use leaq foo@TLSGD(%rip), %rdi as part of the instruction sequence. Using a register other than %rdi and then copying it to %rdi is not valid. llvm-svn: 69350	2009-04-17 14:35:58 +00:00
Bill Wendling	30527b1114	Revert r69335 and r69336. They were causing build failures. llvm-svn: 69347	2009-04-17 04:19:22 +00:00
Dan Gohman	d2d6fd806c	Don't create ConstantInts with pointer type. This fixes a regression in 403.gcc in PIC_CODEGEN=1 and DISABLE_LTO=1 mode. llvm-svn: 69344	2009-04-17 02:02:52 +00:00
Evan Cheng	b96a1082a9	Teach spiller to unfold instructions which modref spill slot when a scratch register is available and when it's profitable. e.g. xorq %r12<kill>, %r13 addq %rax, -184(%rbp) addq %r13, -184(%rbp) ==> xorq %r12<kill>, %r13 movq -184(%rbp), %r12 addq %rax, %r12 addq %r13, %r12 movq %r12, -184(%rbp) Two more instructions, but fewer memory accesses. It can also open up opportunities for more optimizations. llvm-svn: 69341	2009-04-17 01:29:40 +00:00
Dan Gohman	09dbb0b5e0	MOV8rr_NOREX is a "Move" instruction. This doesn't currently matter, because this instruction isn't generated until after things that care. llvm-svn: 69336	2009-04-17 00:45:17 +00:00
Dan Gohman	74835ce1cb	Don't use MOV8rr_NOREX on x86-32. It doesn't actually hurt anything at present, but it's inconsistent. llvm-svn: 69335	2009-04-17 00:43:09 +00:00
Chris Lattner	a8919d0a35	Fix some failures in targets on available_externally functions, this fixes a crash on CodeGen/Generic/externally_available.ll on ppc hosts. Thanks to Nicholas L for pointing this out. llvm-svn: 69333	2009-04-17 00:26:12 +00:00
Dan Gohman	fec1d086e0	Use TargetData::getTypeSizeInBits instead of getPrimitiveSizeInBits() to get the correct answer for pointer types. llvm-svn: 69321	2009-04-16 22:35:57 +00:00
Chris Lattner	11ceb38df0	make sure to unlock keymgr if the JIT is created and destroyed, all locks must be matched with unlocks. Also, use calloc to allocate the block so that it is properly zero'd. Thanks to Nick Kledzik for tracking this down. llvm-svn: 69314	2009-04-16 21:47:59 +00:00
Eli Friedman	929207fd1d	Fix for PR3944: make mem2reg O(N) instead of O(N^2) in the number of incoming edges for a block with many predecessors. llvm-svn: 69312	2009-04-16 21:40:28 +00:00
Dan Gohman	72dc8452a3	Handle a pointer type correctly in SCEVExpander::visitAddRecExpr. llvm-svn: 69310	2009-04-16 21:34:54 +00:00
Dan Gohman	eefba6bbe0	In the list-burr's pseudo two-addr dependency heuristics, don't add dependencies on nodes with exactly one successor which is a COPY_TO_REGCLASS node. In the case that the copy is coalesced away, the dependence should be on the user of the copy, rather than the copy itself. llvm-svn: 69309	2009-04-16 20:59:02 +00:00
Dan Gohman	3027bb6953	Handle SUBREG_TO_REG instructions with the same heuristics as INSERT_SUBREG instructions in the list-burr scheduler. llvm-svn: 69308	2009-04-16 20:57:10 +00:00
Dan Gohman	c1c2ba7a72	Fix a bug with inttoptr/ptrtoint casts where the pointer has a different size from the integer, requiring zero extension or truncation. Don't create ZExtInsts with pointer types. This fixes a regression in consumer-jpeg. llvm-svn: 69307	2009-04-16 19:25:55 +00:00
Devang Patel	dab01f3fd6	Do not treat beginning of inlined scope as beginning of normal function scope if the location info is missing. Insetad of doing ... if (inlined_subroutine && known_location) DW_TAG_inline_subroutine else DW_TAG_subprogram do if (inlined_subroutine) { if (known_location) DW_TAG_inline_subroutine } else { DW_TAG_subprogram } llvm-svn: 69300	2009-04-16 17:55:30 +00:00
Dan Gohman	8b6ebb1112	Minor code simplifications. Don't attempt LSR on theoretical targets with pointers larger than 64 bits, due to the code not yet being APInt clean. llvm-svn: 69296	2009-04-16 16:49:48 +00:00
Dan Gohman	e2ead2c328	LSR is no longer a GEP optimizer. It is now an IV expression optimizer, which just happen to frequently involve optimizing GEPs. llvm-svn: 69295	2009-04-16 16:46:01 +00:00
Dan Gohman	e98ead45e2	Fix SCEVExpander::visitSMaxExpr and SCEVExpander::visitUMaxExpr to not create ICmpInsts with operands of different types. This fixes a regression in Applications/d/make_dparser. llvm-svn: 69294	2009-04-16 16:15:25 +00:00
Dan Gohman	66e038a3e3	Teach SCEVExpander::InsertCastOfTo to avoid creating inttoptr-of-ptrtoint and ptrtoint-of-inttoptr expressions. This fixes a regression in 300.twolf. llvm-svn: 69293	2009-04-16 15:52:57 +00:00
Dan Gohman	a8be04b2db	Use ConstantExpr::getIntToPtr instead of SCEVExpander::InsertCastOfTo, since the operand is always a constant. llvm-svn: 69291	2009-04-16 15:48:38 +00:00
Dan Gohman	71bccd3e0e	Use a SCEV expression cast instead of immediately inserting a new instruction with SCEVExpander::InsertCastOfTo. llvm-svn: 69290	2009-04-16 15:47:35 +00:00
Devang Patel	9ac4390bf4	Record line number at the beginning of a func.start. This line was accidently lost yesterday. llvm-svn: 69286	2009-04-16 15:07:09 +00:00
Rafael Espindola	5e42177a0f	fix PR3995. A scale must be 1, 2, 4 or 8. llvm-svn: 69284	2009-04-16 12:34:53 +00:00
Dan Gohman	0a40ad93a9	Expand GEPs in ScalarEvolution expressions. SCEV expressions can now have pointer types, though in contrast to C pointer types, SCEV addition is never implicitly scaled. This not only eliminates the need for special code like IndVars' EliminatePointerRecurrence and LSR's own GEP expansion code, it also does a better job because it lets the normal optimizations handle pointer expressions just like integer expressions. Also, since LLVM IR GEPs can't directly index into multi-dimensional VLAs, moving the GEP analysis out of client code and into the SCEV framework makes it easier for clients to handle multi-dimensional VLAs the same way as other arrays. Some existing regression tests show improved optimization. test/CodeGen/ARM/2007-03-13-InstrSched.ll in particular improved to the point where if-conversion started kicking in; I turned it off for this test to preserve the intent of the test. llvm-svn: 69258	2009-04-16 03:18:22 +00:00
Devang Patel	653dee0884	In -fast mode do what FastISel does. This code could use some refactoring help! llvm-svn: 69254	2009-04-16 02:33:41 +00:00
Devang Patel	46b04e4d06	If FastISel is run and it has known DebugLoc then use it. llvm-svn: 69253	2009-04-16 01:33:10 +00:00
Devang Patel	43fc7e481b	If location where the function was inlined is not know then do not emit debug info describing inlinied region. llvm-svn: 69252	2009-04-16 01:31:54 +00:00
Dale Johannesen	a71daa83c6	Eliminate zext over (iv \| const) or (signed iv), and sext over (iv \| const), if a longer iv is available. Allow expressions to have more than one zext/sext parent. All from OpenSSL. llvm-svn: 69241	2009-04-15 23:31:51 +00:00
Devang Patel	31043aa200	s/RootDbgScope/FunctionDbgScope/g llvm-svn: 69216	2009-04-15 20:41:31 +00:00
Dale Johannesen	82230b5b17	Eliminate zext over (iv & const) or ((iv+const)&const) if a longer iv is available. These subscript forms are not common; they're a bottleneck in OpenSSL. llvm-svn: 69215	2009-04-15 20:41:02 +00:00
Chris Lattner	baf9535284	teach EscapeString and UnescapeString to handle ". llvm-svn: 69211	2009-04-15 20:12:52 +00:00
Devang Patel	2738d7312a	Add DISubprogram is not null check. This fixes test/CodeGen//2009-01-21-invalid-debug-info.m test case. llvm-svn: 69210	2009-04-15 20:11:08 +00:00
Dan Gohman	8aa28b9c34	Generalize one of the SelectionDAG::ReplaceAllUsesWith overloads to support replacing a node with another that has a superset of the result types. Use this instead of calling ReplaceAllUsesOfValueWith for each value. llvm-svn: 69209	2009-04-15 20:06:30 +00:00
Dan Gohman	de7b3e74be	Fix 80-column violations. llvm-svn: 69204	2009-04-15 19:48:57 +00:00
Dan Gohman	6711216e84	Add a folding table entry for MOV8rr_NOREX. llvm-svn: 69203	2009-04-15 19:48:28 +00:00
Devang Patel	70307db0d5	Check isInlinedSubroutine() before creating DW_TAG_inlined_subroutine. llvm-svn: 69202	2009-04-15 19:42:57 +00:00
Dan Gohman	37608532c4	Fix MachineInstr::getNumExplicitOperands to count variadic operands correctly. Patch by Jakob Stoklund Olesen! llvm-svn: 69190	2009-04-15 17:59:11 +00:00
Nick Lewycky	4a7bcf6410	Limit the number of times we're willing to chase pointers. Removes an O(n^2) problem from instcombine. llvm-svn: 69151	2009-04-15 06:23:41 +00:00
Dan Gohman	6f873b446a	Fix X86MachineFunctionInfo's doxygen comment. llvm-svn: 69127	2009-04-15 01:20:18 +00:00
Dan Gohman	210448c233	Move MachineRegisterInfo::setRegClass out of line. llvm-svn: 69126	2009-04-15 01:19:35 +00:00
Dan Gohman	505065cdd0	Move MachineJumpTableInfo::ReplaceMBBInJumpTables out of line. llvm-svn: 69125	2009-04-15 01:18:49 +00:00
Dan Gohman	89892b05c7	Give RemoveRegOperandFromRegInfo a comment and move the code out of line. llvm-svn: 69124	2009-04-15 01:17:37 +00:00
Dale Johannesen	7ffb7d5728	Enhance induction variable code to remove the sext around sext(shorter IV + constant), using a longer IV instead, when it can figure out the add can't overflow. This comes up a lot in subscripting; mainly affects 64 bit. llvm-svn: 69123	2009-04-15 01:10:12 +00:00
Evan Cheng	ffb83a155e	Avoid making the transformation enabled by my last patch if the new destinations have phi nodes. llvm-svn: 69121	2009-04-15 00:43:54 +00:00
Devang Patel	046bf624b9	While inlining, clone llvm.dbg.func.start intrinsic and adjust llvm.dbg.region.end instrinsic. This nested llvm.dbg.func.start/llvm.dbg.region.end pair now enables DW_TAG_inlined_subroutine support in code generator. llvm-svn: 69118	2009-04-15 00:17:06 +00:00
Devang Patel	32d17a1a29	Construct and emit DW_TAG_inlined_subroutine DIEs for inlined subroutine scopes (only in FastISel mode). llvm-svn: 69116	2009-04-15 00:10:26 +00:00
Dan Gohman	dd07f638f5	Do for GR16_NOREX what r69049 did for GR8_NOREX, to avoid trouble with the local register allocator. llvm-svn: 69115	2009-04-15 00:10:16 +00:00
Devang Patel	1be3b53d3c	Add a method to check that the subprogram holds debug info for the given Function or not. llvm-svn: 69113	2009-04-15 00:06:07 +00:00
Dan Gohman	7913ea5e4a	Add a new MOV8rr_NOREX, and make X86's copyRegToReg use it when either the source or destination is a physical h register. This fixes sqlite3 with the post-RA scheduler enabled. llvm-svn: 69111	2009-04-15 00:04:23 +00:00
Dan Gohman	821e13a8f4	GR8_NOREX can contain the H registers, since they don't require REX prefixes. llvm-svn: 69108	2009-04-15 00:00:48 +00:00
Evan Cheng	5ebf2acd84	Optimize conditional branch on i1 phis with non-constant inputs. This turns: eq: %3 = icmp eq i32 %1, %2 br label %join ne: %4 = icmp ne i32 %1, %2 br label %join join: %5 = phi i1 [%3, %eq], [%4, %ne] br i1 %5, label %yes, label %no => eq: %3 = icmp eq i32 %1, %2 br i1 %3, label %yes, label %no ne: %4 = icmp ne i32 %1, %2 br i1 %4, label %yes, label %no llvm-svn: 69102	2009-04-14 23:40:03 +00:00
Dan Gohman	62f4498646	For the h-register addressing-mode trick, use the correct value for any non-address uses of the address value. This fixes 186.crafty. llvm-svn: 69094	2009-04-14 22:45:05 +00:00
Evan Cheng	3f1a3008f4	Mac OS X 10.6 and above do not use key manager to register EH frames. llvm-svn: 69090	2009-04-14 22:31:59 +00:00
Dan Gohman	e5cd1fcdb9	When the result of an EXTRACT_SUBREG, INSERT_SUBREG, or SUBREG_TO_REG operator is used by a CopyToReg to export the value to a different block, don't reuse the CopyToReg's register for the subreg operation result if the register isn't precisely the right class for the subreg operation. Also, rename the h-registers.ll test, now that there are more than one. llvm-svn: 69087	2009-04-14 22:17:14 +00:00
Mikhail Glushenkov	60cde5ba7c	Call CreateProcess with bInheritHandles = TRUE. Makes llvmc show error messages printed by child processes when run from the Cygwin/MSYS shell. Since ExecuteAndWait does not return until the child program has finished execution, this change should be harmless. llvm-svn: 69082	2009-04-14 21:31:36 +00:00
Mikhail Glushenkov	3a62efb732	Delete trailing whitespace. llvm-svn: 69081	2009-04-14 21:31:14 +00:00
Evan Cheng	dfbbf5c043	Some of GR8_NOREX registers are only available in 64-bit mode. llvm-svn: 69049	2009-04-14 16:57:43 +00:00
Sanjiv Gupta	92bb846e2b	Handle aggregate type arguments to direct and indirect calls. llvm-svn: 69022	2009-04-14 02:49:52 +00:00
Owen Anderson	a1902318e3	LoopIndexSplit needs to inform the loop pass manager of the instructions it is deleting, not just the basic block. llvm-svn: 69011	2009-04-14 01:04:19 +00:00
Dale Johannesen	83593f4167	Do not force asm's to be chained if they don't touch memory and aren't volatile. This was interfering with good scheduling. llvm-svn: 69008	2009-04-14 00:56:56 +00:00
Evan Cheng	9787183b9b	Fix PR3934 part 2. findOnlyInterestingUse() was not setting IsCopy and IsDstPhys which are returned by value and used by callee. This happened to work on the earlier test cases because of a logic error in the caller side. llvm-svn: 69006	2009-04-14 00:32:25 +00:00
Daniel Dunbar	097f630dad	Make these errors more noticable in build logs. llvm-svn: 68998	2009-04-13 22:26:09 +00:00
Bob Wilson	59dbbb2bb4	Change SelectionDAG type legalization to allow BUILD_VECTOR operands to be promoted to legal types without changing the type of the vector. This is following a suggestion from Duncan (http://lists.cs.uiuc.edu/pipermail/llvmdev/2009-February/019923.html). The transformation that used to be done during type legalization is now postponed to DAG legalization. This allows the BUILD_VECTORs to be optimized and potentially handled specially by target-specific code. It turns out that this is also consistent with an optimization done by the DAG combiner: a BUILD_VECTOR and INSERT_VECTOR_ELT may be combined by replacing one of the BUILD_VECTOR operands with the newly inserted element; but INSERT_VECTOR_ELT allows its scalar operand to be larger than the element type, with any extra high bits being implicitly truncated. The result is a BUILD_VECTOR where one of the operands has a type larger the the vector element type. Any code that operates on BUILD_VECTORs may now need to be aware of the potential type discrepancy between the vector element type and the BUILD_VECTOR operands. This patch updates all of the places that I could find to handle that case. llvm-svn: 68996	2009-04-13 22:05:19 +00:00
Dan Gohman	6c1426308c	Rename COPY_TO_SUBCLASS to COPY_TO_REGCLASS, and generalize it accordingly. Thanks to Jakob Stoklund Olesen for pointing out how this might be useful. llvm-svn: 68986	2009-04-13 21:06:25 +00:00
Bob Wilson	f6c2195383	Refactor some code in SelectionDAGLegalize::ExpandBUILD_VECTOR. llvm-svn: 68981	2009-04-13 20:20:30 +00:00
Evan Cheng	f0843803a0	PR3934: Fix a bogus two-address pass assertion. llvm-svn: 68979	2009-04-13 20:04:24 +00:00
Devang Patel	0431504fb2	Right now, Debugging information to encode scopes (DW_TAG_lexical_block) relies on DBG_LABEL. Unfortunately this intefers with the quality of optimized code. This patch updates dwarf writer to encode scoping information in DWARF only in FastISel mode. llvm-svn: 68973	2009-04-13 18:13:16 +00:00
Devang Patel	80be3511ed	Reapply 68847. Now debug_inlined section is covered by TAI->doesDwarfUsesInlineInfoSection(), which is false by default. llvm-svn: 68964	2009-04-13 17:02:03 +00:00
Dan Gohman	57d6bd36b2	Implement x86 h-register extract support. - Add patterns for h-register extract, which avoids a shift and mask, and in some cases a temporary register. - Add address-mode matching for turning (X>>(8-n))&(255<<n), where n is a valid address-mode scale value, into an h-register extract and a scaled-offset address. - Replace X86's MOV32to32_ and related instructions with the new target-independent COPY_TO_SUBREG instruction. On x86-64 there are complicated constraints on h registers, and CodeGen doesn't currently provide a high-level way to express all of them, so they are handled with a bunch of special code. This code currently only supports extracts where the result is used by a zero-extend or a store, though these are fairly common. These transformations are not always beneficial; since there are only 4 h registers, they sometimes require extra move instructions, and this sometimes increases register pressure because it can force out values that would otherwise be in one of those registers. However, this appears to be relatively uncommon. llvm-svn: 68962	2009-04-13 16:09:41 +00:00
Dan Gohman	60a446ab02	Add a new TargetInstrInfo MachineInstr opcode, COPY_TO_SUBCLASS. This will be used to replace things like X86's MOV32to32_. Enhance ScheduleDAGSDNodesEmit to be more flexible and robust in the presense of subregister superclasses and subclasses. It can now cope with the definition of a virtual register being in a subclass of a use. Re-introduce the code for recording register superreg classes and subreg classes. This is needed because when subreg extracts and inserts get coalesced away, the virtual registers are left in the correct subclass. llvm-svn: 68961	2009-04-13 15:38:05 +00:00
Dan Gohman	f20462c217	Remove x86's special-case handling for ISD::TRUNCATE and ISD::SIGN_EXTEND_INREG. Tablegen-generated code can handle these cases, and the scheduling issues observed earlier appear to be resolved now. llvm-svn: 68959	2009-04-13 15:29:31 +00:00
Dan Gohman	e9432dcc0d	Fix copy+pastos in comments. llvm-svn: 68958	2009-04-13 15:28:29 +00:00
Dan Gohman	1d50407932	Don't abort on an aliasing physical register that does not have a live interval. This is needed for some upcoming subreg changes. llvm-svn: 68956	2009-04-13 15:22:29 +00:00
Dan Gohman	4d62ff1483	When assigning a physical register to a MachineOperand, set the subreg field to 0, since the subreg field is only used for virtual register subregs. This doesn't change current functionality; it just eliminates bogus noise from debug output. llvm-svn: 68955	2009-04-13 15:21:32 +00:00
Dan Gohman	9e3eb7bcd9	List the l registers before h registers, for consistency. llvm-svn: 68954	2009-04-13 15:18:42 +00:00
Dan Gohman	85abd983f2	Add an assertion to verify that a copy was actually emitted. llvm-svn: 68953	2009-04-13 15:16:56 +00:00
Dan Gohman	092b8b6fdb	Use X86::SUBREG_8BIT instead of hard-coding the equivalent constant. llvm-svn: 68951	2009-04-13 15:14:03 +00:00
Dan Gohman	c5c2fc45ae	Add a comment about MOVSX64rr8. llvm-svn: 68950	2009-04-13 15:13:28 +00:00
Dan Gohman	39aa13a401	Fix another hard-coded constant to use X86AddrNumOperands. This unbreaks the JIT on x86-64. llvm-svn: 68948	2009-04-13 15:04:25 +00:00
Rafael Espindola	6d6c6043ea	X86-64 TLS support for local exec and initial exec. llvm-svn: 68947	2009-04-13 13:02:49 +00:00
Sanjiv Gupta	0368bc4703	While passing arg of types larger than char only one byte at lower end was getting passed. We couldn't catch this as we did not have tests that were passing an int value larger than 256. llvm-svn: 68946	2009-04-13 09:38:38 +00:00
Nick Lewycky	8019af6b2c	Fix warning in .ll parser, detect and reject available_externally on function declarations. llvm-svn: 68944	2009-04-13 07:02:02 +00:00
Chris Lattner	184f1be4a8	Add a new "available_externally" linkage type. This is intended to support C99 inline, GNU extern inline, etc. Related bugzilla's include PR3517, PR3100, & PR2933. Nothing uses this yet, but it appears to work. llvm-svn: 68940	2009-04-13 05:44:34 +00:00
Chris Lattner	836e77d161	eliminate unneeded parens. llvm-svn: 68939	2009-04-13 05:38:23 +00:00
Nick Lewycky	e54da991d1	Link against libffi if available, fall back to "no external calls from interpreter mode" when it's not. llvm-svn: 68937	2009-04-13 04:26:06 +00:00
Chris Lattner	ce8b7d2e24	fix PR3965:SIGINT handler not restored after calling ParseAST(), patch by Alexei Svitkine! llvm-svn: 68929	2009-04-12 23:33:13 +00:00
Rafael Espindola	7186f20a1b	In X86DAGToDAGISel::MatchWrapper, if base or index are set, avoid matching only if symbolic addresses are RIP relatives. llvm-svn: 68924	2009-04-12 23:00:38 +00:00
Rafael Espindola	6688b0a5da	refactor some code into X86DAGToDAGISel::MatchWrapper llvm-svn: 68915	2009-04-12 21:55:03 +00:00
Chris Lattner	6cd82fb430	"There was a typo in my previous patch which leads to miscompilation of strncat :( strncat(foo, "bar", 99) would be optimized to memcpy(foo+strlen(foo), "bar", 100, 1) instead of memcpy(foo+strlen(foo), "bar", 4, 1)" Patch by Benjamin Kramer! llvm-svn: 68905	2009-04-12 18:22:33 +00:00
Chris Lattner	ce6bcf0847	fix a cross-block fastisel crash handling overflow intrinsics. See comment for details. This fixes rdar://6772169 llvm-svn: 68890	2009-04-12 07:51:14 +00:00
Chris Lattner	a101f6f8d3	make UpdateValueMap handle the possiblity that we could be copying into the right register, avoiding a copy. llvm-svn: 68889	2009-04-12 07:46:30 +00:00
Chris Lattner	ada5d6c37e	optimize FastISel::UpdateValueMap to avoid duplicate map lookups, and make it return the assigned register. llvm-svn: 68888	2009-04-12 07:45:01 +00:00
Chris Lattner	99a8cb627d	simplify code by using IntrinsicInst. llvm-svn: 68887	2009-04-12 07:36:01 +00:00
Chris Lattner	24ac95abc1	Add new TargetInstrDesc::hasImplicitUseOfPhysReg and hasImplicitDefOfPhysReg methods. Use them to remove a look in X86 fast isel. llvm-svn: 68886	2009-04-12 07:26:51 +00:00
Chris Lattner	91b6af24ac	add some optimizations for strncpy/strncat and factor some code. Patch by Benjamin Kramer! llvm-svn: 68885	2009-04-12 05:06:39 +00:00
Dan Gohman	825236b116	Revert r68847. It breaks the build on non-Darwin targets, with this message from the assembler: Error: unknown pseudo-op: `.debug_inlined' llvm-svn: 68863	2009-04-11 15:57:04 +00:00
Devang Patel	790e60999e	Keep track of inlined functions and their locations. This information is collected when nested llvm.dbg.func.start intrinsics are seen. (Right now, inliner removes nested llvm.dbg.func.start intrinisics during inlining.) Create debug_inlined dwarf section using these information. This info is used by gdb, at least on Darwin, to enable better experience debugging inlined functions. See DwarfWriter.cpp for more information on structure of debug_inlined section. llvm-svn: 68847	2009-04-11 00:16:47 +00:00
Devang Patel	61b35cd15e	DebugLabelFolder ruthlessly deletes redundant labels. However, sometimes the redundant labels is referenced by debug info somewhere else. This patch provies a way so that dwarf writer can mark labels as used. llvm-svn: 68813	2009-04-10 18:58:59 +00:00
Bob Wilson	f074ca7454	Clean up a bunch of whitespace issues and fix a comment typo. No functional changes. llvm-svn: 68808	2009-04-10 18:48:47 +00:00
Chris Lattner	30c3de6461	fix two problems with machine sinking: 1. Sinking would crash when the first instruction of a block was sunk due to iterator problems. 2. Instructions could be sunk to their current block, causing an infinite loop. This fixes PR3968 llvm-svn: 68787	2009-04-10 16:38:36 +00:00
Dan Gohman	e517ae4211	Now that register classes have names, include the name in debug output. llvm-svn: 68786	2009-04-10 15:59:38 +00:00
Sanjiv Gupta	638f400a3c	Added code to handle spilling and reloading of FSRs. llvm-svn: 68783	2009-04-10 15:10:14 +00:00
Rafael Espindola	bb834f0929	Don't fold a load if the other operand is a TLS address. With this we generate movl %gs:0, %eax leal i@NTPOFF(%eax), %eax instead of movl $i@NTPOFF, %eax addl %gs:0, %eax llvm-svn: 68778	2009-04-10 10:09:34 +00:00
Chris Lattner	bc8848f76f	Add a new Type::getPointerTo method, which is shorthand for llvm::PointerType::get(). Patch by Anders Johnsen! llvm-svn: 68772	2009-04-10 06:42:02 +00:00
Chris Lattner	428f71623b	a few fixes to "addrspace(256) is reference offset of GS segment register". It turns out that there are still several problems with this, will file a bugzilla. llvm-svn: 68749	2009-04-10 00:16:23 +00:00
Bill Wendling	992f846dc7	Pass in the std::string parameter instead of returning it by value. llvm-svn: 68747	2009-04-10 00:12:49 +00:00
Bill Wendling	fe56c25065	Constify getter methods. llvm-svn: 68745	2009-04-10 00:00:25 +00:00
Dan Gohman	de912e2475	Remove the obsolete SelectionDAG::getNodeValueTypes and simplify code that uses it by using SelectionDAG::getVTList instead. llvm-svn: 68744	2009-04-09 23:54:40 +00:00
Bill Wendling	16abfc97fb	StringMap<DIE>::iterator::first() returns a pointer to the first character of the key. This will cause it to create a new std::string, which isn't wanted. Instead, pass back the "const char". Modify the EmitString() method to take a "const char*". llvm-svn: 68741	2009-04-09 23:51:31 +00:00
Devang Patel	a68bdef482	Silence unused variable warning. llvm-svn: 68735	2009-04-09 23:45:17 +00:00
Chris Lattner	66ab904f5a	ignore register zero in isRegTiedToUseOperand, following the example of isRegTiedToDefOperand. Thanks to Bob for pointing this out! llvm-svn: 68734	2009-04-09 23:33:34 +00:00
Owen Anderson	5eb8d26f19	Give register alias checking the hash table treatment too. llvm-svn: 68730	2009-04-09 22:19:30 +00:00
Bill Wendling	e65fd1a302	Use a StringMap instead of std::map for storing std::string->DIE* maps. This gives a micro speedup in the Dwarf writer. llvm-svn: 68728	2009-04-09 21:49:15 +00:00
Devang Patel	a2c2b85df4	llvm.dbg.func_start also defines beginning of function scope. llvm-svn: 68727	2009-04-09 21:42:11 +00:00
Bob Wilson	51856173c8	Fix pr3954. The register scavenger asserts for inline assembly with register destinations that are tied to source operands. The TargetInstrDescr::findTiedToSrcOperand method silently fails for inline assembly. The existing MachineInstr::isRegReDefinedByTwoAddr was very close to doing what is needed, so this revision makes a few changes to that method and also renames it to isRegTiedToUseOperand (for consistency with the very similar isRegTiedToDefOperand and because it handles both two-address instructions and inline assembly with tied registers). llvm-svn: 68714	2009-04-09 17:16:43 +00:00
Sanjiv Gupta	38533d48dd	The way we are trying to figure out banksel immediate operand may yield different results for different type of insns. This will eventually need to be changed but currently let us prevent the crash in cases of incorrect detection of banksel operand. llvm-svn: 68713	2009-04-09 17:06:24 +00:00
Chris Lattner	a725028d41	reg0 references are not real registers. This fixes a crash on the attached testcase. llvm-svn: 68712	2009-04-09 16:50:43 +00:00
Sanjiv Gupta	60fd2cd503	Arguments to indirect calls were being passed incorrectly. They are not fixed to start after return value. llvm-svn: 68705	2009-04-09 10:29:32 +00:00
Chris Lattner	bcd2632638	Fix code size computation on x86-64, patch by Zoltan Varga! llvm-svn: 68690	2009-04-09 06:10:51 +00:00
Sanjiv Gupta	cdfdc79b70	r68576 unconverd a bug in PIC16 port (Thanks to Dan Gohman) where we were custom lowering an ADD to ADDC. llvm-svn: 68671	2009-04-09 04:03:43 +00:00
Dan Gohman	0e8d199f91	Generalize ExtendUsesToFormExtLoad to be usable for ANY_EXTEND, in addition to ZERO_EXTEND and SIGN_EXTEND. Fix a bug in the way it checked for live-out values, and simplify the way it find users by using SDNode::use_iterator's (relatively) new features. Also, make it slightly more permissive on targets with free truncates. In SelectionDAGBuild, avoid creating ANY_EXTEND nodes that are larger than necessary. If the target's SwitchAmountTy has enough bits, use it. This exposes the truncate to optimization early, enabling more optimizations. llvm-svn: 68670	2009-04-09 03:51:29 +00:00
Owen Anderson	3a888f639e	Convert TargetRegisterInfo's super-register checking to use a pre-computed hash table just like subregister checking does. llvm-svn: 68669	2009-04-09 03:50:16 +00:00
Dan Gohman	e6db8ca5eb	Don't copy the operand of a SwitchInst into virtual registers as eagerly. This helps avoid CopyToReg nodes in some cases where they aren't needed, and also helps subsequent optimizer heuristics in cases where the extra nodes would cause the node to appear to have multiple results. This doesn't have a significant impact currently; it'll help an upcoming change. llvm-svn: 68667	2009-04-09 02:33:36 +00:00
Dan Gohman	f15454866c	Fix grammaros in comments. llvm-svn: 68666	2009-04-09 02:06:09 +00:00
Daniel Dunbar	3f48987069	Add sys::Path::makeAbsolute(). llvm-svn: 68663	2009-04-09 00:33:08 +00:00
Devang Patel	3396d7ef9d	If subprogram type is not tagged as DW_TAG_subroutine_type then use it directly as a return value type. llvm-svn: 68647	2009-04-08 22:18:45 +00:00
Rafael Espindola	3b2df10c9e	Re-apply 68552. Tested by bootstrapping llvm-gcc and using that to build llvm. llvm-svn: 68645	2009-04-08 21:14:34 +00:00
Bob Wilson	866c174f79	Fix PR3795: Apply Dan's suggested fix for ARMTargetLowering::isLegalAddressingMode. llvm-svn: 68619	2009-04-08 17:55:28 +00:00
Duncan Sands	5a82613db0	Soft float support for FREM. llvm-svn: 68614	2009-04-08 16:20:57 +00:00
Duncan Sands	fb438caac6	Soft float support for undef. Reported by Xerxes Rånby. llvm-svn: 68607	2009-04-08 13:33:37 +00:00
Rafael Espindola	d173f4237d	Avoid a hard coded constant. llvm-svn: 68603	2009-04-08 08:09:33 +00:00
Sanjiv Gupta	44ea053a49	Emit .line debug directives for stoppoints. The debug location is retrieved by the MachineInstr itself, rather than by custom handling the DBG_STOPPOINT nodes. llvm-svn: 68602	2009-04-08 06:24:04 +00:00
Chris Lattner	eb510d6b3d	Instcombine should not promote whole computation trees to "strange" integer types, unless they are already strange. This prevents it from turning the code produced by SROA into crazy libcalls and stuff that the code generator can't handle. In the attached example, the result was an i96 multiply that caused the x86 backend to assert. Note that if TargetData had an idea of what the legal types are for a target that this could be used to stop instcombine from introducing i64 muls, as Scott wanted. llvm-svn: 68598	2009-04-08 05:41:03 +00:00
Sanjiv Gupta	4b489c75c2	Handle indirect function calls. Every function has the address of its frame in the beginning of code section. The frame address is retrieved and used to pass arguments. llvm-svn: 68597	2009-04-08 05:38:48 +00:00
Chris Lattner	a72576e92b	disable this code for now, re-breaking PR2975, but fixing a testcase I'm about to attach to that pr. llvm-svn: 68592	2009-04-08 04:36:59 +00:00
Chris Lattner	3875195865	Remove AllowInverse: it leaks memory and is not the right abstraction for CommandLine. llvm-svn: 68588	2009-04-08 03:43:51 +00:00
Chris Lattner	69b586e547	change printStringChar to emit characters as unsigned char instead of char, avoiding sign extension for the top octet. For "negative" chars, we'd print stuff like: .asciz "\702... now we print: .asciz "\302... llvm-svn: 68577	2009-04-08 00:28:38 +00:00

... 2 3 4 5 6 ...

28154 Commits