llvm-project

Commit Graph

Author	SHA1	Message	Date
Gabor Greif	e4eed709d4	use ArgOperand API llvm-svn: 106828	2010-06-25 08:24:59 +00:00
Gabor Greif	f6207e0a80	prune an include llvm-svn: 106827	2010-06-25 08:16:50 +00:00
Dale Johannesen	e9eaaa91d8	Fix a case where an earlyclobber operand of an asm is reused as an input. PR 4118. Testcase is too big, as usual with bugs in this area, but there's one in the PR. llvm-svn: 106816	2010-06-25 00:49:43 +00:00
Jakob Stoklund Olesen	889ab7d158	Make sure all eliminated kills are removed from VNInfo lists. This fixes PR7479 and PR7485. The test cases from those PRs are big, so not included. However, PR7485 comes from self hosting on FreeBSD, so we will surely hear about any regression. llvm-svn: 106811	2010-06-24 23:57:35 +00:00
Dan Gohman	5f0bf64c0c	Add some comments. llvm-svn: 106809	2010-06-24 23:41:59 +00:00
Dan Gohman	9a2f0473b2	Teach EmitLiveInCopies to omit copies for unused virtual registers, and to clean up unused incoming physregs from the live-in list. llvm-svn: 106805	2010-06-24 22:23:02 +00:00
Bill Wendling	2d3c490026	It's possible that a flag is added to the SDNode that points back to the original SDNode. This is badness. Also, this function allows one SDNode to point multiple flags to another SDNode. Badness as well. llvm-svn: 106793	2010-06-24 22:00:37 +00:00
Devang Patel	c657c621b7	DBG_VALUE machine instruction pointing to undefined register for a variable justify a separate scope if the variable is inlined function's argument. Radar 8122864. llvm-svn: 106792	2010-06-24 21:51:19 +00:00
Jakob Stoklund Olesen	2b87d44c5d	Don't return a std::vector in the Spiller interface, but take a reference to a vector instead. This avoids needless copying and allocation. Add documentation. llvm-svn: 106788	2010-06-24 20:54:29 +00:00
Jakob Stoklund Olesen	9b659142a6	Remove the now unused LiveIntervals::getVNInfoSourceReg(). This method was always a bit too simplistic for the real world. It didn't really deal with subregisters and such. llvm-svn: 106781	2010-06-24 20:18:15 +00:00
Jakob Stoklund Olesen	487ed997d0	Teach AdjustCopiesBackFrom to also use CoalescerPair to identify compatible copies. llvm-svn: 106780	2010-06-24 20:16:00 +00:00
Jakob Stoklund Olesen	7f894d8fdc	Remove the -fast-spill option. This code path has never really been used, and we are going to be handling spilling through the Spiller interface in the future. llvm-svn: 106777	2010-06-24 19:56:08 +00:00
Bill Wendling	3f0e992af1	Loosen up the requirements in the Horrible Hack(tm) to include all selectors which don't have a catch-all associated with them not just clean-ups. This fixes the SingleSource/Benchmarks/Shootout-C++/except.cpp testcase that broke because of my change r105902. llvm-svn: 106772	2010-06-24 18:49:10 +00:00
Jakob Stoklund Olesen	45230239e4	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. This second attempt fixes some crashes that only occurred Linux. llvm-svn: 106769	2010-06-24 18:15:01 +00:00
Jakob Stoklund Olesen	a612d7c012	Print the LSBs of a SlotIndex symbolically using letters referring to the [L]oad, [u]se, [d]ef, or [S]tore slots. This makes it easier to see if two indices refer to the same instruction, avoiding mental mod 4 calculations. llvm-svn: 106766	2010-06-24 17:31:07 +00:00
Dan Gohman	8a84cd57ae	Simplify this code; switch lowering shouldn't produce cases which trivially fold away. llvm-svn: 106765	2010-06-24 17:08:31 +00:00
Jakob Stoklund Olesen	3b2b46a700	Be more strict about subreg-to-subreg copies in CoalescerPair. Also keep track of the original DstREg before subregister adjustments. llvm-svn: 106753	2010-06-24 16:19:28 +00:00
Jakob Stoklund Olesen	53ccab7d1c	Verify that VNI kills are pointing to existing instructions. In this case it is essential that the kill is real because the spiller will decide to omit a spill if it thinks there is a later kill. llvm-svn: 106751	2010-06-24 15:56:59 +00:00
Dan Gohman	463f26b4be	Eliminate the other half of the BRCOND optimization, and update as many tests as possible. llvm-svn: 106749	2010-06-24 15:24:03 +00:00
Dan Gohman	df6b33e778	Eliminate the first have of the optimization which eliminates BRCOND when the condition is constant. This optimization shouldn't be necessary, because codegen shouldn't be able to find dead control paths that the IR-level optimizer can't find. And it's undesirable, because it encourages bugpoint to leave "br i1 false" branches in its output. And it wasn't updating the CFG. I updated all the tests I could, but some tests are too reduced and I wasn't able to meaningfully preserve them. llvm-svn: 106748	2010-06-24 15:04:11 +00:00
Dan Gohman	600f62b3ba	Reapply r106634, now that the bug it exposed is fixed. llvm-svn: 106746	2010-06-24 14:30:44 +00:00
Dan Gohman	0695e09b09	Optimize the "bit test" code path for switch lowering in the case where the bit mask has exactly one bit. llvm-svn: 106716	2010-06-24 02:06:24 +00:00
Jakob Stoklund Olesen	dbb58d2974	Revert "Replace a big gob of old coalescer logic with the new CoalescerPair class." Whiny buildbots. llvm-svn: 106710	2010-06-24 00:52:22 +00:00
Jakob Stoklund Olesen	f38e6720cc	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. llvm-svn: 106701	2010-06-24 00:12:39 +00:00
Bill Wendling	a136521a17	MorphNodeTo doesn't preserve the memory operands. Because we're morphing a node into the same node, but with different non-memory operands, we need to replace the memory operands after it's finished morphing. llvm-svn: 106643	2010-06-23 18:16:24 +00:00
Daniel Dunbar	4df321b7ad	Revert r106263, "Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass,"... it was causing both 'file' (with clang) and 176.gcc (with llvm-gcc) to be miscompiled. llvm-svn: 106634	2010-06-23 17:09:26 +00:00
Jim Grosbach	b58c08b0ba	Some targets don't require the fencing MEMBARRIER instructions surrounding atomic intrinsics, either because the use locking instructions for the atomics, or because they perform the locking directly. Add support in the DAG combiner to fold away the fences. llvm-svn: 106630	2010-06-23 16:07:42 +00:00
Jakob Stoklund Olesen	731ea71f59	Add a few VNInfo data structure checks. llvm-svn: 106627	2010-06-23 15:34:36 +00:00
Daniel Dunbar	ef5a4383ad	Revert r106066, "Create a more targeted fix for not sinking instructions into a range where it"... it causes bzip2 to be miscompiled by Clang. Conflicts: lib/CodeGen/MachineSink.cpp llvm-svn: 106614	2010-06-23 00:48:25 +00:00
Jakob Stoklund Olesen	1023f6bd98	Also convert SUBREG_TO_REG to a KILL when relevant, like the other subreg instructions. This does not affect codegen much because SUBREG_TO_REG is only used by X86 and X86 does not use the register scavenger, but it prevents verifier errors. llvm-svn: 106583	2010-06-22 22:11:07 +00:00
Dan Gohman	3570f81b1e	Move PHIElimination's SplitCriticalEdge for MachineBasicBlocks out into a utility routine, teach it how to update MachineLoopInfo, and make use of it in MachineLICM to split critical edges on demand. llvm-svn: 106555	2010-06-22 17:25:57 +00:00
Jakob Stoklund Olesen	9c47dac677	Remove the SimpleJoin optimization from SimpleRegisterCoalescing. Measurements show that it does not speed up coalescing, so there is no reason the keep the added complexity around. Also clean out some unused methods and static functions. llvm-svn: 106548	2010-06-22 16:13:57 +00:00
Dan Gohman	d2d1ae105d	Use pre-increment instead of post-increment when the result is not used. llvm-svn: 106542	2010-06-22 15:08:57 +00:00
Dan Gohman	2370e2fe0f	When unfolding a load, avoid assuming which instruction that kill and dead flags will end up on. llvm-svn: 106520	2010-06-22 02:07:21 +00:00
Devang Patel	b6e058da18	Use single interface, using twine, to get named metadata. getNamedMetadata(). llvm-svn: 106518	2010-06-22 01:19:38 +00:00
Evan Cheng	37bb617f8a	Tail merging pass shall not break up IT blocks. rdar://8115404 llvm-svn: 106517	2010-06-22 01:18:16 +00:00
Devang Patel	cbc6fd8493	Discard special LLVM prefix from linkage name. llvm-svn: 106516	2010-06-22 01:06:05 +00:00
Devang Patel	ad51735794	Do not rely on Twine temporaries to survive. llvm-svn: 106515	2010-06-22 01:01:58 +00:00
Dan Gohman	851e478e6b	Fix the new load-unfolding code to update LiveVariable's dead flags, in addition to the kill flags. llvm-svn: 106512	2010-06-22 00:32:04 +00:00
Dan Gohman	3c1b3c61e9	Teach two-address lowering how to unfold a load to open up commuting opportunities. For example, this lets it emit this: movq (%rax), %rcx addq %rdx, %rcx instead of this: movq %rdx, %rcx addq (%rax), %rcx in the case where %rdx has subsequent uses. It's the same number of instructions, and usually the same encoding size on x86, but it appears faster, and in general, it may allow better scheduling for the load. llvm-svn: 106493	2010-06-21 22:17:20 +00:00
Dan Gohman	dd41bba517	Use A.append(...) instead of A.insert(A.end(), ...) when A is a SmallVector, and other SmallVector simplifications. llvm-svn: 106452	2010-06-21 19:47:52 +00:00
Dan Gohman	bbc29ea821	Revert r106422, which is breaking the non-fast-isel path. llvm-svn: 106423	2010-06-21 16:02:28 +00:00
Dan Gohman	f64fdd69d0	More changes for non-top-down fast-isel. Split the code for materializing a value out of SelectionDAGBuilder::getValue into a helper function, so that it can be used in other ways. Add a new getNonRegisterValue function which uses it, for use in code which doesn't want a CopyFromReg even when FuncMap.ValueMap already has an entry for it. llvm-svn: 106422	2010-06-21 15:13:54 +00:00
Dan Gohman	f91aff5f13	Do one lookup instead of two. llvm-svn: 106415	2010-06-21 14:21:47 +00:00
Dan Gohman	7c58cf75fa	Generalize this to look in the regular ValueMap in addition to the LocalValueMap, to make it more flexible when fast-isel isn't proceding straight top-down. llvm-svn: 106414	2010-06-21 14:17:46 +00:00
Bob Wilson	4581434c27	Tidy. llvm-svn: 106383	2010-06-19 05:33:57 +00:00
Dan Gohman	8693650422	Teach regular and fast isel to set dead flags on unused implicit defs on calls and similar instructions. llvm-svn: 106353	2010-06-18 23:28:01 +00:00
Jakob Stoklund Olesen	678927e0b1	Only run CoalesceExtSubRegs when we can expect LiveIntervalAnalysis to clean up the inserted INSERT_SUBREGs after us. llvm-svn: 106345	2010-06-18 23:10:20 +00:00
Evan Cheng	2d51c7c592	Allow ARM if-converter to be run after post allocation scheduling. - This fixed a number of bugs in if-converter, tail merging, and post-allocation scheduler. If-converter now runs branch folding / tail merging first to maximize if-conversion opportunities. - Also changed the t2IT instruction slightly. It now defines the ITSTATE register which is read by instructions in the IT block. - Added Thumb2 specific hazard recognizer to ensure the scheduler doesn't change the instruction ordering in the IT block (since IT mask has been finalized). It also ensures no other instructions can be scheduled between instructions in the IT block. This is not yet enabled. llvm-svn: 106344	2010-06-18 23:09:54 +00:00
Jim Grosbach	a57c2885cf	back-end libcall handling for ATOMIC_SWAP (__sync_lock_test_and_set) llvm-svn: 106342	2010-06-18 23:03:10 +00:00
Jakob Stoklund Olesen	07f4fa8198	TwoAddressInstructionPass::CoalesceExtSubRegs can insert INSERT_SUBREG instructions, but it doesn't really understand live ranges, so the first INSERT_SUBREG uses an implicitly defined register. Fix it in LiveVariableAnalysis by adding the <undef> flag. llvm-svn: 106333	2010-06-18 22:29:44 +00:00
Evan Cheng	cf9e8a987f	Fix an inverted condition. llvm-svn: 106330	2010-06-18 22:17:13 +00:00
Evan Cheng	f5d62535a5	Fix cross initialization compilation error. llvm-svn: 106324	2010-06-18 22:01:37 +00:00
Evan Cheng	c0e0d85b18	Teach iff-converter to properly count # of dups. It was not skipping over dbg_value's which resulted in non-duplicated instructions being deleted. rdar://8104384. llvm-svn: 106323	2010-06-18 21:52:57 +00:00
Jim Grosbach	d64dfc1568	Add Expand-to-libcall support for additional atomics. This covers the usual entries used by llvm-gcc. *_[U]MIN and such can be added later if needed. This enables the front ends to simplify handling of the atomic intrinsics by removing the target-specific decision about which targets can handle the intrinsics. llvm-svn: 106321	2010-06-18 21:43:38 +00:00
Dan Gohman	e5457c275d	Don't leak RegClass2VRegMap, which is now a new[] array instead of a std::vector. llvm-svn: 106298	2010-06-18 18:54:05 +00:00
Dan Gohman	882bb2984e	Start TargetRegisterClass indices at 0 instead of 1, so that MachineRegisterInfo doesn't have to confusingly allocate an extra entry. llvm-svn: 106296	2010-06-18 18:13:55 +00:00
Bob Wilson	f82c8fcc58	Fix PR7372: Conditional branches (at least on ARM) are treated as predicated, so when IfConverter::CopyAndPredicateBlock checks to see if it should ignore an instruction because it is a branch, it should not check if the branch is predicated. This case (when IgnoreBr is true) is only relevant from IfConvertTriangle, where new branches are inserted after the block has been copied and predicated. If the original branch is not removed, we end up with multiple conditional branches (possibly conflicting) at the end of the block. Aside from any immediate errors resulting from that, this confuses the AnalyzeBranch functions so that the branches are not analyzable. That in turn causes the IfConverter to think that the "Simple" pattern can be applied, and things go downhill fast because the "Simple" pattern does _not_ apply if the block can fall through. This is pretty fragile. If there are other degenerate cases where AnalyzeBranch fails, but where the block may still fall through, the IfConverter should not perform its "Simple" if-conversion. But, I don't know how to do that with the current AnalyzeBranch interface, so for now, the best thing seems to be to avoid creating branches that AnalyzeBranch cannot handle. Evan, please review! llvm-svn: 106291	2010-06-18 17:07:23 +00:00
Dan Gohman	9f58b3e106	Don't bother calling releaseMemory before destroying the DominatorTreeBase. llvm-svn: 106287	2010-06-18 16:09:11 +00:00
Dan Gohman	7edb39cc6b	Minor code simplifications. llvm-svn: 106286	2010-06-18 16:00:29 +00:00
Dan Gohman	6e681a5fbe	Give NamedRegionTimer an Enabled flag, allowing all its clients to switch from this: if (TimePassesIsEnabled) { NamedRegionTimer T(Name, GroupName); do_something(); } else { do_something(); // duplicate the code, this time without a timer! } to this: { NamedRegionTimer T(Name, GroupName, TimePassesIsEnabled); do_something(); } llvm-svn: 106285	2010-06-18 15:56:31 +00:00
Dan Gohman	96ca25eba5	Don't replace the old Ordering object with a new one; just clear() the old one. llvm-svn: 106284	2010-06-18 15:40:58 +00:00
Dan Gohman	a4f46b3ef8	Don't call clear() on DbgInfo when it's going to be deleted anyway. Don't replace the old DbgInfo with a new one when clear() on the old one is sufficient. llvm-svn: 106283	2010-06-18 15:36:18 +00:00
Dan Gohman	92c11acdb8	Change UpdateNodeOperands' operand and return value from SDValue to SDNode *, since it doesn't care about the ResNo value. llvm-svn: 106282	2010-06-18 15:30:29 +00:00
Dan Gohman	f1d8304fe3	Eliminate unnecessary uses of getZExtValue(). llvm-svn: 106279	2010-06-18 14:22:04 +00:00
Dan Gohman	35b6f9a929	isValueValidForType can be a static member function. llvm-svn: 106278	2010-06-18 14:01:07 +00:00
Dan Gohman	b92156d5e4	Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass, which is faster, simpler, and less surprising. llvm-svn: 106263	2010-06-18 01:05:21 +00:00
Dan Gohman	0883789ec4	Handle ext(ext(x)) -> ext(x) immediately, since it's simple. llvm-svn: 106256	2010-06-18 00:08:30 +00:00
Stuart Hastings	0125b6410a	Add a DebugLoc parameter to TargetInstrInfo::InsertBranch(). This addresses a longstanding deficiency noted in many FIXMEs scattered across all the targets. This effectively moves the problem up one level, replacing eleven FIXMEs in the targets with eight FIXMEs in CodeGen, plus one path through FastISel where we actually supply a DebugLoc, fixing Radar 7421831. llvm-svn: 106243	2010-06-17 22:43:56 +00:00
Jim Grosbach	0ed5b460dc	add missing break. inconsequential as the code shouldn't be reached, but for correctness' sake, it should be there. llvm-svn: 106229	2010-06-17 17:58:54 +00:00
Jim Grosbach	3aeae8aeeb	Add entries for Expanding atomic intrinsics to libcalls. Just a placeholder for the moment. The implementation of the libcall will follow. Currently, the llvm-gcc knows when the intrinsics can be correctly handled by the back end and only generates them in those cases, issuing libcalls directly otherwise. That's too much coupling. The intrinsics should always be generated and the back end decide how to handle them, be it with a libcall, inline code, or whatever. This patch is a step in that direction. rdar://8097623 llvm-svn: 106227	2010-06-17 17:50:54 +00:00
Jim Grosbach	ba451e80dc	ISD::MEMBARRIER should lower to a libcall (__sync_synchronize) if the target sets the legalize action to Expand. llvm-svn: 106203	2010-06-17 02:00:53 +00:00
Jakob Stoklund Olesen	207cd4bbd7	Allow a register to be redefined multiple times in a basic block. LiveVariableAnalysis was a bit picky about a register only being redefined once, but that really isn't necessary. Here is an example of chained INSERT_SUBREGs that we can handle now: 68 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1028<kill>, 14 register: %reg1040 +[70,134:0) 76 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1029<kill>, 13 register: %reg1040 replace range with [70,78:1) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,134:0) 0@78-(134) 1@70-(78) 84 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1030<kill>, 12 register: %reg1040 replace range with [78,86:2) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,134:0) 0@86-(134) 1@70-(78) 2@78-(86) 92 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1031<kill>, 11 register: %reg1040 replace range with [86,94:3) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,94:3)[94,134:0) 0@94-(134) 1@70-(78) 2@78-(86) 3@86-(94) rdar://problem/8096390 llvm-svn: 106152	2010-06-16 21:29:40 +00:00
Jim Grosbach	6c0da25129	add FIXME llvm-svn: 106126	2010-06-16 18:45:08 +00:00
Bill Wendling	d71bd63600	Improve comment to include that the use of a preg is also verboten in this situation. llvm-svn: 106119	2010-06-16 18:01:31 +00:00
Evan Cheng	f128bdcb55	Make post-ra scheduling, anti-dep breaking, and register scavenger (conservatively) aware of predicated instructions. This enables ARM to move if-conversion before post-ra scheduler. llvm-svn: 106091	2010-06-16 07:35:02 +00:00
Devang Patel	a6d20f446f	Use separate named MDNode to hold each function's local variable info. This speeds up local variable handling in DwarfDebug. llvm-svn: 106075	2010-06-16 00:53:55 +00:00
Eric Christopher	b672ab9b53	Don't emit the linkage for initializer label for mach-o tls. llvm-svn: 106073	2010-06-16 00:27:30 +00:00
Bill Wendling	8c0cf0994d	Create a more targeted fix for not sinking instructions into a range where it will conflict with another live range. The place which creates this scenerio is the code in X86 that lowers a select instruction by splitting the MBBs. This eliminates the need to check from the bottom up in an MBB for live pregs. llvm-svn: 106066	2010-06-15 23:46:31 +00:00
Stuart Hastings	9b5005cd4b	Added a comment. llvm-svn: 106063	2010-06-15 23:06:30 +00:00
Bob Wilson	8105144fcd	Fix 80col violations, remove trailing whitespace, and clarify a comment. llvm-svn: 106057	2010-06-15 22:18:54 +00:00
Jakob Stoklund Olesen	ec2e964fd6	Remove the local register allocator. Please use the fast allocator instead. llvm-svn: 106051	2010-06-15 21:58:33 +00:00
Mon P Wang	7a84689cc5	Fixed vector widening of binary instructions that can trap. Patch by Visa Putkinen! llvm-svn: 106038	2010-06-15 20:29:05 +00:00
Bob Wilson	fc7d739422	IfConversion's AnalyzeBlocks method always returns false; clean it up. llvm-svn: 106027	2010-06-15 18:57:15 +00:00
Jim Grosbach	c964585ff8	fix naming llvm-svn: 106024	2010-06-15 18:53:34 +00:00
Jakob Stoklund Olesen	6e54c908e0	Fix an exotic bug that only showed up in an internal test case. SimpleRegisterCoalescing::JoinIntervals() uses CoalescerPair to determine if a copy is coalescable, and in very rare cases it can return true where LHS is not live - the coalescable copy can come from an alias of the physreg in LHS. llvm-svn: 106021	2010-06-15 18:49:14 +00:00
Bob Wilson	5947573f39	Fix a comment typo. llvm-svn: 106015	2010-06-15 18:19:27 +00:00
Bob Wilson	de94e66234	Add some missing checks for the case where the extract_subregs are combined to an insert_subreg, i.e., where the destination register is larger than the source. We need to check that the subregs can be composed for that case in a symmetrical way to the case when the destination is smaller. llvm-svn: 106004	2010-06-15 17:27:54 +00:00
Jakob Stoklund Olesen	246e9a07a2	Avoid processing early clobbers twice in RegAllocFast. Early clobbers defining a virtual register were first alocated to a physreg and then processed as a physreg EC, spilling the virtreg. This fixes PR7382. llvm-svn: 105998	2010-06-15 16:20:57 +00:00
Jakob Stoklund Olesen	82eca35b3e	Add CoalescerPair helper class. Given a copy instruction, CoalescerPair can determine which registers to coalesce in order to eliminate the copy. It deals with all the subreg fun to determine a tuple (DstReg, SrcReg, SubIdx) such that: - SrcReg is a virtual register that will disappear after coalescing. - DstReg is a virtual or physical register whose live range will be extended. - SubIdx is 0 when DstReg is a physical register. - SrcReg can be joined with DstReg:SubIdx. CoalescerPair::isCoalescable() determines if another copy instruction is compatible with the same tuple. This fixes some NEON miscompilations where shuffles are getting coalesced as if they were copies. The CoalescerPair class will replace a lot of the spaghetti logic in JoinCopy later. llvm-svn: 105997	2010-06-15 16:04:21 +00:00
Bob Wilson	a55b8877e6	Generalize the pre-coalescing of extract_subregs feeding reg_sequences, replacing the overly conservative checks that I had introduced recently to deal with correctness issues. This makes a pretty noticable difference in our testcases where reg_sequences are used. I've updated one test to check that we no longer emit the unnecessary subreg moves. llvm-svn: 105991	2010-06-15 05:56:31 +00:00
Ted Kremenek	d52caa5244	Update CMake build. llvm-svn: 105987	2010-06-15 04:08:14 +00:00
Jim Grosbach	412800d346	More dbg_value cleanup so the presence of debug info doesn't affect code-gen. Make sure to skip the dbg_value instructions when moving dups out of the diamond. rdar://7797940 llvm-svn: 105965	2010-06-14 21:30:32 +00:00
Evan Cheng	078f4cec21	- Do away with SimpleHazardRecognizer.h. It's not used and offers little value. - Rename ExactHazardRecognizer to PostRAHazardRecognizer and move its header to include to allow targets to extend it. llvm-svn: 105959	2010-06-14 21:06:53 +00:00
Evan Cheng	a397ada078	Avoid uncessary array copying. llvm-svn: 105955	2010-06-14 20:18:40 +00:00
Chris Lattner	0fc88efda3	fix a -Wbool-conversions warning from clang. llvm-svn: 105942	2010-06-14 18:28:34 +00:00
Bill Wendling	5d6103318a	When performing the Horrible Hack(tm-Duncan) on the EH code to convert a clean-up to a catch-all after inlining, take into account that there could be filter IDs as well. The presence of filters don't mean that the selector catches anything. It's just metadata information. llvm-svn: 105872	2010-06-12 02:34:29 +00:00
Evan Cheng	e60273fd70	Allow target to provide its own hazard recognizer to post-ra scheduler. llvm-svn: 105862	2010-06-12 00:12:18 +00:00
Evan Cheng	cb1fe56fd9	Code formatting. llvm-svn: 105861	2010-06-12 00:11:53 +00:00
Stuart Hastings	afe54f1625	Support for nested functions/classes in debug output. (Again.) Radar 7424645. llvm-svn: 105828	2010-06-11 20:08:44 +00:00
Evan Cheng	38f6560461	Code refactoring, no functionality changes. llvm-svn: 105775	2010-06-10 02:09:31 +00:00
Jakob Stoklund Olesen	8bc5eca331	Mark physregs defined by inline asm as implicit. This is a bit of a hack to make inline asm look more like call instructions. It would be better to produce correct dead flags during isel. llvm-svn: 105749	2010-06-09 20:05:00 +00:00
Evan Cheng	a0746bd50a	Allow target to place 2-address pass inserted copies in better spots. Thumb2 will use this to try to avoid breaking up IT blocks. llvm-svn: 105745	2010-06-09 19:26:01 +00:00
Bill Wendling	5ac1d23d3d	It's an error to translate this: %reg1025 = <sext> %reg1024 ... %reg1026 = SUBREG_TO_REG 0, %reg1024, 4 into this: %reg1025 = <sext> %reg1024 ... %reg1027 = EXTRACT_SUBREG %reg1025, 4 %reg1026 = SUBREG_TO_REG 0, %reg1027, 4 The problem here is that SUBREG_TO_REG is there to assert that an implicit zext occurs. It doesn't insert a zext instruction. If we allow the EXTRACT_SUBREG here, it will give us the value after the <sext>, not the original value of %reg1024 before <sext>. llvm-svn: 105741	2010-06-09 19:00:55 +00:00
Jakob Stoklund Olesen	a13b1c29b0	Add argument name comments. llvm-svn: 105665	2010-06-09 00:40:31 +00:00
Bob Wilson	7149cfcda3	Fix a mistake in my previous change r105437: don't access operand 2 and assume that it is an immediate before checking that the instruction is an EXTRACT_SUBREG. llvm-svn: 105585	2010-06-07 23:48:46 +00:00
Dan Gohman	7398758719	Add some basic debug output. llvm-svn: 105561	2010-06-07 22:32:10 +00:00
Jim Grosbach	6201b991a2	Cleanup. Process the dbg_values separately llvm-svn: 105554	2010-06-07 21:28:55 +00:00
Jim Grosbach	0f445f328e	Move exit check where it really belongs. llvm-svn: 105541	2010-06-07 19:12:21 +00:00
Stuart Hastings	3ca391027f	Revert 105492 & 105493 due to a testcase regression. Radar 7424645. llvm-svn: 105511	2010-06-05 00:39:29 +00:00
Dale Johannesen	df1a7f83bf	Fix some liveout handling related to tail calls, see comments. I don't think this ever resulted in problems on x86, but it would on ARM. llvm-svn: 105509	2010-06-05 00:30:45 +00:00
Evan Cheng	a03e6f85fe	Re-apply 105308 with fix. llvm-svn: 105502	2010-06-04 23:28:13 +00:00
Jim Grosbach	a1e08fb256	Make if-conversion ignore dbg_value instructions in its analysis. rdar://7797940 llvm-svn: 105498	2010-06-04 23:01:26 +00:00
Stuart Hastings	7c015988fe	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 105492	2010-06-04 22:36:03 +00:00
Jim Grosbach	50d229e6b3	Skip dbg_value instructions when scanning instructions in register scavenging. llvm-svn: 105481	2010-06-04 20:18:30 +00:00
Jakob Stoklund Olesen	864827afb0	Keep track of the call instructions whose clobber lists were skipped during fast register allocation. Process all of the clobber lists at the end of the function, marking the registers as used in MachineRegisterInfo. This is necessary in case the calls clobber callee-saved registers (sic). llvm-svn: 105473	2010-06-04 18:08:29 +00:00
Mon P Wang	622cdd2297	Fixed a bug during widening where we would avoid legalizing a node. When we replace an OpA with a widened OpB, it is possible to get new uses of OpA due to CSE when recursively updating nodes. Since OpA has been processed, the new uses are not examined again. The patch checks if this occurred and it it did, updates the new uses of OpA to use OpB. llvm-svn: 105453	2010-06-04 01:20:10 +00:00
Bob Wilson	a733daf18c	Add some missing checks in TwoAddressInstructionPass::CoalesceExtSubRegs. Check that all the instructions are in the same basic block, that the EXTRACT_SUBREGs write to the same subregs that are being extracted, and that the source and destination registers are in the same regclass. Some of these constraints can be relaxed with a bit more work. Jakob suggested that the loop that checks for subregs when NewSubIdx != 0 should use the "nodbg" iterator, so I made that change here, too. llvm-svn: 105437	2010-06-03 23:53:58 +00:00
Jim Grosbach	01edd68225	Cleanup 80-column and trim trailing whitespace llvm-svn: 105435	2010-06-03 23:49:57 +00:00
Dan Gohman	d83e3e7750	Fix SimplifyDemandedBits' AssertZext logic to demand all the bits. It needs to demand the high bits because it's asserting that they're zero. llvm-svn: 105406	2010-06-03 20:21:33 +00:00
Bob Wilson	30093b5d8b	Revert 105308. llvm-svn: 105399	2010-06-03 18:28:31 +00:00
Bill Wendling	f82aea634c	Machine sink could potentially sink instructions into a block where the physical registers it defines then interfere with an existing preg live range. For instance, if we had something like these machine instructions: BB#0 ... = imul ... EFLAGS<imp-def,dead> test ..., EFLAGS<imp-def> jcc BB#2 EFLAGS<imp-use> BB#1 ... ; fallthrough to BB#2 BB#2 ... ; No code that defines EFLAGS jcc ... EFLAGS<imp-use> Machine sink will come along, see that imul implicitly defines EFLAGS, but because it's "dead", it assumes that it can move imul into BB#2. But when it does, imul's "dead" imp-def of EFLAGS is raised from the dead (a zombie) and messes up the condition code for the jump (and pretty much anything else which relies upon it being correct). The solution is to know which pregs are live going into a basic block. However, that information isn't calculated at this point. Nor does the LiveVariables pass take into account non-allocatable physical registers. In lieu of this, we do a very conservative pass through the basic block to determine if a preg is live coming out of it. llvm-svn: 105387	2010-06-03 07:54:20 +00:00
Eric Christopher	f67fe3b1e8	One underscore, not two. llvm-svn: 105379	2010-06-03 04:02:59 +00:00
Eli Friedman	dbbbf73c96	Implement expansion in type legalization for add/sub with overflow. The expansion is the same as that used by LegalizeDAG. The resulting code sucks in terms of performance/codesize on x86-32 for a 64-bit operation; I haven't looked into whether different expansions might be better in general. llvm-svn: 105378	2010-06-03 03:49:50 +00:00
Jakob Stoklund Olesen	4029596f93	Use the fast register allocator by default for -O0 builds. This affects both llvm-gcc and clang. llvm-svn: 105372	2010-06-03 00:39:06 +00:00
Jakob Stoklund Olesen	818e4df2b4	Use readsWritesVirtualRegister instead of counting uses and defs when inserting spills and reloads. This means that a partial define of a register causes a reload so the other parts of the register are preserved. The reload can be prevented by adding an <imp-def> operand for the full register. This is already done by the coalescer and live interval analysis where relevant. llvm-svn: 105369	2010-06-03 00:07:47 +00:00
Jakob Stoklund Olesen	42c642cd24	Add full register <imp-def> operands when the coalescer is creating partial register updates. These operands tell the spiller that the other parts of the partially defined register are don't-care, and a reload is not necessary. llvm-svn: 105361	2010-06-02 23:22:11 +00:00
Bill Wendling	7ee730eb40	Compulsive reformating. No functionalitical changes. llvm-svn: 105359	2010-06-02 23:04:26 +00:00
Jakob Stoklund Olesen	a8ad97743d	Slightly change the meaning of the reMaterialize target hook when the original instruction defines subregisters. Any existing subreg indices on the original instruction are preserved or composed with the new subreg index. Also substitute multiple operands mentioning the original register by using the new MachineInstr::substituteRegister() function. This is necessary because there will soon be <imp-def> operands added to non read-modify-write partial definitions. This instruction: %reg1234:foo = FLAP %reg1234<imp-def> will reMaterialize(%reg3333, bar) like this: %reg3333:bar-foo = FLAP %reg333:bar<imp-def> Finally, replace the TargetRegisterInfo pointer argument with a reference to indicate that it cannot be NULL. llvm-svn: 105358	2010-06-02 22:47:25 +00:00
Rafael Espindola	f2dffcef82	Remove the TargetRegisterClass member from CalleeSavedInfo llvm-svn: 105344	2010-06-02 20:02:30 +00:00
Devang Patel	c2254f6b98	Skip identical instruction while calculating DBG_VALUE range. llvm-svn: 105340	2010-06-02 19:05:13 +00:00
Bob Wilson	2d35a9e810	Rename canCombinedSubRegIndex method to something more grammatically correct and tidy up the comment describing it. llvm-svn: 105339	2010-06-02 18:54:47 +00:00
Devang Patel	21ccf05b4c	Use local small vector. llvm-svn: 105332	2010-06-02 16:42:51 +00:00
Jim Grosbach	848548300d	Not all entries in the range will have an SUnit. Check for that when looking for debug information. llvm-svn: 105324	2010-06-02 15:29:36 +00:00
Rafael Espindola	c08ecba597	Remove uses of getCalleeSavedRegClasses from outside the backends and removes the virtual declaration. With that out of the way I should be able to cleanup one backend at a time. llvm-svn: 105321	2010-06-02 12:39:06 +00:00
Evan Cheng	a2da22734f	Enable machine cse of instructions which define physical registers. llvm-svn: 105308	2010-06-02 01:08:27 +00:00
Bob Wilson	f4a34b97b8	Fix an obvious mistake: don't change the operands until all of them have been checked and it is safe to proceed with the changes. llvm-svn: 105304	2010-06-02 00:16:08 +00:00
Jim Grosbach	12ac8f0352	Update debug information when breaking anti-dependencies. rdar://7759363 llvm-svn: 105300	2010-06-01 23:48:44 +00:00
Jakob Stoklund Olesen	7b0ac865a4	Properly compose subregister indices when coalescing. The comment about ordering of subreg indices is no longer true. This exposed a bug in the new substVirtReg method that is also fixed. llvm-svn: 105294	2010-06-01 22:39:25 +00:00
Devang Patel	d43e0ca916	Ignore line number of debug value in undefined register. llvm-svn: 105292	2010-06-01 21:43:09 +00:00
Devang Patel	b0c76394a3	Keep track of incoming debug value of unused argument. Radar 7927666. llvm-svn: 105285	2010-06-01 19:59:01 +00:00
Dan Gohman	b782caa393	Fill in missing support for ISD::FEXP, ISD::FPOWI, and friends. llvm-svn: 105283	2010-06-01 18:35:14 +00:00
Jim Grosbach	b24d5c6ce2	Add a FIXME llvm-svn: 105282	2010-06-01 18:06:35 +00:00
Jim Grosbach	74d8345512	When processing function arguments when splitting live ranges across invokes, handle structs passed by value via an extract/insert pair, as a bitcast won't work on a struct. rdar://7742824 llvm-svn: 105280	2010-06-01 18:04:09 +00:00
Chris Lattner	14c46517b5	fix PR6623: when optimizing for size, don't inline memcpy/memsets that are too large. This causes the freebsd bootloader to be too large apparently. It's unclear if this should be an -Os or -Oz thing. Thoughts welcome. llvm-svn: 105228	2010-05-31 17:30:14 +00:00
Chris Lattner	b4a773b452	the 'limit' argument to FindOptimalMemOpLowering is unsigned, not uint64_t. llvm-svn: 105226	2010-05-31 17:12:23 +00:00
Oscar Fuentes	a97311f152	Use `llvm::next' instead of `next' to make VC++ 2010 happy. llvm-svn: 105168	2010-05-30 13:14:21 +00:00
Dan Gohman	4db93c9700	Reorder some code in SelectionDAGBuilder. llvm-svn: 105105	2010-05-29 17:53:24 +00:00
Dan Gohman	d16aa541af	SelectionDAG shouldn't have a FunctionLoweringInfo member. RegsForValue shouldn't have a TargetLoweringInfo member. And FunctionLoweringInfo::set doesn't needs its EnableFastISel argument. llvm-svn: 105101	2010-05-29 17:03:36 +00:00
Benjamin Kramer	c488e92f0b	Remove unused function. llvm-svn: 105100	2010-05-29 14:03:51 +00:00
Evan Cheng	707b7cc429	Remove schedule-livein-copies. It's not being used. llvm-svn: 105095	2010-05-29 02:23:39 +00:00
Jakob Stoklund Olesen	ab6223949e	Handle composed subreg indices when processing REQ_SEQUENCE instructions. llvm-svn: 105066	2010-05-29 00:14:14 +00:00
Evan Cheng	032f3261a2	Doh. Machine LICM is re-initializing the CSE map over and over. Patch by Anna Zaks. rdar://8037934. llvm-svn: 105065	2010-05-29 00:06:36 +00:00
Evan Cheng	cc2efe11db	Fix some latency computation bugs: if the use is not a machine opcode do not just return zero. llvm-svn: 105061	2010-05-28 23:26:21 +00:00
Jakob Stoklund Olesen	64824ea99f	Add a TargetRegisterInfo::composeSubRegIndices hook with a default implementation that is correct for most targets. Tablegen will override where needed. Add MachineOperand::subst{Virt,Phys}Reg methods that correctly handle existing subreg indices when sustituting registers. llvm-svn: 104985	2010-05-28 18:18:53 +00:00
Stuart Hastings	c1e216583f	Revert 104841, 104842, 104876 due to buildbot failures. Radar 7424645. llvm-svn: 104953	2010-05-28 16:41:07 +00:00
Dan Gohman	2140a74979	Eliminate the restriction that the array size in an alloca must be i32. This will help reduce the amount of casting required on 64-bit targets. llvm-svn: 104911	2010-05-28 01:14:11 +00:00
Jakob Stoklund Olesen	b613ae2c89	Add a -regalloc=default option that chooses a register allocator based on the -O optimization level. This only really affects llc for now because both the llvm-gcc and clang front ends override the default register allocator. I intend to remove that code later. llvm-svn: 104904	2010-05-27 23:57:25 +00:00
Jim Grosbach	faa3abbe39	Update the saved stack pointer in the sjlj function context following either an alloca() or an llvm.stackrestore(). rdar://8031573 llvm-svn: 104900	2010-05-27 23:49:24 +00:00
Jim Grosbach	c9f532dddc	back out 104862/104869. Can reuse stacksave after all. Very cool. llvm-svn: 104897	2010-05-27 23:11:57 +00:00
Devang Patel	7a9dedf0ab	Do not drop location info for inlined function args. llvm-svn: 104884	2010-05-27 20:25:04 +00:00
Jim Grosbach	b68dfb45f5	hook ISD::STACKADDR to an intrinsic llvm-svn: 104869	2010-05-27 18:52:11 +00:00
Devang Patel	5e6b71ce34	inlined function's arguments need a label to mark the start point because they are not directly attached to current function. llvm-svn: 104848	2010-05-27 16:47:30 +00:00
Stuart Hastings	8e99e50d08	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 104841	2010-05-27 16:16:54 +00:00
Devang Patel	6b9a9fe207	Simplify. Eliminate unneeded debug_loc entry. llvm-svn: 104785	2010-05-26 23:55:23 +00:00
Bill Wendling	ddee3cb163	Add FIXME comment to remove this. llvm-svn: 104749	2010-05-26 21:53:50 +00:00
Daniel Dunbar	b33dfbcba4	MC: Add TargetMachine support for setting the value of MCRelaxAll with -filetype=obj. llvm-svn: 104747	2010-05-26 21:48:55 +00:00
Devang Patel	acc32a5c19	There is no need to force an line number entry (using previous location) for a temp label at unknown location. llvm-svn: 104740	2010-05-26 21:23:46 +00:00
Bill Wendling	27311269cb	Add "setjmp_syscall", "savectx", "qsetjmp", "vfork", "getcontext" to the list of usual suspects that could "return twice". llvm-svn: 104737	2010-05-26 20:39:00 +00:00
Jim Grosbach	c98892fdaa	Adjust eh.sjlj.setjmp to properly have a chain and to have an opcode entry in ISD::. No functional change. llvm-svn: 104734	2010-05-26 20:22:18 +00:00
Devang Patel	1b08572a66	Update debug info when live-in reg is copied into a vreg. llvm-svn: 104732	2010-05-26 20:18:50 +00:00
Bill Wendling	0c3bfd3fb0	Move the check for "calls setjmp" to SelectionDAGISel so that it can be used by more than just the stack slot coloring algorithm. llvm-svn: 104722	2010-05-26 19:46:12 +00:00
Devang Patel	002d54ddc9	Identify instructions, that needs a label to mark debug info entity, in advance. This simplifies beginScope(). llvm-svn: 104720	2010-05-26 19:37:24 +00:00
Dan Gohman	52c2738324	Eliminate the use of PriorityQueue and just use a std::vector, implementing pop with a linear search for a "best" element. The priority queue was a neat idea, but in practice the comparison functions depend on dynamic information. llvm-svn: 104718	2010-05-26 18:52:00 +00:00
Dan Gohman	1e5d0b0456	Delete an unused function. llvm-svn: 104716	2010-05-26 18:34:12 +00:00
Devang Patel	95fcc96752	Remove dead code. llvm-svn: 104706	2010-05-26 17:42:50 +00:00
Devang Patel	5a5e0bc3b5	Do not construct location list backword! llvm-svn: 104705	2010-05-26 17:29:32 +00:00
Eric Christopher	e805ea9e39	Temporarily revert r104655 as it's breaking the bots. llvm-svn: 104664	2010-05-26 01:59:55 +00:00
Dan Gohman	7c00576a62	Change push_all to a non-virtual function and implement it in the base class, since all the implementations are the same. llvm-svn: 104659	2010-05-26 01:10:55 +00:00
Dan Gohman	3701b3928e	Trim #include. llvm-svn: 104657	2010-05-26 00:55:59 +00:00
Bill Wendling	c5222d6c38	Dale and Evan suggested putting the "check for setjmp" much earlier in the machine code generation. That's a good idea, so I made it so. llvm-svn: 104655	2010-05-26 00:32:40 +00:00
Devang Patel	9fc11706e3	First cut at supporting .debug_loc section. This is used to track variable information. llvm-svn: 104649	2010-05-25 23:40:22 +00:00
Bill Wendling	388f638511	Constify function. llvm-svn: 104646	2010-05-25 22:02:22 +00:00
Dan Gohman	ce3269b815	Do one map lookup instead of two. llvm-svn: 104645	2010-05-25 21:59:42 +00:00
Eric Christopher	f3925438e5	Move the verbose asm output up a bit so it can be used in the special cases as well. llvm-svn: 104642	2010-05-25 21:49:43 +00:00
Bill Wendling	b04ef0cfbc	Okay, bear with me here... If you have a setjmp/longjmp situation, it's possible for stack slot coloring to reuse a stack slot before it's really dead. For instance, if we have something like this: 1: y = g; x = sigsetjmp(env, 0); switch (x) { case 1: /* ... / goto run; case 0: run: do_run(); / marked as "no return" / break; case 3: if (...) { / ... / goto run; } / ... */ break; } 2: g = y; "y" may be put onto the stack, so the expression "g = y" is relying upon the fact that the stack slot containing "y" isn't modified between (1) and (2). But it can be, because of the "no return" calls in there. A longjmp might come back with 3, modify the stack slot, and then go to case 0. And it's perfectly acceptable to reuse the stack slot there because there's no CFG flow from case 3 to (2). The fix is to disable certain optimizations in these situations. Ideally, we'd disable them for all "returns twice" functions. But we don't support that attribute. Check for "setjmp" and "sigsetjmp" instead. llvm-svn: 104640	2010-05-25 21:44:26 +00:00
Eric Christopher	19a4b843cc	Add support for initialized global data for darwin tls. Update comments and testcases accordingly. llvm-svn: 104635	2010-05-25 21:28:50 +00:00
Jakob Stoklund Olesen	1ad0d5e25b	Print symbolic SubRegIndex names on machine operands. llvm-svn: 104628	2010-05-25 19:49:38 +00:00
Dale Johannesen	60fe2cdc4f	Fix another variant of PR 7191. Also add a testcase Mon Ping provided; unfortunately bugpoint failed to reduce it, but I think it's important to have a test for this in the suite. 8023512. llvm-svn: 104624	2010-05-25 18:47:23 +00:00
Dale Johannesen	ff384ad981	Fix PR 7191. I have been unable to create a .ll file that fails, sorry. (oye, a word which should be better known to people writing tree traversals, means grandchild.) llvm-svn: 104619	2010-05-25 17:50:03 +00:00
Jakob Stoklund Olesen	adff18518a	Disable invalid coalescer assertion. llvm-svn: 104574	2010-05-25 00:15:18 +00:00
Bill Wendling	0b7488e8d5	Print out the name of the function during SSC. llvm-svn: 104572	2010-05-24 23:16:04 +00:00
Evan Cheng	1b79babdec	Avoid adding duplicate function live-in's. llvm-svn: 104560	2010-05-24 21:33:37 +00:00
Devang Patel	51b37e0bd8	Do not emit line number entries for unknown debug values. This fixes recent regression in store.exp from gdb testsuite. llvm-svn: 104524	2010-05-24 18:26:49 +00:00
Nicolas Geoffray	c5327226e4	Encode the Caml frametable by following what the comment says: the number of descriptors is first emitted, and StackOffsets are emitted in 16 bits. llvm-svn: 104488	2010-05-24 12:24:11 +00:00
Daniel Dunbar	3ff1a06de6	MC: Add an MCLoggingStreamer, for use in debugging integrated-as mismatches. llvm-svn: 104463	2010-05-23 17:44:06 +00:00
Evan Cheng	168ced94d8	Implement @llvm.returnaddress. rdar://8015977. llvm-svn: 104421	2010-05-22 01:47:14 +00:00
Jim Grosbach	bd9485db63	Implement eh.sjlj.longjmp for ARM. Clean up the intrinsic a bit. Followups: docs patch for the builtin and eh.sjlj.setjmp cleanup to match longjmp. llvm-svn: 104419	2010-05-22 01:06:18 +00:00
Eric Christopher	6fdea1bda8	Add full bss data support for darwin tls variables. llvm-svn: 104414	2010-05-22 00:10:22 +00:00
Devang Patel	4a8e6e83dc	Collect variable information during endFunction() instead of beginFunction(). llvm-svn: 104412	2010-05-22 00:04:14 +00:00
Bob Wilson	61438fe064	Clean up extra whitespace. llvm-svn: 104410	2010-05-21 23:53:55 +00:00
Eric Christopher	53ff992dde	Make this LookAheadLimit, not the uninitialized LookAheadLeft. Evan please verify! llvm-svn: 104408	2010-05-21 23:40:03 +00:00
Evan Cheng	2c8bdead9e	Allow machine cse to cse instructions which define physical registers. Controlled by option -machine-cse-phys-defs. llvm-svn: 104385	2010-05-21 21:22:19 +00:00
Bob Wilson	51d9ee3ff6	Change CodeGen/ARM/2009-11-02-NegativeLane.ll to use 16-bit vector elements so that it will continue to test what it was meant to test when I commit a separate change for better support of BUILD_VECTOR and VECTOR_SHUFFLE for Neon. Fix a DAG combiner crash exposed by this test change. llvm-svn: 104380	2010-05-21 21:05:32 +00:00
Evan Cheng	3858451e09	- Change MachineInstr::findRegisterDefOperandIdx so it can also look for defs that are aliases of the specified register. - Rename modifiesRegister to definesRegister since it's looking a def of the specific register or one of its super-registers. It's not looking for def of a sub-register or alias that could change the specified register. - Added modifiesRegister to look for defs of aliases. llvm-svn: 104377	2010-05-21 20:53:24 +00:00
Jakob Stoklund Olesen	7d7f604321	Add MachineInstr::readsWritesVirtualRegister() to determine if an instruction reads or writes a register. This takes partial redefines and undef uses into account. Don't actually use it yet. That caused miscompiles. llvm-svn: 104372	2010-05-21 20:02:01 +00:00
Devang Patel	1782aae355	Simplify llvm-svn: 104338	2010-05-21 18:49:09 +00:00
Chris Lattner	a81e1cab04	constify accessor. llvm-svn: 104325	2010-05-21 17:47:50 +00:00
Jakob Stoklund Olesen	b4e1687270	Revert "Use MachineInstr::readsWritesVirtualRegister to determine if a register is read." This reverts r104322. I think it was causing miscompilations. llvm-svn: 104323	2010-05-21 17:36:32 +00:00
Jakob Stoklund Olesen	8e8e090301	Use MachineInstr::readsWritesVirtualRegister to determine if a register is read. This correctly handles partial redefines and undef uses. llvm-svn: 104322	2010-05-21 16:42:30 +00:00
Jakob Stoklund Olesen	a648c6a757	Teach VirtRegRewriter to handle spilling in instructions that have multiple definitions of the virtual register. This happens when spilling the registers produced by REG_SEQUENCE: %reg1047:5<def>, %reg1047:6<def>, %reg1047:7<def> = VLD3d8 %reg1033, 0, pred:14, pred:%reg0 The rewriter would spill the register multiple times, dead store elimination tried to keep up, but ended up cutting the branch it was sitting on. llvm-svn: 104321	2010-05-21 16:36:13 +00:00
Jakob Stoklund Olesen	1f3801062d	If the first definition of a virtual register is a partial redef, add an <imp-def> operand for the full register. This ensures that the full physical register is marked live after register allocation. llvm-svn: 104320	2010-05-21 16:32:16 +00:00
Evan Cheng	725211e948	Rename -pre-RA-sched=hybrid to -pre-RA-sched=list-hybrid. llvm-svn: 104306	2010-05-21 00:42:32 +00:00
Devang Patel	fbd6c45e06	Simplify. llvm-svn: 104302	2010-05-21 00:10:20 +00:00
Evan Cheng	4401f8873c	Allow targets more controls on what nodes are scheduled by reg pressure, what for latency in hybrid mode. llvm-svn: 104293	2010-05-20 23:26:43 +00:00
Devang Patel	490c8ab76d	Refactor. llvm-svn: 104265	2010-05-20 19:57:06 +00:00
Jim Grosbach	63d4f68df4	Remove dbg_value workaround and associated command line option llvm-svn: 104254	2010-05-20 18:34:01 +00:00
Devang Patel	e1c53f29d3	Split DbgVariable. Eventually, variable info will be communicated through frame index, or DBG_VALUE instruction, or collection of DBG_VALUE instructions. Plus each DbgVariable may not need a label. llvm-svn: 104233	2010-05-20 16:36:41 +00:00
Evan Cheng	bdd062dae0	Add a hybrid bottom up scheduler that reduce register usage while avoiding pipeline stall. It's useful for targets like ARM cortex-a8. NEON has a lot of long latency instructions so a strict register pressure reduction scheduler does not work well. Early experiments show this speeds up some NEON loops by over 30%. llvm-svn: 104216	2010-05-20 06:13:19 +00:00
Nick Lewycky	c53cc4f8bf	Fix typo in comment. llvm-svn: 104209	2010-05-20 03:30:09 +00:00
Eric Christopher	27e7ffc7d4	Partial code for emitting thread local bss data. llvm-svn: 104197	2010-05-20 00:49:07 +00:00
Bob Wilson	42603958fb	Optimize away insertelement of an undef value. This shows up in test/Codegen/ARM/reg_sequence.ll but it doesn't affect the generated code because the coalescer cleans it up. Radar 7998853. llvm-svn: 104185	2010-05-19 23:42:58 +00:00
Jim Grosbach	f98511473e	Enable preserving debug information through post-RA scheduling llvm-svn: 104175	2010-05-19 22:57:47 +00:00
Jim Grosbach	604560c5fe	Fix the post-RA instruction scheduler to handle instructions referenced by more than one dbg_value instruction. rdar://7759363 llvm-svn: 104174	2010-05-19 22:57:06 +00:00
Evan Cheng	70e506e18a	Code clean up. llvm-svn: 104173	2010-05-19 22:42:23 +00:00
Devang Patel	a08130864e	Revert r104165. llvm-svn: 104172	2010-05-19 21:58:28 +00:00
Jakob Stoklund Olesen	e0eddb21f5	Add support for partial redefs to the fast register allocator. A partial redef now triggers a reload if required. Also don't add <imp-def,dead> operands for physical superregisters. Kill flags are still treated as full register kills, and <imp-use,kill> operands are added for physical superregisters as before. llvm-svn: 104167	2010-05-19 21:36:05 +00:00
Devang Patel	0fe341e2e2	There is no need to maintain InsnsBeginScopeSet separately. llvm-svn: 104165	2010-05-19 21:26:53 +00:00
Jakob Stoklund Olesen	5d4c134a94	Add MachineInstr::readsVirtualRegister() in preparation for proper handling of partial redefines. We are going to treat a partial redefine of a virtual register as a read-modify-write: %reg1024:6 = OP Unless the register is fully clobbered: %reg1024:6 = OP, %reg1024<imp-def> MachineInstr::readsVirtualRegister() knows the difference. The first case is a read, the second isn't. llvm-svn: 104149	2010-05-19 20:36:22 +00:00
Evan Cheng	738e920edf	Code refactoring: pull SchedPreference enum from TargetLowering.h to TargetMachine.h and put it in its own namespace. llvm-svn: 104147	2010-05-19 20:19:50 +00:00
Jakob Stoklund Olesen	e11cdf8cc8	TwoAddressInstructionPass doesn't really know how to merge live intervals when lowering REG_SEQUENCE instructions. Insert copies for REG_SEQUENCE sources not killed to avoid breaking later passes. llvm-svn: 104146	2010-05-19 20:08:00 +00:00
Bob Wilson	6a1bfd282b	When expanding a vector_shuffle, the element type may not be legal and may need to be promoted. The BUILD_VECTOR and EXTRACT_VECTOR_ELT nodes generated here already allow the promoted type to be used without further changes, so just do the promotion. This fixes part of pr7167. llvm-svn: 104141	2010-05-19 18:48:32 +00:00
Evan Cheng	abd0ad54a4	Intrinsics which do a vector compare (results are all zero or all ones) are modeled as icmp / fcmp + sext. This is turned into a vsetcc by dag combine (yes, not a good long term solution). The targets can then isel the vsetcc to the appropriate instruction. The trouble arises when the result of a vector cmp + sext is then and'ed with all ones. Instcombine will turn it into a vector cmp + zext, dag combiner will miss turning it into a vsetcc and hell breaks loose after that. Teach dag combine to turn a vector cpm + zest into a vsetcc + and 1. This fixes rdar://7923010. llvm-svn: 104094	2010-05-19 01:08:17 +00:00
Bob Wilson	055c01d9dc	Fix a crash when debugging the coalescer. DebugValue instructions are not in the coalescer's instruction map. llvm-svn: 104086	2010-05-18 23:19:42 +00:00
Jakob Stoklund Olesen	430b6e40ab	Remember to update VirtRegLastUse when spilling without killing before a call. llvm-svn: 104074	2010-05-18 22:20:09 +00:00
Evan Cheng	f19384d54a	Sink dag combine's post index load / store code that swap base ptr and index into the target hook. Only the target knows whether the swap is safe. In Thumb2 mode, the offset must be an immediate. rdar://7998649 llvm-svn: 104060	2010-05-18 21:31:17 +00:00
Jakob Stoklund Olesen	663543b4d7	Properly handle multiple definitions of a virtual register in the same instruction. This can happen on ARM: >> %reg1035:5<def>, %reg1035:6<def> = VLD1q16 %reg1028, 0, pred:14, pred:%reg0 Regs: Q0=%reg1032* R0=%reg1028* R1=%reg1029* R2 R3=%reg1031* Killing last use: %reg1028 Allocating %reg1035 from QPR Assigning %reg1035 to Q1 << %D2<def>, %D3<def> = VLD1q16 %R0<kill>, 0, pred:14, pred:%reg0, %Q1<imp-def> llvm-svn: 104056	2010-05-18 21:10:50 +00:00
Evan Cheng	45b3f702ab	Continuously refine the register class of REG_SEQUENCE def with all the source registers and sub-register indices. llvm-svn: 104051	2010-05-18 20:07:47 +00:00
Evan Cheng	e7fc64a5c9	Fix PR7162: Use source register classes and sub-indices to determine the correct register class of the definitions of REG_SEQUENCE. llvm-svn: 104050	2010-05-18 20:03:28 +00:00
Jakob Stoklund Olesen	4843178d6b	Teach the machine code verifier to use getSubRegisterRegClass(). The old approach was wrong. It had an off-by-one error. llvm-svn: 104034	2010-05-18 17:31:12 +00:00
Daniel Dunbar	62bc96a1a5	llc (et al): Add support for --show-encoding and --show-inst. llvm-svn: 104029	2010-05-18 17:22:19 +00:00
Evan Cheng	48f0de96d6	FIX PR7158. SimplifyVBinOp was asserting when it fails to constant fold (op (build_vector), (build_vector)). llvm-svn: 104004	2010-05-18 00:03:40 +00:00
Evan Cheng	1e4f55200d	Fix PR7175. Insert copies of a REG_SEQUENCE source if it is used by other REG_SEQUENCE instructions. llvm-svn: 103994	2010-05-17 23:24:12 +00:00
Bill Wendling	02d3368831	- Set the "HasCalls" flag after instruction selection is finished. - Change the logic DisableFramePointerElim() to check for the -disable-non-leaf-fp-elim before -disable-fp-elim. llvm-svn: 103990	2010-05-17 23:09:50 +00:00
Eric Christopher	9635b3da6b	More data/parsing support for tls directives. Add a few more testcases and cleanup comments as well. llvm-svn: 103985	2010-05-17 22:53:55 +00:00
Evan Cheng	f2c9a96f3c	Fix PR7156. If the sources of a REG_SEQUENCE are all IMPLICIT_DEF's. Replace it with an IMPLICIT_DEF rather than deleting it or else it would be left without a def. llvm-svn: 103984	2010-05-17 22:09:49 +00:00
Jakob Stoklund Olesen	585792738b	Pull the UsedInInstr.test() calls into calcSpillCost() and remember aliases. This fixes the miscompilations of MultiSource/Applications/JM/l{en,de}cod. Clang now successfully self hosts in a debug build with the fast register allocator. llvm-svn: 103975	2010-05-17 21:02:08 +00:00
Eric Christopher	bf79238599	Add some section and constant support for darwin TLS. llvm-svn: 103974	2010-05-17 21:02:07 +00:00
Evan Cheng	29c463862e	Careful with reg_sequence coalescing to not to overwrite sub-register indices. llvm-svn: 103971	2010-05-17 20:57:12 +00:00
Jakob Stoklund Olesen	70563bbba5	Remove debug option. Add comment on spill order determinism. llvm-svn: 103961	2010-05-17 20:01:22 +00:00
Jakob Stoklund Olesen	176a9c4272	Avoid allocating the same physreg to multiple virtregs in one instruction. While that approach works wonders for register pressure, it tends to break everything. This should unbreak the arm-linux builder and fix a number of miscompilations. llvm-svn: 103946	2010-05-17 17:18:59 +00:00
Jakob Stoklund Olesen	f5e8c86424	Minor optimizations. DenseMap::begin() is surprisingly slow on an empty map. llvm-svn: 103940	2010-05-17 15:30:37 +00:00
Jakob Stoklund Olesen	6649cdaa23	Extract spill cost calculation to a new method, and use definePhysReg() to clear out aliases when allocating. Clean up allocVirtReg(). Use calcSpillCost() to allow more aggressive hinting. Now the hint is always taken unless blocked by a reserved register. This leads to more coalescing, lower register pressure, and less spilling. llvm-svn: 103939	2010-05-17 15:30:32 +00:00
Zhongxing Xu	188855abef	Remove unused member variable. llvm-svn: 103936	2010-05-17 09:47:55 +00:00
Jakob Stoklund Olesen	7d22a81b61	Only use clairvoyance when defining a register, and then only if it has one use. This makes allocation independent on the ordering of use-def chains. llvm-svn: 103935	2010-05-17 04:50:57 +00:00
Jakob Stoklund Olesen	f915d14955	Eliminate a hash table probe when killing virtual registers. llvm-svn: 103934	2010-05-17 03:26:09 +00:00
Jakob Stoklund Olesen	edd3d9db13	Execute virtreg kills immediately instead of after processing all uses. This is safe to do because the physreg has been marked UsedInInstr and the kill flag will be set on the last operand using the virtreg if there are more then one. llvm-svn: 103933	2010-05-17 03:26:06 +00:00
Jakob Stoklund Olesen	e07a408afc	Sprinkle superregister <imp-def> and <imp-kill> operands when dealing with subregister indices. llvm-svn: 103931	2010-05-17 02:49:21 +00:00
Jakob Stoklund Olesen	1069a09691	Now that we don't keep live registers across calls, there is not reason to go through the very long list of call-clobbered registers. We just assume all registers are clobbered. llvm-svn: 103930	2010-05-17 02:49:18 +00:00
Jakob Stoklund Olesen	397068de06	Boldly attempt consistent capitalization. Functional changes unintended. llvm-svn: 103929	2010-05-17 02:49:15 +00:00
Jakob Stoklund Olesen	8044c989d1	Spill and kill all virtual registers across a call. Debug code doesn't use callee saved registers anyway, and the code is simpler this way. Now spillVirtReg always kills, and the isKill parameter is not needed. llvm-svn: 103927	2010-05-17 02:07:32 +00:00
Jakob Stoklund Olesen	d2ef1fbc82	Reduce hashtable probes by using DenseMap::insert() for lookup. llvm-svn: 103926	2010-05-17 02:07:29 +00:00
Jakob Stoklund Olesen	fb43e065a4	Make MBB a class member instead of passing it around everywhere. llvm-svn: 103925	2010-05-17 02:07:22 +00:00
Evan Cheng	166a7993ba	Yes, if the redef is a copy, update the old val# with the copy. But make sure to clear the copy field if the redef is not a copy. llvm-svn: 103922	2010-05-17 01:47:47 +00:00
Dale Johannesen	3a366a88f2	Fix uint64->{float, double} conversion to do rounding correctly in 32-bit. The implementation in LegalizeIntegerTypes to handle this as sint64->float + appropriate power of 2 is subject to double rounding, considered incorrect by numerics people. Use this implementation only when it is safe. This leads to using library calls in some cases that produced inline code before, but it's correct now. (EVTToAPFloatSemantics belongs somewhere else, any suggestions?) Add a correctly rounding (though not particularly fast) conversion that uses X87 80-bit computations for x86-32. 7885399, 5901940. This shows up in gcc.c-torture/execute/ieee/rbug.c in the gcc testsuite on some platforms. llvm-svn: 103883	2010-05-15 18:51:12 +00:00
Dale Johannesen	bb4656c05e	Improve assertion messages. llvm-svn: 103882	2010-05-15 18:38:02 +00:00
Chris Lattner	93cd0f1c89	improve portability to systems that don't have powf/modf (e.g. solaris 9) patch by Evzen Muller! llvm-svn: 103876	2010-05-15 17:10:24 +00:00
Chandler Carruth	75142e6bfc	Fix an GCC warning that seems to have actually caught a bug (!!!) in a condition's grouping. Every other use of Allocatable.test(Hint) groups it the same way as it is indented, so move the parentheses to agree with that grouping. llvm-svn: 103869	2010-05-15 10:23:23 +00:00
Jakob Stoklund Olesen	84ce290822	Calculate liveness on the fly for local registers. When working top-down in a basic block, substituting physregs for virtregs, the use-def chains are kept up to date. That means we can recognize a virtreg kill by the use-def chain becoming empty. This makes the fast allocator independent of incoming kill flags. llvm-svn: 103866	2010-05-15 06:09:08 +00:00
Evan Cheng	e26e56e72b	A partial re-def instruction may be a copy. llvm-svn: 103850	2010-05-15 01:35:44 +00:00
Evan Cheng	8c2d062ea6	Teach two-address pass to do some coalescing while eliminating REG_SEQUENCE instructions. e.g. %reg1026<def> = VLDMQ %reg1025<kill>, 260, pred:14, pred:%reg0 %reg1027<def> = EXTRACT_SUBREG %reg1026, 6 %reg1028<def> = EXTRACT_SUBREG %reg1026<kill>, 5 ... %reg1029<def> = REG_SEQUENCE %reg1028<kill>, 5, %reg1027<kill>, 6, %reg1028, 7, %reg1027, 8, %reg1028, 9, %reg1027, 10, %reg1030<kill>, 11, %reg1032<kill>, 12 After REG_SEQUENCE is eliminated, we are left with: %reg1026<def> = VLDMQ %reg1025<kill>, 260, pred:14, pred:%reg0 %reg1029:6<def> = EXTRACT_SUBREG %reg1026, 6 %reg1029:5<def> = EXTRACT_SUBREG %reg1026<kill>, 5 The regular coalescer will not be able to coalesce reg1026 and reg1029 because it doesn't know how to combine sub-register indices 5 and 6. Now 2-address pass will consult the target whether sub-registers 5 and 6 of reg1026 can be combined to into a larger sub-register (or combined to be reg1026 itself as is the case here). If it is possible, it will be able to replace references of reg1026 with reg1029 + the larger sub-register index. llvm-svn: 103835	2010-05-14 23:21:14 +00:00
Dan Gohman	88fb253562	Fast ISel trivially coalesces away no-op casts, so check for this when setting kill flags. llvm-svn: 103832	2010-05-14 22:53:18 +00:00
Jakob Stoklund Olesen	089e9421d2	Don't bother spilling before a return llvm-svn: 103831	2010-05-14 22:40:43 +00:00
Jakob Stoklund Olesen	cdef6bc8de	RegAllocLocal can count copies too llvm-svn: 103830	2010-05-14 22:40:40 +00:00
Jakob Stoklund Olesen	b16013936b	Track allocatable instead of reserved regs, and never take an unallocatable hint. llvm-svn: 103828	2010-05-14 22:02:56 +00:00
Dan Gohman	2f277c866d	Don't set kill flags for instructions which the scheduler has cloned. llvm-svn: 103827	2010-05-14 22:01:14 +00:00
Jakob Stoklund Olesen	e68b814c8c	Avoid scanning the long tail of physreg operands on calls llvm-svn: 103823	2010-05-14 21:55:52 +00:00
Devang Patel	36debf8046	Do not forget to mark prcessed arguments. llvm-svn: 103822	2010-05-14 21:55:50 +00:00
Jakob Stoklund Olesen	6c038e33e9	Count coalesced copies llvm-svn: 103821	2010-05-14 21:55:50 +00:00
Jakob Stoklund Olesen	33af4fcdea	Allow virtreg redefines when verifying for RegAllocFast llvm-svn: 103820	2010-05-14 21:55:44 +00:00
Jim Grosbach	866b74ba8b	Remove trailing whitespace llvm-svn: 103807	2010-05-14 21:20:46 +00:00
Jim Grosbach	d772bdeb7e	80 column and trailing whitespace cleanup llvm-svn: 103806	2010-05-14 21:19:48 +00:00
Jim Grosbach	25749ad5c2	add cmd line option to leave dbgvalues in during post-RA sceduling. Useful while debugging what's mishandled about them in the post-RA pass. llvm-svn: 103805	2010-05-14 21:18:04 +00:00
Bill Wendling	95f6ebcb37	Rename "HasCalls" in MachineFrameInfo to "AdjustsStack" to better describe what the variable actually tracks. N.B., several back-ends are using "HasCalls" as being synonymous for something that adjusts the stack. This isn't 100% correct and should be looked into. llvm-svn: 103802	2010-05-14 21:14:32 +00:00
Devang Patel	e0a94bfe9f	Add support to preserve type info for the variables that are removed by the optimizer. llvm-svn: 103798	2010-05-14 21:01:35 +00:00
Jakob Stoklund Olesen	670492c8ee	When verifying two-address instructions, check the following: - Kill is implicit when use and def registers are identical. - Only virtual registers can differ. Add a -verify-fast-regalloc to run the verifier before the fast allocator. llvm-svn: 103797	2010-05-14 20:28:32 +00:00
Jakob Stoklund Olesen	4d5c1061e3	Simplify the handling of physreg defs and uses in RegAllocFast. This adds extra security against using clobbered physregs, and it adds kill markers to physreg uses. llvm-svn: 103784	2010-05-14 18:03:25 +00:00
Daniel Dunbar	148e876ac2	XFAIL the test I added with vg_leak, apparently it is the first and only llc -filetype=obj test, and -filetype=obj leaks a few objects. Added a FIXME, we need to sort out the ownership model for the various MC objects. llvm-svn: 103769	2010-05-14 07:47:51 +00:00
Daniel Dunbar	3439ed6324	Inline Asm: Ensure buffer is newline terminated to match how the text is printed. - This is a hack, but I can't decide the best place to handle this. Chris? llvm-svn: 103765	2010-05-14 04:31:50 +00:00
Jakob Stoklund Olesen	ceb5a7ada2	Enable opportunistic coalescing llvm-svn: 103764	2010-05-14 04:30:51 +00:00
Jakob Stoklund Olesen	68c235bd4d	Trust kill flags from isel and later passes. llvm-svn: 103748	2010-05-14 00:02:23 +00:00
Jakob Stoklund Olesen	41f8dc897e	Fix an embarrassing runtime regression for RegAllocFast. This loop is quadratic in the capacity for a DenseMap: while(!map.empty()) map.erase(map.begin()); Instead we now do a normal begin() - end() iteration followed by map.clear(). That also has the nice sideeffect of shrinking the map capacity on demand. llvm-svn: 103747	2010-05-14 00:02:20 +00:00
Dale Johannesen	1ae94b9394	Implement a correct ui64->f32 conversion. The old one was subject to double rounding in extreme cases. llvm-svn: 103744	2010-05-13 23:50:42 +00:00
Jakob Stoklund Olesen	d74a564feb	Clean up RegAllocFast debug output llvm-svn: 103739	2010-05-13 20:43:17 +00:00
Dan Gohman	c90f51c00b	Teach MachineLICM and MachineSink how to clear kill flags conservatively when they move instructions. llvm-svn: 103737	2010-05-13 20:34:42 +00:00
Dan Gohman	7767d2747b	Add a utility function for conservatively clearing kill flags, and make use of it in MachineCSE. llvm-svn: 103726	2010-05-13 19:24:00 +00:00
Dan Gohman	5b510c1474	An Instruction has a trivial kill only if its use is in the same basic block. llvm-svn: 103725	2010-05-13 19:19:32 +00:00
Jakob Stoklund Olesen	0ba2e2a568	Take allocation hints from copy instructions to/from physregs. This causes way more identity copies to be generated, ripe for coalescing. llvm-svn: 103686	2010-05-13 00:19:43 +00:00
Jakob Stoklund Olesen	680b74941f	More asserts around physreg uses llvm-svn: 103685	2010-05-13 00:19:39 +00:00
Evan Cheng	4aab8b5425	If REG_SEQUENCE source is livein, copy it first. Also, update livevariables information when a copy is introduced. llvm-svn: 103680	2010-05-13 00:00:35 +00:00

... 4 5 6 7 8 ...

10276 Commits