llvm-project

Commit Graph

Author	SHA1	Message	Date
Bill Wendling	2730a0099a	Clear out the landing pad to call site map for each function. This isn't put into the 'clear()' method because the information needs to stick around (at least for a little bit) after the selection DAG is built. llvm-svn: 142032	2011-10-15 01:00:26 +00:00
Evan Cheng	06fdaeb5d9	A few 80-col violations. llvm-svn: 141988	2011-10-14 20:36:23 +00:00
Jakob Stoklund Olesen	06b6ccfe90	Update live-in lists when splitting critical edges. Fixes PR10814. Patch by Jan Sjödin! llvm-svn: 141960	2011-10-14 17:25:46 +00:00
Jim Grosbach	400907cc41	Fix typo. "__sync_fetch_and-xor_4" should be "__sync_fetch_and_xor_4". Pointed out by George Russell. llvm-svn: 141956	2011-10-14 15:53:48 +00:00
Jakob Stoklund Olesen	7fb5632e73	Add value numbers when spilling dead defs. When spilling around an instruction with a dead def, remember to add a value number for the def. The missing value number wouldn't normally create problems since there would be an incoming live range as well. However, due to another bug we could spill a dead V_SET0 instruction which doesn't read any values. The missing value number caused an empty live range to be created which is dangerous since it doesn't interfere with anything. This fixes part of PR11125. llvm-svn: 141923	2011-10-14 00:34:31 +00:00
Eric Christopher	76933f4c0b	Don't forget to reconstruct D after changing the scope that we're looking at. llvm-svn: 141892	2011-10-13 21:43:44 +00:00
Cameron Zwarich	86f7d3556c	Use an existing method. llvm-svn: 141855	2011-10-13 07:36:41 +00:00
Nick Lewycky	594a545821	If MI is deleted then remove it from the set. If a new MI is created, it could have the same address as the one we deleted, and we don't want that in the set yet. Noticed by inspection. llvm-svn: 141849	2011-10-13 02:16:18 +00:00
Nick Lewycky	404feb9973	Tabs to spaces. llvm-svn: 141844	2011-10-13 01:09:50 +00:00
Nick Lewycky	8488225984	Add missing braces to pacify GCC's -Wparentheses. llvm-svn: 141842	2011-10-13 00:54:59 +00:00
Jakob Stoklund Olesen	068dc91de9	Also inflate register classes around inline asm. Now that MI->getRegClassConstraint() can also handle inline assembly, don't bail when recomputing the register class of a virtual register used by inline asm. This fixes PR11078. llvm-svn: 141836	2011-10-12 23:37:40 +00:00
Jakob Stoklund Olesen	35b362fab2	Add MachineInstr::getRegClassConstraint(). Most instructions have some requirements for their register operands. Usually, this is expressed as register class constraints in the MCInstrDesc, but for inline assembly the constraints are encoded in the flag words. llvm-svn: 141835	2011-10-12 23:37:36 +00:00
Jakob Stoklund Olesen	1e73716eae	Extract a method for finding the inline asm flag operand. llvm-svn: 141834	2011-10-12 23:37:33 +00:00
Jakob Stoklund Olesen	24abd9d9b6	Encode register class constreaints in inline asm instructions. The inline asm operand constraint is initially encoded in the virtual register for the operand, but that register class may change during coalescing, and the original constraint is lost. Encode the original register class as part of the flag word for each inline asm operand. This makes it possible to recover the actual constraint required by inline asm, just like we can for normal instructions. llvm-svn: 141833	2011-10-12 23:37:29 +00:00
Bill Wendling	3e5409df77	We need to verify that the machine instruction we're using as a replacement for our current machine instruction defines a register with the same register class as what's being replaced. This showed up in the SPEC 403.gcc benchmark, where it would ICE because a tail call was expecting one register class but was given another. (The machine instruction verifier catches this situation.) <rdar://problem/10270968> llvm-svn: 141830	2011-10-12 23:03:40 +00:00
Eli Friedman	979009ea61	Use a utility from MathExtras to clarify a check and avoid undefined behavior. Based on patch by Ahmed Charles. llvm-svn: 141829	2011-10-12 22:46:45 +00:00
Evan Cheng	b35afcaa56	Disable machine LICM speculation check (for profitability) until I have time to investigate the regressions. llvm-svn: 141813	2011-10-12 21:33:49 +00:00
Cameron Zwarich	2dffcebf77	To find the exiting VN of a LiveInterval from a block, use the previous slot rather than the previous index. If a block has a single instruction, the previous index may be in a different basic block. I have no clue how this used to work on all of test-suite, because now this failure is seen quite often when trying to compile code with -strong-phi-elim. This fixes PR10252. llvm-svn: 141812	2011-10-12 21:24:54 +00:00
Dan Gohman	de239d2647	Fix a thinko that Nick noticed. The previous code actually worked as intended, but only by accident. llvm-svn: 141779	2011-10-12 15:56:56 +00:00
Bill Wendling	918cea2c27	Expand the check for a landing pad so that it looks at the basic block's containing loop's header to see if that's a landing pad. If it is, then we don't want to hoist instructions out of the loop and above the header. llvm-svn: 141767	2011-10-12 02:58:01 +00:00
Jakob Stoklund Olesen	35163e21dc	Use an existing function. llvm-svn: 141763	2011-10-12 01:24:51 +00:00
Evan Cheng	af1389546e	Fix r141744. 1. The speculation check may not have been performed if the BB hasn't had a load LICM candidate. 2. If the candidate would be CSE'ed, then go ahead and speculatively LICM the instruction even if it's in high register pressure situation. llvm-svn: 141747	2011-10-12 00:09:14 +00:00
Evan Cheng	f192ca0761	Refine r141689 with a tri-state variable. Also teach MachineLICM to avoid "speculation" when register pressure is high. llvm-svn: 141744	2011-10-11 23:48:44 +00:00
Eric Christopher	6647b83087	Add a new wrapper node for a DILexicalBlock that encapsulates it and a file. Since it should only be used when necessary propagate it through the backend code generation and tweak testcases accordingly. This helps with code like in clang's test/CodeGen/debug-info-line.c where we have multiple #line directives within a single lexical block and want to generate only a single block that contains each file change. Part of rdar://10246360 llvm-svn: 141729	2011-10-11 22:59:11 +00:00
Eric Christopher	57d1692750	Formatting. llvm-svn: 141728	2011-10-11 22:59:04 +00:00
Bill Wendling	579ff6c39c	N.B. This is with the new EH scheme: The blocks with invokes have branches to the dispatch block, because that more correctly models the behavior of the CFG. The dispatch of course has edges to the landing pads. Those landing pads could contain invokes, which then have branches back to the dispatch. This creates a loop. The machine LICM pass looks at this loop and thinks it can hoist elements out of it. But because the dispatch is an alternate entry point into the program, the hoisted instructions won't be executed. I wasn't able to get a testcase which was small and could reproduce all of the time. The function_try_block.cpp in llvm-test was where this showed up. llvm-svn: 141726	2011-10-11 22:42:31 +00:00
Devang Patel	453d401a51	Add dominance check for the instruction being hoisted. For example, MachineLICM should not hoist a load that is not guaranteed to be executed. Radar 10254254. llvm-svn: 141689	2011-10-11 18:09:58 +00:00
Nadav Rotem	3283793c9a	Add support for legalization of vector SHL/SRA/SRL instructions llvm-svn: 141667	2011-10-11 14:36:35 +00:00
Nadav Rotem	198fe81571	Add support for legalization of vector trunc-store where the saved scalar type is illegal (for example, v2i16 on systems where the smallest store size is i32) llvm-svn: 141661	2011-10-11 11:25:16 +00:00
Nadav Rotem	b521b6037b	Cleanup the trunc-store legalization code and add asserts. llvm-svn: 141659	2011-10-11 10:04:25 +00:00
Devang Patel	478d5bc0d0	Revert r141569 and r141576. llvm-svn: 141594	2011-10-10 23:18:02 +00:00
Jakob Stoklund Olesen	add0c43ebb	Give targets a chance to expand even standard pseudos. Allow targets to expand COPY and other standard pseudo-instructions before they are expanded with copyPhysReg(). This allows the target to examine the COPY instruction for extra operands indicating it can be widened to a preferable super-register copy. See the ARM -widen-vmovs option. llvm-svn: 141578	2011-10-10 20:34:28 +00:00
Devang Patel	2689f95875	If loop header is also loop exiting block then it may not be safe to hoist instructions. llvm-svn: 141576	2011-10-10 20:32:03 +00:00
Devang Patel	e554d5995b	Add dominance check for the instruction being hoisted. For example, MachineLICM should not hoist a load that is not guaranteed to be executed. Radar 10254254. llvm-svn: 141569	2011-10-10 19:09:20 +00:00
Bill Wendling	e9574be6a3	Use the code that lowers the arguments and spills any values which are alive across unwind edges. This is for the back-end which expects such things. The code is from the original SjLj EH pass. llvm-svn: 141463	2011-10-08 00:56:47 +00:00
Bill Wendling	7ecfbd90ef	Thread the chain through the eh.sjlj.setjmp intrinsic, like it's documented to do. This will be useful later on with the new SJLJ stuff. llvm-svn: 141416	2011-10-07 21:25:38 +00:00
Andrew Trick	35c9e51219	PostRA scheduler fix. Clear stale loop dependencies. Fixes <rdar://problem/10235725> llvm-svn: 141357	2011-10-07 06:33:09 +00:00
Andrew Trick	4ef158335b	whitespace llvm-svn: 141356	2011-10-07 06:27:02 +00:00
Eli Friedman	1456cd20b4	Remove the old atomic instrinsics. autoupgrade functionality is included with this patch. llvm-svn: 141333	2011-10-06 23:20:49 +00:00
Bill Wendling	267f323d28	Modify the mapping from landing pad to call sites to accept more than one call site. llvm-svn: 141226	2011-10-05 22:24:35 +00:00
Bill Wendling	c2d55b6e50	Add an ivar that maps a landing pad's EH symbol to the call sites that may jump to the landing pad. This will be used by the back-end to generate the jump tables for dispatching the arriving longjmp in sjlj eh. llvm-svn: 141224	2011-10-05 22:20:38 +00:00
Bill Wendling	e61c62533e	Small refactoring. Cache the FunctionInfo->MBB into a local variable. llvm-svn: 141221	2011-10-05 22:16:11 +00:00
Jakob Stoklund Olesen	eb38bd8ced	Fix sub-register operand verification. PhysReg operands are not allowed to have sub-register indices at all. For virtual registers with sub-reg indices, check that all registers in the register class support the sub-reg index. llvm-svn: 141220	2011-10-05 22:12:57 +00:00
Bill Wendling	db1633530a	Fix comment to reflect the new EH stuff. llvm-svn: 141218	2011-10-05 22:04:08 +00:00
Jakob Stoklund Olesen	3abead76ea	Remove unused DstSubIdx argument. llvm-svn: 141214	2011-10-05 21:22:53 +00:00
Jakob Stoklund Olesen	f7957a9819	Simplify EXTRACT_SUBREG emission. EXTRACT_SUBREG is emitted as %dst = COPY %src:sub, so there is no need to constrain the %dst register class. RegisterCoalescer will apply the necessary constraints if it decides to eliminate the COPY. The %src register class does need to be constrained to something with the right sub-registers, though. This is currently done manually with COPY_TO_REGCLASS nodes. They can possibly be removed after this patch. llvm-svn: 141207	2011-10-05 20:26:40 +00:00
Jakob Stoklund Olesen	8ff52c4135	Simplify INSERT_SUBREG emission. The register class created by INSERT_SUBREG and SUBREG_TO_REG must be legal and support the SubIdx sub-registers. The new getSubClassWithSubReg() hook can compute that. This may create INSERT_SUBREG instructions defining a larger register class than the sub-register being inserted. That is OK, RegisterCoalescer will constrain the register class as needed when it eliminates the INSERT_SUBREG instructions. llvm-svn: 141198	2011-10-05 18:31:00 +00:00
Jakob Stoklund Olesen	ccdfbfb5e5	Add a FIXME. TwoAddressInstructionPass should annotate instructions with <undef> flags when it lower REG_SEQUENCE instructions. LiveIntervals should not be in the business of modifying code (except for kill flags, perhaps). llvm-svn: 141187	2011-10-05 16:51:21 +00:00
Jakob Stoklund Olesen	d5d39bb098	Also add <imp-use,kill> flags for redefined super-registers. For example: %vreg10:dsub_0<def,undef> = COPY %vreg1 %vreg10:dsub_1<def> = COPY %vreg2 is rewritten as: %D2<def> = COPY %D0, %Q1<imp-def> %D3<def> = COPY %D1, %Q1<imp-use,kill>, %Q1<imp-def> The first COPY doesn't care about the previous value of %Q1, so it doesn't read that register. The second COPY is a partial redefinition of %Q1, so it implicitly kills and redefines that register. This makes it possible to recognize instructions that can harmlessly clobber the full super-register. The write and don't read the super-register. llvm-svn: 141139	2011-10-05 00:01:48 +00:00
Jakob Stoklund Olesen	9d5bda9be1	Also add <def,undef> flags when coalescing sub-registers. RegisterCoalescer can create sub-register defs when it is joining a register with a sub-register. Add <undef> flags to these new sub-register defs where appropriate. llvm-svn: 141138	2011-10-05 00:01:46 +00:00
Owen Anderson	0ca562ec4c	Teach the MC to output code/data region marker labels in MachO and ELF modes. These are used by disassemblers to provide better disassembly, particularly on targets like ARM Thumb that like to intermingle data in the TEXT segment. llvm-svn: 141135	2011-10-04 23:26:17 +00:00
Bill Wendling	3d11aa7e75	Create a mapping between the landing pad basic block and the call site index for later use. llvm-svn: 141125	2011-10-04 22:00:35 +00:00
Jakob Stoklund Olesen	10f2de3261	Allow <undef> flags on def operands as well as uses. The <undef> flag says that a MachineOperand doesn't read its register, or doesn't depend on the previous value of its register. A full register def never depends on the previous register value. A partial register def may depend on the previous value if it is intended to update part of a register. For example: %vreg10:dsub_0<def,undef> = COPY %vreg1 %vreg10:dsub_1<def> = COPY %vreg2 The first copy instruction defines the full %vreg10 register with the bits not covered by dsub_0 defined as <undef>. It is not considered a read of %vreg10. The second copy modifies part of %vreg10 while preserving the rest. It has an implicit read of %vreg10. This patch adds a MachineOperand::readsReg() method to determine if an operand reads its register. Previously, this was modelled by adding a full-register <imp-def> operand to the instruction. This approach makes it possible to determine directly from a MachineOperand if it reads its register. No scanning of MI operands is required. llvm-svn: 141124	2011-10-04 21:49:33 +00:00
Bill Wendling	ac3fb4c078	Generic cleanup. llvm-svn: 141050	2011-10-04 00:16:40 +00:00
Bill Wendling	97a8695fff	Don't carry over the dispatchsetup hack from the old system. llvm-svn: 141040	2011-10-03 22:42:40 +00:00
Bill Wendling	6f3e73d6ad	Move the grabbing of the jump buffer into the caller function, eliminating the need for returning a std::pair. llvm-svn: 141026	2011-10-03 21:15:28 +00:00
Eric Christopher	cead033ced	Whitespace. llvm-svn: 141005	2011-10-03 15:49:20 +00:00
Eric Christopher	f84354bfb1	Typo. llvm-svn: 141004	2011-10-03 15:49:16 +00:00
Nadav Rotem	52e8ed9214	Moved type construction out of the loop and added an assert on the legality of the type. Formatted lines to the 80 char limit. llvm-svn: 140952	2011-10-01 18:39:28 +00:00
Bill Wendling	9925f197cc	When inferring the pointer alignment, if the global doesn't have an initializer and the alignment is 0 (i.e., it's defined globally in one file and declared in another file) it could get an alignment which is larger than the ABI allows for that type, resulting in aligned moves being used for unaligned loads. For instance, in file A.c: struct S s; In file B.c: struct { // something long }; extern S s; void foo() { struct S p = s; // ... } this copy is a 'memcpy' which is turned into a series of 'movaps' instructions on X86. But this is wrong, because 'struct S' has alignment of 4, not 16. llvm-svn: 140902	2011-09-30 23:19:55 +00:00
Nick Lewycky	f40df1d46c	Promote comment to doxycomment. Adjust whitespace. No functionality change. llvm-svn: 140899	2011-09-30 22:19:53 +00:00
Jakob Stoklund Olesen	1352be2bd3	Move getCommonSubClass() into TRI. It will soon need the context. llvm-svn: 140896	2011-09-30 22:18:51 +00:00
Torok Edwin	be5020eb95	Comment grammar fixes. thanks to Duncan. llvm-svn: 140850	2011-09-30 13:07:47 +00:00
Torok Edwin	319a1415b8	Instead of crashing when MCAsmInfo is NULL, add an assert. This helps with porting code from 2.9 to 3.0 as TargetSelect.h changed location, and if you include the old one by accident you will trigger this assert. llvm-svn: 140848	2011-09-30 12:31:57 +00:00
Eli Friedman	95031ed837	Clean up uses of switch instructions so they are not dependent on the operand ordering. Patch by Stepan Dyatkovskiy. llvm-svn: 140803	2011-09-29 20:21:17 +00:00
Duncan Sands	cac86805bf	Place this bracket according to the LLVM style. llvm-svn: 140784	2011-09-29 16:01:46 +00:00
Jakob Stoklund Olesen	463b05a2d0	Remove NumImplicitOps which is now unused. llvm-svn: 140767	2011-09-29 01:47:36 +00:00
Eric Christopher	d299dccf91	Use the local we already set up. llvm-svn: 140745	2011-09-29 00:50:59 +00:00
Jakob Stoklund Olesen	2318d1e0e9	Rewrite MachineInstr::addOperand() to avoid NumImplicitOps. The function needs to scan the implicit operands anyway, so no performance is won by caching the number of implicit operands added to an instruction. This also fixes a bug when adding operands after an implicit operand has been added manually. The NumImplicitOps count wasn't kept up to date. MachineInstr::addOperand() will now consistently place all explicit operands before all the implicit operands, regardless of the order they are added. It is possible to change an MI opcode and add additional explicit operands. They will be inserted before any existing implicit operands. The only exception is inline asm instructions where operands are never reordered. This is because of a hack that marks explicit clobber regs on inline asm as <implicit-def> to please the fast register allocator. This hack can go away when InstrEmitter and FastIsel can add exact <dead> flags to physreg defs. llvm-svn: 140744	2011-09-29 00:40:51 +00:00
Bill Wendling	899da52d60	Have the SjLjEHPrepare pass do some more heavy lifting. Upon further review, most of the EH code should remain written at the IR level. The part which breaks SSA form is the dispatch table, so that part will be moved to the back-end. llvm-svn: 140730	2011-09-28 21:56:53 +00:00
Duncan Sands	2e67937f76	A typeid of zero means a cleanup, not a catch. This case occurs when there is both a catch and a cleanup. Correct the comment. llvm-svn: 140686	2011-09-28 09:13:02 +00:00
Bill Wendling	baf3941fde	Strip off pointer casts when looking at the eh.sjlj.functioncontext's argument. llvm-svn: 140678	2011-09-28 03:52:41 +00:00
Bill Wendling	225e8481b0	Bitcast the alloca to an i8* to match the intrinsic's signature. llvm-svn: 140677	2011-09-28 03:47:11 +00:00
Bill Wendling	66b110f571	Create and use an llvm.eh.sjlj.functioncontext intrinsic. This intrinsic is used to pass the index of the function context to the back-end for further processing. The back-end is in charge of filling in the rest of the entries. llvm-svn: 140676	2011-09-28 03:36:43 +00:00
Bill Wendling	2e76ca9d9a	In the new EH model, setup the function context and the call site info. The DWARF exception pass uses the call site information, which is set up here. A pre-RA pass is too late for it to use this information. So create and setup the function context here, and then insert the call site values here (and map the call sites for the DWARF EH pass). This is simpler than the original pass, and doesn't make the CFG lose its SSA-ness. It's a win-win-win-win-lose-win-win situation. llvm-svn: 140675	2011-09-28 03:14:05 +00:00
Bill Wendling	e6138e3ad1	Don't conditionalize execution of the SjLj EH prepare pass. We may need an SjLj EH preparation pass for some call site information, at least in the short term. llvm-svn: 140674	2011-09-28 03:07:34 +00:00
Jakob Stoklund Olesen	bd5109f14d	Rename class and clean up source. No functional change intended. llvm-svn: 140664	2011-09-28 00:01:56 +00:00
Jakob Stoklund Olesen	934b7d7645	Rename SSEDomainFix -> lib/CodeGen/ExecutionDepsFix. I'll clean up the source in the next commit. llvm-svn: 140663	2011-09-28 00:01:54 +00:00
Bill Wendling	354ff9e348	This is the start of the new SjLj EH preparation pass, which will replace the current IR-level pass. The old SjLj EH pass has some problems, especially with the new EH model. Most significantly, it violates some of the new restrictions the new model has. For instance, the 'dispatch' table wants to jump to the landing pad, but we cannot allow that because only an invoke's unwind edge can jump to a landing pad. This requires us to mangle the code something awful. In addition, we need to keep the now dead landingpad instructions around instead of CSE'ing them because the DWARF emitter uses that information (they are dead because no control flow edge will execute them - the control flow edge from an invoke's unwind is superceded by the edge coming from the dispatch). Basically, this pass belongs not at the IR level where SSA is king, but at the code-gen level, where we have more flexibility. llvm-svn: 140646	2011-09-27 22:14:12 +00:00
Cameron Zwarich	7a6e8f2c5d	Remove an invalid assert that is really just asserting when the scheduler emits a suboptimal schedule. llvm-svn: 140643	2011-09-27 21:59:16 +00:00
Jim Grosbach	af136f71ec	Rename AddSelectionDAGCSEId() to addSelectionDAGCSEId(). Naming conventions consistency. No functional change. llvm-svn: 140636	2011-09-27 20:59:33 +00:00
Nadav Rotem	38b3b83362	Cleanup PromoteIntOp_EXTRACT_VECTOR_ELT and PromoteIntRes_SETCC. Add a new method: getAnyExtOrTrunc and use it to replace the manual check. llvm-svn: 140603	2011-09-27 11:16:47 +00:00
Nadav Rotem	1b857d2762	Revert r140463; The patch assumes that <4 x i1> is saved to memory as 4 x i8, while the decision is to bit-pack small values. llvm-svn: 140601	2011-09-27 10:48:29 +00:00
James Molloy	0ceb8cadd2	Fix emission of debug data for global variables. getContext() on DIGlobalVariables is not valid any more. llvm-svn: 140539	2011-09-26 17:40:42 +00:00
Jakob Stoklund Olesen	df977fedb6	Add target hook for pseudo instruction expansion. Many targets use pseudo instructions to help register allocation. Like the COPY instruction, these pseudos can be expanded after register allocation. The early expansion can make life easier for PEI and the post-ra scheduler. This patch adds a hook that is called for all remaining pseudo instructions from the ExpandPostRAPseudos pass. llvm-svn: 140472	2011-09-25 19:21:35 +00:00
Nadav Rotem	2279949129	[vector-select] Address one of the issues in pr10902. EXTRACT_VECTOR_ELEMENT SDNodes may return values which are wider than the incoming element types. In this patch we fix the integer promotion of these nodes. Fixes spill-q.ll when running -promote-elements. llvm-svn: 140471	2011-09-25 18:59:42 +00:00
Jakob Stoklund Olesen	fd719d184e	Clean up code after renaming LowerSubregs -> ExpandPostRAPseudos. No functional change intended. llvm-svn: 140470	2011-09-25 16:46:08 +00:00
Jakob Stoklund Olesen	f152df1e6b	Rename LowerSubregs to ExpandPostRAPseudos. I'll fix the file contents in the next commit. This pass is currently expanding the COPY and SUBREG_TO_REG pseudos. I am going to add a hook so targets can expand more pseudo-instructions after register allocation. Many targets have pseudo-instructions that assist the register allocator. They can be expanded after register allocation, before PEI and PostRA scheduling. llvm-svn: 140469	2011-09-25 16:46:00 +00:00
Nadav Rotem	c2deabd202	Implement Duncan's suggestion to use the result of getSetCCResultType if it is legal (this is always the case for scalars), otherwise use the promoted result type. Fix test/CodeGen/X86/vsplit-and.ll when promote-elements is enabled. llvm-svn: 140464	2011-09-24 19:48:19 +00:00
Nadav Rotem	77426a754b	[Vector-Select] Address one of the problems in 10902. When generating the trunc-store of i1's, we need to use the vector type and not the scalar type. This patch fixes the assertion in CodeGen/Generic/bool-vector.ll when running with -promote-elements. llvm-svn: 140463	2011-09-24 18:32:19 +00:00
Jakob Stoklund Olesen	3bb99bc957	Verify that terminators follow non-terminators. This exposes a -segmented-stacks bug. llvm-svn: 140429	2011-09-23 22:45:39 +00:00
Eli Friedman	8a15a5aa93	PR10998: It is not legal to sink an instruction past the terminator of a block; make sure we don't do that. llvm-svn: 140428	2011-09-23 22:41:57 +00:00
Duncan Sands	b461176cfb	Tweak the handling of MERGE_VALUES nodes: remove the need for DecomposeMERGE_VALUES to "know" that results are legalized in a particular order, by passing it the number of the result being legalized (the type legalization core provides this, it just needs to be passed on). llvm-svn: 140373	2011-09-23 13:59:22 +00:00
Nadav Rotem	57e30726ad	Vector-Select: Address one of the problems in pr10902. Add handling for the integer-promotion of CONCAT_VECTORS. Test: test/CodeGen/X86/widen_shuffle-1.ll This patch fixes the above tests (when running in with -promote-elements). llvm-svn: 140372	2011-09-23 09:33:24 +00:00
Dan Gohman	e83e1b2d2c	Fix SimplifySelectCC to add newly created nodes to the DAGCombiner worklist, as it may be possible to perform further optimization on them. llvm-svn: 140349	2011-09-22 23:01:29 +00:00
Jakob Stoklund Olesen	e92e5ee81f	Constrain register classes instead of emitting copies. Sometimes register class constraints are trivial, like GR32->GR32_NOSP, or GPR->rGPR. Teach InstrEmitter to simply constrain the virtual register instead of emitting a copy in these cases. Normally, these copies are handled by the coalescer. This saves some coalescer work. llvm-svn: 140340	2011-09-22 21:39:34 +00:00
Jakob Stoklund Olesen	0f36544c08	Add a MinNumRegs argument to MRI::constrainRegClass(). The function will refuse to use a register class with fewer registers than MinNumRegs. This can be used by clients to avoid accidentally increase register pressure too much. The default value of MinNumRegs=0 doesn't affect how constrainRegClass() works. llvm-svn: 140339	2011-09-22 21:39:31 +00:00
Bill Wendling	a58fde665a	Use the C personality function instead of the C++ personality function. llvm-svn: 140318	2011-09-22 17:56:40 +00:00
Devang Patel	5e6b65cf0d	Do not unnecessarily use AT_specification DIE because it does not add any value. Few weeks ago, llvm completely inverted the debug info graph. Earlier each debug info node used to keep track of its compile unit, now compile unit keeps track of important nodes. One impact of this change is that the global variable's do not have any context, which should be checked before deciding to use AT_specification DIE. llvm-svn: 140282	2011-09-21 23:41:11 +00:00
Bill Wendling	7b3fc8ee38	Attempt to update the shadow stack GC pass to the new EH model. This inserts a cleanup landingpad instruction and a resume to mimic the old unwind instruction. llvm-svn: 140277	2011-09-21 22:14:28 +00:00
Jim Grosbach	098f5a2911	Tidy up. Whitepsace. llvm-svn: 140275	2011-09-21 21:36:53 +00:00
Nadav Rotem	bc9ba30158	[VECTOR-SELECT] Address one of the bugs in pr10902. Vector SetCC result types need to be type-legalized. This code worked before because scalar result types are known to be legal. llvm-svn: 140249	2011-09-21 14:34:38 +00:00
Andrew Trick	924123acb3	Lower ARM adds/subs to add/sub after adding optional CPSR operand. This is still a hack until we can teach tblgen to generate the optional CPSR operand rather than an implicit CPSR def. But the strangeness is now limited to the selection DAG. ADD/SUB MI's no longer have implicit CPSR defs, nor do we allow flag setting variants of these opcodes in machine code. There are several corner cases to consider, and getting one wrong would previously lead to nasty miscompilation. It's not the first time I've debugged one, so this time I added enough verification to ensure it won't happen again. llvm-svn: 140228	2011-09-21 02:20:46 +00:00
Bruno Cardoso Lopes	6cb23f6e7f	Add a DAGCombine for subvector extracts to remove useless chains of subvector inserts and extracts. Initial patch by Rackover, Zvi with some tweak done by me. llvm-svn: 140204	2011-09-20 23:19:33 +00:00
Andrew Trick	52363bdbeb	Restore hasPostISelHook tblgen flag. No functionality change. The hook makes it explicit which patterns require "special" handling. i.e. it self-documents tblgen deficiencies. I plan to add verification in ExpandISelPseudos and Thumb2SizeReduce to catch any missing hasPostISelHooks. Otherwise it's too fragile. llvm-svn: 140160	2011-09-20 18:22:31 +00:00
Andrew Trick	8586e62d91	ARM isel bug fix for adds/subs operands. Modified ARMISelLowering::AdjustInstrPostInstrSelection to handle the full gamut of CPSR defs/uses including instructins whose "optional" cc_out operand is not really optional. This allowed removal of the hasPostISelHook to simplify the .td files and make the implementation more robust. Fixes rdar://10137436: sqlite3 miscompile llvm-svn: 140134	2011-09-20 03:17:40 +00:00
Andrew Trick	53df4b6dfa	whitespace llvm-svn: 140133	2011-09-20 03:06:13 +00:00
Nadav Rotem	7aaa0aa7a7	white space cleanups llvm-svn: 139994	2011-09-18 10:29:29 +00:00
Benjamin Kramer	67b014b2c2	Namespacify. llvm-svn: 139892	2011-09-16 00:35:06 +00:00
Jakob Stoklund Olesen	e2c92a3112	Spill mode: Hoist back-copies locally. The leaveIntvAfter() function normally inserts a back-copy after the requested instruction, making the back-copy kill the live range. In spill mode, try to insert the back-copy before the last use instead. That means the last use becomes the kill instead of the back-copy. This lowers the register pressure because the last use can now redefine the same register it was reading. This will also improve compile time: The back-copy isn't a kill, so hoisting it in hoistCopiesForSize() won't force a recomputation of the source live range. Similarly, if the back-copy isn't hoisted by the splitter, the spiller will not attempt hoisting it locally. llvm-svn: 139883	2011-09-16 00:03:35 +00:00
Jakob Stoklund Olesen	e8339b2e63	Disable local spill hoisting for non-killing copies. If the source register is live after the copy being spilled, there is no point to hoisting it. Hoisting inside a basic block only serves to resolve interferences by shortening the live range of the source. llvm-svn: 139882	2011-09-16 00:03:33 +00:00
Eli Friedman	ee8f14a799	Some legalization fixes for atomic load and store. llvm-svn: 139851	2011-09-15 21:20:49 +00:00
Jakob Stoklund Olesen	bceb9e5c05	Add an option to disable spill hoisting. When -split-spill-mode is enabled, spill hoisting is performed by SplitKit instead of by InlineSpiller. This hidden command line option is for testing the splitter spill mode. llvm-svn: 139845	2011-09-15 21:06:00 +00:00
Jakob Stoklund Olesen	53e2e48de7	VirtRegMap is counting spill slots, not register spills. Fix the stats counters to reflect that. llvm-svn: 139819	2011-09-15 18:31:13 +00:00
Jakob Stoklund Olesen	c94c967656	Count correctly when a COPY turns into a spill or reload. The number of spills could go negative since a folded COPY is just a spill, and it may be eliminated. llvm-svn: 139815	2011-09-15 18:22:52 +00:00
Jakob Stoklund Olesen	37eb6962c6	Count inserted spills and reloads more accurately. Adjust counters when removing spill and reload instructions. We still don't account for reloads being removed by eliminateDeadDefs(). llvm-svn: 139806	2011-09-15 17:54:28 +00:00
Jakob Stoklund Olesen	07b3503f8b	Trace through sibling PHIs in bulk. When traceSiblingValue() encounters a PHI-def value created by live range splitting, don't look at all the predecessor blocks. That can be very expensive in a complicated CFG. Instead, consider that all the non-PHI defs jointly dominate all the PHI-defs. Tracing directly to all the non-PHI defs is much faster that zipping around in the CFG when there are many PHIs with many predecessors. This significantly improves compile time for indirectbr interpreters. llvm-svn: 139797	2011-09-15 16:41:12 +00:00
Jakob Stoklund Olesen	b8b1d4c435	Speed up LiveIntervals::shrinkToUse with some caching. Blocks with multiple PHI successors only need to go on the worklist once. Use a SmallPtrSet to track the live-out blocks that have already been handled. This is a lot faster than the two live range check we would otherwise do. Also stop recomputing hasPHIKill flags. Like RenumberValues(), it is conservatively correct to leave them in, and they are not used for anything important. llvm-svn: 139792	2011-09-15 15:24:16 +00:00
Jakob Stoklund Olesen	fb75d78d33	Revert r139782, "RemoveCopyByCommutingDef doesn't need hasPHIKill()." It does, after all. RemoveCopyByCommutingDef rewrites the uses of one particular value number in A. It doesn't know how to rewrite phi uses, so there can't be any. llvm-svn: 139787	2011-09-15 06:27:32 +00:00
Jakob Stoklund Olesen	4c099551f9	Stop verifying hasPHIKill() flags. There is only one legitimate use remaining, in addIntervalsForSpills(). All other calls to hasPHIKill() are only used to update PHIKill flags. The addIntervalsForSpills() function is part of the old spilling framework, only used by linearscan. llvm-svn: 139783	2011-09-15 05:16:30 +00:00
Jakob Stoklund Olesen	0499e7bbd0	RemoveCopyByCommutingDef doesn't need hasPHIKill(). Instead, let HasOtherReachingDefs() test for defs in B that overlap any phi-defs in A as well. This test is slightly different, but almost identical. A perfectly precise test would only check those phi-defs in A that are reachable from AValNo. llvm-svn: 139782	2011-09-15 05:03:50 +00:00
Jakob Stoklund Olesen	dca022e377	It is safe to remat a value killed by phis. The source live range is recomputed using shrinkToUses() which does handle phis correctly. The hasPHIKill() condition was relevant in the old days when ReMaterializeTrivialDef() tried to recompute the live range itself. The shrinkToUses() function will mark the original def as dead when no more uses and phi kills remain. It is then removed by runOnMachineFunction(). llvm-svn: 139781	2011-09-15 04:52:06 +00:00
Jakob Stoklund Olesen	e7ca8ecd92	Leave hasPHIKill flags alone in LiveInterval::RenumberValues. It is conservatively correct to keep the hasPHIKill flags, even after deleting PHI-defs. The calculation can be very expensive after taildup has created a quadratic number of indirectbr edges in the CFG, and the hasPHIKill flag isn't used for anything after RenumberValues(). llvm-svn: 139780	2011-09-15 04:37:18 +00:00
Andrew Trick	76a86d3d4c	[regcoalescing] bug fix for RegistersDefinedFromSameValue. An improper SlotIndex->VNInfo lookup was leading to unsafe copy removal. Fixes PR10920 401.bzip2 miscompile with no IV rewrite. llvm-svn: 139765	2011-09-15 01:09:33 +00:00
Devang Patel	04d6d47865	Add support to emit debug info for C++0x nullptr type. llvm-svn: 139751	2011-09-14 23:13:28 +00:00
Jakob Stoklund Olesen	811b9c475d	Ignore the cloning of unknown registers. THe LRE_DidCloneVirtReg callback may be called with vitual registers that RAGreedy doesn't even know about yet. In that case, there are no data structures to update. llvm-svn: 139702	2011-09-14 17:34:37 +00:00
Jakob Stoklund Olesen	a98af39856	Hoist back-copies to the least busy dominator. When a back-copy is hoisted to the nearest common dominator, keep looking up the dominator tree for a less loopy dominator, and place the back-copy there instead. Don't do this when a single existing back-copy dominates all the others. Assume the client knows what he is doing, and keep the dominating back-copy. This prevents us from hoisting back-copies into loops in most cases. If a value is defined in a loop with multiple exits, we may still hoist back-copies into that loop. That is the speed/size tradeoff. llvm-svn: 139698	2011-09-14 16:45:39 +00:00
Nadav Rotem	d748dbacb0	Add integer promotion support for vselect llvm-svn: 139692	2011-09-14 14:42:15 +00:00
Jakob Stoklund Olesen	5d4277ddfa	Distinguish complex mapped values from forced recomputation. When a ParentVNI maps to multiple defs in a new interval, its live range may still be derived directly from RegAssign by transferValues(). On the other hand, when instructions have been rematerialized or hoisted, it may be necessary to completely recompute live ranges using LiveRangeCalc::extend() to all uses. Use a bit in the value map to indicate that a live range must be recomputed. Rename markComplexMapped() to forceRecompute(). This fixes some live range verification errors when -split-spill-mode=size hoists back-copies by recomputing source ranges when RegAssign kills can't be moved. llvm-svn: 139660	2011-09-13 23:09:04 +00:00
Jakob Stoklund Olesen	a25330f0d7	Implement -split-spill-mode=size. Whenever the complement interval is defined by multiple copies of the same value, hoist those back-copies to the nearest common dominator. This ensures that at most one copy is inserted per value in the complement inteval, and no phi-defs are needed. llvm-svn: 139651	2011-09-13 22:22:39 +00:00
Eli Friedman	f78c6a83ee	Fix check for unaligned load/store so it doesn't catch over-aligned load/store. llvm-svn: 139649	2011-09-13 22:19:59 +00:00
Eli Friedman	f1518216fd	Error out on CodeGen of unaligned load/store. Fix test so it isn't accidentally testing that case. llvm-svn: 139641	2011-09-13 20:50:54 +00:00
Nadav Rotem	66dc9ae08d	Fix the assertion which checks the size of the input operand. llvm-svn: 139633	2011-09-13 20:03:38 +00:00
Nadav Rotem	52202fbf2d	Add vselect target support for targets that do not support blend but do support xor/and/or (For example SSE2). llvm-svn: 139623	2011-09-13 19:17:42 +00:00
Devang Patel	f9e2ae9b05	Use a cache to maintain list of machine basic blocks for a given UserValue. llvm-svn: 139616	2011-09-13 18:40:53 +00:00
Jakob Stoklund Olesen	4484f99175	Add SplitEditor::markOverlappedComplement(). This function is used to flag values where the complement interval may overlap other intervals. Call it from overlapIntv, and use the flag to fully recompute those live ranges in transferValues(). llvm-svn: 139612	2011-09-13 18:05:29 +00:00
Jakob Stoklund Olesen	820c8fd0db	Eliminate the extendRange() wrapper. llvm-svn: 139608	2011-09-13 17:38:57 +00:00
Jakob Stoklund Olesen	0494c5c35d	Switch extendInBlock() to take a kill slot instead of the last use slot. Three out of four clients prefer this interface which is consistent with extendIntervalEndTo() and LiveRangeCalc::extend(). llvm-svn: 139604	2011-09-13 16:47:56 +00:00
Jakob Stoklund Olesen	054984d75b	Use a separate LiveRangeCalc for the complement in spill modes. The complement interval may overlap the other intervals created, so use a separate LiveRangeCalc instance to compute its live range. A LiveRangeCalc instance can only be shared among non-overlapping intervals. llvm-svn: 139603	2011-09-13 16:47:53 +00:00
NAKAMURA Takumi	cac923b556	Unbreak msvc. llvm-svn: 139581	2011-09-13 03:58:34 +00:00
Jakob Stoklund Olesen	487f2a37bf	Extract live range calculations from SplitKit. SplitKit will soon need two copies of these data structures, and the algorithms will also be useful when LiveIntervalAnalysis becomes independent of LiveVariables. llvm-svn: 139572	2011-09-13 01:34:21 +00:00
Bill Wendling	ac5a883624	Introduce a bit of a hack. Splitting a landing pad takes considerable care because of PHIs and other nasties. The problem is that the jump table needs to jump to the landing pad block. However, the landing pad block can be jumped to only by an invoke instruction. So we clone the landingpad instruction into its own basic block, have the invoke jump to there. The landingpad instruction's basic block's successor is now the target for the jump table. But because of PHI nodes, we need to create another basic block for the jump table to jump to. This is definitely a hack, because the values for the PHI nodes may not be defined on the edge from the jump table. But that's okay, because the jump table is simply a construct to mimic what is happening in the CFG. So the values are mysteriously there, even though there is no value for the PHI from the jump table's edge (hence calling this a hack). llvm-svn: 139545	2011-09-12 21:56:59 +00:00
Jakob Stoklund Olesen	45df7e0f22	Remove the -compact-regions flag. It has been enabled by default for a while, it was only there to allow performance comparisons. llvm-svn: 139501	2011-09-12 16:54:42 +00:00
Jakob Stoklund Olesen	eecb2fb183	Add an interface for SplitKit complement spill modes. SplitKit always computes a complement live range to cover the places where the original live range was live, but no explicit region has been allocated. Currently, the complement live range is created to be as small as possible - it never overlaps any of the regions. This minimizes register pressure, but if the complement is going to be spilled anyway, that is not very important. The spiller will eliminate redundant spills, and hoist others by making the spill slot live range overlap some of the regions created by splitting. Stack slots are cheap. This patch adds the interface to enable spill modes in SplitKit. In spill mode, SplitKit will assume that the complement is going to spill, so it will allow it to overlap regions in order to avoid back-copies. By doing some of the spiller's work early, the complement live range becomes simpler. In some cases, it can become much simpler because no extra PHI-defs are required. This will speed up both splitting and spilling. This is only the interface to enable spill modes, no implementation yet. llvm-svn: 139500	2011-09-12 16:49:21 +00:00
Jakob Stoklund Olesen	72c0ddfbc4	Update comments to reflect some (not so) recent changes. llvm-svn: 139498	2011-09-12 16:03:26 +00:00
Richard Trieu	78a812bf2d	Fix asserts in CodeGen from: assert("error"); to: assert(0 && "error"); llvm-svn: 139449	2011-09-10 01:07:54 +00:00
Chris Lattner	e74e0c8020	tidy up a bit llvm-svn: 139419	2011-09-09 22:06:59 +00:00
Eli Friedman	b7910b79f5	Make the SelectionDAG verify that all the operands of BUILD_VECTOR have the same type. Teach DAGCombiner::visitINSERT_VECTOR_ELT not to make invalid BUILD_VECTORs. Fixes PR10897. llvm-svn: 139407	2011-09-09 21:04:06 +00:00
Jakob Stoklund Olesen	278bf02581	Reapply r139247: Cache intermediate results during traceSiblingValue. In some cases such as interpreters using indirectbr, the CFG can be very complicated, and live range splitting may be forced to insert a large number of phi-defs. When that happens, traceSiblingValue can spend a lot of time zipping around in the CFG looking for defs and reloads. This patch causes more information to be cached in SibValues, and the cached values are used to terminate searches early. This speeds up spilling by 20x in one interpreter test case. For more typical code, this is just a 10% speedup of spilling. The previous version had bugs that caused miscompilations. They have been fixed. llvm-svn: 139378	2011-09-09 18:11:41 +00:00
Devang Patel	9d904e1a97	Directly point debug info to the stack slot of the arugment, instead of trying to keep track of vreg in which it the arugment is copied. The LiveDebugVariable can keep track of variable's ranges. llvm-svn: 139330	2011-09-08 22:59:09 +00:00

1 2 3 4 5 ...

12604 Commits