llvm-project

Commit Graph

Author	SHA1	Message	Date
Owen Anderson	0ca562ec4c	Teach the MC to output code/data region marker labels in MachO and ELF modes. These are used by disassemblers to provide better disassembly, particularly on targets like ARM Thumb that like to intermingle data in the TEXT segment. llvm-svn: 141135	2011-10-04 23:26:17 +00:00
Bill Wendling	3d11aa7e75	Create a mapping between the landing pad basic block and the call site index for later use. llvm-svn: 141125	2011-10-04 22:00:35 +00:00
Jakob Stoklund Olesen	10f2de3261	Allow <undef> flags on def operands as well as uses. The <undef> flag says that a MachineOperand doesn't read its register, or doesn't depend on the previous value of its register. A full register def never depends on the previous register value. A partial register def may depend on the previous value if it is intended to update part of a register. For example: %vreg10:dsub_0<def,undef> = COPY %vreg1 %vreg10:dsub_1<def> = COPY %vreg2 The first copy instruction defines the full %vreg10 register with the bits not covered by dsub_0 defined as <undef>. It is not considered a read of %vreg10. The second copy modifies part of %vreg10 while preserving the rest. It has an implicit read of %vreg10. This patch adds a MachineOperand::readsReg() method to determine if an operand reads its register. Previously, this was modelled by adding a full-register <imp-def> operand to the instruction. This approach makes it possible to determine directly from a MachineOperand if it reads its register. No scanning of MI operands is required. llvm-svn: 141124	2011-10-04 21:49:33 +00:00
Bill Wendling	ac3fb4c078	Generic cleanup. llvm-svn: 141050	2011-10-04 00:16:40 +00:00
Bill Wendling	97a8695fff	Don't carry over the dispatchsetup hack from the old system. llvm-svn: 141040	2011-10-03 22:42:40 +00:00
Bill Wendling	6f3e73d6ad	Move the grabbing of the jump buffer into the caller function, eliminating the need for returning a std::pair. llvm-svn: 141026	2011-10-03 21:15:28 +00:00
Eric Christopher	cead033ced	Whitespace. llvm-svn: 141005	2011-10-03 15:49:20 +00:00
Eric Christopher	f84354bfb1	Typo. llvm-svn: 141004	2011-10-03 15:49:16 +00:00
Nadav Rotem	52e8ed9214	Moved type construction out of the loop and added an assert on the legality of the type. Formatted lines to the 80 char limit. llvm-svn: 140952	2011-10-01 18:39:28 +00:00
Bill Wendling	9925f197cc	When inferring the pointer alignment, if the global doesn't have an initializer and the alignment is 0 (i.e., it's defined globally in one file and declared in another file) it could get an alignment which is larger than the ABI allows for that type, resulting in aligned moves being used for unaligned loads. For instance, in file A.c: struct S s; In file B.c: struct { // something long }; extern S s; void foo() { struct S p = s; // ... } this copy is a 'memcpy' which is turned into a series of 'movaps' instructions on X86. But this is wrong, because 'struct S' has alignment of 4, not 16. llvm-svn: 140902	2011-09-30 23:19:55 +00:00
Nick Lewycky	f40df1d46c	Promote comment to doxycomment. Adjust whitespace. No functionality change. llvm-svn: 140899	2011-09-30 22:19:53 +00:00
Jakob Stoklund Olesen	1352be2bd3	Move getCommonSubClass() into TRI. It will soon need the context. llvm-svn: 140896	2011-09-30 22:18:51 +00:00
Torok Edwin	be5020eb95	Comment grammar fixes. thanks to Duncan. llvm-svn: 140850	2011-09-30 13:07:47 +00:00
Torok Edwin	319a1415b8	Instead of crashing when MCAsmInfo is NULL, add an assert. This helps with porting code from 2.9 to 3.0 as TargetSelect.h changed location, and if you include the old one by accident you will trigger this assert. llvm-svn: 140848	2011-09-30 12:31:57 +00:00
Eli Friedman	95031ed837	Clean up uses of switch instructions so they are not dependent on the operand ordering. Patch by Stepan Dyatkovskiy. llvm-svn: 140803	2011-09-29 20:21:17 +00:00
Duncan Sands	cac86805bf	Place this bracket according to the LLVM style. llvm-svn: 140784	2011-09-29 16:01:46 +00:00
Jakob Stoklund Olesen	463b05a2d0	Remove NumImplicitOps which is now unused. llvm-svn: 140767	2011-09-29 01:47:36 +00:00
Eric Christopher	d299dccf91	Use the local we already set up. llvm-svn: 140745	2011-09-29 00:50:59 +00:00
Jakob Stoklund Olesen	2318d1e0e9	Rewrite MachineInstr::addOperand() to avoid NumImplicitOps. The function needs to scan the implicit operands anyway, so no performance is won by caching the number of implicit operands added to an instruction. This also fixes a bug when adding operands after an implicit operand has been added manually. The NumImplicitOps count wasn't kept up to date. MachineInstr::addOperand() will now consistently place all explicit operands before all the implicit operands, regardless of the order they are added. It is possible to change an MI opcode and add additional explicit operands. They will be inserted before any existing implicit operands. The only exception is inline asm instructions where operands are never reordered. This is because of a hack that marks explicit clobber regs on inline asm as <implicit-def> to please the fast register allocator. This hack can go away when InstrEmitter and FastIsel can add exact <dead> flags to physreg defs. llvm-svn: 140744	2011-09-29 00:40:51 +00:00
Bill Wendling	899da52d60	Have the SjLjEHPrepare pass do some more heavy lifting. Upon further review, most of the EH code should remain written at the IR level. The part which breaks SSA form is the dispatch table, so that part will be moved to the back-end. llvm-svn: 140730	2011-09-28 21:56:53 +00:00
Duncan Sands	2e67937f76	A typeid of zero means a cleanup, not a catch. This case occurs when there is both a catch and a cleanup. Correct the comment. llvm-svn: 140686	2011-09-28 09:13:02 +00:00
Bill Wendling	baf3941fde	Strip off pointer casts when looking at the eh.sjlj.functioncontext's argument. llvm-svn: 140678	2011-09-28 03:52:41 +00:00
Bill Wendling	225e8481b0	Bitcast the alloca to an i8* to match the intrinsic's signature. llvm-svn: 140677	2011-09-28 03:47:11 +00:00
Bill Wendling	66b110f571	Create and use an llvm.eh.sjlj.functioncontext intrinsic. This intrinsic is used to pass the index of the function context to the back-end for further processing. The back-end is in charge of filling in the rest of the entries. llvm-svn: 140676	2011-09-28 03:36:43 +00:00
Bill Wendling	2e76ca9d9a	In the new EH model, setup the function context and the call site info. The DWARF exception pass uses the call site information, which is set up here. A pre-RA pass is too late for it to use this information. So create and setup the function context here, and then insert the call site values here (and map the call sites for the DWARF EH pass). This is simpler than the original pass, and doesn't make the CFG lose its SSA-ness. It's a win-win-win-win-lose-win-win situation. llvm-svn: 140675	2011-09-28 03:14:05 +00:00
Bill Wendling	e6138e3ad1	Don't conditionalize execution of the SjLj EH prepare pass. We may need an SjLj EH preparation pass for some call site information, at least in the short term. llvm-svn: 140674	2011-09-28 03:07:34 +00:00
Jakob Stoklund Olesen	bd5109f14d	Rename class and clean up source. No functional change intended. llvm-svn: 140664	2011-09-28 00:01:56 +00:00
Jakob Stoklund Olesen	934b7d7645	Rename SSEDomainFix -> lib/CodeGen/ExecutionDepsFix. I'll clean up the source in the next commit. llvm-svn: 140663	2011-09-28 00:01:54 +00:00
Bill Wendling	354ff9e348	This is the start of the new SjLj EH preparation pass, which will replace the current IR-level pass. The old SjLj EH pass has some problems, especially with the new EH model. Most significantly, it violates some of the new restrictions the new model has. For instance, the 'dispatch' table wants to jump to the landing pad, but we cannot allow that because only an invoke's unwind edge can jump to a landing pad. This requires us to mangle the code something awful. In addition, we need to keep the now dead landingpad instructions around instead of CSE'ing them because the DWARF emitter uses that information (they are dead because no control flow edge will execute them - the control flow edge from an invoke's unwind is superceded by the edge coming from the dispatch). Basically, this pass belongs not at the IR level where SSA is king, but at the code-gen level, where we have more flexibility. llvm-svn: 140646	2011-09-27 22:14:12 +00:00
Cameron Zwarich	7a6e8f2c5d	Remove an invalid assert that is really just asserting when the scheduler emits a suboptimal schedule. llvm-svn: 140643	2011-09-27 21:59:16 +00:00
Jim Grosbach	af136f71ec	Rename AddSelectionDAGCSEId() to addSelectionDAGCSEId(). Naming conventions consistency. No functional change. llvm-svn: 140636	2011-09-27 20:59:33 +00:00
Nadav Rotem	38b3b83362	Cleanup PromoteIntOp_EXTRACT_VECTOR_ELT and PromoteIntRes_SETCC. Add a new method: getAnyExtOrTrunc and use it to replace the manual check. llvm-svn: 140603	2011-09-27 11:16:47 +00:00
Nadav Rotem	1b857d2762	Revert r140463; The patch assumes that <4 x i1> is saved to memory as 4 x i8, while the decision is to bit-pack small values. llvm-svn: 140601	2011-09-27 10:48:29 +00:00
James Molloy	0ceb8cadd2	Fix emission of debug data for global variables. getContext() on DIGlobalVariables is not valid any more. llvm-svn: 140539	2011-09-26 17:40:42 +00:00
Jakob Stoklund Olesen	df977fedb6	Add target hook for pseudo instruction expansion. Many targets use pseudo instructions to help register allocation. Like the COPY instruction, these pseudos can be expanded after register allocation. The early expansion can make life easier for PEI and the post-ra scheduler. This patch adds a hook that is called for all remaining pseudo instructions from the ExpandPostRAPseudos pass. llvm-svn: 140472	2011-09-25 19:21:35 +00:00
Nadav Rotem	2279949129	[vector-select] Address one of the issues in pr10902. EXTRACT_VECTOR_ELEMENT SDNodes may return values which are wider than the incoming element types. In this patch we fix the integer promotion of these nodes. Fixes spill-q.ll when running -promote-elements. llvm-svn: 140471	2011-09-25 18:59:42 +00:00
Jakob Stoklund Olesen	fd719d184e	Clean up code after renaming LowerSubregs -> ExpandPostRAPseudos. No functional change intended. llvm-svn: 140470	2011-09-25 16:46:08 +00:00
Jakob Stoklund Olesen	f152df1e6b	Rename LowerSubregs to ExpandPostRAPseudos. I'll fix the file contents in the next commit. This pass is currently expanding the COPY and SUBREG_TO_REG pseudos. I am going to add a hook so targets can expand more pseudo-instructions after register allocation. Many targets have pseudo-instructions that assist the register allocator. They can be expanded after register allocation, before PEI and PostRA scheduling. llvm-svn: 140469	2011-09-25 16:46:00 +00:00
Nadav Rotem	c2deabd202	Implement Duncan's suggestion to use the result of getSetCCResultType if it is legal (this is always the case for scalars), otherwise use the promoted result type. Fix test/CodeGen/X86/vsplit-and.ll when promote-elements is enabled. llvm-svn: 140464	2011-09-24 19:48:19 +00:00
Nadav Rotem	77426a754b	[Vector-Select] Address one of the problems in 10902. When generating the trunc-store of i1's, we need to use the vector type and not the scalar type. This patch fixes the assertion in CodeGen/Generic/bool-vector.ll when running with -promote-elements. llvm-svn: 140463	2011-09-24 18:32:19 +00:00
Jakob Stoklund Olesen	3bb99bc957	Verify that terminators follow non-terminators. This exposes a -segmented-stacks bug. llvm-svn: 140429	2011-09-23 22:45:39 +00:00
Eli Friedman	8a15a5aa93	PR10998: It is not legal to sink an instruction past the terminator of a block; make sure we don't do that. llvm-svn: 140428	2011-09-23 22:41:57 +00:00
Duncan Sands	b461176cfb	Tweak the handling of MERGE_VALUES nodes: remove the need for DecomposeMERGE_VALUES to "know" that results are legalized in a particular order, by passing it the number of the result being legalized (the type legalization core provides this, it just needs to be passed on). llvm-svn: 140373	2011-09-23 13:59:22 +00:00
Nadav Rotem	57e30726ad	Vector-Select: Address one of the problems in pr10902. Add handling for the integer-promotion of CONCAT_VECTORS. Test: test/CodeGen/X86/widen_shuffle-1.ll This patch fixes the above tests (when running in with -promote-elements). llvm-svn: 140372	2011-09-23 09:33:24 +00:00
Dan Gohman	e83e1b2d2c	Fix SimplifySelectCC to add newly created nodes to the DAGCombiner worklist, as it may be possible to perform further optimization on them. llvm-svn: 140349	2011-09-22 23:01:29 +00:00
Jakob Stoklund Olesen	e92e5ee81f	Constrain register classes instead of emitting copies. Sometimes register class constraints are trivial, like GR32->GR32_NOSP, or GPR->rGPR. Teach InstrEmitter to simply constrain the virtual register instead of emitting a copy in these cases. Normally, these copies are handled by the coalescer. This saves some coalescer work. llvm-svn: 140340	2011-09-22 21:39:34 +00:00
Jakob Stoklund Olesen	0f36544c08	Add a MinNumRegs argument to MRI::constrainRegClass(). The function will refuse to use a register class with fewer registers than MinNumRegs. This can be used by clients to avoid accidentally increase register pressure too much. The default value of MinNumRegs=0 doesn't affect how constrainRegClass() works. llvm-svn: 140339	2011-09-22 21:39:31 +00:00
Bill Wendling	a58fde665a	Use the C personality function instead of the C++ personality function. llvm-svn: 140318	2011-09-22 17:56:40 +00:00
Devang Patel	5e6b65cf0d	Do not unnecessarily use AT_specification DIE because it does not add any value. Few weeks ago, llvm completely inverted the debug info graph. Earlier each debug info node used to keep track of its compile unit, now compile unit keeps track of important nodes. One impact of this change is that the global variable's do not have any context, which should be checked before deciding to use AT_specification DIE. llvm-svn: 140282	2011-09-21 23:41:11 +00:00
Bill Wendling	7b3fc8ee38	Attempt to update the shadow stack GC pass to the new EH model. This inserts a cleanup landingpad instruction and a resume to mimic the old unwind instruction. llvm-svn: 140277	2011-09-21 22:14:28 +00:00
Jim Grosbach	098f5a2911	Tidy up. Whitepsace. llvm-svn: 140275	2011-09-21 21:36:53 +00:00
Nadav Rotem	bc9ba30158	[VECTOR-SELECT] Address one of the bugs in pr10902. Vector SetCC result types need to be type-legalized. This code worked before because scalar result types are known to be legal. llvm-svn: 140249	2011-09-21 14:34:38 +00:00
Andrew Trick	924123acb3	Lower ARM adds/subs to add/sub after adding optional CPSR operand. This is still a hack until we can teach tblgen to generate the optional CPSR operand rather than an implicit CPSR def. But the strangeness is now limited to the selection DAG. ADD/SUB MI's no longer have implicit CPSR defs, nor do we allow flag setting variants of these opcodes in machine code. There are several corner cases to consider, and getting one wrong would previously lead to nasty miscompilation. It's not the first time I've debugged one, so this time I added enough verification to ensure it won't happen again. llvm-svn: 140228	2011-09-21 02:20:46 +00:00
Bruno Cardoso Lopes	6cb23f6e7f	Add a DAGCombine for subvector extracts to remove useless chains of subvector inserts and extracts. Initial patch by Rackover, Zvi with some tweak done by me. llvm-svn: 140204	2011-09-20 23:19:33 +00:00
Andrew Trick	52363bdbeb	Restore hasPostISelHook tblgen flag. No functionality change. The hook makes it explicit which patterns require "special" handling. i.e. it self-documents tblgen deficiencies. I plan to add verification in ExpandISelPseudos and Thumb2SizeReduce to catch any missing hasPostISelHooks. Otherwise it's too fragile. llvm-svn: 140160	2011-09-20 18:22:31 +00:00
Andrew Trick	8586e62d91	ARM isel bug fix for adds/subs operands. Modified ARMISelLowering::AdjustInstrPostInstrSelection to handle the full gamut of CPSR defs/uses including instructins whose "optional" cc_out operand is not really optional. This allowed removal of the hasPostISelHook to simplify the .td files and make the implementation more robust. Fixes rdar://10137436: sqlite3 miscompile llvm-svn: 140134	2011-09-20 03:17:40 +00:00
Andrew Trick	53df4b6dfa	whitespace llvm-svn: 140133	2011-09-20 03:06:13 +00:00
Nadav Rotem	7aaa0aa7a7	white space cleanups llvm-svn: 139994	2011-09-18 10:29:29 +00:00
Benjamin Kramer	67b014b2c2	Namespacify. llvm-svn: 139892	2011-09-16 00:35:06 +00:00
Jakob Stoklund Olesen	e2c92a3112	Spill mode: Hoist back-copies locally. The leaveIntvAfter() function normally inserts a back-copy after the requested instruction, making the back-copy kill the live range. In spill mode, try to insert the back-copy before the last use instead. That means the last use becomes the kill instead of the back-copy. This lowers the register pressure because the last use can now redefine the same register it was reading. This will also improve compile time: The back-copy isn't a kill, so hoisting it in hoistCopiesForSize() won't force a recomputation of the source live range. Similarly, if the back-copy isn't hoisted by the splitter, the spiller will not attempt hoisting it locally. llvm-svn: 139883	2011-09-16 00:03:35 +00:00
Jakob Stoklund Olesen	e8339b2e63	Disable local spill hoisting for non-killing copies. If the source register is live after the copy being spilled, there is no point to hoisting it. Hoisting inside a basic block only serves to resolve interferences by shortening the live range of the source. llvm-svn: 139882	2011-09-16 00:03:33 +00:00
Eli Friedman	ee8f14a799	Some legalization fixes for atomic load and store. llvm-svn: 139851	2011-09-15 21:20:49 +00:00
Jakob Stoklund Olesen	bceb9e5c05	Add an option to disable spill hoisting. When -split-spill-mode is enabled, spill hoisting is performed by SplitKit instead of by InlineSpiller. This hidden command line option is for testing the splitter spill mode. llvm-svn: 139845	2011-09-15 21:06:00 +00:00
Jakob Stoklund Olesen	53e2e48de7	VirtRegMap is counting spill slots, not register spills. Fix the stats counters to reflect that. llvm-svn: 139819	2011-09-15 18:31:13 +00:00
Jakob Stoklund Olesen	c94c967656	Count correctly when a COPY turns into a spill or reload. The number of spills could go negative since a folded COPY is just a spill, and it may be eliminated. llvm-svn: 139815	2011-09-15 18:22:52 +00:00
Jakob Stoklund Olesen	37eb6962c6	Count inserted spills and reloads more accurately. Adjust counters when removing spill and reload instructions. We still don't account for reloads being removed by eliminateDeadDefs(). llvm-svn: 139806	2011-09-15 17:54:28 +00:00
Jakob Stoklund Olesen	07b3503f8b	Trace through sibling PHIs in bulk. When traceSiblingValue() encounters a PHI-def value created by live range splitting, don't look at all the predecessor blocks. That can be very expensive in a complicated CFG. Instead, consider that all the non-PHI defs jointly dominate all the PHI-defs. Tracing directly to all the non-PHI defs is much faster that zipping around in the CFG when there are many PHIs with many predecessors. This significantly improves compile time for indirectbr interpreters. llvm-svn: 139797	2011-09-15 16:41:12 +00:00
Jakob Stoklund Olesen	b8b1d4c435	Speed up LiveIntervals::shrinkToUse with some caching. Blocks with multiple PHI successors only need to go on the worklist once. Use a SmallPtrSet to track the live-out blocks that have already been handled. This is a lot faster than the two live range check we would otherwise do. Also stop recomputing hasPHIKill flags. Like RenumberValues(), it is conservatively correct to leave them in, and they are not used for anything important. llvm-svn: 139792	2011-09-15 15:24:16 +00:00
Jakob Stoklund Olesen	fb75d78d33	Revert r139782, "RemoveCopyByCommutingDef doesn't need hasPHIKill()." It does, after all. RemoveCopyByCommutingDef rewrites the uses of one particular value number in A. It doesn't know how to rewrite phi uses, so there can't be any. llvm-svn: 139787	2011-09-15 06:27:32 +00:00
Jakob Stoklund Olesen	4c099551f9	Stop verifying hasPHIKill() flags. There is only one legitimate use remaining, in addIntervalsForSpills(). All other calls to hasPHIKill() are only used to update PHIKill flags. The addIntervalsForSpills() function is part of the old spilling framework, only used by linearscan. llvm-svn: 139783	2011-09-15 05:16:30 +00:00
Jakob Stoklund Olesen	0499e7bbd0	RemoveCopyByCommutingDef doesn't need hasPHIKill(). Instead, let HasOtherReachingDefs() test for defs in B that overlap any phi-defs in A as well. This test is slightly different, but almost identical. A perfectly precise test would only check those phi-defs in A that are reachable from AValNo. llvm-svn: 139782	2011-09-15 05:03:50 +00:00
Jakob Stoklund Olesen	dca022e377	It is safe to remat a value killed by phis. The source live range is recomputed using shrinkToUses() which does handle phis correctly. The hasPHIKill() condition was relevant in the old days when ReMaterializeTrivialDef() tried to recompute the live range itself. The shrinkToUses() function will mark the original def as dead when no more uses and phi kills remain. It is then removed by runOnMachineFunction(). llvm-svn: 139781	2011-09-15 04:52:06 +00:00
Jakob Stoklund Olesen	e7ca8ecd92	Leave hasPHIKill flags alone in LiveInterval::RenumberValues. It is conservatively correct to keep the hasPHIKill flags, even after deleting PHI-defs. The calculation can be very expensive after taildup has created a quadratic number of indirectbr edges in the CFG, and the hasPHIKill flag isn't used for anything after RenumberValues(). llvm-svn: 139780	2011-09-15 04:37:18 +00:00
Andrew Trick	76a86d3d4c	[regcoalescing] bug fix for RegistersDefinedFromSameValue. An improper SlotIndex->VNInfo lookup was leading to unsafe copy removal. Fixes PR10920 401.bzip2 miscompile with no IV rewrite. llvm-svn: 139765	2011-09-15 01:09:33 +00:00
Devang Patel	04d6d47865	Add support to emit debug info for C++0x nullptr type. llvm-svn: 139751	2011-09-14 23:13:28 +00:00
Jakob Stoklund Olesen	811b9c475d	Ignore the cloning of unknown registers. THe LRE_DidCloneVirtReg callback may be called with vitual registers that RAGreedy doesn't even know about yet. In that case, there are no data structures to update. llvm-svn: 139702	2011-09-14 17:34:37 +00:00
Jakob Stoklund Olesen	a98af39856	Hoist back-copies to the least busy dominator. When a back-copy is hoisted to the nearest common dominator, keep looking up the dominator tree for a less loopy dominator, and place the back-copy there instead. Don't do this when a single existing back-copy dominates all the others. Assume the client knows what he is doing, and keep the dominating back-copy. This prevents us from hoisting back-copies into loops in most cases. If a value is defined in a loop with multiple exits, we may still hoist back-copies into that loop. That is the speed/size tradeoff. llvm-svn: 139698	2011-09-14 16:45:39 +00:00
Nadav Rotem	d748dbacb0	Add integer promotion support for vselect llvm-svn: 139692	2011-09-14 14:42:15 +00:00
Jakob Stoklund Olesen	5d4277ddfa	Distinguish complex mapped values from forced recomputation. When a ParentVNI maps to multiple defs in a new interval, its live range may still be derived directly from RegAssign by transferValues(). On the other hand, when instructions have been rematerialized or hoisted, it may be necessary to completely recompute live ranges using LiveRangeCalc::extend() to all uses. Use a bit in the value map to indicate that a live range must be recomputed. Rename markComplexMapped() to forceRecompute(). This fixes some live range verification errors when -split-spill-mode=size hoists back-copies by recomputing source ranges when RegAssign kills can't be moved. llvm-svn: 139660	2011-09-13 23:09:04 +00:00
Jakob Stoklund Olesen	a25330f0d7	Implement -split-spill-mode=size. Whenever the complement interval is defined by multiple copies of the same value, hoist those back-copies to the nearest common dominator. This ensures that at most one copy is inserted per value in the complement inteval, and no phi-defs are needed. llvm-svn: 139651	2011-09-13 22:22:39 +00:00
Eli Friedman	f78c6a83ee	Fix check for unaligned load/store so it doesn't catch over-aligned load/store. llvm-svn: 139649	2011-09-13 22:19:59 +00:00
Eli Friedman	f1518216fd	Error out on CodeGen of unaligned load/store. Fix test so it isn't accidentally testing that case. llvm-svn: 139641	2011-09-13 20:50:54 +00:00
Nadav Rotem	66dc9ae08d	Fix the assertion which checks the size of the input operand. llvm-svn: 139633	2011-09-13 20:03:38 +00:00
Nadav Rotem	52202fbf2d	Add vselect target support for targets that do not support blend but do support xor/and/or (For example SSE2). llvm-svn: 139623	2011-09-13 19:17:42 +00:00
Devang Patel	f9e2ae9b05	Use a cache to maintain list of machine basic blocks for a given UserValue. llvm-svn: 139616	2011-09-13 18:40:53 +00:00
Jakob Stoklund Olesen	4484f99175	Add SplitEditor::markOverlappedComplement(). This function is used to flag values where the complement interval may overlap other intervals. Call it from overlapIntv, and use the flag to fully recompute those live ranges in transferValues(). llvm-svn: 139612	2011-09-13 18:05:29 +00:00
Jakob Stoklund Olesen	820c8fd0db	Eliminate the extendRange() wrapper. llvm-svn: 139608	2011-09-13 17:38:57 +00:00
Jakob Stoklund Olesen	0494c5c35d	Switch extendInBlock() to take a kill slot instead of the last use slot. Three out of four clients prefer this interface which is consistent with extendIntervalEndTo() and LiveRangeCalc::extend(). llvm-svn: 139604	2011-09-13 16:47:56 +00:00
Jakob Stoklund Olesen	054984d75b	Use a separate LiveRangeCalc for the complement in spill modes. The complement interval may overlap the other intervals created, so use a separate LiveRangeCalc instance to compute its live range. A LiveRangeCalc instance can only be shared among non-overlapping intervals. llvm-svn: 139603	2011-09-13 16:47:53 +00:00
NAKAMURA Takumi	cac923b556	Unbreak msvc. llvm-svn: 139581	2011-09-13 03:58:34 +00:00
Jakob Stoklund Olesen	487f2a37bf	Extract live range calculations from SplitKit. SplitKit will soon need two copies of these data structures, and the algorithms will also be useful when LiveIntervalAnalysis becomes independent of LiveVariables. llvm-svn: 139572	2011-09-13 01:34:21 +00:00
Bill Wendling	ac5a883624	Introduce a bit of a hack. Splitting a landing pad takes considerable care because of PHIs and other nasties. The problem is that the jump table needs to jump to the landing pad block. However, the landing pad block can be jumped to only by an invoke instruction. So we clone the landingpad instruction into its own basic block, have the invoke jump to there. The landingpad instruction's basic block's successor is now the target for the jump table. But because of PHI nodes, we need to create another basic block for the jump table to jump to. This is definitely a hack, because the values for the PHI nodes may not be defined on the edge from the jump table. But that's okay, because the jump table is simply a construct to mimic what is happening in the CFG. So the values are mysteriously there, even though there is no value for the PHI from the jump table's edge (hence calling this a hack). llvm-svn: 139545	2011-09-12 21:56:59 +00:00
Jakob Stoklund Olesen	45df7e0f22	Remove the -compact-regions flag. It has been enabled by default for a while, it was only there to allow performance comparisons. llvm-svn: 139501	2011-09-12 16:54:42 +00:00
Jakob Stoklund Olesen	eecb2fb183	Add an interface for SplitKit complement spill modes. SplitKit always computes a complement live range to cover the places where the original live range was live, but no explicit region has been allocated. Currently, the complement live range is created to be as small as possible - it never overlaps any of the regions. This minimizes register pressure, but if the complement is going to be spilled anyway, that is not very important. The spiller will eliminate redundant spills, and hoist others by making the spill slot live range overlap some of the regions created by splitting. Stack slots are cheap. This patch adds the interface to enable spill modes in SplitKit. In spill mode, SplitKit will assume that the complement is going to spill, so it will allow it to overlap regions in order to avoid back-copies. By doing some of the spiller's work early, the complement live range becomes simpler. In some cases, it can become much simpler because no extra PHI-defs are required. This will speed up both splitting and spilling. This is only the interface to enable spill modes, no implementation yet. llvm-svn: 139500	2011-09-12 16:49:21 +00:00
Jakob Stoklund Olesen	72c0ddfbc4	Update comments to reflect some (not so) recent changes. llvm-svn: 139498	2011-09-12 16:03:26 +00:00
Richard Trieu	78a812bf2d	Fix asserts in CodeGen from: assert("error"); to: assert(0 && "error"); llvm-svn: 139449	2011-09-10 01:07:54 +00:00
Chris Lattner	e74e0c8020	tidy up a bit llvm-svn: 139419	2011-09-09 22:06:59 +00:00
Eli Friedman	b7910b79f5	Make the SelectionDAG verify that all the operands of BUILD_VECTOR have the same type. Teach DAGCombiner::visitINSERT_VECTOR_ELT not to make invalid BUILD_VECTORs. Fixes PR10897. llvm-svn: 139407	2011-09-09 21:04:06 +00:00
Jakob Stoklund Olesen	278bf02581	Reapply r139247: Cache intermediate results during traceSiblingValue. In some cases such as interpreters using indirectbr, the CFG can be very complicated, and live range splitting may be forced to insert a large number of phi-defs. When that happens, traceSiblingValue can spend a lot of time zipping around in the CFG looking for defs and reloads. This patch causes more information to be cached in SibValues, and the cached values are used to terminate searches early. This speeds up spilling by 20x in one interpreter test case. For more typical code, this is just a 10% speedup of spilling. The previous version had bugs that caused miscompilations. They have been fixed. llvm-svn: 139378	2011-09-09 18:11:41 +00:00
Devang Patel	9d904e1a97	Directly point debug info to the stack slot of the arugment, instead of trying to keep track of vreg in which it the arugment is copied. The LiveDebugVariable can keep track of variable's ranges. llvm-svn: 139330	2011-09-08 22:59:09 +00:00
Jakob Stoklund Olesen	946e0a4665	Revert r139247 "Cache intermediate results during traceSiblingValue." It broke the self host and clang-x86_64-darwin10-RA. llvm-svn: 139259	2011-09-07 21:43:52 +00:00
Jakob Stoklund Olesen	b77d5c1484	Cache intermediate results during traceSiblingValue. In some cases such as interpreters using indirectbr, the CFG can be very complicated, and live range splitting may be forced to insert a large number of phi-defs. When that happens, traceSiblingValue can spend a lot of time zipping around in the CFG looking for defs and reloads. This patch causes more information to be cached in SibValues, and the cached values are used to terminate searches early. This speeds up spilling by 20x in one interpreter test case. For more typical code, this is just a 10% speedup of spilling. llvm-svn: 139247	2011-09-07 19:07:31 +00:00
James Molloy	4c493e8050	Refactor instprinter and mcdisassembler to take a SubtargetInfo. Add -mattr= handling to llvm-mc. Reviewed by Owen Anderson. llvm-svn: 139237	2011-09-07 17:24:38 +00:00
Eli Friedman	e978d2f644	Relax the MemOperands on atomics a bit. Fixes -verify-machineinstrs failures for atomic laod/store on ARM. (The fix for the related failures on x86 is going to be nastier because we actually need Acquire memoperands attached to the atomic load instrs, etc.) llvm-svn: 139221	2011-09-07 02:23:42 +00:00
Devang Patel	9de7a7db26	While sinking machine instructions, sink matching DBG_VALUEs also otherwise live debug variable pass will drop DBG_VALUEs on the floor. llvm-svn: 139208	2011-09-07 00:07:58 +00:00
Duncan Sands	f2641e1bc1	Add codegen support for vector select (in the IR this means a select with a vector condition); such selects become VSELECT codegen nodes. This patch also removes VSETCC codegen nodes, unifying them with SETCC nodes (codegen was actually often using SETCC for vector SETCC already). This ensures that various DAG combiner optimizations kick in for vector comparisons. Passes dragonegg bootstrap with no testsuite regressions (nightly testsuite as well as "make check-all"). Patch mostly by Nadav Rotem. llvm-svn: 139159	2011-09-06 19:07:46 +00:00
Duncan Sands	a098436b32	Split the init.trampoline intrinsic, which currently combines GCC's init.trampoline and adjust.trampoline intrinsics, into two intrinsics like in GCC. While having one combined intrinsic is tempting, it is not natural because typically the trampoline initialization needs to be done in one function, and the result of adjust trampoline is needed in a different (nested) function. To get around this llvm-gcc hacks the nested function lowering code to insert an additional parent variable holding the adjust.trampoline result that can be accessed from the child function. Dragonegg doesn't have the luxury of tweaking GCC code, so it stored the result of adjust.trampoline in the memory GCC set aside for the trampoline itself (this is always available in the child function), and set up some new memory (using an alloca) to hold the trampoline. Unfortunately this breaks Go which allocates trampoline memory on the heap and wants to use it even after the parent has exited (!). Rather than doing even more hacks to get Go working, it seemed best to just use two intrinsics like in GCC. Patch mostly by Sanjoy Das. llvm-svn: 139140	2011-09-06 13:37:06 +00:00
Owen Anderson	40d756eacc	Fix a truly heinous bug in DAGCombine related to AssertZext. If we have a chain of zext -> assert_zext -> zext -> use, the first zext would get simplified away because of the later zext, and then the later zext would get simplified away because of the assert. The solution is to teach SimplifyDemandedBits that assert_zext demands all of the high bits of its input, rather than only those demanded by its users. No testcase because the only example I have manifests as llvm-gcc miscompiling LLVM, and I haven't found a smaller case that reproduces this problem. Fixes <rdar://problem/10063365>. llvm-svn: 139059	2011-09-03 00:26:49 +00:00
Jakob Stoklund Olesen	97fe09ad2e	Simplify by using isFullCopy(). llvm-svn: 139019	2011-09-02 18:18:29 +00:00
Duncan Sands	5c04c62765	Darwin wants ctors/dtors to be ordered the other way round to linux. llvm-svn: 139015	2011-09-02 18:07:19 +00:00
Dan Gohman	3767be9aee	Revert r131152, r129796, r129761. This code is currently considered to be unreliable on platforms which require memcpy calls, and it is complicating broader legalize cleanups. It is hoped that these cleanups will make memcpy byval easier to implement in the future. llvm-svn: 138977	2011-09-01 23:07:08 +00:00
Benjamin Kramer	6397051ece	Don't drop alignment info on local common symbols. - On COFF the .lcomm directive has an alignment argument. - On ELF we fall back to .local + .comm Based on a patch by NAKAMURA Takumi. Fixes PR9337, PR9483 and PR10128. llvm-svn: 138976	2011-09-01 23:04:27 +00:00
Jakob Stoklund Olesen	5dc87d0f4d	Permit remat of partial register defs when it is safe. An instruction may define part of a register where the other bits are undefined. In that case, it is safe to rematerialize the instruction. For example: %vreg2:ssub_0<def> = VLDRS <cp#0>, 0, pred:14, pred:%noreg, %vreg2<imp-def> The extra <imp-def> operand indicates that the instruction does not read the other parts of the virtual register, so a remat is safe. This patch simply allows multiple def operands for the virtual register. It is MI->readsVirtualRegister() that determines if we depend on a previous value so remat is impossible. llvm-svn: 138953	2011-09-01 18:27:51 +00:00
Jakob Stoklund Olesen	e417273fce	Revert r138794, "Do not try to rematerialize a value from a partial definition." The problem is fixed for all register allocators by r138944, so this patch is no longer necessary. <rdar://problem/10032939> llvm-svn: 138945	2011-09-01 17:25:18 +00:00
Jakob Stoklund Olesen	6357fa2f06	Prevent remat of partial register redefinitions. An instruction that redefines only part of a larger register can never be rematerialized since the virtual register value depends on the old value in other parts of the register. This was fixed for the inline spiller in r138794. This patch fixes the problem for all register allocators, and includes a small test case. <rdar://problem/10032939> llvm-svn: 138944	2011-09-01 17:18:50 +00:00
Evan Cheng	90da66bb69	Teach MachineLICM reg pressure tracking code to deal with MVT::untyped. Sorry, I can't come up with a small test case. rdar://10043690 llvm-svn: 138934	2011-09-01 01:45:00 +00:00
Andrew Trick	832a6a1909	PreRA scheduler should avoid cloning compares. Added canClobberReachingPhysRegUse() to handle a particular pattern in which a two-address instruction could be forced to interfere with EFLAGS, causing a compare to be unnecessarilly cloned. Fixes rdar://problem/5875261 llvm-svn: 138924	2011-09-01 00:54:31 +00:00
David Greene	7df940d660	Fix Size Typing Stores sizes as uint64_t to avoid possible truncation. llvm-svn: 138901	2011-08-31 21:34:20 +00:00
Eli Friedman	ae1acddb95	Misc cleanup; addresses Duncan's comments on r138877. llvm-svn: 138887	2011-08-31 20:13:26 +00:00
Eli Friedman	e839ecb70b	Fill in type legalization for MERGE_VALUES in all the various cases. Patch by Micah Villmow. (No testcase because the issue only showed up in an out-of-tree backend.) llvm-svn: 138877	2011-08-31 18:36:04 +00:00
Eli Friedman	7c3bdede25	Generic expansion for atomic load/store into cmpxchg/atomicrmw xchg; implements 64-bit atomic load/store for ARM. llvm-svn: 138872	2011-08-31 18:26:09 +00:00
David Greene	cdef71f4f3	Compress Repeated Byte Output Emit a repeated sequence of bytes using .zero. This saves an enormous amount of asm file space for certain programs. llvm-svn: 138864	2011-08-31 17:30:56 +00:00
Rafael Espindola	6e31dfea35	Spelling and grammar fixes to problems found by Duncan. llvm-svn: 138858	2011-08-31 16:43:33 +00:00
Rafael Espindola	c21742112b	Emit segmented-stack specific code into function prologues for X86. Modify the pass added in the previous patch to call this new code. This new prologues generated will call a libgcc routine (__morestack) to allocate more stack space from the heap when required Patch by Sanjoy Das. llvm-svn: 138812	2011-08-30 19:39:58 +00:00
Evan Cheng	e6fba77971	Follow up to r138791. Add a instruction flag: hasPostISelHook which tells the pre-RA scheduler to call a target hook to adjust the instruction. For ARM, this is used to adjust instructions which may be setting the 's' flag. ADC, SBC, RSB, and RSC instructions have implicit def of CPSR (required since it now uses CPSR physical register dependency rather than "glue"). If the carry flag is used, then the target hook will fill in the optional operand with CPSR. Otherwise, the hook will remove the CPSR implicit def from the MachineInstr. llvm-svn: 138810	2011-08-30 19:09:48 +00:00
Bob Wilson	358a5f6a72	Do not try to rematerialize a value from a partial definition. I don't currently have a good testcase for this; will try to get one tomorrow. <rdar://problem/10032939> llvm-svn: 138794	2011-08-30 05:36:02 +00:00
Jim Grosbach	ed16ec4248	Thumb2 parsing and encoding for IT blocks. llvm-svn: 138773	2011-08-29 22:24:09 +00:00
Duncan Sands	4d63542b82	Fix PR5329: pay attention to constructor/destructor priority when outputting them. With this, the entire LLVM testsuite passes when built with dragonegg. llvm-svn: 138724	2011-08-28 13:17:22 +00:00
Bill Wendling	4707d37ac9	These splits should be done whether they are critical edges or not. llvm-svn: 138697	2011-08-27 04:40:37 +00:00
Bill Wendling	71fce2c84d	Update the dominator tree with the correct dominator for the new 'unwind' block. llvm-svn: 138664	2011-08-26 21:36:12 +00:00
Bill Wendling	fee8eda35b	Split the landing pad block only if it's a critical edge. Also intelligently split it in the other place where we're splitting critical edges. llvm-svn: 138658	2011-08-26 21:18:55 +00:00
Eli Friedman	452aae6202	Atomic load/store on ARM/Thumb. I don't really like the patterns, but I'm having trouble coming up with a better way to handle them. I plan on making other targets use the same legalization ARM-without-memory-barriers is using... it's not especially efficient, but if anyone cares, it's not that hard to fix for a given target if there's some better lowering. llvm-svn: 138621	2011-08-26 02:59:24 +00:00
Bill Wendling	8ac2041a19	Look at only the terminators of the basic block. Also, if we're using the new EH scheme, return 'true' so that it doesn't try to run the old EH scheme's fixup on the new code. llvm-svn: 138605	2011-08-25 23:48:11 +00:00
Eli Friedman	342e8df0e0	Basic x86 code generation for atomic load and store instructions. llvm-svn: 138478	2011-08-24 20:50:09 +00:00
Evan Cheng	2bb4035707	Move TargetRegistry and TargetSelect from Target to Support where they belong. These are strictly utilities for registering targets and components. llvm-svn: 138450	2011-08-24 18:08:43 +00:00
Jim Grosbach	dee9e8a37c	Tidy up. Trailing whitespace. llvm-svn: 138437	2011-08-24 16:44:17 +00:00
Bill Wendling	f4ee0c0db2	Add the sentinal "no handle" value to the ResumeInst. A value of -1 at a call site tells the personality function that this call isn't handled by the current function. Since the ResumeInsts are converted to calls to _Unwind_SjLj_Resume, add a (volatile) store of -1 to its 'call site'. llvm-svn: 138416	2011-08-24 00:00:23 +00:00
Bill Wendling	2d4f0bea57	Don't replace all uses with the new stuff. This is not necessarily the first or dominating use of the EH values. The IR breaks if it's not. So replace the specific value in the instruction with the new value. llvm-svn: 138406	2011-08-23 22:55:03 +00:00
Bill Wendling	01a325a40e	Look at the end of the entry block for an invoke. The invoke could be at the end of the entry block. If it's the only one, then we won't process all of the landingpad instructions correctly. This code is currently ugly, but should be made much nicer once the new EH switch is thrown. llvm-svn: 138397	2011-08-23 22:20:16 +00:00
Bill Wendling	4eb0433672	A landingpad instruction is neither folded nor dead. llvm-svn: 138387	2011-08-23 21:33:05 +00:00
Evan Cheng	6b477b985b	Fix 80 col violations. llvm-svn: 138356	2011-08-23 19:17:21 +00:00
Bill Wendling	f0d2dfde4f	Split the landing pad's edge. Then for all uses of a landingpad instruction's value, we insert a load of the exception object and selector object from memory, which is where it actually resides. If it's used by a PHI node, we follow that to where it is being used. Eventually, all landingpad instructions should have no uses. Any PHI nodes that were associated with those landingpads should be removed. llvm-svn: 138302	2011-08-22 23:38:40 +00:00
Evan Cheng	6aa2744bed	Follow up to Jim's r138278. This fixes commuteInstruction so it handles two-address instructions correctly. I'll let Jim add a test case. :-) llvm-svn: 138289	2011-08-22 23:04:56 +00:00
Bill Wendling	3aaed0a14c	Some whitespace fixes and #include reordering. llvm-svn: 138256	2011-08-22 18:44:49 +00:00
Nick Lewycky	97f73cb449	Be less redundant. llvm-svn: 138252	2011-08-22 18:26:12 +00:00
Devang Patel	59e27c5f12	Do not use named md nodes to track variables that are completely optimized. This does not scale while doing LTO with debug info. New approach is to include list of variables in the subprogram info directly. llvm-svn: 138145	2011-08-19 23:28:12 +00:00
Benjamin Kramer	68ed46ce9a	Roll back the rest of r126557. It's a hack that will break in some obscure cases. llvm-svn: 138130	2011-08-19 22:39:31 +00:00
Nick Lewycky	c1348074ec	Eli points out that this is what report_fatal_error() is for. llvm-svn: 138091	2011-08-19 21:45:19 +00:00
Nick Lewycky	3f73184d90	This is not actually unreachable, so don't use llvm_unreachable for it. Since the intent seems to be to terminate even in Release builds, just use abort() directly. If program flow ever reaches a __builtin_unreachable (which llvm_unreachable is #define'd to on newer GCCs) then the program is undefined. llvm-svn: 138068	2011-08-19 20:14:27 +00:00
Jakob Stoklund Olesen	6949077f74	Add llc flags to disable machine DCE and CSE. This is useful for unit tests. llvm-svn: 138028	2011-08-19 02:05:35 +00:00
Benjamin Kramer	4938edb02c	Make a bunch of symbols private. llvm-svn: 138025	2011-08-19 01:42:18 +00:00
Jakob Stoklund Olesen	9eb77bf615	Don't treat a partial <def,undef> operand as a read. Normally, a partial register def is treated as reading the super-register unless it also defines the full register like this: %vreg110:sub_32bit<def> = COPY %vreg77:sub_32bit, %vreg110<imp-def> This patch also uses the <undef> flag on partial defs to recognize non-reading operands: %vreg110:sub_32bit<def,undef> = COPY %vreg77:sub_32bit This fixes a subtle bug in RegisterCoalescer where LIS->shrinkToUses would treat a coalesced copy as still reading the register, extending the live range artificially. My test case only works when I disable DCE so a dead copy is left for RegisterCoalescer, so I am not including it. <rdar://problem/9967101> llvm-svn: 138018	2011-08-19 00:30:17 +00:00
Renato Golin	c8d4065781	add the comments of each declaration follow it, making it easier to read and compare to GCC's result. llvm-svn: 138009	2011-08-18 23:43:14 +00:00
Devang Patel	0ecbcbd12c	Eliminate unnecessary forwarding function. llvm-svn: 138006	2011-08-18 23:17:55 +00:00
Devang Patel	a6576a146d	Add new DIE into the map asap. llvm-svn: 137998	2011-08-18 22:21:50 +00:00
Ivan Krasin	d7cbd4c518	FastISel: avoid function calls between the materialization of the constant and its use. llvm-svn: 137993	2011-08-18 22:06:10 +00:00
Bill Wendling	247fd3bf59	Add the support in code-gen for the landingpad instruction lowering. The landingpad instruction is lowered into the EXCEPTIONADDR and EHSELECTION SDNodes. The information from the landingpad instruction is harvested by the 'AddLandingPadInfo' function. The new EH uses the current EH scheme in the back-end. This will change once we switch over to the new scheme. (Reviewed by Jakob!) llvm-svn: 137880	2011-08-17 21:56:44 +00:00
Bill Wendling	a408e5bf31	Revert patch. Forgot a dependent commit. llvm-svn: 137875	2011-08-17 21:28:05 +00:00
Bill Wendling	2a521948f0	Add the body of 'visitLandingPad'. This generates the SDNodes for the new exception handling scheme. It takes the two values coming from the landingpad instruction and assigns them to the EXCEPTIONADDR and EHSELECTION nodes. llvm-svn: 137873	2011-08-17 21:25:14 +00:00
Bill Wendling	1cdd7fdf54	Modify for the new EH scheme. Things are much saner now. We no longer need to modify the laning pads, because of the invariants we impose upon them. The only thing DwarfEHPrepare needs to do is convert the 'resume' instruction into a call to '_Unwind_Resume'. llvm-svn: 137855	2011-08-17 19:48:49 +00:00
Devang Patel	eb1bb4e419	Until now all debug info MDNodes referred to a root MDNode, a compile unit. This simplified handling of these needs in dwarf writer. However, one side effect of this is that during link time optimization all these MDNodes are _not_ uniqued. In other words there will be N number of MDNodes describing "int", "char" and all other types, which would suddenly grow when each object file starts using libraries like STL. MDNodes graph structure such that compiler unit keeps track of important MDNodes and update dwarf writer to process mdnodes top-down instead of bottom up. llvm-svn: 137778	2011-08-16 22:09:43 +00:00
Jim Grosbach	345768c9ff	Remove unused Target argument from AsmParser construction methods. The argument is unused, and is a layering violation in any case. llvm-svn: 137735	2011-08-16 18:33:49 +00:00
Devang Patel	927840458e	Remove unnecessary version check. llvm-svn: 137728	2011-08-16 17:41:41 +00:00
Nadav Rotem	b66b866f46	Revert r137562 because it caused PR10674 llvm-svn: 137719	2011-08-16 14:34:29 +00:00
Devang Patel	07bb9eea33	Refactor. llvm-svn: 137689	2011-08-15 23:47:24 +00:00
Devang Patel	1f4f98d664	Continue to hoist uses of getCompileUnit() up. The goal is to get rid of uses of getCompileUnit(). llvm-svn: 137683	2011-08-15 23:36:40 +00:00
Devang Patel	d2dfc5ec02	This is somewhat déjà-vu, but avoid using getCompileUnit() as much as possible. llvm-svn: 137668	2011-08-15 22:24:32 +00:00
Devang Patel	3acc70e536	Refactor. Variables are part of compile unit so let CompileUnit create new variable. llvm-svn: 137663	2011-08-15 22:04:40 +00:00
Devang Patel	d899444347	There is no need to maintain a set to keep track of variables that use location expressions. In such cases, AT_location attribute's value will be a label. llvm-svn: 137659	2011-08-15 21:43:21 +00:00
Devang Patel	900d97719b	Fix warning. llvm-svn: 137658	2011-08-15 21:35:16 +00:00
Devang Patel	3e4a965519	Simplify. Let DbgVariable keep track of variable's DBG_VALUE machine instruction. llvm-svn: 137656	2011-08-15 21:24:36 +00:00
Devang Patel	99819b527d	Simplify mapping to variable from its abstract variable info. When a variable is inlined multiple places, abstract variable keeps name, location, type etc.. info and all other concreate instances of the variable directly refers to abstract variable. llvm-svn: 137637	2011-08-15 19:01:20 +00:00
Devang Patel	d7d80aadd1	Refactor. llvm-svn: 137632	2011-08-15 18:40:16 +00:00
Devang Patel	6e4d2c9fb7	Refactor. llvm-svn: 137631	2011-08-15 18:35:42 +00:00
Devang Patel	dfd6ec3ce1	Refactor. Global variables are part of compile unit so let CompileUnit create new global variable. llvm-svn: 137621	2011-08-15 17:57:41 +00:00
Devang Patel	895437142a	Refactor. A subprogram is part of compile unit so let CompileUnit construct new subprogram. llvm-svn: 137618	2011-08-15 17:24:54 +00:00
Nadav Rotem	6858b344ed	Fix PR 10635. When generating integer constants, the constant element type may be illegal, even if the requested vector type is legal. Testcase is one of the disabled ARM tests in the vector-select patch. llvm-svn: 137562	2011-08-13 20:31:45 +00:00
Bill Wendling	fae1475823	Initial commit of the 'landingpad' instruction. This implements the 'landingpad' instruction. It's used to indicate that a basic block is a landing pad. There are several restrictions on its use (see LangRef.html for more detail). These restrictions allow the exception handling code to gather the information it needs in a much more sane way. This patch has the definition, implementation, C interface, parsing, and bitcode support in it. llvm-svn: 137501	2011-08-12 20:24:12 +00:00
Devang Patel	444034783e	Use ArrayRef. llvm-svn: 137485	2011-08-12 18:10:19 +00:00
Chris Lattner	335d399a0e	switch to use the new api for structtypes. llvm-svn: 137480	2011-08-12 18:06:37 +00:00
Devang Patel	db4374a28a	Provide fast path as Jakob suggested. llvm-svn: 137478	2011-08-12 18:01:34 +00:00
Nadav Rotem	62da15a330	Revert r137310 because it does not optimize any code on ToT llvm-svn: 137466	2011-08-12 17:15:04 +00:00
Duncan Sands	a41634e307	Silence a bunch (but not all) "variable written but not read" warnings when building with assertions disabled. llvm-svn: 137460	2011-08-12 14:54:45 +00:00
Jakob Stoklund Olesen	1f582ba609	Simplify the interference checking code a bit. This is possible now that we now longer provide an interface to iterate the interference overlaps. llvm-svn: 137397	2011-08-12 00:22:04 +00:00
Jakob Stoklund Olesen	da0192d72b	Remove the InterferenceResult class. llvm-svn: 137381	2011-08-11 22:46:06 +00:00
Jakob Stoklund Olesen	cd14efaec2	Eliminate the last use of InterferenceResult. The Query class now holds two iterators instead of an InterferenceResult instance. The iterators are used as bookmarks for repeated collectInterferingVRegs calls. llvm-svn: 137380	2011-08-11 22:46:04 +00:00
Jakob Stoklund Olesen	da4f0eb12c	Remove more dead code. collectInterferingVRegs will be the primary function for interference checks. llvm-svn: 137354	2011-08-11 21:18:34 +00:00
Jakob Stoklund Olesen	7519336752	Privatize an unused part of the LiveIntervalUnion::Query interface. No clients are iterating over interference overlaps. llvm-svn: 137350	2011-08-11 21:00:42 +00:00
Jakob Stoklund Olesen	05ff9d1f6d	Remove some dead code. The InterferenceResult iterator turned out to be less important than we thought it would be. LiveIntervalUnion clients want higher level information, like the list of interfering virtual registers. llvm-svn: 137346	2011-08-11 20:41:41 +00:00
Benjamin Kramer	fa7e6a54b1	Plug a memory leak. llvm-svn: 137321	2011-08-11 18:39:28 +00:00
Nadav Rotem	61140e1028	[AVX] When joining two XMM registers into a YMM register, make sure that the lower XMM register gets in first. This will allow the SUBREG pattern to elliminate the first vector insertion. llvm-svn: 137310	2011-08-11 16:49:36 +00:00
Chris Lattner	96710b4308	fix PR10605 / rdar://9930964 by adding a pretty scary missed check. It's somewhat surprising anything works without this. Before we would compile the testcase into: test: # @test movl $4, 8(%rdi) movl 8(%rdi), %eax orl %esi, %eax cmpl $32, %edx movl %eax, -4(%rsp) # 4-byte Spill je .LBB0_2 now we produce: test: # @test movl 8(%rdi), %eax movl $4, 8(%rdi) orl %esi, %eax cmpl $32, %edx movl %eax, -4(%rsp) # 4-byte Spill je .LBB0_2 llvm-svn: 137303	2011-08-11 06:26:54 +00:00
Devang Patel	784077eb57	Stay within 80 columns. llvm-svn: 137283	2011-08-10 23:58:09 +00:00
Devang Patel	bb23a4a9a5	Distinguish between two copies of one inlined variable. Take 2. llvm-svn: 137253	2011-08-10 21:50:54 +00:00
Devang Patel	37a62058fe	While extending definition range of a debug variable, consult lexical scopes also. There is no point extending debug variable out side its lexical block. This provides 6x compile time speedup in some cases. llvm-svn: 137250	2011-08-10 21:25:34 +00:00
Devang Patel	e30746c844	Revert unintentional parts of previous check-in. llvm-svn: 137249	2011-08-10 21:16:49 +00:00
Devang Patel	7e62302fae	Start using LexicalScopes utility. No intetional functionality change. llvm-svn: 137246	2011-08-10 20:55:27 +00:00
Devang Patel	e1649c31cb	Provide utility to extract and use lexical scoping information from machine instructions. llvm-svn: 137237	2011-08-10 19:04:06 +00:00
Jakob Stoklund Olesen	b91e489923	Trim an unneeded header. llvm-svn: 137184	2011-08-09 23:49:21 +00:00
Jakob Stoklund Olesen	53910d6aae	Inflate register classes after coalescing. Coalescing can remove copy-like instructions with sub-register operands that constrained the register class. Examples are: x86: GR32_ABCD:sub_8bit_hi -> GR32 arm: DPR_VFP2:ssub0 -> DPR Recompute the register class of any virtual registers that are used by less instructions after coalescing. This affects code generation for the Cortex-A8 where we use NEON instructions for f32 operations, c.f. fp_convert.ll: vadd.f32 d16, d1, d0 vcvt.s32.f32 d0, d16 The register allocator is now free to use d16 for the temporary, and that comes first in the allocation order because it doesn't interfere with any s-registers. llvm-svn: 137133	2011-08-09 18:19:41 +00:00

... 2 3 4 5 6 ...

12604 Commits