llvm-project

Commit Graph

Author	SHA1	Message	Date
Andrew Trick	1eb4a0da55	SparseSet: Add support for key-derived indexes and arbitrary key types. This nicely handles the most common case of virtual register sets, but also handles anticipated cases where we will map pointers to IDs. The goal is not to develop a completely generic SparseSet template. Instead we want to handle the expected uses within llvm without any template antics in the client code. I'm adding a bit of template nastiness here, and some assumption about expected usage in order to make the client code very clean. The expected common uses cases I'm designing for: - integer keys that need to be reindexed, and may map to additional data - densely numbered objects where we want pointer keys because no number->object map exists. llvm-svn: 155227	2012-04-20 20:05:28 +00:00
Andrew Trick	7405c6d57a	misched: initialize BB llvm-svn: 155226	2012-04-20 20:05:21 +00:00
Andrew Trick	97d5b9cca6	misched: Added CanHandleTerminators. This is a special flag for targets that really want their block terminators in the DAG. The default scheduler cannot handle this correctly, so it becomes the specialized scheduler's responsibility to schedule terminators. llvm-svn: 154712	2012-04-13 23:29:54 +00:00
Benjamin Kramer	411d5a2026	ScheduleDAGInstrs: When adding uses we add them into a set that's empty at the beginning, no need to maintain another set for the added regs. llvm-svn: 152934	2012-03-16 17:38:19 +00:00
Andrew Trick	e6913c7245	misched: add DAG edges from vreg defs to ExitSU. These edges are not really necessary, but it is consistent with the way we currently create physreg edges. Scheduler heuristics that expect a DAG edge to the block terminator could benefit from this change. Although in the future I hope we have a better mechanism for modeling latency across scheduling regions. llvm-svn: 152895	2012-03-16 05:04:25 +00:00
Andrew Trick	8823decdd4	misched: implemented a framework for top-down or bottom-up scheduling. New flags: -misched-topdown, -misched-bottomup. They can be used with the default scheduler or with -misched=shuffle. Without either topdown/bottomup flag -misched=shuffle now alternates scheduling direction. LiveIntervals update is unimplemented with bottom-up scheduling, so only -misched-topdown currently works. Capped the ScheduleDAG hierarchy with a concrete ScheduleDAGMI class. ScheduleDAGMI is aware of the top and bottom of the unscheduled zone within the current region. Scheduling policy can be plugged into the ScheduleDAGMI driver by implementing MachineSchedStrategy. ConvergingScheduler is now the default scheduling algorithm. It exercises the new driver but still does no reordering. llvm-svn: 152700	2012-03-14 04:00:41 +00:00
Andrew Trick	8c207e47c1	misched interface: rename Begin/End to RegionBegin/RegionEnd since they are not private. llvm-svn: 152382	2012-03-09 04:29:02 +00:00
Andrew Trick	9a0c583954	misched prep: Expose the ScheduleDAGInstrs interface so targets may implement their own MachineScheduler. llvm-svn: 152261	2012-03-07 23:01:06 +00:00
Andrew Trick	9b9dea5d07	misched prep: Comment the ScheduleDAGInstrs interface. llvm-svn: 152259	2012-03-07 23:00:59 +00:00
Andrew Trick	926d4736ed	misched prep: Cleanup ScheduleDAGInstrs interface. ScheduleDAGInstrs will be the main interface for MI-level schedulers. Make sure it's readable: one page of protected fields, one page of public methids. llvm-svn: 152258	2012-03-07 23:00:57 +00:00
Andrew Trick	a316faabec	misched prep: rename InsertPos to End. ScheduleDAGInstrs knows nothing about how instructions will be moved or inserted. llvm-svn: 152256	2012-03-07 23:00:52 +00:00
Andrew Trick	52226d409b	misched preparation: rename core scheduler methods for consistency. We had half the API with one convention, half with another. Now was a good time to clean it up. llvm-svn: 152255	2012-03-07 23:00:49 +00:00
Andrew Trick	60cf03e772	misched preparation: clarify ScheduleDAG and ScheduleDAGInstrs roles. ScheduleDAG is responsible for the DAG: SUnits and SDeps. It provides target hooks for latency computation. ScheduleDAGInstrs extends ScheduleDAG and defines the current scheduling region in terms of MachineInstr iterators. It has access to the target's scheduling itinerary data. ScheduleDAGInstrs provides the logic for building the ScheduleDAG for the sequence of MachineInstrs in the current region. Target's can implement highly custom schedulers by extending this class. ScheduleDAGPostRATDList provides the driver and diagnostics for current postRA scheduling. It maintains a current Sequence of scheduled machine instructions and logic for splicing them into the block. During scheduling, it uses the ScheduleHazardRecognizer provided by the target. Specific changes: - Removed driver code from ScheduleDAG. clearDAG is the only interface needed. - Added enterRegion/exitRegion hooks to ScheduleDAGInstrs to delimit the scope of each scheduling region and associated DAG. They should be used to setup and cleanup any region-specific state in addition to the DAG itself. This is necessary because we reuse the same ScheduleDAG object for the entire function. The target may extend these hooks to do things at regions boundaries, like bundle terminators. The hooks are called even if we decide not to schedule the region. So all instructions in a block are "covered" by these calls. - Added ScheduleDAGInstrs::begin()/end() public API. - Moved Sequence into the driver layer, which is specific to the scheduling algorithm. llvm-svn: 152208	2012-03-07 05:21:52 +00:00
Andrew Trick	e932bb77b5	misched preparation: modularize schedule emission. ScheduleDAG has nothing to do with how the instructions are scheduled. llvm-svn: 152206	2012-03-07 05:21:44 +00:00
Andrew Trick	1b2324d0e8	Cleanup in preparation for misched: Move DAG visualization logic. Soon, ScheduleDAG will not refer to the BB. llvm-svn: 152177	2012-03-07 00:18:22 +00:00
Craig Topper	1d32658877	Use uint16_t to store register overlaps to reduce static data. llvm-svn: 152001	2012-03-04 10:43:23 +00:00
Andrew Trick	9dbbd3e553	PostRA sched: speed up physreg tracking by not abusing SparseSet. llvm-svn: 151348	2012-02-24 07:04:55 +00:00
Andrew Trick	da6a15d90d	misched: cleanup reaching def computation Ignore undef uses completely. Use a more explicit SlotIndex API. Add more explicit comments. llvm-svn: 151233	2012-02-23 03:16:24 +00:00
Andrew Trick	d675a4cec0	PostRASched: Convert physreg def/use tracking to Jakob's SparseSet. Added array subscript to SparseSet for convenience. Slight reorg to make it easier to manage the def/use sets. llvm-svn: 151228	2012-02-23 01:52:38 +00:00
Jakob Stoklund Olesen	033b9add40	Don't compute latencies for regmask operands. llvm-svn: 151211	2012-02-22 22:52:52 +00:00
Andrew Trick	d458e2df8d	misched: Use SparseSet for VRegDegs for constant time clear(). llvm-svn: 151205	2012-02-22 21:59:00 +00:00
Andrew Trick	64ca16e9b8	Comment from code review llvm-svn: 151178	2012-02-22 18:34:49 +00:00
Andrew Trick	db42c6faa4	misched: DAG builder should not track dependencies for SSA defs. The vast majority of virtual register definitions don't need an entry in the DAG builder's VRegDefs set. llvm-svn: 151136	2012-02-22 06:08:13 +00:00
Andrew Trick	46cc9a4aaa	Initialize SUnits before DAG building. Affect on SD scheduling and postRA scheduling: Printing the DAG will display the nodes in top-down topological order. This matches the order within the MBB and makes my life much easier in general. Affect on misched: We don't need to track virtual register uses at all. This is awesome. I also intend to rely on the SUnit ID as a topo-sort index. So if A < B then we cannot have an edge B -> A. llvm-svn: 151135	2012-02-22 06:08:11 +00:00
Andrew Trick	da84e64683	Clear virtual registers after they are no longer referenced. Passes after RegAlloc should be able to rely on MRI->getNumVirtRegs() == 0. This makes sharing code for pre/postRA passes more robust. Now, to check if a pass is running before the RA pipeline begins, use MRI->isSSA(). To check if a pass is running after the RA pipeline ends, use !MRI->getNumVirtRegs(). PEI resets virtual regs when it's done scavenging. PTX will either have to provide its own PEI pass or assign physregs. llvm-svn: 151032	2012-02-21 04:51:23 +00:00
Andrew Trick	59ac4fb706	misched: Initial code for building an MI level scheduling DAG llvm-svn: 148174	2012-01-14 02:17:18 +00:00
Andrew Trick	dbee9d8900	Move physreg dependency generation into aptly named addPhysRegDeps. llvm-svn: 148173	2012-01-14 02:17:15 +00:00
Andrew Trick	1d028a364d	misched: Added ScheduleDAGInstrs::IsPostRA llvm-svn: 148172	2012-01-14 02:17:12 +00:00
Evan Cheng	00b1a3cd7e	Added a late machine instruction copy propagation pass. This catches opportunities that only present themselves after late optimizations such as tail duplication .e.g. ## BB#1: movl %eax, %ecx movl %ecx, %eax ret The register allocator also leaves some of them around (due to false dep between copies from phi-elimination, etc.) This required some changes in codegen passes. Post-ra scheduler and the pseudo-instruction expansion passes have been moved after branch folding and tail merging. They were before branch folding before because it did not always update block livein's. That's fixed now. The pass change makes independently since we want to properly schedule instructions after branch folding / tail duplication. rdar://10428165 rdar://10640363 llvm-svn: 147716	2012-01-07 03:02:36 +00:00
Chandler Carruth	eab5029964	Remove an unused variable. llvm-svn: 147605	2012-01-05 11:25:47 +00:00
Andrew Trick	100af0adf7	Minor postra scheduler cleanup. It could result in more precise antidependence latency on ARM in exceedingly rare cases. llvm-svn: 147594	2012-01-05 02:52:11 +00:00
Evan Cheng	da103bf9ec	Model ARM predicated write as read-mod-write. e.g. r0 = mov #0 r0 = moveq #1 Then the second instruction has an implicit data dependency on the first instruction. Sadly I have yet to come up with a small test case that demonstrate the post-ra scheduler taking advantage of this. llvm-svn: 146583	2011-12-14 20:00:08 +00:00
Evan Cheng	87975df580	Allow target to specify register output dependency. Still default to one. llvm-svn: 146547	2011-12-14 02:28:53 +00:00
Evan Cheng	7fae11b231	- Add MachineInstrBundle.h and MachineInstrBundle.cpp. This includes a function to finalize MI bundles (i.e. add BUNDLE instruction and computing register def and use lists of the BUNDLE instruction) and a pass to unpack bundles. - Teach more of MachineBasic and MachineInstr methods to be bundle aware. - Switch Thumb2 IT block to MI bundles and delete the hazard recognizer hack to prevent IT blocks from being broken apart. llvm-svn: 146542	2011-12-14 02:11:42 +00:00
Evan Cheng	7f8e563a69	Add bundle aware API for querying instruction properties and switch the code generator to it. For non-bundle instructions, these behave exactly the same as the MC layer API. For properties like mayLoad / mayStore, look into the bundle and if any of the bundled instructions has the property it would return true. For properties like isPredicable, only return true if all of the bundled instructions have the property. For properties like canFoldAsLoad, isCompare, conservatively return false for bundles. llvm-svn: 146026	2011-12-07 07:15:52 +00:00
Evan Cheng	2a81dd4a3c	First chunk of MachineInstr bundle support. 1. Added opcode BUNDLE 2. Taught MachineInstr class to deal with bundled MIs 3. Changed MachineBasicBlock iterator to skip over bundled MIs; added an iterator to walk all the MIs 4. Taught MachineBasicBlock methods about bundled MIs llvm-svn: 145975	2011-12-06 22:12:01 +00:00
Hal Finkel	4201820275	make sure ScheduleDAGInstrs::EmitSchedule does not crash when the first instruction in Sequence is a Noop llvm-svn: 145677	2011-12-02 04:58:07 +00:00
Andrew Trick	35c9e51219	PostRA scheduler fix. Clear stale loop dependencies. Fixes <rdar://problem/10235725> llvm-svn: 141357	2011-10-07 06:33:09 +00:00
Andrew Trick	4ef158335b	whitespace llvm-svn: 141356	2011-10-07 06:27:02 +00:00
Evan Cheng	0d639a28aa	Rename TargetSubtarget to TargetSubtargetInfo for consistency. llvm-svn: 134259	2011-07-01 21:01:15 +00:00
Evan Cheng	8264e272a9	Sink SubtargetFeature and TargetInstrItineraries (renamed MCInstrItineraries) into MC. llvm-svn: 134049	2011-06-29 01:14:12 +00:00
Evan Cheng	6cc775f905	- Rename TargetInstrDesc, TargetOperandInfo to MCInstrDesc and MCOperandInfo and sink them into MC layer. - Added MCInstrInfo, which captures the tablegen generated static data. Chang TargetInstrInfo so it's based off MCInstrInfo. llvm-svn: 134021	2011-06-28 19:10:37 +00:00
Devang Patel	5ca0837397	Remove dead code. llvm-svn: 132488	2011-06-02 21:31:00 +00:00
Devang Patel	f02a376fbc	Update DBG_VALUEs while breaking anti dependencies. llvm-svn: 132487	2011-06-02 21:26:52 +00:00
Devang Patel	e5feef0fe1	During post RA scheduling, do not try to chase reg defs. to preserve DBG_VALUEs. This approach has several downsides, for example, it does not work when dbg value is a constant integer, it does not work if reg is defined more than once, it places end of debug value range markers in the wrong place. It even causes misleading incorrect debug info when duplicate DBG_VALUE instructions point to same reg def. Instead, use simpler approach and let DBG_VALUE follow its predecessor instruction. After live debug value analysis pass, all DBG_VALUE instruction are placed at the right place. Thanks Jakob for the hint! llvm-svn: 132483	2011-06-02 20:07:12 +00:00
Andrew Trick	2e116a4491	Added an assertion, and updated a comment. llvm-svn: 131022	2011-05-06 21:52:52 +00:00
Andrew Trick	3dc73aae5e	ARM post RA scheduler compile time fix. BuildSchedGraph was quadratic in the number of calls in the basic block. After this fix, it keeps only a single call at the top of the DefList so compile time doesn't blow up on large blocks. This reduces postRA sched time on an external test case from 81s to 0.3s. Although r130800 (reduced ARM register alias defs) also partially fixes the issue by reducing the constant overhead of checking call interference by an order of magnitude. Fixes <rdar://problem/7662664> very poor compile time with post RA scheduling. llvm-svn: 130943	2011-05-05 19:32:21 +00:00
Andrew Trick	24b1c48514	whitespace llvm-svn: 130942	2011-05-05 19:24:06 +00:00
Chris Lattner	0ab5e2cded	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
Evan Cheng	6eb516dbea	Do not model all INLINEASM instructions as having unmodelled side effects. Instead encode llvm IR level property "HasSideEffects" in an operand (shared with IsAlignStack). Added MachineInstrs::hasUnmodeledSideEffects() to check the operand when the instruction is an INLINEASM. This allows memory instructions to be moved around INLINEASM instructions. llvm-svn: 123044	2011-01-07 23:50:32 +00:00
Dan Gohman	a4fcd2418d	Move Value::getUnderlyingObject to be a standalone function so that it can live in Analysis instead of VMCore. llvm-svn: 121885	2010-12-15 20:02:24 +00:00
Evan Cheng	debf9c502a	Two sets of changes. Sorry they are intermingled. 1. Fix pre-ra scheduler so it doesn't try to push instructions above calls to "optimize for latency". Call instructions don't have the right latency and this is more likely to use introduce spills. 2. Fix if-converter cost function. For ARM, it should use instruction latencies, not # of micro-ops since multi-latency instructions is completely executed even when the predicate is false. Also, some instruction will be "slower" when they are predicated due to the register def becoming implicit input. rdar://8598427 llvm-svn: 118135	2010-11-03 00:45:17 +00:00
Evan Cheng	cbdf7e874a	Putting r117193 back except for the compile time cost. Rather than assuming fallthroughs uses all registers, just gather the union of all successor liveins. llvm-svn: 117506	2010-10-27 23:17:17 +00:00
Evan Cheng	43d6f34e9f	Neuter r117193 as it causes significant post-ra scheduler compile time regression. llvm-svn: 117329	2010-10-25 23:56:21 +00:00
Evan Cheng	15459b695f	Properly model the latency of register defs which are 1) function returns or 2) live-outs. Previously the post-RA schedulers completely ignore these dependencies since returns, branches, etc. are all scheduling barriers. This patch model the latencies between instructions being scheduled and the barriers. It also handle calls by marking their register uses. llvm-svn: 117193	2010-10-23 02:10:46 +00:00
Evan Cheng	df2aae0c5a	Avoid compiler warning: comparison between signed and unsigned integer. llvm-svn: 116119	2010-10-08 23:01:57 +00:00
Evan Cheng	8c5e7e51bd	Fix operand latency computation in cases where the definition operand is implicit. e.g. %D6<def>, %D7<def> = VLD1q16 %R2<kill>, 0, ..., %Q3<imp-def> %Q1<def> = VMULv8i16 %Q1<kill>, %Q3<kill>, ... The real definition indices are 0,1. llvm-svn: 116080	2010-10-08 18:42:25 +00:00
Nick Lewycky	ec0da969fb	Remove unused variables. llvm-svn: 115802	2010-10-06 18:11:50 +00:00
Evan Cheng	49d4c0bd18	- Add TargetInstrInfo::getOperandLatency() to compute operand latencies. This allow target to correctly compute latency for cases where static scheduling itineraries isn't sufficient. e.g. variable_ops instructions such as ARM::ldm. This also allows target without scheduling itineraries to compute operand latencies. e.g. X86 can return (approximated) latencies for high latency instructions such as division. - Compute operand latencies for those defined by load multiple instructions, e.g. ldm and those used by store multiple instructions, e.g. stm. llvm-svn: 115755	2010-10-06 06:27:31 +00:00
Evan Cheng	4a010fd1ea	Model Cortex-a9 load to SUB, RSB, ADD, ADC, SBC, RSC, CMN, MVN, or CMP pipeline forwarding path. llvm-svn: 115098	2010-09-29 22:42:35 +00:00
Evan Cheng	bf4070756f	Teach if-converter to be more careful with predicating instructions that would take multiple cycles to decode. For the current if-converter clients (actually only ARM), the instructions that are predicated on false are not nops. They would still take machine cycles to decode. Micro-coded instructions such as LDM / STM can potentially take multiple cycles to decode. If-converter should take treat them as non-micro-coded simple instructions. llvm-svn: 113570	2010-09-10 01:29:16 +00:00
Bob Wilson	56c006561c	Change ScheduleDAGInstrs::Defs and ::Uses to be variable-size vectors instead of fixed size arrays, so that increasing FirstVirtualRegister to 16K won't cause a compile time performance regression. llvm-svn: 109330	2010-07-24 06:01:53 +00:00
Bill Wendling	2da75ef315	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 108452	2010-07-15 20:04:36 +00:00
Jim Grosbach	604560c5fe	Fix the post-RA instruction scheduler to handle instructions referenced by more than one dbg_value instruction. rdar://7759363 llvm-svn: 104174	2010-05-19 22:57:06 +00:00
Dan Gohman	25c1653700	Get rid of the EdgeMapping map. Instead, just check for BasicBlock changes before doing phi lowering for switches. llvm-svn: 102809	2010-05-01 00:01:06 +00:00
Dan Gohman	1f0f2142cc	Fix -Wcast-qual warnings. llvm-svn: 101655	2010-04-17 17:42:52 +00:00
Evan Cheng	14694d3666	Reduce indentation. llvm-svn: 99214	2010-03-22 21:24:33 +00:00
Evan Cheng	760dc65d59	80 col violation. llvm-svn: 99195	2010-03-22 18:40:50 +00:00
Dale Johannesen	49de0607a8	Progress towards shepherding debug info through SelectionDAG. No functional effect yet. This is still evolving and should not be viewed as final. llvm-svn: 98195	2010-03-10 22:13:47 +00:00
Duncan Sands	19d0b47b1f	There are two ways of checking for a given type, for example isa<PointerType>(T) and T->isPointerTy(). Convert most instances of the first form to the second form. Requested by Chris. llvm-svn: 96344	2010-02-16 11:11:14 +00:00
David Goodwin	d2f9c044c0	Fix dependencies added to model memory aliasing for post-RA scheduling. The dependencies were overly conservative for memory access that are known not to alias. llvm-svn: 86580	2009-11-09 19:22:17 +00:00
David Goodwin	28ba4f27d1	Correctly add chain dependencies around calls and unknown-side-effect instructions. llvm-svn: 86080	2009-11-05 00:16:44 +00:00
David Goodwin	a86f919763	<rdar://problem/7352605>. When building schedule graph use mayAlias information to avoid chaining loads/stores of spill slots with non-aliased memory ops. llvm-svn: 85934	2009-11-03 20:15:00 +00:00
David Goodwin	00822aabf6	Chain dependencies used to enforce memory order should have latency of 0 (except for true dependency of Store followed by aliased Load... we estimate that case with a single cycle of latency assuming the hardware will bypass) llvm-svn: 85807	2009-11-02 17:06:28 +00:00
Dan Gohman	9aba0d9988	When checking whether a def of an aliased register is dead, ask the machineinstr whether the aliased register is dead, rather than the original register is dead. This allows it to get the correct answer when examining an instruction like this: CALLpcrel32 <ga:foo>, %AL<imp-def>, %EAX<imp-def,dead> where EAX is dead but a subregister of it is still live. This fixes PR5294. llvm-svn: 85135	2009-10-26 18:26:18 +00:00
Evan Cheng	f0236e011e	Spill slots cannot alias. llvm-svn: 84432	2009-10-18 19:58:47 +00:00
Evan Cheng	0e9d9ca855	-Revert parts of 84326 and 84411. Distinquishing between fixed and non-fixed stack slots and giving them different PseudoSourceValue's did not fix the problem of post-alloc scheduling miscompiling llvm itself. - Apply Dan's conservative workaround by assuming any non fixed stack slots can alias other memory locations. This means a load from spill slot #1 cannot move above a store of spill slot #2. - Enable post-alloc scheduling for x86 at optimization leverl Default and above. llvm-svn: 84424	2009-10-18 18:16:27 +00:00
Dan Gohman	87b02d5bbc	Factor out LiveIntervalAnalysis' code to determine whether an instruction is trivially rematerializable and integrate it into TargetInstrInfo::isTriviallyReMaterializable. This way, all places that need to know whether an instruction is rematerializable will get the same answer. This enables the useful parts of the aggressive-remat option by default -- using AliasAnalysis to determine whether a memory location is invariant, and removes the questionable parts -- rematting operations with virtual register inputs that may not be live everywhere. llvm-svn: 83687	2009-10-09 23:27:56 +00:00
Dan Gohman	be8137b0b4	Replace TargetInstrInfo::isInvariantLoad and its target-specific implementations with a new MachineInstr::isInvariantLoad, which uses MachineMemOperands and is target-independent. This brings MachineLICM and other functionality to targets which previously lacked an isInvariantLoad implementation. llvm-svn: 83475	2009-10-07 17:38:06 +00:00
Dan Gohman	48b185d6f7	Improve MachineMemOperand handling. - Allocate MachineMemOperands and MachineMemOperand lists in MachineFunctions. This eliminates MachineInstr's std::list member and allows the data to be created by isel and live for the remainder of codegen, avoiding a lot of copying and unnecessary translation. This also shrinks MemSDNode. - Delete MemOperandSDNode. Introduce MachineSDNode which has dedicated fields for MachineMemOperands. - Change MemSDNode to have a MachineMemOperand member instead of its own fields with the same information. This introduces some redundancy, but it's more consistent with what MachineInstr will eventually want. - Ignore alignment when searching for redundant loads for CSE, but remember the greatest alignment. Target-specific code which previously used MemOperandSDNodes with generic SDNodes now use MemIntrinsicSDNodes, with opcodes in a designated range so that the SelectionDAG framework knows that MachineMemOperand information is available. llvm-svn: 82794	2009-09-25 20:36:54 +00:00
Evan Cheng	270d0f986f	Enhance EmitInstrWithCustomInserter() so target can specify CFG changes that sdisel will use to properly complete phi nodes. Not functionality change yet. llvm-svn: 82273	2009-09-18 21:02:19 +00:00
David Goodwin	9b48cd4899	Use the schedule itinerary operand use/def cycle information to adjust dependence edge latency for post-RA scheduling. llvm-svn: 79425	2009-08-19 16:08:58 +00:00
David Goodwin	90e6b8b708	Add callback to allow target to adjust latency of schedule dependency edge. llvm-svn: 78910	2009-08-13 16:05:04 +00:00
David Goodwin	6021b4dccc	Post RA scheduler changes. Introduce a hazard recognizer that uses the target schedule information to accurately model the pipeline. Update the scheduler to correctly handle multi-issue targets. llvm-svn: 78563	2009-08-10 15:55:25 +00:00
Dan Gohman	6c0c21954c	Fix a typo in a comment. llvm-svn: 78362	2009-08-07 01:26:06 +00:00
Dan Gohman	58b0e71886	Eliminate yet another copy of getOpcode. llvm-svn: 76236	2009-07-17 20:58:59 +00:00
Dan Gohman	80a9942593	Move isLCSSAForm, isLoopInvariant, getCanonicalInductionVariable, and related functions out of LoopBase and into Loop, since they are specific to BasicBlock-based loops. This also allows the code to be moved out-of-line. llvm-svn: 75523	2009-07-13 22:02:44 +00:00
Dan Gohman	dfaf646c34	When scheduling a block in parts, keep track of the overall instruction index across each part. Instruction indices are used to make live range queries, and live ranges can extend beyond scheduling region boundaries. Refactor the ScheduleDAGSDNodes class some more so that it doesn't have to worry about this additional information. llvm-svn: 64288	2009-02-11 04:27:20 +00:00
Dan Gohman	b95434356c	Factor out more code for computing register live-range informationfor scheduling, and generalize is so that preserves state across scheduling regions. This fixes incorrect live-range information around terminators and labels, which are effective region boundaries. In place of looking for terminators to anchor inter-block dependencies, introduce special entry and exit scheduling units for this purpose. llvm-svn: 64254	2009-02-10 23:27:53 +00:00
Dan Gohman	f4b08b4f6c	Move ScheduleDAGInstrs.h to be a private header. Front-ends that used this header to select a scheduling policy should use SchedulerRegistry.h instead (llvm-gcc and clang were updated a while ago). llvm-svn: 63934	2009-02-06 17:12:10 +00:00
Dan Gohman	1ee0d41ef8	Fix a post-RA scheduling dependency bug. If a MachineInstr doesn't have a memoperand but has an opcode that is known to load or store, assume its memory reference may alias anything, including stack slots which the compiler completely controls. To partially compensate for this, teach the ScheduleDAG building code to do basic getUnderlyingValue analysis. This greatly reduces the number of instructions that require restrictive dependencies. This code will need to be revisited when we start doing real alias analysis, but it should suffice for now. llvm-svn: 63370	2009-01-30 02:49:14 +00:00
Dan Gohman	5f8a2598b2	Instead of adding dependence edges between terminator instructions and every other instruction in their blocks to keep the terminator instructions at the end, teach the post-RA scheduler how to operate on ranges of instructions, and exclude terminators from the range of instructions that get scheduled. Also, exclude mid-block labels, such as EH_LABEL instructions, and schedule code before them separately from code after them. This fixes problems with the post-RA scheduler moving code past EH_LABELs. llvm-svn: 62366	2009-01-16 22:10:20 +00:00
Dan Gohman	619ef48a52	Move a few containers out of ScheduleDAGInstrs::BuildSchedGraph and into the ScheduleDAGInstrs class, so that they don't get destructed and re-constructed for each block. This fixes a compile-time hot spot in the post-pass scheduler. To help facilitate this, tidy and do some minor reorganization in the scheduler constructor functions. llvm-svn: 62275	2009-01-15 19:20:50 +00:00
Dan Gohman	12f2490489	Clean up the atomic opcodes in SelectionDAG. This removes all the _8, _16, _32, and _64 opcodes and replaces each group with an unsuffixed opcode. The MemoryVT field of the AtomicSDNode is now used to carry the size information. In tablegen, the size-specific opcodes are replaced by size-independent opcodes that utilize the ability to compose them with predicates. This shrinks the per-opcode tables and makes the code that handles atomics much more concise. llvm-svn: 61389	2008-12-23 21:37:04 +00:00
Dan Gohman	04543e719e	Rename BuildSchedUnits to BuildSchedGraph, and refactor the code in ScheduleDAGSDNodes' BuildSchedGraph into separate functions. llvm-svn: 61376	2008-12-23 18:36:58 +00:00
Dan Gohman	072e52f170	Use isTerminator() instead of isBranch()\|\|isReturn() in several places. isTerminator() returns true for a superset of cases, and includes things like FP_REG_KILL, which are nither return or branch but aren't safe to move/remat/etc. llvm-svn: 61373	2008-12-23 17:28:50 +00:00
Dan Gohman	b9a012156b	Add initial support for back-scheduling address computations, especially in the case of addresses computed from loop induction variables. llvm-svn: 61075	2008-12-16 03:35:01 +00:00
Dan Gohman	dddc1ac7ea	Fix some register-alias-related bugs in the post-RA scheduler liveness computation code. Also, avoid adding output-depenency edges when both defs are dead, which frequently happens with EFLAGS defs. Compute Depth and Height lazily, and always in terms of edge latency values. For the schedulers that don't care about latency, edge latencies are set to 1. Eliminate Cycle and CycleBound, and LatencyPriorityQueue's Latencies array. These are all subsumed by the Depth and Height fields. llvm-svn: 61073	2008-12-16 03:25:46 +00:00
Dan Gohman	8f782bbb28	Add a simple target-independent heuristic to allow targets with no instruction itinerary data to back-schedule loads. llvm-svn: 61070	2008-12-16 02:38:22 +00:00
Dan Gohman	2d170896ee	Rewrite the SDep class, and simplify some of the related code. The Cost field is removed. It was only being used in a very limited way, to indicate when the scheduler should attempt to protect a live register, and it isn't really needed to do that. If we ever want the scheduler to start inserting copies in non-prohibitive situations, we'll have to rethink some things anyway. A Latency field is added. Instead of giving each node a single fixed latency, each edge can have its own latency. This will eventually be used to model various micro-architecture properties more accurately. The PointerIntPair class and an internal union are now used, which reduce the overall size. llvm-svn: 60806	2008-12-09 22:54:47 +00:00
Dan Gohman	f90d3b096a	Fix the top-level comments, and fix some 80-column violations. llvm-svn: 60707	2008-12-08 17:50:35 +00:00
Dan Gohman	3aab10b932	Add minimal support for disambiguating memory references. Currently the main thing this covers is spills to distinct spill slots. llvm-svn: 60517	2008-12-04 01:35:46 +00:00
Dan Gohman	d2b10368ed	Pass the isAntiDep argument. llvm-svn: 59968	2008-11-24 17:24:27 +00:00
Dan Gohman	57d0b88830	Correctly set the isCtrl flag for chain dependencies. llvm-svn: 59837	2008-11-21 19:17:25 +00:00
Dan Gohman	546bcfe8d6	Update comments. llvm-svn: 59836	2008-11-21 19:16:58 +00:00
Dan Gohman	d7d1fd7eb7	Set the isAntiDep flag in the MachineInstr scheduler. llvm-svn: 59787	2008-11-21 02:38:21 +00:00
Dan Gohman	d1f33e2397	Use ComputeLatency in the MachineInstr scheduler. llvm-svn: 59777	2008-11-21 01:44:51 +00:00
Dan Gohman	7b7ca502fa	Implement ComputeLatency for MachineInstr ScheduleDAGs. Factor some of the latency computation logic out of the SDNode ScheduleDAG code into a TargetInstrItineraries helper method to help with this. llvm-svn: 59761	2008-11-21 00:12:10 +00:00
Dan Gohman	22e9677a5e	Treat mid-block labels the same as terminators when building the MachineInstr scheduling DAG, meaning they implicitly depend on all preceding defs. This fixes Benchmarks/Shootout-C++/except and Regression/C++/EH/simple_rethrow in -relocation-model=pic -disable-post-RA-scheduler=false mode. llvm-svn: 59747	2008-11-20 19:58:35 +00:00
Dan Gohman	60cb69e665	Experimental post-pass scheduling support. Post-pass scheduling is currently off by default, and can be enabled with -disable-post-RA-scheduler=false. This doesn't have a significant impact on most code yet because it doesn't yet do anything to address anti-dependencies and it doesn't attempt to disambiguate memory references. Also, several popular targets don't have pipeline descriptions yet. The majority of the changes here are splitting the SelectionDAG-specific code out of ScheduleDAG, so that ScheduleDAG can be moved to libLLVMCodeGen.a. The interface between ScheduleDAG-using code and the rest of the scheduling code is somewhat rough and will evolve. llvm-svn: 59676	2008-11-19 23:18:57 +00:00

... 2 3 4 5 6

260 Commits