llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	079b96e6f7	Revert "Give internal classes hidden visibility." It works with clang, but GCC has different rules so we can't make all of those hidden. This reverts commit r190534. llvm-svn: 190536	2013-09-11 18:05:11 +00:00
Benjamin Kramer	6a44af3629	Give internal classes hidden visibility. Worth 100k on a linux/x86_64 Release+Asserts clang. llvm-svn: 190534	2013-09-11 17:42:27 +00:00
Benjamin Kramer	e2a1d89e14	Switch spill weights from a basic loop depth estimation to BlockFrequencyInfo. The main advantages here are way better heuristics, taking into account not just loop depth but also __builtin_expect and other static heuristics and will eventually learn how to use profile info. Most of the work in this patch is pushing the MachineBlockFrequencyInfo analysis into the right places. This is good for a 5% speedup on zlib's deflate (x86_64), there were some very unfortunate spilling decisions in its hottest loop in longest_match(). Other benchmarks I tried were mostly neutral. This changes register allocation in subtle ways, update the tests for it. 2012-02-20-MachineCPBug.ll was deleted as it's very fragile and the instruction it looked for was gone already (but the FileCheck pattern picked up unrelated stuff). llvm-svn: 184105	2013-06-17 19:00:36 +00:00
Jakob Stoklund Olesen	994fed689f	Make SplitAnalysis::UseSlots private. llvm-svn: 148031	2012-01-12 17:53:44 +00:00
Jakob Stoklund Olesen	67aec12409	Exclusively use SplitAnalysis::getLastSplitPoint(). Delete the alternative implementation in LiveIntervalAnalysis. These functions computed the same thing, but SplitAnalysis caches the result. llvm-svn: 147911	2012-01-11 02:07:00 +00:00
Jakob Stoklund Olesen	a98af39856	Hoist back-copies to the least busy dominator. When a back-copy is hoisted to the nearest common dominator, keep looking up the dominator tree for a less loopy dominator, and place the back-copy there instead. Don't do this when a single existing back-copy dominates all the others. Assume the client knows what he is doing, and keep the dominating back-copy. This prevents us from hoisting back-copies into loops in most cases. If a value is defined in a loop with multiple exits, we may still hoist back-copies into that loop. That is the speed/size tradeoff. llvm-svn: 139698	2011-09-14 16:45:39 +00:00
Jakob Stoklund Olesen	5d4277ddfa	Distinguish complex mapped values from forced recomputation. When a ParentVNI maps to multiple defs in a new interval, its live range may still be derived directly from RegAssign by transferValues(). On the other hand, when instructions have been rematerialized or hoisted, it may be necessary to completely recompute live ranges using LiveRangeCalc::extend() to all uses. Use a bit in the value map to indicate that a live range must be recomputed. Rename markComplexMapped() to forceRecompute(). This fixes some live range verification errors when -split-spill-mode=size hoists back-copies by recomputing source ranges when RegAssign kills can't be moved. llvm-svn: 139660	2011-09-13 23:09:04 +00:00
Jakob Stoklund Olesen	a25330f0d7	Implement -split-spill-mode=size. Whenever the complement interval is defined by multiple copies of the same value, hoist those back-copies to the nearest common dominator. This ensures that at most one copy is inserted per value in the complement inteval, and no phi-defs are needed. llvm-svn: 139651	2011-09-13 22:22:39 +00:00
Jakob Stoklund Olesen	4484f99175	Add SplitEditor::markOverlappedComplement(). This function is used to flag values where the complement interval may overlap other intervals. Call it from overlapIntv, and use the flag to fully recompute those live ranges in transferValues(). llvm-svn: 139612	2011-09-13 18:05:29 +00:00
Jakob Stoklund Olesen	820c8fd0db	Eliminate the extendRange() wrapper. llvm-svn: 139608	2011-09-13 17:38:57 +00:00
Jakob Stoklund Olesen	054984d75b	Use a separate LiveRangeCalc for the complement in spill modes. The complement interval may overlap the other intervals created, so use a separate LiveRangeCalc instance to compute its live range. A LiveRangeCalc instance can only be shared among non-overlapping intervals. llvm-svn: 139603	2011-09-13 16:47:53 +00:00
Jakob Stoklund Olesen	487f2a37bf	Extract live range calculations from SplitKit. SplitKit will soon need two copies of these data structures, and the algorithms will also be useful when LiveIntervalAnalysis becomes independent of LiveVariables. llvm-svn: 139572	2011-09-13 01:34:21 +00:00
Jakob Stoklund Olesen	eecb2fb183	Add an interface for SplitKit complement spill modes. SplitKit always computes a complement live range to cover the places where the original live range was live, but no explicit region has been allocated. Currently, the complement live range is created to be as small as possible - it never overlaps any of the regions. This minimizes register pressure, but if the complement is going to be spilled anyway, that is not very important. The spiller will eliminate redundant spills, and hoist others by making the spill slot live range overlap some of the regions created by splitting. Stack slots are cheap. This patch adds the interface to enable spill modes in SplitKit. In spill mode, SplitKit will assume that the complement is going to spill, so it will allow it to overlap regions in order to avoid back-copies. By doing some of the spiller's work early, the complement live range becomes simpler. In some cases, it can become much simpler because no extra PHI-defs are required. This will speed up both splitting and spilling. This is only the interface to enable spill modes, no implementation yet. llvm-svn: 139500	2011-09-12 16:49:21 +00:00
Jakob Stoklund Olesen	72c0ddfbc4	Update comments to reflect some (not so) recent changes. llvm-svn: 139498	2011-09-12 16:03:26 +00:00
Jakob Stoklund Olesen	cdf9ad9107	Delete getMultiUseBlocks and splitSingleBlocks. These functions are no longer used, and they are easily replaced with a loop calling shouldSplitSingleBlock and splitSingleBlock. llvm-svn: 136993	2011-08-05 22:52:17 +00:00
Jakob Stoklund Olesen	8627ea91cb	Split around single instructions to enable register class inflation. Normally, we don't create a live range for a single instruction in a basic block, the spiller does that anyway. However, when splitting a live range that belongs to a proper register sub-class, inserting these extra COPY instructions completely remove the constraints from the remainder interval, and it may be allocated from the larger super-class. The spiller will mop up these small live ranges if we end up spilling anyway. It calls them snippets. llvm-svn: 136989	2011-08-05 22:20:45 +00:00
Jakob Stoklund Olesen	43859a6ad2	Rename {First,Last}Use to {First,Last}Instr. With a 'FirstDef' field right there, it is very confusing that FirstUse refers to an instruction that may be a def. llvm-svn: 136739	2011-08-02 22:54:14 +00:00
Jakob Stoklund Olesen	ae8027cc95	Add a BlockInfo::FirstDef field. This is either an invalid SlotIndex, or valno->def for the first value defined inside the block. PHI values are not counted as defined inside the block. The FirstDef field will be used when estimating the cost of spilling around a block. llvm-svn: 136736	2011-08-02 22:37:22 +00:00
Jakob Stoklund Olesen	f047ff4fe1	Delete BlockInfo::LiveThrough. It wasn't used any more. llvm-svn: 136735	2011-08-02 22:37:20 +00:00
Jakob Stoklund Olesen	795da1c108	Extract parts of RAGreedy::splitAroundRegion as SplitKit methods. This gets rid of some of the gory splitting details in RAGreedy and makes them available to future SplitKit clients. Slightly generalize the functionality to support multi-way splitting. Specifically, SplitEditor::splitLiveThroughBlock() supports switching between different register intervals in a block. llvm-svn: 135307	2011-07-15 21:47:57 +00:00
Jakob Stoklund Olesen	adc6a4ca5d	Reapply r134047 now that the world is ready for it. This patch will sometimes choose live range split points next to interference instead of always splitting next to a register point. That means spill code can now appear almost anywhere, and it was necessary to fix code that didn't expect that. The difficult places were: - Between a CALL returning a value on the x87 stack and the corresponding FpPOP_RETVAL (was FpGET_ST0). Probably also near x87 inline assembly, but that didn't actually show up in testing. - Between a CALL popping arguments off the stack and the corresponding ADJCALLSTACKUP. Both are fixed now. The only place spill code can't appear is after terminators, see SplitAnalysis::getLastSplitPoint. Original commit message: Rewrite RAGreedy::splitAroundRegion, now with cool ASCII art. This function has to deal with a lot of special cases, and the old version got it wrong sometimes. In particular, it would sometimes leave multiple uses in the stack interval in a single block. That causes bad code with multiple reloads in the same basic block. The new version handles block entry and exit in a single pass. It first eliminates all the easy cases, and then goes on to create a local interval for the blocks with difficult interference. Previously, we would only create the local interval for completely isolated blocks. It can happen that the stack interval becomes completely empty because we could allocate a register in all edge bundles, and the new local intervals deal with the interference. The empty stack interval is harmless, but we need to remove a SplitKit assertion that checks for empty intervals. llvm-svn: 134125	2011-06-30 01:30:39 +00:00
Jakob Stoklund Olesen	8628435c06	Revert r134047 while investigating a llvm-gcc-i386-linux-selfhost miscompile. llvm-svn: 134053	2011-06-29 02:03:36 +00:00
Jakob Stoklund Olesen	ffbc05b715	Rewrite RAGreedy::splitAroundRegion, now with cool ASCII art. This function has to deal with a lot of special cases, and the old version got it wrong sometimes. In particular, it would sometimes leave multiple uses in the stack interval in a single block. That causes bad code with multiple reloads in the same basic block. The new version handles block entry and exit in a single pass. It first eliminates all the easy cases, and then goes on to create a local interval for the blocks with difficult interference. Previously, we would only create the local interval for completely isolated blocks. It can happen that the stack interval becomes completely empty because we could allocate a register in all edge bundles, and the new local intervals deal with the interference. The empty stack interval is harmless, but we need to remove a SplitKit assertion that checks for empty intervals. llvm-svn: 134047	2011-06-29 00:24:24 +00:00
Jakob Stoklund Olesen	ec43d5d780	Reapply r132245 with a fix for the bug that broke the darwin9/i386 build. llvm-svn: 132309	2011-05-30 01:33:26 +00:00
Jakob Stoklund Olesen	ca6a4d8940	Revert r132245, "Create two BlockInfo entries when a live range is discontinuous through a block." This commit seems to have broken a darwin 9 tester. llvm-svn: 132299	2011-05-29 21:24:39 +00:00
Jakob Stoklund Olesen	fd3f71ef3a	Create two BlockInfo entries when a live range is discontinuous through a block. Delete the Kill and Def markers in BlockInfo. They are no longer necessary when BlockInfo describes a continuous live range. This only affects the relatively rare kind of basic block where a live range looks like this: \|---x o---\| Now live range splitting can pretend that it is looking at two blocks: \|---x o---\| This allows the code to be simplified a bit. llvm-svn: 132245	2011-05-28 02:33:00 +00:00
Jakob Stoklund Olesen	5cc91b2611	Add SplitAnalysis::getNumLiveBlocks(). It is important that this function returns the same number of live blocks as countLiveBlocks(CurLI) because live range splitting uses the number of live blocks to ensure it is making progress. This is in preparation of supporting duplicate UseBlock entries for basic blocks that have a virtual register live-in and live-out, but not live-though. llvm-svn: 132244	2011-05-28 02:32:57 +00:00
Jakob Stoklund Olesen	eaa6ed1ad8	Gracefully handle invalid live ranges. Fix PR9831. Register coalescing can sometimes create live ranges that end in the middle of a basic block without any killing instruction. When SplitKit detects this, it will repair the live range by shrinking it to its uses. Live range splitting also needs to know about this. When the range shrinks so much that it becomes allocatable, live range splitting fails because it can't find a good split point. It is paranoid about making progress, so an allocatable range is considered an error. The coalescer should really not be creating these bad live ranges. They appear when coalescing dead copies. llvm-svn: 130787	2011-05-03 20:42:13 +00:00
Jakob Stoklund Olesen	eef2327360	Add a safe-guard against repeated splitting for some rare cases. The number of blocks covered by a live range must be strictly decreasing when splitting, otherwise we can't allow repeated splitting. llvm-svn: 130249	2011-04-26 22:33:12 +00:00
Sebastian Redl	b8a62aa3c9	Give SplitKit.h a header guard. llvm-svn: 130095	2011-04-24 15:46:51 +00:00
Jakob Stoklund Olesen	6a663b8dc8	Allow allocatable ranges from global live range splitting to be split again. These intervals are allocatable immediately after splitting, but they may be evicted because of later splitting. This is rare, but when it happens they should be split again. The remainder intervals that cannot be allocated after splitting still move directly to spilling. SplitEditor::finish can optionally provide a mapping from new live intervals back to the original interval indexes returned by openIntv(). Each original interval index can map to multiple new intervals after connected components have been separated. Dead code elimination may also add existing intervals to the list. The reverse mapping allows the SplitEditor client to treat the new intervals differently depending on the split region they came from. llvm-svn: 129925	2011-04-21 18:38:15 +00:00
Jakob Stoklund Olesen	1af8b4dc92	Teach the SplitKit blitter to handle multiply defined values as well. The transferValues() function can now handle both singly and multiply defined values, as long as the resulting live range is known. Only rematerialized values have their live range recomputed by extendRange(). The updateSSA() function can now insert PHI values in bulk across multiple values in multiple target registers in one pass. The list of blocks received from transferValues() is in layout order which seems to work well for the iterative algorithm. Blocks from extendRange() are still in reverse BFS order, but this function is used so rarely now that it doesn't matter. llvm-svn: 129580	2011-04-15 17:24:49 +00:00
Jakob Stoklund Olesen	cda53febec	Stop using dead function. llvm-svn: 129442	2011-04-13 15:00:11 +00:00
Jakob Stoklund Olesen	c49df2c05a	SparseBitVector is SLOW. Use a Bitvector instead, we didn't need the smaller memory footprint anyway. This makes the greedy register allocator 10% faster. llvm-svn: 129390	2011-04-12 21:30:53 +00:00
Jakob Stoklund Olesen	c70b697a40	Create new intervals for isolated blocks during region splitting. This merges the behavior of splitSingleBlocks into splitAroundRegion, so the RS_Region and RS_Block register stages can be coalesced. That means the leftover intervals after region splitting go directly to spilling instead of a second pass of per-block splitting. llvm-svn: 129379	2011-04-12 19:32:53 +00:00
Jakob Stoklund Olesen	0840f50b76	Add SplitKit API to query and select the current interval being worked on. This makes it possible to target multiple registers in one pass. llvm-svn: 129374	2011-04-12 18:11:31 +00:00
Jakob Stoklund Olesen	ed47ed4e80	Build the Hopfield network incrementally when splitting global live ranges. It is common for large live ranges to have few basic blocks with register uses and many live-through blocks without any uses. This approach grows the Hopfield network incrementally around the use blocks, completely avoiding checking interference for some through blocks. llvm-svn: 129188	2011-04-09 02:59:09 +00:00
Jakob Stoklund Olesen	bf91c4e85e	Analyze blocks with uses separately from live-through blocks without uses. About 90% of the relevant blocks are live-through without uses, and the only information required about them is their number. This saves memory and enables later optimizations that need to look at only the use-blocks. llvm-svn: 128985	2011-04-06 03:57:00 +00:00
Jakob Stoklund Olesen	fe6e07fd8a	Use std::unique instead of a SmallPtrSet to ensure unique instructions in UseSlots. This allows us to always keep the smaller slot for an instruction which is what we want when a register has early clobber defines. Drop the UsingInstrs set and the UsingBlocks map. They are no longer needed. llvm-svn: 128886	2011-04-05 15:18:18 +00:00
Jakob Stoklund Olesen	d93b0e3ced	Stop precomputing last split points, query the SplitAnalysis cache on demand. llvm-svn: 128875	2011-04-05 04:20:29 +00:00
Jakob Stoklund Olesen	50b2db8a02	Cache the fairly expensive last split point computation and provide a fast inlined path for the common case. Most basic blocks don't contain a call that may throw, so the last split point os simply the first terminator. llvm-svn: 128874	2011-04-05 04:20:27 +00:00
Jakob Stoklund Olesen	8933907b51	Stop caching basic block index ranges now that SlotIndexes can keep up. llvm-svn: 128821	2011-04-04 15:32:15 +00:00
Jakob Stoklund Olesen	956ae3da41	Delete leftover data members. llvm-svn: 128820	2011-04-04 15:32:11 +00:00
Jakob Stoklund Olesen	315b42c354	Rewrite instructions as part of ConnectedVNInfoEqClasses::Distribute. llvm-svn: 127779	2011-03-17 00:23:45 +00:00
Jakob Stoklund Olesen	ea5ebfed15	Delete dead code after rematerializing. LiveRangeEdit::eliminateDeadDefs() will eventually be used by coalescing, splitting, and spilling for dead code elimination. It can delete chains of dead instructions as long as there are no dependency loops. llvm-svn: 127287	2011-03-08 22:46:11 +00:00
Jakob Stoklund Olesen	27e0a4ab86	Work around a coalescer bug. The coalescer can in very rare cases leave too large live intervals around after rematerializing cheap-as-a-move instructions. Linear scan doesn't really care, but live range splitting gets very confused when a live range is killed by a ghost instruction. I will fix this properly in the coalescer after 2.9 branches. llvm-svn: 127096	2011-03-05 18:33:49 +00:00
Jakob Stoklund Olesen	1a69e23300	Use an IndexedMap instead of a DenseMap for the live-out cache. This speeds up updateSSA() so it only accounts for 5% of the live range splitting time. llvm-svn: 126972	2011-03-04 00:15:36 +00:00
Jakob Stoklund Olesen	9a6382fc81	Cache basic block bounds instead of asking SlotIndexes::getMBBRange all the time. This speeds up the greedy register allocator by 15%. DenseMap is not as fast as one might hope. llvm-svn: 126921	2011-03-03 03:41:29 +00:00
Jakob Stoklund Olesen	c96019886c	Change the SplitEditor interface to a single instance can be shared for multiple splits. llvm-svn: 126912	2011-03-03 01:29:13 +00:00
Jakob Stoklund Olesen	815196ca19	Turn the Edit member into a pointer so it can change dynamically. No functional change. llvm-svn: 126898	2011-03-02 23:31:50 +00:00

1 2 3

110 Commits