llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Northover	7fdd4857f7	Revert "ReMat: fix overly cavalier attitude to sub-register indices" Very sorry, this was a premature patch that I still need to investigate and finish off (for some reason beyond me at the moment it doesn't actually fix the issue in all cases). This reverts commit r199091. llvm-svn: 199093	2014-01-13 10:49:11 +00:00
Tim Northover	59f8d4b4ee	ReMat: fix overly cavalier attitude to sub-register indices There are two attempted optimisations in reMaterializeTrivialDef, trying to avoid promoting the size of a register too much when rematerializing. Unfortunately, both appear to be flawed. First, we see if the original register would have worked, but this is inadequate. Consider: v1 = SOMETHING (v1 is QQ) v2:Q0 = COPY v1:Q1 (v1, v2 are QQ) ... uses of v2 In this case even though v2 could be used directly as the output of SOMETHING, this would set the wrong bits of the QQ register involved. The correct rematerialization must be: v2:Q0_Q1 = SOMETHING (v2 promoted to QQQ) ... uses of v2:Q1_Q2 For the second optimisation, if the correct remat is "v2:idx = SOMETHING" then we can't necessarily expect v2 itself to be valid for SOMETHING, but we do try to hunt for a class between v1 and v2 that works. Unfortunately, this is also wrong: v1 = SOMETHING (v1 is QQ) v2:Q0_Q1 = COPY v1 (v1 is QQ, v2 is QQQ) ... uses of v2 as a QQQ The canonical rematerialization here is "v2:Q0_Q1 = SOMETHING". However current logic would decide that v2 could be a QQ (no interest is taken in later uses). This patch, therefore, always accepts the widened register class without trying to be clever. Generally there is no penalty to this (e.g. in the common GR32 < GR64 case, expanding the width doesn't matter because it's not like you were going to do anything else with the high bits of a GR32 register). It can increase register pressure in cases like the ARM VFP regs though (multiple non-overlapping but equivalent subregisters). Hopefully this situation is rare enough that it won't matter. Unfortunately, no in-tree targets actually expose this as far as I can tell (there are so few isAsCheapAsAMove instructions for it to trigger on) so I've been unable to produce a test. It was exposed in our ARM64 SPEC tests though, and I will be adding a test there that we should be able to contribute soon(TM). llvm-svn: 199091	2014-01-13 10:47:01 +00:00
Matthias Braun	f6fe6bfffe	Print register in LiveInterval::print() llvm-svn: 192398	2013-10-10 21:29:05 +00:00
Matthias Braun	34e1be9451	Represent RegUnit liveness with LiveRange instance Previously LiveInterval has been used, but having a spill weight and register number is unnecessary for a register unit. llvm-svn: 192397	2013-10-10 21:29:02 +00:00
Matthias Braun	2d5c32b3b5	Work on LiveRange instead of LiveInterval where possible Also change some pointer arguments to references at some places where 0-pointers are not allowed. llvm-svn: 192396	2013-10-10 21:28:57 +00:00
Matthias Braun	88dd0abd2d	Pass LiveQueryResult by value This makes the API a bit more natural to use and makes it easier to make LiveRanges implementation details private. llvm-svn: 192394	2013-10-10 21:28:52 +00:00
Matthias Braun	13ddb7cd65	Rename LiveRange to LiveInterval::Segment The Segment struct contains a single interval; multiple instances of this struct are used to construct a live range, but the struct is not a live range by itself. llvm-svn: 192392	2013-10-10 21:28:43 +00:00
Matthias Braun	caff764739	Fix comment llvm-svn: 191966	2013-10-04 16:53:02 +00:00
Andrew Trick	71e8bb6d1d	Added temp flag -misched-bench for staging in default changes. llvm-svn: 191423	2013-09-26 05:53:35 +00:00
Benjamin Kramer	8817cca5ce	Provide basic type safety for array_pod_sort comparators. This makes using array_pod_sort significantly safer. The implementation relies on function pointer casting but that should be safe as we're dealing with void* here. llvm-svn: 191175	2013-09-22 14:09:50 +00:00
Matthias Braun	305ef7f5b0	avoid unnecessary direct access to LiveInterval::ranges llvm-svn: 190170	2013-09-06 16:44:32 +00:00
Matthias Braun	90e0d3c03a	remove unused argument from LiveRanges::join() llvm-svn: 190169	2013-09-06 16:44:29 +00:00
Matthias Braun	c0ad7bfa62	remove pointless assert The if above it ensures the property anyway. llvm-svn: 190168	2013-09-06 16:44:27 +00:00
Matthias Braun	b348d9703c	fix comment There's no 'B3' in the example. llvm-svn: 190167	2013-09-06 16:44:25 +00:00
Mark Lacey	f9ea88546f	Track new virtual registers by register number. Track new virtual registers by register number, rather than by the live interval created for them. This is the first step in separating the creation of new virtual registers and new live intervals. Eventually live intervals will be created and populated on demand after the virtual registers have been created and used in instructions. llvm-svn: 188434	2013-08-14 23:50:04 +00:00
Jakob Stoklund Olesen	e6abacfb8b	Use modern API to avoid exposing LiveInterval internals. No functional change intended. llvm-svn: 185733	2013-07-05 23:48:07 +00:00
Andrew Trick	714aec021d	Fix a -join-globalcopies bug; handle undef operands. llvm-svn: 184569	2013-06-21 18:33:11 +00:00
Andrew Trick	75961ecc1a	Modify the -join-globalcopies option (off by default). Always coalesce in forward order to propagate rematerialization. I'm fixing this option so I can enable it by default soon. llvm-svn: 184568	2013-06-21 18:33:09 +00:00
Andrew Trick	3a851a27b8	Make rematerialization in the coalescer less sensitive to LRG order. llvm-svn: 184567	2013-06-21 18:33:06 +00:00
Tim Northover	059cead5ed	Mark rematerialized super/sub registers as dead. When we're rematerializing into a not-quite-right register we already add the real definition as an imp-def, but we should also be marking the "official" register as dead, since nothing else is going to use it as a result of this remat. Not doing this can affect pressure tracking. rdar://problem/14158833 llvm-svn: 184002	2013-06-14 20:22:21 +00:00
Tim Northover	69cd121dd9	Fix rematerialization into physical registers. r182872 introduced a bug in how the register-coalescer's rematerialization handled defining a physical register. It relied on the output of the coalescer's setRegisters method to determine whether the replacement instruction needed an implicit-def. However, this value isn't necessarily the same as the CopyMI's actual destination register which is what the rest of the basic-block expects us to be defining. The commit changes the rematerializer to use the actual register attached to CopyMI in its decision. This will be tested soon by an X86 patch which moves everything to using MOV32r0 instead of other sizes. llvm-svn: 182925	2013-05-30 12:30:50 +00:00
Tim Northover	b65f6b0820	Teach ReMaterialization to be more cunning about subregisters This allows rematerialization during register coalescing to handle more cases involving operations like SUBREG_TO_REG which might need to be rematerialized using sub-register indices. For example, code like: v1(GPR64):sub_32 = MOVZ something v2(GPR64) = COPY v1(GPR64) should be convertable to: v2(GPR64):sub_32 = MOVZ something but previously we just gave up in places like this llvm-svn: 182872	2013-05-29 19:32:06 +00:00
Bill Wendling	a69d0aaa71	Remove unused #includes. llvm-svn: 176467	2013-03-05 01:00:45 +00:00
Cameron Zwarich	8f55064a06	RegisterCoalescer::reMaterializeTrivialDef() can constrain the destination register class to match the defining instruction. llvm-svn: 175130	2013-02-14 03:25:24 +00:00
Cameron Zwarich	48ab445621	Fix RegisterCoalescer::rematerializeTrivialDef() so that it works on flipped CoalescerPairs. Also, make it take a CoalescerPair directly like other methods of RegisterCoalescer. llvm-svn: 175123	2013-02-14 02:51:05 +00:00
Cameron Zwarich	1195e819bb	Fix some issues with rematerialization in RegisterCoalescer when the destination of the copy is a subregister def. The current code assumes that it can do a full def of the destination register, but it is not checking that the def operand is read-undef. It also doesn't clear the subregister index of the destination in the new instruction to reflect the full subregister def. These issues were found running 'make check' with my next commit that enables rematerialization in more cases. llvm-svn: 175122	2013-02-14 02:51:03 +00:00
Manman Ren	f019cd62da	Debug Info: LiveDebugVarible can remove DBG_VALUEs, make sure we emit them back. RegisterCoalescer used to depend on LiveDebugVariable. LDV removes DBG_VALUEs without emitting them at the end. We fix this by removing LDV from RegisterCoalescer. Also add an assertion to make sure we call emitDebugValues if DBG_VALUEs are removed at runOnMachineFunction. rdar://problem/13183203 Reviewed by Andy & Jakob llvm-svn: 175023	2013-02-13 01:14:49 +00:00
Jakob Stoklund Olesen	725d57682b	Fix PR14732 by handling all kinds of IMPLICIT_DEF live ranges. Most IMPLICIT_DEF instructions are removed by the ProcessImplicitDefs pass, and a few are reinserted by PHIElimination when a PHI argument is <undef>. RegisterCoalescer was assuming that all IMPLICIT_DEF live ranges look like those created by PHIElimination, and that their live range never leaves the basic block. The PR14732 test case does tricks with PHI nodes that causes a longer IMPLICIT_DEF live range to appear. This happens very rarely, but RegisterCoalescer should be able to handle it. llvm-svn: 171435	2013-01-03 00:47:51 +00:00
Chandler Carruth	9fb823bbd4	Move all of the header files which are involved in modelling the LLVM IR into their new header subdirectory: include/llvm/IR. This matches the directory structure of lib, and begins to correct a long standing point of file layout clutter in LLVM. There are still more header files to move here, but I wanted to handle them in separate commits to make tracking what files make sense at each layer easier. The only really questionable files here are the target intrinsic tablegen files. But that's a battle I'd rather not fight today. I've updated both CMake and Makefile build systems (I think, and my tests think, but I may have missed something). I've also re-sorted the includes throughout the project. I'll be committing updates to Clang, DragonEgg, and Polly momentarily. llvm-svn: 171366	2013-01-02 11:36:10 +00:00
Chandler Carruth	ed0881b2a6	Use the new script to sort the includes of every file under lib. Sooooo many of these had incorrect or strange main module includes. I have manually inspected all of these, and fixed the main module include to be the nearest plausible thing I could find. If you own or care about any of these source files, I encourage you to take some time and check that these edits were sensible. I can't have broken anything (I strictly added headers, and reordered them, never removed), but they may not be the headers you'd really like to identify as containing the API being implemented. Many forward declarations and missing includes were added to a header files to allow them to parse cleanly when included first. The main module rule does in fact have its merits. =] llvm-svn: 169131	2012-12-03 16:50:05 +00:00
Jakob Stoklund Olesen	546e9e85f1	Avoid rewriting instructions twice. This could cause miscompilations in targets where sub-register composition is not always idempotent (ARM). <rdar://problem/12758887> llvm-svn: 168837	2012-11-29 00:26:11 +00:00
Jakob Stoklund Olesen	26c9d70d28	Make the LiveRegMatrix analysis available to targets. No functional change, just moved header files. Targets can inject custom passes between register allocation and rewriting. This makes it possible to tweak the register allocation before rewriting, using the full global interference checking available from LiveRegMatrix. llvm-svn: 168806	2012-11-28 19:13:06 +00:00
Jakub Staszak	38e2f52e85	Remove duplicated #includes. llvm-svn: 168712	2012-11-27 18:27:14 +00:00
Andrew Trick	9d0a1ae946	Use array_pod_sort instead of std::sort. llvm-svn: 168203	2012-11-16 21:33:38 +00:00
Andrew Trick	449eb3f3be	Fix an obvious merge bug in -join-globalcopies (disabled). Jakub Staszak spotted this in review. I don't notice these things until I manually rerun benchmarks. But reducing unit tests is a very high priority. llvm-svn: 168021	2012-11-15 02:32:22 +00:00
Jakub Staszak	ab0139cb90	Use reserve() to avoid vector reallocation. llvm-svn: 167991	2012-11-14 22:42:17 +00:00
Jakub Staszak	542db4a0bc	canJoinPhys method doesn't modify CoalescerPair. Make it const. llvm-svn: 167972	2012-11-14 20:31:04 +00:00
Andrew Trick	459d891a43	Revert -join-splitedges to a boolean cmd line option. llvm-svn: 167880	2012-11-13 22:19:48 +00:00
Andrew Trick	47d58ce0df	The MachineScheduler does not currently require JoinSplitEdges. This option will eventually either be enabled unconditionally or replaced by a more general live range splitting optimization. llvm-svn: 167879	2012-11-13 22:15:40 +00:00
Andrew Trick	449c7ad7d7	Fix -join-splitedges: my previous "cleanup" broke it. Working on reducing unit tests. This won't be enabled unless a subtarget enables misched. llvm-svn: 167851	2012-11-13 17:37:46 +00:00
Andrew Trick	108c88c5b7	misched: Allow subtargets to enable misched and dependent options. This allows me to begin enabling (or backing out) misched by default for one subtarget at a time. To run misched we typically want to: - Disable SelectionDAG scheduling (use the source order scheduler) - Enable more aggressive coalescing (until we decide to always run the coalescer this way) - Enable MachineScheduler pass itself. Disabling PostRA sched may follow for some subtargets. llvm-svn: 167826	2012-11-13 08:47:29 +00:00
Andrew Trick	40534fe9a5	Added RegisterCoalescer support for joining global copies first. This adds the -join-globalcopies option which can be enabled by default once misched is also enabled. Ideally, the register coalescer would be able to split local live ranges in a way that produces copies that can be easily resolved by the scheduler. Until then, this heuristic should be good enough to at least allow the scheduler to run after coalescing. llvm-svn: 167825	2012-11-13 08:47:25 +00:00
Andrew Trick	edac22a9f3	Cleanup the main RegisterCoalescer loop. Block priorities still apply outside loops. llvm-svn: 167793	2012-11-13 00:34:44 +00:00
Andrew Trick	c25d3fe71e	Cleanup -join-splitedges. Make the loop more obvious. llvm-svn: 167785	2012-11-12 23:59:48 +00:00
Andrew Trick	22d688a29c	Added a temporary option to avoid critical edges splitting. This teaches the register coalescer to be less prone to split critical edges. I am currently benchmarking this with the new (post-coalescer) scheduler. I plan to enable this by default and remove the option as soon as misched is enabled. llvm-svn: 167758	2012-11-12 21:42:40 +00:00
Jakob Stoklund Olesen	9892a4b794	Exploit the new identity composition in composeSubRegIndices(). The static compose() function in RegisterCoalescer was doing the exact same thing. llvm-svn: 167198	2012-11-01 01:15:43 +00:00
Jakob Stoklund Olesen	9a06696a77	Completely disallow partial copies in adjustCopiesBackFrom(). Partial copies can show up even when CoalescerPair.isPartial() returns false. For example: %vreg24:dsub_0<def> = COPY %vreg31:dsub_0; QPR:%vreg24,%vreg31 Such a partial-partial copy is not good enough for the transformation adjustCopiesBackFrom() needs to do. llvm-svn: 166944	2012-10-29 17:51:52 +00:00
Jakob Stoklund Olesen	57143f7e78	Never attempt to join an early-clobber def with a regular kill. This fixes PR14194. llvm-svn: 166880	2012-10-27 17:41:27 +00:00
Jakob Stoklund Olesen	fd4ced2c52	Don't crash when the Assignments vector is empty. Reported by Vincent Lejeune using an out-of-tree target. llvm-svn: 166398	2012-10-21 19:05:03 +00:00
Jakob Stoklund Olesen	2043329e67	Revert r166046 "Switch back to the old coalescer for now to fix the 32 bit bit" A fix for PR14098, including the test case is in the next commit. llvm-svn: 166067	2012-10-16 22:51:55 +00:00

1 2 3 4

185 Commits