llvm-project

Commit Graph

Author	SHA1	Message	Date
Andrew Trick	40534fe9a5	Added RegisterCoalescer support for joining global copies first. This adds the -join-globalcopies option which can be enabled by default once misched is also enabled. Ideally, the register coalescer would be able to split local live ranges in a way that produces copies that can be easily resolved by the scheduler. Until then, this heuristic should be good enough to at least allow the scheduler to run after coalescing. llvm-svn: 167825	2012-11-13 08:47:25 +00:00
Andrew Trick	4b1f9e3bac	misched: Don't consider artificial edges weak edges. For now be more conservative in case other out-of-tree schedulers rely on the old behavior of artificial edges. llvm-svn: 167808	2012-11-13 02:35:06 +00:00
Bill Wendling	f454dfb6b5	Use the 'count' attribute instead of the 'upper_bound' attribute. If we have a type 'int a[1]' and a type 'int b[0]', the generated DWARF is the same for both of them because we use the 'upper_bound' attribute. Instead use the 'count' attrbute, which gives the correct number of elements in the array. <rdar://problem/12566646> llvm-svn: 167806	2012-11-13 02:31:47 +00:00
Andrew Trick	edac22a9f3	Cleanup the main RegisterCoalescer loop. Block priorities still apply outside loops. llvm-svn: 167793	2012-11-13 00:34:44 +00:00
Andrew Trick	c25d3fe71e	Cleanup -join-splitedges. Make the loop more obvious. llvm-svn: 167785	2012-11-12 23:59:48 +00:00
Eric Christopher	2942431175	Add an option to enable prototype "fission" capabilities and debug changes. llvm-svn: 167765	2012-11-12 22:22:20 +00:00
Andrew Trick	22d688a29c	Added a temporary option to avoid critical edges splitting. This teaches the register coalescer to be less prone to split critical edges. I am currently benchmarking this with the new (post-coalescer) scheduler. I plan to enable this by default and remove the option as soon as misched is enabled. llvm-svn: 167758	2012-11-12 21:42:40 +00:00
Andrew Trick	ec369d5316	misched: rename interfaceto avoid gcc warnings llvm-svn: 167753	2012-11-12 21:28:10 +00:00
Andrew Trick	263280248a	misched: Target-independent support for MacroFusion. Uses the infrastructure from r167742 to support clustering instructure that the target processor can "fuse". e.g. cmp+jmp. Next step: target hook implementations with test cases, and enable. llvm-svn: 167744	2012-11-12 19:52:20 +00:00
Andrew Trick	a7714a0ff9	misched: Target-independent support for load/store clustering. This infrastructure is generally useful for any target that wants to strongly prefer two instructions to be adjacent after scheduling. A following checkin will add target-specific hooks with unit tests. Then this feature will be enabled by default with misched. llvm-svn: 167742	2012-11-12 19:40:10 +00:00
Andrew Trick	f1ff84c64e	misched: Infrastructure for weak DAG edges. This adds support for weak DAG edges to the general scheduling infrastructure in preparation for MachineScheduler support for heuristics based on weak edges. llvm-svn: 167738	2012-11-12 19:28:57 +00:00
Jakob Stoklund Olesen	13d5562963	Fix assertions in updateRegMaskSlots(). The RegMaskSlots contains 'r' slots while NewIdx and OldIdx are 'B' slots. This broke the checks in the assertions. This fixes PR14302. llvm-svn: 167625	2012-11-09 19:18:49 +00:00
Benjamin Kramer	c280f41864	Silence GCC warning about falling off the end of a non-void function. llvm-svn: 167618	2012-11-09 15:45:22 +00:00
Andrew Trick	3ca33acb95	misched: Heuristics based on the machine model. misched is disabled by default. With -enable-misched, these heuristics balance the schedule to simultaneously avoid saturating processor resources, expose ILP, and minimize register pressure. I've been analyzing the performance of these heuristics on everything in the llvm test suite in addition to a few other benchmarks. I would like each heuristic check to be verified by a unit test, but I'm still trying to figure out the best way to do that. The heuristics are still in considerable flux, but as they are refined we should be rigorous about unit testing the improvements. llvm-svn: 167527	2012-11-07 07:05:09 +00:00
Andrew Trick	e145559b70	misched: handle on-the-fly regpressure queries better for 2-addr instructions without relying on liveintervals. llvm-svn: 167526	2012-11-07 07:05:05 +00:00
Bill Wendling	f720bf64d4	Add comment describing what's going on here. llvm-svn: 167525	2012-11-07 05:19:04 +00:00
Bill Wendling	d9bb9b611b	When we're updating the subprogram scope DIE, we want to determine if we're updating an abstract DIE or not. If we are, then we use that. Its children will be added on later, as well as the object pointer attribute. Otherwise, this function may be called with a concrete DIE twice and adding the children and object pointer attribute to it twice. <rdar://problem/12401423&12600340> llvm-svn: 167524	2012-11-07 04:42:18 +00:00
Chad Rosier	8d2c229006	[regallocfast] Make sure the MachineRegisterInfo is aware of clobbers from a register masks. This is an obvious and necessary fix for a soon to be committed patch. No test case possible at this time. Reviewed by Jakob. llvm-svn: 167498	2012-11-06 22:52:42 +00:00
Andrew Trick	e96390ea96	misched: TargetSchedule interface for machine resources. Expose the processor resources defined by the machine model to the scheduler and other clients through the TargetSchedule interface. Normalize each resource count with respect to other kinds of resources. This allows scheduling heuristics to balance resources against other kinds of resources and latency. llvm-svn: 167444	2012-11-06 07:10:38 +00:00
Andrew Trick	4d1fa712ac	misched: Rename RemainingCount to avoid confusion with remaining resources. llvm-svn: 167443	2012-11-06 07:10:34 +00:00
Andrew Trick	baeaabb2d0	ScheduleDAG interface. Added OrderKind to distinguish nonregister dependencies. This is in preparation for adding "weak" DAG edges, but generally simplifies the design. llvm-svn: 167435	2012-11-06 03:13:46 +00:00
Owen Anderson	15fd6ac4ba	Be careful not to optimize a SELECT_CC into a SETCC post-legalization if the SETCC node would be illegal. llvm-svn: 167344	2012-11-03 00:17:26 +00:00
Manman Ren	3d5af279b1	OutputArg: added an index of the original argument to match the change to InputArg in r165616. This will enable us to get the actual type for both InputArg and OutputArg. rdar://9932559 llvm-svn: 167265	2012-11-01 23:49:58 +00:00
Chandler Carruth	5da3f0512e	Revert the majority of the next patch in the address space series: r165941: Resubmit the changes to llvm core to update the functions to support different pointer sizes on a per address space basis. Despite this commit log, this change primarily changed stuff outside of VMCore, and those changes do not carry any tests for correctness (or even plausibility), and we have consistently found questionable or flat out incorrect cases in these changes. Most of them are probably correct, but we need to devise a system that makes it more clear when we have handled the address space concerns correctly, and ideally each pass that gets updated would receive an accompanying test case that exercises that pass specificaly w.r.t. alternate address spaces. However, from this commit, I have retained the new C API entry points. Those were an orthogonal change that probably should have been split apart, but they seem entirely good. In several places the changes were very obvious cleanups with no actual multiple address space code added; these I have not reverted when I spotted them. In a few other places there were merge conflicts due to a cleaner solution being implemented later, often not using address spaces at all. In those cases, I've preserved the new code which isn't address space dependent. This is part of my ongoing effort to clean out the partial address space code which carries high risk and low test coverage, and not likely to be finished before the 3.2 release looms closer. Duncan and I would both like to see the above issues addressed before we return to these changes. llvm-svn: 167222	2012-11-01 09:14:31 +00:00
Chandler Carruth	7ec5085e01	Revert the series of commits starting with r166578 which introduced the getIntPtrType support for multiple address spaces via a pointer type, and also introduced a crasher bug in the constant folder reported in PR14233. These commits also contained several problems that should really be addressed before they are re-committed. I have avoided reverting various cleanups to the DataLayout APIs that are reasonable to have moving forward in order to reduce the amount of churn, and minimize the number of commits that were reverted. I've also manually updated merge conflicts and manually arranged for the getIntPtrType function to stay in DataLayout and to be defined in a plausible way after this revert. Thanks to Duncan for working through this exact strategy with me, and Nick Lewycky for tracking down the really annoying crasher this triggered. (Test case to follow in its own commit.) After discussing with Duncan extensively, and based on a note from Micah, I'm going to continue to back out some more of the more problematic patches in this series in order to ensure we go into the LLVM 3.2 branch with a reasonable story here. I'll send a note to llvmdev explaining what's going on and why. Summary of reverted revisions: r166634: Fix a compiler warning with an unused variable. r166607: Add some cleanup to the DataLayout changes requested by Chandler. r166596: Revert "Back out r166591, not sure why this made it through since I cancelled the command. Bleh, sorry about this! r166591: Delete a directory that wasn't supposed to be checked in yet. r166578: Add in support for getIntPtrType to get the pointer type based on the address space. llvm-svn: 167221	2012-11-01 08:07:29 +00:00
Owen Anderson	b351c8d692	Add a few more simple fast-math constant propagations and cancellations. llvm-svn: 167200	2012-11-01 02:00:53 +00:00
Jakob Stoklund Olesen	9892a4b794	Exploit the new identity composition in composeSubRegIndices(). The static compose() function in RegisterCoalescer was doing the exact same thing. llvm-svn: 167198	2012-11-01 01:15:43 +00:00
Benjamin Kramer	1559127f6f	Replace some instances of UniqueVector with SetVector, which is slightly cheaper. No functionality change. llvm-svn: 167116	2012-10-31 13:45:49 +00:00
Akira Hatanaka	d837be780d	Change signature of function RAFast::spillAll to avoid conversion between type MachineInstr* and MachineBasicBlock::iterator. llvm-svn: 167088	2012-10-31 00:56:01 +00:00
Akira Hatanaka	ebb31e9c42	Check that iterator I is not the end iterator. llvm-svn: 167086	2012-10-31 00:50:52 +00:00
Chad Rosier	909f6a035f	[inline asm] Get the mayLoad/mayStore directly from the MIOp_ExtraInfo operand. llvm-svn: 167050	2012-10-30 20:39:19 +00:00
Chad Rosier	86f6050c54	Add a comment for r167040. llvm-svn: 167046	2012-10-30 20:01:12 +00:00
Chad Rosier	9e1274fb48	[inline asm] Implement mayLoad and mayStore for inline assembly. In general, the MachineInstr MayLoad/MayLoad flags are based on the tablegen implementation. For inline assembly, however, we need to compute these based on the constraints. Revert r166929 as this is no longer needed, but leave the test case in place. rdar://12033048 and PR13504 llvm-svn: 167040	2012-10-30 19:11:54 +00:00
Bill Wendling	10e0e2ec49	Fix grammar. llvm-svn: 167029	2012-10-30 17:51:02 +00:00
Ulrich Weigand	3abb34389d	In various places throughout the code generator, there were special checks to avoid performing compile-time arithmetic on PPCDoubleDouble. Now that APFloat supports arithmetic on PPCDoubleDouble, those checks are no longer needed, and we can treat the type like any other. llvm-svn: 166958	2012-10-29 18:35:49 +00:00
Jakob Stoklund Olesen	9a06696a77	Completely disallow partial copies in adjustCopiesBackFrom(). Partial copies can show up even when CoalescerPair.isPartial() returns false. For example: %vreg24:dsub_0<def> = COPY %vreg31:dsub_0; QPR:%vreg24,%vreg31 Such a partial-partial copy is not good enough for the transformation adjustCopiesBackFrom() needs to do. llvm-svn: 166944	2012-10-29 17:51:52 +00:00
Duncan Sands	5bdd9dda48	Remove a wrapper around getIntPtrType added to GVN by Hal in commit 166624 (the wrapper returns a vector of integers when passed a vector of pointers) by having getIntPtrType itself return a vector of integers in this case. Outside of this wrapper, I didn't find anywhere in the codebase that was relying on the old behaviour for vectors of pointers, so give this a whirl through the buildbots. llvm-svn: 166939	2012-10-29 17:31:46 +00:00
Preston Gurd	52dacca977	This patch addresses a problem with the Post RA scheduler generating an incorrect instruction sequence due to it not being aware that an inline assembly instruction may reference memory. This patch fixes the problem by causing the scheduler to always assume that any inline assembly code instruction could access memory. This is necessary because the internal representation of the inline instruction does not include any information about memory accesses. This should fix PR13504. llvm-svn: 166929	2012-10-29 15:01:23 +00:00
Lang Hames	ee6142c36b	Remove unused typedef. llvm-svn: 166910	2012-10-29 04:57:52 +00:00
Jakob Stoklund Olesen	57143f7e78	Never attempt to join an early-clobber def with a regular kill. This fixes PR14194. llvm-svn: 166880	2012-10-27 17:41:27 +00:00
Jakob Stoklund Olesen	1dfe4fc60c	Reduce indentation with early exit. No functional change. llvm-svn: 166829	2012-10-26 23:05:13 +00:00
Jakob Stoklund Olesen	7fa17d4bc8	Also make the current basic block a class member. Don't pass it around everywhere as a function argument. llvm-svn: 166828	2012-10-26 23:05:10 +00:00
Jakob Stoklund Olesen	d788e32bf5	Make the Processed set a class member. Don't pass it everywhere as an argument. llvm-svn: 166820	2012-10-26 22:06:00 +00:00
Jakob Stoklund Olesen	112a44d9af	Fix whitespace and function names to be coding standardy. No functional change. llvm-svn: 166814	2012-10-26 21:12:49 +00:00
Jakob Stoklund Olesen	09d69f5b0f	Remove the canCombineSubRegIndices() target hook. The new coalescer can already do all of this, so there is no need to duplicate the efforts. llvm-svn: 166813	2012-10-26 20:38:19 +00:00
Akira Hatanaka	6fe7acab9d	Make sure I is not the end iterator when isInsideBundle is called. llvm-svn: 166784	2012-10-26 17:11:42 +00:00
Nicolas Geoffray	457b356f3a	Remove GC roots that reference dead objects. llvm-svn: 166763	2012-10-26 09:15:55 +00:00
Nick Lewycky	1a32954279	Fix typo in comment. llvm-svn: 166750	2012-10-26 04:27:49 +00:00
Jakob Stoklund Olesen	9004798da8	Stop running the machine code verifier unconditionally. llvm-svn: 166646	2012-10-25 00:05:39 +00:00
Micah Villmow	bf3eeb2dfc	Add some cleanup to the DataLayout changes requested by Chandler. llvm-svn: 166607	2012-10-24 18:36:13 +00:00
Micah Villmow	51e7246cb4	Back out r166591, not sure why this made it through since I cancelled the command. Bleh, sorry about this! llvm-svn: 166596	2012-10-24 17:25:11 +00:00
Micah Villmow	6a8f3f9e20	Delete a directory that wasn't supposed to be checked in yet. llvm-svn: 166591	2012-10-24 17:20:04 +00:00
Micah Villmow	12d9127833	Add in support for getIntPtrType to get the pointer type based on the address space. This checkin also adds in some tests that utilize these paths and updates some of the clients. llvm-svn: 166578	2012-10-24 15:52:52 +00:00
Michael Liao	5922979e49	Teach DAG combine to fold (buildvec (Xint2fp x)) to (Xint2fp (buildvec x)) - If more than 1 elemennts are defined and target supports the vectorized conversion, use the vectorized one instead to reduce the strength on conversion operation. llvm-svn: 166546	2012-10-24 04:14:18 +00:00
Jakub Staszak	a6addc2741	Keep coding standard. Don't evaluate getNumOperands() every time. llvm-svn: 166531	2012-10-24 00:38:25 +00:00
Michael Liao	6d106b7bfd	Clean up code and put transformation on (build_vec (ext x)) into a helper func llvm-svn: 166519	2012-10-23 23:06:52 +00:00
Nadav Rotem	33e034a4b3	Make the indirect branch optimization deterministic. No functionality change. Patch by Daniel Reynaud. llvm-svn: 166501	2012-10-23 21:05:33 +00:00
Richard Smith	6289a4e85e	Per the C++ standard, we need to include the definition of llvm::Calculate in every TU where it's implicitly instantiated, even if there's an implicit instantiation for the same types available in another TU. llvm-svn: 166470	2012-10-23 06:19:46 +00:00
Jakob Stoklund Olesen	fd4ced2c52	Don't crash when the Assignments vector is empty. Reported by Vincent Lejeune using an out-of-tree target. llvm-svn: 166398	2012-10-21 19:05:03 +00:00
Benjamin Kramer	a74129adad	Symbol hygiene: Make sure declarations and definitions match, make helper functions static. llvm-svn: 166376	2012-10-20 12:53:26 +00:00
Shuxin Yang	1479fcdef1	1. Remove noreturn attribute from __builtin_debugtrap(). (The change at Clang side was committed in r166345) 2. Cosmetic change in order to conform to coding standards. llvm-svn: 166350	2012-10-19 23:00:20 +00:00
Nadav Rotem	4dc976fbcb	revert r166264 because the LTO build is still failing llvm-svn: 166340	2012-10-19 21:28:43 +00:00
Shuxin Yang	cdde059a34	This patch is to fix radar://8426430. It is about llvm support of __builtin_debugtrap() which is supposed to consistently raise SIGTRAP across all systems. In contrast, __builtin_trap() behave differently on different systems. e.g. it raises SIGTRAP on ARM, and SIGILL on X86. The purpose of __builtin_debugtrap() is to consistently provide "trap" functionality, in the mean time preserve the compatibility with on gcc on __builtin_trap(). The X86 backend is already able to handle debugtrap(). This patch is to: 1) make front-end recognize "__builtin_debugtrap()" (emboddied in the one-line change to Clang). 2) In DAG legalization phase, by default, "debugtrap" will be replaced with "trap", which make the __builtin_debugtrap() "available" to all existing ports without the hassle of changing their code. 3) If trap-function is specified (via -trap-func=xyz to llc), both __builtin_debugtrap() and __builtin_trap() will be expanded into the function call of the specified trap function. This behavior may need change in the future. The provided testing-case is to make sure 2) and 3) are working for ARM port, and we already have a testing case for x86. llvm-svn: 166300	2012-10-19 20:11:16 +00:00
Nadav Rotem	4985ddc5e0	recommit the patch that makes LSR and LowerInvoke use the TargetTransform interface. llvm-svn: 166264	2012-10-19 04:27:49 +00:00
Michael Liao	2c2358036d	Simplify condition checking as CONCAT assume all inputs of the same type. llvm-svn: 166260	2012-10-19 03:17:00 +00:00
Sebastian Pop	127777d686	Clear unknown mem ops when merging stack slots (pr14090) When merging stack slots, if StackColoring::remapInstructions gets a value back from GetUnderlyingObject that it does not know about or is not itself a stack slot, clear the memory operand in case it aliases the merged slot. This prevents the introduction of incorrect aliasing information. Author: Matthew Curtis <mcurtis@codeaurora.org> llvm-svn: 166216	2012-10-18 19:53:48 +00:00
Sebastian Pop	fdd94d4955	Change MachineFrameInfo::StackObject::Alloca from Value* to AllocaInst* This more accurately reflects what is actually being stored in the field. No functionality change intended. Author: Matthew Curtis <mcurtis@codeaurora.org> llvm-svn: 166215	2012-10-18 19:53:45 +00:00
Nadav Rotem	d5f8859672	In SimplifySelectOps we pulled two loads through a select node despite the fact that one was dependent on the other. rdar://12513091 llvm-svn: 166196	2012-10-18 18:06:48 +00:00
Bob Wilson	d6d9ccca38	Temporarily revert the TargetTransform changes. The TargetTransform changes are breaking LTO bootstraps of clang. I am working with Nadav to figure out the problem, but I am reverting it for now to get our buildbots working. This reverts svn commits: 165665 165669 165670 165786 165787 165997 and I have also reverted clang svn 165741 llvm-svn: 166168	2012-10-18 05:43:52 +00:00
Michael Liao	3ac8201ea4	Revert part of r166049 back and enable test case in r166125. - Folding (trunc (concat ... X )) to (concat ... (trunc X) ...) is valid when '...' are all 'undef's. - r166125 relies on this transformation. llvm-svn: 166155	2012-10-17 23:45:54 +00:00
Michael Liao	c87d98dbc8	Revert r166049 - In general, it's unsafe for this transformation. llvm-svn: 166135	2012-10-17 22:41:15 +00:00
Michael Liao	7a442c8031	Teach DAG combine to fold (extract_subvec (concat v1, ..) i) to v_i - If the extracted vector has the same type of all vectored being concatenated together, it should be simplified directly into v_i, where i is the index of the element being extracted. llvm-svn: 166125	2012-10-17 20:48:33 +00:00
Jakob Stoklund Olesen	7a9f0c09de	Switch MRI::UsedPhysRegs to a register unit bit vector. This is a more compact, less redundant representation, and it avoids scanning long lists of aliases for ARM D-registers, for example. llvm-svn: 166124	2012-10-17 20:26:33 +00:00
Evan Cheng	839fb650b2	Add a really faster pre-RA scheduler (-pre-RA-sched=linearize). It doesn't use any scheduling heuristics nor does it build up any scheduling data structure that other heuristics use. It essentially linearize by doing a DFA walk but it does handle glues correctly. IMPORTANT: it probably can't handle all the physical register dependencies so it's not suitable for x86. It also doesn't deal with dbg_value nodes right now so it's definitely is still WIP. rdar://12474515 llvm-svn: 166122	2012-10-17 19:39:36 +00:00
Jakob Stoklund Olesen	0736442683	Merge MRI::isPhysRegOrOverlapUsed() into isPhysRegUsed(). All callers of these functions really want the isPhysRegOrOverlapUsed() functionality which also checks aliases. For historical reasons, targets without register aliases were calling isPhysRegUsed() instead. Change isPhysRegUsed() to also check aliases, and switch all isPhysRegOrOverlapUsed() callers to isPhysRegUsed(). llvm-svn: 166117	2012-10-17 18:44:18 +00:00
Andrew Trick	0b1d8d04b9	misched: Better handling of invalid latencies in the machine model llvm-svn: 166107	2012-10-17 17:27:10 +00:00
Jakob Stoklund Olesen	a2136be107	Use a SparseSet instead of a BitVector for UsedInInstr in RAFast. This is just as fast, and it makes it possible to avoid leaking the UsedPhysRegs BitVector implementation through MachineRegisterInfo::addPhysRegsUsed(). llvm-svn: 166083	2012-10-17 01:37:59 +00:00
Jakob Stoklund Olesen	4df59a9ff8	Avoid rematerializing a redef immediately after the old def. PR14098 contains an example where we would rematerialize a MOV8ri immediately after the original instruction: %vreg7:sub_8bit<def> = MOV8ri 9; GR32_ABCD:%vreg7 %vreg22:sub_8bit<def> = MOV8ri 9; GR32_ABCD:%vreg7 Besides being pointless, it is also wrong since the original instruction only redefines part of the register, and the value read by the new instruction is wrong. The problem was the LiveRangeEdit::allUsesAvailableAt() didn't special-case OrigIdx == UseIdx and found the wrong SSA value. llvm-svn: 166068	2012-10-16 22:51:58 +00:00
Jakob Stoklund Olesen	2043329e67	Revert r166046 "Switch back to the old coalescer for now to fix the 32 bit bit" A fix for PR14098, including the test case is in the next commit. llvm-svn: 166067	2012-10-16 22:51:55 +00:00
Michael Liao	19006206a1	Teach DAG combine to fold (trunc (fptoXi x)) to (fptoXi x) llvm-svn: 166049	2012-10-16 19:38:35 +00:00
Rafael Espindola	b58be2c593	Switch back to the old coalescer for now to fix the 32 bit bit llvm+clang+compiler-rt bootstrap. llvm-svn: 166046	2012-10-16 19:34:06 +00:00
Stepan Dyatkovskiy	e59a920b0c	Issue: Stack is formed improperly for long structures passed as byval arguments for EABI mode. If we took AAPCS reference, we can found the next statements: A: "If the argument requires double-word alignment (8-byte), the NCRN (Next Core Register Number) is rounded up to the next even register number." (5.5 Parameter Passing, Stage C, C.3). B: "The alignment of an aggregate shall be the alignment of its most-aligned component." (4.3 Composite Types, 4.3.1 Aggregates). So if we have structure with doubles (9 double fields) and 3 Core unused registers (r1, r2, r3): caller should use r2 and r3 registers only. Currently r1,r2,r3 set is used, but it is invalid. Callee VA routine should also use r2 and r3 regs only. All is ok here. This behaviour is guessed by rounding up SP address with ADD+BFC operations. Fix: Main fix is in ARMTargetLowering::HandleByVal. If we detected AAPCS mode and 8 byte alignment, we waste odd registers then. P.S.: I also improved LDRB_POST_IMM regression test. Since ldrb instruction will not generated by current regression test after this patch. llvm-svn: 166018	2012-10-16 07:16:47 +00:00
Andrew Trick	d9d4be0d57	misched: Added handleMove support for updating all kill flags, not just for allocatable regs. This is a medium term workaround until we have a more robust solution in the form of a register liveness utility for postRA passes. llvm-svn: 166001	2012-10-16 00:22:51 +00:00
Jakob Stoklund Olesen	244beb42ce	Remove unused BitVectors from getAllocatableSet(). llvm-svn: 165999	2012-10-16 00:05:06 +00:00
Jakob Stoklund Olesen	f67bf3e0ea	Remove RegisterClassInfo::isReserved() and isAllocatable(). Clients can use the equivalent functions in MRI. llvm-svn: 165990	2012-10-15 22:41:03 +00:00
Jakob Stoklund Olesen	cea596acf7	Remove LIS::isAllocatable() and isReserved() helpers. All callers can simply use the corresponding MRI functions. llvm-svn: 165985	2012-10-15 22:14:34 +00:00
Jakob Stoklund Olesen	c30a9af2d7	Switch most getReservedRegs() clients to the MRI equivalent. Using the cached bit vector in MRI avoids comstantly allocating and recomputing the reserved register bit vector. llvm-svn: 165983	2012-10-15 21:57:41 +00:00
Jakob Stoklund Olesen	57e310613c	Freeze the reserved registers as soon as isel is complete. Also provide an MRI::getReservedRegs() function to access the frozen register set, and isReserved() and isAllocatable() methods to test individual registers. The various implementations of TRI::getReservedRegs() are quite complicated, and many passes need to look at the reserved register set. This patch makes it possible for these passes to use the cached copy in MRI, avoiding a lot of malloc traffic and repeated calculations. llvm-svn: 165982	2012-10-15 21:33:06 +00:00
Bill Wendling	50d27849f6	Move the Attributes::Builder outside of the Attributes class and into its own class named AttrBuilder. No functionality change. llvm-svn: 165960	2012-10-15 20:35:56 +00:00
Rafael Espindola	048405f510	Make sure we iterate over newly created instructions. Fixes pr13625. Testcase to follow in one sec. llvm-svn: 165951	2012-10-15 18:21:07 +00:00
Andrew Trick	90f711da9a	misched: ILP scheduler for experimental heuristics. llvm-svn: 165950	2012-10-15 18:02:27 +00:00
Micah Villmow	4bb926d91d	Resubmit the changes to llvm core to update the functions to support different pointer sizes on a per address space basis. llvm-svn: 165941	2012-10-15 16:24:29 +00:00
Bill Wendling	a05b043c4a	Remove the bitwise XOR operator from the Attributes class. Replace it with the equivalent from the builder class. llvm-svn: 165893	2012-10-14 06:56:13 +00:00
Jakob Stoklund Olesen	ea82bd7f0d	Drop <def,dead> flags when merging into an unused lane. The new coalescer can merge a dead def into an unused lane of an otherwise live vector register. Clear the <dead> flag when that happens since the flag refers to the full virtual register which is still live after the partial dead def. This fixes PR14079. llvm-svn: 165877	2012-10-13 17:26:47 +00:00
Jakob Stoklund Olesen	2f6dfc7d0b	Allow for loops in LiveIntervals::pruneValue(). It is possible that the live range of the value being pruned loops back into the kill MBB where the search started. When that happens, make sure that the beginning of KillMBB is also pruned. Instead of starting a DFS at KillMBB and skipping the root of the search, start a DFS at each KillMBB successor, and allow the search to loop back to KillMBB. This fixes PR14078. llvm-svn: 165872	2012-10-13 16:15:31 +00:00
Jakob Stoklund Olesen	1a87a29d08	Use a transposed algorithm for handleMove(). Completely update one interval at a time instead of collecting live range fragments to be updated. This avoids building data structures, except for a single SmallPtrSet of updated intervals. Also share code between handleMove() and handleMoveIntoBundle(). Add support for moving dead defs across other live values in the interval. The MI scheduler can do that. llvm-svn: 165824	2012-10-12 21:31:57 +00:00
Jakob Stoklund Olesen	1a3eb878f6	Fix coalescing with IMPLICIT_DEF values. PHIElimination inserts IMPLICIT_DEF instructions to guarantee that all PHI predecessors have a live-out value. These IMPLICIT_DEF values are not considered to be real interference when coalescing virtual registers: %vreg1 = IMPLICIT_DEF %vreg2 = MOV32r0 When joining %vreg1 and %vreg2, the IMPLICIT_DEF instruction and its value number should simply be erased since the %vreg2 value number now provides a live-out value for the PHI predecesor block. llvm-svn: 165813	2012-10-12 18:03:04 +00:00
Ulrich Weigand	9aa51d1a2c	Fix big-endian codegen bug in DAGTypeLegalizer::ExpandRes_BITCAST On PowerPC, a bitcast of <16 x i8> to i128 may run through a code path in ExpandRes_BITCAST that attempts to do an intermediate bitcast to a <4 x i32> vector, and then construct the Hi and Lo parts of the resulting i128 by pairing up two of those i32 vector elements each. The code already recognizes that on a big-endian system, the first two vector elements form the Hi part, and the final two vector elements form the Lo part (vice-versa from the little-endian situation). However, we also need to take endianness into account when forming each of those separate pairs: on a big-endian system, vector element 0 is the high part of the pair making up the Hi part of the result, and vector element 1 is the low part of the pair. The code currently always uses vector element 0 as the low part and vector element 1 as the high part, as is appropriate for little-endian platforms only. This patch fixes this by swapping the vector elements as they are paired up as appropriate. llvm-svn: 165802	2012-10-12 15:42:58 +00:00
Evan Cheng	21c4adcdd8	Legalizer optimize a pair of div / mod to a call to divrem libcall if they are not legal. However, it should use a div instruction + mul + sub if divide is legal. The rem legalization code was missing a check and incorrectly uses a divrem libcall even when div is legal. rdar://12481395 llvm-svn: 165778	2012-10-12 01:15:47 +00:00
Sean Silva	506a1c5a58	Remove unnecessary classof()'s isa<> et al. automatically infer when the cast is an upcast (including a self-cast), so these are no longer necessary. llvm-svn: 165767	2012-10-11 23:30:49 +00:00
Micah Villmow	0c61134d8d	Revert 165732 for further review. llvm-svn: 165747	2012-10-11 21:27:41 +00:00
Micah Villmow	083189730e	Add in the first iteration of support for llvm/clang/lldb to allow variable per address space pointer sizes to be optimized correctly. llvm-svn: 165726	2012-10-11 17:21:41 +00:00
Jakob Stoklund Olesen	d0d7860f40	Pass an explicit operand number to addLiveIns. Not all instructions define a virtual register in their first operand. Specifically, INLINEASM has a different format. <rdar://problem/12472811> llvm-svn: 165721	2012-10-11 16:46:07 +00:00
Michael Liao	6b49c2f69c	Follow the same routine to add target float expansion hook llvm-svn: 165707	2012-10-11 07:22:01 +00:00
Andrew Trick	5f35afb0f1	misched: Handle "transient" non-instructions. llvm-svn: 165701	2012-10-11 05:37:06 +00:00
Nadav Rotem	e10328737d	Add a new interface to allow IR-level passes to access codegen-specific information. llvm-svn: 165665	2012-10-10 22:04:55 +00:00
Micah Villmow	0242b9b543	Add in support for expansion of all of the comparison operations to the absolute minimum required set. This allows a backend to expand any arbitrary set of comparisons as long as a minimum set is supported. The minimum set of required instructions is ISD::AND, ISD::OR, ISD::SETO(or ISD::SETOEQ) and ISD::SETUO(or ISD::SETUNE). Everything is expanded into one of two patterns: Pattern 1: (LHS CC1 RHS) Opc (LHS CC2 RHS) Pattern 2: (LHS CC1 LHS) Opc (RHS CC2 RHS) llvm-svn: 165655	2012-10-10 20:50:51 +00:00
Michael Liao	effae0c8e1	Add alternative support for FP_ROUND from v2f32 to v2f64 - Due to the current matching vector elements constraints in ISD::FP_EXTEND, rounding from v2f32 to v2f64 is scalarized. Add a customized v2f32 widening to convert it into a target-specific X86ISD::VFPEXT to work around this constraints. This patch also reverts a previous attempt to fix this issue by recovering the scalarized ISD::FP_EXTEND pattern and thus significantly reduces the overhead of supporting non-power-2 vector FP extend. llvm-svn: 165625	2012-10-10 16:32:15 +00:00
Stepan Dyatkovskiy	f13dbb8e24	Issue description: SchedulerDAGInstrs::buildSchedGraph ignores dependencies between FixedStack objects and byval parameters. So loading byval parameters from stack may be inserted before it will be stored, since these operations are treated as independent. Fix: Currently ARMTargetLowering::LowerFormalArguments saves byval registers with FixedStack MachinePointerInfo. To fix the problem we need to store byval registers with MachinePointerInfo referenced to first the "byval" parameter. Also commit adds two new fields to the InputArg structure: Function's argument index and InputArg's part offset in bytes relative to the start position of Function's argument. E.g.: If function's argument is 128 bit width and it was splitted onto 32 bit regs, then we got 4 InputArg structs with same arg index, but different offset values. llvm-svn: 165616	2012-10-10 11:37:36 +00:00
Bill Wendling	bbcdf4e2a5	Remove the final bits of Attributes being declared in the Attribute namespace. Use the attribute's enum value instead. No functionality change intended. llvm-svn: 165610	2012-10-10 07:36:45 +00:00
Lang Hames	05fee08dfa	My earlier "fix" for PBQP (see r165201) was incorrect. The real issue was that checkRegMaskInterference only initializes the bitmask on the first interference. This fixes PR14027 and (re)fixes PR13945. llvm-svn: 165608	2012-10-10 06:39:48 +00:00
Andrew Trick	c334bd4577	misched: fall-back to a target hook for instr bundles. llvm-svn: 165606	2012-10-10 05:43:18 +00:00
Andrew Trick	dd79f0fcea	misched: Use the TargetSchedModel interface wherever possible. Allows the new machine model to be used for NumMicroOps and OutputLatency. Allows the HazardRecognizer to be disabled along with itineraries. llvm-svn: 165603	2012-10-10 05:43:09 +00:00
Andrew Trick	780fae8cd6	misched: Add computeInstrLatency to TargetSchedModel. llvm-svn: 165566	2012-10-09 23:44:32 +00:00
Andrew Trick	cfcf5202a1	misched: Allow flags to disable hasInstrSchedModel/hasInstrItineraries for external users of TargetSchedule. llvm-svn: 165564	2012-10-09 23:44:26 +00:00
Andrew Trick	caf1dc7867	misched: Remove LoopDependencies heuristic. This wasn't contributing anything significant to postRA heuristics except compile time (by my measurements) and will be replaced by a more general heuristic for cross-region dependencies within the scheduler itself. llvm-svn: 165563	2012-10-09 23:44:23 +00:00
Bill Wendling	8ccd6ca199	Use the attribute enums to query if a parameter has an attribute. llvm-svn: 165550	2012-10-09 21:38:14 +00:00
Micah Villmow	89021e4740	Add in the first step of the multiple pointer support. This adds in support to the data layout for specifying a per address space pointer size. The next step is to update the optimizers to allow them to optimize the different address spaces with this information. llvm-svn: 165505	2012-10-09 16:06:12 +00:00
Bill Wendling	c9b22d735a	Create enums for the different attributes. We use the enums to query whether an Attributes object has that attribute. The opaque layer is responsible for knowing where that specific attribute is stored. llvm-svn: 165488	2012-10-09 07:45:08 +00:00
Eric Christopher	286113687a	Fix up comment to be more clear. llvm-svn: 165463	2012-10-08 23:53:45 +00:00
Nadav Rotem	35315fea70	Refactor the AddrMode class out of TLI to its own header file. This class is used by LSR and a number of places in the codegen. This is the first step in de-coupling LSR from TLI, and creating a new interface in between them. llvm-svn: 165455	2012-10-08 23:06:34 +00:00
Jakob Stoklund Olesen	9d1173a86e	Don't crash on extra evil irreducible control flow. When the CFG contains a loop with multiple entry blocks, the traces computed by MachineTraceMetrics don't always have the same nice properties. Loop back-edges are normally excluded from traces, but MachineLoopInfo doesn't recognize loops with multiple entry blocks, so those back-edges may be included. Avoid asserting when that happens by adding an isEarlierInSameTrace() function that accurately determines if a dominating block is part of the same trace AND is above the currrent block in the trace. llvm-svn: 165434	2012-10-08 22:06:44 +00:00
Eric Christopher	cc10d20a17	Fixup comment. llvm-svn: 165427	2012-10-08 20:48:54 +00:00
Eric Christopher	85a495e9a7	Fixup comments. llvm-svn: 165426	2012-10-08 20:48:49 +00:00
Andrew Trick	07dced627e	misched: remove the unused getSpecialAddressLatency hook. llvm-svn: 165418	2012-10-08 18:54:00 +00:00
Andrew Trick	09650df562	misched: remove forceUnitLatencies. Defaults are handled by the default SchedModel llvm-svn: 165417	2012-10-08 18:53:57 +00:00
Andrew Trick	984d98bf6a	misched: avoid scheduling an instruction twice. llvm-svn: 165416	2012-10-08 18:53:53 +00:00
Micah Villmow	cdfe20b97f	Move TargetData to DataLayout. llvm-svn: 165402	2012-10-08 16:38:25 +00:00
Craig Topper	bc3a602929	Remove unused MachineInstr constructors that don't take a DebugLoc argument. llvm-svn: 165382	2012-10-07 23:03:22 +00:00
Craig Topper	2f6031c643	Fix indentation. Remove 'else' after return. No functional change. llvm-svn: 165381	2012-10-07 20:31:05 +00:00
Benjamin Kramer	db5fb3bfe8	Remove unused but set variable flagged by GCC. llvm-svn: 165331	2012-10-05 20:08:45 +00:00
Benjamin Kramer	62f7fb977c	Simplify code, don't or a bool with an uint64_t. No functionality change. llvm-svn: 165321	2012-10-05 18:19:44 +00:00
Nadav Rotem	b27777ff02	When merging connsecutive stores, use vectors to store the constant zero. llvm-svn: 165267	2012-10-04 22:35:15 +00:00
Eric Christopher	13319578ea	Update this a bit more to represent how the prologue should work: a) frame setup instructions define the prologue b) we shouldn't change our location mid-stream Add a test to make sure that the stack adjustment stays within the prologue. llvm-svn: 165250	2012-10-04 20:46:14 +00:00
Jakob Stoklund Olesen	878d386b9a	Get MCSchedModel directly from the subtarget. Not all targets have itineraries, but the subtarget always has an MCSchedModel. llvm-svn: 165236	2012-10-04 17:30:43 +00:00
Jakob Stoklund Olesen	8982222917	Switch MachineTraceMetrics to the new TargetSchedModel interface. llvm-svn: 165235	2012-10-04 17:30:40 +00:00
Lang Hames	8ce99f296b	Fix reg mask slot test, and preserve LiveIntervals and VirtRegMap in the PBQP allocator. Fixes PR13945. llvm-svn: 165201	2012-10-04 04:50:53 +00:00
Andrew Trick	8abcf4df68	Enable -schedmodel, but prefer itineraries until we have more benchmark data. llvm-svn: 165188	2012-10-04 00:24:34 +00:00
Bill Wendling	71ad78b24b	Update to use the predicate methods to query if an attribute exists. llvm-svn: 165163	2012-10-03 21:17:09 +00:00
Nadav Rotem	ac92066b0c	Fix a cycle in the DAG. In this code we replace multiple loads with a single load and multiple stores with a single load. We create the wide loads and stores (and their chains) before we remove the scalar loads and stores and fix the DAG chain. We attempted to merge loads with a different chain. When that happened, the assumption that it is safe to RAUW broke and a cycle was introduced. llvm-svn: 165148	2012-10-03 19:30:31 +00:00
Nadav Rotem	7cbc12a41d	A DAGCombine optimization for mergeing consecutive stores to memory. The optimization is not profitable in many cases because modern processors perform multiple stores in parallel and merging stores prior to merging requires extra work. We handle two main cases: 1. Store of multiple consecutive constants: q->a = 3; q->4 = 5; In this case we store a single legal wide integer. 2. Store of multiple consecutive loads: int a = p->a; int b = p->b; q->a = a; q->b = b; In this case we load/store either ilegal vector registers or legal wide integer registers. llvm-svn: 165125	2012-10-03 16:11:15 +00:00
Silviu Baranga	3c314990e6	Fixed a bug in the ExecutionDependencyFix pass that caused dependencies to not propagate through implicit defs. llvm-svn: 165102	2012-10-03 08:29:36 +00:00
Eric Christopher	f4fba5cf7a	Revert 165051-165049 while looking into the foreach.m failure in more detail. llvm-svn: 165099	2012-10-03 08:10:01 +00:00
Jakob Stoklund Olesen	0f6e8bb5e0	The early if conversion pass is ready to be used as an opt-in. Enable the pass by default for targets that request it, and change the -enable-early-ifcvt to the opposite -disable-early-ifcvt. There are still some x86 regressions when enabling early if-conversion because of the missing machine models. Disable the pass for x86 until machine models are added. llvm-svn: 165075	2012-10-03 00:51:32 +00:00
Eric Christopher	d7e9a450eb	Revert "Don't use a debug location for frame setup instructions in the" This reverts 165055 and 165052 temporarily while I look at debugger failures. llvm-svn: 165071	2012-10-02 23:43:11 +00:00
Jakob Stoklund Olesen	dd4d8dfea8	Remove the old coalescer algorithm. The new algorithm has been enabled by default for almost a week now and seems to be stable. llvm-svn: 165062	2012-10-02 22:45:03 +00:00
Jakob Stoklund Olesen	c8e25d98c0	Handle reserved registers more accurately in handleMove(). Reserved register live ranges look like a set of dead defs - any uses of reserved registers are ignored. Instead of skipping the updating of reserved register operands entirely, just ignore the use operands and treat the def operands normally. No test case, handleMove() is not commonly used yet. llvm-svn: 165060	2012-10-02 22:08:36 +00:00
Jakob Stoklund Olesen	bb999c2f72	Make sure the whole live range is covered when values are pruned twice. JoinVals::pruneValues() calls LIS->pruneValue() to avoid conflicts when overlapping two different values. This produces a set of live range end points that are used to reconstruct the live range (with SSA update) after joining the two registers. When a value is pruned twice, the set of end points was insufficient: v1 = DEF v1 = REPLACE1 v1 = REPLACE2 KILL v1 The end point at KILL would only reconstruct the live range from REPLACE2 to KILL, leaving the range REPLACE1-REPLACE2 dead. Add REPLACE2 as an end point in this case so the full live range is reconstructed. This fixes PR13999. llvm-svn: 165056	2012-10-02 21:46:39 +00:00
Eric Christopher	a55b1d5b99	80-col. llvm-svn: 165054	2012-10-02 21:44:12 +00:00
Eric Christopher	f01b02b7cf	Don't use a debug location for frame setup instructions in the prologue. Also skip frame setup instructions when looking for the first location. llvm-svn: 165052	2012-10-02 21:17:00 +00:00
Eric Christopher	d40ce7a43d	Remove the SavePoint infrastructure from fast isel, replace with just an insert point from the MachineBasicBlock and let the location be updated as we access it. llvm-svn: 165049	2012-10-02 21:16:50 +00:00
Duncan Sands	f97cb15aee	Fix PR13991: legalizing an overflowing multiplication operation is harder than the add/sub case since in the case of multiplication you also have to check that the operation in the larger type did not overflow. llvm-svn: 165017	2012-10-02 15:03:49 +00:00
Jakub Staszak	ec5a2f248f	Use dyn_cast instead of isa and cast. No functionality change. llvm-svn: 164924	2012-09-30 21:24:57 +00:00
Nadav Rotem	abbe665154	Revert r164910 because it causes failures to several phase2 builds. llvm-svn: 164911	2012-09-30 07:17:56 +00:00
Nadav Rotem	45715b25f7	A DAGCombine optimization for merging consecutive stores. This optimization is not profitable in many cases because moden processos can store multiple values in parallel, and preparing the consecutive store requires some work. We only handle these cases: 1. Consecutive stores where the values and consecutive loads. For example: int a = p->a; int b = p->b; q->a = a; q->b = b; 2. Consecutive stores where the values are constants. Foe example: q->a = 4; q->b = 5; llvm-svn: 164910	2012-09-30 06:24:14 +00:00
Duncan Sands	fb9d30dd64	Speculatively revert commit 164885 (nadav) in the hope of ressurecting a pile of buildbots. Original commit message: A DAGCombine optimization for merging consecutive stores. This optimization is not profitable in many cases because moden processos can store multiple values in parallel, and preparing the consecutive store requires some work. We only handle these cases: 1. Consecutive stores where the values and consecutive loads. For example: int a = p->a; int b = p->b; q->a = a; q->b = b; 2. Consecutive stores where the values are constants. Foe example: q->a = 4; q->b = 5; llvm-svn: 164890	2012-09-29 10:25:35 +00:00
Craig Topper	5f9791fd2f	Tidy up to match coding standards. Remove 'else' after 'return' and moving operators to end of preceding line. No functional change intended. llvm-svn: 164887	2012-09-29 07:18:53 +00:00
Craig Topper	65161fa493	Replace a couple if/elses around similar calls with conditional operators on the varying arguments. No functional change. llvm-svn: 164886	2012-09-29 06:54:22 +00:00
Nadav Rotem	a2e7ea2f18	A DAGCombine optimization for merging consecutive stores. This optimization is not profitable in many cases because moden processos can store multiple values in parallel, and preparing the consecutive store requires some work. We only handle these cases: 1. Consecutive stores where the values and consecutive loads. For example: int a = p->a; int b = p->b; q->a = a; q->b = b; 2. Consecutive stores where the values are constants. Foe example: q->a = 4; q->b = 5; llvm-svn: 164885	2012-09-29 06:33:25 +00:00
Jakob Stoklund Olesen	31af8bf1cc	Remove <def,read-undef> flags from partial redefinitions. The new coalescer can turn a full virtual register definition into a partial redef by merging another value into an unused vector lane. Make sure to clear the <read-undef> flag on such defs. llvm-svn: 164807	2012-09-27 23:31:32 +00:00
Jakob Stoklund Olesen	8919aa508d	Enable the new coalescer algorithm by default. The new coalescer is better at merging values into unused vector lanes, improving NEON code. llvm-svn: 164794	2012-09-27 21:06:02 +00:00
Jakob Stoklund Olesen	4976d0df41	Don't dereference begin() on an empty vector. The fix is obvious and the only test case I have is horrible, so I am not including it. The problem shows up when self-hosting clang on i386 with -new-coalescer enabled. llvm-svn: 164793	2012-09-27 21:05:59 +00:00
Jakob Stoklund Olesen	1d19582a8f	Avoid dereferencing a NULL pointer. Fixes PR13943. llvm-svn: 164778	2012-09-27 16:34:19 +00:00
Sylvestre Ledru	91ce36c986	Revert 'Fix a typo 'iff' => 'if''. iff is an abreviation of if and only if. See: http://en.wikipedia.org/wiki/If_and_only_if Commit 164767 llvm-svn: 164768	2012-09-27 10:14:43 +00:00
Sylvestre Ledru	721cffd53a	Fix a typo 'iff' => 'if' llvm-svn: 164767	2012-09-27 09:59:43 +00:00
Bill Wendling	863bab689a	Remove the `hasFnAttr' method from Function. The hasFnAttr method has been replaced by querying the Attributes explicitly. No intended functionality change. llvm-svn: 164725	2012-09-26 21:48:26 +00:00
Craig Topper	2a6a08b1cd	Rename virtual table anchors from Anchor() to anchor() for consistency with the rest of the tree. llvm-svn: 164666	2012-09-26 06:36:36 +00:00
Bill Wendling	5def891396	Generate an error message instead of asserting or segfaulting when we have a scalar-to-vector conversion that we cannot handle. For instance, when an invalid constraint is used in an inline asm statement. <rdar://problem/12284092> llvm-svn: 164662	2012-09-26 06:16:18 +00:00
Bill Wendling	81406f692f	Generate an error message instead of asserting or segfaulting when we have a scalar-to-vector conversion that we cannot handle. For instance, when an invalid constraint is used in an inline asm statement. <rdar://problem/12284092> llvm-svn: 164657	2012-09-26 04:04:19 +00:00
Sebastian Pop	edb31faf92	TargetLowering interface to set/get minimum block entries for jump tables. Provide interface in TargetLowering to set or get the minimum number of basic blocks whereby jump tables are generated for switch statements rather than an if sequence. getMinimumJumpTableEntries() defaults to 4. setMinimumJumpTableEntries() allows target configuration. This patch changes the default for the Hexagon architecture to 5 as it improves performance on some benchmarks. llvm-svn: 164628	2012-09-25 20:35:36 +00:00
Jim Grosbach	361ca34270	Mark jump tables in code sections with DataRegion directives. Even out-of-line jump tables can be in the code section, so mark them as data-regions for those targets which support the directives. rdar://12362871&12362974 llvm-svn: 164571	2012-09-24 23:06:27 +00:00
Eric Christopher	c1c8a1bb6a	Have the DbgVariable "isArtificial" and "isObjectPointer" not care about it being an argument variable so that we can decide that captured block and lambda vars that don't happen to be arguments could be an argument pointer. Add the object pointer for one case onto the subprogram die. rdar://12001329 llvm-svn: 164419	2012-09-21 22:18:52 +00:00
Evan Cheng	b53825b82b	Fix a significant recent(?) regression. StackSlotColoring no longer did anything because LiveStackAnalysis was not preserved by VirtRegWriter. This caused big stack usage regression in some cases. rdar://12340383 llvm-svn: 164408	2012-09-21 20:04:28 +00:00
Bill Wendling	9be7759ee1	Make the 'get*AlignmentFromAttr' functions into member functions within the Attributes class. Now with fix. llvm-svn: 164370	2012-09-21 15:26:31 +00:00
Jakob Stoklund Olesen	b8707faba3	Ignore PHI-defs for -new-coalescer interference checks. A PHI can't create interference on its own. If two live ranges interfere at a PHI, they must also interfere when leaving one of the PHI predecessors. llvm-svn: 164330	2012-09-20 23:08:42 +00:00
Jakob Stoklund Olesen	09cd303655	Extend -new-coalescer SSA update to handle mapped values as well. The old-fashioned many-to-one value mapping doesn't always work when merging vector lanes. A value can map to multiple different values, and it can even be necessary to insert new PHIs. When a value number is defined by a copy from a value number that required SSa update, include the live range of the copied value number in the SSA update as well. It is not necessarily a copy of the original value number any longer. llvm-svn: 164329	2012-09-20 23:08:39 +00:00
Eric Christopher	3a3d529e0d	Only emit DW_AT_object_pointer if this is a definition. llvm-svn: 164326	2012-09-20 22:51:57 +00:00
Bill Wendling	c727bacb38	Revert r164308 to fix buildbots. llvm-svn: 164309	2012-09-20 16:59:57 +00:00
Bill Wendling	abac66150c	Make the 'get*AlignmentFromAttr' functions into member functions within the Attributes class. llvm-svn: 164308	2012-09-20 16:27:05 +00:00
Nadav Rotem	841c9a84d0	Fix 80-col violations. llvm-svn: 164297	2012-09-20 08:53:31 +00:00
Bill Wendling	3bef2dd5f9	Convert some attribute existence queries over to use the predicate methods. llvm-svn: 164268	2012-09-19 23:54:18 +00:00
Bill Wendling	d6b2688130	Add predicates for queries on whether an attribute exists. llvm-svn: 164264	2012-09-19 23:35:21 +00:00
Jakob Stoklund Olesen	7d3c9c0a2a	Resolve conflicts involving dead vector lanes for -new-coalescer. A common coalescing conflict in vector code is lane insertion: %dst = FOO %src = BAR %dst:ssub0 = COPY %src The live range of %src interferes with the ssub0 lane of %dst, but that lane is never read after %src would have clobbered it. That makes it safe to merge the live ranges and eliminate the COPY: %dst = FOO %dst:ssub0 = BAR This patch teaches the new coalescer to resolve conflicts where dead vector lanes would be clobbered, at least as long as the clobbered vector lanes don't escape the basic block. llvm-svn: 164250	2012-09-19 21:29:18 +00:00
Andrew Trick	6a35f197a7	comment typo llvm-svn: 164180	2012-09-18 22:57:42 +00:00
Andrew Trick	f2b70d9f3a	TargetSchedule: cleanup computeOperandLatency logic & diagnostics. llvm-svn: 164154	2012-09-18 18:20:02 +00:00
Andrew Trick	9b63513ac6	misched: Make ScheduleDAGInstrs use the TargetSchedule interface. llvm-svn: 164153	2012-09-18 18:20:00 +00:00
Roman Divacky	5dd4ccb402	When creating MCAsmBackend pass the CPU string as well. In X86AsmBackend store this and use it to not emit long nops when the CPU is geode which doesnt support them. Fixes PR11212. llvm-svn: 164132	2012-09-18 16:08:49 +00:00
Andrew Trick	6e6d597b1c	TargetSchedModel API. Implement latency lookup, disabled. llvm-svn: 164098	2012-09-18 04:03:34 +00:00
Craig Topper	b1d83e8c72	Mark unimplemented copy constructors and copy assignment operators as LLVM_DELETED_FUNCTION. llvm-svn: 164090	2012-09-18 02:01:41 +00:00
Evan Cheng	c573599137	Fix some funky indentation. llvm-svn: 164087	2012-09-18 01:34:40 +00:00
Jakob Stoklund Olesen	0bb3dd78c4	Merge into undefined lanes under -new-coalescer. Add LIS::pruneValue() and extendToIndices(). These two functions are used by the register coalescer when merging two live ranges requires more than a trivial value mapping as supported by LiveInterval::join(). The pruneValue() function can remove the part of a value number that is going to conflict in join(). Afterwards, extendToIndices can restore the live range, using any new dominating value numbers and updating the SSA form. Use this complex value mapping to support merging a register into a vector lane that has a conflicting value, but the clobbered lane is undef. llvm-svn: 164074	2012-09-17 23:03:25 +00:00
Jakob Stoklund Olesen	af50f17df4	Stop adding <imp-def> operands when expanding REG_SEQUENCE. These extra operands are not needed by register allocators using VirtRegRewriter, and RAFast don't need them any longer. By omitting the <imp-def> operands, it becomes possible for the new register coalescer to track which lanes are valid and which are undef. llvm-svn: 164073	2012-09-17 23:03:21 +00:00
Andrew Trick	8e7f202e32	Revert r164061-r164067. Most of the new subtarget emitter. I have to work out the Target/CodeGen header dependencies before putting this back. llvm-svn: 164072	2012-09-17 23:00:42 +00:00
Andrew Trick	f403ee7937	TargetSchedModel API. Implement latency lookup, disabled. llvm-svn: 164065	2012-09-17 22:19:08 +00:00
Michael Ilseman	4f0e00a5b8	Increase the static sizes of some SmallSets. finalizeBundle() is very frequently called for some backends, and growing into an std::set is overkill for these numbers. llvm-svn: 164044	2012-09-17 18:31:15 +00:00
Michael Ilseman	3a8336379c	whitespace llvm-svn: 164043	2012-09-17 18:25:23 +00:00
Michael Liao	b503b323f3	Fix PR13859 - Preserve the original NOutVT during casting from vector to integer by extracting vector elements. llvm-svn: 164042	2012-09-17 18:05:20 +00:00
Tom Stellard	86af62c1ad	Add a MachinePostDominator pass This is used in the AMDIL and R600 backends. llvm-svn: 164029	2012-09-17 14:08:37 +00:00
Nadav Rotem	2ae810a51f	Disable the protection from escaped allocas in an attempt to find violating passes. This may break the buildbots. I plan to revert it in a few hours. llvm-svn: 164024	2012-09-17 10:21:55 +00:00
Craig Topper	04b4e83cf7	Fix bad comment. No functional change. llvm-svn: 164000	2012-09-16 16:48:25 +00:00
Jakob Stoklund Olesen	17e2185543	Add alternative coalescing algorithm under a flag. The live range of an SSA value forms a sub-tree of the dominator tree. That means the live ranges of two values overlap if and only if the def of one value lies within the live range of the other. This can be used to simplify the interference checking a bit: Visit each def in the two registers about to be joined. Check for interference against the value that is live in the other register at the def point only. It is not necessary to scan the set of overlapping live ranges, this interference check can be done while computing the value mapping required for the final live range join. The new algorithm is prepared to handle more complicated conflict resolution - We can allow overlapping live ranges with different values as long as the differing lanes are undef or unused in the other register. The implementation in this patch doesn't do that yet, it creates code that is nearly identical to the old algorithm's, except: - The new stripCopies() function sees through multiple copies while the old RegistersDefinedFromSameValue() only can handle one. - There are a few rare cases where the new algorithm can erase an IMPLICIT_DEF instuction that RegistersDefinedFromSameValue() couldn't handle. llvm-svn: 163991	2012-09-16 02:15:36 +00:00
Craig Topper	a60c0f1163	Use LLVM_DELETED_FUNCTION in place of 'DO NOT IMPLEMENT' comments. llvm-svn: 163974	2012-09-15 17:09:36 +00:00
Jakob Stoklund Olesen	b7d27a3dd7	Don't depend on kill flags in removeCopyByCommutingDef(). Kill flags are removed more and more aggressively during the register allocation passes, it is better to get information from LiveIntervals. llvm-svn: 163972	2012-09-15 16:32:11 +00:00
Andrew Trick	d2a19da1b8	TargetSchedModel interface. To be implemented... llvm-svn: 163934	2012-09-14 20:26:46 +00:00
Andrew Trick	a2733e9549	misched: add a hook for custom DAG postprocessing. llvm-svn: 163915	2012-09-14 17:22:42 +00:00
Duncan Sands	291d47efdf	Remove silly dead store. Patch by Ettl Martin. llvm-svn: 163882	2012-09-14 09:00:11 +00:00
Eric Christopher	b83dba2b84	Fix both the test for zero and what we do if we have a zero for umulo legalization. Fixes PR13839 llvm-svn: 163856	2012-09-13 23:24:02 +00:00
Eric Christopher	3bc248176c	Reformat, remove a couple unused variables and move some variables closer to where they're needed. llvm-svn: 163855	2012-09-13 23:23:58 +00:00
Michael Liao	460fc46e0f	Enhance type legalization on bitcast from vector to integer - Find a legal vector type before casting and extracting element from it. - As the new vector type may have more than 2 elements, build the final hi/lo pair by BFS pairing them from bottom to top. llvm-svn: 163830	2012-09-13 19:58:21 +00:00
Nadav Rotem	77a09ebbeb	Rename the flag which protects from escaped allocas, which may come from bugs in user code or in the compiler. Also, dont assert if the protection is not enabled. llvm-svn: 163807	2012-09-13 15:46:30 +00:00
Nadav Rotem	24a822a5cb	Fix a dagcombine optimization. The optimization attempts to optimize a bitcast of fneg to integers by xoring the high-bit. This fails if the source operand is a vector because we need to negate each of the elements in the vector. Fix rdar://12281066 PR13813. llvm-svn: 163802	2012-09-13 14:54:28 +00:00
Nadav Rotem	2bd25fed29	Fix a typo. llvm-svn: 163801	2012-09-13 14:51:00 +00:00
Nadav Rotem	4e9ad06617	Stack Coloring: We have code that checks that all of the uses of allocas are within the lifetime zone. Sometime legitimate usages of allocas are hoisted outside of the lifetime zone. For example, GEPS may calculate the address of a member of an allocated struct. This commit makes sure that we only check (abort regions or assert) for instructions that read and write memory using stack frames directly. Notice that by allowing legitimate usages outside the lifetime zone we also stop checking for instructions which use derivatives of allocas. We will catch less bugs in user code and in the compiler itself. llvm-svn: 163791	2012-09-13 12:38:37 +00:00
Eric Christopher	e341776c1e	Recommit, with fixes: Add some support for dealing with an object pointer on arguments. Part of rdar://9797999 which now supports adding the object pointer attribute to the subprogram as it should. llvm-svn: 163754	2012-09-12 23:36:19 +00:00
Michael Liao	abb87d4857	Fix PR11985 - BlockAddress has no support of BA + offset form and there is no way to propagate that offset into machine operand; - Add BA + offset support and a new interface 'getTargetBlockAddress' to simplify target block address forming; - All targets are modified to use new interface and X86 backend is enhanced to support BA + offset addressing. llvm-svn: 163743	2012-09-12 21:43:09 +00:00
Owen Anderson	6f9dace01c	Remove an overly-aggressive assertion. The code following this assertion already knows how to handle the case where DstRC was NULL, so it's not actually protecting us from anything, and this pattern can come up when using unknown_class operands in the SelectionDAG. llvm-svn: 163736	2012-09-12 20:09:19 +00:00
Jakob Stoklund Olesen	5a3db551a8	Delete dead code. llvm-svn: 163735	2012-09-12 20:04:17 +00:00
Eric Christopher	c44e973a36	Revert "Add some support for dealing with an object pointer on arguments." This should be done on the subprogram, not the variable itself. llvm-svn: 163734	2012-09-12 18:42:31 +00:00
Dmitri Gribenko	881929c1b6	Fix a couple of Doxygen comment issues pointed out by -Wdocumentation. llvm-svn: 163721	2012-09-12 16:59:47 +00:00
Kristof Beyls	e6b876f4e5	Fix constant folding through bitcasts by no longer relying on undefined behaviour (converting NaN values between float and double). SelectionDAG::getConstantFP(double Val, EVT VT, bool isTarget); should not be used when Val is not a simple constant (as the comment in SelectionDAG.h indicates). This patch avoids using this function when folding an unknown constant through a bitcast, where it cannot be guaranteed that Val will be a simple constant. llvm-svn: 163703	2012-09-12 11:25:02 +00:00
Nadav Rotem	9566ca9af8	Add a flag to disable the code that looks for allocas which escaped the lifetime regions. This is useful for debugging. No testcase because without this check we fail on assertions when finding escaped allocas. llvm-svn: 163702	2012-09-12 11:06:26 +00:00
James Molloy	c747cdae24	Add a function computeRegisterLiveness() to MachineBasicBlock. This uses analyzePhysReg() from r163694 to heuristically try and determine the liveness state of a physical register upon arrival at a particular instruction in a block. The search for liveness is clipped to a specific number of instructions around the target MachineInstr, in order to avoid degenerating into an O(N^2) algorithm. It tries to use various clues about how instructions around (both before and after) a given MachineInstr use that register, to determine its state at the MachineInstr. llvm-svn: 163695	2012-09-12 10:18:23 +00:00
James Molloy	381fab93d5	Add an analyzePhysReg() function to MachineOperandIteratorBase that analyses an instruction's use of a physical register, analogous to analyzeVirtReg. Rename RegInfo to VirtRegInfo so as not to be confused with the new PhysRegInfo. llvm-svn: 163694	2012-09-12 10:03:31 +00:00
Nadav Rotem	b9e2202049	Enable stack-coloring, in hope that the recent fixes will enable correct dragonegg self-hosting. llvm-svn: 163687	2012-09-12 07:58:35 +00:00
Lang Hames	c3d9a3d881	Make findLastUseBefore handle reg-unit liveness. findLastUseBefore was previous considering virtreg liveness only, leading to incorrect live intervals for reg units when instrs with physreg operands were moved up. llvm-svn: 163685	2012-09-12 06:56:16 +00:00
Nadav Rotem	8ff00989fc	Stack coloring: remove lifetime intervals which contain escaped allocas. The input program may contain intructions which are not inside lifetime markers. This can happen due to a bug in the compiler or due to a bug in user code (for example, returning a reference to a local variable). This commit adds checks that all of the instructions in the function and invalidates lifetime ranges which do not contain all of the instructions. llvm-svn: 163678	2012-09-12 04:57:37 +00:00
Eric Christopher	97c0fdd116	Add some support for dealing with an object pointer on arguments. Part of rdar://9797999 llvm-svn: 163667	2012-09-12 00:26:55 +00:00
Manman Ren	19f49ac624	Release build: guard dump functions with "#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)" No functional change. Update r163339. llvm-svn: 163653	2012-09-11 22:23:19 +00:00
Chad Rosier	1778831a3d	[ms-inline asm] Split the parsing of IR asm strings into GCC and MS variants. Add support in the EmitMSInlineAsmStr() function for handling integer consts. llvm-svn: 163645	2012-09-11 19:09:56 +00:00
Nadav Rotem	42b641c879	Dragonegg selfhost exposed additional cases where alloca usage moved outside of lifetime markers. Disabling the pass for now. llvm-svn: 163623	2012-09-11 15:40:27 +00:00
Nadav Rotem	4464613ceb	Enable stack coloring. llvm-svn: 163617	2012-09-11 13:48:35 +00:00
Nadav Rotem	65ba95ebf9	Stack Coloring: Dont crash on dbg values which use stack frames. llvm-svn: 163616	2012-09-11 12:34:27 +00:00
Craig Topper	8238461211	Teach DAG combiner to constant fold FABS of a BUILD_VECTOR of ConstantFPs. Factor similar code out of FNEG DAG combiner. llvm-svn: 163587	2012-09-11 01:45:21 +00:00
Andrew Trick	7a8e10042f	Reorganize MachineScheduler interfaces and publish them in the header. The Hexagon target decided to use a lot of functionality from the target-independent scheduler. That's fine, and other targets should be able to do the same. This reorg and API update makes that easy. For the record, ScheduleDAGMI was not meant to be subclassed. Instead, new scheduling algorithms should be able to implement MachineSchedStrategy and be done. But if need be, it's nice to be able to extend ScheduleDAGMI, so I also made that easier. The target scheduler is somewhat more apt to break that way though. llvm-svn: 163580	2012-09-11 00:39:15 +00:00
Eric Christopher	9fd70c8fb3	Revert r160148 it seems to cause more problems than it should right now. We'll fix PR13303 a different way. llvm-svn: 163570	2012-09-10 23:34:06 +00:00
Eric Christopher	e8a7b1b741	80-col fixup. llvm-svn: 163569	2012-09-10 23:34:03 +00:00
Eric Christopher	abb4d9ed34	80-col fixup. llvm-svn: 163568	2012-09-10 23:34:00 +00:00
Eric Christopher	a47d096125	No reason to construct this twice. llvm-svn: 163567	2012-09-10 23:33:57 +00:00
Chad Rosier	7641f58784	[ms-inline asm] Properly emit the asm directives when the AsmPrinterVariant and InlineAsmVariant don't match. llvm-svn: 163550	2012-09-10 21:36:05 +00:00
Dmitri Gribenko	ca1e27be0d	Remove redundant semicolons which are null statements. llvm-svn: 163547	2012-09-10 21:26:47 +00:00
Nadav Rotem	5a72a23a70	Disable stack coloring because it makes dragonegg fail bootstrapping. llvm-svn: 163545	2012-09-10 21:17:58 +00:00
Chad Rosier	db20a41d99	[ms-inline asm] Pass the correct AsmVariant to the PrintAsmOperand() function and update the printOperand() function accordingly. llvm-svn: 163544	2012-09-10 21:10:49 +00:00
Nadav Rotem	107faf853b	Enable stack coloring. llvm-svn: 163539	2012-09-10 20:15:49 +00:00
Nadav Rotem	3c86b78ae4	Stack Coloring: Handle the case where END markers come before BEGIN markers properly. llvm-svn: 163530	2012-09-10 18:51:09 +00:00
Michael Ilseman	0666f0580c	Fold multiply by 0 or 1 when in UnsafeFPMath mode in SelectionDAG::getNode(). This folding happens as early as possible for performance reasons, and to make sure it isn't foiled by other transforms (e.g. forming FMAs). llvm-svn: 163519	2012-09-10 17:00:37 +00:00
Michael Ilseman	d5f91515f3	whitespace llvm-svn: 163518	2012-09-10 16:56:31 +00:00
James Molloy	1e5c611815	Fix an assertion failure when optimising a shufflevector incorrectly into concat_vectors, and a followup bug with SelectionDAG::getNode() creating nodes with invalid types. llvm-svn: 163511	2012-09-10 14:01:21 +00:00
Nadav Rotem	ba9a03f279	Minor cleanup. No functional change. llvm-svn: 163510	2012-09-10 13:20:00 +00:00
Nadav Rotem	d62287dc91	Stack Coloring: Debug prints to print the slot number and not the array index. llvm-svn: 163509	2012-09-10 13:17:58 +00:00
Nadav Rotem	ed242a0f1c	Stack Coloring: When searching for disjoint regions, do not compare intervals twice or to theirself. llvm-svn: 163508	2012-09-10 12:47:38 +00:00

... 3 4 5 6 7 ...

14466 Commits