llvm-project

Commit Graph

Author	SHA1	Message	Date
Jakob Stoklund Olesen	da96006975	Move CalculateRegClass to MRI::recomputeRegClass. This function doesn't have anything to do with spill weights, and MRI already has functions for manipulating the register class of a virtual register. llvm-svn: 137123	2011-08-09 16:46:27 +00:00
Devang Patel	6c1ed31b3b	Print variable's inline location in debug output. llvm-svn: 137096	2011-08-09 01:03:35 +00:00
Jakob Stoklund Olesen	e7dddfd7f6	Rename member variables to follow coding standards. No functional change. llvm-svn: 137094	2011-08-09 01:01:27 +00:00
Jakob Stoklund Olesen	e1f5313bc7	Move the RegisterCoalescer private to its implementation file. RegisterCoalescer.h still has the CoalescerPair class interface. llvm-svn: 137088	2011-08-09 00:43:37 +00:00
Jakob Stoklund Olesen	4c9a2fb044	Refer to the RegisterCoalescer pass by ID. A public interface is no longer needed since RegisterCoalescer is not an analysis any more. llvm-svn: 137082	2011-08-09 00:29:53 +00:00
Jakob Stoklund Olesen	daa2cad723	Hoist hasLoadFromStackSlot and hasStoreToStackSlot. These the methods are target-independent since they simply scan the memory operands. They can live in TargetInstrInfoImpl. llvm-svn: 137063	2011-08-08 20:53:24 +00:00
Devang Patel	fee7cedbc9	Simplify by creating parent first. llvm-svn: 137056	2011-08-08 18:22:10 +00:00
Jakob Stoklund Olesen	22f37a1eb1	Fix typo. Thanks, Andy! llvm-svn: 137023	2011-08-06 18:20:24 +00:00
Jakob Stoklund Olesen	d4bb1d43e8	Reject RS_Spill ranges from local splitting as well. All new local ranges are marked as RS_New now, so there is no need to attempt splitting of RS_Spill ranges any more. llvm-svn: 137002	2011-08-05 23:50:33 +00:00
Jakob Stoklund Olesen	02cf10bdfd	Only mark remainder intervals as RS_Spill after per-block splitting. The local ranges created get to stay in the RS_New stage, just like for local and region splitting. This gives tryLocalSplit a bit more freedom the first time it sees one of these new local ranges. llvm-svn: 137001	2011-08-05 23:50:31 +00:00
Jakob Stoklund Olesen	0de95ef7f5	Remember to update LiveDebugVariables after per-block splitting. llvm-svn: 136996	2011-08-05 23:10:40 +00:00
Jakob Stoklund Olesen	cef5d8ff77	Extract per-block splitting into its own method. No functional change. llvm-svn: 136994	2011-08-05 23:04:18 +00:00
Jakob Stoklund Olesen	cdf9ad9107	Delete getMultiUseBlocks and splitSingleBlocks. These functions are no longer used, and they are easily replaced with a loop calling shouldSplitSingleBlock and splitSingleBlock. llvm-svn: 136993	2011-08-05 22:52:17 +00:00
Jakob Stoklund Olesen	58995bc551	Also use shouldSplitSingleBlock() in the fallback splitting mode. Drop the use of SplitAnalysis::getMultiUseBlocks, there is no need to go through a SmallPtrSet any more. llvm-svn: 136992	2011-08-05 22:43:23 +00:00
Jakob Stoklund Olesen	8627ea91cb	Split around single instructions to enable register class inflation. Normally, we don't create a live range for a single instruction in a basic block, the spiller does that anyway. However, when splitting a live range that belongs to a proper register sub-class, inserting these extra COPY instructions completely remove the constraints from the remainder interval, and it may be allocated from the larger super-class. The spiller will mop up these small live ranges if we end up spilling anyway. It calls them snippets. llvm-svn: 136989	2011-08-05 22:20:45 +00:00
Jakob Stoklund Olesen	5122467b38	Detect proper register sub-classes. Some instructions require restricted register classes, but most of the time that doesn't affect register allocation. For example, some instructions don't work with the stack pointer, but that is a reserved register anyway. Sometimes it matters, GR32_ABCD only has 4 allocatable registers. For such a proper sub-class, the register allocator should try to enable register class inflation since that makes more registers available for allocation. Make sure only legal super-classes are considered. For example, tGPR is not a proper sub-class in Thumb mode, but in ARM mode it is. llvm-svn: 136981	2011-08-05 21:28:14 +00:00
Jakob Stoklund Olesen	d633abebf6	Fix liveness computations in BranchFolding. The old code would look at kills and defs in one pass over the instruction operands, causing problems with this code: %R0<def>, %CPSR<def,dead> = tLSLri %R5<kill>, 2, pred:14, pred:%noreg %R0<def>, %CPSR<def,dead> = tADDrr %R4<kill>, %R0<kill>, pred:14, %pred:%noreg The last instruction kills and redefines %R0, so it is still live after the instruction. This caused a register scavenger crash when compiling 483.xalancbmk for armv6. I am not including a test case because it requires too much bad luck to expose this old bug. First you need to convince the register allocator to use %R0 twice on the tADDrr instruction, then you have to convince BranchFolding to do something that causes it to run the register scavenger on he bad block. <rdar://problem/9898200> llvm-svn: 136973	2011-08-05 18:47:07 +00:00
Chandler Carruth	81b7e11c89	Temporarily revert r135528 which distinguishes between two copies of one inlined variable, based on the discussion in PR10542. This explodes the runtime of several passes down the pipeline due to a large number of "copies" remaining live across a large function. This only shows up with both debug and opt, but when it does it creates a many-minute compile when self-hosting LLVM+Clang. There are several other cases that show these types of regressions. All of this is tracked in PR10542, and progress is being made on fixing the issue. Once its addressed, the re-instated, but until then this restores the performance for self-hosting and other opt+debug builds. Devang, let me know if this causes any trouble, or impedes fixing it in any way, and thanks for working on this! llvm-svn: 136953	2011-08-05 00:51:31 +00:00
Jakob Stoklund Olesen	63e3dec9ad	Count the total amount of stack space used in compiled functions. Patch by Ivan Krasin! llvm-svn: 136921	2011-08-04 21:06:09 +00:00
Devang Patel	d61b1d505c	Print DBG_VALUE variable's location info as a comment. llvm-svn: 136916	2011-08-04 20:44:26 +00:00
Devang Patel	eabc3cea33	Increment counter inside insertDebugValue(). llvm-svn: 136915	2011-08-04 20:42:11 +00:00
Devang Patel	b456866b7b	Add counter. llvm-svn: 136901	2011-08-04 18:45:38 +00:00
Jakob Stoklund Olesen	2539af600a	Correctly handle multiple DBG_VALUE instructions at the same SlotIndex. It is possible to have multiple DBG_VALUEs for the same variable: 32L TEST32rr %vreg0<kill>, %vreg0, %EFLAGS<imp-def>; GR32:%vreg0 DBG_VALUE 2, 0, !"i" DBG_VALUE %noreg, %0, !"i" When that happens, keep the last one instead of the first. llvm-svn: 136842	2011-08-03 23:44:31 +00:00
Jakob Stoklund Olesen	11b788d5be	Enable compact region splitting by default. This helps generate better code in functions with high register pressure. The previous version of compact region splitting caused regressions because the regions were a bit too large. A stronger negative bias applied in r136832 fixed this problem. llvm-svn: 136836	2011-08-03 23:16:09 +00:00
Devang Patel	aab841cf63	Do not drop undef debug values. These are used as range termination marker by live debug variable pass. llvm-svn: 136834	2011-08-03 23:13:55 +00:00
Jakob Stoklund Olesen	869545203b	Be more conservative when forming compact regions. Apply twice the negative bias on transparent blocks when computing the compact regions. This excludes loop backedges from the region when only one of the loop blocks uses the register. Previously, we would include the backedge in the region if the loop preheader and the loop latch both used the register, but the loop header didn't. When both the header and latch blocks use the register, we still keep it live on the backedge. llvm-svn: 136832	2011-08-03 23:09:38 +00:00
Chandler Carruth	77eb5a0a37	Fix some warnings from Clang in release builds: lib/CodeGen/RegAllocGreedy.cpp:1176:18: warning: unused variable 'B' [-Wunused-variable] if (unsigned B = Cand.getBundles(BundleCand, BestCand)) { ^ lib/CodeGen/RegAllocGreedy.cpp:1188:18: warning: unused variable 'B' [-Wunused-variable] if (unsigned B = Cand.getBundles(BundleCand, 0)) { ^ llvm-svn: 136831	2011-08-03 23:07:27 +00:00
Jakub Staszak	3ef20e35f9	Fix typo in #include which revealed in the case-sensitive filesystem. llvm-svn: 136828	2011-08-03 22:53:41 +00:00
Jakub Staszak	15e5b742ad	Use MachineBranchProbabilityInfo in If-Conversion instead of its own heuristics. llvm-svn: 136826	2011-08-03 22:34:43 +00:00
Jakub Staszak	a60d130f26	Add more constantness in BlockFrequencyInfo. llvm-svn: 136816	2011-08-03 21:30:57 +00:00
Eli Friedman	30a49e93e3	New approach to r136737: insert the necessary fences for atomic ops in platform-independent code, since a bunch of platforms (ARM, Mips, PPC, Alpha are the relevant targets here) need to do essentially the same thing. I think this completes the basic CodeGen for atomicrmw and cmpxchg. llvm-svn: 136813	2011-08-03 21:06:02 +00:00
Bob Wilson	0a8d5c6047	Some revisions to Devang's change r136759 for merged global debug info. llvm-svn: 136802	2011-08-03 19:42:51 +00:00
Devang Patel	dc9cbaaf23	Use byte offset, instead of element number, to access merged global. llvm-svn: 136759	2011-08-03 01:25:46 +00:00
Jakob Stoklund Olesen	3c14505164	Use the precomputed def presence in RAGreedy::calcSpillCost. llvm-svn: 136742	2011-08-02 23:04:08 +00:00
Jakob Stoklund Olesen	057f9b68de	Inform SpillPlacement about blocks with defs. This information is not used for anything yet. llvm-svn: 136741	2011-08-02 23:04:06 +00:00
Jakob Stoklund Olesen	43859a6ad2	Rename {First,Last}Use to {First,Last}Instr. With a 'FirstDef' field right there, it is very confusing that FirstUse refers to an instruction that may be a def. llvm-svn: 136739	2011-08-02 22:54:14 +00:00
Jakob Stoklund Olesen	ae8027cc95	Add a BlockInfo::FirstDef field. This is either an invalid SlotIndex, or valno->def for the first value defined inside the block. PHI values are not counted as defined inside the block. The FirstDef field will be used when estimating the cost of spilling around a block. llvm-svn: 136736	2011-08-02 22:37:22 +00:00
Jakob Stoklund Olesen	f047ff4fe1	Delete BlockInfo::LiveThrough. It wasn't used any more. llvm-svn: 136735	2011-08-02 22:37:20 +00:00
Jakob Stoklund Olesen	d2a7d1ed97	Extend the SpillPlacement interface with two new features. The PrefBoth constraint is used for blocks that ideally want a live-in value both on the stack and in a register. This would be used by a block that has a use before interference forces a spill. Secondly, add the ChangesValue flag to BlockConstraint. This tells SpillPlacement if a live-in value on the stack can be reused as a live-out stack value for free. If the block redefines the virtual register, a spill would be required for that. This extra information will be used by SpillPlacement to more accurately calculate spill costs when a value can exist both on the stack and in a register. The simplest example is a basic block that reads the virtual register, but doesn't change its value. Spilling around such a block requires a reload, but no spill in the block. The spiller already knows this, but the spill placer doesn't. That can sometimes lead to suboptimal regions. llvm-svn: 136731	2011-08-02 21:53:03 +00:00
Eli Friedman	04c5025cd5	Don't create a ridiculous EXTRACT_ELEMENT. PR10563. The testcase looks extremely fragile, so I'm adding an assertion which should catch any cases like this. llvm-svn: 136711	2011-08-02 18:38:35 +00:00
Jay Foad	8dfee5f6bf	Remove an unnecessary cast. llvm-svn: 136609	2011-08-01 12:27:15 +00:00
Bill Wendling	f891bf8b30	Add the 'resume' instruction for the new EH rewrite. This adds the 'resume' instruction class, IR parsing, and bitcode reading and writing. The 'resume' instruction resumes propagation of an existing (in-flight) exception whose unwinding was interrupted with a 'landingpad' instruction (to be added later). llvm-svn: 136589	2011-07-31 06:30:59 +00:00
Jakob Stoklund Olesen	163e7a73f1	Time the emission of debug values. llvm-svn: 136584	2011-07-31 03:53:42 +00:00
Jakob Stoklund Olesen	eb5ea833ed	Revert r136528 "Enable compact region splitting by default." While this generally helped x86-64, there was some large regressions for i386. llvm-svn: 136571	2011-07-30 17:19:14 +00:00
Bill Wendling	ad088e6724	Revert r136253, r136263, r136269, r136313, r136325, r136326, r136329, r136338, r136339, r136341, r136369, r136387, r136392, r136396, r136429, r136430, r136444, r136445, r136446, r136253 pending review. llvm-svn: 136556	2011-07-30 05:42:50 +00:00
Jakob Stoklund Olesen	5670f850c6	Revert "Don't check liveness of unallocatable registers." The ARM target depends on CPSR liveness being tracked after register allocation. llvm-svn: 136548	2011-07-30 00:57:25 +00:00
Jakob Stoklund Olesen	95cc5440e9	Don't check liveness of unallocatable registers. This includes registers like EFLAGS and ST0-ST7. We don't check for liveness issues in the verifier and scavenger because registers will never be allocated from these classes. While in SSA form, we do care about the liveness of unallocatable unreserved registers. Liveness of EFLAGS and ST0 neds to be correct for MachineDCE and MachineSinking. llvm-svn: 136541	2011-07-29 23:36:21 +00:00
Jakob Stoklund Olesen	9dd184151b	Check for multiple defs in the machine code verifier. llvm-svn: 136535	2011-07-29 23:02:48 +00:00
Jakob Stoklund Olesen	9760f04ef9	Add an isSSA() flag to MachineRegisterInfo. This flag is true from isel to register allocation when the machine function is required to be in SSA form. The TwoAddressInstructionPass and PHIElimination passes clear the flag. The SSA flag wil be used by the machine code verifier to check for SSA form, and eventually an assertion can enforce it in +Asserts builds. This will catch the common target error of creating machine code with multiple defs of a virtual register. llvm-svn: 136532	2011-07-29 22:51:22 +00:00
Jakub Staszak	0480a8fbbb	Do not lose branch weights when lowering SwitchInst. llvm-svn: 136529	2011-07-29 22:25:21 +00:00
Jakob Stoklund Olesen	b5c2d3210c	Enable compact region splitting by default. This helps generate better code in functions with high register pressure. llvm-svn: 136528	2011-07-29 22:10:27 +00:00
Jakub Staszak	539db98987	Remove unneeded const_cast. llvm-svn: 136506	2011-07-29 20:05:36 +00:00
Nick Lewycky	019d255d3e	Fix a lot of typos, improve (but not necessarily fix) grammaros and reflow some lines. No functionality change. llvm-svn: 136458	2011-07-29 03:49:23 +00:00
Eli Friedman	adec587d5c	Misc optimizer+codegen work for 'cmpxchg' and 'atomicrmw'. They appear to be working on x86 (at least for trivial testcases); other architectures will need more work so that they actually emit the appropriate instructions for orderings stricter than 'monotonic'. (As far as I can tell, the ARM, PPC, Mips, and Alpha backends need such changes.) llvm-svn: 136457	2011-07-29 03:05:32 +00:00
Bill Wendling	7eadbeaf62	Use the pointer type size. With this, we can now compile a simple EH program. llvm-svn: 136446	2011-07-29 01:15:29 +00:00
Bill Wendling	6a8cac735a	And now something that compiles... llvm-svn: 136445	2011-07-29 01:11:33 +00:00
Bill Wendling	4b0a365beb	Make sure to sext or trunc the result from the register. llvm-svn: 136444	2011-07-29 01:11:14 +00:00
Chandler Carruth	9d7feab3e0	Rewrite the CMake build to use explicit dependencies between libraries, specified in the same file that the library itself is created. This is more idiomatic for CMake builds, and also allows us to correctly specify dependencies that are missed due to bugs in the GenLibDeps perl script, or change from compiler to compiler. On Linux, this returns CMake to a place where it can relably rebuild several targets of LLVM. I have tried not to change the dependencies from the ones in the current auto-generated file. The only places I've really diverged are in places where I was seeing link failures, and added a dependency. The goal of this patch is not to start changing the dependencies, merely to move them into the correct location, and an explicit form that we can control and change when necessary. This also removes a serialization point in the build because we don't have to scan all the libraries before we begin building various tools. We no longer have a step of the build that regenerates a file inside the source tree. A few other associated cleanups fall out of this. This isn't really finished yet though. After talking to dgregor he urged switching to a single CMake macro to construct libraries with both sources and dependencies in the arguments. Migrating from the two macros to that style will be a follow-up patch. Also, llvm-config is still generated with GenLibDeps.pl, which means it still has slightly buggy dependencies. The internal CMake 'llvm-config-like' macro uses the correct explicitly specified dependencies however. A future patch will switch llvm-config generation (when using CMake) to be based on these deps as well. This may well break Windows. I'm getting a machine set up now to dig into any failures there. If anyone can chime in with problems they see or ideas of how to solve them for Windows, much appreciated. llvm-svn: 136433	2011-07-29 00:14:25 +00:00
Bill Wendling	3cc87682e1	Visit the landingpad instruction. This generates the correct SDNodes for the landingpad instruction. It makes an assumption that the result of the landingpad instruction has at least two values. And that the first value is a pointer to the exception object and the second value is the "selector." llvm-svn: 136430	2011-07-28 23:44:58 +00:00
Bill Wendling	7fa7fe6b58	Add the AddLandingPadInfo function. AddLandingPadInfo takes a landingpad instruction and grabs all of the information from it that it needs for EH table generation. llvm-svn: 136429	2011-07-28 23:42:57 +00:00
Eli Friedman	c9a551ebed	LangRef and basic memory-representation/reading/writing for 'cmpxchg' and 'atomicrmw' instructions, which allow representing all the current atomic rmw intrinsics. The allowed operands for these instructions are heavily restricted at the moment; we can probably loosen it a bit, but supporting general first-class types (where it makes sense) might get a bit complicated, given how SelectionDAG works. As an initial cut, these operations do not support specifying an alignment, but it would be possible to add if we think it's useful. Specifying an alignment lower than the natural alignment would be essentially impossible to support on anything other than x86, but specifying a greater alignment would be possible. I can't think of any useful optimizations which would use that information, but maybe someone else has ideas. Optimizer/codegen support coming soon. llvm-svn: 136404	2011-07-28 21:48:00 +00:00
Jakob Stoklund Olesen	b16081ce8c	Handle REG_SEQUENCE with implicitly defined operands. Code like that would only be produced by bugpoint, but we should still handle it correctly. When a register is defined by a REG_SEQUENCE of undefs, the register itself is undef. Previously, we would create a register with uses but no defs. Fixes part of PR10520. llvm-svn: 136401	2011-07-28 21:38:51 +00:00
Bill Wendling	f8d95bc4c6	Use ArrayRef instead of requiring an std::vector. llvm-svn: 136396	2011-07-28 21:25:33 +00:00
Bill Wendling	4f027233d2	The personality function should be a Function* and not just a Value*. llvm-svn: 136392	2011-07-28 21:14:13 +00:00
Jakob Stoklund Olesen	cad845f4c0	Reverse order of RS_Split live ranges under -compact-regions. There are two conflicting strategies in play: - Under high register pressure, we want to assign large live ranges first. Smaller live ranges are easier to place afterwards. - Live range splitting is guided by interference, so splitting should be deferred until interference is as realistic as possible. With the recent changes to the live range stages, and with compact regions enabled, it is less traumatic to split a live range too early. If some of the split products were too big, they can often be split again. By reversing the RS_Split order, we get this queue order: 1. Normal live ranges, large to small. 2. RS_Split live ranges, large to small. The large-to-small order improves RAGreedy's puzzle solving skills under high register pressure. It may cause a bit more iterated splitting, but we handle that better now. With this change, -compact-regions is mostly an improvement on SPEC. llvm-svn: 136388	2011-07-28 20:48:23 +00:00
Bill Wendling	7b563cde19	Initial code to convert ResumeInsts into calls to _Unwind_Resume. This should be the only code necessary for DWARF EH prepare. llvm-svn: 136387	2011-07-28 20:48:05 +00:00
Nadav Rotem	9708aef2dc	CR fix: The ANY_EXTEND can be removed because the input and putput type must be identical. llvm-svn: 136355	2011-07-28 14:38:46 +00:00
Eli Friedman	26a484852e	Code generation for 'fence' instruction. llvm-svn: 136283	2011-07-27 22:21:52 +00:00
Jakub Staszak	da3df4302a	Use BlockFrequency instead of uint32_t in BlockFrequencyInfo. llvm-svn: 136278	2011-07-27 22:05:51 +00:00
Devang Patel	53dc616170	Remove outdated FIXME comment. llvm-svn: 136275	2011-07-27 22:00:01 +00:00
Bill Wendling	6c923bb8d9	Merge the contents from exception-handling-rewrite to the mainline. This adds the new instructions 'landingpad' and 'resume'. llvm-svn: 136253	2011-07-27 20:18:04 +00:00
Jeffrey Yasskin	6381c0100b	Explicitly cast narrowing conversions inside {}s that will become errors in C++0x. llvm-svn: 136211	2011-07-27 06:22:51 +00:00
Dan Gohman	456b1edd0d	Revert r136156, which broke several buildbots. llvm-svn: 136206	2011-07-27 01:10:27 +00:00
Devang Patel	f098ce2757	It is quiet possible that inlined function body is split into multiple chunks of consequtive instructions. But, there is not any way to describe this in .debug_inline accelerator table used by gdb. However, describe non contiguous ranges of inlined function body appropriately using AT_range of DW_TAG_inlined_subroutine debug info entry. llvm-svn: 136196	2011-07-27 00:34:13 +00:00
Jakob Stoklund Olesen	dab4b9a4b2	Add support for multi-way live range splitting. When splitting global live ranges, it is now possible to split for multiple destination intervals at once. Previously, we only had the main and stack intervals. Each edge bundle is assigned to a split candidate, and splitAroundRegion will insert copies between the candidate intervals and the stack interval as needed. The multi-way splitting is used to split around compact regions when enabled with -compact-regions. The best candidate register still gets all the bundles it wants, but everything outside the main interval is first split around compact regions before we create single-block intervals. Compact region splitting still causes some regressions, so it is not enabled by default. llvm-svn: 136186	2011-07-26 23:41:46 +00:00
Jakob Stoklund Olesen	b1459dbc25	Print out the MBB live-in registers. llvm-svn: 136178	2011-07-26 23:12:08 +00:00
Jakob Stoklund Olesen	c3bcb02154	Eliminate copies of undefined values during coalescing. These copies would coalesce easily, but the resulting value would be defined by a deleted instruction. Now we also remove the undefined value number from the destination register. This fixes PR10503. llvm-svn: 136174	2011-07-26 23:00:24 +00:00
Dan Gohman	9eb62cd159	Delete unnecessarily cautious LastCALLSEQ code. llvm-svn: 136156	2011-07-26 22:00:59 +00:00
Eli Friedman	06b8b571b2	Add obvious missing case to switch. PR10497. llvm-svn: 136130	2011-07-26 20:38:49 +00:00
Devang Patel	613958c82c	While extracting lexical scopes from machine instruction stream, work on one machine basic block at a time. llvm-svn: 136106	2011-07-26 18:09:53 +00:00
Duncan Sands	3ac1836540	SrcDef is only written and never read. Remove it. llvm-svn: 136080	2011-07-26 15:05:06 +00:00
Jakob Stoklund Olesen	5387bd340b	Revert to RA_Assign when a virtreg separates into components. When dead code elimination deletes a PHI value, the virtual register may split into multiple connected components. In that case, revert each component to the RS_Assign stage. The new components are guaranteed to be smaller (the original value numbers are distributed among the components), so this will always be making progress. The components are now allowed to evict other live ranges or be split again. llvm-svn: 136034	2011-07-26 00:54:56 +00:00
Evan Cheng	3a79225b4c	Rename createCodeEmitter to createMCCodeEmitter; createObjectStreamer to createMCObjectStreamer. llvm-svn: 136031	2011-07-26 00:42:34 +00:00
Evan Cheng	1142444565	Rename TargetAsmParser to MCTargetAsmParser and TargetAsmLexer to MCTargetAsmLexer; rename createAsmLexer to createMCAsmLexer and createAsmParser to createMCAsmParser. llvm-svn: 136027	2011-07-26 00:24:13 +00:00
Evan Cheng	5928e69d20	Rename TargetAsmBackend to MCAsmBackend; rename createAsmBackend to createMCAsmBackend. llvm-svn: 136010	2011-07-25 23:24:55 +00:00
Eli Friedman	fee02c6c13	Initial implementation of 'fence' instruction, the new C++0x-style replacement for llvm.memory.barrier. This is just a LangRef entry and reading/writing/memory representation; optimizer+codegen support coming soon. llvm-svn: 136009	2011-07-25 23:16:38 +00:00
Eli Friedman	cbd3ba91b7	Make sure this DAGCombine actually returns an UNDEF of the correct type; PR10476. llvm-svn: 135993	2011-07-25 22:25:42 +00:00
Jakub Staszak	875ebd5f5d	Rename BlockFrequency to BlockFrequencyInfo and MachineBlockFrequency to MachineBlockFrequencyInfo. llvm-svn: 135937	2011-07-25 19:25:40 +00:00
Jakob Stoklund Olesen	450111718c	Add an RS_Split2 stage used for loop prevention. This mechanism already exists, but the RS_Split2 stage makes it clearer. When live range splitting creates ranges that may not be making progress, they are marked RS_Split2 instead of RS_New. These ranges may be split again, but only in a way that can be proven to make progress. For local ranges, that means they must be split into ranges used by strictly fewer instructions. For global ranges, region splitting is bypassed and the RS_Split2 ranges go straight to per-block splitting. llvm-svn: 135912	2011-07-25 15:25:43 +00:00
Jakob Stoklund Olesen	3ef8cf1370	Rename live range stages to better reflect how they are used. The stage is used to control where a live range is going, not where it is coming from. Live ranges created by splitting will usually be marked RS_New, but some are marked RS_Spill to avoid wasting time trying to split them again. The old RS_Global and RS_Local stages are merged - they are really the same thing for local and global live ranges. llvm-svn: 135911	2011-07-25 15:25:41 +00:00
Jay Foad	d1b7849d49	Convert GetElementPtrInst to use ArrayRef. llvm-svn: 135904	2011-07-25 09:48:08 +00:00
Jakob Stoklund Olesen	73a9eb9f81	Never extend live ranges for <undef> uses. llvm-svn: 135886	2011-07-24 20:33:23 +00:00
Jakob Stoklund Olesen	56a56eb80e	Correctly handle <undef> tied uses when rewriting after a split. This fixes PR10463. A two-address instruction with an <undef> use operand was incorrectly rewritten so the def and use no longer used the same register, violating the tie constraint. Fix this by always rewriting <undef> operands with the register a def operand would use. llvm-svn: 135885	2011-07-24 20:23:50 +00:00
Jakob Stoklund Olesen	ecad62f909	Add RAGreedy::calcCompactRegion. This method computes the edge bundles that should be live when splitting around a compact region. This is independent of interference. The function returns false if the live range was already a compact region, or the compact region doesn't have any live bundles - it would be the same as splitting around basic blocks. Compact regions are computed using the normal spill placement code. We pretend there is interference in all live-through blocks that don't use the live range. This removes all edges from the Hopfield network used for spill placement, so it converges instantly. llvm-svn: 135847	2011-07-23 03:41:57 +00:00
Jakob Stoklund Olesen	f500ccece7	Fix bug in SplitEditor::splitLiveThroughBlock when switching registers. If there is no interference and no last split point, we cannot enterIntvBefore(Stop) - that function needs a real instruction. Use enterIntvAtEnd instead for that very easy case. This code doesn't currently run, it is needed by multi-way splitting. llvm-svn: 135846	2011-07-23 03:32:26 +00:00
Jakob Stoklund Olesen	a953bf135f	Prepare RAGreedy::growRegion for compact regions. A split candidate can have a null PhysReg which means that it doesn't map to a real interference pattern. Instead, pretend that all through blocks have interference. This makes it possible to generate compact regions where the live range doesn't go through blocks that don't use it. The live range will still be live between directly connected blocks with uses. Splitting around a compact region tends to produce a live range with a high spill weight, so it may evict a less dense live range. llvm-svn: 135845	2011-07-23 03:22:33 +00:00
Jakob Stoklund Olesen	0ab5d0ee5b	Add a simple method for marking blocks with interference in and out. This method matches addLinks - All the listed blocks are considered to have interference, so they add a negative bias to their bundles. This could also be done by addConstraints, but that requires building a separate BlockConstraint array. llvm-svn: 135844	2011-07-23 03:10:19 +00:00
Jakob Stoklund Olesen	cacefc7dca	Allow null interference cursors to be queried. They always report 'no interference'. llvm-svn: 135843	2011-07-23 03:10:17 +00:00
Evan Cheng	f2596bc62a	Move TargetAsmParser.h TargetAsmBackend.h and TargetAsmLexer.h to MC where they belong. llvm-svn: 135833	2011-07-23 00:45:41 +00:00
Jay Foad	17bab44308	Fix more MSVC warnings caused by a cases I missed when converting ConstantExpr::getGetElementPtr to use ArrayRef. llvm-svn: 135762	2011-07-22 08:52:50 +00:00
Jay Foad	040dd82f44	Convert IRBuilder::CreateGEP and IRBuilder::CreateInBoundsGEP to use ArrayRef. llvm-svn: 135761	2011-07-22 08:16:57 +00:00
Jakub Staszak	b82bbf40bb	Allow getBlockFreq to return 0. llvm-svn: 135742	2011-07-22 02:24:57 +00:00
Jakub Staszak	7987ea7460	Revert patch which broke some IfConversion tests. llvm-svn: 135738	2011-07-22 00:55:15 +00:00
Jakub Staszak	76d711582c	Fix typo in #include which revealed in the case-sensitive filesystem. llvm-svn: 135734	2011-07-22 00:39:00 +00:00
Jakub Staszak	44860314d2	Use MachineBranchProbabilityInfo instead of MachineLoopInfo in IfConversion. llvm-svn: 135724	2011-07-21 23:48:55 +00:00
Jakub Staszak	cb7c0a4927	Add missing getAnalysisUsage in MachineBlockFrequency. llvm-svn: 135714	2011-07-21 22:59:09 +00:00
Devang Patel	ddfe66e948	Refactor. llvm-svn: 135633	2011-07-20 23:00:27 +00:00
Devang Patel	8fb9fd6769	There are two ways to map a variable to its lexical scope. Lexical scope information is embedded in MDNode describing the variable. It is also available as a part of DebugLoc attached with DBG_VALUE instruction. DebugLoc attached with an instruction is less reliable in optimized code so use information embedded in the MDNode. llvm-svn: 135629	2011-07-20 22:18:50 +00:00
Devang Patel	bcd50a10d5	While emitting constant value, look through derived type and use underlying basic type to determine size and signness of the constant value. llvm-svn: 135627	2011-07-20 21:57:04 +00:00
Evan Cheng	bbf3b0de8b	Goodbye TargetAsmInfo. This eliminate last bit of CodeGen and Target in llvm-mc. There is still a bit more refactoring left to do in Targets. But we are now very close to fixing all the layering issues in MC. llvm-svn: 135611	2011-07-20 19:50:42 +00:00
Eli Friedman	6ed783228d	PR10421: Fix a straightforward bug in the widening logic for CONCAT_VECTORS. llvm-svn: 135595	2011-07-20 18:14:33 +00:00
Evan Cheng	efd9b4240f	- Move CodeModel from a TargetMachine global option to MCCodeGenInfo. - Introduce JITDefault code model. This tells targets to set different default code model for JIT. This eliminates the ugly hack in TargetMachine where code model is changed after construction. llvm-svn: 135580	2011-07-20 07:51:56 +00:00
Evan Cheng	76792992d6	Add MCObjectFileInfo and sink the MCSections initialization code from TargetLoweringObjectFileImpl down to MCObjectFileInfo. TargetAsmInfo is done to one last method. It's almost gone! llvm-svn: 135569	2011-07-20 05:58:47 +00:00
Evan Cheng	ccf243d56b	Fix an obvious typo that's preventing x86 (32-bit) from using .literal16. llvm-svn: 135535	2011-07-19 23:14:32 +00:00
Devang Patel	a59b24b090	Distinguish between two copies of one inlined variable. llvm-svn: 135528	2011-07-19 22:31:15 +00:00
Jay Foad	bf904773bb	Convert TargetData::getIndexedOffset to use ArrayRef. llvm-svn: 135478	2011-07-19 14:01:37 +00:00
Evan Cheng	2129f59637	Introduce MCCodeGenInfo, which keeps information that can affect codegen (including compilation, assembly). Move relocation model Reloc::Model from TargetMachine to MCCodeGenInfo so it's accessible even without TargetMachine. llvm-svn: 135468	2011-07-19 06:37:02 +00:00
Devang Patel	9ab3cac694	Revert r135423. llvm-svn: 135454	2011-07-19 00:28:24 +00:00
Bill Wendling	b20453faae	Add a frame with the compact unwind encoding if it exists. llvm-svn: 135450	2011-07-19 00:02:51 +00:00
Bill Wendling	6969ed6286	Rename CompactEncoding to CompactUnwindEncoding. llvm-svn: 135448	2011-07-19 00:00:58 +00:00
Bill Wendling	353404d924	Move the compact encoding from the target-specific library to the code-gen library. llvm-svn: 135443	2011-07-18 23:38:40 +00:00
Evan Cheng	67c033e6b8	Move getInitialFrameState from TargetFrameInfo to MCAsmInfo (suggestions for better location welcome). llvm-svn: 135438	2011-07-18 22:29:13 +00:00
Jeffrey Yasskin	7a16288157	Add APInt(numBits, ArrayRef<uint64_t> bigVal) constructor to prevent future ambiguity errors like the one corrected by r135261. Migrate all LLVM callers of the old constructor to the new one. llvm-svn: 135431	2011-07-18 21:45:40 +00:00
Evan Cheng	d60fa58ba1	Sink getDwarfRegNum, getLLVMRegNum, getSEHRegNum from TargetRegisterInfo down to MCRegisterInfo. Also initialize the mapping at construction time. This patch eliminate TargetRegisterInfo from TargetAsmInfo. It's another step towards fixing the layering violation. llvm-svn: 135424	2011-07-18 20:57:22 +00:00
Devang Patel	4dc76f2438	During bottom up fast-isel, instructions emitted to materalize registers are at top of basic block and do not have debug location. This may misguide debugger while entering the basic block and sometimes debugger provides semi useful view of current location to developer by picking up previous known location as current location. Assign a sensible location to the first instruction in a basic block, if it does not have one location derived from source file, so that debugger can provide meaningful user experience to developers in edge cases. [take 2] llvm-svn: 135423	2011-07-18 20:55:23 +00:00
Jakob Stoklund Olesen	c45d38e14a	Fix a crash when building 177.mesa for armv6. When splitting a live range immediately before an LDR_POST instruction that redefines the address register, make sure to use the correct value number in leaveIntvBefore. We need the value number entering the instruction. <rdar://problem/9793765> llvm-svn: 135413	2011-07-18 18:47:13 +00:00
Frits van Bommel	717d7edd3e	Migrate LLVM and Clang to use the new makeArrayRef(...) functions where previously explicit non-default constructors were used. Mostly mechanical with some manual reformatting. llvm-svn: 135390	2011-07-18 12:00:32 +00:00
Jakob Stoklund Olesen	c0dd3da9c5	Fix PR10387. When trying to rematerialize a value before an instruction that has an early-clobber redefine of the virtual register, make sure to look up the correct value number. Early-clobber defs are moved one slot back, so getBaseIndex is needed to find the used value number. Bugpoint was unable to reduce the test case for this, see PR10388. llvm-svn: 135378	2011-07-18 05:31:59 +00:00
Chris Lattner	229907cd11	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00
Nadav Rotem	76d51c6c89	Minor code cleanups llvm-svn: 135362	2011-07-17 19:05:00 +00:00
Jakub Staszak	6063549470	Remove unused LoopRanges from RegAllocGreedy. llvm-svn: 135354	2011-07-16 20:43:00 +00:00
Jakub Staszak	2713117135	Add MachineBlockFrequency analysis. llvm-svn: 135352	2011-07-16 20:23:20 +00:00
Matt Beaumont-Gay	26909d8c61	Silence unused variable warning llvm-svn: 135339	2011-07-16 04:18:47 +00:00
Jakob Stoklund Olesen	37e3a13931	He said before the last split point. This should unbreak the build-self-4-mingw32 tester. I have a very complicated test case that I will try to clean up. llvm-svn: 135329	2011-07-16 00:13:30 +00:00
Dan Gohman	945864d6dc	LegalizeDAG doesn't need its own copy of this enum. llvm-svn: 135320	2011-07-15 22:51:43 +00:00
Dan Gohman	e49e74261a	Delete LegalizeDAG's own version of isTypeLegal and getTypeAction and just use the ones from TargetLowering directly. llvm-svn: 135318	2011-07-15 22:39:09 +00:00
Dan Gohman	8c5ca645ce	Delete an unused variable and a redundant assert. llvm-svn: 135311	2011-07-15 22:19:02 +00:00
Jakob Stoklund Olesen	795da1c108	Extract parts of RAGreedy::splitAroundRegion as SplitKit methods. This gets rid of some of the gory splitting details in RAGreedy and makes them available to future SplitKit clients. Slightly generalize the functionality to support multi-way splitting. Specifically, SplitEditor::splitLiveThroughBlock() supports switching between different register intervals in a block. llvm-svn: 135307	2011-07-15 21:47:57 +00:00
Dan Gohman	ad94608b1f	Modernize comments. llvm-svn: 135305	2011-07-15 21:42:20 +00:00
Devang Patel	b7cc06366d	Use DebugLoc directly to map inlined functions' instructions to respective lexical scope. llvm-svn: 135302	2011-07-15 21:25:44 +00:00
Devang Patel	f5f352dda5	Eliminate redundant map. llvm-svn: 135278	2011-07-15 16:38:42 +00:00
Jay Foad	5bd375a6cc	Convert CallInst and InvokeInst APIs to use ArrayRef. llvm-svn: 135265	2011-07-15 08:37:34 +00:00
Evan Cheng	b46f3e24ba	Reverting r135232. It's causing infinite looping in DbgScope::openInsnRange. llvm-svn: 135254	2011-07-15 06:26:35 +00:00
Devang Patel	001c4f3ff0	Do not get confused by multiple empty lexical scopes inlined at one location. llvm-svn: 135232	2011-07-15 00:30:39 +00:00
Evan Cheng	1705ab00ab	Rename createAsmInfo to createMCAsmInfo and move registration code to MCTargetDesc to prepare for next round of changes. llvm-svn: 135219	2011-07-14 23:50:31 +00:00
Devang Patel	4771159f9f	Refactor. llvm-svn: 135212	2011-07-14 23:17:49 +00:00
Devang Patel	1f9913fdb2	Eliminate redundant LLVMContext argument. Improve DbgScope->dump() output. llvm-svn: 135207	2011-07-14 22:30:56 +00:00
Eric Christopher	92464be28c	Check register class matching instead of width of type matching when determining validity of matching constraint. Allow i1 types access to the GR8 reg class for x86. Fixes PR10352 and rdar://9777108 llvm-svn: 135180	2011-07-14 20:13:52 +00:00
Benjamin Kramer	e6e1933f31	Change Intrinsic::getDeclaration and friends to take an ArrayRef. llvm-svn: 135154	2011-07-14 17:45:39 +00:00
Nadav Rotem	771f29677f	[VECTOR-SELECT] During type legalization we often use the SIGN_EXTEND_INREG SDNode. When this SDNode is legalized during the LegalizeVector phase, it is scalarized because non-simple types are automatically marked to be expanded. In this patch we add support for lowering SIGN_EXTEND_INREG manually. This fixes CodeGen/X86/vec_sext.ll when running with the '-promote-elements' flag. llvm-svn: 135144	2011-07-14 11:11:14 +00:00
Nadav Rotem	db213c0400	Add assertion for the chain value type llvm-svn: 135143	2011-07-14 10:37:54 +00:00
Jakob Stoklund Olesen	a153ca5885	Reapply r135121 with a fixed copy constructor. Original commit message: Count references to interference cache entries. Each InterferenceCache::Cursor instance references a cache entry. A non-zero reference count guarantees that the entry won't be reused for a new register. This makes it possible to have multiple live cursors examining interference for different physregs. The total number of live cursors into a cache must be kept below InterferenceCache::getMaxCursors(). Code generation should be unaffected by this change, and it doesn't seem to affect the cache replacement strategy either. llvm-svn: 135130	2011-07-14 05:35:11 +00:00
Devang Patel	d5234bbced	Simplify. llvm-svn: 135127	2011-07-14 01:52:45 +00:00
Benjamin Kramer	15cd5a3f12	Don't emit a bit test if there is only one case the test can yield false. A simple SETNE is sufficient. llvm-svn: 135126	2011-07-14 01:38:42 +00:00
Devang Patel	07d61edc30	Simplify and delay extracting DebugLoc elements, scope and InlinedAt, as much as possible. llvm-svn: 135124	2011-07-14 01:14:57 +00:00
Eric Christopher	d6300d2956	Add a dag combine pattern for folding C2-(A+C1) -> (C2-C1)-A Fixes rdar://9761830 llvm-svn: 135123	2011-07-14 01:12:15 +00:00
Jakob Stoklund Olesen	1d4badae74	Revert r135121 which broke a gcc-4.2 builder. llvm-svn: 135122	2011-07-14 00:58:38 +00:00
Jakob Stoklund Olesen	c270cb6e94	Count references to interference cache entries. Each InterferenceCache::Cursor instance references a cache entry. A non-zero reference count guarantees that the entry won't be reused for a new register. This makes it possible to have multiple live cursors examining interference for different physregs. The total number of live cursors into a cache must be kept below InterferenceCache::getMaxCursors(). Code generation should be unaffected by this change, and it doesn't seem to affect the cache replacement strategy either. llvm-svn: 135121	2011-07-14 00:31:14 +00:00
Devang Patel	e07ebe32bf	Simplify. Compile unit check inside hasValidLocation() did not add any value. llvm-svn: 135118	2011-07-14 00:20:24 +00:00
Jakob Stoklund Olesen	d7e9937175	Reapply r135074 and r135080 with a fix. The cache entry referenced by the best split candidate could become clobbered by an unsuccessful candidate. The correct fix here is to use reference counts on the cache entries. Coming up. llvm-svn: 135113	2011-07-14 00:17:10 +00:00
Devang Patel	a9195bcff0	Fix typo in DEBUG message. llvm-svn: 135111	2011-07-14 00:04:53 +00:00
Devang Patel	2cce0d103d	Add DEBUG messages. llvm-svn: 135110	2011-07-14 00:03:58 +00:00
Jakob Stoklund Olesen	fae30b240b	Revert r135074 and r135080. They broke clamscan. llvm-svn: 135096	2011-07-13 22:20:09 +00:00
Jakob Stoklund Olesen	5fba5b8eb9	Only keep the global split candidates that work out. Some pysical registers create split solutions that would spill anywhere. They should not even be considered in future multi-way global splits. This does not affect code generation (yet). llvm-svn: 135080	2011-07-13 20:49:46 +00:00
Jakob Stoklund Olesen	7bb72e2824	Move the InterferenceCache cursor into the GlobalSplitCand struct. This is in preparation of supporting multiple global split candidates in a single live range split operation. llvm-svn: 135074	2011-07-13 20:14:52 +00:00
Evan Cheng	2d7faa5e3e	Fix up TargetLoweringObjectFile ctors to properly initialize fields. llvm-svn: 135068	2011-07-13 19:54:59 +00:00
Jay Foad	57aa636794	Convert InsertValueInst and ExtractValueInst APIs to use ArrayRef. llvm-svn: 135040	2011-07-13 10:26:04 +00:00
Jay Foad	b804a2b751	Second attempt at de-constifying LLVM Types in FunctionType::get(), StructType::get() and TargetData::getIntPtrType(). llvm-svn: 134982	2011-07-12 14:06:48 +00:00
Bill Wendling	a78cd228c2	Revert r134893 and r134888 (and related patches in other trees). It was causing an assert on Darwin llvm-gcc builds. Assertion failed: (castIsValid(op, S, Ty) && "Invalid cast!"), function Create, file /Users/buildslave/zorg/buildbot/smooshlab/slave-0.8/build.llvm-gcc-i386-darwin9-RA/llvm.src/lib/VMCore/Instructions.cpp, li\ ne 2067. etc. http://smooshlab.apple.com:8013/builders/llvm-gcc-i386-darwin9-RA/builds/2354 --- Reverse-merging r134893 into '.': U include/llvm/Target/TargetData.h U include/llvm/DerivedTypes.h U tools/bugpoint/ExtractFunction.cpp U unittests/Support/TypeBuilderTest.cpp U lib/Target/ARM/ARMGlobalMerge.cpp U lib/Target/TargetData.cpp U lib/VMCore/Constants.cpp U lib/VMCore/Type.cpp U lib/VMCore/Core.cpp U lib/Transforms/Utils/CodeExtractor.cpp U lib/Transforms/Instrumentation/ProfilingUtils.cpp U lib/Transforms/IPO/DeadArgumentElimination.cpp U lib/CodeGen/SjLjEHPrepare.cpp --- Reverse-merging r134888 into '.': G include/llvm/DerivedTypes.h U include/llvm/Support/TypeBuilder.h U include/llvm/Intrinsics.h U unittests/Analysis/ScalarEvolutionTest.cpp U unittests/ExecutionEngine/JIT/JITTest.cpp U unittests/ExecutionEngine/JIT/JITMemoryManagerTest.cpp U unittests/VMCore/PassManagerTest.cpp G unittests/Support/TypeBuilderTest.cpp U lib/Target/MBlaze/MBlazeIntrinsicInfo.cpp U lib/Target/Blackfin/BlackfinIntrinsicInfo.cpp U lib/VMCore/IRBuilder.cpp G lib/VMCore/Type.cpp U lib/VMCore/Function.cpp G lib/VMCore/Core.cpp U lib/VMCore/Module.cpp U lib/AsmParser/LLParser.cpp U lib/Transforms/Utils/CloneFunction.cpp G lib/Transforms/Utils/CodeExtractor.cpp U lib/Transforms/Utils/InlineFunction.cpp U lib/Transforms/Instrumentation/GCOVProfiling.cpp U lib/Transforms/Scalar/ObjCARC.cpp U lib/Transforms/Scalar/SimplifyLibCalls.cpp U lib/Transforms/Scalar/MemCpyOptimizer.cpp G lib/Transforms/IPO/DeadArgumentElimination.cpp U lib/Transforms/IPO/ArgumentPromotion.cpp U lib/Transforms/InstCombine/InstCombineCompares.cpp U lib/Transforms/InstCombine/InstCombineAndOrXor.cpp U lib/Transforms/InstCombine/InstCombineCalls.cpp U lib/CodeGen/DwarfEHPrepare.cpp U lib/CodeGen/IntrinsicLowering.cpp U lib/Bitcode/Reader/BitcodeReader.cpp llvm-svn: 134949	2011-07-12 01:15:52 +00:00
Jay Foad	7c57be3e2b	De-constify Types in StructType::get() and TargetData::getIntPtrType(). llvm-svn: 134893	2011-07-11 09:56:20 +00:00
Jay Foad	56cc1530ee	De-constify Types in FunctionType::get(). llvm-svn: 134888	2011-07-11 07:56:41 +00:00
Evan Cheng	c5e6d2f519	- Eliminate MCCodeEmitter's dependency on TargetMachine. It now uses MCInstrInfo and MCSubtargetInfo. - Added methods to update subtarget features (used when targets automatically detect subtarget features or switch modes). - Teach X86Subtarget to update MCSubtargetInfo features bits since the MCSubtargetInfo layer can be shared with other modules. - These fixes .code 16 / .code 32 support since mode switch is updated in MCSubtargetInfo so MC code emitter can do the right thing. llvm-svn: 134884	2011-07-11 03:57:24 +00:00
Jakub Staszak	9b07c0ab6b	Use BranchProbability instead of floating points in IfConverter. llvm-svn: 134858	2011-07-10 02:58:07 +00:00
Jakub Staszak	a4a18f092c	Don't analyze block if it's not considered for ifcvt anymore. llvm-svn: 134856	2011-07-10 02:00:16 +00:00
Chris Lattner	b1ed91f397	Land the long talked about "type system rewrite" patch. This patch brings numerous advantages to LLVM. One way to look at it is through diffstat: 109 files changed, 3005 insertions(+), 5906 deletions(-) Removing almost 3K lines of code is a good thing. Other advantages include: 1. Value::getType() is a simple load that can be CSE'd, not a mutating union-find operation. 2. Types a uniqued and never move once created, defining away PATypeHolder. 3. Structs can be "named" now, and their name is part of the identity that uniques them. This means that the compiler doesn't merge them structurally which makes the IR much less confusing. 4. Now that there is no way to get a cycle in a type graph without a named struct type, "upreferences" go away. 5. Type refinement is completely gone, which should make LTO much MUCH faster in some common cases with C++ code. 6. Types are now generally immutable, so we can use "Type " instead "const Type " everywhere. Downsides of this patch are that it removes some functions from the C API, so people using those will have to upgrade to (not yet added) new API. "LLVM 3.0" is the right time to do this. There are still some cleanups pending after this, this patch is large enough as-is. llvm-svn: 134829	2011-07-09 17:41:24 +00:00
Evan Cheng	91111d2706	Change createAsmParser to take a MCSubtargetInfo instead of triple, CPU, and feature string. Parsing some asm directives can change subtarget state (e.g. .code 16) and it must be reflected in other modules (e.g. MCCodeEmitter). That is, the MCSubtargetInfo instance must be shared. llvm-svn: 134795	2011-07-09 05:47:46 +00:00
Jakob Stoklund Olesen	780db902f7	Oops, didn't mean to commit that. Spills should be hoisted out of loops, but we don't want to hoist them to dominating blocks at the same loop depth. That could cause the spills to be executed more often. llvm-svn: 134782	2011-07-09 01:02:44 +00:00
Jakob Stoklund Olesen	bf6afec312	Hoist spills within a basic block. Try to move spills as early as possible in their basic block. This can help eliminate interferences by shortening the live range being spilled. This fixes PR10221. llvm-svn: 134776	2011-07-09 00:25:03 +00:00
Cameron Zwarich	f03fa189ca	Add an intrinsic and codegen support for fused multiply-accumulate. The intent is to use this for architectures that have a native FMA instruction. llvm-svn: 134742	2011-07-08 21:39:21 +00:00
Jakob Stoklund Olesen	4931bbc671	Be more aggressive about following hints. RAGreedy::tryAssign will now evict interference from the preferred register even when another register is free. To support this, add the EvictionCost struct that counts how many hints are broken by an eviction. We don't want to break one hint just to satisfy another. Rename canEvict to shouldEvict, and add the first bit of eviction policy that doesn't depend on spill weights: Always make room in the preferred register as long as the evictees can be split and aren't already assigned to their preferred register. Also make the CSR avoidance more accurate. When looking for a cheaper register it is OK to use a new volatile register. Only CSR aliases that have never been used before should be avoided. llvm-svn: 134735	2011-07-08 20:46:18 +00:00
Devang Patel	2442a89eb9	Refactor. llvm-svn: 134703	2011-07-08 17:09:57 +00:00
Devang Patel	ed9fd45740	Make provision to have floating point constants in .debug_loc expressions. llvm-svn: 134702	2011-07-08 16:49:43 +00:00
Benjamin Kramer	2bb8b26aa8	Apparently we can't expect a BinaryOperator here. Should fix llvm-gcc selfhost. llvm-svn: 134699	2011-07-08 12:08:24 +00:00
Benjamin Kramer	9960a25006	Emit a more efficient magic number multiplication for exact sdivs. We have to do this in DAGBuilder instead of DAGCombiner, because the exact bit is lost after building. struct foo { char x[24]; }; long bar(struct foo a, struct foo b) { return a-b; } is now compiled into movl 4(%esp), %eax subl 8(%esp), %eax sarl $3, %eax imull $-1431655765, %eax, %eax instead of movl 4(%esp), %eax subl 8(%esp), %eax movl $715827883, %ecx imull %ecx movl %edx, %eax shrl $31, %eax sarl $2, %edx addl %eax, %edx movl %edx, %eax llvm-svn: 134695	2011-07-08 10:31:30 +00:00
Evan Cheng	4d1ca96bfc	Eliminate asm parser's dependency on TargetMachine: - Each target asm parser now creates its own MCSubtatgetInfo (if needed). - Changed AssemblerPredicate to take subtarget features which tablegen uses to generate asm matcher subtarget feature queries. e.g. "ModeThumb,FeatureThumb2" is translated to "(Bits & ModeThumb) != 0 && (Bits & FeatureThumb2) != 0". llvm-svn: 134678	2011-07-08 01:53:10 +00:00
Eric Christopher	6a6d8fc7fd	Remove a FIXME. All of the standard ones are in the list. llvm-svn: 134647	2011-07-07 22:29:03 +00:00
Devang Patel	53b050aec6	Add DEBUG message. llvm-svn: 134643	2011-07-07 21:44:42 +00:00
Devang Patel	bf8cc60d1b	If known DebugLocs do not match then two DBG_VALUE machine instructions are not identical. For example, DBG_VALUE 3.310000e+02, 0, !"ds"; dbg:sse.stepfft.c:138:18 @[ sse.stepfft.c:32:10 ] DBG_VALUE 3.310000e+02, 0, !"ds"; dbg:sse.stepfft.c:138:18 @[ sse.stepfft.c:31:10 ] These two MIs represent identical value, 3.31..., for one variable, ds, but they are not identical because the represent two separate instances of inlined variable "ds". llvm-svn: 134620	2011-07-07 17:45:33 +00:00
Lang Hames	5a00499e87	Add functions 'hasPredecessor' and 'hasPredecessorHelper' to SDNode. The hasPredecessorHelper function allows predecessors to be cached to speed up repeated invocations. This fixes PR10186. X.isPredecessorOf(Y) now just calls Y.hasPredecessor(X) Y.hasPredecessor(X) calls Y.hasPredecessorHelper(X, Visited, Worklist) with empty Visited and Worklist sets (i.e. no caching over invocations). Y.hasPredecessorHelper(X, Visited, Worklist) caches search state in Visited and Worklist to speed up repeated calls. The Visited set is searched for X before going to the worklist to further search the DAG if necessary. llvm-svn: 134592	2011-07-07 04:31:51 +00:00
Devang Patel	b7a328ed27	Add DEBUG messages. llvm-svn: 134572	2011-07-07 00:14:27 +00:00
Eli Friedman	bf007364bf	When tail-merging multiple blocks, make sure to correctly update the live-in list on the merged block to correctly account for the live-outs of all the predecessors. They might not be the same in all cases (the testcase I have involves a PHI node where one of the operands is an IMPLICIT_DEF). Unfortunately, the testcase I have is large and confidential, so I don't have a test to commit at the moment; I'll see if I can come up with something smaller where this issue reproduces. <rdar://problem/9716278> llvm-svn: 134565	2011-07-06 23:41:48 +00:00
Devang Patel	92ca8fc927	Remove dead code. llvm-svn: 134561	2011-07-06 23:26:18 +00:00
Devang Patel	338e43268c	Typo. llvm-svn: 134559	2011-07-06 23:09:51 +00:00
Eric Christopher	ea336c797c	Grammar and 80-col. llvm-svn: 134555	2011-07-06 22:41:18 +00:00
Evan Cheng	ab37af9af3	createMCInstPrinter doesn't need TargetMachine anymore. llvm-svn: 134525	2011-07-06 19:45:42 +00:00
Jakub Staszak	3f158fdf6e	Introduce "expect" intrinsic instructions. llvm-svn: 134516	2011-07-06 18:22:43 +00:00
Dan Gohman	024bb8fa07	Remove the ObjC ARC passes from the default optimization list, and add extension points to be used by clang. llvm-svn: 134444	2011-07-05 22:01:44 +00:00
Jakob Stoklund Olesen	91f3a30921	Break infinite loop when the Hopfield network oscillates. This is impossible in theory, I can prove it. In practice, our near-zero threshold can cause the network to oscillate between equally good solutions. <rdar://problem/9720596> llvm-svn: 134428	2011-07-05 18:46:42 +00:00
Jakob Stoklund Olesen	bbad3bceb7	Fix PR10277. Remat during spilling triggers dead code elimination. If a phi-def becomes unused, that may also cause live ranges to split into separate connected components. This type of splitting is different from normal live range splitting. In particular, there may not be a common original interval. When the split range is its own original, make sure that the new siblings are also their own originals. The range being split cannot be used as an original since it doesn't cover the new siblings. llvm-svn: 134413	2011-07-05 15:38:41 +00:00
Jakob Stoklund Olesen	b2090ecbf2	Tweak comment and debug output. llvm-svn: 134412	2011-07-05 15:38:37 +00:00
Rafael Espindola	c74d9378e1	Move early tail duplication earlier. This fixes the issue noted in PR10251 where early tail dup of bbs with indirectbr would cause a bb to be duplicated into a loop preheader and then into its predecessors, creating phi nodes with identical operands just before register allocation. This helps with jsinterp.o size (__TEXT goes from 163568 to 126656) and a bit with performance 1.005x faster on sunspider (jits still enabled). The result on webkit with the jit disabled is more significant: 1.021x faster. llvm-svn: 134372	2011-07-04 04:54:22 +00:00
Rafael Espindola	f9f012ea88	Move most of the pre BB code to TailDuplicateAndUpdate. Change the HasIndirectbr variable to be just that. No functionality change. llvm-svn: 134371	2011-07-04 01:21:42 +00:00
Rafael Espindola	79dc4e7709	Reduce indentation and fix the count of how many PHIs we have inserted. llvm-svn: 134370	2011-07-04 00:13:36 +00:00
Jakob Stoklund Olesen	71a3a003dd	Fix PR10244. A split point inserted in a block with a landing pad successor may be hoisted above the call to ensure that it dominates all successors. The code that handles the rest of the basic block must take this into account. I am not including a test case, it would be very fragile. PR10244 comes from building clang with exceptions enabled. llvm-svn: 134369	2011-07-04 00:05:28 +00:00
Rafael Espindola	de8fa9e1f1	Fix an easy fixme. llvm-svn: 134364	2011-07-03 05:26:42 +00:00
Rafael Espindola	ed33752769	Use getVNInfoAt. llvm-svn: 134312	2011-07-02 07:50:27 +00:00
Jakob Stoklund Olesen	54f7c59c1a	Better diagnostics when inline asm fails to allocate. asm.c:2:7: error: ran out of registers during register allocation asm(""::"r"(0), "r"(1), "r"(2), "r"(3), "r"(4), "r"(5), "r"(6), "r"(7), "r"(8), "r"(9)); ^ llvm-svn: 134310	2011-07-02 07:17:37 +00:00
Rafael Espindola	36e11ff819	Check the VN of the src register at the two copies, not just the register number. llvm-svn: 134309	2011-07-02 05:34:02 +00:00
Jakob Stoklund Olesen	25a404eb81	Include a source location when complaining about bad inline assembly. Add a MI->emitError() method that the backend can use to report errors related to inline assembly. Call it from X86FloatingPoint.cpp when the constraints are wrong. This enables proper clang diagnostics from the backend: $ clang -c pr30848.c pr30848.c:5:12: error: Inline asm output regs must be last on the x87 stack __asm__ ("" : "=u" (d)); /* { dg-error "output regs" } */ ^ 1 error generated. llvm-svn: 134307	2011-07-02 03:53:34 +00:00
Jakob Stoklund Olesen	30a8563a61	Use a new strategy for preventing eviction loops in RAGreedy. Every live range is assigned a cascade number the first time it is involved in an eviction. As the evictor, it gets a new cascade number. Every evictee is assigned the same cascade number as the evictor. Eviction is prohibited if the evictor has a lower assigned cascade number than the evictee. This means that assigned cascade numbers are monotonically increasing with every eviction, yet they are bounded by NextCascade which can only be incremented by new live ranges. Thus, infinite loops cannot happen, but eviction cascades can still be triggered by new live ranges as we want. Thanks to Andy for explaining this to me. llvm-svn: 134303	2011-07-02 01:37:09 +00:00
Cameron Zwarich	7da0f9a58e	Take a stab at fixing the llvm-x86_64-linux-checks failure. llvm-svn: 134287	2011-07-01 23:45:21 +00:00
Evan Cheng	0d639a28aa	Rename TargetSubtarget to TargetSubtargetInfo for consistency. llvm-svn: 134259	2011-07-01 21:01:15 +00:00
Duncan Sands	bc9e523421	Disable commit 134216 ("Add 134199 back, but disable the optimization when the second copy is a kill") to see if it fixes the i386 dragonegg buildbot, which is timing out because gcc built with dragonegg is going into an infinite loop. llvm-svn: 134237	2011-07-01 12:01:00 +00:00
Rafael Espindola	760e51079a	Avoid DenseMap lookup. llvm-svn: 134231	2011-07-01 04:15:02 +00:00
Rafael Espindola	475cd405b0	Fix off by one error. I misunderstood the comment about killedAt. llvm-svn: 134229	2011-07-01 03:31:29 +00:00
Rafael Espindola	59066f0da0	Check the liveinterval, not the kill flag. llvm-svn: 134228	2011-07-01 02:35:06 +00:00
Jakob Stoklund Olesen	39af582c57	Don't inflate register classes used by inline asm. The constraints are represented by the register class of the original virtual register created for the inline asm. If the register class were included in the operand descriptor, we might be able to do this. For now, just give up on regclass inflation when inline asm is involved. No test case, this bug hasn't happened yet. llvm-svn: 134226	2011-07-01 01:24:25 +00:00
Rafael Espindola	4b522de5c0	Add 134199 back, but disable the optimization when the second copy is a kill. llvm-svn: 134216	2011-07-01 00:16:54 +00:00
Rafael Espindola	abe5f97634	Revert my previous patch while I debug llvm-gcc bootstrap. llvm-svn: 134201	2011-06-30 22:58:17 +00:00
Rafael Espindola	027cb82657	Don't give up on coalescing A and B when we find A = X B = X Instead, proceed as if we had found A = X B = A llvm-svn: 134199	2011-06-30 22:24:13 +00:00
Rafael Espindola	070f96c567	Create a isFullCopy predicate. llvm-svn: 134189	2011-06-30 21:15:52 +00:00
Rafael Espindola	79fd2e7a95	Remove dead code. llvm-svn: 134148	2011-06-30 13:17:24 +00:00
Jakob Stoklund Olesen	adc6a4ca5d	Reapply r134047 now that the world is ready for it. This patch will sometimes choose live range split points next to interference instead of always splitting next to a register point. That means spill code can now appear almost anywhere, and it was necessary to fix code that didn't expect that. The difficult places were: - Between a CALL returning a value on the x87 stack and the corresponding FpPOP_RETVAL (was FpGET_ST0). Probably also near x87 inline assembly, but that didn't actually show up in testing. - Between a CALL popping arguments off the stack and the corresponding ADJCALLSTACKUP. Both are fixed now. The only place spill code can't appear is after terminators, see SplitAnalysis::getLastSplitPoint. Original commit message: Rewrite RAGreedy::splitAroundRegion, now with cool ASCII art. This function has to deal with a lot of special cases, and the old version got it wrong sometimes. In particular, it would sometimes leave multiple uses in the stack interval in a single block. That causes bad code with multiple reloads in the same basic block. The new version handles block entry and exit in a single pass. It first eliminates all the easy cases, and then goes on to create a local interval for the blocks with difficult interference. Previously, we would only create the local interval for completely isolated blocks. It can happen that the stack interval becomes completely empty because we could allocate a register in all edge bundles, and the new local intervals deal with the interference. The empty stack interval is harmless, but we need to remove a SplitKit assertion that checks for empty intervals. llvm-svn: 134125	2011-06-30 01:30:39 +00:00
Eric Christopher	f81292ba3b	Remove getRegClassForInlineAsmConstraint and all dependencies. Fixes rdar://9643582 llvm-svn: 134123	2011-06-30 01:20:03 +00:00
Devang Patel	0eada03216	Revert r133953 for now. llvm-svn: 134116	2011-06-29 23:50:13 +00:00
Rafael Espindola	ff218bd3fd	make compose and isMoveInstr static functions. llvm-svn: 134093	2011-06-29 20:55:48 +00:00
Benjamin Kramer	8665f8d916	Revert a part of r126557 which could create unschedulable DAGs. llvm-svn: 134067	2011-06-29 13:47:25 +00:00
Jakob Stoklund Olesen	8628435c06	Revert r134047 while investigating a llvm-gcc-i386-linux-selfhost miscompile. llvm-svn: 134053	2011-06-29 02:03:36 +00:00
Evan Cheng	8264e272a9	Sink SubtargetFeature and TargetInstrItineraries (renamed MCInstrItineraries) into MC. llvm-svn: 134049	2011-06-29 01:14:12 +00:00
Jakob Stoklund Olesen	ffbc05b715	Rewrite RAGreedy::splitAroundRegion, now with cool ASCII art. This function has to deal with a lot of special cases, and the old version got it wrong sometimes. In particular, it would sometimes leave multiple uses in the stack interval in a single block. That causes bad code with multiple reloads in the same basic block. The new version handles block entry and exit in a single pass. It first eliminates all the easy cases, and then goes on to create a local interval for the blocks with difficult interference. Previously, we would only create the local interval for completely isolated blocks. It can happen that the stack interval becomes completely empty because we could allocate a register in all edge bundles, and the new local intervals deal with the interference. The empty stack interval is harmless, but we need to remove a SplitKit assertion that checks for empty intervals. llvm-svn: 134047	2011-06-29 00:24:24 +00:00
Evan Cheng	194c3dc01f	Move CallFrameSetupOpcode and CallFrameDestroyOpcode to TargetInstrInfo. llvm-svn: 134030	2011-06-28 21:14:33 +00:00
Evan Cheng	6cc775f905	- Rename TargetInstrDesc, TargetOperandInfo to MCInstrDesc and MCOperandInfo and sink them into MC layer. - Added MCInstrInfo, which captures the tablegen generated static data. Chang TargetInstrInfo so it's based off MCInstrInfo. llvm-svn: 134021	2011-06-28 19:10:37 +00:00
Jakob Stoklund Olesen	a1dceb0e3c	Print registers by name instead of by number. llvm-svn: 134013	2011-06-28 17:24:32 +00:00
Chandler Carruth	137c7ead2e	Fix CMake build by removing this now dead file. llvm-svn: 133981	2011-06-28 02:03:12 +00:00
Jakob Stoklund Olesen	040d659206	Fix a bad iterator dereference that Evan uncovered. llvm-svn: 133978	2011-06-28 01:18:58 +00:00
Evan Cheng	21afabe73d	Remove RegClass2VRegMap from MachineRegisterInfo. llvm-svn: 133967	2011-06-27 23:54:40 +00:00
Evan Cheng	b7d00313dc	Remove the experimental (and unused) pre-ra splitting pass. Greedy regalloc can split live ranges. llvm-svn: 133962	2011-06-27 23:40:45 +00:00
Devang Patel	4dc034df1d	During bottom up fast-isel, instructions emitted to materalize registers are at top of basic block and do not have debug location. This may misguide debugger while entering the basic block and sometimes debugger provides semi useful view of current location to developer by picking up previous known location as current location. Assign a sensible location to the first instruction in a basic block, if it does not have one location derived from source file, so that debugger can provide meaningful user experience to developers in edge cases. llvm-svn: 133953	2011-06-27 22:32:04 +00:00
Evan Cheng	8d71a75777	More refactoring. Move getRegClass from TargetOperandInfo to TargetInstrInfo. llvm-svn: 133944	2011-06-27 21:26:13 +00:00
Owen Anderson	b0a5a1ee29	The index stored in the RegDefIter is one after the current index. When getting the index, decrement it so that it points to the current element. Fixes an off-by-one bug encountered when trying to make use of MVT::untyped. llvm-svn: 133923	2011-06-27 18:34:12 +00:00
Andrew Trick	31f25bc66f	pre-RA-sched: Cleanup register pressure tracking. Removed the check that peeks past EXTRA_SUBREG, which I don't think makes sense any more. Intead treat it as a normal register def. No significant affect on x86 or ARM benchmarks. llvm-svn: 133917	2011-06-27 18:01:20 +00:00
Jakob Stoklund Olesen	79f1b714a2	Track live-out physical registers in MachineDCE. Patch by Sanjoy Das! llvm-svn: 133910	2011-06-27 15:00:36 +00:00
Jakob Stoklund Olesen	537a302d1a	Distinguish early clobber output operands from clobbered registers. Both become <earlyclobber> defs on the INLINEASM MachineInstr, but we now use two different asm operand kinds. The new Kind_Clobber is treated identically to the old Kind_RegDefEarlyClobber for now, but x87 floating point stack inline assembly does care about the difference. This will pop a register off the stack: asm("fstp %st" : : "t"(x) : "st"); While this will pop the input and push an output: asm("fst %st" : "=&t"(r) : "t"(x)); We need to know if ST0 was a clobber or an output operand, and we can't depend on <dead> flags for that. llvm-svn: 133902	2011-06-27 04:08:33 +00:00
Jakob Stoklund Olesen	6b356b18b4	Decode and pretty print inline asm operand descriptors. The INLINEASM MachineInstrs have an immediate operand describing each original inline asm operand. Decode the bits in MachineInstr::print() so it is easier to read: INLINEASM <es:rorq $1,$0>, $0:[regdef], %vreg0<def>, %vreg1<def>, $1:[imm], 1, $2:[reguse] [tiedto:$0], %vreg2, %vreg3, $3:[regdef-ec], %EFLAGS<earlyclobber,imp-def> llvm-svn: 133901	2011-06-27 04:08:29 +00:00
Rafael Espindola	2cf9489cf6	Remove unused methods. llvm-svn: 133900	2011-06-26 22:44:34 +00:00
Rafael Espindola	676c405acb	There is only one register coalescer. Merge it into the base class and remove the analysis group. llvm-svn: 133899	2011-06-26 22:34:10 +00:00
Rafael Espindola	ea1a9c342d	Merge SimpleRegisterCoalescing.cpp into RegisterCoalescer.cpp. llvm-svn: 133897	2011-06-26 22:06:36 +00:00
Rafael Espindola	14a314b1c6	merge SimpleRegisterCoalescing.h into RegisterCoalescer.h. llvm-svn: 133896	2011-06-26 21:54:28 +00:00
Rafael Espindola	fef3c64a1f	Move RegisterCoalescer.h to lib/CodeGen. llvm-svn: 133895	2011-06-26 21:41:06 +00:00
Rafael Espindola	4c9613c5e5	Remove unnecessary wrapper. llvm-svn: 133886	2011-06-26 19:47:36 +00:00
Owen Anderson	99adfec0b1	The scheduler needs to be aware on the existence of untyped nodes when it performs type propagation for EXTRACT_SUBREG. llvm-svn: 133838	2011-06-24 23:02:22 +00:00
Devang Patel	f071d72c44	Handle debug info for i128 constants. llvm-svn: 133821	2011-06-24 20:46:11 +00:00
Rafael Espindola	5135ae2383	Simplify llvm-svn: 133798	2011-06-24 15:50:56 +00:00
Rafael Espindola	cb0213bda6	Now that bb with phis are not considered simple, duplicate them even if we cannot duplicate to every predecessor. llvm-svn: 133797	2011-06-24 15:47:41 +00:00
Rafael Espindola	ad0cdd5606	Simplify now that blocks with phis are not considered simple. llvm-svn: 133793	2011-06-24 14:04:13 +00:00
Evan Cheng	247533179a	Starting to refactor Target to separate out code that's needed to fully describe target machine from those that are only needed by codegen. The goal is to sink the essential target description into MC layer so we can start building MC based tools without needing to link in the entire codegen. First step is to refactor TargetRegisterInfo. This patch added a base class MCRegisterInfo which TargetRegisterInfo is derived from. Changed TableGen to separate register description from the rest of the stuff. llvm-svn: 133782	2011-06-24 01:44:41 +00:00
Bill Wendling	9af2fa9d1b	Use the presence of the __compact_unwind section to indicate that a target supports compact unwind info instead of having a separate flag indicating this. llvm-svn: 133685	2011-06-23 05:13:28 +00:00
Rafael Espindola	e25a8710e5	Move more logic to shouldTailDuplicate and only duplicate regular bb before register allocation if it has a indirectbr or if we can duplicate it to every predecessor. This fixes the SingleSource/Benchmarks/Shootout-C++/matrix.cpp regression but keeps the previous improvements to sunspider. llvm-svn: 133682	2011-06-23 03:41:29 +00:00
Bill Wendling	f942585dae	Add a flag that indicates whether a target supports compact unwind info or not. llvm-svn: 133662	2011-06-22 23:16:51 +00:00
Rafael Espindola	2496c1f1f8	Reenable tail duplication of bb with just an unconditional jump, but don't remove blocks that have their address taken. llvm-svn: 133659	2011-06-22 22:31:57 +00:00
Bill Wendling	d346304373	Add a __LD,__compact_unwind section. If the linker supports it, this will hold the CIE and FDE information in a compact format. The implementation of the compact unwinding emission is coming soon. llvm-svn: 133658	2011-06-22 22:22:24 +00:00
Chad Rosier	cb7cfa4954	Revert r133607. This is causing failures in the Clang gccTestSuite. Specifically, gcc.c-torture/compile/pr21356.c. llvm-svn: 133646	2011-06-22 21:13:23 +00:00
Nick Lewycky	6208a2fd66	Emit trailing padding on constant vectors when TargetData says that the vector is larger than the sum of the elements (including per-element padding). llvm-svn: 133631	2011-06-22 18:55:03 +00:00
Jay Foad	83be361b8a	Replace the existing forms of ConstantArray::get() with a single form that takes an ArrayRef. llvm-svn: 133615	2011-06-22 09:24:39 +00:00
Rafael Espindola	0850f709de	Reenable the optimization added in 133415, but change the definition of a "simple" bb to be one with only one unconditional branch and no phis. Duplicating the phis in this case is possible, but requeres liveness analysis or breaking edges. llvm-svn: 133607	2011-06-22 04:01:58 +00:00
Devang Patel	d88b8babe0	After register is spilled there should not be any DBG_VALUE referring the same register. llvm-svn: 133569	2011-06-21 23:02:36 +00:00
Owen Anderson	d1955e78b4	Fix some trailing issues from my introduction of MVT::untyped and its use for REGISTER_SEQUENCE. llvm-svn: 133567	2011-06-21 22:54:23 +00:00
Bill Wendling	ddec6838a9	Add verbose EH table printing to SjLj exception tables. llvm-svn: 133561	2011-06-21 22:40:24 +00:00
Devang Patel	0ab7767b37	There could be more than one DBG_VALUE instructions for variables where all of them have offset based on one register. llvm-svn: 133560	2011-06-21 22:36:03 +00:00
Bill Wendling	a8339eb0d0	Improve the comment printing for the EH table. This gives a much more detailed explanation of what the EH table describes. llvm-svn: 133559	2011-06-21 22:30:20 +00:00
Evan Cheng	4c0bd9629d	Teach dag combine to match halfword byteswap patterns. 1. (((x) & 0xFF00) >> 8) \| (((x) & 0x00FF) << 8) => (bswap x) >> 16 2. ((x&0xff)<<8)\|((x&0xff00)>>8)\|((x&0xff000000)>>8)\|((x&0x00ff0000)<<8)) => (rotl (bswap x) 16) This allows us to eliminate most of the def : Pat patterns for ARM rev16 revsh instructions. It catches many more cases for ARM and x86. rdar://9609108 llvm-svn: 133503	2011-06-21 06:01:08 +00:00
Rafael Espindola	02f262e942	Disable again. llvm-svn: 133446	2011-06-20 17:04:08 +00:00
Rafael Espindola	336e10236f	Re enable 133415 with two fixes * Don't introduce a duplicated bb in the CFG * When making a branch unconditional, clear the PredCond array so that it is really unconditional. llvm-svn: 133432	2011-06-20 14:11:42 +00:00
Duncan Sands	406b9be057	Disable the logic added by rafael in commit 133415 to see if it brings the dragonegg buildbots back to life. Original commit message: Teach early dup how to duplicate basic blocks with one successor and only phi instructions into more complex blocks. llvm-svn: 133430	2011-06-20 09:26:23 +00:00
Nadav Rotem	d34ce4344b	Fix PromoteIntRes_TRUNCATE: Add support for cases where the source vector type is to be split while the target vector is to be promoted. (eg: <4 x i64> -> <4 x i8> ) llvm-svn: 133424	2011-06-20 07:15:58 +00:00
Francois Pichet	3f60acade6	Fix MSVC build. next() function already exists in the MSVC headers. This create a overload conflict. Make sure we pick up the llvm one. llvm-svn: 133416	2011-06-20 05:19:37 +00:00
Rafael Espindola	ef636bffb5	Teach early dup how to duplicate basic blocks with one successor and only phi instructions into more complex blocks. llvm-svn: 133415	2011-06-20 04:16:35 +00:00
Chris Lattner	cc19efaa97	Revamp the "ConstantStruct::get" methods. Previously, these were scattered all over the place in different styles and variants. Standardize on two preferred entrypoints: one that takes a StructType and ArrayRef, and one that takes StructType and varargs. In cases where there isn't a struct type convenient, we now add a ConstantStruct::getAnon method (whose name will make more sense after a few more patches land). It would be "really really nice" if the ConstantStruct::get and ConstantVector::get methods didn't make temporary std::vectors. llvm-svn: 133412	2011-06-20 04:01:31 +00:00
Jay Foad	6002068c13	Fix a FIXME by making GlobalVariable::getInitializer() return a const Constant *. llvm-svn: 133400	2011-06-19 18:37:11 +00:00
Nadav Rotem	94d67a02e0	Code cleanups: Remove duplicated logic in PromotInteRes_BITCAST, reserve vector space, reuse types. llvm-svn: 133389	2011-06-19 10:49:57 +00:00
Nadav Rotem	35d600d9f4	Calls to AssertZext and getZeroExtendInReg must be made using scalar types. llvm-svn: 133388	2011-06-19 10:22:39 +00:00
Nadav Rotem	36896bfd0c	When promoting the vector elements in CopyToParts, use vector trunc instead of scalarizing, and doing an element-by-element truncat. llvm-svn: 133382	2011-06-19 08:49:38 +00:00
Chris Lattner	f3f545ea8a	fix the varargs version of StructType::get to not require an LLVMContext, making usage much cleaner. llvm-svn: 133364	2011-06-18 22:48:56 +00:00
Benjamin Kramer	0fb6db6442	Simplify code. No change in functionality. llvm-svn: 133350	2011-06-18 13:53:47 +00:00
Benjamin Kramer	e1fc29b6ac	Don't allocate empty read-only SmallVectors during SelectionDAG deallocation. llvm-svn: 133348	2011-06-18 13:13:44 +00:00
Benjamin Kramer	25e17b0f89	Remove unused but set variables. llvm-svn: 133347	2011-06-18 11:09:41 +00:00
Eric Christopher	e4a1266a9a	Fix UMULO support for 2x register width to allow the full range without a libcall to a new mulo<mode> libcall that we'd have to create. Finishes the rest of rdar://9090077 and rdar://9210061 llvm-svn: 133318	2011-06-18 00:09:57 +00:00
Jakob Stoklund Olesen	becf3d3f29	Only call TRI::getRawAllocationOrder to resolve a target-dependent hint. llvm-svn: 133313	2011-06-17 23:26:52 +00:00
Eric Christopher	232431c389	Fix comment. llvm-svn: 133307	2011-06-17 22:35:59 +00:00
Bill Wendling	b74b9de151	Use the verbose asm flag instead of a new flag for decoding the LSDA. llvm-svn: 133292	2011-06-17 20:55:01 +00:00
Eric Christopher	5bbb2bdb46	Lower multiply with overflow checking to __mulo<mode> calls if we haven't been able to lower them any other way. Fixes rdar://9090077 and rdar://9210061 llvm-svn: 133288	2011-06-17 20:41:29 +00:00
Bill Wendling	e303114b3c	Add an option that allows one to "decode" the LSDA. The LSDA is a bit difficult for the non-initiated to read. Even with comments, it's not always clear what's going on. This wraps the ASM streamer in a class that retains the LSDA and then emits a human-readable description of what's going on in it. So instead of having to make sense of: Lexception1: .byte 255 .byte 155 .byte 168 .space 1 .byte 3 .byte 26 Lset0 = Ltmp7-Leh_func_begin1 .long Lset0 Lset1 = Ltmp812-Ltmp7 .long Lset1 Lset2 = Ltmp913-Leh_func_begin1 .long Lset2 .byte 3 Lset3 = Ltmp812-Leh_func_begin1 .long Lset3 Lset4 = Leh_func_end1-Ltmp812 .long Lset4 .long 0 .byte 0 .byte 1 .byte 0 .byte 2 .byte 125 .long __ZTIi@GOTPCREL+4 .long __ZTIPKc@GOTPCREL+4 you can read this instead: ## Exception Handling Table: Lexception1 ## @LPStart Encoding: omit ## @TType Encoding: indirect pcrel sdata4 ## @TType Base: 40 bytes ## @CallSite Encoding: udata4 ## @Action Table Size: 26 bytes ## Action 1: ## A throw between Ltmp7 and Ltmp812 jumps to Ltmp913 on an exception. ## For type(s): __ZTIi@GOTPCREL+4 __ZTIPKc@GOTPCREL+4 ## Action 2: ## A throw between Ltmp812 and Leh_func_end1 does not have a landing pad. llvm-svn: 133286	2011-06-17 20:35:21 +00:00
Jakub Staszak	5f45dc7636	getSuccWeight returns now default 0 if Weights vector is empty. llvm-svn: 133271	2011-06-17 18:00:21 +00:00
Jakub Staszak	2ce8399a2d	Allow empty Weights vector. llvm-svn: 133265	2011-06-17 17:30:10 +00:00
Rafael Espindola	e0304d1df9	Two fixes relating to debug value: * We should change the generated code because of a debug use. * Avoid creating debug uses of undef, as they become a kill. Test to follow. llvm-svn: 133255	2011-06-17 13:59:43 +00:00
Lang Hames	934625efc1	Add a hook for PBQP clients to run a custom pre-alloc pass to run prior to PBQP allocation. Patch by Arnaud Allard de Grandmaison. llvm-svn: 133249	2011-06-17 07:09:01 +00:00
Rafael Espindola	79a4b7e55c	Enable early duplication of small blocks. There are still improvements to be made, but this is already a win. llvm-svn: 133240	2011-06-17 05:54:50 +00:00
Jakob Stoklund Olesen	801f7ab321	Rename TRI::getAllocationOrder() to getRawAllocationOrder(). Also switch the return type to ArrayRef<unsigned> which works out nicely for ARM's implementation of this function because of the clever ArrayRef constructors. The name change indicates that the returned allocation order may contain reserved registers as has been the case for a while. llvm-svn: 133216	2011-06-16 23:31:16 +00:00
Jakob Stoklund Olesen	c826df9506	Don't use register classes larger than TLI->getRegClassFor(VT). In Thumb mode we cannot handle GPR virtual registers, even though some instructions can. When isel is lowering a CopyFromReg, it should limit itself to subclasses of getRegClassFor(VT). <rdar://problem/9624323> llvm-svn: 133210	2011-06-16 22:50:38 +00:00
Jakob Stoklund Olesen	4f5f84c7e7	Teach antidependency breakers to use RegisterClassInfo. No functional change was intended. llvm-svn: 133202	2011-06-16 21:56:21 +00:00
Jakob Stoklund Olesen	08322b7dc3	Move PBQP off allocation_order_begin. No functional change intended. I think PBQP could use RegisterClassInfo, but it didn't fit neatly with the external interfaces that PBQP uses, so I'll leave that to Lang. llvm-svn: 133186	2011-06-16 20:37:45 +00:00
Jakub Staszak	12a43bdde5	Introduce MachineBranchProbabilityInfo class, which has similar API to BranchProbabilityInfo (expect setEdgeWeight which is not available here). Branch Weights are kept in MachineBasicBlocks. To turn off this analysis set -use-mbpi=false. llvm-svn: 133184	2011-06-16 20:22:37 +00:00
Owen Anderson	5fc8b77f83	Change the REG_SEQUENCE SDNode to take an explict register class ID as its first operand. This operand is lowered away by the time we reach MachineInstrs, so the actual register-allocation handling of them doesn't need to change. This is intended to support using REG_SEQUENCE SDNode's with type MVT::untyped, and is part of the long road to eliminating some of the hacks we currently use to support register pairs and other strange constraints, particularly on ARM NEON. llvm-svn: 133178	2011-06-16 18:17:13 +00:00
Jakob Stoklund Olesen	89a7e5ad45	Switch linear scan to using RegisterClassInfo. This avoids the manual filtering of reserved registers and removes the dependency on allocation_order_begin(). Palliative care... llvm-svn: 133177	2011-06-16 18:17:00 +00:00
Jakub Staszak	feadd435c1	Test commit. llvm-svn: 133174	2011-06-16 18:01:17 +00:00
Jakob Stoklund Olesen	1f641d577e	Add TargetRegisterInfo::getRawAllocationOrder(). This virtual function will replace allocation_order_begin/end as the one to override when implementing custom allocation orders. It is simpler to have one function return an ArrayRef than having two virtual functions computing different ends of the same array. Use getRawAllocationOrder() in place of allocation_order_begin() where it makes sense, but leave some clients that look like they really want the filtered allocation orders from RegisterClassInfo. llvm-svn: 133170	2011-06-16 17:42:25 +00:00
Nick Lewycky	6d677cfdd8	Add a DAGCombine for (ext (binop (load x), cst)). llvm-svn: 133124	2011-06-16 01:15:49 +00:00
Anna Zaks	2c2aa9a9be	Function::getNumBlockIDs() should be used instead of Function::size() to set the upper limit on the block IDs since basic blocks might get removed (simplified away) after being initially numbered. Plus the test case, in which SelectionDAGBuilder::visitBr() calls llvm::MachineFunction::removeFromMBBNumbering(), which introduces the hole in numbering leading to an assert in llc (prior to the fix). llvm-svn: 133113	2011-06-16 00:03:21 +00:00
John McCall	d935e9c359	The ARC language-specific optimizer. Credit to Dan Gohman. llvm-svn: 133108	2011-06-15 23:37:01 +00:00
Owen Anderson	96adc4a540	Add a new MVT::untyped. This will be used in future work for modelling ISA features like register pairs and lists with "interesting" constraints (such as ARM NEON contiguous register lists or even-odd paired registers). We need to be able to generate these instructions (often from intrinsics), but don't want to have to assign a legal type to them. Instead, we'll use an "untyped" edge to bypass the type-checking and simply ensure that the register classes match. llvm-svn: 133106	2011-06-15 23:35:18 +00:00
Rafael Espindola	ab20567227	Handle jump tables. Test to follow soon. llvm-svn: 133083	2011-06-15 21:00:28 +00:00
Andrew Trick	3013b6ae4a	Added -stress-sched flag in the Asserts build. Added a test case for handling physreg aliases during pre-RA-sched. llvm-svn: 133063	2011-06-15 17:16:12 +00:00
Nadav Rotem	13cb7736a7	getZeroExtendInReg needs to get a scalar type llvm-svn: 133057	2011-06-15 14:37:18 +00:00
Nadav Rotem	d2d9bdb2b0	Enable the simplification of truncating-store after fixing the usage of GetDemandBits (which must operate on the vector element type). Fix the a usage of getZeroExtendInReg which must also be done on scalar types. llvm-svn: 133052	2011-06-15 11:19:12 +00:00
Chad Rosier	818e116723	When pattern matching during instruction selection make sure shl x,1 is not converted to add x,x if x is a undef. add undef, undef does not guarantee that the resulting low order bit is zero. Fixes <rdar://problem/9453156> and <rdar://problem/9487392>. llvm-svn: 133022	2011-06-14 22:29:10 +00:00
Eli Friedman	8a3264ad48	Revert r133004 ; it's breaking nightly tests. llvm-svn: 133007	2011-06-14 19:30:33 +00:00
Rafael Espindola	5e85158321	Partial revert of 132882. Dan noted that this would work on the case shown on the commit message. I think the case that was failing was a bb ending with a redundant conditional jump: ... jne foo foo: ... I was unable to find any such case in the tests or in a debug build of clang, so I will revert this part of the patch and watch the bots. llvm-svn: 133004	2011-06-14 18:12:31 +00:00
Rafael Espindola	3aeaf9e4c1	Add 132986 back, but avoid non-determinism if a bb address gets reused. llvm-svn: 132995	2011-06-14 15:31:54 +00:00
Rafael Espindola	06ba7a68de	revert 132986 to see if the bots go green. llvm-svn: 132988	2011-06-14 12:48:26 +00:00
Nadav Rotem	10193c830b	Add a testcase for checking the integer-promotion of many different vector types (with power of two types such as 8,16,32 .. 512). Fix a bug in the integer promotion of bitcast nodes. Enable integer expanding only if the target of the conversion is an integer (when the type action is scalarize). Add handling to the legalization of vector load/store in cases where the saved vector is integer-promoted. llvm-svn: 132985	2011-06-14 08:11:52 +00:00
Nadav Rotem	571ae19af7	Disable trunc-store simplification on vectors. llvm-svn: 132984	2011-06-14 07:18:26 +00:00
Rafael Espindola	844485af13	Implement Jakob's suggestion on how to detect fall thought without calling AnalyzeBranch. llvm-svn: 132981	2011-06-14 06:08:32 +00:00
Bruno Cardoso Lopes	dc9ff3a4b1	Add one more argument to the prefetch intrinsic to indicate whether it's a data or instruction cache access. Update the targets to match it and also teach autoupgrade. llvm-svn: 132976	2011-06-14 04:58:37 +00:00
Rafael Espindola	da24f2f8e1	Make the threshold used by branch folding softer. Before we would get a sharp all or nothing transition when one extra predecessor was added. Now we still test first ones for merging. llvm-svn: 132974	2011-06-14 04:41:17 +00:00
Nadav Rotem	573ee374a2	Fix a bug in FindMemType. When widening vector loads, use a wider memory type only if the number of packed elements is a power of two. Bug found in Duncan's testcase. llvm-svn: 132923	2011-06-13 18:13:24 +00:00
Jakob Stoklund Olesen	fb03a92c33	Be less aggressive about hinting in RAFast. In particular, don't spill dirty registers only to satisfy a hint. It is not worth it. The attached test case provides an example where the fast allocator would spill a register when other registers are available. llvm-svn: 132900	2011-06-13 03:26:46 +00:00
Jakob Stoklund Olesen	f4f66f36c7	Include callee-saved registers in debug output. llvm-svn: 132899	2011-06-13 03:26:42 +00:00
Rafael Espindola	51d2d7aabc	Fix invalid uses of Twine. Hopefully this fixes the problem that Takumi is having. llvm-svn: 132898	2011-06-13 03:09:13 +00:00
Nadav Rotem	504cf0cde2	Fix a bug in the calculation of the vectorTypeBreakdown into registers. Odd types such as i33 were rounded to i32. Originated from Duncan's testcase. llvm-svn: 132893	2011-06-12 14:56:55 +00:00
Nadav Rotem	083837e729	Improve the generated code by getCopyFromPartsVector for promoted integer types. Instead of scalarizing, and doing an element-by-element truncat, use vector truncate. Add support for scalarization of vectors: i8 -> <1 x i1> (from Duncan's testcase). llvm-svn: 132892	2011-06-12 14:49:38 +00:00
Rafael Espindola	2f3c2fe7c5	Really fix the fall-through logic. Add a triple to the tests. llvm-svn: 132885	2011-06-12 05:57:01 +00:00
Rafael Espindola	653a07206d	Fix silly bug I introduce in the previous commit. Fixes debug builds. llvm-svn: 132883	2011-06-12 05:26:32 +00:00
Rafael Espindola	defd4b0875	AnalyzeBranch doesn't change which successors a bb has, just the order we try to branch to them. Before we were creating successor lists with duplicated entries. Fixing that found a bug in isBlockOnlyReachableByFallthrough that would causes it to return the wrong answer for ----------- ... jne foo jmp bar foo: ---------- llvm-svn: 132882	2011-06-12 03:20:32 +00:00
Chad Rosier	79044dbebf	Revert r132871. llvm-svn: 132872	2011-06-11 02:27:46 +00:00
Chad Rosier	5793b53027	Typo. llvm-svn: 132871	2011-06-11 02:16:36 +00:00
Eric Christopher	eb964516c3	80-col cleanups. llvm-svn: 132863	2011-06-10 23:05:08 +00:00
Rafael Espindola	0f62e4c428	Removed tabs. Also fixed my editor... llvm-svn: 132857	2011-06-10 21:01:53 +00:00
Cameron Zwarich	8b58a83889	Rename the ParmContext enum values to make a bit more sense and add a small comment on their meaning. llvm-svn: 132854	2011-06-10 20:37:36 +00:00
Cameron Zwarich	6221139453	Remove tabs. llvm-svn: 132853	2011-06-10 20:31:39 +00:00
Cameron Zwarich	86ceec1b42	Remove a pointless const_cast. llvm-svn: 132852	2011-06-10 20:30:08 +00:00
Rafael Espindola	1ffadd7809	Remove duplicated test. Thanks Bob Wilson for noticing it! llvm-svn: 132851	2011-06-10 20:08:23 +00:00
Chad Rosier	b90a43d266	Ensure that EmitGlobalVariable is correctly differentiating between declarations and definitions when emitting global variables. This was causing global declarations to be emitted as if they were definitions. Fixes <rdar://problem/9429892>. llvm-svn: 132825	2011-06-10 00:53:15 +00:00
Rafael Espindola	9e97a895f3	Make the optional verification step more strict. llvm-svn: 132822	2011-06-09 23:55:56 +00:00
Rafael Espindola	c9e93a44be	Avoid a gcc warning about multiline comments. llvm-svn: 132821	2011-06-09 23:51:45 +00:00
Rafael Espindola	c735f13368	On last fix to the early tail duplication. With this I am able to bootstrap clang with early tail duplication enabled for any small bb and setting tail-dup-size to a relatively large value(8) to stress this code. llvm-svn: 132816	2011-06-09 23:22:56 +00:00
Rafael Espindola	81512fc1bb	Also consider phi nodes when deciding if a register is live out. llvm-svn: 132814	2011-06-09 22:53:47 +00:00
Eli Friedman	1877ac9937	Change this DAGCombine to build AND of SHR instead of SHR of AND; this matches the ordering we prefer in instcombine. Part of rdar://9562809. The potential DAGCombine which enforces this more generally messes up some other very fragile patterns, so I'm leaving that alone, at least for now. llvm-svn: 132809	2011-06-09 22:14:44 +00:00
Rafael Espindola	c90a32a4e6	AnalyzeBranch modifies the bb, but we don't want to modify a bb with eh edges. Swap the order of the checks to avoid it. llvm-svn: 132806	2011-06-09 21:43:25 +00:00
Rafael Espindola	887fc1bdeb	A PHI in this basic block is a use in another basic block. llvm-svn: 132805	2011-06-09 20:55:41 +00:00
Rafael Espindola	73f93930e0	Refactor some checks into shouldTailDuplicate. Update comments. No functionality change. llvm-svn: 132798	2011-06-09 19:54:42 +00:00
Eric Christopher	cafa08cbf3	Recommit r132764 since it didn't cause the windows buildbot failures. llvm-svn: 132776	2011-06-09 15:39:01 +00:00
Eric Christopher	76fd742d16	Temporarily revert 132764 to see if it fixes the Windows buildbot. llvm-svn: 132771	2011-06-09 06:29:54 +00:00
Eric Christopher	11edab6a46	If the alignment of the byval argument is greater than the alignment of the frame then increase the maximum alignment of the frame to match. Fixes PR6965 llvm-svn: 132764	2011-06-09 00:15:19 +00:00
Eric Christopher	0713a9d8fc	Add a parameter to CCState so that it can access the MachineFunction. No functional change. Part of PR6965 llvm-svn: 132763	2011-06-08 23:55:35 +00:00
Andrew Trick	6ed0c63559	Remove a temporary test case probe in CheckForLiveRegDef. llvm-svn: 132751	2011-06-08 15:19:49 +00:00
Rafael Espindola	eabd18b931	Fix count. llvm-svn: 132749	2011-06-08 14:23:19 +00:00
Rafael Espindola	dfbf6de747	Count how many phis we are creating. llvm-svn: 132748	2011-06-08 14:13:31 +00:00
Cameron Zwarich	2e252de512	Fix an issue where the two-address conversion pass incorrectly rewrites untied operands to an early clobber register. This fixes <rdar://problem/9566076>. llvm-svn: 132738	2011-06-07 23:54:00 +00:00
Rafael Espindola	c85e0d81e4	Fix a silly error I introduce in r131951. Fixes PR10095. llvm-svn: 132735	2011-06-07 23:26:45 +00:00
Andrew Trick	0af2e47310	Fix a merge bug in preRAsched for handling physreg aliases. I've been sitting on this long enough trying to find a test case. I think the fix should go in now, but I'll keep working on the test case. llvm-svn: 132701	2011-06-07 00:38:12 +00:00
Jakob Stoklund Olesen	df476270eb	Simplify local live range splitting's safeguard to fix PR10070. When local live range splitting creates a live range with the same number of instructions as the old range, mark it as RS_Local. When such a range is seen again, require that it be split in a way that reduces the number of instructions. That guarantees we are making progress while still being able to perform 3 -> 2+3 splits as required by PR10070. This also means that the PrevSlot map is no longer needed. This was also used to estimate new spill weights, but that is no longer necessary after slotIndexes::insertMachineInstrInMaps() got the extra Late insertion argument. llvm-svn: 132697	2011-06-06 23:55:20 +00:00
Jakob Stoklund Olesen	0cde8eb9e2	Get allocation orders from RegisterClassInfo when possible. Only target-dependent hints require callbacks. The RCI allocation order has CSR aliases last according to their order of appearance in the getCalleeSavedRegs list. This can depend on the calling convention. This way, AllocationOrder::next doesn't have to check for reserved registers, and CSRs are always allocated last, even with weird calling conventions. llvm-svn: 132690	2011-06-06 21:02:04 +00:00
Nadav Rotem	c807fa5687	Add methods to support the integer-promotion of vector types. Methods to legalize SDNodes such as BUILD_VECTOR, EXTRACT_VECTOR_ELT, etc. llvm-svn: 132689	2011-06-06 20:55:56 +00:00
Stuart Hastings	bee6fcc5aa	Avoid FGETSIGN of 80-bit types. Fixes PR10085. llvm-svn: 132681	2011-06-06 16:44:31 +00:00
Jakob Stoklund Olesen	b7657d0225	Don't try to be clever, just preserve the target's allocation order. The order of registers returned by getCalleeSavedRegs is used to lay out the fixed stack slots for CSRs. Some targets like their CSRs used from one end, and some targets want them used from the other end. When computing an allocation order, simply preserve the relative ordering of CSRs that the target specifies in its allocation order. Reordering CSRs would break some targets, ARM in particular. We still place volatiles before the CSRs, providing slightly better results with different calling conventions. llvm-svn: 132680	2011-06-06 16:36:30 +00:00
Eli Friedman	bd375f1a3f	PR10077: fix fast-isel of extractvalue of aggregate constants. llvm-svn: 132676	2011-06-06 05:46:34 +00:00
Benjamin Kramer	440c3b7306	Use path API for path concatenation. llvm-svn: 132668	2011-06-05 14:36:47 +00:00
Nadav Rotem	06bd6d304e	TypeLegalizer: Add support for passing of vector-promoted types in registers (copyFromParts/copyToParts). llvm-svn: 132649	2011-06-04 20:58:08 +00:00
Nadav Rotem	78d19bebe6	TypeLegalizer: Fix a bug in the promotion of elements of integer vectors. (only happens when using the -promote-elements option). The correct legalization order is to first try to promote element. Next, we try to widen vectors. llvm-svn: 132648	2011-06-04 20:32:01 +00:00
Jakob Stoklund Olesen	b8bf3c0f8b	Switch AllocationOrder to using RegisterClassInfo instead of a BitVector of reserved registers. Use RegisterClassInfo in RABasic as well. This slightly changes som allocation orders because RegisterClassInfo puts CSR aliases last. llvm-svn: 132581	2011-06-03 20:34:53 +00:00
Jakob Stoklund Olesen	3460ae88b2	Preserve the original ordering when a CSR has multiple aliases. Previously, these aliases would be ordered alphabetically. (BH, BL) Print out the computed allocation orders. llvm-svn: 132580	2011-06-03 20:34:50 +00:00
Eric Christopher	fbff0e4f26	Add a TODO about memory operands. llvm-svn: 132559	2011-06-03 17:21:23 +00:00
Jakob Stoklund Olesen	4b0bb8396a	Avoid calling TRI->getAllocatableSet in RAFast. When compiling a program with lots of small functions like 483.xalancbmk, this makes RAFast 11% faster. Add some comments to clarify the difference between unallocatable and reserved registers. It's quite subtle. The fast register allocator depends on EFLAGS' not being allocatable on x86. That way it can completely avoid tracking liveness, and it won't mind when there are multiple uses of a single def. llvm-svn: 132514	2011-06-02 23:41:40 +00:00
Eric Christopher	de9399bf76	Have LowerOperandForConstraint handle multiple character constraints. Part of rdar://9119939 llvm-svn: 132510	2011-06-02 23:16:42 +00:00
Jakob Stoklund Olesen	75703ca76f	Make it possible to have unallocatable register classes. Some register classes are only used for instruction operand constraints. They should never be used for virtual registers. Previously, those register classes were given an empty allocation order, but now you can say 'let isAllocatable=0' in the register class definition. TableGen calculates if a register is part of any allocatable register class, and makes that information available in TargetRegisterDesc::inAllocatableClass. The goal here is to eliminate use cases for overriding allocation_order_* methods. llvm-svn: 132508	2011-06-02 23:07:20 +00:00
Jakob Stoklund Olesen	e242ebea50	Just use a SmallVector. I was confused whether new uint8_t[] would zero-initialize the returned array, and it seems that so is gcc-4.0. This should fix the test failures on darwin 9. llvm-svn: 132500	2011-06-02 22:22:43 +00:00
Devang Patel	5ca0837397	Remove dead code. llvm-svn: 132488	2011-06-02 21:31:00 +00:00
Devang Patel	f02a376fbc	Update DBG_VALUEs while breaking anti dependencies. llvm-svn: 132487	2011-06-02 21:26:52 +00:00
Devang Patel	e5feef0fe1	During post RA scheduling, do not try to chase reg defs. to preserve DBG_VALUEs. This approach has several downsides, for example, it does not work when dbg value is a constant integer, it does not work if reg is defined more than once, it places end of debug value range markers in the wrong place. It even causes misleading incorrect debug info when duplicate DBG_VALUE instructions point to same reg def. Instead, use simpler approach and let DBG_VALUE follow its predecessor instruction. After live debug value analysis pass, all DBG_VALUE instruction are placed at the right place. Thanks Jakob for the hint! llvm-svn: 132483	2011-06-02 20:07:12 +00:00
Rafael Espindola	aa318ae495	Revert 132424 to fix PR10068. llvm-svn: 132479	2011-06-02 19:57:47 +00:00
Jakob Stoklund Olesen	50663b7485	Use RegisterClassInfo::getOrder in RAFast. This saves two virtual function calls and an Allocatable BitVector test, making RAFast run 2% faster. llvm-svn: 132471	2011-06-02 18:35:30 +00:00
Benjamin Kramer	c8c4f7640a	Start with a zeroed CSRNum map. Found by valgrind. llvm-svn: 132457	2011-06-02 12:07:44 +00:00
Jakob Stoklund Olesen	09e6667531	Initialize members to fix problem found by valgrind. llvm-svn: 132456	2011-06-02 05:43:49 +00:00
Jakob Stoklund Olesen	aff1060207	Use TRI::has{Sub,Super}ClassEq() where possible. No functional change. llvm-svn: 132455	2011-06-02 05:43:46 +00:00
Jakob Stoklund Olesen	c58894bc36	Add a RegisterClassInfo class that lazily caches information about register classes. It provides information for each register class that cannot be determined statically, like: - The number of allocatable registers in a class after filtering out the reserved and invalid registers. - The preferred allocation order with registers that overlap callee-saved registers last. - The last callee-saved register that overlaps a given physical register. This information usually doesn't change between functions, so it is reused for compiling multiple functions when possible. The many possible combinations of reserved and callee saves registers makes it unfeasible to compute this information statically in TableGen. Use RegisterClassInfo to count available registers in various heuristics in SimpleRegisterCoalescing, making the pass run 4% faster. llvm-svn: 132450	2011-06-02 02:19:35 +00:00
Devang Patel	e7181b5fdb	A DBG_VALUE that truncates a range does not start another dbg value range. llvm-svn: 132433	2011-06-01 23:00:17 +00:00
Devang Patel	324f843107	Do not drop constant values when a variable's content is described using .debug_loc entries. llvm-svn: 132427	2011-06-01 22:03:25 +00:00
Stuart Hastings	7adc95f69e	Recommit 132404 with fixes. rdar://problem/5993888 llvm-svn: 132424	2011-06-01 21:33:14 +00:00
Eric Christopher	690030c116	Allow bitcasts between valid types of the same size and vector types if the vector type is legal. Fixes rdar://9306086 llvm-svn: 132420	2011-06-01 19:55:10 +00:00
Nadav Rotem	22ad9bb7d9	Refactor LegalizeTypes: Erase LegalizeAction and make the type legalizer use the TargetLowering enum. llvm-svn: 132418	2011-06-01 19:47:10 +00:00
Jakob Stoklund Olesen	e9cc8e90b7	Revert r132358 "Simplify the eviction policy by making the failsafe explicit." This commit caused regressions in i386 flops-[568], matrix, salsa20, 256.bzip2, and enc-md5. llvm-svn: 132413	2011-06-01 18:45:02 +00:00
Stuart Hastings	3ae49c03a4	Fix double FGETSIGN to work on x86_32; followup to 132396. rdar://problem/5660695 llvm-svn: 132411	2011-06-01 18:32:25 +00:00
Stuart Hastings	fd5ecd0cec	Turn on FGETSIGN for x86. Followup to 132388. rdar://problem/5660695 llvm-svn: 132396	2011-06-01 14:04:17 +00:00
Nadav Rotem	8b24a731f2	This patch is another step in the direction of adding vector select. In this patch we add a flag to enable a new type legalization decision - to promote integer elements in vectors. Currently, the rest of the codegen does not support this kind of legalization. This flag will be removed when the transition is complete. llvm-svn: 132394	2011-06-01 12:51:46 +00:00
Andrew Trick	18c9b37a42	Add an issue width check to the postRA scheduler. Patch by Max Kazakov! For targets with no itinerary (x86) it is a nop by default. For targets with issue width already expressed in the itinerary (ARM) it bypasses a scoreboard check but otherwise does not affect the schedule. It does make the code more consistent and complete and allows new targets to specify their issue width in an arbitrary way. llvm-svn: 132385	2011-06-01 03:27:56 +00:00
Bill Wendling	48581a6454	The ARM stuff already calls the Resume function, not the Resume_or_Rethrow. It turns out that it could cause an infinite loop in some situations. If this code is triggered and it converts a cleanup into a catchall, but that cleanup was in already in a cleanup, then the _Unwind_SjLj_Resume could infinite loop. I.e., the code doesn't consume the exception object and passes it on to _Unwind_SjLj_Resume. But _USjLjR expects it to be consumed (since it's landing at a catchall instead of a cleanup). So it uses the values that are presently there, which are the values that tell it to jump to the fake landing pad. <rdar://problem/9508402> llvm-svn: 132381	2011-06-01 01:49:35 +00:00
Devang Patel	562c74284f	Incomplete type may not have corresponding DIE, so do not check DIEEntry eagerly. llvm-svn: 132377	2011-06-01 00:23:24 +00:00
Devang Patel	1cb8ab456c	Refactor. llvm-svn: 132373	2011-05-31 23:30:30 +00:00
Devang Patel	e9853f25ad	Include global types, that are referenced through local variables, in debug_pubtypes list. llvm-svn: 132371	2011-05-31 22:56:51 +00:00
Jakob Stoklund Olesen	73e18b7aea	Simplify the eviction policy by making the failsafe explicit. When assigned ranges are evicted, they are put in the RS_Evicted stage and are not allowed to evict anything else. That prevents looping automatically. When evicting ranges just to get a cheaper register, use only spill weights to find the possible candidates. Avoid breaking hints for this purpose, it is not worth it. Start implementing more complex eviction heuristics, guarded by the temporary -complex-eviction flag. The initial version permits a heavier range to be evicted if it doesn't have any uses where the evicting range is live. This makes it a good candidate for live ranfge splitting. llvm-svn: 132358	2011-05-31 21:02:44 +00:00

... 6 7 8 9 10 ...

12604 Commits