llvm-project

Commit Graph

Author	SHA1	Message	Date
Elena Demikhovsky	14a4af0e66	Optimized load + SIGN_EXTEND patterns in the X86 backend. llvm-svn: 170506	2012-12-19 07:50:20 +00:00
Nadav Rotem	33360d8ae9	After reducing the size of an operation in the DAG we zero-extend the reduced bitwidth op back to the original size. If we reduce ANDs then this can cause an endless loop. This patch changes the ZEXT to ANY_EXTEND if the demanded bits are equal or smaller than the size of the reduced operation. llvm-svn: 170505	2012-12-19 07:39:08 +00:00
Bill Wendling	3d7b0b8ac7	Rename the 'Attributes' class to 'Attribute'. It's going to represent a single attribute in the future. llvm-svn: 170502	2012-12-19 07:18:57 +00:00
Craig Topper	3f194c8f4f	Remove more of 'else's after 'returns'. No functional change. llvm-svn: 170497	2012-12-19 06:43:58 +00:00
Craig Topper	5dd8291cbe	Remove a bunch of 'else's after 'returns' llvm-svn: 170496	2012-12-19 06:39:17 +00:00
Craig Topper	63f5921776	Teach SimplifySetCC that comparing AssertZext i1 against a constant 1 can be rewritten as a compare against a constant 0 with the opposite condition. llvm-svn: 170495	2012-12-19 06:12:28 +00:00
Jakob Stoklund Olesen	d742533dbc	Use bidirectional bundle flags to simplify important functions. The bundle_iterator::operator++ function now doesn't need to dig out the basic block and check against end(). It can use the isBundledWithSucc() flag to find the last bundled instruction safely. Similarly, MachineInstr::isBundled() no longer needs to look at iterators etc. It only has to look at flags. llvm-svn: 170473	2012-12-18 23:21:49 +00:00
Jakob Stoklund Olesen	00f6c7754b	Verify bundle flag consistency when setting them. Now that the bundle flag aware APIs are all in place, it is possible to continuously verify the flag consistency. llvm-svn: 170465	2012-12-18 23:00:28 +00:00
Jakob Stoklund Olesen	29c277197e	Verify bundle flags for consistency in MachineVerifier. The new bidirectional bundle flags are redundant, so inadvertent bundle tearing can be detected in the machine code verifier. llvm-svn: 170463	2012-12-18 22:55:07 +00:00
Jakob Stoklund Olesen	a33f504b3e	Don't allow the automatically updated MI flags to be set directly. The bundle-related MI flags need to be kept in sync with the neighboring instructions. Don't allow the bulk flag-setting setFlags() function to change them. Also don't copy MI flags when cloning an instruction. The clone's bundle flags will be set when it is explicitly inserted into a bundle. llvm-svn: 170459	2012-12-18 21:36:05 +00:00
Jakob Stoklund Olesen	78eaf05fa7	Tighten up the splice() API for bundled instructions. Remove the instr_iterator versions of the splice() functions. It doesn't seem useful to be able to splice sequences of instructions that don't consist of full bundles. The normal splice functions that take MBB::iterator arguments are not changed, and they can move whole bundles around without any problems. llvm-svn: 170456	2012-12-18 20:59:41 +00:00
Andrew Trick	ec2564818c	MISched: add dependence to ExitSU to model live-out latency. llvm-svn: 170454	2012-12-18 20:53:01 +00:00
Andrew Trick	ef23569858	MISched: Cleanup, redundant statement. llvm-svn: 170453	2012-12-18 20:52:58 +00:00
Andrew Trick	d6d5ad3d7b	MISched: Heuristics, compare latency more precisely. It matters more for some targets. llvm-svn: 170452	2012-12-18 20:52:56 +00:00
Andrew Trick	44f54d97a4	MISched: Remove SchedRemainder::IsResourceLimited. I don't know how to compute it. llvm-svn: 170451	2012-12-18 20:52:54 +00:00
Andrew Trick	493b867b5d	MISched: cleanup, use the proper iterator type. llvm-svn: 170450	2012-12-18 20:52:52 +00:00
Andrew Trick	ffb6168e85	MISched: minor improvement, initialize remaining resources before the first scheduling decision. llvm-svn: 170449	2012-12-18 20:52:49 +00:00
Jakob Stoklund Olesen	422e07b091	Tighten the insert() API for bundled instructions. The normal insert() function takes an MBB::iterator position, and inserts a stand-alone MachineInstr as before. The insert() function that takes an MBB::instr_iterator position can insert instructions inside a bundle, and will now update the bundle flags correctly when that happens. When the insert position is between two bundles, it is unclear whether the instruction should be appended to the previous bundle, prepended to the next bundle, or stand on its own. The MBB::insert() function doesn't bundle the instruction in that case, use the MIBundleBuilder class for that. llvm-svn: 170437	2012-12-18 17:54:53 +00:00
Hal Finkel	943f76d1b3	Check multiple register classes for inline asm tied registers A register can be associated with several distinct register classes. For example, on PPC, the floating point registers are each associated with both F4RC (which holds f32) and F8RC (which holds f64). As a result, this code would fail when provided with a floating point register and an f64 operand because it would happen to find the register in the F4RC class first and return that. From the F4RC class, SDAG would extract f32 as the register type and then assert because of the invalid implied conversion between the f64 value and the f32 register. Instead, search all register classes. If a register class containing the the requested register has the requested type, then return that register class. Otherwise, as before, return the first register class found that contains the requested register. llvm-svn: 170436	2012-12-18 17:50:58 +00:00
Jakob Stoklund Olesen	ccfb5fb472	Tighten up the erase/remove API for bundled instructions. Most code is oblivious to bundles and uses the MBB::iterator which only visits whole bundles. MBB::erase() operates on whole bundles at a time as before. MBB::remove() now refuses to remove bundled instructions. It is not safe to remove all instructions in a bundle without deleting them since there is no way of returning pointers to all the removed instructions. MBB::remove_instr() and MBB::erase_instr() will now update bundle flags correctly, lifting individual instructions out of bundles while leaving the remaining bundle intact. The MachineInstr convenience functions are updated so eraseFromParent() erases a whole bundle as before eraseFromBundle() erases a single instruction, leaving the rest of its bundle. removeFromParent() refuses to operate on bundled instructions, and removeFromBundle() lifts a single instruction out of its bundle. These functions will no longer accidentally split or coalesce bundles - bundle flags are updated to preserve the existing bundling, and explicit bundleWith* / unbundleFrom* functions should be used to change the instruction bundling. This API update is still a work in progress. I am going to update APIs first so they maintain bundle flags automatically when possible. Then I'll add stricter verification of the bundle flags. llvm-svn: 170384	2012-12-17 23:55:38 +00:00
Patrik Hagglund	c494d24a68	Revert/correct some FastISel changes in r170104 (EVT->MVT for TargetLowering::getRegClassFor). Some isSimple() guards were missing, or getSimpleVT() were hoisted too far, resulting in asserts on valid LLVM assembly input. llvm-svn: 170336	2012-12-17 14:30:06 +00:00
Craig Topper	588ceec0f7	Add debug prints for when optimizeLoadInstr folds a load. llvm-svn: 170298	2012-12-17 03:56:00 +00:00
Dmitri Gribenko	2943ce80f3	Declare class DwarfDebug before use instead of relying on a forward declaration from some other unrelated header. Patch by Kai. llvm-svn: 170284	2012-12-16 12:57:36 +00:00
Reed Kotler	aee4d5d194	This patch is needed to make c++ exceptions work for mips16. Mips16 is really a processor decoding mode (ala thumb 1) and in the same program, mips16 and mips32 functions can exist and can call each other. If a jal type instruction encounters an address with the lower bit set, then the processor switches to mips16 mode (if it is not already in it). If the lower bit is not set, then it switches to mips32 mode. The linker knows which functions are mips16 and which are mips32. When relocation is performed on code labels, this lower order bit is set if the code label is a mips16 code label. In general this works just fine, however when creating exception handling tables and dwarf, there are cases where you don't want this lower order bit added in. This has been traditionally distinguished in gas assembly source by using a different syntax for the label. lab1: ; this will cause the lower order bit to be added lab2=. ; this will not cause the lower order bit to be added In some cases, it does not matter because in dwarf and debug tables the difference of two labels is used and in that case the lower order bits subtract each other out. To fix this, I have added to mcstreamer the notion of a debuglabel. The default is for label and debug label to be the same. So calling EmitLabel and EmitDebugLabel produce the same result. For various reasons, there is only one set of labels that needs to be modified for the mips exceptions to work. These are the "$eh_func_beginXXX" labels. Mips overrides the debug label suffix from ":" to "=." . This initial patch fixes exceptions. More changes most likely will be needed to DwarfCFException to make all of this work for actual debugging. These changes will be to emit debug labels in some places where a simple label is emitted now. Some historical discussion on this from gcc can be found at: http://gcc.gnu.org/ml/gcc-patches/2008-08/msg00623.html http://gcc.gnu.org/ml/gcc-patches/2008-11/msg01273.html llvm-svn: 170279	2012-12-16 04:00:45 +00:00
Eric Christopher	a2de826d29	To simplify some code move the unit emission into the holders. Make emitDIE public accordingly. No functional change. llvm-svn: 170258	2012-12-15 00:04:07 +00:00
Eric Christopher	16485a5164	Use begin and end label names from the section for info. llvm-svn: 170257	2012-12-15 00:04:04 +00:00
Patrik Hagglund	55d6f47a37	Change TargetLowering::getLoadExtAction to take an MVT, instead of EVT. llvm-svn: 170183	2012-12-14 09:05:13 +00:00
Jakob Stoklund Olesen	7bb2f97a90	Use the new MI bundling API in MachineInstrBundle itself. The new API is higher level than just manipulating the bundle flags directly, and the setIsInsideBundle() function will disappear soon. llvm-svn: 170159	2012-12-13 23:23:46 +00:00
David Blaikie	37fefc3f8d	Debug Info: add support to mark member variables as artificial This is the LLVM portion of r170154. llvm-svn: 170156	2012-12-13 22:43:07 +00:00
Patrik Hagglund	13abe5ec3c	Change TargetLowering::setTypeAction to take an MVT, instead fo EVT. llvm-svn: 170148	2012-12-13 20:42:43 +00:00
Patrik Hagglund	05394352c0	Change TargetLowering::getRepRegClassFor to take an MVT, instead of EVT. Accordingly, change RegDefIter to contain MVTs instead of EVTs. llvm-svn: 170140	2012-12-13 18:45:35 +00:00
Patrik Hagglund	5e6c361bc0	Change TargetLowering::getRegClassFor to take an MVT, instead of EVT. Accordingly, add helper funtions getSimpleValueType (in parallel to getValueType) in SDValue, SDNode, and TargetLowering. This is the first, in a series of patches. This is the second attempt. In the first attempt (r169837), a few getSimpleVT() were hoisted too far, detected by bootstrap failures. llvm-svn: 170104	2012-12-13 06:34:11 +00:00
Eric Christopher	996b2b7ae6	Use default label name for a section in emitting abbreviation section to help prep some code to be split about. llvm-svn: 170088	2012-12-13 03:00:38 +00:00
Evan Cheng	bf0baa9de7	Fix a bug in DAGCombiner::MatchBSwapHWord. Make sure the node has operands before referencing them. rdar://12868039 llvm-svn: 170078	2012-12-13 01:34:32 +00:00
Pedro Artigas	7212ee4534	Make the MCStreamer have a reset method and call that after finalization of the asm printer, also changed MCContext to a single reset only method for simplicity as requested on the list llvm-svn: 170041	2012-12-12 22:59:46 +00:00
Evan Cheng	b7d3d03bf9	Fix a logic bug in inline expansion of memcpy / memset with an overlapping load / store pair. It's not legal to use a wider load than the size of the remaining bytes if it's the first pair of load / store. llvm-svn: 170018	2012-12-12 20:43:23 +00:00
Evan Cheng	962711ee71	Sorry about the churn. One more change to getOptimalMemOpType() hook. Did I mention the inline memcpy / memset expansion code is a mess? This patch split the ZeroOrLdSrc argument into two: IsMemset and ZeroMemset. The first indicates whether it is expanding a memset or a memcpy / memmove. The later is whether the memset is a memset of zero. It's totally possible (likely even) that targets may want to do different things for memcpy and memset of zero. llvm-svn: 169959	2012-12-12 02:34:41 +00:00
Evan Cheng	c3d1aca657	- Rename isLegalMemOpType to isSafeMemOpType. "Legal" is a very overloade term. Also added more comments to explain why it is generally ok to return true. - Rename getOptimalMemOpType argument IsZeroVal to ZeroOrLdSrc. It's meant to be true for loaded source (memcpy) or zero constants (memset). The poor name choice is probably some kind of legacy issue. llvm-svn: 169954	2012-12-12 01:32:07 +00:00
Manman Ren	82751a105c	DAGCombine: clamp hi bit in APInt::getBitsSet to avoid assertion rdar://12838504 llvm-svn: 169951	2012-12-12 01:13:50 +00:00
Evan Cheng	04e5518783	Avoid using lossy load / stores for memcpy / memset expansion. e.g. f64 load / store on non-SSE2 x86 targets. llvm-svn: 169944	2012-12-12 00:42:09 +00:00
Evan Cheng	eb54240dc2	Replace TargetLowering::isIntImmLegal() with ScalarTargetTransformInfo::getIntImmCost() instead. "Legal" is a poorly defined term for something like integer immediate materialization. It is always possible to materialize an integer immediate. Whether to use it for memcpy expansion is more a "cost" conceern. llvm-svn: 169929	2012-12-11 23:26:14 +00:00
Eric Christopher	d692c1dbb7	Update some comments. llvm-svn: 169907	2012-12-11 19:42:09 +00:00
Joel Jones	24e440d045	Add comment for load folding llvm-svn: 169880	2012-12-11 16:10:25 +00:00
Patrik Hagglund	e98b7a0389	Revert EVT->MVT changes, r169836-169851, due to buildbot failures. llvm-svn: 169854	2012-12-11 11:14:33 +00:00
Patrik Hagglund	b31465b09b	Change RegVT in BitTestBlock and RegsForValue, to contain MVTs, instead of EVTs. llvm-svn: 169851	2012-12-11 10:24:48 +00:00
Patrik Hagglund	ad432a8e70	Change TargetLowering::getTypeForExtArgOrReturn to take and return MVTs, instead of EVTs. Accordingly, add bitsLT (and similar) to MVT. llvm-svn: 169850	2012-12-11 10:20:51 +00:00
Patrik Hagglund	d34337495e	Change a parameter of TargetLowering::getVectorTypeBreakdown to MVT, from EVT. llvm-svn: 169849	2012-12-11 10:16:19 +00:00
Patrik Hagglund	03e9628cfa	Change TargetLowering::RegisterTypeForVT to contain MVTs, instead of EVTs. llvm-svn: 169848	2012-12-11 10:09:23 +00:00
Patrik Hagglund	c50489e203	Change TargetLowering::TransformToType to contain MVTs, instead of EVTs. llvm-svn: 169847	2012-12-11 10:05:04 +00:00
Patrik Hagglund	8d2e7cf561	Change TargetLowering::findRepresentativeClass to take an MVT, instead of EVT. llvm-svn: 169845	2012-12-11 09:57:18 +00:00
Patrik Hagglund	ffb60f7c08	Change TargetLowering::getTypeToPromoteTo to take and return MVTs, instead of EVTs. llvm-svn: 169844	2012-12-11 09:54:23 +00:00
Patrik Hagglund	a970281106	Change TargetLowering::isCondCodeLegal to take an MVT, instead of EVT. llvm-svn: 169843	2012-12-11 09:51:27 +00:00
Patrik Hagglund	e3bec6365a	Change TargetLowering::getCondCodeAction to take an MVT, instead of EVT. llvm-svn: 169842	2012-12-11 09:48:14 +00:00
Patrik Hagglund	7ffcd226dd	Change TargetLowering::getTruncStoreAction to take MVTs, instead of EVTs. llvm-svn: 169841	2012-12-11 09:42:24 +00:00
Patrik Hagglund	cbc9d4d0f9	Change TargetLowering::getLoadExtAction to take an MVT, instead of EVT. llvm-svn: 169840	2012-12-11 09:39:09 +00:00
Patrik Hagglund	40e1afe970	Change TargetLowering::setTypeAction to take an MVT, instead fo EVT. llvm-svn: 169839	2012-12-11 09:32:56 +00:00
Patrik Hagglund	57b1694df1	Change TargetLowering::getRepRegClassFor to take an MVT, instead of EVT. Accordingly, change RegDefIter to contain MVTs instead of EVTs. llvm-svn: 169838	2012-12-11 09:31:43 +00:00
Patrik Hagglund	3708e548f8	Change TargetLowering::getRegClassFor to take an MVT, instead of EVT. Accordingly, add helper funtions getSimpleValueType (in parallel to getValueType) in SDValue, SDNode, and TargetLowering. This is the first, in a series of patches. llvm-svn: 169837	2012-12-11 09:10:33 +00:00
Chandler Carruth	b27041c50b	Fix a miscompile in the DAG combiner. Previously, we would incorrectly try to reduce the width of this load, and would end up transforming: (truncate (lshr (sextload i48 <ptr> as i64), 32) to i32) to (truncate (zextload i32 <ptr+4> as i64) to i32) We lost the sext attached to the load while building the narrower i32 load, and replaced it with a zext because lshr always zext's the results. Instead, bail out of this combine when there is a conflict between a sextload and a zext narrowing. The rest of the DAG combiner still optimize the code down to the proper single instruction: movswl 6(...),%eax Which is exactly what we wanted. Previously we read past the end and missed the sign extension: movl 6(...), %eax llvm-svn: 169802	2012-12-11 00:36:57 +00:00
Chad Rosier	df42cf39ab	Fall back to the selection dag isel to select tail calls. This shouldn't affect codegen for -O0 compiles as tail call markers are not emitted in unoptimized compiles. Testing with the external/internal nightly test suite reveals no change in compile time performance. Testing with -O1, -O2 and -O3 with fast-isel enabled did not cause any compile-time or execution-time failures. All tests were performed on my x86 machine. I'll monitor our arm testers to ensure no regressions occur there. In an upcoming clang patch I will be marking the objc_autoreleaseReturnValue and objc_retainAutoreleaseReturnValue as tail calls unconditionally. While it's theoretically true that this is just an optimization, it's an optimization that we very much want to happen even at -O0, or else ARC applications become substantially harder to debug. Part of rdar://12553082 llvm-svn: 169796	2012-12-11 00:18:02 +00:00
Eric Christopher	c8a310edc1	Refactor out the abbreviation handling into a separate class that controls each of the abbreviation sets (only a single one at the moment) and computes offsets separately as well for each set of DIEs. No real function change, ordering of abbreviations for the skeleton CU changed but only because we're computing in a separate order. Fix the testcase not to care. llvm-svn: 169793	2012-12-10 23:34:43 +00:00
Evan Cheng	79e2ca90bc	Some enhancements for memcpy / memset inline expansion. 1. Teach it to use overlapping unaligned load / store to copy / set the trailing bytes. e.g. On 86, use two pairs of movups / movaps for 17 - 31 byte copies. 2. Use f64 for memcpy / memset on targets where i64 is not legal but f64 is. e.g. x86 and ARM. 3. When memcpy from a constant string, do not replace the load with a constant if it's not possible to materialize an integer immediate with a single instruction (required a new target hook: TLI.isIntImmLegal()). 4. Use unaligned load / stores more aggressively if target hooks indicates they are "fast". 5. Update ARM target hooks to use unaligned load / stores. e.g. vld1.8 / vst1.8. Also increase the threshold to something reasonable (8 for memset, 4 pairs for memcpy). This significantly improves Dhrystone, up to 50% on ARM iOS devices. rdar://12760078 llvm-svn: 169791	2012-12-10 23:21:26 +00:00
Lang Hames	517fc8b264	Defer call to InitSections until after MCContext has been initialized. If InitSections is called before the MCContext is initialized it could cause duplicate temporary symbols to be emitted later (after context initialization resets the temporary label counter). llvm-svn: 169785	2012-12-10 22:49:11 +00:00
Eric Christopher	0aa4a670ad	Rearrange vars and make comments more obvious. llvm-svn: 169780	2012-12-10 22:25:41 +00:00
Eric Christopher	81d091eed9	Remove blank line at top of file. llvm-svn: 169779	2012-12-10 22:25:38 +00:00
Eric Christopher	200dd760fa	Fix a coding style nit. llvm-svn: 169776	2012-12-10 22:00:20 +00:00
Tom Stellard	30e2aa5015	LegalizeDAG: Allow type promotion of scalar loads llvm-svn: 169773	2012-12-10 21:41:58 +00:00
Tom Stellard	b785bd776c	LegalizeDAG: Allow type promotion for scalar stores llvm-svn: 169772	2012-12-10 21:41:54 +00:00
Eric Christopher	cdf218d606	Use the somewhat semantic term "split dwarf" it more matches what's going on and makes a lot of the terminology in comments make more sense. llvm-svn: 169758	2012-12-10 19:51:21 +00:00
Eric Christopher	8afd7b6066	Delete the FissionCU. llvm-svn: 169757	2012-12-10 19:51:18 +00:00
Eric Christopher	d79f5480ac	Reorder fission variables. llvm-svn: 169756	2012-12-10 19:51:13 +00:00
Hal Finkel	66859ae0f6	Use GetUnderlyingObjects in misched misched used GetUnderlyingObject in order to break false load/store dependencies, and the -enable-aa-sched-mi feature similarly relied on GetUnderlyingObject in order to ensure it is safe to use the aliasing analysis. Unfortunately, GetUnderlyingObject does not recurse through phi nodes, and so (especially due to LSR) all of these mechanisms failed for induction-variable-dependent loads and stores inside loops. This change replaces uses of GetUnderlyingObject with GetUnderlyingObjects (which will recurse through phi and select instructions) in misched. Andy reviewed, tested and simplified this patch; Thanks! llvm-svn: 169744	2012-12-10 18:49:16 +00:00
Craig Topper	d8005db486	Teach DAG combine to handle vector add/sub with vectors of all 0s. llvm-svn: 169727	2012-12-10 08:12:29 +00:00
Craig Topper	5ea3bdd75b	Remove extra blank line. llvm-svn: 169692	2012-12-09 08:20:52 +00:00
Craig Topper	a183ddb0fe	Teach DAG combine to handle vector logical operations with vectors of all 1s or all 0s. These cases can show up when vectors are split for legalizing. Fix some tests that were dependent on these cases not being combined. llvm-svn: 169684	2012-12-08 22:49:19 +00:00
Jakob Stoklund Olesen	fead62d4f4	Add higher-level API for dealing with bundled MachineInstrs. This is still a work in progress. The purpose is to make bundling and unbundling operations explicit, and to catch errors where bundles are broken or created inadvertently. The old IsInsideBundle flag is replaced by two MI flags: BundledPred which has the same meaning as IsInsideBundle, and BundledSucc which is set on instructions that are bundled with a successor. Having two flags provdes redundancy to detect when a bundle is inadvertently torn by a splice() or insert(), and it makes it possible to write bundle iterators that don't need to peek at adjacent instructions. The new flags can't be manipulated directly (once setIsInsideBundle is gone). Instead there are MI functions to make and break bundle bonds. The setIsInsideBundle function will be removed in a future commit. It should be replaced by bundleWithPred(). llvm-svn: 169583	2012-12-07 04:23:29 +00:00
Pedro Artigas	e84b13f039	fixed valgrind issues of prior commit, this change applies r169456 changes back to the tree with fixes. on darwin no valgrind issues exist in the tests that used to fail. original change description: change MCContext to work on the doInitialization/doFinalization model reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169553	2012-12-06 22:12:44 +00:00
Evan Cheng	9ec512d768	Replace r169459 with something safer. Rather than having computeMaskedBits to understand target implementation of any_extend / extload, just generate zero_extend in place of any_extend for liveouts when the target knows the zero_extend will be implicit (e.g. ARM ldrb / ldrh) or folded (e.g. x86 movz). rdar://12771555 llvm-svn: 169536	2012-12-06 19:13:27 +00:00
Nadav Rotem	ac450eb59e	Fix a bug in the code that merges consecutive stores. Previously we did not check if loads that happen in between stores alias with the first store in the chain, only with the second store onwards. llvm-svn: 169516	2012-12-06 17:34:13 +00:00
Bill Wendling	3495f9b6dd	s/getLowerBoundDefault/getDefaultLowerBound/ for consistency. Also put the more natural check first in the if-then statement. llvm-svn: 169486	2012-12-06 07:55:19 +00:00
Bill Wendling	28fe9e7a36	Handle non-default array bounds. Some languages, e.g. Ada and Pascal, allow you to specify that the array bounds are different from the default (1 in these cases). If we have a lower bound that's non-default, then we emit the lower bound. We also calculate the correct upper bound in those cases. llvm-svn: 169484	2012-12-06 07:38:10 +00:00
NAKAMURA Takumi	d985d76040	Revert r169456, "change MCContext to work on the doInitialization/doFinalization model" It broke many builders. llvm-svn: 169462	2012-12-06 02:00:13 +00:00
Evan Cheng	5213139f48	Let targets provide hooks that compute known zero and ones for any_extend and extload's. If they are implemented as zero-extend, or implicitly zero-extend, then this can enable more demanded bits optimizations. e.g. define void @foo(i16* %ptr, i32 %a) nounwind { entry: %tmp1 = icmp ult i32 %a, 100 br i1 %tmp1, label %bb1, label %bb2 bb1: %tmp2 = load i16* %ptr, align 2 br label %bb2 bb2: %tmp3 = phi i16 [ 0, %entry ], [ %tmp2, %bb1 ] %cmp = icmp ult i16 %tmp3, 24 br i1 %cmp, label %bb3, label %exit bb3: call void @bar() nounwind br label %exit exit: ret void } This compiles to the followings before: push {lr} mov r2, #0 cmp r1, #99 bhi LBB0_2 @ BB#1: @ %bb1 ldrh r2, [r0] LBB0_2: @ %bb2 uxth r0, r2 cmp r0, #23 bhi LBB0_4 @ BB#3: @ %bb3 bl _bar LBB0_4: @ %exit pop {lr} bx lr The uxth is not needed since ldrh implicitly zero-extend the high bits. With this change it's eliminated. rdar://12771555 llvm-svn: 169459	2012-12-06 01:28:01 +00:00
Pedro Artigas	bf7d3bab26	change MCContext to work on the doInitialization/doFinalization model reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169456	2012-12-06 00:50:55 +00:00
Andrew Trick	d3226eee03	RegPressureTracker::dump(): Remove unnecessary argument. llvm-svn: 169443	2012-12-05 23:05:22 +00:00
Andrew Trick	fda7a8832d	RegisterPressureTracker: fix findUseBetween to handle DebugValue llvm-svn: 169427	2012-12-05 21:37:50 +00:00
Andrew Trick	7bbcad7bcd	RegisterPressureTracker: unify virtual registers and physical regunits. Now that live register units are tracked individually, the code can be simplified. llvm-svn: 169426	2012-12-05 21:37:47 +00:00
Andrew Trick	7f7cee39ab	RegisterPresssureTracker: Track live physical register by unit. This is much simpler to reason about, more efficient, and fixes some corner cases involving implicit super-register defs. Fixed rdar://12797931. llvm-svn: 169425	2012-12-05 21:37:42 +00:00
Jakob Stoklund Olesen	a97cec790f	Remove unused MachineInstr constructors. A MachineInstr can only ever be constructed by CreateMachineInstr() and CloneMachineInstr(), and those factories don't use the removed constructors. llvm-svn: 169395	2012-12-05 18:27:39 +00:00
Pedro Artigas	41b98843e8	- Added calls to doInitialization/doFinalization to immutable passes - fixed ordering of calls to doFinalization to be the reverse of the pass run order due to potential dependencies - fixed machine module info to operate in the doInitialization/doFinalization model, also fixes some FIXMEs reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169391	2012-12-05 17:12:22 +00:00
Andrew Trick	d52ab339cb	Added RegisterPressureTracker::dump() for debugging. llvm-svn: 169359	2012-12-05 06:47:08 +00:00
Jakob Stoklund Olesen	3cb2cb800f	Speed up the AllocationOrder class a bit. Allow the central functions to be inlined, and use the argumentless isHint() function when possible. llvm-svn: 169319	2012-12-04 22:25:16 +00:00
David Blaikie	67cb31ebdd	Comment change made in r169304 as requested by Eric Christopher. llvm-svn: 169315	2012-12-04 22:02:33 +00:00
Bill Wendling	d7767125d5	Use the 'count' attribute to calculate the upper bound of an array. The count attribute is more accurate with regards to the size of an array. It also obviates the upper bound attribute in the subrange. We can also better handle an unbound array by setting the count to -1 instead of the lower bound to 1 and upper bound to 0. llvm-svn: 169312	2012-12-04 21:34:03 +00:00
David Blaikie	5a773bb601	Reapply r160148 (reverted in r163570) fixing spurious breakpoints in modern GDB This reapplies the fix for PR13303 now with more justification. Based on my execution of the GDB 7.5 test suite this results in: expected passes: 16101 -> 20890 (+30%) unexpected failures: 4826 -> 637 (-77%) There are 23 checks that used to pass and now fail. They are all in gdb.reverse. Investigating a few looks like they were accidentally passing due to extra breakpoints being set by this bug. They're generally due to the difference in end location between gcc and clang, the test suite is trying to set breakpoints on the closing '}' that clang doesn't associate with any instructions. llvm-svn: 169304	2012-12-04 21:05:36 +00:00
Chandler Carruth	802d755533	Sort includes for all of the .h files under the 'lib' tree. These were missed in the first pass because the script didn't yet handle include guards. Note that the script is now able to handle all of these headers without manual edits. =] llvm-svn: 169224	2012-12-04 07:12:27 +00:00
Bill Wendling	bfc0e5725f	Add a 'count' field to the DWARF subrange. The count field is necessary because there isn't a difference between the 'lo' and 'hi' attributes for a one-element array and a zero-element array. When the count is '0', we know that this is a zero-element array. When it's >=1, then it's a normal constant sized array. When it's -1, then the array is unbounded. llvm-svn: 169218	2012-12-04 06:20:49 +00:00
Jakub Staszak	ae551a853d	Simplify code. No functionality change. llvm-svn: 169198	2012-12-04 01:00:52 +00:00
Manman Ren	f563941adc	Stack Alignment: when creating stack objects in MachineFrameInfo, make sure the alignment is clamped to TargetFrameLowering.getStackAlignment if the target does not support stack realignment or the option "realign-stack" is off. This will cause miscompile if the address is treated as aligned and add is replaced with or in DAGCombine. Added a bool StackRealignable to TargetFrameLowering to check whether stack realignment is implemented for the target. Also added a bool RealignOption to MachineFrameInfo to check whether the option "realign-stack" is on. rdar://12713765 llvm-svn: 169197	2012-12-04 00:52:33 +00:00
Jakub Staszak	bac8ae6506	Use dyn_cast instead of isa and cast. No functionality change. llvm-svn: 169196	2012-12-04 00:50:06 +00:00
Jakob Stoklund Olesen	084665fa6d	Remove VirtRegMap::getRegAllocPref(). Now that there can be multiple hint registers from targets, it doesn't make sense to have a function that returns 'the' preferred register. llvm-svn: 169190	2012-12-04 00:35:59 +00:00
Jakob Stoklund Olesen	1dd82dd3fc	Use MRI::getSimpleHint() instead of getRegAllocPref() in remaining cases. Targets can provide multiple hints now, so getRegAllocPref() doesn't make sense any longer because it only returns one preferred register. Replace it with getSimpleHint() in the remaining heuristics. This function only llvm-svn: 169188	2012-12-04 00:30:22 +00:00
Manman Ren	26c73f93e0	Stack Alignment: move functions from header file MachineFrameInfo.h. No functional change for this commit. The follow-up patch will add more stuff to these functions. rdar://12713765 llvm-svn: 169186	2012-12-04 00:26:44 +00:00
Jakob Stoklund Olesen	74052b041b	Add VirtRegMap::hasKnownPreference(). Virtual registers with a known preferred register are prioritized by RAGreedy. This function makes the condition explicit without depending on getRegAllocPref(). llvm-svn: 169179	2012-12-03 23:23:50 +00:00
Jakob Stoklund Olesen	c784a1f906	Use the new getRegAllocationHints() hook from AllocationOrder. This simplifies the hinting code quite a bit while making the targets easier to write at the same time. llvm-svn: 169173	2012-12-03 22:51:04 +00:00
Pedro Artigas	e4348b0412	moves doInitialization and doFinalization to the Pass class and removes some unreachable code in MachineModuleInfo reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169164	2012-12-03 21:56:57 +00:00
Jakob Stoklund Olesen	499cac486a	Add a new hook for providing register allocator hints more flexibly. The TargetRegisterInfo::getRegAllocationHints() function is going to replace the existing mechanisms for providing target-dependent hints to the register allocator: ResolveRegAllocHint() and getRawAllocationOrder(). The new hook is more flexible because it allows the target to provide multiple preferred candidate registers for each virtual register, and it is easier to use because targets are not required to return a reference to a constant array like getRawAllocationOrder(). An optional VirtRegMap argument can be used to provide target-dependent hints that depend on the provisional assignments of other virtual registers. llvm-svn: 169154	2012-12-03 21:17:00 +00:00
Eli Bendersky	b42d1466a0	Fix PR12942: Allow two CUs to be generated from the same source file. Thanks Eric for the review. llvm-svn: 169142	2012-12-03 18:45:45 +00:00
Chandler Carruth	ed0881b2a6	Use the new script to sort the includes of every file under lib. Sooooo many of these had incorrect or strange main module includes. I have manually inspected all of these, and fixed the main module include to be the nearest plausible thing I could find. If you own or care about any of these source files, I encourage you to take some time and check that these edits were sensible. I can't have broken anything (I strictly added headers, and reordered them, never removed), but they may not be the headers you'd really like to identify as containing the API being implemented. Many forward declarations and missing includes were added to a header files to allow them to parse cleanly when included first. The main module rule does in fact have its merits. =] llvm-svn: 169131	2012-12-03 16:50:05 +00:00
Nadav Rotem	1157e1410c	Allow merging multiple store sequences on the same chain. llvm-svn: 169111	2012-12-02 17:14:09 +00:00
Andrew Trick	b767d1eba8	misched: Fix RegisterPressureTracker handling of DebugVals. Assertion failed: (TopRPTracker.getPos() == RegionBegin && "bad initial Top tracker"). rdar://12790302. llvm-svn: 169072	2012-12-01 01:22:49 +00:00
Andrew Trick	d5953622ce	misched: Fix the DAG builder to handle an undef operand at ExitSU. Assertion failed: (VNI && "No value to read by operand") rdar://12790267. llvm-svn: 169071	2012-12-01 01:22:44 +00:00
Andrew Trick	a01302182c	misched: Fix LiveInterval update to better handle DebugVal. Assertion failed: (itr != mi2iMap.end() && "Instruction not found in maps.") rdar://12777252. llvm-svn: 169070	2012-12-01 01:22:41 +00:00
Andrew Trick	e7ea8aa48a	misched: fix RegionBegin when DebugValues get shuffled to the top. assert (RemainingInstrs == 0 && "Instruction count mismatch!") rdar://12776937. llvm-svn: 169069	2012-12-01 01:22:38 +00:00
Jakob Stoklund Olesen	da2b6b381a	Simplify REG_SEQUENCE lowering. The TwoAddressInstructionPass takes the machine code out of SSA form by expanding REG_SEQUENCE instructions into copies. It is no longer necessary to rewrite the registers used by a REG_SEQUENCE instruction because the new coalescer algorithm can do it now. REG_SEQUENCE is just converted to a sequence of sub-register copies now. llvm-svn: 169067	2012-12-01 01:06:44 +00:00
Eric Christopher	9c2ecd93d0	Add some first skeleton work for the DWARF5 Fission proposal. Emit part of the compile unit CU and start separating out information into the various sections that will be pulled out later. WIP. llvm-svn: 169061	2012-11-30 23:59:06 +00:00
Jakob Stoklund Olesen	bb1e98318f	Convert COPY instructions into KILLs if they have implicit defs. MachineCopyPropagation doesn't understand super-register liveness well enough to be able to remove implicit defs of super-registers. This fixes a problem in ARM/2012-01-26-CopyPropKills.ll that is exposed by an future TwoAddressInstructionPass change. The KILL instructions are removed before the machine code is emitted. llvm-svn: 169060	2012-11-30 23:53:00 +00:00
Bill Wendling	c786b31233	Replace r168930 with a more reasonable patch. The original patch removed a bunch of code that the SjLjEHPrepare pass placed into the entry block if all of the landing pads were removed during the CodeGenPrepare class. The more natural way of doing things is to run the CGP before we run the SjLjEHPrepare pass. Make it so! llvm-svn: 169044	2012-11-30 22:08:55 +00:00
Eric Christopher	42e3994e77	More comment. llvm-svn: 168952	2012-11-29 22:56:13 +00:00
Justin Holewinski	edec332437	Cleanup recent addition of DAGTypeLegalizer::SplitVecOp_VSELECT llvm-svn: 168932	2012-11-29 19:42:09 +00:00
Benjamin Kramer	aa598b3be6	misched: Recompute priority queue when DFSResults are updated. This was found by MSVC10's STL debug mode on a test from the test suite. Sadly std::is_heap isn't standard so there is no way to assert this without writing our own heap verify, which looks like overkill to me. llvm-svn: 168885	2012-11-29 14:36:26 +00:00
Justin Holewinski	0ac49bf846	Teach the legalizer how to handle operands for VSELECT nodes If we need to split the operand of a VSELECT, it must be the mask operand. We split the entire VSELECT operand with EXTRACT_SUBVECTOR. llvm-svn: 168883	2012-11-29 14:26:28 +00:00
Justin Holewinski	bc45119b44	Allow targets to prefer TypeSplitVector over TypePromoteInteger when computing the legalization method for vectors For some targets, it is desirable to prefer scalarizing <N x i1> instead of promoting to a larger legal type, such as <N x i32>. llvm-svn: 168882	2012-11-29 14:26:24 +00:00
Jakob Stoklund Olesen	bdb55e0c59	Use MCPhysReg for RegisterClassInfo allocation orders. This saves a bit of memory. llvm-svn: 168852	2012-11-29 03:34:17 +00:00
Jakob Stoklund Olesen	546e9e85f1	Avoid rewriting instructions twice. This could cause miscompilations in targets where sub-register composition is not always idempotent (ARM). <rdar://problem/12758887> llvm-svn: 168837	2012-11-29 00:26:11 +00:00
Nadav Rotem	307d767177	When combining consecutive stores allow loads in between the stores, if the loads do not alias. llvm-svn: 168832	2012-11-29 00:00:08 +00:00
Jakob Stoklund Olesen	26c9d70d28	Make the LiveRegMatrix analysis available to targets. No functional change, just moved header files. Targets can inject custom passes between register allocation and rewriting. This makes it possible to tweak the register allocation before rewriting, using the full global interference checking available from LiveRegMatrix. llvm-svn: 168806	2012-11-28 19:13:06 +00:00
Andrew Trick	48d392e81e	misched: Analysis that partitions the DAG into subtrees. This is a simple, cheap infrastructure for analyzing the shape of a DAG. It recognizes uniform DAGs that take the shape of bottom-up subtrees, such as the included matrix multiplication example. This is useful for heuristics that balance register pressure with ILP. Two canonical expressions of the heuristic are implemented in scheduling modes: -misched-ilpmin and -misched-ilpmax. llvm-svn: 168773	2012-11-28 05:13:28 +00:00
Andrew Trick	cd1c2f9fb1	misched: rename ScheduleDAGILP to ScheduleDFS to prepare for other heuristics. llvm-svn: 168772	2012-11-28 05:13:24 +00:00
Andrew Trick	0be19363d1	misched: better alias analysis. This fixes a hole in the "cheap" alias analysis logic implemented within the DAG builder itself, regardless of whether proper alias analysis is enabled. It now handles this pattern produced by LSR+CodeGenPrepare. %sunkaddr1 = ptrtoint * %obj to i64 %sunkaddr2 = add i64 %sunkaddr1, %lsr.iv %sunkaddr3 = inttoptr i64 %sunkaddr2 to i32* store i32 %v, i32* %sunkaddr3 llvm-svn: 168768	2012-11-28 03:42:49 +00:00
Andrew Trick	cf7e6971e8	misched: Debug output fix. Use an always valid iterator. llvm-svn: 168767	2012-11-28 03:42:47 +00:00
Jakob Stoklund Olesen	c351aed4b1	Move the guts of TargetInstrInfoImpl into the TargetInstrInfo class. The *Impl class no longer serves a purpose now that the super-class implementation is in CodeGen. llvm-svn: 168759	2012-11-28 02:35:13 +00:00
Jakob Stoklund Olesen	fcf14e8436	Move Target{Instr,Register}Info.cpp into lib/CodeGen. The Target library is not allowed to depend on the large CodeGen library, but the TRI and TII classes provide abstract interfaces that require both caller and callee to link to CodeGen. The implementation files for these classes provide default implementations of some of the hooks. These methods may need to reference CodeGen, so they belong in that library. We already have a number of methods implemented in the TargetInstrInfoImpl sub-class because of that. I will merge that class into the parent next. llvm-svn: 168758	2012-11-28 02:35:09 +00:00
Chad Rosier	ed119d542b	Revert r168630, r168631, and r168633 as these are causing nightly test failures. llvm-svn: 168751	2012-11-28 00:21:29 +00:00
Eric Christopher	acdcbdb17d	Attempt to make the comments for dwarf debug look more like the coding standard would like. llvm-svn: 168737	2012-11-27 22:43:45 +00:00
Eric Christopher	95198f5035	Reapply section moving, make sure string section is output last. llvm-svn: 168736	2012-11-27 22:43:42 +00:00
Manman Ren	f89406ac78	CSE: allow PerformTrivialCoalescing to check copies across basic block boundaries. Given the following case: BB0 %vreg1<def> = SUBrr %vreg0, %vreg7 %vreg2<def> = COPY %vreg7 BB1 %vreg10<def> = SUBrr %vreg0, %vreg2 We should be able to CSE between SUBrr in BB0 and SUBrr in BB1. rdar://12462006 llvm-svn: 168717	2012-11-27 18:58:41 +00:00
Jakub Staszak	38e2f52e85	Remove duplicated #includes. llvm-svn: 168712	2012-11-27 18:27:14 +00:00
Ulrich Weigand	e5f9405842	Never use .lcomm on platforms where it does not accept an alignment argument. Instead, use a pair of .local and .comm directives. This avoids spurious differences between binaries built by the integrated assembler vs. those built by the external assembler, since the external assembler may impose alignment requirements on .lcomm symbols where the integrated assembler does not. llvm-svn: 168704	2012-11-27 16:11:16 +00:00
Eric Christopher	6e20a16829	Revert rearrangement of debug info sections to unblock the bots and O0 + debug codegen. llvm-svn: 168680	2012-11-27 06:49:23 +00:00
Jakub Staszak	8262b885da	Remove unneeded #include. llvm-svn: 168670	2012-11-27 02:00:27 +00:00
Jakub Staszak	508888e446	Remove unneeded #include. llvm-svn: 168664	2012-11-27 01:22:15 +00:00
NAKAMURA Takumi	2e4a30709d	llvm/CodeGen: Remove empty files in r168659. llvm-svn: 168663	2012-11-27 01:21:50 +00:00
Jakub Staszak	08a28d248f	Remove unused forward declaration. llvm-svn: 168660	2012-11-27 01:16:37 +00:00
Jakub Staszak	0820b2a360	Remove unused MachineLoopRanges analysis. llvm-svn: 168659	2012-11-27 01:14:34 +00:00
Eric Christopher	69e328e5bd	Make comment names match function names. llvm-svn: 168644	2012-11-27 00:41:57 +00:00
Eric Christopher	4c9b119d64	Add in sections for the fission case (no change so incorrect) and add a TODO for starting. llvm-svn: 168643	2012-11-27 00:41:54 +00:00
Eric Christopher	c800b12bae	Reorder section output ordering. llvm-svn: 168638	2012-11-27 00:13:58 +00:00
Eric Christopher	735401cf29	Whitespace cleanup. llvm-svn: 168637	2012-11-27 00:13:51 +00:00
Chad Rosier	110b73e0e5	Add an assertion to ensure freezeReservedRegs() is only ever called once. llvm-svn: 168633	2012-11-26 23:37:07 +00:00
Chad Rosier	f8a3a62cdb	Now that the X86 Maximal Stack Alignment Check pass has been removed (i.e., r168627), we no longer need to call the freezeReservedRegs() function a second time. Previously, this pass was conservatively adding the FP to the set of reserved registers, requiring the second update to the reserved registers. rdar://12719844 llvm-svn: 168631	2012-11-26 23:25:41 +00:00
Chad Rosier	a44e1825a3	Now that the X86 Maximal Stack Alignment Check pass has been removed (i.e., r168627), we no longer need to call the freezeReservedRegs() function a second time. Previously, this pass was conservatively adding the FP to the set of reserved registers, requiring the second update to the reserved registers. rdar://12719844 llvm-svn: 168630	2012-11-26 23:14:37 +00:00
Jakub Staszak	f18753b8d0	Don't use iterator after being erased. llvm-svn: 168622	2012-11-26 22:14:19 +00:00
Jakub Staszak	e25344225d	Remove unneeded #includes. llvm-svn: 168608	2012-11-26 21:04:19 +00:00
Craig Topper	79bd205d8c	Refactor to make helper method static. llvm-svn: 168557	2012-11-25 08:08:58 +00:00
Craig Topper	268b62288e	Remove duplicate check of LimitFloatPrecision. It was already checked earlier before IsExp10 could be set to true. llvm-svn: 168553	2012-11-25 00:48:58 +00:00
Craig Topper	8571944cf1	Factor common code out of individual if blocks into common tail. llvm-svn: 168551	2012-11-25 00:15:07 +00:00
Craig Topper	d374694b07	Remove redundant calls to getCurDebugLoc in visitIntrinsicCall. It's already called at the start of the function and captured in a local variable. llvm-svn: 168548	2012-11-24 23:05:23 +00:00
Craig Topper	d2638c1894	Refactor a bit to make some helper methods static. llvm-svn: 168546	2012-11-24 18:52:06 +00:00
Craig Topper	4a98175800	Factor some common code out of individual if blocks. llvm-svn: 168538	2012-11-24 08:22:37 +00:00
Craig Topper	bef254ab16	Refactor a bit to make some helper functions static. llvm-svn: 168524	2012-11-23 18:38:31 +00:00
Patrik Hägglund	f77cc055cd	Cleanup: Simplify loop end logic in computeRegisterProperties(). llvm-svn: 168507	2012-11-23 08:35:04 +00:00
Eli Bendersky	26e7efeb1a	Fix 80-col violation llvm-svn: 168498	2012-11-22 14:10:40 +00:00
Lang Hames	e9541c820a	llvm.fmuladd.* lowering should be checking isOperationLegalOrCustom, rather than isOperationLegal. Thanks to Craig Topper for pointing this out. llvm-svn: 168485	2012-11-22 03:31:45 +00:00
Eric Christopher	960ac37832	Pull some code out into functions to make rearranging them a bit easier. llvm-svn: 168481	2012-11-22 00:59:49 +00:00
Eric Christopher	92331fde8c	Whitespace. llvm-svn: 168402	2012-11-21 00:34:38 +00:00
Eric Christopher	7b30f2e43b	Update for some of the coding standard before rearranging functions around. llvm-svn: 168401	2012-11-21 00:34:35 +00:00
Eric Christopher	5d1cf930df	Update some comments. llvm-svn: 168400	2012-11-21 00:17:49 +00:00
Eric Christopher	55c5181525	Update and add some comments. llvm-svn: 168399	2012-11-21 00:03:31 +00:00
Eric Christopher	27527b2b92	Whitespace. llvm-svn: 168398	2012-11-21 00:03:28 +00:00
Eric Christopher	383719592a	Remove constness from this, it modifies the output stream as does everything else underneath. llvm-svn: 168395	2012-11-20 23:30:11 +00:00
Eric Christopher	1f0cbb826f	Remove unused function argument, add a bit to the comment. llvm-svn: 168387	2012-11-20 22:14:13 +00:00
Eric Christopher	1d6bd41ee6	Formatting. llvm-svn: 168384	2012-11-20 20:34:47 +00:00
Eric Christopher	7c718e41c7	Whitespace. llvm-svn: 168383	2012-11-20 20:34:44 +00:00
Tim Northover	dd219d06c2	Fix physical register liveness calculations: + Take account of clobbers + Give outputs priority over inputs since they happen later. llvm-svn: 168360	2012-11-20 09:56:11 +00:00
Eric Christopher	58f4195942	Remove a function argument and propagate const around accordingly. llvm-svn: 168338	2012-11-19 22:42:15 +00:00
Eric Christopher	6a8413853f	Whitespace and 80-col. llvm-svn: 168337	2012-11-19 22:42:10 +00:00
Anton Korobeynikov	097b0e9d6a	Make AsmPrinter::EmitTTypeReference() more robust - put the zero GV check inside, so we won't forget it at the caller side. llvm-svn: 168328	2012-11-19 21:17:20 +00:00
Anton Korobeynikov	f65a638d94	Factor out type info emission into separate routine. It turned out that ARM wants different layout of type infos. This is yet another patch in attempt to fix PR7187 llvm-svn: 168325	2012-11-19 21:06:26 +00:00
Eric Christopher	cebb0ec764	Move section label emission to module end. Nothing should be depending on them being emitted before the text and/or data sections and testing didn't uncover any. llvm-svn: 168321	2012-11-19 19:43:59 +00:00
Jakob Stoklund Olesen	31ebe55808	Handle mixed normal and early-clobber defs on inline asm. PR14376. llvm-svn: 168320	2012-11-19 19:31:10 +00:00
Craig Topper	36f29122ef	Move else onto line with preceding closing brace. llvm-svn: 168294	2012-11-19 00:11:50 +00:00
Andrew Trick	28c000b234	Broaden isSchedulingBoundary to check aliases of SP. On PPC the stack pointer is X1, but ADJCALLSTACK writes R1. Fixes PR14315: Register regmask dependency problem with misched. llvm-svn: 168248	2012-11-17 03:35:11 +00:00
Eli Friedman	30834940ec	Mark FP_EXTEND form v2f32 to v2f64 as "expand" for ARM NEON. Patch by Pete Couperus. llvm-svn: 168240	2012-11-17 01:52:46 +00:00
Andrew Trick	9d0a1ae946	Use array_pod_sort instead of std::sort. llvm-svn: 168203	2012-11-16 21:33:38 +00:00
Craig Topper	ed756c5fc8	Remove conditions from 'else if' that were guaranteed by preceding 'if'. llvm-svn: 168191	2012-11-16 20:01:39 +00:00
Craig Topper	3669de4c97	Factor out the final FADD that's common to multiple code paths in the visitLog* functions. llvm-svn: 168183	2012-11-16 19:08:44 +00:00
Craig Topper	ae89426f07	Factor some common code to reduce compile size. llvm-svn: 168143	2012-11-16 07:48:23 +00:00
Eli Friedman	e6385e61b5	Mark FP_ROUND for converting NEON v2f64 to v2f32 as expand. Add a missing case to vector legalization so this actually works. Patch by Pete Couperus. Fixes PR12540. llvm-svn: 168107	2012-11-15 22:44:27 +00:00
Ulrich Weigand	dcee8ce8ed	Use std::stable_sort instead of std::sort when sorting stack slots to guarantee deterministic code generation. llvm-svn: 168074	2012-11-15 19:33:30 +00:00
Chad Rosier	2463f67c49	[reg scavenger] Fix the isUsed/isAliasUsed functions so as to not report a false positive. In this particular case, R6 was being spilled by the register scavenger when it was in fact dead. The isUsed function reported R6 as used because the R6_R7 alias was reserved (due to the fact that we've reserved R7 as the FP). The solution is to only check if the original register (i.e., R6) isReserved and not the aliases. The aliases are only checked to make sure they're available. The test case is derived from one of the nightly tester benchmarks and is rather intractable and difficult to reproduce, so I haven't included it. rdar://12592448 llvm-svn: 168054	2012-11-15 18:13:20 +00:00
Sergei Larin	e822148c80	Fix indeterminism in MI scheduler DAG construction. Similarly to several recent fixes throughout the code replace std::map use with the MapVector. Add find() method to the MapVector. llvm-svn: 168051	2012-11-15 17:45:50 +00:00
Craig Topper	61d045781a	Add llvm.ceil, llvm.trunc, llvm.rint, llvm.nearbyint intrinsics. llvm-svn: 168025	2012-11-15 06:51:10 +00:00
Andrew Trick	449eb3f3be	Fix an obvious merge bug in -join-globalcopies (disabled). Jakub Staszak spotted this in review. I don't notice these things until I manually rerun benchmarks. But reducing unit tests is a very high priority. llvm-svn: 168021	2012-11-15 02:32:22 +00:00
Jakub Staszak	ab0139cb90	Use reserve() to avoid vector reallocation. llvm-svn: 167991	2012-11-14 22:42:17 +00:00
Jakub Staszak	542db4a0bc	canJoinPhys method doesn't modify CoalescerPair. Make it const. llvm-svn: 167972	2012-11-14 20:31:04 +00:00
Chad Rosier	e18e4add6c	Remove dead code. llvm-svn: 167970	2012-11-14 20:25:37 +00:00
Anton Korobeynikov	b619a4138d	Fix really stupid ARM EHABI info generation bug: we should not emit eh table and handler data if there are no landing pads in the function. Patch by Logan Chien with some cleanups from me. llvm-svn: 167945	2012-11-14 19:13:30 +00:00
Craig Topper	04a5cc39f4	Add newlines to end of debug messages. llvm-svn: 167913	2012-11-14 05:20:09 +00:00
Rafael Espindola	c79532d101	Handle DAG CSE adding new uses during ReplaceAllUsesWith. Fixes PR14333. llvm-svn: 167912	2012-11-14 05:08:56 +00:00
Anton Korobeynikov	e42af3699b	Use TARGET2 relocation for TType references on ARM. Do some cleanup of the code while here. Inspired by patch by Logan Chien! llvm-svn: 167904	2012-11-14 01:47:00 +00:00
Eric Christopher	0f23b82147	Revert "Use the 'count' attribute instead of the 'upper_bound' attribute." temporarily as it is breaking the gdb bots. This reverts commit r167806/e7ff4c14b157746b3e0228d2dce9f70712d1c126. llvm-svn: 167886	2012-11-13 23:30:43 +00:00
Andrew Trick	459d891a43	Revert -join-splitedges to a boolean cmd line option. llvm-svn: 167880	2012-11-13 22:19:48 +00:00
Andrew Trick	47d58ce0df	The MachineScheduler does not currently require JoinSplitEdges. This option will eventually either be enabled unconditionally or replaced by a more general live range splitting optimization. llvm-svn: 167879	2012-11-13 22:15:40 +00:00
Michael J. Spencer	f1aef758a7	[MC][COFF] Emit weak symbols to the correct section. Patch by Dmitry Puzirev! llvm-svn: 167877	2012-11-13 22:04:09 +00:00
Ulrich Weigand	3946877f88	Do not consider a machine instruction that uses and defines the same physical register as candidate for common subexpression elimination in MachineCSE. This fixes a bug on PowerPC in MultiSource/Applications/oggenc/oggenc caused by MachineCSE invalidly merging two separate DYNALLOC insns. llvm-svn: 167855	2012-11-13 18:40:58 +00:00
Andrew Trick	449c7ad7d7	Fix -join-splitedges: my previous "cleanup" broke it. Working on reducing unit tests. This won't be enabled unless a subtarget enables misched. llvm-svn: 167851	2012-11-13 17:37:46 +00:00
Duncan Sands	b8d3caf65a	Codegen support for arbitrary vector getelementptrs. llvm-svn: 167830	2012-11-13 13:01:58 +00:00
Andrew Trick	108c88c5b7	misched: Allow subtargets to enable misched and dependent options. This allows me to begin enabling (or backing out) misched by default for one subtarget at a time. To run misched we typically want to: - Disable SelectionDAG scheduling (use the source order scheduler) - Enable more aggressive coalescing (until we decide to always run the coalescer this way) - Enable MachineScheduler pass itself. Disabling PostRA sched may follow for some subtargets. llvm-svn: 167826	2012-11-13 08:47:29 +00:00
Andrew Trick	40534fe9a5	Added RegisterCoalescer support for joining global copies first. This adds the -join-globalcopies option which can be enabled by default once misched is also enabled. Ideally, the register coalescer would be able to split local live ranges in a way that produces copies that can be easily resolved by the scheduler. Until then, this heuristic should be good enough to at least allow the scheduler to run after coalescing. llvm-svn: 167825	2012-11-13 08:47:25 +00:00
Andrew Trick	4b1f9e3bac	misched: Don't consider artificial edges weak edges. For now be more conservative in case other out-of-tree schedulers rely on the old behavior of artificial edges. llvm-svn: 167808	2012-11-13 02:35:06 +00:00
Bill Wendling	f454dfb6b5	Use the 'count' attribute instead of the 'upper_bound' attribute. If we have a type 'int a[1]' and a type 'int b[0]', the generated DWARF is the same for both of them because we use the 'upper_bound' attribute. Instead use the 'count' attrbute, which gives the correct number of elements in the array. <rdar://problem/12566646> llvm-svn: 167806	2012-11-13 02:31:47 +00:00
Andrew Trick	edac22a9f3	Cleanup the main RegisterCoalescer loop. Block priorities still apply outside loops. llvm-svn: 167793	2012-11-13 00:34:44 +00:00
Andrew Trick	c25d3fe71e	Cleanup -join-splitedges. Make the loop more obvious. llvm-svn: 167785	2012-11-12 23:59:48 +00:00
Eric Christopher	2942431175	Add an option to enable prototype "fission" capabilities and debug changes. llvm-svn: 167765	2012-11-12 22:22:20 +00:00
Andrew Trick	22d688a29c	Added a temporary option to avoid critical edges splitting. This teaches the register coalescer to be less prone to split critical edges. I am currently benchmarking this with the new (post-coalescer) scheduler. I plan to enable this by default and remove the option as soon as misched is enabled. llvm-svn: 167758	2012-11-12 21:42:40 +00:00
Andrew Trick	ec369d5316	misched: rename interfaceto avoid gcc warnings llvm-svn: 167753	2012-11-12 21:28:10 +00:00
Andrew Trick	263280248a	misched: Target-independent support for MacroFusion. Uses the infrastructure from r167742 to support clustering instructure that the target processor can "fuse". e.g. cmp+jmp. Next step: target hook implementations with test cases, and enable. llvm-svn: 167744	2012-11-12 19:52:20 +00:00
Andrew Trick	a7714a0ff9	misched: Target-independent support for load/store clustering. This infrastructure is generally useful for any target that wants to strongly prefer two instructions to be adjacent after scheduling. A following checkin will add target-specific hooks with unit tests. Then this feature will be enabled by default with misched. llvm-svn: 167742	2012-11-12 19:40:10 +00:00
Andrew Trick	f1ff84c64e	misched: Infrastructure for weak DAG edges. This adds support for weak DAG edges to the general scheduling infrastructure in preparation for MachineScheduler support for heuristics based on weak edges. llvm-svn: 167738	2012-11-12 19:28:57 +00:00
Jakob Stoklund Olesen	13d5562963	Fix assertions in updateRegMaskSlots(). The RegMaskSlots contains 'r' slots while NewIdx and OldIdx are 'B' slots. This broke the checks in the assertions. This fixes PR14302. llvm-svn: 167625	2012-11-09 19:18:49 +00:00
Benjamin Kramer	c280f41864	Silence GCC warning about falling off the end of a non-void function. llvm-svn: 167618	2012-11-09 15:45:22 +00:00
Andrew Trick	3ca33acb95	misched: Heuristics based on the machine model. misched is disabled by default. With -enable-misched, these heuristics balance the schedule to simultaneously avoid saturating processor resources, expose ILP, and minimize register pressure. I've been analyzing the performance of these heuristics on everything in the llvm test suite in addition to a few other benchmarks. I would like each heuristic check to be verified by a unit test, but I'm still trying to figure out the best way to do that. The heuristics are still in considerable flux, but as they are refined we should be rigorous about unit testing the improvements. llvm-svn: 167527	2012-11-07 07:05:09 +00:00
Andrew Trick	e145559b70	misched: handle on-the-fly regpressure queries better for 2-addr instructions without relying on liveintervals. llvm-svn: 167526	2012-11-07 07:05:05 +00:00
Bill Wendling	f720bf64d4	Add comment describing what's going on here. llvm-svn: 167525	2012-11-07 05:19:04 +00:00
Bill Wendling	d9bb9b611b	When we're updating the subprogram scope DIE, we want to determine if we're updating an abstract DIE or not. If we are, then we use that. Its children will be added on later, as well as the object pointer attribute. Otherwise, this function may be called with a concrete DIE twice and adding the children and object pointer attribute to it twice. <rdar://problem/12401423&12600340> llvm-svn: 167524	2012-11-07 04:42:18 +00:00
Chad Rosier	8d2c229006	[regallocfast] Make sure the MachineRegisterInfo is aware of clobbers from a register masks. This is an obvious and necessary fix for a soon to be committed patch. No test case possible at this time. Reviewed by Jakob. llvm-svn: 167498	2012-11-06 22:52:42 +00:00
Andrew Trick	e96390ea96	misched: TargetSchedule interface for machine resources. Expose the processor resources defined by the machine model to the scheduler and other clients through the TargetSchedule interface. Normalize each resource count with respect to other kinds of resources. This allows scheduling heuristics to balance resources against other kinds of resources and latency. llvm-svn: 167444	2012-11-06 07:10:38 +00:00
Andrew Trick	4d1fa712ac	misched: Rename RemainingCount to avoid confusion with remaining resources. llvm-svn: 167443	2012-11-06 07:10:34 +00:00
Andrew Trick	baeaabb2d0	ScheduleDAG interface. Added OrderKind to distinguish nonregister dependencies. This is in preparation for adding "weak" DAG edges, but generally simplifies the design. llvm-svn: 167435	2012-11-06 03:13:46 +00:00
Owen Anderson	15fd6ac4ba	Be careful not to optimize a SELECT_CC into a SETCC post-legalization if the SETCC node would be illegal. llvm-svn: 167344	2012-11-03 00:17:26 +00:00
Manman Ren	3d5af279b1	OutputArg: added an index of the original argument to match the change to InputArg in r165616. This will enable us to get the actual type for both InputArg and OutputArg. rdar://9932559 llvm-svn: 167265	2012-11-01 23:49:58 +00:00
Chandler Carruth	5da3f0512e	Revert the majority of the next patch in the address space series: r165941: Resubmit the changes to llvm core to update the functions to support different pointer sizes on a per address space basis. Despite this commit log, this change primarily changed stuff outside of VMCore, and those changes do not carry any tests for correctness (or even plausibility), and we have consistently found questionable or flat out incorrect cases in these changes. Most of them are probably correct, but we need to devise a system that makes it more clear when we have handled the address space concerns correctly, and ideally each pass that gets updated would receive an accompanying test case that exercises that pass specificaly w.r.t. alternate address spaces. However, from this commit, I have retained the new C API entry points. Those were an orthogonal change that probably should have been split apart, but they seem entirely good. In several places the changes were very obvious cleanups with no actual multiple address space code added; these I have not reverted when I spotted them. In a few other places there were merge conflicts due to a cleaner solution being implemented later, often not using address spaces at all. In those cases, I've preserved the new code which isn't address space dependent. This is part of my ongoing effort to clean out the partial address space code which carries high risk and low test coverage, and not likely to be finished before the 3.2 release looms closer. Duncan and I would both like to see the above issues addressed before we return to these changes. llvm-svn: 167222	2012-11-01 09:14:31 +00:00
Chandler Carruth	7ec5085e01	Revert the series of commits starting with r166578 which introduced the getIntPtrType support for multiple address spaces via a pointer type, and also introduced a crasher bug in the constant folder reported in PR14233. These commits also contained several problems that should really be addressed before they are re-committed. I have avoided reverting various cleanups to the DataLayout APIs that are reasonable to have moving forward in order to reduce the amount of churn, and minimize the number of commits that were reverted. I've also manually updated merge conflicts and manually arranged for the getIntPtrType function to stay in DataLayout and to be defined in a plausible way after this revert. Thanks to Duncan for working through this exact strategy with me, and Nick Lewycky for tracking down the really annoying crasher this triggered. (Test case to follow in its own commit.) After discussing with Duncan extensively, and based on a note from Micah, I'm going to continue to back out some more of the more problematic patches in this series in order to ensure we go into the LLVM 3.2 branch with a reasonable story here. I'll send a note to llvmdev explaining what's going on and why. Summary of reverted revisions: r166634: Fix a compiler warning with an unused variable. r166607: Add some cleanup to the DataLayout changes requested by Chandler. r166596: Revert "Back out r166591, not sure why this made it through since I cancelled the command. Bleh, sorry about this! r166591: Delete a directory that wasn't supposed to be checked in yet. r166578: Add in support for getIntPtrType to get the pointer type based on the address space. llvm-svn: 167221	2012-11-01 08:07:29 +00:00
Owen Anderson	b351c8d692	Add a few more simple fast-math constant propagations and cancellations. llvm-svn: 167200	2012-11-01 02:00:53 +00:00
Jakob Stoklund Olesen	9892a4b794	Exploit the new identity composition in composeSubRegIndices(). The static compose() function in RegisterCoalescer was doing the exact same thing. llvm-svn: 167198	2012-11-01 01:15:43 +00:00
Benjamin Kramer	1559127f6f	Replace some instances of UniqueVector with SetVector, which is slightly cheaper. No functionality change. llvm-svn: 167116	2012-10-31 13:45:49 +00:00
Akira Hatanaka	d837be780d	Change signature of function RAFast::spillAll to avoid conversion between type MachineInstr* and MachineBasicBlock::iterator. llvm-svn: 167088	2012-10-31 00:56:01 +00:00
Akira Hatanaka	ebb31e9c42	Check that iterator I is not the end iterator. llvm-svn: 167086	2012-10-31 00:50:52 +00:00
Chad Rosier	909f6a035f	[inline asm] Get the mayLoad/mayStore directly from the MIOp_ExtraInfo operand. llvm-svn: 167050	2012-10-30 20:39:19 +00:00
Chad Rosier	86f6050c54	Add a comment for r167040. llvm-svn: 167046	2012-10-30 20:01:12 +00:00
Chad Rosier	9e1274fb48	[inline asm] Implement mayLoad and mayStore for inline assembly. In general, the MachineInstr MayLoad/MayLoad flags are based on the tablegen implementation. For inline assembly, however, we need to compute these based on the constraints. Revert r166929 as this is no longer needed, but leave the test case in place. rdar://12033048 and PR13504 llvm-svn: 167040	2012-10-30 19:11:54 +00:00
Bill Wendling	10e0e2ec49	Fix grammar. llvm-svn: 167029	2012-10-30 17:51:02 +00:00
Ulrich Weigand	3abb34389d	In various places throughout the code generator, there were special checks to avoid performing compile-time arithmetic on PPCDoubleDouble. Now that APFloat supports arithmetic on PPCDoubleDouble, those checks are no longer needed, and we can treat the type like any other. llvm-svn: 166958	2012-10-29 18:35:49 +00:00
Jakob Stoklund Olesen	9a06696a77	Completely disallow partial copies in adjustCopiesBackFrom(). Partial copies can show up even when CoalescerPair.isPartial() returns false. For example: %vreg24:dsub_0<def> = COPY %vreg31:dsub_0; QPR:%vreg24,%vreg31 Such a partial-partial copy is not good enough for the transformation adjustCopiesBackFrom() needs to do. llvm-svn: 166944	2012-10-29 17:51:52 +00:00
Duncan Sands	5bdd9dda48	Remove a wrapper around getIntPtrType added to GVN by Hal in commit 166624 (the wrapper returns a vector of integers when passed a vector of pointers) by having getIntPtrType itself return a vector of integers in this case. Outside of this wrapper, I didn't find anywhere in the codebase that was relying on the old behaviour for vectors of pointers, so give this a whirl through the buildbots. llvm-svn: 166939	2012-10-29 17:31:46 +00:00
Preston Gurd	52dacca977	This patch addresses a problem with the Post RA scheduler generating an incorrect instruction sequence due to it not being aware that an inline assembly instruction may reference memory. This patch fixes the problem by causing the scheduler to always assume that any inline assembly code instruction could access memory. This is necessary because the internal representation of the inline instruction does not include any information about memory accesses. This should fix PR13504. llvm-svn: 166929	2012-10-29 15:01:23 +00:00
Lang Hames	ee6142c36b	Remove unused typedef. llvm-svn: 166910	2012-10-29 04:57:52 +00:00
Jakob Stoklund Olesen	57143f7e78	Never attempt to join an early-clobber def with a regular kill. This fixes PR14194. llvm-svn: 166880	2012-10-27 17:41:27 +00:00
Jakob Stoklund Olesen	1dfe4fc60c	Reduce indentation with early exit. No functional change. llvm-svn: 166829	2012-10-26 23:05:13 +00:00

... 3 4 5 6 7 ...

14675 Commits