llvm-project

Commit Graph

Author	SHA1	Message	Date
Nick Kledzik	204720bc71	Suppress stderr noise when test case runs. llvm-svn: 161085	2012-07-31 22:18:15 +00:00
Akira Hatanaka	2c64adf672	Add Mips16InstrInfo.cpp and MipsSEInstrInfo.cpp to CMakeLists.txt. llvm-svn: 161083	2012-07-31 22:11:05 +00:00
Michael J. Spencer	1e18841f62	[obj2yaml] Print the Relocations header. llvm-svn: 161082	2012-07-31 22:04:08 +00:00
Akira Hatanaka	b7fa3c9db0	Add definitions of two subclasses of MipsInstrInfo, MipsInstrInfo (for mips16), and MipsSEInstrInfo (for mips32/64). llvm-svn: 161081	2012-07-31 21:49:49 +00:00
Akira Hatanaka	30651805c4	Delete mips64 target machine classes. mips target machines can be used in place of them. llvm-svn: 161080	2012-07-31 21:39:17 +00:00
Akira Hatanaka	02de0e4425	Let PEI::calculateFrameObjectOffsets compute the final stack size rather than computing it in MipsFrameLowering::emitPrologue. llvm-svn: 161078	2012-07-31 21:28:49 +00:00
Akira Hatanaka	33a25af5a8	Expand DYNAMIC_STACKALLOC nodes rather than doing custom-lowering. The frame object which points to the dynamically allocated area will not be needed after changes are made to cease reserving call frames. llvm-svn: 161076	2012-07-31 20:54:48 +00:00
Manman Ren	f288d2f120	MachineSink: Sort the successors before trying to find SuccToSinkTo. Use stable_sort instead of sort. Follow-up to r161062. rdar://11980766 llvm-svn: 161075	2012-07-31 20:45:38 +00:00
Jakob Stoklund Olesen	059e647c6d	Compute instruction depths through the current trace. Assuming infinite issue width, compute the earliest each instruction in the trace can issue, when considering the latency of data dependencies. The issue cycle is record as a 'depth' from the beginning of the trace. This is half the computation required to find the length of the critical path through the trace. Heights are next. llvm-svn: 161074	2012-07-31 20:44:38 +00:00
Jakob Stoklund Olesen	1dfb101835	Rename CT -> MTM. MachineTraceMetrics is abbreviated MTM. llvm-svn: 161072	2012-07-31 20:25:13 +00:00
Akira Hatanaka	a66d676b20	Define ADJCALLSTACKDOWN/UP nodes. These nodes are emitted regardless of whether or not it is in mips16 mode. Define MipsPseudo (mode-independant pseudo) and PseudoSE (mips32/64 pseudo) classes. llvm-svn: 161071	2012-07-31 19:13:07 +00:00
Akira Hatanaka	3a810eda91	Change name of class MipsInst to InstSE to distinguish it from mips16's instruction class. SE stands for standard encoding. llvm-svn: 161069	2012-07-31 18:55:01 +00:00
Akira Hatanaka	beda2241a4	When store nodes or memcpy nodes are created to copy the function call arguments to the stack in MipsISelLowering::LowerCall, use stack pointer and integer offset operands rather than frame object operands. llvm-svn: 161068	2012-07-31 18:46:41 +00:00
Chad Rosier	710be7df71	[x86 frame lowering] In 32-bit mode, use ESI as the base pointer. Previously, we were using EBX, but PIC requires the GOT to be in EBX before function calls via PLT GOT pointer. llvm-svn: 161066	2012-07-31 18:29:21 +00:00
Ted Kremenek	cf8cfd7fc2	Use regex instead of special casing clang and llvm libraries. llvm-svn: 161065	2012-07-31 18:23:44 +00:00
Akira Hatanaka	4ce7c4060d	Fix type of LUXC1 and SUXC1. These instructions were incorrectly defined as single-precision load and store. Also avoid selecting LUXC1 and SUXC1 instructions during isel. It is incorrect to map unaligned floating point load/store nodes to these instructions. llvm-svn: 161063	2012-07-31 18:16:49 +00:00
Manman Ren	8c549b586c	MachineSink: Sort the successors before trying to find SuccToSinkTo. One motivating example is to sink an instruction from a basic block which has two successors: one outside the loop, the other inside the loop. We should try to sink the instruction outside the loop. rdar://11980766 llvm-svn: 161062	2012-07-31 18:10:39 +00:00
Micah Villmow	b67d7a3a33	Conform to LLVM coding style. llvm-svn: 161061	2012-07-31 18:07:43 +00:00
Micah Villmow	6b12f596ef	Don't generate ordered or unordered comparison operations if it is not legal to do so. llvm-svn: 161053	2012-07-31 16:48:03 +00:00
Chandler Carruth	94ed210702	Implement copy and move assignment for TinyPtrVector. These try to re-use allocated vectors as much as possible. llvm-svn: 161041	2012-07-31 09:42:24 +00:00
Sylvestre Ledru	ef7d8295b6	Fix some minor typos llvm-svn: 161037	2012-07-31 07:05:57 +00:00
Craig Topper	9caea12bd8	Use uint8_t to store the InstructionContext table. Saves 768 bytes of static data. llvm-svn: 161034	2012-07-31 06:15:39 +00:00
Craig Topper	6f142746e7	Tidy up. Move for loop index declarations into for statements. Use unsigned instead of uint16_t for loop indices. Use unsigned instead of uint32_t for arguments to raw_ostream.indent. llvm-svn: 161033	2012-07-31 06:02:05 +00:00
Craig Topper	b61024cfcc	Tidy up function argument formatting. llvm-svn: 161032	2012-07-31 05:42:02 +00:00
Craig Topper	347e8cf3b7	Remove trailing whitespace llvm-svn: 161031	2012-07-31 05:28:41 +00:00
Craig Topper	0c4253fe29	Remove trailing whitespace llvm-svn: 161030	2012-07-31 05:27:01 +00:00
Craig Topper	c2efce404e	Make INSTRUCTION_SPECIFIER_FIELDS match X86DisassemblerCommon.h. Also remove trailing whitespace. llvm-svn: 161029	2012-07-31 05:18:26 +00:00
Craig Topper	fb39f97d4c	Tidy up trailing whitespace llvm-svn: 161027	2012-07-31 04:58:05 +00:00
Craig Topper	5f33d90214	Tidy up trailing whitespace llvm-svn: 161026	2012-07-31 04:38:27 +00:00
Chandler Carruth	c0b8a0c216	Clean up trailing whitespace and unnecessary blank lines. llvm-svn: 161025	2012-07-31 04:13:57 +00:00
Chandler Carruth	a565375a18	Bring TinyPtrVector under test. Somehow we never picked up unit tests for this class. These tests exercise most of the basic properties, but the API for TinyPtrVector is very strange currently. My plan is to start fleshing out the API to match that of SmallVector, but I wanted a test for what is there first. Sadly, it doesn't look reasonable to just re-use the SmallVector tests, as this container can only ever store pointers, and much of the SmallVector testing is to get construction and destruction right. Just to get this basic test working, I had to add value_type to the interface. While here I found a subtle bug in the combination of 'erase', 'begin', and 'end'. Both 'begin' and 'end' wanted to use a null pointer to indicate the "end" iterator of an empty vector, regardless of whether there is actually a vector allocated or the pointer union is null. Everything else was fine with this except for erase. If you erase the last element of a vector after it has held more than one element, we return the end iterator of the underlying SmallVector which need not be a null pointer. Instead, simply use the pointer, and poniter + size() begin/end definitions in the tiny case, and delegate to the inner vector whenever it is present. llvm-svn: 161024	2012-07-31 02:48:31 +00:00
Jakob Stoklund Olesen	0c807dfae2	Clear kill flags in removeCopyByCommutingDef(). We are extending live ranges, so kill flags are not accurate. They aren't needed until they are recomputed after RA anyway. <rdar://problem/11950722> llvm-svn: 161023	2012-07-31 02:47:24 +00:00
Manman Ren	2b6a0dfd4c	Reverse order of the two branches at end of a basic block if it is profitable. We branch to the successor with higher edge weight first. Convert from je LBB4_8 --> to outer loop jmp LBB4_14 --> to inner loop to jne LBB4_14 jmp LBB4_8 PR12750 rdar: 11393714 llvm-svn: 161018	2012-07-31 01:11:07 +00:00
Andrew Trick	79795897b3	Use the latest MachineRegisterInfo APIs. No functionality. llvm-svn: 161010	2012-07-30 23:48:17 +00:00
Andrew Trick	79df0de4fc	Added MachineRegisterInfo::hasOneDef() llvm-svn: 161009	2012-07-30 23:48:14 +00:00
Andrew Trick	535a23c38b	Inline MachineRegisterInfo::hasOneUse llvm-svn: 161007	2012-07-30 23:48:12 +00:00
Chandler Carruth	e9cdc7f0d8	Extend the InstVisitor to visit the specialized classes wrapping CallInst for intrinsics. This allows users of the InstVisitor that would like to special case certain very common intrinsics to do so naturally in keeping with the type hierarchy's utility classes. llvm-svn: 161006	2012-07-30 23:45:06 +00:00
Jakob Stoklund Olesen	68c2cd059e	Avoid looking at stale data in verifyAnalysis(). llvm-svn: 161004	2012-07-30 23:15:12 +00:00
Jakob Stoklund Olesen	c14cf57ba9	Allow traces to enter nested loops. This lets traces include the final iteration of a nested loop above the center block, and the first iteration of a nested loop below the center block. We still don't allow traces to contain backedges, and traces are truncated where they would leave a loop, as seen from the center block. llvm-svn: 161003	2012-07-30 23:15:10 +00:00
Jim Grosbach	20666162e9	Keep empty assembly macro argument values in the middle of the list. Empty macro arguments at the end of the list should be as-if not specified at all, but those in the middle of the list need to be kept so as not to screw up the positional numbering. E.g.: .macro foo foo_-bash___: nop .endm foo 1, 2, 3, 4 foo 1, , 3, 4 Should create two labels, "foo_1_2_3_4" and "foo_1__3_4". rdar://11948769 llvm-svn: 161002	2012-07-30 22:44:17 +00:00
Chandler Carruth	0b01261cb0	Move the SmallVector unit tests to be type-parameterized so that we can test more than a single instantiation of SmallVector. Add testing for 0, 1, 2, and 4 element sized "small" buffers. These appear to be essentially untested in the unit tests until now. Fix several tests to be robust in the face of a '0' small buffer. As a consequence of this size buffer, the growth patterns are actually observable in the test -- yes this means that many tests never caused a grow to occur before. For some tests I've merely added a reserve call to normalize behavior. For others, the growth is actually interesting, and so I captured the fact that growth would occur and adjusted the assertions to not assume how rapidly growth occured. Also update the specialization for a '0' small buffer length to have all the same interface points as the normal small vector. llvm-svn: 161001	2012-07-30 22:17:52 +00:00
Jakob Stoklund Olesen	984cfe8322	Clarify invalidation strategy in comment. llvm-svn: 160997	2012-07-30 21:16:22 +00:00
Nick Lewycky	7e9f6d7d58	Fix grammar-o. Fixes PR13482! llvm-svn: 160996	2012-07-30 21:10:51 +00:00
Jakob Stoklund Olesen	f308c128ea	Assert that all trace candidate blocks have been visited by the PO. When computing a trace, all the candidates for pred/succ must have been visited. Filter out back-edges first, though. The PO traversal ignores them. Thanks to Andy for spotting this in review. llvm-svn: 160995	2012-07-30 21:10:27 +00:00
Jakob Stoklund Olesen	a12a7d5f74	Hook into PassManager's analysis verification. By overriding Pass::verifyAnalysis(), the pass contents will be verified by the pass manager. llvm-svn: 160994	2012-07-30 20:57:50 +00:00
Pete Cooper	91244268d7	Consider address spaces for hashing and CSEing DAG nodes. Otherwise two loads from different x86 segments but the same address would get CSEd llvm-svn: 160987	2012-07-30 20:23:19 +00:00
Eric Christopher	d79864c59b	Typo. llvm-svn: 160981	2012-07-30 20:09:37 +00:00
Kevin Enderby	5c490f1b8f	Fix a bug in ARMMachObjectWriter::RecordRelocation() in ARMMachObjectWriter.cpp where the other_half of the movt and movw relocation entries needs to get set and only with the 16 bits of the other half. rdar://10038370 llvm-svn: 160978	2012-07-30 18:46:15 +00:00
Jakob Stoklund Olesen	7361846f32	Add MachineInstr::isTransient(). This is a cleaned up version of the isFree() function in MachineTraceMetrics.cpp. Transient instructions are very unlikely to produce any code in the final output. Either because they get eliminated by RegisterCoalescing, or because they are pseudo-instructions like labels and debug values. llvm-svn: 160977	2012-07-30 18:34:14 +00:00
Jakob Stoklund Olesen	3df6c46fdd	Add MachineTraceMetrics::verify(). This function verifies the consistency of cached data in the MachineTraceMetrics analysis. llvm-svn: 160976	2012-07-30 18:34:11 +00:00
Jakob Stoklund Olesen	eb488fe165	Verify that the CFG hasn't changed during invalidate(). The MachineTraceMetrics analysis must be invalidated before modifying the CFG. This will catch some of the violations of that rule. llvm-svn: 160969	2012-07-30 17:36:49 +00:00
Jakob Stoklund Olesen	fee94ca15b	Add MachineBasicBlock::isPredecessor(). A->isPredecessor(B) is the same as B->isSuccessor(A), but it can tolerate a B that is null or dangling. This shouldn't happen normally, but it it useful for verification code. llvm-svn: 160968	2012-07-30 17:36:47 +00:00
Nadav Rotem	77f1b9c477	When constant folding GEP expressions, keep the address space information of pointers. Together with Ran Chachick <ran.chachick@intel.com> llvm-svn: 160954	2012-07-30 07:25:20 +00:00
Craig Topper	efd97044a3	Mark MOVZX16/MOVSX16 as neverHasSideEffects/mayLoad llvm-svn: 160953	2012-07-30 07:14:07 +00:00
Craig Topper	c6b7ef61f4	Mark MOVZX32_NOREX as isCodeGenOnly and neverHasSideEffects. The isCodeGenOnly change allows special detection of _NOREX instructions to be removed from tablegen disassembler code. llvm-svn: 160951	2012-07-30 06:48:11 +00:00
Craig Topper	08ead0b14e	Remove some unnecessary filter checks. They were already covered by IsCodeGenOnly llvm-svn: 160950	2012-07-30 06:27:19 +00:00
Craig Topper	6f4ad80dc8	Remove check for sub class of X86Inst from filter function since caller guaranteed it. Replace another sub class check with ShouldBeEmitted flag since it was factored in there already. llvm-svn: 160949	2012-07-30 05:39:34 +00:00
Craig Topper	b58dc17025	Simplify code that filtered certain instructions in two different ways. No functional change. llvm-svn: 160948	2012-07-30 05:10:05 +00:00
Craig Topper	60a58ac3e2	Remove check for f256mem from has256BitOperands as nothing depended on it and it isn't the only 256-bit memory type anyway. llvm-svn: 160946	2012-07-30 04:53:00 +00:00
Craig Topper	ac172e225d	Remove trailing whitespace. llvm-svn: 160945	2012-07-30 04:48:12 +00:00
Craig Topper	14eac5dda8	Give VCVTTPD2DQ priority over CVTTPD2DQ. llvm-svn: 160942	2012-07-30 02:20:32 +00:00
Craig Topper	f881d385da	Fix patterns for CVTTPS2DQ to specify SSE2 instead of SSE1. llvm-svn: 160941	2012-07-30 02:14:02 +00:00
Craig Topper	415b3586d0	Fix up patterns for VCVTSS2SD. Specifically give it priority over SSE form. Add an OptForSpeed to explicitly pair up with an OptForSize that was already on another pattern. llvm-svn: 160939	2012-07-30 01:38:57 +00:00
Craig Topper	28402efcb6	Fix load types on intrinsic forms of SS2SD and SD2SS AVX/SSE convert instruction patterns. llvm-svn: 160938	2012-07-29 23:26:34 +00:00
Craig Topper	b6767f3acd	Move more SSE/AVX convert instruction patterns into their definitions. llvm-svn: 160937	2012-07-29 22:30:06 +00:00
Benjamin Kramer	ef2932125d	APInt: Simplify code. No functionality change. llvm-svn: 160929	2012-07-29 12:33:29 +00:00
Manman Ren	f87dd7c01b	Revert r160920 and r160919 due to dragonegg and clang selfhost failure llvm-svn: 160927	2012-07-29 02:44:09 +00:00
Nick Lewycky	d2c3bdd269	Add testcases for GlobalOpt changes in r160693 and r160757. llvm-svn: 160925	2012-07-29 01:15:37 +00:00
Craig Topper	fc93281c07	Fold patterns for some of the SSE/AVX convert instructions into their instruction definitions. llvm-svn: 160922	2012-07-28 18:59:19 +00:00
Craig Topper	024797b9a2	Mark some of the SSE/AVX convert instructions as mayLoad/neverHasSideEffects. llvm-svn: 160921	2012-07-28 18:36:39 +00:00
Manman Ren	9de95e779c	X86 Peephole: fold loads to the source register operand if possible. Trying to fix the bot by specifying a triple in the failing testing cases. llvm-svn: 160920	2012-07-28 17:51:24 +00:00
Manman Ren	0fa3ab88ba	X86 Peephole: fold loads to the source register operand if possible. Machine CSE and other optimizations can remove instructions so folding is possible at peephole while not possible at ISel. rdar://10554090 and rdar://11873276 llvm-svn: 160919	2012-07-28 16:48:01 +00:00
Craig Topper	44f9b5343d	Make CVTSS2SI instruction definition consistent with CVTSD2SI. llvm-svn: 160914	2012-07-28 08:28:23 +00:00
Craig Topper	1c1aef07b8	Fix up memory load types for SSE scalar convert intrinsic patterns. llvm-svn: 160913	2012-07-28 07:59:59 +00:00
Manman Ren	32367c063b	X86 Peephole: fix PR13475 in optimizeCompare. It is possible that an instruction can use and update EFLAGS. When checking the safety, we should check the usage of EFLAGS first before declaring it is safe to optimize due to the update. llvm-svn: 160912	2012-07-28 03:15:46 +00:00
Andrew Trick	940534371b	Reenable a basic SSA DAG builder optimization. Jakob fixed ProcessImplicifDefs in r159149. llvm-svn: 160910	2012-07-28 01:48:15 +00:00
Jakob Stoklund Olesen	0563369755	Add more debug output to MachineTraceMetrics. llvm-svn: 160905	2012-07-27 23:58:38 +00:00
Jakob Stoklund Olesen	1152202cc2	Keep track of the head and tail of the trace through each block. This makes it possible to quickly detect blocks that are outside the trace. llvm-svn: 160904	2012-07-27 23:58:36 +00:00
Eric Christopher	86ca9f9e11	Add a DW_AT_high_pc for CUs that are a single address range. Update all tests accordingly. Fixes PR13351. Patch by shinichiro hamaji! llvm-svn: 160899	2012-07-27 22:00:05 +00:00
Jakob Stoklund Olesen	7dfe7abdee	Also compute register mask lists under -new-live-intervals. llvm-svn: 160898	2012-07-27 21:56:39 +00:00
Chad Rosier	bd9f2ba4d6	Typos. llvm-svn: 160897	2012-07-27 21:41:59 +00:00
Evan Cheng	249716e8ae	Teach CodeGenPrep to look past bitcast when it's duplicating return instruction into predecessor blocks to enable tail call optimization. rdar://11958338 llvm-svn: 160894	2012-07-27 21:21:26 +00:00
Jakob Stoklund Olesen	97e14e02f1	Eliminate the IS_PHI_DEF flag and VNInfo::setIsPHIDef(). A value number is a PHI def if and only if it begins at a block boundary. This can be derived from the def slot, a separate flag is not necessary. llvm-svn: 160893	2012-07-27 21:11:14 +00:00
Jakob Stoklund Olesen	4021a7bf25	Add a -new-live-intervals experimental option. This option replaces the existing live interval computation with one based on LiveRangeCalc.cpp. The new algorithm does not depend on LiveVariables, and it can be run at any time, before or after leaving SSA form. llvm-svn: 160892	2012-07-27 20:58:46 +00:00
Andrew Kaylor	8e87a75be7	Fixing problems with X86_64_32 relocations and making the assertions more readable. llvm-svn: 160889	2012-07-27 20:30:12 +00:00
Jakob Stoklund Olesen	bc65e8f94e	Add <imp-def> of super-register when lowering SUBREG_TO_REG. Patch by Tyler Nowicki! llvm-svn: 160888	2012-07-27 20:19:49 +00:00
Benjamin Kramer	718b007fe9	SmallVector: Crank up verbosity of asserts per Chandler's request. Also add assertions to validate the iterator in the insert method overloads. llvm-svn: 160882	2012-07-27 19:05:58 +00:00
Chad Rosier	c25f88b703	The TimePassesIsEnabled has since moved to PassManager.cpp. llvm-svn: 160881	2012-07-27 19:03:02 +00:00
Andrew Kaylor	782d5c434f	Test commit, clean up comment llvm-svn: 160880	2012-07-27 18:39:47 +00:00
Nuno Lopes	85591f899d	fix PR13390: do not loop forever with self-referencing self instructions llvm-svn: 160876	2012-07-27 18:21:15 +00:00
Nuno Lopes	20c7eb3549	fix infinite loop in instcombine in the presence of a (malformed) self-referencing select inst. This can happen as long as the instruction is not reachable. Instcombine does generate these unreachable malformed selects when doing RAUW llvm-svn: 160874	2012-07-27 18:03:57 +00:00
Andrew Kaylor	5c01090c49	Test commit, clean up comment llvm-svn: 160873	2012-07-27 17:52:42 +00:00
Jakob Stoklund Olesen	0c06121e4e	Give MCRegisterInfo an implementation file. Move some functions from MCRegisterInfo.h that don't need to be inline. This shrinks llc by 8K. llvm-svn: 160865	2012-07-27 16:25:20 +00:00
Benjamin Kramer	38862ecf79	SmallVector::erase: Assert that iterators are actually inside the vector. The rationale here is that it's hard to write loops containing vector erases and it only shows up if the vector contains non-trivial objects leading to crashes when forming them out of garbage memory. llvm-svn: 160854	2012-07-27 09:10:25 +00:00
Craig Topper	b63501397b	Clean up includes. llvm-svn: 160852	2012-07-27 06:44:02 +00:00
Jakob Stoklund Olesen	4914cced62	Eliminate the large XXXSubRegTable constant arrays. These tables were indexed by [register][subreg index] which made them, very large and sparse. Replace them with lists of sub-register indexes that match the existing lists of sub-registers. MCRI::getSubReg() becomes a very short linear search, like getSubRegIndex() already was. llvm-svn: 160843	2012-07-27 00:10:51 +00:00
Jakob Stoklund Olesen	5995936309	Remove support for 'CompositeIndices' and sub-register cycles. Now that the weird X86 sub_ss and sub_sd sub-register indexes are gone, there is no longer a need for the CompositeIndices construct in .td files. Sub-register index composition can be specified on the SubRegIndex itself using the ComposedOf field. Also enforce unique names for sub-registers in TableGen. The same sub-register cannot be available with multiple sub-register indexes. llvm-svn: 160842	2012-07-26 23:39:50 +00:00
Akira Hatanaka	97ba7696f8	Pass the correct call frame size to callseq_start node. This is needed to replace uses of function getMaxCallFrameSize defined in MipsFunctionInfo with the one MachineFrameInfo has. llvm-svn: 160841	2012-07-26 23:27:01 +00:00
Pete Cooper	abc13af9c6	Simplify demanded bits of select sources where the condition is a constant vector llvm-svn: 160835	2012-07-26 23:10:24 +00:00
Jakob Stoklund Olesen	7cd08536c2	Remove the X86 sub_ss and sub_sd sub-register indexes completely. llvm-svn: 160833	2012-07-26 23:07:20 +00:00
Jakob Stoklund Olesen	77cd55b4ee	Remove the last mentions of sub_ss and sub_sd from patterns. I'll remove these two sub-register indexes shortly. llvm-svn: 160831	2012-07-26 23:03:08 +00:00
Jakob Stoklund Olesen	b96d0b4e08	Eliminate sub_ss, sub_sd from broadcast patterns. The (COPY_TO_REGCLASS GR32:$src, VR128) pattern looks odd, but copyPhysReg does the right thing with it. (The old pattern would eventually produce the same cross-class copy). llvm-svn: 160830	2012-07-26 22:59:06 +00:00
Pete Cooper	e807e45bff	Teach SimplifyDemandedBits how to look through fpext and fptrunc to simplify their operand llvm-svn: 160823	2012-07-26 22:37:04 +00:00
Jakob Stoklund Olesen	206b825f5c	Eliminate more sub_ss / sub_sd patterns. This gets rid of some more INSERT_SUBREG - IMPLICIT_DEF patterns, simplifying the emitted code a bit. llvm-svn: 160820	2012-07-26 22:30:18 +00:00
Jakob Stoklund Olesen	75d17b0577	Eliminate some SUBREG_TO_REG patterns with sub_ss and sub_sd. The SUBREG_TO_REG instruction has magic semantics asserting that the source value was defined by an instruction that cleared the high half of the register. Those semantics are never actually exploited for xmm registers. llvm-svn: 160818	2012-07-26 22:03:21 +00:00
Jakob Stoklund Olesen	ceee4a9d0c	Eliminate a batch of uses of sub_ss and sub_sd in the X86 target. These idempotent sub-register indices don't do anything --- They simply map XMM registers to themselves. They no longer affect register classes either since the SubRegClasses field has been removed from Target.td. This patch replaces XMM->XMM EXTRACT_SUBREG and INSERT_SUBREG patterns with COPY_TO_REGCLASS patterns which simply become COPY instructions. The number of IMPLICIT_DEF instructions before register allocation is reduced, and that is the cause of the test case changes. llvm-svn: 160816	2012-07-26 21:40:42 +00:00
Micah Villmow	7b473d9f72	Add support for v16i32/v16i64 into the code generator. This is required for backends that use i32/i64 vectors for the getSetCCResultType function. llvm-svn: 160814	2012-07-26 21:22:00 +00:00
Chad Rosier	7c427c40cb	Make comments in Debug.cpp and Debug.h consistent. Rename SetCurrentDebugType; Function names should be camel case, and start with a lower case letter. No functional change intended. llvm-svn: 160813	2012-07-26 20:38:52 +00:00
Jakob Stoklund Olesen	35400b1dda	Use an otherwise unused variable. llvm-svn: 160798	2012-07-26 19:42:56 +00:00
Jakob Stoklund Olesen	f9029fef2a	Start scaffolding for a MachineTraceMetrics analysis pass. This is still a work in progress. Out-of-order CPUs usually execute instructions from multiple basic blocks simultaneously, so it is necessary to look at longer traces when estimating the performance effects of code transformations. The MachineTraceMetrics analysis will pick a typical trace through a given basic block and provide performance metrics for the trace. Metrics will include: - Instruction count through the trace. - Issue count per functional unit. - Critical path length, and per-instruction 'slack'. These metrics can be used to determine the performance limiting factor when executing the trace, and how it will be affected by a code transformation. Initially, this will be used by the early if-conversion pass. llvm-svn: 160796	2012-07-26 18:38:11 +00:00
Dan Gohman	0b3d782933	Add a floor intrinsic. llvm-svn: 160791	2012-07-26 17:43:27 +00:00
Nuno Lopes	5940c4a15f	do null checks for a few more Emit*() functions. Thanks Eli for noticing. llvm-svn: 160787	2012-07-26 17:10:46 +00:00
Duncan Sands	5651452076	Stop reassociate from looking through expressions of arbitrary complexity. This is a temporary measure until my fix for PR13021 is ready. llvm-svn: 160778	2012-07-26 09:26:40 +00:00
Duncan Sands	a2791b576f	Take people straight to the contents of the file. llvm-svn: 160777	2012-07-26 08:08:31 +00:00
Duncan Sands	c769ccaff3	Add the list of code owners to the top level of the LLVM source tree to hopefully make it more visible. Adjust the web-docs to have a link to this file rather than the list itself. I described code owners as also being gatekeepers for their part of the code, which I think is true but isn't in the code owner explanation on the web page. llvm-svn: 160776	2012-07-26 08:04:09 +00:00
Craig Topper	c7690ac7ac	Make l/q suffixes on AVX forms of scalar convert instructions consistent with their non-AVX forms. llvm-svn: 160775	2012-07-26 07:48:28 +00:00
Akira Hatanaka	64626fc20f	Fix call setup for PIC. Patch by Reed Kotler. llvm-svn: 160774	2012-07-26 02:24:43 +00:00
Sylvestre Ledru	4fb32b10e5	Fix two typos in the doc llvm-svn: 160762	2012-07-25 22:01:31 +00:00
Jakob Stoklund Olesen	abd254e1b6	Differentially encode all MC register lists. This simplifies MCRegisterInfo and shrinks the target descriptions a bit more. llvm-svn: 160758	2012-07-25 21:41:37 +00:00
Nick Lewycky	7d0f110cb3	It's not safe to blindly remove invoke instructions. This happens when we encounter an invoke of an allocation function. This should fix the dragonegg bootstrap. Testcase to follow, later. llvm-svn: 160757	2012-07-25 21:19:40 +00:00
Manman Ren	e8c6b15137	Update testing case for Atom when disabling rematerialization in TwoAddressInstructionPass. The generated code for Atom has a different code sequence. This is realted to commit r160749. llvm-svn: 160755	2012-07-25 20:17:14 +00:00
Chad Rosier	13198f8f9f	You cannot call removeModule on a JIT with no modules. Patch by Verena Beckham <verena@codeplay.com>. Reviewed by Jim Grosbach. llvm-svn: 160753	2012-07-25 19:06:29 +00:00
Nuno Lopes	f0626f2205	revert r160742: it's breaking CMake build original commit msg: MemoryBuiltins: add support to determine the size of strdup'ed non-constant strings llvm-svn: 160751	2012-07-25 18:49:28 +00:00
Manman Ren	cc1dc6dc11	Disable rematerialization in TwoAddressInstructionPass. It is redundant; RegisterCoalescer will do the remat if it can't eliminate the copy. Collected instruction counts before and after this. A few extra instructions are generated due to spilling but it is normal to see these kinds of changes with almost any small codegen change, according to Jakob. This also fixed rdar://11830760 where xor is expected instead of movi0. llvm-svn: 160749	2012-07-25 18:28:13 +00:00
David Blaikie	70fdf72a48	Don't add null characters to the end of the APFloat string buffer. Report/patch inspiration by Olaf Krzikalla. llvm-svn: 160744	2012-07-25 18:04:24 +00:00
Nuno Lopes	f0441e04bd	MemoryBuiltins: add support to determine the size of strdup'ed non-constant strings llvm-svn: 160742	2012-07-25 17:29:22 +00:00
Nuno Lopes	7ba5b98720	add EmitStrNLen() llvm-svn: 160741	2012-07-25 17:18:59 +00:00
Jakob Stoklund Olesen	cef9a618b1	Preserve 2-addr constraints in ConnectedVNInfoEqClasses. When a live range splits into multiple connected components, we would arbitrarily assign <undef> uses to component 0. This is wrong when the use is tied to a def that gets assigned to a different component: %vreg69<def> = ADD8ri %vreg68<undef>, 1 The use and def must get the same virtual register. Fix this by assigning <undef> uses to the same component as the value defined by the instruction, if any: %vreg69<def> = ADD8ri %vreg69<undef>, 1 This fixes PR13402. The PR has a test case which I am not including because it is unlikely to keep exposing this behavior in the future. llvm-svn: 160739	2012-07-25 17:15:15 +00:00
Jim Grosbach	6df755cc4e	ARM: Don't assume an SDNode is a constant. Before accessing a node as a ConstandSDNode, make sure it actually is one. No testcase of non-trivial size. rdar://11948669 llvm-svn: 160735	2012-07-25 17:02:47 +00:00
Jakob Stoklund Olesen	c6fd3deee6	Verify two-address constraints more carefully. Include <undef> operands and virtual registers after leaving SSA form. llvm-svn: 160734	2012-07-25 16:49:11 +00:00
Nuno Lopes	89702e94b5	make all Emit*() functions consult the TargetLibraryInfo information before creating a call to a library function. Update all clients to pass the TLI information around. Previous draft reviewed by Eli. llvm-svn: 160733	2012-07-25 16:46:31 +00:00
Rafael Espindola	73173c55c2	Fix typos. Thanks to Matt Beaumont-Gay for noticing it. llvm-svn: 160731	2012-07-25 15:42:45 +00:00
Axel Naumann	7b44fbb95b	Twine: fix link to source, add link to class doc and container section. 80 char lines. llvm-svn: 160726	2012-07-25 13:46:11 +00:00
Rafael Espindola	11c38b9657	When a return struct pointer is passed in registers, the called has nothing to pop. llvm-svn: 160725	2012-07-25 13:41:10 +00:00
Rafael Espindola	2caee7f4d2	Factor a long list of conditions into a predicate function. No functionality change. llvm-svn: 160724	2012-07-25 13:35:45 +00:00
Duncan Sands	77a1f3b564	Don't perform an overaligned load in this test, since that's undefined behaviour that might be exploited one day. llvm-svn: 160714	2012-07-25 09:45:37 +00:00
Duncan Sands	0b875a0c29	When folding a load from a global constant, if the load started in the middle of an array element (rather than at the beginning of the element) and extended into the next element, then the load from the second element was being handled wrong due to incorrect updating of the notion of which byte to load next. This fixes PR13442. Thanks to Chris Smowton for reporting the problem, analyzing it and providing a fix. llvm-svn: 160711	2012-07-25 09:14:54 +00:00
Akira Hatanaka	5a69c235ae	Eliminate the stack slot used to save the global base register. The long branch pass (fixed in r160601) no longer uses the global base register to compute addresses of branch destinations, so it is not necessary to reserve a slot on the stack. llvm-svn: 160703	2012-07-25 03:16:47 +00:00
Rafael Espindola	a92cf29f0d	Add a cpu to the test. Should fix the atom bot. llvm-svn: 160701	2012-07-24 22:56:06 +00:00
Rafael Espindola	f30e9bfb90	Add a triple to the test. llvm-svn: 160698	2012-07-24 21:55:04 +00:00
Rafael Espindola	a44e193a11	In order to correctly compile struct s { double x1; float x2; }; __attribute__((regparm(3))) struct s f(int a, int b, int c); void g(void) { f(41, 42, 43); } We need to be able to represent passing the address of s to f (sret) in a register (inreg). Turns out that all that is needed is to not mark them as mutually incompatible. llvm-svn: 160695	2012-07-24 21:40:17 +00:00
Kevin Enderby	216ac31971	Fix a bug in the x86 disassembler's symbolic disassembly support for Jcc-Jump if Condition Is Met instuctions that was not correctly determining the target instruction. So for a jne rel32 instruction: % cat x.s .byte 0x0f, 0x85, 0x09, 0x00, 0x00, 0x00 % as x.s it was incorrectly deterining the target: % otool -q -tv a.out a.out: (__TEXT,__text) section 0000000000000000 jne 0xd and with the fix it gets this correct as: % otool -q -tv a.out a.out: (__TEXT,__text) section 0000000000000000 jne 0xf rdar://11505997 llvm-svn: 160694	2012-07-24 21:40:01 +00:00
Nick Lewycky	38be931223	Don't delete one more instruction than we're allowed to. This should fix the Darwin bootstrap. Testcase exists but isn't fully reduced, I expect to commit the testcase this evening. llvm-svn: 160693	2012-07-24 21:33:00 +00:00
Michael J. Spencer	041c0d4c21	[Object] Remove unneeded const_cast. llvm-svn: 160692	2012-07-24 21:07:56 +00:00
Nuno Lopes	342cf787ef	add a few more functions to TargetLibraryInfo: fputc, memchr, memcmp, putchar, puts, strchr, strncmp llvm-svn: 160690	2012-07-24 21:00:36 +00:00
David Chisnall	5b8c1680de	ELF does not imply GNU/Linux. Do not assume GNU conventions just because we are targeting an ELF platform. Only fold gs-relative (and fs-relative) loads if it is actually sensible to do so for the target platform. This fixes PR13438. llvm-svn: 160687	2012-07-24 20:04:16 +00:00
Anshuman Dasgupta	eefe7c9cf9	Add new interfaces to support ldd's ReaderElf.cpp. Patch by Sid Manning! llvm-svn: 160685	2012-07-24 19:48:24 +00:00
Nuno Lopes	20f5a7aeb7	TargetLibraryInfo: add strn?cat, strn?cpy, and strn?len llvm-svn: 160678	2012-07-24 17:25:06 +00:00
Nuno Lopes	2a4b09c9de	teach objectsize about strdup() and strndup() llvm-svn: 160676	2012-07-24 16:28:13 +00:00
Nadav Rotem	465834c85f	Clean whitespaces. llvm-svn: 160668	2012-07-24 10:51:42 +00:00
Nick Lewycky	faa9c3b035	Teach globalopt to not nuke all stores to globals. Keep them around of they might be deliberate "one time" leaks, so that leak checkers can find them. This is a reapply of r160602 with the fix that this time I'm committing the code I thought I was committing last time; the I->eraseFromParent() goes after the break out of the loop. llvm-svn: 160664	2012-07-24 07:21:08 +00:00
Craig Topper	17300940ae	Change llvm_unreachable in SplitVectorOperand to report_fatal_error. Keeps release builds from crashing if code uses an intrinsic with an illegal type. llvm-svn: 160661	2012-07-24 04:11:21 +00:00
Akira Hatanaka	45da9e2653	Fix function MipsCodeEmitter::emitExternalSymbolAddress to pass test ExecutionEngine/test-fp.ll. Patch by Petar Jovanovic. llvm-svn: 160653	2012-07-24 00:08:26 +00:00
Akira Hatanaka	26e9ecb7a3	Add basic ability to setup call frame, and make procedure calls. Hello world will compile and execute with this patch. Patch by Reed Kotler. llvm-svn: 160651	2012-07-23 23:45:54 +00:00
Eric Christopher	2ce6541f3b	Fix a "Bad fd number" error on some platforms due to a less portable redirection in the system call. Patch by Andy Gibbs. llvm-svn: 160644	2012-07-23 20:54:17 +00:00
Nuno Lopes	eb9d2755b2	make ConstantRange::zeroExtend() optimal llvm-svn: 160643	2012-07-23 20:33:29 +00:00
Richard Trieu	1feac1cef9	Add operator== to APSInt. This will compare the signed bit before doing the comparison. This prevents large unsigned integers from being equal to signed negative integers of the same bit width. llvm-svn: 160642	2012-07-23 20:24:23 +00:00
Dan Gohman	f64ff8ed3a	An objc_retain can serve as a may-use for a different pointer. rdar://11931823. llvm-svn: 160637	2012-07-23 19:27:31 +00:00
Akira Hatanaka	adec58c091	Add comment for relocations MO_HIGHER and HIGHEST in MipsBaseInfo.h. llvm-svn: 160636	2012-07-23 19:19:20 +00:00
Micah Villmow	9eedce1e7c	Test revert of test changes. llvm-svn: 160632	2012-07-23 16:42:45 +00:00
Micah Villmow	780c24f19c	Test commit. llvm-svn: 160631	2012-07-23 16:37:24 +00:00
Nadav Rotem	1088811c33	Suppress a warning. llvm-svn: 160629	2012-07-23 13:44:15 +00:00
Nadav Rotem	7f829e4d32	Doxygenify the comments of ISD nodes. llvm-svn: 160623	2012-07-23 09:04:00 +00:00
Sylvestre Ledru	35521e2310	Fix a typo (the the => the) llvm-svn: 160621	2012-07-23 08:51:15 +00:00
Nadav Rotem	9056076cab	Fixed DAGCombine optimizations which generate select_cc for targets that do not support it (X86 does not lower select_cc). PR: 13428 Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160619	2012-07-23 07:59:50 +00:00
Craig Topper	2694c05e86	Tidy up. Fix indentation and remove trailing whitespace. llvm-svn: 160617	2012-07-23 05:38:07 +00:00
Craig Topper	b49546a3b3	Change llvm_unreachable in SplitVectorResult to report_fatal_error. Keeps release builds from crashing if code uses an intrinsic with an illegal type. For instance 256-bit AVX intrinsics without having AVX enabled. llvm-svn: 160616	2012-07-23 04:34:49 +00:00
Chandler Carruth	c8acd7c96b	Move the initialization of the bounds checking pass. The pass itself moved earlier. This fixes some layering issues. llvm-svn: 160611	2012-07-22 05:19:32 +00:00
NAKAMURA Takumi	b5b4d8b06b	ExecutionEngine/TargetSelect.cpp: Override default triple as LLVM_HOSTTRIPLE. In current implementation, JIT should run only on host. llvm-svn: 160610	2012-07-22 03:04:57 +00:00
NAKAMURA Takumi	43652ae0de	autoconf: Re-introduce LLVM_HOSTTRIPLE since r143500, as rework of PR11060. cmake: Add LLVM_HOSTTRIPLE. For now, it is same as TARGET_TRIPLE. llvm-svn: 160609	2012-07-22 03:04:52 +00:00
Nick Lewycky	9669c198ba	Revert r160602. llvm-svn: 160603	2012-07-21 09:03:15 +00:00
Nick Lewycky	72b83e5eaa	Teach globalopt to play nice with leak checkers. This is a reapplication of r160529 that was subsequently reverted. The fix was to not call GV->eraseFromParent() right before the caller does the same. The existing testcases already caught this bug if run under valgrind. llvm-svn: 160602	2012-07-21 08:29:45 +00:00
Akira Hatanaka	f72efdb62f	Fix Mips long branch pass. This pass no longer requires that the global pointer value be saved to the stack or register since it uses bal instruction to compute branch distance. llvm-svn: 160601	2012-07-21 03:30:44 +00:00
Akira Hatanaka	6035fe78c7	Add HIGHER and HIGHEST relocations to Mips backend. llvm-svn: 160599	2012-07-21 03:09:04 +00:00
Akira Hatanaka	b49c68a65d	Revert accidental commit. llvm-svn: 160598	2012-07-21 02:20:33 +00:00
Akira Hatanaka	f73e362758	Add VK_Mips_HIGHER and VK_Mips_HIGHEST to MCSymbolRefExpr::VariantKind. Test case will be added later when long branch patch is checked in. llvm-svn: 160597	2012-07-21 02:15:19 +00:00
Nuno Lopes	705141d4df	baby steps toward fixing some problems with inbound GEPs that overflow, as discussed 2 months ago or so. Make sure we do not emit index computations with NSW flags so that we dont get an undef value if the GEP overflows llvm-svn: 160589	2012-07-20 23:07:40 +00:00
Nuno Lopes	20ea62527a	move the bounds checking pass to the instrumentation folder, where it belongs. I dunno why in the world I dropped it in the Scalar folder in the first place. No functionality change. llvm-svn: 160587	2012-07-20 22:39:33 +00:00
Benjamin Kramer	5be8f60126	Remove unused private member variables uncovered by the recent changes to clang's -Wunused-private-field. llvm-svn: 160583	2012-07-20 22:05:57 +00:00
Galina Kistanova	434efb29b5	Fix few warnings. llvm-svn: 160576	2012-07-20 21:30:52 +00:00
Jakob Stoklund Olesen	e2cfd0d45a	Avoid folding loads that are unsafe to move. LiveRangeEdit::foldAsLoad() can eliminate a register by folding a load into its only use. Only do that when the load is safe to move, and it won't extend any live ranges. This fixes PR13414. llvm-svn: 160575	2012-07-20 21:29:31 +00:00
Chandler Carruth	1f41bf0c3f	Fix a dangling StringRef bug in the auto upgrader. In one case, we reset CI's name, and then used the StringRef pointing at its old name. I'm fixing it by storing the name in a std::string, and hoisting the renaming logic to happen always. This is nicer anyways as it will allow the upgraded IR to have the same names as the input IR in more cases. Another bug found by AddressSanitizer. Woot. llvm-svn: 160572	2012-07-20 21:09:18 +00:00
Jakob Stoklund Olesen	f62c07f147	Split loop exiting edges more aggressively. PHIElimination splits critical edges when it predicts it can resolve interference and eliminate copies. It doesn't split the edge if the interference wouldn't be resolved anyway because the phi-use register is live in the critical edge anyway. Teach PHIElimination to split loop exiting edges with interference, even if it wouldn't resolve the interference. This removes the necessary copies from the loop, which is still an improvement from injecting the copies into the loop. The test case demonstrates the improvement. Before: LBB0_1: cmpb $0, (%rdx) leaq 1(%rdx), %rdx movl %esi, %eax je LBB0_1 After: LBB0_1: cmpb $0, (%rdx) leaq 1(%rdx), %rdx je LBB0_1 movl %esi, %eax llvm-svn: 160571	2012-07-20 20:49:53 +00:00
Benjamin Kramer	dfaa0f3a81	Try to unbreak the windows build. llvm-svn: 160567	2012-07-20 19:49:33 +00:00
Daniel Dunbar	c8b8c49d6f	SourceMgr: Use has_colors() instead of just is_displayed() before trying to use color. llvm-svn: 160559	2012-07-20 18:29:44 +00:00
Daniel Dunbar	04b4583c9b	raw_ostream: Add a has_colors() method. llvm-svn: 160558	2012-07-20 18:29:41 +00:00
Daniel Dunbar	712de82154	Process: Add sys::Process::FileDescriptorHasColors(). llvm-svn: 160557	2012-07-20 18:29:38 +00:00
Daniel Dunbar	2f529107a7	lit: Use close_fds=True on UNIX, to avoid file descriptor pollution of subprocesses. llvm-svn: 160556	2012-07-20 18:29:34 +00:00
Richard Osborne	0ab2b0df82	Fix assertion in jump threading (PR13405). GetBestDestForJumpOnUndef() assumes there is at least 1 successor, which isn't true if the block ends in an indirect branch with no successors. Fix this by bailing out earlier in this case. llvm-svn: 160546	2012-07-20 10:36:17 +00:00
Kostya Serebryany	f02c6069ac	[asan] make sure that the crash callbacks do not get merged (Chandler's idea: insert an empty InlineAsm). Change the order in which the new BBs are inserted: the slow path BB is insert between old BBs, the crash BB is inserted at the end. Don't create an empty BB (introduced by recent commits). Update the test. The experimental code that does manual crash callback merge will most likely be deleted later. llvm-svn: 160544	2012-07-20 09:54:50 +00:00
Craig Topper	0b94e46ce3	Don't use implicit register operands to calculate L-bit for AVX instructions. Needed because super reg defs and kills are added as implicit operands on 128-bit instructions. Fixes PR13349. Patch by Jose Fonseca. llvm-svn: 160543	2012-07-20 07:03:46 +00:00
Owen Anderson	3a8bdb5677	Make RegisterOperand a subclass of DAGOperand so that RegisterOperands can be passed into multiclasses that take DAGOperands as multiclass parameters. llvm-svn: 160540	2012-07-20 03:38:19 +00:00
Nick Lewycky	7707e23429	Revert r160529 due to crashes. llvm-svn: 160532	2012-07-19 23:59:21 +00:00
Pete Cooper	dcf94db677	Fix crash in machine verifier when trying to print the def of a register which has no def llvm-svn: 160531	2012-07-19 23:40:38 +00:00
Nick Lewycky	0fa6a28141	Don't wipe out global variables that are probably storing pointers to heap memory. This makes clang play nice with leak checkers. llvm-svn: 160529	2012-07-19 22:35:28 +00:00
Galina Kistanova	27540f8d8c	Reverting r 160419. llvm-svn: 160525	2012-07-19 21:43:55 +00:00
Preston Gurd	8e082688a1	Adds the family codes for the Midview Atom processors so that the Atom buildbot will auto-detect Atom. llvm-svn: 160521	2012-07-19 19:05:37 +00:00
Preston Gurd	f2ea70ae4a	Fix remaining lit tests which were failing when run on an Atom processor. Patches by Tyler Nowicki, Andy Zhang, and Preston Gurd! llvm-svn: 160520	2012-07-19 18:53:21 +00:00
Sebastian Pop	221e07e140	default to use -mv4 when no version of Hexagon has been specified This fixes a bunch of make check failures of the form: Unknown Architecture Version. UNREACHABLE executed at ../lib/Target/Hexagon/HexagonSubtarget.cpp:60! llvm-svn: 160518	2012-07-19 18:24:50 +00:00
Nuno Lopes	c14776d406	reimplement truncate() to make it optimal. It is optimal at least up to 7 bits (I've tested all such cases) This change to truncate() allows a little simplification to the multiplication code, and it also makes multiplication optimal :) llvm-svn: 160512	2012-07-19 16:27:45 +00:00
Benjamin Kramer	347d559323	Pull the simple parts of DenseMapInfo<DebugLoc> inline and prune includes. llvm-svn: 160507	2012-07-19 15:00:34 +00:00
NAKAMURA Takumi	67ce1930c1	test/DebugInfo/dwarfdump-test.test: Tweak expressions for Win32 to match backslashes. They are still odd, though. For example, Paths are printed on Win32 as below; /tmp/dbginfo\def2.cc:4:0 /tmp/dbginfo\include\decl2.h:1:0 /tmp/include\decl.h:5:0 llvm-svn: 160505	2012-07-19 13:40:09 +00:00
Benjamin Kramer	f364a63c3e	Replace some explicit compare loops with std::equal. No functionality change. llvm-svn: 160501	2012-07-19 10:46:05 +00:00
Jush Lu	e67e07b901	[arm-fast-isel] Add support for vararg function calls. llvm-svn: 160500	2012-07-19 09:49:00 +00:00
Alexey Samsonov	e16e16add6	DebugInfo library: add support for fetching absolute paths to source files (instead of basenames) from DWARF. Use this behavior in llvm-dwarfdump tool. Reviewed by Benjamin Kramer. llvm-svn: 160496	2012-07-19 07:03:58 +00:00
Galina Kistanova	aaf9735951	Fixed few warnings. llvm-svn: 160493	2012-07-19 04:50:12 +00:00
Bill Wendling	723444e767	Remove tabs. llvm-svn: 160483	2012-07-19 00:25:04 +00:00
Bill Wendling	bd8e5d537d	Remove tabs. llvm-svn: 160482	2012-07-19 00:23:13 +00:00
Bill Wendling	4e68e0673a	Remove tabs. llvm-svn: 160480	2012-07-19 00:17:40 +00:00
Bill Wendling	318f03f56f	Remove tabs. llvm-svn: 160479	2012-07-19 00:15:11 +00:00
Chad Rosier	09a06c257e	Tweak prose. llvm-svn: 160478	2012-07-19 00:11:45 +00:00
Bill Wendling	ea6397f67b	Remove tabs. llvm-svn: 160477	2012-07-19 00:11:40 +00:00
Bill Wendling	2b07965042	Remove tabs. llvm-svn: 160476	2012-07-19 00:06:06 +00:00
Bill Wendling	d163405df8	Remove tabs. llvm-svn: 160475	2012-07-19 00:04:14 +00:00
Bill Wendling	0de5913855	Remove tabs. llvm-svn: 160473	2012-07-19 00:01:33 +00:00
Bill Wendling	a88946e21a	Remove tabs. llvm-svn: 160472	2012-07-19 00:01:00 +00:00
Bill Wendling	efe80cb87e	Remove tabs. llvm-svn: 160471	2012-07-18 23:58:37 +00:00
Richard Trieu	9208abd7c3	Move around some enum elements so that lastMRM corrects gets assigned 56, which is one more that MRM_DF which is 55. Previously, it held value 45, the same as MRM_D0. llvm-svn: 160465	2012-07-18 23:04:22 +00:00
Jim Grosbach	66372684f7	TblGen: Tweak to pretty-print DAGISel.inc a bit better. llvm-svn: 160463	2012-07-18 22:41:03 +00:00
Jordan Rose	82632bffbc	Allow PointerIntPairs to be created from const void . For a measure of safety, this conversion is only permitted if the stored pointer type can also be created from a const void . llvm-svn: 160456	2012-07-18 21:58:49 +00:00
Manman Ren	d0a4ee8427	X86: remove redundant cmp against zero. Updated OptimizeCompare in peephole to remove redundant cmp against zero. We only remove Compare if CF and OF are not used. rdar://11855129 llvm-svn: 160454	2012-07-18 21:40:01 +00:00
Preston Gurd	f0a48ec8f1	This patch fixes 8 out of 20 unexpected failures in "make check" when run on an Intel Atom processor. The failures have arisen due to changes elsewhere in the trunk over the past 8 weeks or so. These failures were not detected by the Atom buildbot because the CPU on the Atom buildbot was not being detected as an Atom CPU. The fix for this problem is in Host.cpp and X86Subtarget.cpp, but shall remain commented out until the current set of Atom test failures are fixed. Patch by Andy Zhang and Tyler Nowicki! llvm-svn: 160451	2012-07-18 20:49:17 +00:00
Victor Oliveira	aa9ccee921	Adding some debug information to PassManager llvm-svn: 160446	2012-07-18 19:59:29 +00:00
Chad Rosier	848094e3ce	Whitespace. llvm-svn: 160445	2012-07-18 19:35:16 +00:00
Chandler Carruth	985454e0ac	Fix a somewhat nasty crasher in PR13378. This crashes inside of LiveIntervals due to the two-addr pass generating bogus MI code. The crux of the issue was a loop nesting problem. The intent of the code which attempts to transform instructions before converting them to two-addr form is to defer and reprocess any transformed instructions as the second processing is likely to have more opportunities to coalesce copies, etc. Unfortunately, there was one section of processing that was not deferred -- the INSERT_SUBREG rewriting. Due to quirks of how this rewriting proceeded, not only did it occur early, it removed the bits of information needed for the deferred processing to correctly generate the necessary two address form (specifically inserting a copy), but didn't trigger any immediate assertions and produced what appeared to be already valid two-address from code. Thus, the assertion only fired much later in the pipeline. The fix is to hoist the transformation logic up layer to where it can more firmly defer all further processing, and to teach the normal processing to handle an edge case previously handled as part of the transformation logic. This edge case (already matched tied register operands) needs to not defer any steps. As has been brought up repeatedly in the process: wow does this code need refactoring. I may squeeze in some time to at least bring sanity to this loop... but wow... =] Thanks to Jakob for helpful hints on the way here, and the review. llvm-svn: 160443	2012-07-18 18:58:22 +00:00
Andrew Trick	a22cdb713b	Fix ARMTargetLowering::isLegalAddImmediate to consider thumb encodings. Based on Evan's suggestion without a commitable test. llvm-svn: 160441	2012-07-18 18:34:27 +00:00
Andrew Trick	bc325168c3	whitespace llvm-svn: 160440	2012-07-18 18:34:24 +00:00
Andrew Trick	e002fb5da3	Added unit test for PR13361: LSR + SCEV "hangs" on reasonably sized test. llvm-svn: 160439	2012-07-18 18:07:52 +00:00
Victor Oliveira	a1de408aa7	test commit llvm-svn: 160438	2012-07-18 17:53:05 +00:00
Simon Atanasyan	8856ef886a	Add some missed ELF constants definitions: - section types - dynamic table entries tags - state flags for DT_FLAGS_1 entry The patch reviewed by Rafael Espindola. llvm-svn: 160433	2012-07-18 14:12:32 +00:00
NAKAMURA Takumi	5f8d8eb692	Update config.h.cmake corresponding to config.h.in. llvm-svn: 160431	2012-07-18 09:17:02 +00:00
Nadav Rotem	4c12245b3a	The vbroadcast family of instructions has 'fallback patterns' in case where the load source operand is used by multiple nodes. The v2i64 broadcast was emulated by shuffling the two lower i32 elements to the upper two. We had a bug in the immediate used for the broadcast. Replacing 0 to 0x44. 0x44 means [01\|00\|01\|00] which corresponds to the correct lane. Patch by Michael Kuperstein. llvm-svn: 160430	2012-07-18 08:14:48 +00:00
Jack Carter	a62ba82825	Mips specific inline asm operand modifier 'M': Print the high order register of a double word register operand. In 32 bit mode, a 64 bit double word integer will be represented by 2 32 bit registers. This modifier causes the high order register to be used in the asm expression. It is useful if you are using doubles in assembler and continue to control register to variable relationships. This patch also fixes a related bug in a previous patch: case 'D': // Second part of a double word register operand case 'L': // Low order register of a double word register operand case 'M': // High order register of a double word register operand I got 'D' and 'M' confused. The second part of a double word operand will only match 'M' for one of the endianesses. I had 'L' and 'D' be the opposite twins when 'L' and 'M' are. llvm-svn: 160429	2012-07-18 06:41:36 +00:00
Andrew Trick	0d10225fa2	SCEVTraversal: Add a visited set. Expression trees may be DAGs. Make sure traversal has linear complexity. llvm-svn: 160426	2012-07-18 05:14:03 +00:00
Craig Topper	6bf3ed454a	Remove tab characters. llvm-svn: 160425	2012-07-18 04:59:16 +00:00
Craig Topper	8532423268	Fix typo in error message and remove some tab characters. llvm-svn: 160423	2012-07-18 04:36:35 +00:00
Andrew Trick	0d07dfcd6f	indvars: drive by heuristics fix. Minor oversight noticed by inspection. Sorry no unit test. llvm-svn: 160422	2012-07-18 04:35:13 +00:00
Andrew Trick	c08726627c	indvars: Linear function test replace should avoid reusing undef. Fixes PR13371: indvars pass incorrectly substitutes 'undef' values. I do not like this fix. It's needed until/unless the meaning of undef changes. It attempts to be complete according to the IR spec, but I don't have much confidence in the implementation given the difficulty testing undefined behavior. Worse, this invalidates some of my hard-fought work on indvars and LSR to optimize pointer induction variables. It results benchmark regressions, which I'll track internally. On x86_64 no LTO I see: -3% huffbench -3% 400.perlbench -8% fhourstones My only suggestion for recovering is to change the meaning of undef. If we could trust an arbitrary instruction to produce a some real value that can be manipulated (e.g. incremented) according to non-undef rules, then this case could be easily handled with SCEV. llvm-svn: 160421	2012-07-18 04:35:10 +00:00
Craig Topper	01deb5f2df	Make x86 asm parser to check for xmm vs ymm for index register in gather instructions. Also fix Intel syntax for gather instructions to use 'DWORD PTR' or 'QWORD PTR' to match gas. llvm-svn: 160420	2012-07-18 04:11:12 +00:00
Galina Kistanova	5ac251b81a	Fixed few warnings. llvm-svn: 160419	2012-07-18 04:06:49 +00:00
Nuno Lopes	2151497dca	ignore 'invoke @llvm.donothing', but still keep the edge to the continuation BB llvm-svn: 160411	2012-07-18 00:07:17 +00:00
Joel Jones	b84f7bea09	More replacing of target-dependent intrinsics with target-indepdent intrinsics. The second instruction(s) to be handled are the vector versions of count set bits (ctpop). The changes here are to clang so that it generates a target independent vector ctpop when it sees an ARM dependent vector bits set count. The changes in llvm are to match the target independent vector ctpop and in VMCore/AutoUpgrade.cpp to update any existing bc files containing ARM dependent vector pop counts with target-independent ctpops. There are also changes to an existing test case in llvm for ARM vector count instructions and to a test for the bitcode upgrade. <rdar://problem/11892519> There is deliberately no test for the change to clang, as so far as I know, no consensus has been reached regarding how to test neon instructions in clang; q.v. <rdar://problem/8762292> llvm-svn: 160410	2012-07-18 00:02:16 +00:00
Nuno Lopes	acd8535de0	Apparently it's possible to do an 'invoke asm'. Update the language reference to reflect that. llvm-svn: 160408	2012-07-17 23:51:33 +00:00
Akira Hatanaka	f640f040d1	Clean up Mips16InstrFormats.td and Mips16InstrInfo.td. Patch by Reed Kotler. llvm-svn: 160403	2012-07-17 22:55:34 +00:00
Evan Cheng	f73d7553cc	Add test case for r160387 llvm-svn: 160389	2012-07-17 19:40:05 +00:00
Evan Cheng	e6a3b03ee0	Back out r160101 and instead implement a dag combine to recover from instcombine transformation. llvm-svn: 160387	2012-07-17 18:54:11 +00:00
Jim Grosbach	ab27c5e994	TableGen: Pattern<> references to null_frag are a nop. A standalone pattern defined in a multiclass expansion should handle null_frag references just like patterns on instructions. Follow-up to r160333. llvm-svn: 160384	2012-07-17 18:39:36 +00:00
Jakob Stoklund Olesen	6ca05ebd50	Fix broken ipo_ext_iterator constructors. These functions have obviously never been used before. They should be identical to the idf_ext_iterator counterparts. llvm-svn: 160381	2012-07-17 17:57:25 +00:00
Jakob Stoklund Olesen	0ef031186c	Add some trace output to TwoAddressInstructionPass. llvm-svn: 160380	2012-07-17 17:57:23 +00:00
Benjamin Kramer	7c1598caaa	Remove unused variable. llvm-svn: 160372	2012-07-17 17:00:11 +00:00
Nuno Lopes	216d571af7	simplify getSetSize() per Duncan's comments llvm-svn: 160368	2012-07-17 15:43:59 +00:00
NAKAMURA Takumi	7364c32b33	llvm/test/Transforms/LoopRotate/PhiRename-1.ll: FileCheck-ize. It fixes PR13301. It began choking since Chandler's r159547, possibly due to improper expression on grep from TclParser to ShParser. llvm-svn: 160367	2012-07-17 15:43:17 +00:00
Jakob Stoklund Olesen	c92bde7ba9	Allow for customized graph edge pruning in PostOrderIterator.h Make it possible to prune individual graph edges from a post-order traversal by specializing the po_iterator_storage template. Previously, it was only possible to prune full graph nodes. Edge pruning makes it possible to remove loop back-edges, for example. Also replace the existing DFSetTraits customization hook with a po_iterator_storage method for observing the post-order. DFSetTraits was only used by LoopIterator.h which now provides a po_iterator_storage specialization. Thanks to Sean and Chandler for reviewing. llvm-svn: 160366	2012-07-17 15:35:40 +00:00
Alexey Samsonov	b604ff2a07	Improve behavior of DebugInfoEntryMinimal::getSubprogramName() introduced in r159512. To fetch a subprogram name we should not only inspect the DIE for this subprogram, but optionally inspect its specification, or its abstract origin (even if there is no inlining), or even specification of an abstract origin. Reviewed by Benjamin Kramer. llvm-svn: 160365	2012-07-17 15:28:35 +00:00
Kostya Serebryany	986b8da500	[asan] more code to merge crash callbacks. Doesn't fully work yet, but allows to hold performance experiments llvm-svn: 160361	2012-07-17 11:04:12 +00:00
Nadav Rotem	277a40bc0a	Fix a crash in the legalization of large vectors. When truncating a result of a vector that is split we need to use the result of the split vector, and not re-split the dead node. llvm-svn: 160357	2012-07-17 09:07:37 +00:00
Evan Cheng	780f9b5f92	Implement r160312 as target indepedenet dag combine. llvm-svn: 160354	2012-07-17 08:31:11 +00:00
Simon Atanasyan	bb02d8de47	Revert commit r160307. We decide to move builtins selection to the backend. llvm-svn: 160352	2012-07-17 08:14:45 +00:00
Evan Cheng	47d7be9578	Make sure constant bitwidth is <= 64 bit before calling getSExtValue(). llvm-svn: 160350	2012-07-17 07:47:50 +00:00
Evan Cheng	f579beca6d	This is another case where instcombine demanded bits optimization created large immediates. Add dag combine logic to recover in case the large immediates doesn't fit in cmp immediate operand field. int foo(unsigned long l) { return (l>> 47) == 1; } we produce %shr.mask = and i64 %l, -140737488355328 %cmp = icmp eq i64 %shr.mask, 140737488355328 %conv = zext i1 %cmp to i32 ret i32 %conv which codegens to movq $0xffff800000000000,%rax andq %rdi,%rax movq $0x0000800000000000,%rcx cmpq %rcx,%rax sete %al movzbl %al,%eax ret TargetLowering::SimplifySetCC would transform (X & -256) == 256 -> (X >> 8) == 1 if the immediate fails the isLegalICmpImmediate() test. For x86, that's immediates which are not a signed 32-bit immediate. Based on a patch by Eli Friedman. PR10328 rdar://9758774 llvm-svn: 160346	2012-07-17 06:53:39 +00:00
Andrew Trick	c803706c18	Reapply r160340. LSR: Limit CollectSubexprs. Speculatively fix crashes by code inspection. Can't reproduce them yet. llvm-svn: 160344	2012-07-17 05:30:37 +00:00
Andrew Trick	e834cb465a	Revert "LSR: try not to blow up solving combinatorial problems brute force." Some units tests crashed on a different platform. llvm-svn: 160341	2012-07-17 05:05:21 +00:00
Andrew Trick	7cd6d426b3	LSR: try not to blow up solving combinatorial problems brute force. This places limits on CollectSubexprs to constrains the number of reassociation possibilities. It limits the recursion depth and skips over chains of nested recurrences outside the current loop. Fixes PR13361. Although underlying SCEV behavior is still potentially bad. llvm-svn: 160340	2012-07-17 05:00:56 +00:00
Jim Grosbach	514410ba07	TableGen: Allow conditional instruction pattern in multiclass. Define a 'null_frag' SDPatternOperator node, which if referenced in an instruction Pattern, results in the pattern being collapsed to be as-if '[]' had been specified instead. This allows supporting a multiclass definition where some instaniations have ISel patterns associated and others do not. For example, multiclass myMulti<RegisterClass rc, SDPatternOperator OpNode = null_frag> { def _x : myI<(outs rc:), (ins rc:), []>; def _r : myI<(outs rc:), (ins rc:), [(set rc:, (OpNode rc:))]>; } defm foo : myMulti<GRa, not>; defm bar : myMulti<GRb>; llvm-svn: 160333	2012-07-17 00:47:06 +00:00
Akira Hatanaka	046744467d	Fix function select_cc_f32 in test/CodeGen/Mips/selectcc.ll. llvm-svn: 160329	2012-07-16 23:56:51 +00:00
Owen Anderson	8a503f2d8d	Defer checking for registers in the MC AsmMatcher until the after user-defined match classes have been checked. This allows the creation of MatchClass's that are supersets of a register class. llvm-svn: 160327	2012-07-16 23:20:09 +00:00
Nuno Lopes	482fb19fd5	fix PR13339 (remove the predecessor from the unwind BB when removing an invoke) llvm-svn: 160325	2012-07-16 22:49:40 +00:00
Nuno Lopes	986cc181b0	teach ConstantRange that zero times X is always zero llvm-svn: 160317	2012-07-16 20:47:16 +00:00
Evan Cheng	75315b877c	For something like uint32_t hi(uint64_t res) { uint_32t hi = res >> 32; return !hi; } llvm IR looks like this: define i32 @hi(i64 %res) nounwind uwtable ssp { entry: %lnot = icmp ult i64 %res, 4294967296 %lnot.ext = zext i1 %lnot to i32 ret i32 %lnot.ext } The optimizer has optimize away the right shift and truncate but the resulting constant is too large to fit in the 32-bit immediate field. The resulting x86 code is worse as a result: movabsq $4294967296, %rax ## imm = 0x100000000 cmpq %rax, %rdi sbbl %eax, %eax andl $1, %eax This patch teaches the x86 lowering code to handle ult against a large immediate with trailing zeros. It will issue a right shift and a truncate followed by a comparison against a shifted immediate. shrq $32, %rdi testl %edi, %edi sete %al movzbl %al, %eax It also handles a ugt comparison against a large immediate with trailing bits set. i.e. X > 0x0ffffffff -> (X >> 32) >= 1 rdar://11866926 llvm-svn: 160312	2012-07-16 19:35:43 +00:00
Nadav Rotem	60f7904db7	Minor cleanup and docs. llvm-svn: 160311	2012-07-16 18:56:39 +00:00
Simon Atanasyan	ef2128c12c	MIPS: Create two definitions for __builtin_mips_shll_qb builtin. The first variant accepts immediate number as the second argument. The second variant accepts register operand as the second argument. llvm-svn: 160307	2012-07-16 18:51:39 +00:00
Nadav Rotem	839a06e9d7	Make ComputeDemandedBits return a deterministic result when computing an AssertZext value. In the added testcase the constant 55 was behind an AssertZext of type i1, and ComputeDemandedBits reported that some of the bits were both known to be one and known to be zero. Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160305	2012-07-16 18:34:53 +00:00
Tom Stellard	1be1aa84ec	Revert "AMDGPU: Add core backend files for R600/SI codegen v6" This reverts commit 4ea70107c5e51230e9e60f0bf58a0f74aa4885ea. llvm-svn: 160303	2012-07-16 18:19:53 +00:00
Tom Stellard	adf452260f	Revert "include/llvm: Add R600 Intrinsics v6" This reverts commit 600f7a90f3eef4c5108179b43e27cfd9e5de7cdc. llvm-svn: 160302	2012-07-16 18:19:48 +00:00
Tom Stellard	95bd0be903	Revert "Build script changes for R600/SI Codegen v6" This reverts commit e3013202259ed1e006c21817c63cf25d75982721. llvm-svn: 160301	2012-07-16 18:19:46 +00:00
Tom Stellard	fc3db614c0	Revert "test/CodeGen/R600: Add some basic tests v6" This reverts commit 11d3457afcda7848448dd7f11b2ede6552ffb9ea. llvm-svn: 160300	2012-07-16 18:19:43 +00:00
Tom Stellard	151dc338e4	Revert "Target/AMDGPU/R600KernelParameters.cpp: Fix two includes, <llvm/IRBuilder.h> and <llvm/TypeBuilder.h>" This reverts commit 0258a6bdd30802f5cc0e8e57c8e768fde2aef590. llvm-svn: 160299	2012-07-16 18:19:41 +00:00
Tom Stellard	1bd3012505	Revert "Target/AMDGPU: [CMake] Fix dependencies. 1) Add intrinsics_gen. Add AMDGPUCommonTableGen." This reverts commit ebc934ba32ee71abbb8f0f2eb6a0fbaa613ba0d2. llvm-svn: 160298	2012-07-16 18:19:40 +00:00
Tom Stellard	781853e11f	Revert "Target/AMDGPU/R600KernelParameters.cpp: Don't use "and", "or" as conditional operator..." This reverts commit 29f28bc14ad5a907f5dc849f004fafeec0aab33a. llvm-svn: 160297	2012-07-16 18:19:38 +00:00
Tom Stellard	2e007de42d	Revert "Target/AMDGPU/AMDILIntrinsicInfo.cpp: Use llvm_unreachable() in nonreturn function, instead of assert(0)." This reverts commit 4ba4acc1bc2561b944a571edbb6a2dc78e357dfe. llvm-svn: 160296	2012-07-16 18:19:37 +00:00
Tom Stellard	f65e78b2fa	Revert "Target/AMDGPU: Fix includes, or msvc build failed." This reverts commit fef4aa1b16fcf7a472559abbbcf4c1adc9eb5ca6. llvm-svn: 160295	2012-07-16 18:19:32 +00:00
Nuno Lopes	99504c577c	make ConstantRange::getSetSize() properly compute the size of wrapped and full sets. Make it always return APInts with the same bitwidth for the same ConstantRange bitwidth to simply clients llvm-svn: 160294	2012-07-16 18:08:12 +00:00
Chad Rosier	10e8207c9e	With r160248 in place this code is no longer needed. llvm-svn: 160293	2012-07-16 17:42:13 +00:00
Kostya Serebryany	c4ce5dfe2d	[asan] a bit more refactoring, addressed some of the style comments from chandlerc, partially implemented crash callback merging (under flag) llvm-svn: 160290	2012-07-16 17:12:07 +00:00
Aaron Ballman	ed9b0a9114	MSVC's implementation of isalnum will assert on characters > 255, so we need to use an unsigned char to ensure the integer promotion happens properly. This fixes an assert in debug builds with CodeGen\X86\utf8.ll llvm-svn: 160286	2012-07-16 16:18:18 +00:00
Kostya Serebryany	874dae6119	[asan] refactor instrumentation to allow merging the crash callbacks (not fully implemented yet, no functionality change except the BB order) llvm-svn: 160284	2012-07-16 16:15:40 +00:00
NAKAMURA Takumi	96cc5e5bf9	Target/AMDGPU: Fix includes, or msvc build failed. llvm-svn: 160280	2012-07-16 15:43:50 +00:00
NAKAMURA Takumi	dc4261794f	Target/AMDGPU/AMDILIntrinsicInfo.cpp: Use llvm_unreachable() in nonreturn function, instead of assert(0). llvm-svn: 160279	2012-07-16 15:43:09 +00:00
NAKAMURA Takumi	5f5fd8e545	Target/AMDGPU/R600KernelParameters.cpp: Don't use "and", "or" as conditional operator... llvm-svn: 160278	2012-07-16 15:42:35 +00:00
Jack Carter	f649043aa5	Doubleword Shift Left Logical Plus 32 Mips shift instructions DSLL, DSRL and DSRA are transformed into DSLL32, DSRL32 and DSRA32 respectively if the shift amount is between 32 and 63 Here is a description of DSLL: Purpose: Doubleword Shift Left Logical Plus 32 To execute a left-shift of a doubleword by a fixed amount--32 to 63 bits Description: GPR[rd] <- GPR[rt] << (sa+32) The 64-bit doubleword contents of GPR rt are shifted left, inserting zeros into the emptied bits; the result is placed in GPR rd. The bit-shift amount in the range 0 to 31 is specified by sa. This patch implements the direct object output of these instructions. llvm-svn: 160277	2012-07-16 15:14:51 +00:00
NAKAMURA Takumi	bb42a5e2cf	Target/AMDGPU: [CMake] Fix dependencies. 1) Add intrinsics_gen. Add AMDGPUCommonTableGen. llvm-svn: 160276	2012-07-16 15:09:11 +00:00
NAKAMURA Takumi	3128d26124	Target/AMDGPU/R600KernelParameters.cpp: Fix two includes, <llvm/IRBuilder.h> and <llvm/TypeBuilder.h> llvm-svn: 160275	2012-07-16 15:08:47 +00:00
Alexey Samsonov	893d3d336a	Fix tests that failed on i686-win32 after r160248: 1. FileCheck-ize epilogue.ll and allow another asm instruction to restore %rsp. 2. Remove check in widen_arith-3.ll that was hitting instruction in epilogue instead of vector add. llvm-svn: 160274	2012-07-16 14:33:36 +00:00
Tom Stellard	6693fbe3eb	test/CodeGen/R600: Add some basic tests v6 llvm-svn: 160273	2012-07-16 14:17:19 +00:00
Tom Stellard	812e652b43	Build script changes for R600/SI Codegen v6 llvm-svn: 160272	2012-07-16 14:17:16 +00:00
Tom Stellard	ee1812b94f	include/llvm: Add R600 Intrinsics v6 llvm-svn: 160271	2012-07-16 14:17:14 +00:00
Tom Stellard	bcce80fa95	AMDGPU: Add core backend files for R600/SI codegen v6 llvm-svn: 160270	2012-07-16 14:17:08 +00:00
Kostya Serebryany	4273bb05d1	[asan] initialize asan error callbacks in runOnModule instead of doing that on-demand llvm-svn: 160269	2012-07-16 14:09:42 +00:00
Nadav Rotem	4968e45b9f	Fix a bug in the 3-address conversion of LEA when one of the operands is an undef virtual register. The problem is that ProcessImplicitDefs removes the definition of the register and marks all uses as undef. If we lose the undef marker then we get a register which has no def, is not marked as undef. The live interval analysis does not collect information for these virtual registers and we crash in later passes. Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160260	2012-07-16 10:52:25 +00:00
Chandler Carruth	8b540ab337	Revert r160254 temporarily. It turns out that ASan relied on the at-the-end block insertion order to (purely by happenstance) disable some LLVM optimizations, which in turn start firing when the ordering is made more "normal". These optimizations in turn merge many of the instrumentation reporting calls which breaks the return address based error reporting in ASan. We're looking at several different options for fixing this. llvm-svn: 160256	2012-07-16 10:01:02 +00:00
Chandler Carruth	3dd6c81492	Teach AddressSanitizer to create basic blocks in a more natural order. This is particularly useful to the backend code generators which try to process things in the incoming function order. Also, cleanup some uses of IRBuilder to be a bit simpler and more clear. llvm-svn: 160254	2012-07-16 08:58:53 +00:00
Chandler Carruth	663943e23e	Add a basic test for AddressSanitizer. This is just a bare-bones functionality test. In general, unless the functionality is substantially separated, we should lump more basic testing into this file. The test running infrastructure likes having a few test files with more comprehensive testing within them. llvm-svn: 160253	2012-07-16 08:56:46 +00:00
Chandler Carruth	f5fe556c70	Add support for attaching branch weight metadata directly from the IRBuilder. Added a basic unit test for this with CreateCondBr. I didn't go all the way and test the switch side as the boilerplate for setting up the switch IRBuilder unit tests is a lot more. Fortunately, the two share all the interesting code paths. llvm-svn: 160251	2012-07-16 07:45:06 +00:00
Chandler Carruth	b39c55f4be	Add a boring bit of boilerplate to start testing IRBuilder::CreateCondBr. This is in anticipation of changing CreateCondBr and wanting to test those changes. llvm-svn: 160250	2012-07-16 07:44:51 +00:00
Chandler Carruth	ebabadf933	Move the IRBuilder unittest from Support to VMCore. This got missed in the original move of IRBuilder. llvm-svn: 160249	2012-07-16 07:44:45 +00:00
Alexey Samsonov	dcc1291d17	This CL changes the function prologue and epilogue emitted on X86 when stack needs realignment. It is intended to fix PR11468. Old prologue and epilogue looked like this: push %rbp mov %rsp, %rbp and $alignment, %rsp push %r14 push %r15 ... pop %r15 pop %r14 mov %rbp, %rsp pop %rbp The problem was to reference the locations of callee-saved registers in exception handling: locations of callee-saved had to be re-calculated regarding the stack alignment operation. It would take some effort to implement this in LLVM, as currently MachineLocation can only have the form "Register + Offset". Funciton prologue and epilogue are now changed to: push %rbp mov %rsp, %rbp push %14 push %15 and $alignment, %rsp ... lea -$size_of_saved_registers(%rbp), %rsp pop %r15 pop %r14 pop %rbp Reviewed by Chad Rosier. llvm-svn: 160248	2012-07-16 06:54:09 +00:00
Chandler Carruth	36e2ecf528	Move llvm/Support/TypeBuilder.h -> llvm/TypeBuilder.h. This completes the move of *Builder classes into the Core library. No uses of this builder in Clang or DragonEgg I could find. If there is a desire to have an IR-building-support library that contains all of these builders, that can be easily added, but currently it seems likely that these add no real overhead to VMCore. llvm-svn: 160243	2012-07-15 23:45:24 +00:00
Chandler Carruth	d9d363f8d7	Update the header guard I missed when moving the header. llvm-svn: 160242	2012-07-15 23:45:20 +00:00
Chandler Carruth	ec7ad6561f	Move llvm/Support/MDBuilder.h to llvm/MDBuilder.h, to live with IRBuilder, DIBuilder, etc. This is the proper layering as MDBuilder can't be used (or implemented) without the Core Metadata representation. Patches to Clang and Dragonegg coming up. llvm-svn: 160237	2012-07-15 23:26:50 +00:00
Nadav Rotem	3050e07108	Fix a bug in the scalarization of BUILD_VECTOR. BUILD_VECTOR elements may be wider than the output element type. Make sure to trunc them if needed. Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160235	2012-07-15 20:39:08 +00:00
Nadav Rotem	eec74c7279	Teach getTargetVShiftNode about TargetConstant nodes. llvm-svn: 160234	2012-07-15 20:27:43 +00:00
NAKAMURA Takumi	032dc0a06c	llvm/test/CodeGen/X86/2012-07-15-broadcastfold.ll: Rewrite expressions to fit various targets. - Make sure existence of "barrier". - Confirm reload corresponding to spill. llvm-svn: 160232	2012-07-15 14:38:35 +00:00
Nadav Rotem	ee3552f88d	Rename VBROADCASTSDrm into VBROADCASTSDYrm to match the naming convention. Allow the folding of vbroadcastRR to vbroadcastRM, where the memory operand is a spill slot. PR12782. Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160230	2012-07-15 12:26:30 +00:00
Nadav Rotem	a62368c965	Refactor the code that checks that all operands of a node are UNDEFs. Add a micro-optimization to getNode of CONCAT_VECTORS when both operands are undefs. Can't find a testcase for this because VECTOR_SHUFFLE already handles undef operands, but Duncan suggested that we add this. Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160229	2012-07-15 08:38:23 +00:00
Chandler Carruth	db5536f09d	Reapply r160194, switching to use LV information for finding local kills. The notable fix is to look at any dependencies attached to the kill instruction (or other instructions between MI nad the kill) where the dependencies are specific to the register in question. The old code implicitly handled this by rejecting the transform if any other uses were found within the block, but after the start point. The new code directly finds the kill, and has to re-use the existing dependency scan to check for non-kill uses. This was caught by self-host, but I found the bug via inspection and use of absurd assert scaffolding to compute the kills in two ways and compare them. So I have no useful testcase for this other than "bootstrap". I'd work harder to reduce a test case if this particular code were likely to live for a long time. Thanks to Benjamin Kramer for reviewing the fix itself. llvm-svn: 160228	2012-07-15 03:29:46 +00:00
Eric Christopher	abb6ffd9b3	Move IsSameValue from clang's ASTImporter to be methods on the APInt/APSInt classes. Part of rdar://11875995 llvm-svn: 160223	2012-07-15 00:23:36 +00:00
Nadav Rotem	9466e81df6	AVX: Fix a bug in getTargetVShiftNode. The shift amount has to be a 128bit vector with the same element type as the input vector. This is needed because of the patterns we have for the VP[SLL/SRA/SRL][W/D/Q] instructions. llvm-svn: 160222	2012-07-14 22:26:05 +00:00
Nadav Rotem	018921002e	Add a dagcombine optimization to convert concat_vectors of undefs into a single undef. The unoptimized concat_vectors isd prevented the canonicalization of the vector_shuffle node. llvm-svn: 160221	2012-07-14 21:30:27 +00:00
Jakob Stoklund Olesen	8f324a2cc8	Account for early-clobber reload instructions. No test case, there are no in-tree targets that require this. llvm-svn: 160219	2012-07-14 18:45:35 +00:00
Jakob Stoklund Olesen	3d604ab933	Be more verbose when detecting dominance problems. Catch uses of undefined physregs that haven't been added to basic block live-in lists. Run the verifier to pinpoint the problem. Also run the verifier when a virtual register use is not jointly dominated by defs. llvm-svn: 160207	2012-07-13 23:39:05 +00:00
Andrew Trick	653513b8dd	LSR Fix: check SCEV expression safety before expansion. All SCEV expressions used by LSR formulae must be safe to expand. i.e. they may not contain UDiv unless we can prove nonzero denominator. Fixes PR11356: LSR hoists UDiv. llvm-svn: 160205	2012-07-13 23:33:10 +00:00
Andrew Trick	ee76065b7a	IVUsers should only generate SCEV's for values that are safe to speculate. This allows SCEVExpander to run on the IV expressions. This codifies an assumption made by LSR to complete the fix for PR11356, but I haven't been able to generate a separate unit test for this part. I'm adding it as an extra safety check. llvm-svn: 160204	2012-07-13 23:33:05 +00:00
Andrew Trick	365e31c36c	Factor SCEV traversal code so I can use it elsewhere. No functionality. llvm-svn: 160203	2012-07-13 23:33:03 +00:00
Joel Jones	43cb87839c	This is one of the first steps at moving to replace target-dependent intrinsics with target-indepdent intrinsics. The first instruction(s) to be handled are the vector versions of count leading zeros (ctlz). The changes here are to clang so that it generates a target independent vector ctlz when it sees an ARM dependent vector ctlz. The changes in llvm are to match the target independent vector ctlz and in VMCore/AutoUpgrade.cpp to update any existing bc files containing ARM dependent vector ctlzs with target-independent ctlzs. There are also changes to an existing test case in llvm for ARM vector count instructions and a new test for the bitcode upgrade. <rdar://problem/11831778> There is deliberately no test for the change to clang, as so far as I know, no consensus has been reached regarding how to test neon instructions in clang; q.v. <rdar://problem/8762292> llvm-svn: 160200	2012-07-13 23:25:25 +00:00
Chandler Carruth	9c97cd5672	Revert r160194, which switched to use LV information for finding local kills. This is causing miscompiles that I'm working on tracking down. llvm-svn: 160196	2012-07-13 22:23:32 +00:00
Chandler Carruth	58c470dc68	Use the LiveVariables information to efficiently get local kills. This removes the largest scaling problem in the test cases from PR13225 when ASan is switched to insert basic blocks in the natural CFG order. It may also solve some scaling problems for more normal code with large numbers of basic blocks and variables. llvm-svn: 160194	2012-07-13 21:18:38 +00:00
Galina Kistanova	8aded18c5d	Fixed few warnings. llvm-svn: 160192	2012-07-13 21:06:54 +00:00
Jakob Stoklund Olesen	ed6c0408fa	Remove variable_ops from call instructions in most targets. Call instructions are no longer required to be variadic, and variable_ops should only be used for instructions that encode a variable number of arguments, like the ARM stm/ldm instructions. llvm-svn: 160189	2012-07-13 20:44:29 +00:00
Jakob Stoklund Olesen	6a81d30269	Remove variable_ops from ARM call instructions. Function argument registers are added to the call SDNode, but InstrEmitter now knows how to make those operands implicit, and the call instruction doesn't have to be variadic. Explicit register operands should only be those that are encoded in the instruction, implicit register operands are for extra dependencies like call argument and return values. llvm-svn: 160188	2012-07-13 20:27:00 +00:00
Jack Carter	5ddcfda8ef	The Mips specific relocation R_MIPS_GOT_DISP is used in cases where global symbols are directly represented in the GOT and we use an offset into the global offset table. This patch adds direct object support for R_MIPS_GOT_DISP. llvm-svn: 160183	2012-07-13 19:15:47 +00:00
Jack Carter	2e3358a0f8	test case for revision 160084: Alignment filling between Mips function units llvm-svn: 160177	2012-07-13 18:14:01 +00:00
Benjamin Kramer	abbfe69356	Make helper functions static. llvm-svn: 160173	2012-07-13 13:25:15 +00:00
Alexander Kornienko	73221f5624	Initializers for some fields were missing in Option::Option llvm-svn: 160170	2012-07-13 12:55:23 +00:00
Hans Wennborg	e2679c50d7	ReleaseNotes.html: add note about specifying TLS models llvm-svn: 160168	2012-07-13 12:44:23 +00:00
Duncan Sands	5a5928a5eb	Post-dom frontier was removed in 3.0. Patch by chenwj. llvm-svn: 160166	2012-07-13 10:11:28 +00:00
Duncan Sands	a9c373e49d	Restrict this to x86, hopefully fixing ARM buildbots. llvm-svn: 160163	2012-07-13 07:02:00 +00:00
Craig Topper	b3bac4908e	Mark VINSERTI128rm as MayLoad=1. Fixes PR13348. llvm-svn: 160162	2012-07-13 05:46:28 +00:00
Galina Kistanova	fc25990582	Fixed few warnings; trimmed empty lines. llvm-svn: 160159	2012-07-13 01:25:27 +00:00
Jim Grosbach	1af8c8060c	Provide function name in 'Cannot select' fatal error. When dumping the DAG for a fatal 'Cannot select' back-end error, also provide the name of the function the construct is in. Useful when dealing with large testcases, as the next step is to llvm-extract the function in question to get a small(er) testcase. llvm-svn: 160152	2012-07-13 00:29:09 +00:00
Eric Christopher	bf57091f8b	The end of the prologue should be marked with is_stmt. Fixes PR13303. Patch by Paul Robinson! llvm-svn: 160148	2012-07-12 23:30:25 +00:00
Jim Grosbach	5f111b2721	TableGen: Assembly matcher 'insufficient operands' diagnostic. Make sure the tblgen'erated asm matcher correctly returns numoperands+1 as the ErrorInfo when the problem was that there weren't enough operands specified. rdar://9142751 llvm-svn: 160144	2012-07-12 21:37:20 +00:00
Akira Hatanaka	a13cd0666e	Fix check strings in test/MC/Disassembler/Mips/* and run FileCheck. Patch by Vladimir Medic. llvm-svn: 160143	2012-07-12 21:19:32 +00:00
Galina Kistanova	7da6578291	Fixed few warnings. llvm-svn: 160142	2012-07-12 20:45:36 +00:00
Benjamin Kramer	4d0916788d	Give the rdrand instructions a SideEffect flag and a chain so MachineCSE and MachineLICM don't touch it. I already had the necessary things in place for IR-level passes but missed the machine passes. llvm-svn: 160137	2012-07-12 18:14:57 +00:00
Eric Christopher	54c39e0688	Regenerate. llvm-svn: 160134	2012-07-12 17:59:12 +00:00
Nadav Rotem	fdce33a495	The LIT tests below do not specify the exact cpu model and fail on AVX2 machines, because we select different instructions such as vbroadcast, new shuffles, etc. Patch by Michael Liao. llvm-svn: 160129	2012-07-12 13:45:15 +00:00
Gabor Greif	c6c28ff8e6	detabify llvm-svn: 160128	2012-07-12 13:18:13 +00:00
Gabor Greif	1e71896bb4	fix typo in generated comment llvm-svn: 160127	2012-07-12 13:05:12 +00:00
NAKAMURA Takumi	f415fe70f3	llvm/test/CodeGen/X86/rdrand.ll: Relax expression corresponding to Win64 CC. llvm-svn: 160124	2012-07-12 10:22:57 +00:00
NAKAMURA Takumi	0b00f994a6	llvm/test/CMakeLists.txt: Add llvm-diff to deps. llvm-svn: 160123	2012-07-12 10:15:48 +00:00
Benjamin Kramer	cbac2f3bc9	Use %s instead of the explicit name, the latter doesn't work in out-of-tree builds. llvm-svn: 160120	2012-07-12 09:36:29 +00:00
Benjamin Kramer	0ab2794eda	Add intrinsics for Ivy Bridge's rdrand instruction. The rdrand/cmov sequence is the same that is emitted by both GCC and ICC. Fixes PR13284. llvm-svn: 160117	2012-07-12 09:31:43 +00:00
Duncan Sands	671cc2575d	The result type of EXTRACT_VECTOR_ELT doesn't have to match the element type of the input vector, it can be bigger (this is helpful for powerpc where <2 x i16> is a legal vector type but i16 isn't a legal type, IIRC). However this wasn't being taken into account by ExpandRes_EXTRACT_VECTOR_ELT, causing PR13220. Lightly tweaked version of a patch by Michael Liao. llvm-svn: 160116	2012-07-12 09:01:35 +00:00
Craig Topper	f7755df776	Update GATHER instructions to support 2 read-write operands. Patch from myself and Manman Ren. llvm-svn: 160110	2012-07-12 06:52:41 +00:00
Evan Cheng	493eb32ff4	Instcombine was transforming: %shr = lshr i64 %key, 3 %0 = load i64* %val, align 8 %sub = add i64 %0, -1 %and = and i64 %sub, %shr ret i64 %and to: %shr = lshr i64 %key, 3 %0 = load i64* %val, align 8 %sub = add i64 %0, 2305843009213693951 %and = and i64 %sub, %shr ret i64 %and The demanded bit optimization is actually a pessimization because add -1 would be codegen'ed as a sub 1. Teach the demanded constant shrinking optimization to check for negated constant to make sure it is actually reducing the width of the constant. rdar://11793464 llvm-svn: 160101	2012-07-12 01:45:35 +00:00
Jim Grosbach	d2aabd3bb2	TableGen: Location information for diagnostic. def Pat<...>; Results in 'record name is not a string!' diagnostic. Not the best, but the lack of location information moves it from not very helpful into completely useless. We're in the Record class when throwing the error, so just add the location info directly. llvm-svn: 160098	2012-07-12 00:53:31 +00:00
Manman Ren	88a0d3313b	ARM: fix typo in comments llvm-svn: 160093	2012-07-11 23:47:00 +00:00
Manman Ren	34cb93e192	ARM: Fix optimizeCompare to correctly check safe condition. It is safe if CPSR is killed or re-defined. When we are done with the basic block, check whether CPSR is live-out. Do not optimize away cmp if CPSR is live-out. llvm-svn: 160090	2012-07-11 22:51:44 +00:00
Jack Carter	570ae0b1f7	Patch for Mips direct object generation. When WriteFragmentData() case FT_align called Asm.getBackend().writeNopData() is called, nothing is done since Mips implementation of writeNopData just returned "true". For some reason this has not caused problems in 32 bit mode, but in 64 bit mode it caused an assert when processing multiple function units. The test case included will assert without this patch. It runs twice with different flags to prevent false positives due to changes in code generation over time. llvm-svn: 160084	2012-07-11 22:17:39 +00:00
Chad Rosier	26b8e1d03f	Fixup broken doc link. Patch by Sean Silva <silvas@purdue.edu>. llvm-svn: 160082	2012-07-11 21:49:14 +00:00
Jack Carter	42ebf98b04	This change removes an "initialization" warning. Even though variable in question could not be initialized before use, the code was such that the compiler had no way of knowing that. llvm-svn: 160081	2012-07-11 21:41:49 +00:00
Stepan Dyatkovskiy	326edc579a	Fixed diff comparison. llvm-svn: 160076	2012-07-11 21:02:57 +00:00
Argyrios Kyrtzidis	f141156e6c	In MemoryBuffer::getOpenFile() don't verify that the mmap'ed file buffer is null-terminated. If the file is smaller than we thought, mmap will not allow dereferencing past the pages that are enough to cover the actual file size, even though we asked for a larger address range. rdar://11612916 llvm-svn: 160075	2012-07-11 20:59:20 +00:00
Akira Hatanaka	bb5519154c	In register classes in MipsRegisterInfo.td, list the registers in ascending order of binary encoding. Patch by Vladimir Medic. llvm-svn: 160073	2012-07-11 20:51:50 +00:00
Chad Rosier	8446ede023	[x86 fast-isel] Per discussion with Eric, add all cases to switch with verbose comments. llvm-svn: 160069	2012-07-11 19:58:38 +00:00
Akira Hatanaka	20dced4dbb	Test case for r160036. llvm-svn: 160067	2012-07-11 19:50:46 +00:00
Manman Ren	1553ce0e81	X86: Update to peephole optimization to move Movr0 before (Sub, Cmp) pair. When Movr0 is between sub and cmp, we move Movr0 before sub if it enables removal of Cmp. llvm-svn: 160066	2012-07-11 19:35:12 +00:00
Akira Hatanaka	24cf4e36e5	Implement MipsTargetLowering::LowerSELECT_CC to custom lower SELECT_CC. llvm-svn: 160064	2012-07-11 19:32:27 +00:00
Evan Cheng	b17122859b	InstrEmitter::EmitSubregNode() optimize extract_subreg in this case: r1025 = s/zext r1024, 4 r1026 = extract_subreg r1025, 4 to a copy: r1026 = copy r1024 This is correct. However it uses TII->isCoalescableExtInstr() which can return true for instructions which essentially does a sext_in_reg so this can end up with an illegal copy where the source and destination register classes do not match. Add a check to avoid it. Sorry, no test case possible at this time. rdar://11849816 llvm-svn: 160059	2012-07-11 18:55:07 +00:00
Benjamin Kramer	3aab6a86a2	PR13326: Fix a subtle edge case in the udiv -> magic multiply generator. This caused 6 of 65k possible 8 bit udivs to be wrong. llvm-svn: 160058	2012-07-11 18:31:59 +00:00
Tom Stellard	73daa0f740	test commit llvm-svn: 160056	2012-07-11 17:34:12 +00:00
Chad Rosier	43218c59c3	[x86 fast-isel] Rather then call llvm_unreachable() have fast-isel fall back to Selection DAG isel. Patch by Andrew Kaylor <andrew.kaylor@intel.com>. llvm-svn: 160055	2012-07-11 17:23:17 +00:00
Nadav Rotem	d2bdcebb14	When ext-loading and trunc-storing vectors to memory, on x86 32bit systems, allow loads/stores of 64bit values from xmm registers. llvm-svn: 160044	2012-07-11 13:27:05 +00:00
Nadav Rotem	2a148668b6	Rename many of the Tmp1, Tmp2, Tmp3 variables to names such as Chain, Value, Ptr, etc. No functionality change. llvm-svn: 160042	2012-07-11 11:02:16 +00:00
Benjamin Kramer	9488100d46	Remove unused variable. llvm-svn: 160040	2012-07-11 09:39:04 +00:00
Nadav Rotem	de6fd282ef	Refactor the DAG Legalizer by extracting the legalization of Load and Store nodes into their own functions. No functional change. llvm-svn: 160037	2012-07-11 08:52:09 +00:00
Owen Anderson	b8844d6744	Only apply the SETCC+SITOFP -> SELECTCC optimization when the SETCC returns an MVT::i1, i.e. before type legalization. This is a speculative fix for a problem on Mips reported by Akira Hatanaka. llvm-svn: 160036	2012-07-11 06:38:55 +00:00
Akira Hatanaka	878ad8b28d	Lower RETURNADDR node in Mips backend. Patch by Sasa Stankovic. llvm-svn: 160031	2012-07-11 00:53:32 +00:00
Jack Carter	e8cb2fc616	Mips specific inline asm operand modifier 'L'. Low order register of a double word register operand. Operands are defined by the name of the variable they are marked with in the inline assembler code. This is a way to specify that the operand just refers to the low order register for that variable. It is the opposite of modifier 'D' which specifies the high order register. Example: main() { long long ll_input = 0x1111222233334444LL; long long ll_val = 3; int i_result = 0; __asm__ __volatile__( "or %0, %L1, %2" : "=r" (i_result) : "r" (ll_input), "r" (ll_val)); } Which results in: lui $2, %hi(_gp_disp) addiu $2, $2, %lo(_gp_disp) addiu $sp, $sp, -8 addu $2, $2, $25 sw $2, 0($sp) lui $2, 13107 ori $3, $2, 17476 <-- Low 32 bits of ll_input lui $2, 4369 ori $4, $2, 8738 <-- High 32 bits of ll_input addiu $5, $zero, 3 <-- Low 32 bits of ll_val addiu $2, $zero, 0 <-- High 32 bits of ll_val #APP or $3, $4, $5 <-- or i_result, high 32 ll_input, low 32 of ll_val #NO_APP addiu $sp, $sp, 8 jr $ra If not direction is done for the long long for 32 bit variables results in using the low 32 bits as ll_val shows. There is an existing bug if 'L' or 'D' is used for the destination register for 32 bit long longs in that the target value will be updated incorrectly for the non-specified part unless explicitly set within the inline asm code. llvm-svn: 160028	2012-07-10 22:41:20 +00:00
Jakob Stoklund Olesen	bc90a4ea82	Require and preserve LoopInfo for early if-conversion. It will surely be needed by heuristics. llvm-svn: 160027	2012-07-10 22:39:56 +00:00
Chandler Carruth	2207f76cd4	Teach the LiveInterval::join function to use the fast merge algorithm, generalizing its implementation sufficiently to support this value number scenario as well. This cuts out another significant performance hit in large functions (over 10k basic blocks, etc), especially those with "natural" CFG structures. llvm-svn: 160026	2012-07-10 22:25:21 +00:00
Jakob Stoklund Olesen	02638392c1	Run early if-conversion in domtree post-order. This ordering allows nested if-conversion without using a work list, and it makes it possible to update the dominator tree on the fly as well. Any erased basic blocks will always be dominated by the current post-order position, so the domtree can be pruned without invalidating the iterator. llvm-svn: 160025	2012-07-10 22:18:23 +00:00
Chad Rosier	97c2214277	Move [get\|set]BasePtrStackAdjustment() from MachineFrameInfo to X86MachineFunctionInfo as this is currently only used by X86. If this ever becomes an issue on another arch (e.g., ARM) then we can hoist it back out. llvm-svn: 160009	2012-07-10 18:27:15 +00:00
Chad Rosier	3ee9a4c29e	Add newline. llvm-svn: 160006	2012-07-10 17:57:00 +00:00
Chad Rosier	579b1fee6b	Add test case accidentally omitted from r160002. llvm-svn: 160004	2012-07-10 17:49:39 +00:00
Chad Rosier	bdb08ac50a	Add support for dynamic stack realignment in the presence of dynamic allocas on X86. Basically, this is a reapplication of r158087 with a few fixes. Specifically, (1) the stack pointer is restored from the base pointer before popping callee-saved registers and (2) in obscure cases (see comments in patch) we must cache the value of the original stack adjustment in the prologue and apply it in the epilogue. rdar://11496434 llvm-svn: 160002	2012-07-10 17:45:53 +00:00
Chandler Carruth	77d940011d	Fix a bug where I didn't test for an empty range before inspecting the back of it. I don't have anything even remotely close to a test case for this. It only broke two build bots, both of them doing bootstrap builds, one of them a dragonegg bootstrap. It doesn't break for me when I bootstrap either. It doesn't reproduce every time or on many machines during the bootstrap. Many thanks to Duncan Sands who got the exact command (and stage of the bootstrap) which failed on the dragonegg bootstrap and managed to get it to trigger under valgrind with debug symbols. The fix was then found by inspection. llvm-svn: 159993	2012-07-10 15:41:33 +00:00
Nadav Rotem	d908ddc186	Improve the loading of load-anyext vectors by allowing the codegen to load multiple scalars and insert them into a vector. Next, we shuffle the elements into the correct places, as before. Also fix a small dagcombine bug in SimplifyBinOpWithSameOpcodeHands, when the migration of bitcasts happened too late in the SelectionDAG process. llvm-svn: 159991	2012-07-10 13:25:08 +00:00
Richard Barton	1dc44dcedd	Fix instruction description of VMOV (between two ARM core registers and two single-precision resiters) (and do it properly this time! llvm-svn: 159989	2012-07-10 12:51:09 +00:00
Craig Topper	be41e2daa6	Reverse assembler/disassembler operand order for gather instructions. llvm-svn: 159983	2012-07-10 06:38:33 +00:00
Chandler Carruth	e18614dd17	Add an efficient merge operation to LiveInterval and use it to avoid quadratic behavior when performing pathological merges. Fixes the core element of PR12652. There is only one user of addRangeFrom left: join. I'm hoping to refactor further in a future patch and have join use this merge operation as well. llvm-svn: 159982	2012-07-10 05:16:17 +00:00
Chandler Carruth	ac766b9b42	Teach LiveIntervals how to verify themselves and start using it in some of the trick merge routines. This adds a layer of testing that was necessary when implementing more efficient (and complex) merge logic for this datastructure. No functionality changed here. llvm-svn: 159981	2012-07-10 05:06:03 +00:00
Jim Grosbach	16b43dbbfe	ARM: Allow more flexible patterns in NEON formats. Some NEON instructions want to match against normal SDNodes for some operand types and Intrinsics for others. For example, CTLZ. To enable this, switch from explicitly requiring Intrinsic on the class templates to using SDPatternOperator instead. llvm-svn: 159974	2012-07-10 00:51:13 +00:00
Jim Grosbach	700068206f	Allow intrinsics to be used in place of node matchables. TableGen has support for using an intrinics name directly in a DAG, but this breaks down when referring to just a node, as that's handled initializer list stuff entirely via subclassing in the parser. That is, using an instrinsic like "(int_my_intrinsic ...)" works fine. Using it standalone for parameterizing the operator in such a DAG does not. Fixing this is simple enough, as we simply declare Intrinsic as deriving from SDPatternOperator, which is the class name intended for exactly this purpose in TargetSelectionDAG.td. When the intrinsic is actually used in the DAG pattern, it will be recognized and expanded to an intrinsic_wo_chain (et. al.) just like when it's used directly. Incoming ARM NEON cleanup based on this and a bit of functionality improvement after that. llvm-svn: 159973	2012-07-10 00:51:11 +00:00
Akira Hatanaka	efff7b763b	Make register Mips::RA allocatable if not in mips16 mode. llvm-svn: 159971	2012-07-10 00:19:06 +00:00
Dan Gohman	3d1512384f	Delete code for folding undefs in ScalarEvolution. It's invalid in obscure ways, and it isn't actually important in the real world. llvm-svn: 159969	2012-07-09 23:51:20 +00:00
Chad Rosier	aeed158f75	Revert r159938 (and r159945) to appease the buildbots. llvm-svn: 159960	2012-07-09 20:43:34 +00:00
Andrew Trick	fb982ddeda	Machine model: allow itineraries to be shared by different processor models. llvm-svn: 159959	2012-07-09 20:43:03 +00:00
Andrew Trick	c50f06487c	indentation llvm-svn: 159958	2012-07-09 20:43:01 +00:00
Owen Anderson	d4b841f8f9	Teach the DAG combiner to turn sitofp/uitofp from i1 into a conditional move, since there are only two possible values. Previously, this would become an integer extension operation, followed by a real integer->float conversion. llvm-svn: 159957	2012-07-09 20:31:12 +00:00
Manman Ren	5f6fa428fa	X86: implement functions to analyze & synthesize CMOV\|SET\|Jcc getCondFromSETOpc, getCondFromCMovOpc, getSETFromCond, getCMovFromCond No functional change intended. If we want to update the condition code of CMOV\|SET\|Jcc, we first analyze the opcode to get the condition code, then update the condition code, finally synthesize the new opcode form the new condition code. llvm-svn: 159955	2012-07-09 18:57:12 +00:00
Akira Hatanaka	9bf2b5677d	Reapply r158846. Access mips register classes via MCRegisterInfo's functions instead of via the TargetRegisterClasses defined in MipsGenRegisterInfo.inc. llvm-svn: 159953	2012-07-09 18:46:47 +00:00
Nuno Lopes	95cc4f3cb5	instcombine: merge the functions that remove dead allocas and dead mallocs/callocs/... This patch removes ~70 lines in InstCombineLoadStoreAlloca.cpp and makes both functions a bit more aggressive than before :) In theory, we can be more aggressive when removing an alloca than a malloc, because an alloca pointer should never escape, but we are not taking advantage of this anyway llvm-svn: 159952	2012-07-09 18:38:20 +00:00
Richard Barton	984d0ba6b6	Some formatting to keep Clang happy llvm-svn: 159948	2012-07-09 18:30:56 +00:00
Richard Barton	5beef2d242	Oops - correct broken disassembly for VMOV llvm-svn: 159945	2012-07-09 18:20:02 +00:00
Richard Barton	c9e1c94fae	Fix instruction description of VMOV (between two ARM core registers and two single-precision resiters) llvm-svn: 159938	2012-07-09 16:41:33 +00:00
Richard Barton	35aceb86fe	Prevent ARM assembler from losing a right shift by #32 applied to a register llvm-svn: 159937	2012-07-09 16:31:14 +00:00
Richard Barton	d56603722e	Spelling! llvm-svn: 159936	2012-07-09 16:14:28 +00:00
Richard Barton	a39625ecc6	Teach the assembler to use the narrow thumb encodings of various three-register dp instructions where permissable. llvm-svn: 159935	2012-07-09 16:12:24 +00:00
Benjamin Kramer	a5e136b613	Remove some trivial copy ctors so the classes become trivially copyable and get the optimized SmallVector implementation. llvm-svn: 159916	2012-07-08 19:47:51 +00:00
Benjamin Kramer	c810a68923	SmallVector: Make use of move semantics to speed up moving objects in erase() and insert() llvm-svn: 159914	2012-07-08 12:06:35 +00:00
Andrew Trick	87255e340e	I'm introducing a new machine model to simultaneously allow simple subtarget CPU descriptions and support new features of MachineScheduler. MachineModel has three categories of data: 1) Basic properties for coarse grained instruction cost model. 2) Scheduler Read/Write resources for simple per-opcode and operand cost model (TBD). 3) Instruction itineraties for detailed per-cycle reservation tables. These will all live side-by-side. Any subtarget can use any combination of them. Instruction itineraries will not change in the near term. In the long run, I expect them to only be relevant for in-order VLIW machines that have complex contraints and require a precise scheduling/bundling model. Once itineraries are only actively used by VLIW-ish targets, they could be replaced by something more appropriate for those targets. This tablegen backend rewrite sets things up for introducing MachineModel type #2: per opcode/operand cost model. llvm-svn: 159891	2012-07-07 04:00:00 +00:00
Andrew Trick	91118a6155	whitespace llvm-svn: 159890	2012-07-07 03:59:51 +00:00
Andrew Trick	030e2f8f1a	Tweak spelling. llvm-svn: 159889	2012-07-07 03:59:48 +00:00
Manman Ren	bb36074047	X86: Fix optimizeCompare to correctly check safe condition. It is safe if EFLAGS is killed or re-defined. When we are done with the basic block, check whether EFLAGS is live-out. Do not optimize away cmp if EFLAGS is live-out. llvm-svn: 159888	2012-07-07 03:34:46 +00:00
NAKAMURA Takumi	80eb0c502d	LLVMConfig.cmake.in: Quote around @LLVM_INSTALL_PREFIX@, or it would not accept whitespace paths. Thanks to Kai. llvm-svn: 159887	2012-07-07 03:12:28 +00:00
Bill Wendling	786de35fa0	Use the DebugInfo wrappers instead of mucking about with the MDNode directly. llvm-svn: 159881	2012-07-07 00:52:35 +00:00
Bill Wendling	56543735c9	Print the name last. llvm-svn: 159879	2012-07-06 23:43:12 +00:00
Chad Rosier	73b02825d0	Fix the naming of ensureAlignment. Per the coding standard function names should be camel case, and start with a lower case letter. llvm-svn: 159877	2012-07-06 23:13:38 +00:00
Nuno Lopes	fa0dffccee	teach instcombine to remove allocated buffers even if there are stores, memcpy/memmove/memset, and objectsize users. This means we can do cheap DSE for heap memory. Nothing is done if the pointer excapes or has a load. The churn in the tests is mostly due to objectsize, since we want to make sure we don't delete the malloc call before evaluating the objectsize (otherwise it becomes -1/0) llvm-svn: 159876	2012-07-06 23:09:25 +00:00
Dmitri Gribenko	d01af8772d	Since SmallMap was removed in r158644, remove documentation in ProgrammersManual.html. llvm-svn: 159874	2012-07-06 23:06:47 +00:00
Bill Wendling	3270582ceb	Check if it's a scope last, because several things are scopes. llvm-svn: 159873	2012-07-06 23:06:16 +00:00
Jim Grosbach	09487775d3	ARM: Add test cleanup entry to the README. llvm-svn: 159864	2012-07-06 21:52:04 +00:00
Akira Hatanaka	b577ff116d	revert r159851. llvm-svn: 159854	2012-07-06 20:16:48 +00:00
Akira Hatanaka	cfa35fa0ff	Reapply r158846. Include file MipsGenRegisterInfo.inc. llvm-svn: 159851	2012-07-06 19:29:11 +00:00
Bill Wendling	aa02e36fa8	Add a print method to the ObjC property object. llvm-svn: 159848	2012-07-06 19:12:31 +00:00
Bill Wendling	5ef3159820	Remove trailing comma in array initialization list. llvm-svn: 159843	2012-07-06 17:49:19 +00:00
Bill Wendling	7154c43eff	Remove unnecessary 'llvm::'. llvm-svn: 159842	2012-07-06 17:47:36 +00:00
Bill Wendling	16d944ce11	Remove unnecessary 'llvm::'. llvm-svn: 159841	2012-07-06 17:46:28 +00:00
Chad Rosier	879c34f45a	Whitespace. llvm-svn: 159839	2012-07-06 17:44:22 +00:00
Manman Ren	c965673707	X86: peephole optimization to remove cmp instruction For each Cmp, we check whether there is an earlier Sub which make Cmp redundant. We handle the case where SUB operates on the same source operands as Cmp, including the case where the two source operands are swapped. llvm-svn: 159838	2012-07-06 17:36:20 +00:00
Chad Rosier	88d53eae56	[fast-isel] Tell fast-isel to do nothing with the new donothing intrinsic. llvm-svn: 159837	2012-07-06 17:33:39 +00:00
Chad Rosier	e3a87b1511	Update getFunction parameter documentation. Fixes PR13268. llvm-svn: 159835	2012-07-06 17:15:03 +00:00
Dmitri Gribenko	aa4f47f266	Revert r159789. llvm-svn: 159834	2012-07-06 16:42:25 +00:00
NAKAMURA Takumi	b8c7dada33	llvm/include/llvm/CMakeLists.txt: Cut dependency to intrinsics_gen. llvm-svn: 159831	2012-07-06 15:55:39 +00:00
Duncan Sands	c65aa3f6ae	Attempt to fix windows buildbots. Patch by James Benton. llvm-svn: 159826	2012-07-06 14:43:16 +00:00
NAKAMURA Takumi	4f934676fb	test/CodeGen/X86/sext-setcc-self.ll: Mark it as XFAIL: cygwin,mingw32,win32. Investigating. llvm-svn: 159820	2012-07-06 12:12:39 +00:00
NAKAMURA Takumi	0246724cd6	Revert r159804, "[arm-fast-isel] Add support for vararg function calls." It broke LLVM :: CodeGen/Thumb2/large-call.ll on several hosts. llvm-svn: 159817	2012-07-06 11:12:44 +00:00
Alexey Samsonov	39602781f6	Fix PR13202 and a regtest. DwarfDebug class could generate the same (inlined) DIVariable twice: 1) when trying to find abstract debug variable for a concrete inlined instance. 2) when explicitly collecting info for variables that were optimized out. This change makes sure that this duplication won't happen and makes Clang pass "gdb.opt/inline-locals" test from gdb testsuite. Reviewed by Eric Christopher. llvm-svn: 159811	2012-07-06 08:45:08 +00:00
Bill Wendling	fab09c66f3	Sphinxify the CMake document. llvm-svn: 159806	2012-07-06 05:51:50 +00:00
Jush Lu	5e6e6264f4	[arm-fast-isel] Add support for vararg function calls. llvm-svn: 159804	2012-07-06 03:02:37 +00:00
Jack Carter	2ab73b13a5	Changes per review of commit 159787 Mips specific inline asm operand modifier D. Comment changes and predicate change. llvm-svn: 159802	2012-07-06 02:44:22 +00:00
Eric Christopher	174266960e	Untabify and move a function near similar functions dealing with struct types. llvm-svn: 159801	2012-07-06 02:35:57 +00:00
Jakob Stoklund Olesen	3f1bb93cab	Add some comments suggested in code review. llvm-svn: 159800	2012-07-06 02:31:22 +00:00
Dmitri Gribenko	d5200f1bc4	Enable new[] on llvm::BumpPtrAllocator. llvm-svn: 159789	2012-07-06 00:25:39 +00:00
Jack Carter	b2af512cef	Mips specific inline asm operand modifier D. Print the second half of a double word operand. The include list was cleaned up a bit as well. Also the test case was modified to test for both big and little patterns. llvm-svn: 159787	2012-07-05 23:58:21 +00:00
Owen Anderson	00da236f7e	Fix an overzealous assertion. It is legitimate for a target to have multiple fixups on a single instruction that target the same byte, so long as their bit-offsets are coordinates appropriately. llvm-svn: 159785	2012-07-05 22:30:42 +00:00
Akira Hatanaka	bbf374c4c6	test case for r159770. llvm-svn: 159771	2012-07-05 19:29:31 +00:00
Akira Hatanaka	7d33c78e3b	Enclose instruction rdhwr with directives, which are needed when target is mips32 rev1 (the directives are emitted when target is mips32r2 too). llvm-svn: 159770	2012-07-05 19:26:38 +00:00
Akira Hatanaka	d359075e43	Enable target dependent directive parsing to hook before standard parser in AsmParser::ParseStatement. Patch by Vladimir Medic. llvm-svn: 159768	2012-07-05 19:09:33 +00:00

... 7 8 9 10 11 ...

84284 Commits