llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	82d1c371e2	X86: Match pmin/pmax as a target specific dag combine. This occurs during vectorization. Part of PR14667. llvm-svn: 170908	2012-12-21 17:46:58 +00:00
Roman Divacky	a229186a82	Remove duplicate includes. llvm-svn: 170902	2012-12-21 17:06:44 +00:00
Tom Stellard	a8b0351720	R600: Expand vec4 INT <-> FP conversions llvm-svn: 170901	2012-12-21 16:33:24 +00:00
Benjamin Kramer	4669d18893	X86: Match the SSE/AVX min/max vector ops using a custom node instead of intrinsics This is very mechanical, no functionality change. Preparation for PR14667. llvm-svn: 170898	2012-12-21 14:04:55 +00:00
Evgeniy Stepanov	4fbc0d08bf	[msan] Remove unreachable blocks before instrumenting a function. llvm-svn: 170883	2012-12-21 11:18:49 +00:00
Nadav Rotem	eacbb731d3	Add a missing "virtual" keyword. llvm-svn: 170842	2012-12-21 05:02:12 +00:00
Nadav Rotem	3b850b70b3	Enable if-conversion. llvm-svn: 170841	2012-12-21 04:47:54 +00:00
Quentin Colombet	b1b66e7a25	Add ARM cortex-r5 subtarget. llvm-svn: 170840	2012-12-21 04:35:05 +00:00
Rafael Espindola	73bf9fa7ba	Don't skip __DWARF, Now that we don't merge section and segment names, we don't need to skip the segment name to get to the section name. llvm-svn: 170839	2012-12-21 04:08:03 +00:00
Rafael Espindola	a9f810b6b5	Add a function to get the segment name of a section. On MachO, sections also have segment names. When a tool looking at a .o file prints a segment name, this is what they mean. In reality, a .o has only one anonymous, segment. This patch adds a MachO only function to fetch that segment name. I named it getSectionFinalSegmentName since the main use for the name seems to be inform the linker with segment this section should go to. The patch also changes MachOObjectFile::getSectionName to return just the section name instead of computing SegmentName,SectionName. The main difference from the previous patch is that it doesn't use InMemoryStruct. It is extremely dangerous: if the endians match it returns a pointer to the file buffer, if not, it returns a pointer to an internal buffer that is overwritten in the next API call. We should change all of this code to use support::detail::packed_endian_specific_integral like ELF, but since these functions only handle strings, they work with big and little endian machines as is. I have tested this by installing ubuntu 12.10 ppc on qemu, that is why it took so long :-) llvm-svn: 170838	2012-12-21 03:47:03 +00:00
Evan Cheng	59421aee3d	Add targets to skip running the GC passes. llvm-svn: 170836	2012-12-21 02:57:04 +00:00
Evan Cheng	99cafb1db2	Every pass deserves a name, even codegenprep. llvm-svn: 170831	2012-12-21 01:48:14 +00:00
Nadav Rotem	6d4fdd6d2c	Improve the X86 cost model for loads and stores. llvm-svn: 170830	2012-12-21 01:33:59 +00:00
Nadav Rotem	a4b53f20a3	BB-Vectorizer: Check the cost of the store pointer type and not the return type, which is void. A number of test cases fail after adding the assertion in TTImpl. llvm-svn: 170828	2012-12-21 01:24:36 +00:00
Reed Kotler	9bff1ead0e	Call llvm_unreachable instead of assert. llvm-svn: 170822	2012-12-21 00:44:59 +00:00
Nadav Rotem	e7785686a5	Fix a bug in the code that checks if we can vectorize loops while using dynamic memory bound checks. Before the fix we were able to vectorize this loop from the Livermore Loops benchmark: for ( k=1 ; k<n ; k++ ) x[k] = x[k-1] + y[k]; llvm-svn: 170811	2012-12-21 00:07:35 +00:00
Jakob Stoklund Olesen	2455b58551	Require the two-argument MI::addOperand(MF, MO) for dangling instructions. Instructions that are inserted in a basic block can still be decorated with addOperand(MO). Make the two-argument addOperand() function contain the actual implementation. This function will now always have a valid MF reference that it can use for memory allocation. llvm-svn: 170798	2012-12-20 22:54:05 +00:00
Jakob Stoklund Olesen	33f5d1492d	Add an MF argument to MI::copyImplicitOps(). This function is often used to decorate dangling instructions, so a context reference is required to allocate memory for the operands. Also add a corresponding MachineInstrBuilder method. llvm-svn: 170797	2012-12-20 22:54:02 +00:00
Jakob Stoklund Olesen	ac4210eacb	Use two-arg addOperand(MF, MO) internally in MachineInstr when possible. llvm-svn: 170796	2012-12-20 22:53:58 +00:00
Jakob Stoklund Olesen	2ea203694d	MachineInstrBuilderize ARM. llvm-svn: 170795	2012-12-20 22:53:55 +00:00
Jakob Stoklund Olesen	4255c96aed	MachineInstrBuilderize NVPTX. llvm-svn: 170794	2012-12-20 22:53:53 +00:00
Eli Bendersky	75a7a338fc	Fix an unitialized member variable that may have caused sporadic failures for code that wasn't even in bundling mode. llvm-svn: 170793	2012-12-20 22:51:52 +00:00
Eric Christopher	48fef599a4	Whitespace and 80-column cleanup. llvm-svn: 170771	2012-12-20 21:58:40 +00:00
Eric Christopher	e698f53740	Start splitting out the debug string section handling by moving it into the DwarfUnits class. llvm-svn: 170770	2012-12-20 21:58:36 +00:00
Bill Wendling	66e978f904	Some random comment, naming, and format changes. Rename the AttributeImpl* from Attrs to pImpl to be consistent with other code. Add comments where none were before. Or doxygen-ify other comments. llvm-svn: 170767	2012-12-20 21:28:43 +00:00
Jakob Stoklund Olesen	00b28ecfae	Remove two dead functions. llvm-svn: 170766	2012-12-20 21:12:42 +00:00
Bob Wilson	7bba4f8957	Revert "Adding support for llvm.arm.neon.vaddl[su].* and" This reverts r170694. The operations can be represented in IR without adding any new intrinsics. llvm-svn: 170765	2012-12-20 21:09:38 +00:00
Nadav Rotem	2ababf68d7	LoopVectorize: Fix a bug in the scalarization of instructions. Before if-conversion we could check if a value is loop invariant if it was declared inside the basic block. Now that loops have multiple blocks this check is incorrect. This fixes External/SPEC/CINT95/099_go/099_go llvm-svn: 170756	2012-12-20 20:24:40 +00:00
Evan Cheng	ddc0cb6dc5	On some ARM cpus, flags setting movs with shifter operand, i.e. lsl, lsr, asr, are more expensive than the non-flag setting variant. Teach thumb2 size reduction pass to avoid generating them unless we are optimizing for size. rdar://12892707 llvm-svn: 170728	2012-12-20 19:59:30 +00:00
Eli Bendersky	f483ff9204	Aligned bundling support. Following the discussion here: http://lists.cs.uiuc.edu/pipermail/llvmdev/2012-December/056754.html The proposal and implementation are fully documented here: https://sites.google.com/a/chromium.org/dev/nativeclient/pnacl/aligned-bundling-support-in-llvm Tests will follow shortly. llvm-svn: 170718	2012-12-20 19:05:53 +00:00
Jakob Stoklund Olesen	2705333253	Use MachineInstrBuilder for PHI nodes in SelectionDAGISel. llvm-svn: 170716	2012-12-20 18:46:29 +00:00
Jakob Stoklund Olesen	b109a7b430	Use MachineInstrBuilder in InstrEmitter. This is supposed to be a mechanical change with no functional effects. InstrEmitter can generate all types of MachineOperands which revealed that MachineInstrBuilder was missing a few methods, added by this patch. Besides providing a context pointer to MI::addOperand(), MachineInstrBuilder seems like a better fit for this code. llvm-svn: 170712	2012-12-20 18:08:09 +00:00
Jakob Stoklund Olesen	f623e9870d	Use MachineInstrBuilder in a few CodeGen passes. This automatically passes a context pointer to MI->addOperand(). llvm-svn: 170711	2012-12-20 18:08:06 +00:00
Nadav Rotem	8b20c0a814	Loop Vectorizer: turn-off if-conversion. llvm-svn: 170708	2012-12-20 17:42:53 +00:00
James Molloy	4f6fb953a7	Add a new attribute, 'noduplicate'. If a function contains a noduplicate call, the call cannot be duplicated - Jump threading, loop unrolling, loop unswitching, and loop rotation are inhibited if they would duplicate the call. Similarly inlining of the function is inhibited, if that would duplicate the call (in particular inlining is still allowed when there is only one callsite and the function has internal linkage). llvm-svn: 170704	2012-12-20 16:04:27 +00:00
Roman Divacky	ff95a1dc12	Remove MCTargetAsmLexer and its derived classes now that edis, its only user, is gone. llvm-svn: 170699	2012-12-20 14:43:30 +00:00
Renato Golin	6b2ea4a48f	Adding support for llvm.arm.neon.vaddl[su].* and llvm.arm.neon.vsub[su].* intrinsics. Patch by Pete Couperus <pjcoup@gmail.com> llvm-svn: 170694	2012-12-20 13:52:11 +00:00
Craig Topper	ae48cb2e5a	Formatting fixes. Remove some unnecessary 'else' after 'return'. No functional change. llvm-svn: 170676	2012-12-20 07:15:54 +00:00
Craig Topper	9d4171afed	Removing trailing whitespace llvm-svn: 170675	2012-12-20 07:09:41 +00:00
Reed Kotler	d11acc7dc0	Implement cfi_def_cfa_offset. "Make check" test case for this comming in the next few days but it's already tested a lot from test-suite and works fine. This patch completes almost 100% pass of test-suite for mips 16. llvm-svn: 170674	2012-12-20 06:59:37 +00:00
Reed Kotler	8965d24a2a	There is one more patch to finish large frames. Make sure we assert on code that has large frames which will not yet compile correctly. llvm-svn: 170673	2012-12-20 06:57:00 +00:00
Jyotsna Verma	56605448f2	Add constant extender support to GP-relative load/store instructions. llvm-svn: 170672	2012-12-20 06:52:46 +00:00
Jyotsna Verma	bf75aaf53e	Add TSFlags to ALU32 type instructions for constant-extender/Relationship maps. llvm-svn: 170671	2012-12-20 06:45:39 +00:00
Reed Kotler	7bff8f1d7a	set register class properly for mips16 here llvm-svn: 170669	2012-12-20 06:06:35 +00:00
Rafael Espindola	fb8ac2df09	Undefine PPC harder. This was causing a build failure while trying to build on ppc ubuntu 12.10 with cmake. llvm-svn: 170668	2012-12-20 05:13:09 +00:00
Reed Kotler	92fc33bc97	This assert is overly restrictive and does not work for mips16. llvm-svn: 170667	2012-12-20 05:09:15 +00:00
Reed Kotler	fd633229f7	Turn on register scavenger for Mips 16 We use an unused Mips 32 register for the emergency slot instead of using the stack. llvm-svn: 170665	2012-12-20 04:44:58 +00:00
Akira Hatanaka	e7f1acc7c0	[mips] Refactor SLT (set on less than) instructions. Separate encoding information from the rest. llvm-svn: 170664	2012-12-20 04:27:52 +00:00
Akira Hatanaka	bbd197e9c4	[mips] Refactor unconditional branch instruction. Separate encoding information from the rest. llvm-svn: 170663	2012-12-20 04:22:39 +00:00
Akira Hatanaka	b1527b7505	[mips] Remove asm string parameter from pseudo instructions. Add InstrItinClass parameter. llvm-svn: 170661	2012-12-20 04:20:09 +00:00
Akira Hatanaka	14f9ce0f83	[mips] Delete definition of CPRESTORE instruction. llvm-svn: 170660	2012-12-20 04:15:30 +00:00
Akira Hatanaka	c0ea0bb99b	[mips] Refactor conditional branch instructions with one register operand. Separate encoding information from the rest. llvm-svn: 170659	2012-12-20 04:13:23 +00:00
Richard Smith	4a8e454ab2	Don't use isa<CallInst>(this) in the constructor for CallInst's base class. This has undefined behavior, because the classof implementation attempts to access parts of the not-yet-constructed derived class. Found by clang -fsanitize=vptr. llvm-svn: 170658	2012-12-20 04:11:02 +00:00
Akira Hatanaka	f71ffd29d9	[mips] Refactor conditional branch instructions with two register operands. Separate encoding information from the rest. llvm-svn: 170657	2012-12-20 04:10:13 +00:00
Reed Kotler	d019dbf75e	fix most of remaining issues with large frames. these patches are tested a lot by test-suite but make check tests are forthcoming once the next few patches that complete this are committed. with the next few patches the pass rate for mips16 is near 100% llvm-svn: 170656	2012-12-20 04:07:42 +00:00
Akira Hatanaka	f423672117	[mips] Use "or $r0, $r1, $zero" instead of "addu $r0, $zero, $r1" to copy physical register $r1 to $r0. GNU disassembler recognizes an "or" instruction as a "move", and this change makes the disassembled code easier to read. Original patch by Reed Kotler. llvm-svn: 170655	2012-12-20 04:06:06 +00:00
Richard Smith	15b1e3727b	Fix use-before-construction of X86TargetLowering. llvm-svn: 170654	2012-12-20 04:04:17 +00:00
Richard Smith	e7701ebfec	Don't use -1 as a value of an unsigned 7-bit enumeration; that has undefined behavior and violates the !range constraints we put on loads of this enum. Found by clang -fsanitize=enum. llvm-svn: 170653	2012-12-20 04:02:58 +00:00
Akira Hatanaka	7d75f9e3d3	[mips] Change the order of template parameters. Move the default parameters to the end. llvm-svn: 170651	2012-12-20 03:52:08 +00:00
Akira Hatanaka	244f9e874c	[mips] Refactor shift instructions with register operands. Separate encoding information from the rest. llvm-svn: 170650	2012-12-20 03:48:24 +00:00
Akira Hatanaka	7f96ad325f	[mips] Refactor shift immediate instructions. Separate encoding information from the rest. llvm-svn: 170649	2012-12-20 03:44:41 +00:00
Akira Hatanaka	ab1b715bf2	[mips] Refactor arithmetic and logic instructions with immediate operands. Separate encoding information from the rest. llvm-svn: 170648	2012-12-20 03:40:03 +00:00
Akira Hatanaka	1b37c4af01	[mips] Refactor arithmetic and logic instructions. Separate encoding information from the rest. llvm-svn: 170647	2012-12-20 03:34:05 +00:00
Akira Hatanaka	73495897b1	[mips] Delete ArithOverflowR and ArithOverflow and use ArithLogicR and ArithLogicI as the instruction base classes. llvm-svn: 170642	2012-12-20 03:00:16 +00:00
Nadav Rotem	7bdc45b570	Loop Vectorizer: Enable if-conversion. llvm-svn: 170632	2012-12-20 02:00:02 +00:00
Bill Wendling	4607f4bdad	s/AttributesImpl/AttributeImpl/g This is going to apply to Attribute, not Attributes. llvm-svn: 170631	2012-12-20 01:36:59 +00:00
Bob Wilson	3365b80290	Do not introduce vector operations in functions marked with noimplicitfloat. <rdar://problem/12879313> llvm-svn: 170630	2012-12-20 01:36:20 +00:00
Nadav Rotem	28408a20c9	whitespace llvm-svn: 170626	2012-12-20 00:49:56 +00:00
NAKAMURA Takumi	2a0b40f584	Target/R600: Update MIB according to r170588. llvm-svn: 170620	2012-12-20 00:22:11 +00:00
Bill Wendling	6ad6c3b1c2	Add a context so that once we uniquify strings we can access them easily. llvm-svn: 170615	2012-12-19 23:55:43 +00:00
Jim Grosbach	6df94846ec	MC: Add MCInstrDesc::mayAffectControlFlow() method. MC disassembler clients (LLDB) are interested in querying if an instruction may affect control flow other than by virtue of being an explicit branch instruction. For example, instructions which write directly to the PC on some architectures. llvm-svn: 170610	2012-12-19 23:38:53 +00:00
Michael Ilseman	b99f80dea7	Refactor isIntrinsic() to be quicker, and change classof() (and thus, isa<IntrinsicInst>()) to use it. This decreases the number of occurrences of the slow-path string matching performed by getIntrinsicID(). llvm-svn: 170602	2012-12-19 23:17:20 +00:00
Bill Wendling	6848e38daf	s/AttributeListImpl/AttributeSetImpl/g to match the namechange of AttributeList. llvm-svn: 170600	2012-12-19 22:42:22 +00:00
Dmitri Gribenko	349d1a35ff	Add a missing 'else'. Found by grep '} if' No testcase because it is apparently not so trivial to construct. llvm-svn: 170595	2012-12-19 22:13:01 +00:00
Tom Stellard	1c315d5411	R600: Remove unecessary VREG alignment. Unlike SGPRs VGPRs doesn't need to be aligned. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170593	2012-12-19 22:10:34 +00:00
Tom Stellard	e7b907d85c	R600: control flow optimization Branch if we have enough instructions so that it makes sense. Also remove branches if they don't make sense. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170592	2012-12-19 22:10:33 +00:00
Tom Stellard	f8794354b2	R600: New control flow for SI v2 This patch replaces the control flow handling with a new pass which structurize the graph before transforming it to machine instruction. This has a couple of different advantages and currently fixes 20 piglit tests without a single regression. It is now a general purpose transformation that could be not only be used for SI/R6xx, but also for other hardware implementations that use a form of structurized control flow. v2: further cleanup, fixes and documentation Patch by: Christian König Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 170591	2012-12-19 22:10:31 +00:00
Eric Christopher	3c5a1914b6	Split out abbreviations for the skeleton info from the rest of the abbreviations. Part of implementing split dwarf. llvm-svn: 170589	2012-12-19 22:02:53 +00:00
Jakob Stoklund Olesen	b159b5ff0d	Remove the explicit MachineInstrBuilder(MI) constructor. Use the version that also takes an MF reference instead. It would technically be possible to extract an MF reference from the MI as MI->getParent()->getParent(), but that would not work for MIs that are not inserted into any basic block. Given the reasonably small number of places this constructor was used at all, I preferred the compile time check to a run time assertion. llvm-svn: 170588	2012-12-19 21:31:56 +00:00
Nadav Rotem	11350aafb4	Fix a bug that was found by building clang with -fsanitize. I introduced it in r166785. PR14291. If TD is unavailable use getScalarSizeInBits, but don't optimize pointers or vectors of pointers. llvm-svn: 170586	2012-12-19 20:47:04 +00:00
Evan Cheng	eae6d2ccea	LLVM sdisel normalize bit extraction of the form: ((x & 0xff00) >> 8) << 2 to (x >> 6) & 0x3fc This is general goodness since it folds a left shift into the mask. However, the trailing zeros in the mask prevents the ARM backend from using the bit extraction instructions. And worse since the mask materialization may require an addition instruction. This comes up fairly frequently when the result of the bit twiddling is used as memory address. e.g. = ptr[(x & 0xFF0000) >> 16] We want to generate: ubfx r3, r1, #16, #8 ldr.w r3, [r0, r3, lsl #2] vs. mov.w r9, #1020 and.w r2, r9, r1, lsr #14 ldr r2, [r0, r2] Add a late ARM specific isel optimization to ARMDAGToDAGISel::PreprocessISelDAG(). It folds the left shift to the 'base + offset' address computation; change the mask to one which doesn't have trailing zeros and enable the use of ubfx. Note the optimization has to be done late since it's target specific and we don't want to change the DAG normalization. It's also fairly restrictive as shifter operands are not always free. It's only done for lsh 1 / 2. It's known to be free on some cpus and they are most common for address computation. This is a slight win for blowfish, rijndael, etc. rdar://12870177 llvm-svn: 170581	2012-12-19 20:16:09 +00:00
Roman Divacky	e3d323052f	Remove edis - the enhanced disassembler. Fixes PR14654. llvm-svn: 170578	2012-12-19 19:55:47 +00:00
Paul Redmond	5917f4c715	Transform (x&C)>V into (x&C)!=0 where possible When the least bit of C is greater than V, (x&C) must be greater than V if it is not zero, so the comparison can be simplified. Although this was suggested in Target/X86/README.txt, it benefits any architecture with a directly testable form of AND. Patch by Kevin Schoedel llvm-svn: 170576	2012-12-19 19:47:13 +00:00
Benjamin Kramer	c5071466d4	PowerPC: Expand VSELECT nodes. There's probably a better expansion for those nodes than the default for altivec, but this is better than crashing. VSELECTs occur in loop vectorizer output. llvm-svn: 170551	2012-12-19 15:49:14 +00:00
Patrik Hagglund	f9934613e8	Change AsmOperandInfo::ConstraintVT to MVT, instead of EVT. Accordingly, add MVT::getVT. llvm-svn: 170550	2012-12-19 15:19:11 +00:00
Rafael Espindola	0f00de40dd	Revert 170545 while I debug the ppc failures. llvm-svn: 170547	2012-12-19 14:48:05 +00:00
Rafael Espindola	aa7b27801c	Add r170095 back. I cannot reproduce it the failures locally, so I will keep an eye at the ppc bots. This patch does add the change to the "Disassembly of section" message, but that is not what was failing on the bots. Original message: Add a funciton to get the segment name of a section. On MachO, sections also have segment names. When a tool looking at a .o file prints a segment name, this is what they mean. In reality, a .o has only one anonymous, segment. This patch adds a MachO only function to fetch that segment name. I named it getSectionFinalSegmentName since the main use for the name seems to be infor the linker with segment this section should go to. The patch also changes MachOObjectFile::getSectionName to return just the section name instead of computing SegmentName,SectionName. llvm-svn: 170545	2012-12-19 14:15:04 +00:00
Evgeniy Stepanov	abeae5c7d5	[msan] Add track-origins argument to the pass constructor. llvm-svn: 170544	2012-12-19 13:55:51 +00:00
Patrik Hagglund	00e7a11904	Split the usage of 'EVT PartVT' into 'MVT PartVT' and 'EVT PartEVT'. llvm-svn: 170540	2012-12-19 12:33:30 +00:00
Patrik Hagglund	4e0f828686	Change RegVT in BitTestBlock and RegsForValue, to contain MVTs, instead of EVTs. llvm-svn: 170538	2012-12-19 12:23:01 +00:00
Patrik Hagglund	e09cac9a67	Change TargetLowering::getTypeForExtArgOrReturn to take and return MVTs, instead of EVTs. llvm-svn: 170537	2012-12-19 12:02:25 +00:00
Patrik Hagglund	3f1905199b	Change a parameter of TargetLowering::getVectorTypeBreakdown to MVT, from EVT. llvm-svn: 170536	2012-12-19 11:53:21 +00:00
Patrik Hagglund	bad545ccba	Change TargetLowering::RegisterTypeForVT to contain MVTs, instead of EVTs. llvm-svn: 170535	2012-12-19 11:48:16 +00:00
Patrik Hagglund	93060569ba	Change TargetLowering::TransformToType to contain MVTs, instead of EVTs. llvm-svn: 170534	2012-12-19 11:42:00 +00:00
Patrik Hagglund	f9eb168ef4	Change TargetLowering::findRepresentativeClass to take an MVT, instead of EVT. llvm-svn: 170532	2012-12-19 11:30:36 +00:00
Evgeniy Stepanov	d7571cd4bc	[msan] Heuristically instrument unknown intrinsics. This changes adds shadow and origin propagation for unknown intrinsics by examining the arguments and ModRef behaviour. For now, only 3 classes of intrinsics are handled: - those that look like simple SIMD store - those that look like simple SIMD load - those that don't have memory effects and look like arithmetic/logic/whatever operation on simple types. llvm-svn: 170530	2012-12-19 11:22:04 +00:00
Patrik Hagglund	fd41b5b969	Change TargetLowering::getTypeToPromoteTo to take and return MVTs, instead of EVTs. llvm-svn: 170529	2012-12-19 11:21:04 +00:00
Benjamin Kramer	e300004bd5	LoopVectorize: Make iteration over induction variables not depend on pointer values. MapVector is a bit heavyweight, but I don't see a simpler way. Also the InductionList is unlikely to be large. This should help 3-stage selfhost compares (PR14647). llvm-svn: 170528	2012-12-19 11:09:15 +00:00
Patrik Hagglund	ffd057a3e1	Change TargetLowering::isCondCodeLegal to take an MVT, instead of EVT. llvm-svn: 170524	2012-12-19 10:19:55 +00:00
NAKAMURA Takumi	89209462fe	X86ISelLowering.cpp: Fix warnings. [-Wlogical-op-parentheses] llvm-svn: 170523	2012-12-19 10:12:48 +00:00
Patrik Hagglund	deee9003ed	Change TargetLowering::getCondCodeAction to take an MVT, instead of EVT. llvm-svn: 170522	2012-12-19 10:09:26 +00:00
Bill Wendling	a87cdc27d9	Inline hasFunctionOnlyAttrs into its only use. llvm-svn: 170518	2012-12-19 09:15:11 +00:00
Bill Wendling	e9506a211f	Inline the only use of the hasParameterOnlyAttrs method. llvm-svn: 170517	2012-12-19 09:04:58 +00:00
Bill Wendling	d97b75d816	Inline the 'hasIncompatibleWithVarArgsAttrs' method into its only uses. And some minor comment reformatting. llvm-svn: 170516	2012-12-19 08:57:40 +00:00
Patrik Hagglund	d7cdcf8cb5	Change TargetLowering::getTruncStoreAction to take MVTs, instead of EVTs. llvm-svn: 170510	2012-12-19 08:28:51 +00:00
Elena Demikhovsky	14a4af0e66	Optimized load + SIGN_EXTEND patterns in the X86 backend. llvm-svn: 170506	2012-12-19 07:50:20 +00:00
Nadav Rotem	33360d8ae9	After reducing the size of an operation in the DAG we zero-extend the reduced bitwidth op back to the original size. If we reduce ANDs then this can cause an endless loop. This patch changes the ZEXT to ANY_EXTEND if the demanded bits are equal or smaller than the size of the reduced operation. llvm-svn: 170505	2012-12-19 07:39:08 +00:00
Bill Wendling	3d7b0b8ac7	Rename the 'Attributes' class to 'Attribute'. It's going to represent a single attribute in the future. llvm-svn: 170502	2012-12-19 07:18:57 +00:00
Craig Topper	3f194c8f4f	Remove more of 'else's after 'returns'. No functional change. llvm-svn: 170497	2012-12-19 06:43:58 +00:00
Craig Topper	5dd8291cbe	Remove a bunch of 'else's after 'returns' llvm-svn: 170496	2012-12-19 06:39:17 +00:00
Craig Topper	63f5921776	Teach SimplifySetCC that comparing AssertZext i1 against a constant 1 can be rewritten as a compare against a constant 0 with the opposite condition. llvm-svn: 170495	2012-12-19 06:12:28 +00:00
Reed Kotler	3aad762d1d	Add some missing Defs and Uses. llvm-svn: 170493	2012-12-19 04:06:15 +00:00
Shuxin Yang	5b841c4a64	Make sure the buffer, which containas an instance of APFloat, has proper alignment. llvm-svn: 170486	2012-12-19 01:10:17 +00:00
Kevin Enderby	85cf531593	Add to the disassembler C API an option to print the disassembled instructions in the assembly code variant if one exists. The intended use for this is so tools like lldb and darwin's otool(1) can be switched to print Intel-flavored disassembly. I discussed extensively this API with Jim Grosbach and we feel while it may not be fully general, in reality there is only one syntax for each assembly with the exception of X86 which has exactly two for historical reasons. rdar://10989182 llvm-svn: 170477	2012-12-18 23:47:28 +00:00
Jakob Stoklund Olesen	d742533dbc	Use bidirectional bundle flags to simplify important functions. The bundle_iterator::operator++ function now doesn't need to dig out the basic block and check against end(). It can use the isBundledWithSucc() flag to find the last bundled instruction safely. Similarly, MachineInstr::isBundled() no longer needs to look at iterators etc. It only has to look at flags. llvm-svn: 170473	2012-12-18 23:21:49 +00:00
Shuxin Yang	37a1efe1c6	rdar://12801297 InstCombine for unsafe floating-point add/sub. llvm-svn: 170471	2012-12-18 23:10:12 +00:00
Nadav Rotem	9aee065e3c	Enable the loop vectorizer in clang and not in the pass manager, so that we can disable it in clang. llvm-svn: 170470	2012-12-18 23:09:44 +00:00
Jakob Stoklund Olesen	00f6c7754b	Verify bundle flag consistency when setting them. Now that the bundle flag aware APIs are all in place, it is possible to continuously verify the flag consistency. llvm-svn: 170465	2012-12-18 23:00:28 +00:00
Jakub Staszak	338863a546	Reverse order of checking SSE level when calculating compare cost, so we check AVX2 before AVX. llvm-svn: 170464	2012-12-18 22:57:56 +00:00
Jakob Stoklund Olesen	29c277197e	Verify bundle flags for consistency in MachineVerifier. The new bidirectional bundle flags are redundant, so inadvertent bundle tearing can be detected in the machine code verifier. llvm-svn: 170463	2012-12-18 22:55:07 +00:00
Quentin Colombet	23b404d5ad	Disable ARM partial flag dependency optimization at -Oz To not over constrain the scheduler for ARM in thumb mode, some optimizations for code size reduction, specific to ARM thumb, are blocked when they add a dependency (like write after read dependency). Disables this check when code size is the priority, i.e., code is compiled with -Oz. llvm-svn: 170462	2012-12-18 22:47:16 +00:00
Jakob Stoklund Olesen	a33f504b3e	Don't allow the automatically updated MI flags to be set directly. The bundle-related MI flags need to be kept in sync with the neighboring instructions. Don't allow the bulk flag-setting setFlags() function to change them. Also don't copy MI flags when cloning an instruction. The clone's bundle flags will be set when it is explicitly inserted into a bundle. llvm-svn: 170459	2012-12-18 21:36:05 +00:00
Jakob Stoklund Olesen	78eaf05fa7	Tighten up the splice() API for bundled instructions. Remove the instr_iterator versions of the splice() functions. It doesn't seem useful to be able to splice sequences of instructions that don't consist of full bundles. The normal splice functions that take MBB::iterator arguments are not changed, and they can move whole bundles around without any problems. llvm-svn: 170456	2012-12-18 20:59:41 +00:00
Andrew Trick	ec2564818c	MISched: add dependence to ExitSU to model live-out latency. llvm-svn: 170454	2012-12-18 20:53:01 +00:00
Andrew Trick	ef23569858	MISched: Cleanup, redundant statement. llvm-svn: 170453	2012-12-18 20:52:58 +00:00
Andrew Trick	d6d5ad3d7b	MISched: Heuristics, compare latency more precisely. It matters more for some targets. llvm-svn: 170452	2012-12-18 20:52:56 +00:00
Andrew Trick	44f54d97a4	MISched: Remove SchedRemainder::IsResourceLimited. I don't know how to compute it. llvm-svn: 170451	2012-12-18 20:52:54 +00:00
Andrew Trick	493b867b5d	MISched: cleanup, use the proper iterator type. llvm-svn: 170450	2012-12-18 20:52:52 +00:00
Andrew Trick	ffb6168e85	MISched: minor improvement, initialize remaining resources before the first scheduling decision. llvm-svn: 170449	2012-12-18 20:52:49 +00:00
Benjamin Kramer	f0e5d2f032	LoopVectorize: Emit reductions as log2(vectorsize) shuffles + vector ops instead of scalar operations. For example on x86 with SSE4.2 a <8 x i8> add reduction becomes movdqa %xmm0, %xmm1 movhlps %xmm1, %xmm1 ## xmm1 = xmm1[1,1] paddw %xmm0, %xmm1 pshufd $1, %xmm1, %xmm0 ## xmm0 = xmm1[1,0,0,0] paddw %xmm1, %xmm0 phaddw %xmm0, %xmm0 pextrb $0, %xmm0, %edx instead of pextrb $2, %xmm0, %esi pextrb $0, %xmm0, %edx addb %sil, %dl pextrb $4, %xmm0, %esi addb %dl, %sil pextrb $6, %xmm0, %edx addb %sil, %dl pextrb $8, %xmm0, %esi addb %dl, %sil pextrb $10, %xmm0, %edi pextrb $14, %xmm0, %edx addb %sil, %dil pextrb $12, %xmm0, %esi addb %dil, %sil addb %sil, %dl llvm-svn: 170439	2012-12-18 18:40:20 +00:00
Eli Bendersky	39e7c6e370	Get rid of the pesky -Woverloaded-virtual warning. No change in functionality. llvm-svn: 170438	2012-12-18 18:21:29 +00:00
Jakob Stoklund Olesen	422e07b091	Tighten the insert() API for bundled instructions. The normal insert() function takes an MBB::iterator position, and inserts a stand-alone MachineInstr as before. The insert() function that takes an MBB::instr_iterator position can insert instructions inside a bundle, and will now update the bundle flags correctly when that happens. When the insert position is between two bundles, it is unclear whether the instruction should be appended to the previous bundle, prepended to the next bundle, or stand on its own. The MBB::insert() function doesn't bundle the instruction in that case, use the MIBundleBuilder class for that. llvm-svn: 170437	2012-12-18 17:54:53 +00:00
Hal Finkel	943f76d1b3	Check multiple register classes for inline asm tied registers A register can be associated with several distinct register classes. For example, on PPC, the floating point registers are each associated with both F4RC (which holds f32) and F8RC (which holds f64). As a result, this code would fail when provided with a floating point register and an f64 operand because it would happen to find the register in the F4RC class first and return that. From the F4RC class, SDAG would extract f32 as the register type and then assert because of the invalid implied conversion between the f64 value and the f32 register. Instead, search all register classes. If a register class containing the the requested register has the requested type, then return that register class. Otherwise, as before, return the first register class found that contains the requested register. llvm-svn: 170436	2012-12-18 17:50:58 +00:00
Nadav Rotem	c0699854dd	Enable the loop vectorizer. llvm-svn: 170416	2012-12-18 06:37:12 +00:00
Nadav Rotem	a5024fc3e1	SROA: Replace calls to getScalarSizeInBits to DataLayout's API because getScalarSizeInBits could not handle vectors of pointers. llvm-svn: 170412	2012-12-18 05:23:31 +00:00
Rafael Espindola	46b9c8a2cd	Initialize NoRedZone and remove unused default values. llvm-svn: 170404	2012-12-18 03:35:05 +00:00
Jakob Stoklund Olesen	41bbf9c256	Repair bundles that were broken by removing and reinserting the first instruction. This isn't strictly necessary at the moment because Thumb2SizeReduction also copies all MI flags from the old instruction to the new. However, a future patch will make that kind of direct flag tampering illegal. llvm-svn: 170395	2012-12-18 00:46:39 +00:00
Eric Christopher	79f165699d	Formatting. llvm-svn: 170394	2012-12-18 00:42:26 +00:00
Eric Christopher	906da23229	Add support for passing -main-file-name all the way through to the assembler. Part of PR14624 llvm-svn: 170390	2012-12-18 00:31:01 +00:00
Eric Christopher	a7c3273e85	Cleanup formatting and whitespace. llvm-svn: 170389	2012-12-18 00:30:54 +00:00
Jakob Stoklund Olesen	43b1e13386	Extract a method, no functional change intended. Sadly, this costs us a perfectly good opportunity to use 'goto'. llvm-svn: 170385	2012-12-18 00:13:11 +00:00
Jakob Stoklund Olesen	ccfb5fb472	Tighten up the erase/remove API for bundled instructions. Most code is oblivious to bundles and uses the MBB::iterator which only visits whole bundles. MBB::erase() operates on whole bundles at a time as before. MBB::remove() now refuses to remove bundled instructions. It is not safe to remove all instructions in a bundle without deleting them since there is no way of returning pointers to all the removed instructions. MBB::remove_instr() and MBB::erase_instr() will now update bundle flags correctly, lifting individual instructions out of bundles while leaving the remaining bundle intact. The MachineInstr convenience functions are updated so eraseFromParent() erases a whole bundle as before eraseFromBundle() erases a single instruction, leaving the rest of its bundle. removeFromParent() refuses to operate on bundled instructions, and removeFromBundle() lifts a single instruction out of its bundle. These functions will no longer accidentally split or coalesce bundles - bundle flags are updated to preserve the existing bundling, and explicit bundleWith* / unbundleFrom* functions should be used to change the instruction bundling. This API update is still a work in progress. I am going to update APIs first so they maintain bundle flags automatically when possible. Then I'll add stricter verification of the bundle flags. llvm-svn: 170384	2012-12-17 23:55:38 +00:00
Reed Kotler	0c1745e56a	EmitDebugLabel should by default be the same as EmitLabel everywhere. It must be explicity set in MCPureStreamer because otherwise it will inherit incorrectly from the parent. llvm-svn: 170383	2012-12-17 23:41:45 +00:00
Eli Bendersky	d371eb3060	fix indentation llvm-svn: 170381	2012-12-17 22:50:56 +00:00
Chad Rosier	150d35bc1d	[arm fast-isel] Minor cleanup. No functional change intended. llvm-svn: 170379	2012-12-17 22:35:29 +00:00
Chandler Carruth	10700aad85	Prepare LLVM to fix PR14625, exposing a hook in MCContext to manage the compilation directory. This defaults to the current working directory, just as it always has, but now an assembler can choose to override it with a custom directory. I've taught llvm-mc about this option and added a test case. llvm-svn: 170371	2012-12-17 21:32:42 +00:00
Michael Ilseman	acdb76d339	Removed trailing whitespace llvm-svn: 170367	2012-12-17 20:37:55 +00:00
Chad Rosier	62a144f099	[arm fast-isel] Fast-isel only handles simple VTs, so make sure the necessary checks are in place. Some minor cleanup as well. llvm-svn: 170360	2012-12-17 19:59:43 +00:00
Chandler Carruth	e3f4119b06	Fix another SROA crasher, PR14601. This was a silly oversight, we weren't pruning allocas which were used by variable-length memory intrinsics from the set that could be widened and promoted as integers. Fix that. llvm-svn: 170353	2012-12-17 18:48:07 +00:00
Tim Northover	d05e6b5817	Query section for whether it should be executable. llvm-svn: 170350	2012-12-17 17:59:35 +00:00
Tim Northover	5edabc131a	Teach MachO which sections contain code llvm-svn: 170349	2012-12-17 17:59:32 +00:00
Evgeniy Stepanov	88b8dceddf	[msan] Fix lint warning. llvm-svn: 170347	2012-12-17 16:30:05 +00:00
Richard Osborne	459e35c261	Add instruction encodings / disassembly support for l2r instructions. llvm-svn: 170345	2012-12-17 16:28:02 +00:00
Tom Stellard	5a6879466a	R600: enable S_N2_ instructions They seem to work fine. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170343	2012-12-17 15:14:56 +00:00
Tom Stellard	9e90b5895d	R600: BB operand support for SI Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170342	2012-12-17 15:14:54 +00:00
Tom Stellard	16a17c6d3e	R600: remove nonsense setPrefLoopAlignment The Align parameter is a power of two, so 16 results in 64K alignment. Additional to that even 16 byte alignment doesn't make any sense, so just remove it. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170341	2012-12-17 15:14:53 +00:00
Chandler Carruth	21eb4e96c2	Teach the rewriting of memcpy calls to support subvector copies. This also cleans up a bit of the memcpy call rewriting by sinking some irrelevant code further down and making the call-emitting code a bit more concrete. Previously, memcpy of a subvector would actually miscompile (!!!) the copy into a single vector element copy. I have no idea how this ever worked. =/ This is the memcpy half of PR14478 which we probably weren't noticing previously because it didn't actually assert. The rewrite relies on the newly refactored insert- and extractVector functions to do the heavy lifting, and those are the same as used for loads and stores which makes the test coverage a bit more meaningful here. llvm-svn: 170338	2012-12-17 14:51:24 +00:00
Patrik Hagglund	c494d24a68	Revert/correct some FastISel changes in r170104 (EVT->MVT for TargetLowering::getRegClassFor). Some isSimple() guards were missing, or getSimpleVT() were hoisted too far, resulting in asserts on valid LLVM assembly input. llvm-svn: 170336	2012-12-17 14:30:06 +00:00
Evgeniy Stepanov	95a80abead	Optimize tree walking in markAliveBlocks. Check whether a BB is known as reachable before adding it to the worklist. This way BB's with multiple predecessors are added to the list no more than once. llvm-svn: 170335	2012-12-17 14:28:00 +00:00
Richard Osborne	51bf1b269a	Add instruction encodings for PEEK and ENDIN. Previously these were marked with the wrong format. llvm-svn: 170334	2012-12-17 14:23:54 +00:00
Chandler Carruth	cacda256a1	Fix a secondary bug I introduced while fixing the first part of PR14478. The first half of fixing this bug was actually in r170328, but was entirely coincidental. It did however get me to realize the nature of the bug, and adapt the test case to test more interesting behavior. In turn, that uncovered the rest of the bug which I've fixed here. This should fix two new asserts that showed up in the vectorize nightly tester. llvm-svn: 170333	2012-12-17 14:03:01 +00:00
Richard Osborne	c104bf2769	Fix parameter name in prototypes in XCoreDisassembler. llvm-svn: 170332	2012-12-17 13:55:49 +00:00
Chandler Carruth	95e1fb8a42	Hoist a convertValue call to the two paths where it is needed. I noticed this while looking at r170328. We only ever do a vector rewrite when the alloca is the vector type, so it's good to not paper over bugs here by doing a convertValue that isn't needed. llvm-svn: 170331	2012-12-17 13:51:03 +00:00
Richard Osborne	041071c558	Add instruction encodings / disassembly support for rus instructions. llvm-svn: 170330	2012-12-17 13:50:04 +00:00
Chandler Carruth	ce4562bdcb	Hoist the insertVector helper to be a static helper. This will allow its use inside of memcpy rewriting as well. This routine is more complex than extractVector, and some of its uses are not 100% where I want them to be so there is still some work to do here. While this can technically change the output in some cases, it shouldn't be a change that matters -- IE, it can leave some dead code lying around that prior versions did not, etc. Yet another step in the refactorings leading up to the solution to the last component of PR14478. llvm-svn: 170328	2012-12-17 13:41:21 +00:00
Richard Osborne	e405e58639	Add instruction encodings for ZEXT and SEXT. Previously these were marked with the wrong format. llvm-svn: 170327	2012-12-17 13:20:37 +00:00
Chandler Carruth	b6bc8749e8	Lift the extractVector helper all the way out to a static helper function. The method helpers all implicitly act upon the alloca, and what we really want is a fully generic helper. Doing memcpy rewrites is more special than all other rewrites because we are at times rewriting instructions which touch pointers other than the alloca. As a consequence all of the helpers needed by memcpy rewriting of sub-vector copies will need to be generalized fully. Note that all of these helpers ({insert,extract}{Integer,Vector}) are woefully uncommented. I'm going to go back through and document them once I get the factoring correct. No functionality changed. llvm-svn: 170325	2012-12-17 13:07:30 +00:00
Chandler Carruth	769445ef03	Factor the vector load rewriting into a more generic form. This makes it suitable for use in rewriting memcpy in the presence of subvector memcpy intrinsics. No functionality changed. llvm-svn: 170324	2012-12-17 12:50:21 +00:00
Richard Osborne	3a0d5cc314	Add instruction encodings / disassembly support for 2r instructions. llvm-svn: 170323	2012-12-17 12:29:31 +00:00
Richard Osborne	016967e4ff	Add instruction encodings / disassembly support for 0r instructions. llvm-svn: 170322	2012-12-17 12:26:29 +00:00
Richard Osborne	1cc2b68ad6	Simplify assertion in XCoreInstPrinter. llvm-svn: 170321	2012-12-17 12:13:46 +00:00
Richard Osborne	4e1e14bccd	Update comments to match recommended doxygen style. llvm-svn: 170320	2012-12-17 12:13:41 +00:00
Richard Osborne	eb31fa483e	Remove unnecessary include. llvm-svn: 170319	2012-12-17 12:13:32 +00:00
Craig Topper	354ed773b8	Remove EFLAGS from the BLSI/BLSMSK/BLSR patterns. The nodes created by DAG combine don't contain an EFLAGS def. llvm-svn: 170308	2012-12-17 06:13:48 +00:00
Craig Topper	f3ff6ae066	Simplify BMI ANDN matching to use patterns instead of a DAG combine. Also add ANDN to isDefConvertible. llvm-svn: 170305	2012-12-17 05:12:30 +00:00
Craig Topper	f924a58af1	Add rest of BMI/BMI2 instructions to the folding tables as well as popcnt and lzcnt. llvm-svn: 170304	2012-12-17 05:02:29 +00:00
Craig Topper	5b08cf7736	Remove store forms of DEC/INC from isDefConvertible. Since they are stores they don't have a register def. llvm-svn: 170303	2012-12-17 04:55:07 +00:00
Chandler Carruth	ccca504f3a	Fix the first part of PR14478: memset now works. PR14478 highlights a serious problem in SROA that simply wasn't being exercised due to a lack of vector input code mixed with C-library function calls. Part of SROA was written carefully to handle subvector accesses via memset and memcpy, but the rewriter never grew support for this. Fixing it required refactoring the subvector access code in other parts of SROA so it could be shared, and then fixing the splat formation logic and using subvector insertion (this patch). The PR isn't quite fixed yet, as memcpy is still broken in the same way. I'm starting on that series of patches now. Hopefully this will be enough to bring the bullet benchmark back to life with the bb-vectorizer enabled, but that may require fixing memcpy as well. llvm-svn: 170301	2012-12-17 04:07:37 +00:00
Chandler Carruth	eae65a5629	Extract the logic for inserting a subvector into a vector alloca. No functionality changed. Another step of refactoring toward solving PR14487. llvm-svn: 170300	2012-12-17 04:07:35 +00:00
Chandler Carruth	514f34f9c4	Lift the integer splat computation into a helper function. No functionality changed. Refactoring leading up to the fix for PR14478 which requires some significant changes to the memset and memcpy rewriting. llvm-svn: 170299	2012-12-17 04:07:30 +00:00
Craig Topper	588ceec0f7	Add debug prints for when optimizeLoadInstr folds a load. llvm-svn: 170298	2012-12-17 03:56:00 +00:00
Richard Osborne	1b5562ad8e	Add instruction encodings and disassembly for 1r instructions. llvm-svn: 170293	2012-12-16 17:37:34 +00:00
Richard Osborne	e31735a52b	Add XCore disassembler. Currently there is no instruction encoding info and XCoreDisassembler::getInstruction() always returns Fail. I intend to add instruction encodings and tests in follow on commits. llvm-svn: 170292	2012-12-16 17:29:14 +00:00
Richard Osborne	872f51e301	Remove invalid instruction encodings. llvm-svn: 170291	2012-12-16 16:46:31 +00:00
Richard Osborne	e298556706	Mark anything deriving from PseudoInstXCore as a pseudo instruction. llvm-svn: 170290	2012-12-16 16:46:28 +00:00
Richard Osborne	f12cb9ef27	Set instruction size correctly in XCoreInstrFormats.td llvm-svn: 170289	2012-12-16 16:46:24 +00:00
Richard Osborne	3c31e21837	Change XCoreAsmPrinter to lower MachineInstrs to MCInsts before emission. This change adds XCoreMCInstLower to do the lowering to MCInst and XCoreInstPrinter to print the MCInsts. llvm-svn: 170288	2012-12-16 16:20:48 +00:00
Richard Osborne	b1de9f7e07	Replace ${:comment} with the comment symbol. llvm-svn: 170286	2012-12-16 15:59:02 +00:00
Dmitri Gribenko	2943ce80f3	Declare class DwarfDebug before use instead of relying on a forward declaration from some other unrelated header. Patch by Kai. llvm-svn: 170284	2012-12-16 12:57:36 +00:00
NAKAMURA Takumi	c7146e251d	MCPureStreamer.cpp: Try to fix build, pruning EmitDebugLabel(). llvm-svn: 170280	2012-12-16 04:23:20 +00:00
Reed Kotler	aee4d5d194	This patch is needed to make c++ exceptions work for mips16. Mips16 is really a processor decoding mode (ala thumb 1) and in the same program, mips16 and mips32 functions can exist and can call each other. If a jal type instruction encounters an address with the lower bit set, then the processor switches to mips16 mode (if it is not already in it). If the lower bit is not set, then it switches to mips32 mode. The linker knows which functions are mips16 and which are mips32. When relocation is performed on code labels, this lower order bit is set if the code label is a mips16 code label. In general this works just fine, however when creating exception handling tables and dwarf, there are cases where you don't want this lower order bit added in. This has been traditionally distinguished in gas assembly source by using a different syntax for the label. lab1: ; this will cause the lower order bit to be added lab2=. ; this will not cause the lower order bit to be added In some cases, it does not matter because in dwarf and debug tables the difference of two labels is used and in that case the lower order bits subtract each other out. To fix this, I have added to mcstreamer the notion of a debuglabel. The default is for label and debug label to be the same. So calling EmitLabel and EmitDebugLabel produce the same result. For various reasons, there is only one set of labels that needs to be modified for the mips exceptions to work. These are the "$eh_func_beginXXX" labels. Mips overrides the debug label suffix from ":" to "=." . This initial patch fixes exceptions. More changes most likely will be needed to DwarfCFException to make all of this work for actual debugging. These changes will be to emit debug labels in some places where a simple label is emitted now. Some historical discussion on this from gcc can be found at: http://gcc.gnu.org/ml/gcc-patches/2008-08/msg00623.html http://gcc.gnu.org/ml/gcc-patches/2008-11/msg01273.html llvm-svn: 170279	2012-12-16 04:00:45 +00:00
Benjamin Kramer	b16ccde7a4	X86: Add a couple of target-specific dag combines that turn VSELECTS into psubus if possible. We match the pattern "x >= y ? x-y : 0" into "subus x, y" and two special cases if y is a constant. DAGCombiner canonicalizes those so we first have to undo the canonicalization for those cases. The pattern occurs in gzip when the loop vectorizer is enabled. Part of PR14613. llvm-svn: 170273	2012-12-15 16:47:44 +00:00
Chandler Carruth	067edd342f	Relax an overly aggressive assert to fix PR14572. The alloca width is based on the alloc size, not the type size. llvm-svn: 170270	2012-12-15 09:26:06 +00:00
Chandler Carruth	7a28f95419	Make '-mtune=x86_64' assume fast unaligned memory accesses. Not all chips targeted by x86_64 have this feature, but a dramatically increasing number do. Specifying a chip-specific tuning parameter will continue to turn the feature on or off as appropriate for that particular chip, but the generic flag should try to achieve the best performance on the most widely available hardware. Today, the number of chips with fast UA access dwarfs those without in the x86-64 space. Note that this also brings LLVM's code generation for this '-march' flag more in line with that of modern GCCs. Reviewed by Dan Gohman. llvm-svn: 170269	2012-12-15 09:01:13 +00:00
NAKAMURA Takumi	8f45b6c709	Revert r170246, "Enable the loop vectorizer by default." llvm-svn: 170267	2012-12-15 06:11:13 +00:00
Reed Kotler	5fdeb21249	This code implements most of mips16 hardfloat as it is done by gcc. In this case, essentially it is soft float with different library routines. The next step will be to make this fully interoperational with mips32 floating point and that requires creating stubs for functions with signatures that contain floating point types. I have a more sophisticated design for mips16 hardfloat which I hope to implement at a later time that directly does floating point without the need for function calls. The mips16 encoding has no floating point instructions so one needs to switch to mips32 mode to execute floating point instructions. llvm-svn: 170259	2012-12-15 00:20:05 +00:00
Eric Christopher	a2de826d29	To simplify some code move the unit emission into the holders. Make emitDIE public accordingly. No functional change. llvm-svn: 170258	2012-12-15 00:04:07 +00:00
Eric Christopher	16485a5164	Use begin and end label names from the section for info. llvm-svn: 170257	2012-12-15 00:04:04 +00:00
Kevin Enderby	06aa3eb8ce	Make sure the alternate PC+imm syntax of LDR instruction with a small immediate generates the narrow version. Needed when doing round-trip assemble/disassemble testing using the alternate syntax that specifies 'pc' directly. llvm-svn: 170255	2012-12-14 23:04:25 +00:00
Michael Ilseman	e2754dc887	Add back FoldOpIntoPhi optimizations with fix. Included test cases to help catch these errors and to test the presence of the optimization itself llvm-svn: 170248	2012-12-14 22:08:26 +00:00

... 2 3 4 5 6 ...

58324 Commits