llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	9a9468ee02	Remove underscores from TBM instruction names for consistency with other instruction naming. llvm-svn: 192040	2013-10-05 19:27:26 +00:00
Craig Topper	52196640a2	Remove unneeded TBM intrinsics. The arithmetic/logical operation patterns are sufficient. llvm-svn: 192039	2013-10-05 19:22:59 +00:00
Craig Topper	80bd135e7a	Add an additional pattern for BLCI since opt can turn (not (add x, 1)) into (sub -2, x). llvm-svn: 192037	2013-10-05 17:17:53 +00:00
Rafael Espindola	ac4ad25a00	Remove some really nasty uses of hasRawTextSupport. When MC was first added, targets could use hasRawTextSupport to keep features working before they were added to the MC interface. The design goal of MC is to provide an uniform api for printing assembly and object files. Short of relaxations and other corner cases, a object file is just another representation of the assembly. It was never the intention that targets would keep doing things like if (hasRawTextSupport()) Set flags in one way. else Set flags in another way. When they do that they create two code paths and the object file is no longer just another representation of the assembly. This also then requires testing with llc -filetype=obj, which is extremelly brittle. This patch removes some of these hacks by replacing them with smaller ones. The ARM flag setting is trivial, so I just moved it to the constructor. For Mips, the patch adds two temporary hack directives that allow the assembly to represent the same things as the object file was already able to. The hope is that the mips developers will replace the hack directives with the same ones that gas uses and drop the -print-hack-directives flag. I will also try to implement a target streamer interface, so that we can move this out of the common code. In summary, for any new work, two rules of the thumb are * Don't use "llc -filetype=obj" in tests. * Don't add calls to hasRawTextSupport. llvm-svn: 192035	2013-10-05 16:42:21 +00:00
Jiangning Liu	ad242fbb71	Implement aarch64 neon instruction set AdvSIMD (Across). llvm-svn: 192028	2013-10-05 08:22:10 +00:00
Craig Topper	a1bbc323fa	Add OPC_CheckChildSame0-3 to the DAG isel matcher. This replaces sequences of MoveChild, CheckSame, MoveParent. Saves 846 bytes from the X86 DAG isel matcher, ~300 from ARM, ~840 from Hexagon. llvm-svn: 192026	2013-10-05 05:38:16 +00:00
Venkatraman Govindaraju	ece63dbd0d	[Sparc] Use correct alignment while loading/storing fp128 values. llvm-svn: 192023	2013-10-05 02:29:47 +00:00
Andrew Kaylor	480dcb3ee7	Adding multiple GOT handling to RuntimeDyldELF Patch by Ashok Thirumurthi llvm-svn: 192020	2013-10-05 01:52:09 +00:00
Manman Ren	b3388601fb	Debug Info: In DIBuilder, the derived-from field of a DW_TAG_pointer_type is updated to use DITypeRef. Move isUnsignedDIType and getOriginalTypeSize from DebugInfo.h to be static helper functions in DwarfCompileUnit. We already have a static helper function "isTypeSigned" in DwarfCompileUnit, and a pointer to DwarfDebug is added to resolve the derived-from field. All three functions need to go across link for derived-from fields, so we need to get hold of a type identifier map. A pointer to DwarfDebug is also added to DbgVariable in order to resolve the derived-from field. Debug info verifier is updated to check a derived-from field is a TypeRef. Verifier will not go across link for derived-from fields, in debug info finder, we go across the link to add derived-from fields to types. Function getDICompositeType is only used by dragonegg and since dragonegg does not generate identifier for types, we use an empty map to resolve the derived-from field. When printing a derived-from field, we use DITypeRef::getName to either return the type identifier or getName of the DIType. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 192018	2013-10-05 01:43:03 +00:00
Eric Christopher	3264a48a45	Reorganize some member variables and update a comment. llvm-svn: 192017	2013-10-05 00:39:55 +00:00
Eric Christopher	87b9c49c72	Fix one comment and update another. Slightly reformat. llvm-svn: 192016	2013-10-05 00:32:34 +00:00
Venkatraman Govindaraju	30781deb1c	[Sparc] Respect hasHardQuad parameter correctly when lowering SINT_TO_FP with fp128 operand. llvm-svn: 192015	2013-10-05 00:31:41 +00:00
Eric Christopher	9e429ae779	Add a resolve method on CompileUnit that forwards to DwarfDebug. llvm-svn: 192014	2013-10-05 00:27:02 +00:00
Adrian Prantl	f01b562a15	Debug info: Don't crash in SelectionDAGISel when a vreg that is being pointed to by a dbg_value belonging to a function argument is eliminated during instruction selection. rdar://problem/15094721. llvm-svn: 192011	2013-10-05 00:08:27 +00:00
Eric Christopher	fa205cad7c	Make a bunch of CompileUnit member functions private. llvm-svn: 192009	2013-10-05 00:05:51 +00:00
Venkatraman Govindaraju	84f1523cac	[Sparc] Correct the floating point conditional code mapping in GetOppositeBranchCondition(). llvm-svn: 192006	2013-10-04 23:54:30 +00:00
David Blaikie	93ff1eb5fb	Minor formatting/comment rewording/etc. llvm-svn: 192005	2013-10-04 23:52:02 +00:00
Eric Christopher	fe3ae44179	Remove odd use of this. llvm-svn: 192004	2013-10-04 23:49:31 +00:00
Eric Christopher	f0388b7b39	Reformat some odd formattings. llvm-svn: 192003	2013-10-04 23:49:29 +00:00
Eric Christopher	08f7c8f1fe	Tighten up some type arguments to functions. Where we expect a scope, pass a scope. llvm-svn: 192002	2013-10-04 23:49:26 +00:00
Hal Finkel	f5a3eaea55	UpdatePHINodes in BasicBlockUtils should not crash on duplicate predecessors UpdatePHINodes has an optimization to reuse an existing PHI node, where it first deletes all of its entries and then replaces them. Unfortunately, in the case where we had duplicate predecessors (which are allowed so long as the associated PHI entries have the same value), the loop removing the existing PHI entries from the to-be-reused PHI would assert (if that PHI was not the one which had the duplicates). llvm-svn: 192001	2013-10-04 23:41:05 +00:00
David Blaikie	41369b5f41	Remove some dead code. llvm-svn: 192000	2013-10-04 23:37:30 +00:00
David Blaikie	fac5612ab0	Simplify setting of DIE tag for type DIEs by setting it in one* place. * two actually due to some weird template thing... investigating that. llvm-svn: 191998	2013-10-04 23:21:16 +00:00
Eric Christopher	baf3816283	Prune includes. llvm-svn: 191994	2013-10-04 22:54:28 +00:00
Jack Carter	215527449d	forgot to remove this file as well llvm-svn: 191993	2013-10-04 22:54:05 +00:00
Jack Carter	13d5f753f8	reverting per request llvm-svn: 191992	2013-10-04 22:52:31 +00:00
Eric Christopher	6b8209b6b7	Use addFlag to add the enum class attribute. This has the side effect of using DW_FORM_flag_present on dwarf4 and above. llvm-svn: 191991	2013-10-04 22:40:10 +00:00
Eric Christopher	dccd32866b	Use Die->addValue and DIEIntegerOne directly when we want to add a flag. No functional change. llvm-svn: 191990	2013-10-04 22:40:05 +00:00
Hal Finkel	dbc7a8a8a3	Fix DAGCombiner::visitFP_EXTEND to ignore indexed loads DAGCombiner::visitFP_EXTEND will apply the following transformation: fold (fpext (load x)) -> (fpext (fptrunc (extload x))) but the implementation does not handle indexed loads (pre/post inc.), but did not specifically ignore them either (unlike for extending loads, which it already ignored), causing an assert when the transformation was applied to an indexed load. This is the minimal fix for correctness (causing the transformation to be skipped for indexed loads). Unfortunately, I don't have an in-tree test case. llvm-svn: 191989	2013-10-04 22:18:12 +00:00
Reed Kotler	1b5b5c95cc	Support tblockaddr for static compilation in Mips16. llvm-svn: 191986	2013-10-04 22:01:40 +00:00
Jack Carter	721726adfc	[MC][AsmParser] Hook for post assembly file processing This patch handles LLVM standalone assembler (llvm-mc) ELF flag setting based on input file directive processing. Mips assembly requires processing inline directives that directly and indirectly affect the output ELF header flags. This patch handles one ".abicalls". To process these directives we are following the model the code generator uses by storing state in a container as we go through processing and when we detect the end of input file processing, AsmParser is notified and we update the ELF header flags through a MipsELFStreamer method with a call from MCTargetAsmParser::emitEndOfAsmFile(MCStreamer &OutStreamer). This patch will allow other targets the same functionality. Jack llvm-svn: 191982	2013-10-04 21:26:15 +00:00
Akira Hatanaka	55504b4ac9	[mips] Fix a bug in MipsLongBranch::replaceBranch, which was erasing instructions in delay slots along with the original branch instructions. llvm-svn: 191978	2013-10-04 20:51:40 +00:00
Arnold Schwaighofer	698d4ac8a8	SLPVectorizer: Sort inputs to commutative binary operations Sort the operands of the other entries in the current vectorization root according to the first entry's operands opcodes. %conv0 = uitofp ... %load0 = load float ... = fmul %conv0, %load0 = fmul %load0, %conv1 = fmul %load0, %conv2 Make sure that we recursively vectorize <%conv0, %conv1, %conv2> and <%load0, %load0, %load0>. This makes it more likely to obtain vectorizable trees. We have to be careful when we sort that we don't destroy 'good' existing ordering implied by source order. radar://15080067 llvm-svn: 191977	2013-10-04 20:39:16 +00:00
Eric Christopher	c19d6f096c	Temporarily revert r176882 as it needs to be implemented in a different way for all platforms. llvm-svn: 191975	2013-10-04 19:40:33 +00:00
Eric Christopher	e595bae4a4	Temporarily revert r191792 as it is causing some LTO debug failures on platforms with relocations in debug info and also temporarily revert r191800 due to conflicts with the revert of r191792. llvm-svn: 191967	2013-10-04 17:08:38 +00:00
Matthias Braun	caff764739	Fix comment llvm-svn: 191966	2013-10-04 16:53:02 +00:00
Matthias Braun	6a57acf44a	Fix indentation llvm-svn: 191965	2013-10-04 16:53:00 +00:00
Matthias Braun	c9d5c0f21d	Fix typo llvm-svn: 191964	2013-10-04 16:52:58 +00:00
Matthias Braun	2f169f900b	ARM: optimizeSelect has to consider the previous register class optimizeSelect folds (predicated) copy instructions, it must not ignore the original register class of the operand when replacing the register with the copies dest register. llvm-svn: 191963	2013-10-04 16:52:56 +00:00
Matthias Braun	c22630e164	ARM: do not add a regmask for TAILJUMPs The jump doesn't really kill the registers, the following call does but we never get back anyway. This avoids some verify-machineinstrs problems when TAILJUMPs are if-converted. llvm-svn: 191962	2013-10-04 16:52:54 +00:00
Matthias Braun	da621165ca	ARM: preserve undef flag in pseudo instruction expanders Copy over the whole register machine operand instead of creating a new one with an incomplete set of flags. llvm-svn: 191961	2013-10-04 16:52:51 +00:00
Jiangning Liu	ac5fd7e5d3	Implement aarch64 neon instruction set AdvSIMD (3V elem). llvm-svn: 191944	2013-10-04 09:20:44 +00:00
Craig Topper	d9a6cc031d	Revert r191940 to see if it fixes the build bots. llvm-svn: 191941	2013-10-04 05:52:17 +00:00
Craig Topper	a2efe9ebc6	Add OPC_CheckChildSame0-3 to the DAG isel matcher. This replaces sequences of MoveChild, CheckSame, MoveParent. Saves 846 bytes from the X86 DAG isel matcher, ~300 from ARM, ~840 from Hexagon. llvm-svn: 191940	2013-10-04 05:22:20 +00:00
David Blaikie	309ffe4016	DebugInfo: Fix ordering of members after r191928 In the case (shown in the attached test) where a member function definition was emitted into debug info the following could occur: 1) build the debug info for the member function definition 2) in (1), build the debug info for the member function declaration 3) construct and add the member function declaration DIE 4) add it to its context 5) build its context (the type it is a member of) 6) construct the members and add them to the type 7) except don't add member functions because "getOrCreateSubprogram" adds the function to its parent anyway 8) except we're only partway through building this subprogram declaration so it hasn't been added yet - but we returned the partially constructed DIE (since it's already in the MDNode->DIE mapping to avoid infinitely recursing trying to create the member function DIE) 9) once the type is constructed, add the member function to it 10) now the members are out of order (the member function being defined is listed as the last member, even though it was declared as the first) To avoid this, construct the context of the subprogram DIE before we query to see if it exists. That way we never end up creating it before creating its context and ending up in this situation. Alternatively, the type construction that visits/builds all the members could call something like getOrCreateSubprogram, but that doesn't ever do the "add to context" step. Then the type building code would always be responsible for adding members (and the subprogram "addToContextDIE" would no-op because the context building would have added the subprogram declaration to the type/context DIE already). (the test cases updated were overly-sensitive to offsets or abbreviation numbers. We don't have a nice way to make these tests more robust as yet - multiline FileCheck matches would be required) llvm-svn: 191939	2013-10-04 01:39:59 +00:00
Andrew Kaylor	1b2cfb6495	Adding support and tests for multiple module handling in lli llvm-svn: 191938	2013-10-04 00:49:38 +00:00
Richard Mitton	c250824772	Fixed a bug with section names containing special characters. Changed the dwarf aranges code to not use getLabelEndName, as it turns out it's not reliable to call that given user-defined section names. Section names can have characters in that aren't representable as symbol names. The dwarf-aranges test case has been updated to include a special character, to check this. This fixes pr17416. llvm-svn: 191932	2013-10-03 22:07:08 +00:00
Owen Anderson	5797bfd4a3	Pull fptrunc's upwards through selects when one of the select's selectands was a constant. This has a number of benefits, including producing small immediates (easier to materialize, smaller constant pools) as well as being more likely to allow the fptrunc to fuse with a preceding instruction (truncating selects are unusual). llvm-svn: 191929	2013-10-03 21:08:05 +00:00
David Blaikie	811bfe6395	DebugInfo: Avoid redundantly adding child DIEs to parents. DIE::addChild had a shortcircuit that silently no-op'd when a child was readded to the same parent. This hid some quirky/redundant code in DwarfDebug/CompileUnit. By removing that functionality and replacing it with an assert I was able to find and cleanup those cases, mostly centering around adding members to types in various circumstances. 1) The original oddity I noticed while working on type units (which actually was helping me in the short term, by accident) was the addToContextOwner call in constructTypeDIE. This call was completely bogus (why was it only done for non-virtual types? what relevance does that have at all) and redundant with the more uniform addToContextOwner made in getOrCreateTypeDIE. 2) If a member function definition was visited (createSubprogramDIE), it would attempt to build the member function declaration. The declaration DIE would then be added to its context, but in building the context (the type for which this function is a member) the members of the type would be added to the type automatically, so by the time the context was constructed, the member function was already associated with it. 3) The same as (2) but without the member function being constructed first. Whenever a type was constructed, the members would be created and member functions would be created by getOrCreateSubprogramDIE - this would lead to the subprogram being added to the (incomplete) type already, then the general member-construction code would add it again. llvm-svn: 191928	2013-10-03 20:07:20 +00:00
Matt Arsenault	40dddd7147	Rename DataLayout variables TD -> DL llvm-svn: 191927	2013-10-03 19:50:01 +00:00
Rafael Espindola	cda2911caa	Optimize linkonce_odr unnamed_addr functions during LTO. Generalize the API so we can distinguish symbols that are needed just for a DSO symbol table from those that are used from some native .o. The symbols that are only wanted for the dso symbol table can be dropped if llvm can prove every other dso has a copy (linkonce_odr) and the address is not important (unnamed_addr). llvm-svn: 191922	2013-10-03 18:29:09 +00:00
Matt Arsenault	bfa37e546d	Make gep i8* X, -(ptrtoint Y) transform work with address spaces llvm-svn: 191920	2013-10-03 18:15:57 +00:00
Tom Roeder	724143a752	Test commit. Fixed a copy-paste error in the Makefile for lib/LTO. llvm-svn: 191918	2013-10-03 18:05:12 +00:00
Quentin Colombet	76e5557981	[llvm-c][Disassembler] When printing latency information, fall back to the itinerary model in case the target does not supply a scheduling model. By doing this, targets like cortex-a8 can benefit from the latency printing feature added in r191859. This part of <rdar://problem/14687488>. llvm-svn: 191916	2013-10-03 17:51:49 +00:00
Eric Christopher	c948b9df23	Make sure we emit a section for pubnames even if that section is going to be empty. This is particularly important for the gnu pubnames case since we're emitting a relocation to the section. llvm-svn: 191915	2013-10-03 17:41:20 +00:00
Eric Christopher	f976c77ed7	Fix cut and paste typo. llvm-svn: 191914	2013-10-03 17:41:16 +00:00
Benjamin Kramer	8f5d425160	raw_fd_ostream: Be more verbose about the reason when opening a file fails. llvm-svn: 191911	2013-10-03 16:59:14 +00:00
Jin-Gu Kang	0bf8241d4b	Added checking code whehter target supports specific dag combining about rotate or not. The corresponding dag patterns are as following: "DAGCombier::MatchRotate" function in DAGCombiner.cpp Pattern1 // fold (or (shl (ext x), (ext y)), // (srl (ext x), (ext (sub 32, y)))) -> // (ext (rotl x, y)) // fold (or (shl (ext x), (ext y)), // (srl (ext x), (ext (sub 32, y)))) -> // (ext (rotr x, (sub 32, y))) pattern2 // fold (or (shl (ext x), (ext (sub 32, y))), // (srl (ext x), (ext y))) -> // (ext (rotl x, y)) // fold (or (shl (ext x), (ext (sub 32, y))), // (srl (ext x), (ext y))) -> // (ext (rotr x, (sub 32, y))) llvm-svn: 191905	2013-10-03 15:58:48 +00:00
Benjamin Kramer	d2757ba1be	CaptureTracking: Plug a loophole in the "too many uses" heuristic. The heuristic was added to avoid spending too much compile time A specially crafted test case (PR17461, PR16474) with many uses on a select or bitcast instruction can still trigger the slow case. Add a check for that case. This only affects compile time, don't have a good way to test it. llvm-svn: 191896	2013-10-03 13:24:02 +00:00
Elena Demikhovsky	85aeffaf5c	AVX-512: Fixed encoding of VMOVQ instruction. llvm-svn: 191889	2013-10-03 12:03:26 +00:00
Amara Emerson	52cfb6a99a	[ARM] Warn on deprecated IT blocks in v8 AArch32 assembly. Patch by Artyom Skrobov. llvm-svn: 191885	2013-10-03 09:31:51 +00:00
Alexey Samsonov	4436bf03e9	Remove wild .debug_aranges entries generated from unimportant labels r191052 added emitting .debug_aranges to Clang, but this functionality is broken: it uses all MC labels added in DWARF Asm printer, including the labels for build relocations between different DWARF sections, like .Lsection_line or .Ldebug_loc0. As a result, if any DIE .debug_info would contain "DW_AT_location=0x123" attribute, .debug_aranges would also contain a range starting from 0x123, breaking tools that rely on this section. This patch fixes this by using only MC labels that corresponds to the addresses in the user program. llvm-svn: 191884	2013-10-03 08:54:43 +00:00
Craig Topper	9eb8837ffa	Replace C++ style comment with a C style comment to satisfy some of the build bots. llvm-svn: 191880	2013-10-03 06:29:59 +00:00
Craig Topper	42e8a63e4f	Remove comma from the end of an enum. llvm-svn: 191877	2013-10-03 06:18:26 +00:00
Craig Topper	9e3e38ae3f	Add XOP disassembler support. Fixes PR13933. llvm-svn: 191874	2013-10-03 05:17:48 +00:00
Craig Topper	b01cd1aa74	Add patterns for selecting TBM instructions from logical operations. Patch from Yunzhong Gao. llvm-svn: 191871	2013-10-03 04:16:45 +00:00
Pete Cooper	d54381749d	Add v4f16 to supported value types. This is useful for some ARM intrinsics such as VCVTN which does a <4 x float> <-> <4 x half> conversion. llvm-svn: 191870	2013-10-03 03:29:21 +00:00
Quentin Colombet	c366504546	[llvm-c][Disassembler] When printing latency information, skip scheduling classes that are marked as Variant as those require an MI to pass to SubTargetInfo::resolveSchedClass. This is part of <rdar://problem/14687488>. llvm-svn: 191864	2013-10-02 23:11:47 +00:00
Matt Arsenault	0be1cb1c7b	Don't use runtime bounds check between address spaces. Don't vectorize with a runtime check if it requires a comparison between pointers with different address spaces. The values can't be assumed to be directly comparable. Previously it would create an illegal bitcast. llvm-svn: 191862	2013-10-02 22:38:17 +00:00
Quentin Colombet	5f09cb0dba	[llvm-c][Disassembler] Add an option to print latency information in disassembled output alongside the instructions. E.g., on a vector shuffle operation with a memory operand, disassembled outputs are: * Without the option: vpshufd $-0x79, (%rsp), %xmm0 * With the option: vpshufd $-0x79, (%rsp), %xmm0 ## Latency: 5 The printed latency is extracted from the schedule model available in the disassembler context. Thus, this option has no effect if there is not a scheduling model for the target. This boils down to one may need to specify the CPU string, so that this option could have an effect. Note: Latency < 2 are not printed. This part of <rdar://problem/14687488>. llvm-svn: 191859	2013-10-02 22:07:57 +00:00
Yi Jiang	8fd1a806d5	Apply slp vectorization on fully-vectorizable tree of height 2 llvm-svn: 191852	2013-10-02 20:20:39 +00:00
Matt Arsenault	39d592fe48	Fix debug printing spacing. Fix missing newlines, missing and extra spaces in printed messages. llvm-svn: 191851	2013-10-02 20:04:29 +00:00
Matt Arsenault	cccbe16785	Fix comment grammar and capitalization. llvm-svn: 191850	2013-10-02 20:04:26 +00:00
Benjamin Kramer	b9add84ef6	SLPVectorizer: Make store chain finding more aggressive with GetUnderlyingObject. This recursively strips all GEPs like the existing code. It also handles bitcasts and other operations that do not change the pointer value. llvm-svn: 191847	2013-10-02 19:06:06 +00:00
Tom Stellard	d3e916eb6a	StructurizeCFG: Add dependency on LowerSwitch pass Switch instructions were crashing the StructurizeCFG pass, and it's probably easier anyway if we don't need to handle them in this pass. Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 191841	2013-10-02 17:04:59 +00:00
Vincent Lejeune	6df39438af	R600: Add a ldptr intrinsic to support MSAA. llvm-svn: 191838	2013-10-02 16:00:33 +00:00
Chandler Carruth	ea56494625	Remove the very substantial, largely unmaintained legacy PGO infrastructure. This was essentially work toward PGO based on a design that had several flaws, partially dating from a time when LLVM had a different architecture, and with an effort to modernize it abandoned without being completed. Since then, it has bitrotted for several years further. The result is nearly unusable, and isn't helping any of the modern PGO efforts. Instead, it is getting in the way, adding confusion about PGO in LLVM and distracting everyone with maintenance on essentially dead code. Removing it paves the way for modern efforts around PGO. Among other effects, this removes the last of the runtime libraries from LLVM. Those are being developed in the separate 'compiler-rt' project now, with somewhat different licensing specifically more approriate for runtimes. llvm-svn: 191835	2013-10-02 15:42:23 +00:00
Alexey Samsonov	31540172d0	Remove "localize global" optimization Summary: As discussed in http://llvm-reviews.chandlerc.com/D1754, this optimization isn't really valid for C, and fires too rarely anyway. Reviewers: rafael, nicholas Reviewed By: nicholas CC: rnk, llvm-commits, nicholas Differential Revision: http://llvm-reviews.chandlerc.com/D1769 llvm-svn: 191834	2013-10-02 15:31:34 +00:00
Rafael Espindola	efa02d53ff	Fix option parsing in the gold plugin. This was broken when options were moved up in r191680. No test because this is specific LLVMgold.so/libLTO.so. Patch by Tom Roeder! llvm-svn: 191829	2013-10-02 14:36:23 +00:00
Rafael Espindola	3402c057db	Add Support For .bss Named Section Directive For Darwin Targets. Patch by Nicholas White. llvm-svn: 191824	2013-10-02 14:09:29 +00:00
Elena Demikhovsky	34586e7d41	AVX-512: fixed a bug in getLoadStoreRegOpcode() for AVX-512 target llvm-svn: 191818	2013-10-02 12:20:42 +00:00
Alexey Samsonov	15a2335db4	[DebugInfo] Further simplify DWARFDebugAranges public interface llvm-svn: 191813	2013-10-02 07:12:47 +00:00
Elena Demikhovsky	b30371cb6b	AVX-512: Added TB prefix to all instructions without prefixes, otherwise encoding fails after the last change in X86MCCodeEmitter.cpp. llvm-svn: 191812	2013-10-02 06:39:07 +00:00
Filip Pizlo	7aa695e026	This threads SectionName through the allocateCodeSection/allocateDataSection APIs, both in C++ and C land. It's useful for the memory managers that are allocating a section to know what the name of the section is. At a minimum, this is useful for low-level debugging - it's customary for JITs to be able to tell you what memory they allocated, and as part of any such dump, they should be able to tell you some meta-data about what each allocation is for. This allows clients that supply their own memory managers to do this. Additionally, we also envision the SectionName being useful for passing meta-data from within LLVM to an LLVM client. This changes both the C and C++ APIs, and all of the clients of those APIs within LLVM. I'm assuming that it's safe to change the C++ API because that API is allowed to change. I'm assuming that it's safe to change the C API because we haven't shipped the API in a release yet (LLVM 3.3 doesn't include the MCJIT memory management C API). llvm-svn: 191804	2013-10-02 00:59:25 +00:00
Manman Ren	9a0a67035e	Debug Info: In DIBuilder, the derived-from field of a DW_TAG_pointer_type is updated to use DITypeRef. Move isUnsignedDIType and getOriginalTypeSize from DebugInfo.h to be static helper functions in DwarfCompileUnit. We already have a static helper function "isTypeSigned" in DwarfCompileUnit, and a pointer to DwarfDebug is added to resolve the derived-from field. All three functions need to go across link for derived-from fields, so we need to get hold of a type identifier map. A pointer to DwarfDebug is also added to DbgVariable in order to resolve the derived-from field. Debug info verifier is updated to check a derived-from field is a TypeRef. Verifier will not go across link for derived-from fields, in debug info finder, we go across the link to add derived-from fields to types. Function getDICompositeType is only used by dragonegg and since dragonegg does not generate identifier for types, we use an empty map to resolve the derived-from field. When printing a derived-from field, we use DITypeRef::getName to either return the type identifier or getName of the DIType. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 191800	2013-10-01 23:45:54 +00:00
Quentin Colombet	93a98aac8b	[llvm-c][Disassembler] Add an option to reproduce in disassembled output the comments issued with verbose assembly. E.g., on a vector shuffle operation, disassembled output are: * Without the option: vpshufd $-0x79, (%rsp), %xmm0 * With the option: vpshufd $-0x79, (%rsp), %xmm0 ## xmm0 = mem[3,1,0,2] This part of <rdar://problem/14687488>. llvm-svn: 191799	2013-10-01 22:14:56 +00:00
Manman Ren	8990d7ee84	Debug Info: remove duplication of DIEs when a DIE is part of the type system and it is shared across CUs. We add a few maps in DwarfDebug to map MDNodes for the type system to the corresponding DIEs: MDTypeNodeToDieMap, MDSPNodeToDieMap, and MDStaticMemberNodeToDieMap. These DIEs can be shared across CUs, that is why we keep the maps in DwarfDebug instead of CompileUnit. Sometimes, when we try to add an attribute to a DIE, the DIE is not yet added to its owner yet, so we don't know whether we should use ref_addr or ref4. We create a worklist that will be processed during finalization to add attributes with the correct form (ref_addr or ref4). We add addDIEEntry to DwarfDebug to be a wrapper around DIE->addValue. It checks whether we know the correct form, if not, we update the worklist (DIEEntryWorklist). A testing case is added to show that we only create a single DIE for a type MDNode and we use ref_addr to refer to the type DIE. llvm-svn: 191792	2013-10-01 19:52:23 +00:00
Vincent Lejeune	a4da6fb535	R600: add a pass that merges clauses. llvm-svn: 191790	2013-10-01 19:32:58 +00:00
Vincent Lejeune	0b342d6f74	R600: Put PRED_X instruction in its own clause llvm-svn: 191789	2013-10-01 19:32:49 +00:00
Vincent Lejeune	269708b98d	R600: Enable -verify-machineinstrs in some tests. llvm-svn: 191788	2013-10-01 19:32:38 +00:00
Quentin Colombet	85f60ef633	[MC] When MCInstPrint::printAnnotation uses a comment stream, it has to ensure that each comment ends with a newline to match the definition in the header file. This is part of <rdar://problem/14687488>. llvm-svn: 191787	2013-10-01 19:21:24 +00:00
Matt Arsenault	517d84e268	Don't merge tiny functions. It's silly to merge functions like these: define void @foo(i32 %x) { ret void } define void @bar(i32 %x) { ret void } to get define void @bar(i32) { tail call void @foo(i32 %0) ret void } llvm-svn: 191786	2013-10-01 18:05:30 +00:00
Alexey Samsonov	ad4bf3db3a	[DebugInfo] Simplify and speedup .debug_aranges parsing Parsing .debug_aranges section now takes O(nlogn) operations instead of O(n^2), where "n" is the number of address ranges. With this change, the time required to symbolize an address from a random large Clang-generated binary drops from 165 seconds to 1.5 seconds. No functionality change. llvm-svn: 191781	2013-10-01 16:52:46 +00:00
Andrew Kaylor	89bdd103e5	Fixing MCJIT multiple module linking for OSX llvm-svn: 191780	2013-10-01 16:42:50 +00:00
Alexey Samsonov	97e8a87cfb	[DebugInfo] Further simplify DWARFDebugAranges. No functionality change. llvm-svn: 191779	2013-10-01 16:25:14 +00:00
Alexey Samsonov	0c9b72559c	[DebugInfo] Remove unused functions from DWARFDebugAranges and fix code style. llvm-svn: 191778	2013-10-01 15:48:10 +00:00
Richard Sandiford	b63e300b67	[SystemZ] Add comparisons of high words and memory llvm-svn: 191777	2013-10-01 15:00:44 +00:00
Richard Sandiford	a9ac0e0f75	[SystemZ] Add comparisons of large immediates using high words There are no corresponding patterns for small immediates because they would prevent the use of fused compare-and-branch instructions. llvm-svn: 191775	2013-10-01 14:56:23 +00:00
Richard Sandiford	42a694f44e	[SystemZ] Add immediate addition involving high words llvm-svn: 191774	2013-10-01 14:53:46 +00:00
Richard Sandiford	2cac763544	[SystemZ] Extend test-under-mask support to high GR32s llvm-svn: 191773	2013-10-01 14:41:52 +00:00
Richard Sandiford	3ad5a15b72	[SystemZ] Extend 32-bit RISBG optimizations to high words This involves using RISB[LH]G, whereas the equivalent z10 optimization uses RISBG. llvm-svn: 191770	2013-10-01 14:36:20 +00:00
Richard Sandiford	2896d044bd	[SystemZ] Extend pseudo conditional 8- and 16-bit stores to high words As the comment says, we always want to use STOC for 32-bit stores. llvm-svn: 191767	2013-10-01 14:33:55 +00:00
Tim Northover	d840745829	ARM: support interrupt attribute This function-attribute modifies the callee-saved register list and function epilogue (specifically the return instruction) so that a routine is suitable for use as an interrupt-handler of the specified type without disrupting user-mode applications. rdar://problem/14207019 llvm-svn: 191766	2013-10-01 14:33:28 +00:00
Richard Sandiford	f6377fba4c	[SystemZ] Optimize 32-bit FPR<->GPR moves for z196 and above Floats are stored in the high 32 bits of an FPR, and the only GPR<->FPR transfers are full-register transfers. This patch optimizes GPR<->FPR float transfers when the high word of a GPR is directly accessible. llvm-svn: 191764	2013-10-01 14:31:11 +00:00
Tareq A. Siraj	d88b9832c8	Add non-blocking Wait() for launched processes - New ProcessInfo class to encapsulate information about child processes. - Generalized the Wait() to support non-blocking wait on child processes. - ExecuteNoWait() now returns a ProcessInfo object with information about the launched child. Users will be able to use this object to perform non-blocking wait. - ExecuteNoWait() now accepts an ExecutionFailed param that tells if execution failed or not. These changes will allow users to implement basic process parallel tools. Differential Revision: http://llvm-reviews.chandlerc.com/D1728 llvm-svn: 191763	2013-10-01 14:28:18 +00:00
Richard Sandiford	7028428c2c	[SystemZ] Allow integer AND involving high words llvm-svn: 191762	2013-10-01 14:20:41 +00:00
Richard Sandiford	5718dacbdd	[SystemZ] Allow integer XOR involving high words llvm-svn: 191759	2013-10-01 14:08:44 +00:00
Rafael Espindola	44fee4e0eb	Remove several unused variables. Patch by Alp Toker. llvm-svn: 191757	2013-10-01 13:32:03 +00:00
Richard Sandiford	6e96ac600f	[SystemZ] Allow integer OR involving high words llvm-svn: 191755	2013-10-01 13:22:41 +00:00
Richard Sandiford	1a56931b22	[SystemZ] Allow integer insertions with a high-word destination llvm-svn: 191753	2013-10-01 13:18:56 +00:00
Richard Sandiford	7c5c0eabc9	[SystemZ] Allow selects with a high-word destination llvm-svn: 191751	2013-10-01 13:10:16 +00:00
Richard Sandiford	012402346f	[SystemZ] Add patterns to load a constant into a high word (IIHF) Similar to low words, we can use the shorter LLIHL and LLIHH if it turns out that the other half of the GR64 isn't live. llvm-svn: 191750	2013-10-01 13:02:28 +00:00
Joey Gouly	510de640c3	[ARM] Remove an unused function from the disassembler. Pointed out by Joerg. llvm-svn: 191749	2013-10-01 13:01:10 +00:00
Matheus Almeida	6de62d3966	Test commit. Updated comment. llvm-svn: 191748	2013-10-01 12:53:00 +00:00
Richard Sandiford	21235a256f	[SystemZ] Add register zero extensions involving at least one high word llvm-svn: 191746	2013-10-01 12:49:07 +00:00
Joey Gouly	ad98f1671d	[ARM] Introduce the 'sevl' instruction in ARMv8. This also removes the restriction on the immediate field of the 'hint' instruction. llvm-svn: 191744	2013-10-01 12:39:11 +00:00
Richard Sandiford	5469c39a26	[SystemZ] Add truncating high-word stores (STCH and STHH) llvm-svn: 191743	2013-10-01 12:22:49 +00:00
Richard Sandiford	0d46b1a30f	[SystemZ] Add zero-extending high-word loads (LLCH and LLHH) llvm-svn: 191742	2013-10-01 12:19:08 +00:00
Benjamin Kramer	58f1ced564	SCEVExpander: Fix a regression I introduced by to eagerly adding RAII objects. PR17425. llvm-svn: 191741	2013-10-01 12:17:11 +00:00
Richard Sandiford	89e160d975	[SystemZ] Add sign-extending high-word loads (LBH and LHH) llvm-svn: 191740	2013-10-01 12:11:47 +00:00
Richard Sandiford	0755c93b0c	[SystemZ] Use upper words of GR64s for codegen This just adds the basics necessary for allocating the upper words to virtual registers (move, load and store). The move support is parameterised in a way that makes it easy to handle zero extensions, but the associated zero-extend patterns are added by a later patch. The easiest way of testing this seemed to be add a new "h" register constraint for high words. I don't expect the constraint to be useful in real inline asms, but it should work, so I didn't try to hide it behind an option. llvm-svn: 191739	2013-10-01 11:26:28 +00:00
Richard Sandiford	a26a4b4f60	[SystemZ] Reapply: Add definitions of LFH and STFH Originally committed as r191661, but reverted because it changed the matching order of comparisons on some hosts. That should have been fixed by r191735. llvm-svn: 191738	2013-10-01 10:31:04 +00:00
Daniel Sanders	0210dd4b93	[mips][msa] Added support for matching mod_[us] from normal IR (i.e. not intrinsics) llvm-svn: 191737	2013-10-01 10:22:35 +00:00
Vladimir Medic	2b953d0b39	This patch adds aliases for Mips sub instruction with immediate operands. Corresponding test cases are added. llvm-svn: 191734	2013-10-01 09:48:56 +00:00
Elena Demikhovsky	3b75f5d282	AVX-512: Added X86vzmovl patterns llvm-svn: 191733	2013-10-01 08:38:02 +00:00
Craig Topper	766c934814	Remove 0 as a valid encoding for the m-mmmm field. llvm-svn: 191732	2013-10-01 07:10:28 +00:00
Craig Topper	8b278c5dc4	Remove unneeded fields from disassembler internal instruction format. llvm-svn: 191731	2013-10-01 06:56:57 +00:00
Craig Topper	3bf0317fec	BEXTR should be defined to take same type for bother operands. llvm-svn: 191728	2013-10-01 03:48:26 +00:00
Tom Stellard	6aada32dc4	SelectionDAG: Clarify comments from r191600 llvm-svn: 191724	2013-10-01 02:09:00 +00:00
Andrew Kaylor	ea395924d2	Adding multiple module support for MCJIT. Tests to follow. PIC with small code model and EH frame handling will not work with multiple modules. There are also some rough edges to be smoothed out for remote target support. llvm-svn: 191722	2013-10-01 01:47:35 +00:00
Eric Christopher	9a08f9e561	Add the DW_AT_GNU_ranges_base attribute if we've emitted any ranges into the debug_ranges section. llvm-svn: 191721	2013-10-01 00:43:36 +00:00
Eric Christopher	1d06eb5d86	Update comments. llvm-svn: 191720	2013-10-01 00:43:31 +00:00
Matt Arsenault	5ea37f8d89	Fix code duplication llvm-svn: 191716	2013-10-01 00:01:14 +00:00
Preston Gurd	f03a6e7fba	Forgot to add a break statement. llvm-svn: 191715	2013-09-30 23:51:22 +00:00
Matt Arsenault	a90a340fbb	Reuse variable llvm-svn: 191712	2013-09-30 23:31:50 +00:00
Preston Gurd	f0b6288cbf	The X86FixupLEAs pass for Intel Atom must not call convertToThreeAddress on ADD16rr opcodes, if src1 != src, since that would cause convertToThreeAddress to try to create a virtual register. This is not permitted after register allocation, which is when the X86FixupLEAs pass runs. This patch fixes PR16785. llvm-svn: 191711	2013-09-30 23:18:42 +00:00
Eric Christopher	39eebfada6	The DW_AT_GNU_pubnames/pubtypes attributes are actually form SEC_OFFSET from the beginning of the section so go ahead and emit a label at the beginning of each one. llvm-svn: 191710	2013-09-30 23:14:16 +00:00
Matt Arsenault	27e783e90d	Fix getOrInsertGlobal dropping the address space. Currently it will insert an illegal bitcast. Arguably, the address space argument should be added for the creation case. llvm-svn: 191702	2013-09-30 21:23:03 +00:00
Matt Arsenault	8468062c6e	Use right address space size in InstCombineCompares The test's output doesn't change, but this ensures this is actually hit with a different address space. llvm-svn: 191701	2013-09-30 21:11:01 +00:00
Matt Arsenault	06adecabe7	Constant fold ptrtoint + compare with address spaces llvm-svn: 191699	2013-09-30 21:06:18 +00:00
Manman Ren	aad5c3b81b	Debug Info: constify and rename from generateRef to getRef. No functionality change. llvm-svn: 191696	2013-09-30 19:42:10 +00:00
Anders Waldenborg	9515b31096	llvm-c: use typedef for function pointers This makes it consistent with other function pointers used in llvm-c Differential Revision: http://llvm-reviews.chandlerc.com/D1712 llvm-svn: 191693	2013-09-30 19:11:32 +00:00
Jack Carter	8ff70e3e26	[mips][msa] Direct Object Emission for I8 instructions. This patch adds Direct Object Emission support for I8 instructions: andi.b, bmnzi.b, bmzi.b, bseli.b, nori.b, ori.b, shf.{b,h,w} and xori.b. Patch by Matheus Almeida llvm-svn: 191688	2013-09-30 18:05:18 +00:00
Jack Carter	c3b25686b9	[mips][msa] Direct Object Emission for I5 instructions. This patch adds Direct Object Emission support for I5 instructions: addvi.{b,h,w,d}, ceqi.{b,h,w,d}, clei_s.{b,h,w,d}, clei_u.{b,h,w,d}, clti_s.{b,h,w,d}, clti_u.{b,h,w,d}, maxi_s.{b,h,w,d}, maxi_u.{b,h,w,d}, mini_s.{b,h,w,d}, mini_u.{b,h,w,d}, subvi.{b,h,w,d}. Patch by Matheus Almeida llvm-svn: 191687	2013-09-30 17:58:07 +00:00
Tilmann Scheller	be904775d2	[ARM] Clean up ARMAsmParser::validateInstruction(). Fix some LLVM Coding Standards violations. No changes in functionality. llvm-svn: 191686	2013-09-30 17:57:30 +00:00
Jack Carter	92e6e0f171	[mips][msa] Direct Object Emission for 2R instructions. This patch adds Direct Object Emission support for 2R instructions: nloc.{b,h,w}, nlzc.{b,h,w}, pcnt.{b,w,d}. Patch by Matheus Almeida llvm-svn: 191685	2013-09-30 17:52:33 +00:00
Jack Carter	6eed9cc6a8	[PATCH 1/4] [mips][msa] Source register of FILL instructions is GPR and not an MSA register Patch by Matheus Almeida llvm-svn: 191684	2013-09-30 17:43:04 +00:00
Rafael Espindola	0b385c77f7	Move command line options to the users of libLTO. Fixes --enable-shared build. Patch by Richard Sandiford. llvm-svn: 191680	2013-09-30 16:39:19 +00:00
Tilmann Scheller	255722beb8	[ARM] Assembler: ARM LDRD with writeback requires the base register to be different from the destination registers. See ARM ARM A8.8.72. Violating this constraint results in unpredictable behavior. llvm-svn: 191678	2013-09-30 16:11:48 +00:00
Arnold Schwaighofer	66eb921a82	Swift model: Fix uop description on some writes Those writes really need two/three uops. llvm-svn: 191677	2013-09-30 15:56:34 +00:00
Benjamin Kramer	f00472908a	BoundsChecking: Fix refacto. llvm-svn: 191676	2013-09-30 15:52:50 +00:00
Benjamin Kramer	6e931528fe	Convert manual insert point restores to the new RAII object. llvm-svn: 191675	2013-09-30 15:40:17 +00:00
Benjamin Kramer	6748576a0d	InstCombine: Replace manual fast math flag copying with the new IRBuilder RAII helper. Defines away the issue where cast<Instruction> would fail because constant folding happened. Also slightly cleaner. llvm-svn: 191674	2013-09-30 15:39:59 +00:00
Benjamin Kramer	d36f1abefd	IRBuilder: Add RAII objects to reset insertion points or fast math flags. Inspired by the object from the SLPVectorizer. This found a minor bug in the debug loc restoration in the vectorizer where the location of a following instruction was attached instead of the location from the original instruction. llvm-svn: 191673	2013-09-30 15:39:48 +00:00
Arnold Schwaighofer	d2f96b91ca	IfConverter: Use TargetSchedule for instruction latencies For targets that have instruction itineraries this means no change. Targets that move over to the new schedule model will use be able the new schedule module for instruction latencies in the if-converter (the logic is such that if there is no itineary we will use the new sched model for the latencies). Before, we queried "TTI->getInstructionLatency()" for the instruction latency and the extra prediction cost. Now, we query the TargetSchedule abstraction for the instruction latency and TargetInstrInfo for the extra predictation cost. The TargetSchedule abstraction will internally call "TTI->getInstructionLatency" if an itinerary exists, otherwise it will use the new schedule model. ATTENTION: Out of tree targets! (I will also send out an email later to LLVMDev) This means, if your target implements unsigned getInstrLatency(const InstrItineraryData ItinData, const MachineInstr MI, unsigned PredCost); and returns a value for "PredCost", you now also need to implement unsigned getPredictationCost(const MachineInstr MI); (if your target uses the IfConversion.cpp pass) radar://15077010 llvm-svn: 191671	2013-09-30 15:28:56 +00:00
Joey Gouly	d51a35c6a0	Fix a bug in InstCombine where it attempted to cast a Value* to an Instruction* when it was actually a Constant*. There are quite a few other casts to Instruction that might have the same problem, but this is the only one I have a test case for. llvm-svn: 191668	2013-09-30 14:18:35 +00:00
Richard Sandiford	a25f268c25	[SystemZ] Revert r191661: Add definitions of LFH and STFH For some reason, adding definitions for these load and store instructions changed whether some of the build bots matched comparisons as signed or unsigned. llvm-svn: 191663	2013-09-30 12:01:35 +00:00
Richard Sandiford	d30ac3a125	[SystemZ] Add definitions of LFH and STFH llvm-svn: 191661	2013-09-30 10:50:33 +00:00
Richard Sandiford	f9496060f6	[SystemZ] Add GRH32 for the high word of a GR64 The only thing this does on its own is make the definitions of RISB[HL]G a bit more precise. Those instructions are only used by the MC layer at the moment, so no behavioral change is intended. The class is needed by later patches though. llvm-svn: 191660	2013-09-30 10:45:16 +00:00
Richard Sandiford	87a4436456	[SystemZ] Rename subregs and add subreg_h32 Use subreg_hNN and subreg_lNN for the high and low NN bits of a register. List the low registers first, so that subreg_l32 also means the low 32 bits of a 128-bit register. Floats are stored in the upper 32 bits of a 64-bit register, so they should use subreg_h32 rather than subreg_l32. No behavioral change intended. llvm-svn: 191659	2013-09-30 10:28:35 +00:00
Richard Sandiford	ddec3e421b	[SystemZ] Add change missing from previous commit llvm-svn: 191656	2013-09-30 08:54:17 +00:00
Richard Sandiford	7789b0828a	[SystemZ] Rename 32-bit GPR registers I'm about to add support for high-word operations, so it seemed better for the low-word registers to have names like R0L rather than R0W. No behavioral change intended. llvm-svn: 191655	2013-09-30 08:48:38 +00:00
Craig Topper	ed59dd34fd	Various x86 disassembler fixes. Add VEX_LIG to scalar FMA4 instructions. Use VEX_LIG in some of the inheriting checks in disassembler table generator. Make use of VEX_L_W, VEX_L_W_XS, VEX_L_W_XD contexts. Don't let VEX_L_W, VEX_L_W_XS, VEX_L_W_XD, VEX_L_W_OPSIZE inherit from their non-L forms unless VEX_LIG is set. Let VEX_L_W, VEX_L_W_XS, VEX_L_W_XD, VEX_L_W_OPSIZE inherit from all of their non-L or non-W cases. Increase ranking on VEX_L_W, VEX_L_W_XS, VEX_L_W_XD, VEX_L_W_OPSIZE so they get chosen over non-L/non-W forms. llvm-svn: 191649	2013-09-30 02:46:36 +00:00
Benjamin Kramer	155c9d5d97	ObjectSizeOffsetEvaluator: Don't run into infinite recursion if we have a cyclic GEP. Those can occur in dead code. PR17402. llvm-svn: 191644	2013-09-29 19:39:13 +00:00
Benjamin Kramer	41fe88e7b4	Deallocate type units when destroying a DWARFContext. llvm-svn: 191637	2013-09-29 11:24:02 +00:00
Benjamin Kramer	c3c807b3bf	Allocate AtomicSDNode operands in SelectionDAG's allocator to stop leakage. SDNode destructors are never called. As an optimization use AtomicSDNode's internal storage if we have a small number of operands. llvm-svn: 191636	2013-09-29 11:18:56 +00:00
Craig Topper	3aef88b1c7	Change type of XOP flag in code emitters to a bool. Remove a some unneeded cases from switch. llvm-svn: 191632	2013-09-29 08:33:34 +00:00
Craig Topper	e75666f47a	Add comments for XOPA map introduced with TBM instructions.a llvm-svn: 191630	2013-09-29 06:31:18 +00:00
Robert Wilhelm	2788d3ec99	Even more spelling fixes for "instruction". llvm-svn: 191611	2013-09-28 13:42:22 +00:00
Robert Wilhelm	f0cfb83bb4	Fix spelling intruction -> instruction. llvm-svn: 191610	2013-09-28 11:46:15 +00:00
Tom Stellard	45015d9796	SelectionDAG: Silence unused variable warning on release builds llvm-svn: 191604	2013-09-28 03:10:17 +00:00
Tom Stellard	0351ea2010	R600: Fix handling of NAN in comparison instructions We were completely ignoring the unorder/ordered attributes of condition codes and also incorrectly lowering seto and setuo. Reviewed-by: Vincent Lejeune<vljn at ovi.com> llvm-svn: 191603	2013-09-28 02:50:50 +00:00
Tom Stellard	5694d3090a	SelectionDAG: Improve legalization of SELECT_CC with illegal condition codes SelectionDAG will now attempt to inverse an illegal conditon in order to find a legal one and if that doesn't work, it will attempt to swap the operands using the inverted condition. There are no new test cases for this, but a nubmer of the existing R600 tests hit this path. llvm-svn: 191602	2013-09-28 02:50:43 +00:00
Tom Stellard	cd42818d86	SelectionDAG: Try to expand all condition codes using getCCSwappedOperands() This is useful for targets like R600, which only support GT, GE, NE, and EQ condition codes as it removes the need to handle unsupported condition codes in target specific code. There are no tests with this commit, but R600 has been updated to take advantage of this new feature, so its existing selectcc tests are now testing the swapped operands path. llvm-svn: 191601	2013-09-28 02:50:38 +00:00
Tom Stellard	08690a146f	SelectionDAG: Clean up LegalizeSetCCCondCode() function Interpreting the results of this function is not very intuitive, so I cleaned it up to make it more clear whether or not a SETCC op was legalized and how it was legalized (either by swapping LHS and RHS or replacing with AND/OR). This patch does change functionality in the LHS and RHS swapping case, but unfortunately there are no in-tree tests for this. However, this patch is a prerequisite for R600 to take advantage of the LHS and RHS swapping, so tests will be added in subsequent commits. llvm-svn: 191600	2013-09-28 02:50:32 +00:00
NAKAMURA Takumi	3fddccfa43	MipsMachineFunction.cpp: Add missing #include <raw_ostream.h> llvm-svn: 191597	2013-09-28 01:35:07 +00:00
Matt Arsenault	5200fdf077	Fix typo llvm-svn: 191595	2013-09-28 01:08:00 +00:00
Manman Ren	209b17cdaa	AutoUpgrade: upgrade from scalar TBAA format to struct-path aware TBAA format. We treat TBAA tags as struct-path aware TBAA format when the first operand is a MDNode and the tag has 3 or more operands. llvm-svn: 191593	2013-09-28 00:22:27 +00:00
Akira Hatanaka	af4211ad94	[mips] Make sure loads from lazy-binding entries do not get CSE'd or hoisted out of loops. Previously, two consecutive calls to function "func" would result in the following sequence of instructions: 1. load $16, %got(func)($gp) // load address of lazy-binding stub. 2. move $25, $16 3. jalr $25 // jump to lazy-binding stub. 4. nop 5. move $25, $16 6. jalr $25 // jump to lazy-binding stub again. With this patch, the second call directly jumps to func's address, bypassing the lazy-binding resolution routine: 1. load $25, %got(func)($gp) // load address of lazy-binding stub. 2. jalr $25 // jump to lazy-binding stub. 3. nop 4. load $25, %got(func)($gp) // load resolved address of func. 5. jalr $25 // directly jump to func. llvm-svn: 191591	2013-09-28 00:12:32 +00:00
Manman Ren	f3a8c27e8d	TBAA: try to fix the dragonegg bots. llvm-svn: 191585	2013-09-27 22:59:21 +00:00
Eric Christopher	a51d3fc721	Unify conditionals and reformat. llvm-svn: 191582	2013-09-27 22:50:48 +00:00
Matt Arsenault	4c265906cc	Minor code simplification llvm-svn: 191579	2013-09-27 22:38:23 +00:00
Akira Hatanaka	e0657b2419	[mips] Define a derived class of PseudoSourceValue that represents a GOT entry resolved by lazy-binding. llvm-svn: 191578	2013-09-27 22:30:36 +00:00
Matt Arsenault	31cfc78f81	Use right pointer type in DebugIR llvm-svn: 191576	2013-09-27 22:26:25 +00:00
Matt Arsenault	fa25272db9	Use type helper functions llvm-svn: 191574	2013-09-27 22:18:51 +00:00
Eric Christopher	7857d489a9	Rework conditional for printing out pub sections. llvm-svn: 191571	2013-09-27 22:10:10 +00:00
Josh Magee	8ecfb52388	[stackprotector] Refactor the StackProtector pass from a single .cpp file into StackProtector.h and StackProtector.cpp. No functionality change. Future patches will add analysis which will be used in other passes (PEI, StackSlot). The end goal is to support ssp-strong stack layout rules. WIP. Differential Revision: http://llvm-reviews.chandlerc.com/D1521 llvm-svn: 191570	2013-09-27 21:58:43 +00:00
Rui Ueyama	bc654b18bc	Object/COFF: Rename getXXX{Begin,End} -> xxx_{begin,end}. It is mentioned in the LLVM coding standard that _begin() and _end() suffixes should be used. llvm-svn: 191569	2013-09-27 21:47:05 +00:00
Matt Arsenault	29f31735a2	Fix SLPVectorizer using wrong address space for load/store llvm-svn: 191564	2013-09-27 21:24:57 +00:00
Dmitri Gribenko	78fe2ba3ba	SourceMgr diagnotics printing: fix a bug where printing a fixit for a source range that includes a tab character will cause out-of-bounds access to the fixit string. llvm-svn: 191563	2013-09-27 21:24:36 +00:00
Dmitri Gribenko	8f944628ac	Make SourceMgr::PrintMessage() testable and add unit tests llvm-svn: 191558	2013-09-27 21:09:25 +00:00
Rui Ueyama	c2bed42904	Re-submit r191472 with a fix for big endian. llvm-objdump: Dump COFF import table if -private-headers option is given. llvm-svn: 191557	2013-09-27 21:04:00 +00:00
Justin Bogner	4a9ac8cd75	InstCombine: Only foldSelectICmpAndOr for integer types Currently foldSelectICmpAndOr asserts if the "or" involves a vector containing several of the same power of two. We can easily avoid this by only performing the fold on integer types, like foldSelectICmpAnd does. Fixes <rdar://problem/15012516> llvm-svn: 191552	2013-09-27 20:35:39 +00:00
Akira Hatanaka	d8f10ceb51	[mips] Rewrite MipsTargetLowering::getAddr functions as template functions. No intended functionality change. llvm-svn: 191546	2013-09-27 19:51:35 +00:00
Yunzhong Gao	b8bbcbfcc8	Adding intrinsics to the llvm backend for TBM instruction set. Phabricator code review is located here: http://llvm-reviews.chandlerc.com/D1750 llvm-svn: 191539	2013-09-27 18:38:42 +00:00
Manman Ren	0ed04fc9ab	TBAA: handle scalar TBAA format and struct-path aware TBAA format. Remove the command line argument "struct-path-tbaa" since we should not depend on command line argument to decide which format the IR file is using. Instead, we check the first operand of the tbaa tag node, if it is a MDNode, we treat it as struct-path aware TBAA format, otherwise, we treat it as scalar TBAA format. When clang starts to use struct-path aware TBAA format no matter whether struct-path-tbaa is no, and we can auto-upgrade existing bc files, the support for scalar TBAA format can be dropped. Existing testing cases are updated to use the struct-path aware TBAA format. llvm-svn: 191538	2013-09-27 18:34:27 +00:00
Justin Bogner	ca9bd8fac1	Transforms: Use getFirstNonPHI to set the insertion point for PHIs We were previously using getFirstInsertionPt to insert PHI instructions when vectorizing, but getFirstInsertionPt also skips past landingpads, causing this to generate invalid IR. We can avoid this issue by using getFirstNonPHI instead. llvm-svn: 191526	2013-09-27 15:30:25 +00:00
Richard Sandiford	067817ee05	[SystemZ] Rein back the use of block operations The backend tries to use block operations like MVC, NC, OC and XC for simple scalar operations. For correctness reasons, it rejects any case in which the regions might partially overlap. However, for performance reasons, it should also reject cases where the regions might be equal, since the instruction might then not use the fast path. This fixes a performance regression seen in bzip2. We may want to limit the optimisation even more in future, or even remove it entirely, but I'll try with this for now. llvm-svn: 191525	2013-09-27 15:29:20 +00:00
Richard Sandiford	54b369166f	[SystemZ] Improve handling of PC-relative addresses The backend previously folded offsets into PC-relative addresses whereever possible. That's the right thing to do when the address can be used directly in a PC-relative memory reference (using things like LRL). But if we have a register-based memory reference and need to load the PC-relative address separately, it's better to use an anchor point that could be shared with other accesses to the same area of the variable. Fixes a FIXME. llvm-svn: 191524	2013-09-27 15:14:04 +00:00
Daniel Sanders	6098b33515	[mips][msa] Implemented insert.d intrinsic. This intrinsic is lowered into an equivalent INSERT_VECTOR_ELT which is further lowered into a sequence of insert.w's on MIPS32. llvm-svn: 191521	2013-09-27 13:36:54 +00:00
Tilmann Scheller	1aebfa0a9b	ARM: Teach assembler to enforce constraints for ARM LDRD destination register operands. As specified in A8.8.72/A8.8.73/A8.8.74 in the ARM ARM, all variants of the ARM LDRD instruction have the following two constraints: LDRD<c> <Rt>, <Rt2>, ... (a) Rt must be even-numbered and not r14 (b) Rt2 must be R(t+1) If those two constraints are not met the result of executing the instruction will be unpredictable. Constraint (b) was already enforced, this commit adds support for constraint (a). Fixes rdar://14479793. llvm-svn: 191520	2013-09-27 13:28:17 +00:00
Daniel Sanders	c72593e69a	[mips][msa] Implemented fill.d intrinsic. This intrinsic is lowered into an equivalent BUILD_VECTOR which is further lowered into a sequence of insert.w's on MIPS32. llvm-svn: 191519	2013-09-27 13:20:41 +00:00
Daniel Sanders	7f3d946fb7	[mips][msa] Implemented copy_[us].d intrinsic. This intrinsic is lowered into equivalent copy_s.w instructions during legalization. llvm-svn: 191518	2013-09-27 13:04:21 +00:00
Daniel Sanders	51287b9355	[mips][msa] Rename arguments to MSA_INSERT_DESC_BASE to better match their expected values. No functional change. llvm-svn: 191517	2013-09-27 12:45:08 +00:00
Daniel Sanders	a515070eb3	[mips][msa] Implemented insert_vector_elt for v4f32 and v2f64. For v4f32 and v2f64, INSERT_VECTOR_ELT is matched by a pseudo-insn which is later expanded to appropriate insve.[wd] insns. llvm-svn: 191515	2013-09-27 12:31:32 +00:00
Daniel Sanders	39bb8ba023	[mips][msa] Implemented extract_vector_elt for v4f32 or v2f64 For v4f32 and v2f64, EXTRACT_VECTOR_ELT is matched by a pseudo-insn which may be expanded to subregister copies and/or instructions as appropriate. llvm-svn: 191514	2013-09-27 12:17:32 +00:00
Daniel Sanders	9ea9ff2da7	[mips][msa] Added support for MSA registers to copyPhysReg llvm-svn: 191512	2013-09-27 12:03:51 +00:00
Daniel Sanders	7e51fe19d5	[mips][msa] Added support for matching splati from normal IR (i.e. not intrinsics) Updated some of the vshf since they (correctly) emit splati's now llvm-svn: 191511	2013-09-27 11:48:57 +00:00
Andrea Di Biagio	56ce9c4e78	Re-apply the change from r191393 with fix for pr17380. This change fixes the problem reported in pr17380 and re-add the dagcombine transformation ensuring that the value types are always legal if the transformation is triggered after Legalization took place. Added the test case from pr17380. llvm-svn: 191509	2013-09-27 11:37:05 +00:00
Daniel Sanders	928920ab29	[mips][msa] Added MSA.txt to describe instruction selection quirks. This file contains notes about the instruction selection for MSA. For example, it notes that ilvl.d is cannot be selected because ilvev.d covers the same cases and is selected instead of ilvl.d. llvm-svn: 191507	2013-09-27 10:42:22 +00:00
Tilmann Scheller	041f717680	Fix comment. llvm-svn: 191505	2013-09-27 10:38:11 +00:00
Tilmann Scheller	88c8f16558	ARM: Teach assembler to enforce constraint for Thumb2 LDRD (literal/immediate) destination register operands. LDRD<c> <Rt>, <Rt2>, <label> LDRD<c> <Rt>, <Rt2>, [<Rn>{, #+/-<imm>}] LDRD<c> <Rt>, <Rt2>, [<Rn>], #+/-<imm> LDRD<c> <Rt>, <Rt2>, [<Rn>, #+/-<imm>]! As specified in A8.8.72/A8.8.73 in the ARM ARM, the T1 encoding has a constraint which enforces that Rt != Rt2. If this constraint is not met the result of executing the instruction will be unpredictable. Fixes rdar://14479780. llvm-svn: 191504	2013-09-27 10:30:18 +00:00
Daniel Sanders	84e7caf741	[mips][msa] Tidy up lowerMSABinaryIntr, lowerMSABinaryImmIntr, lowerMSABranchIntr, and lowerMSAUnaryIntr were trivially small functions. Inlined them into their callers. lowerMSASplat now takes its callers SDLoc instead of making a new one. No functional change. llvm-svn: 191503	2013-09-27 10:25:41 +00:00
Daniel Sanders	1b1e25b7c5	[mips][msa] MSA requires FR=1 mode (64-bit FPU register file). Report fatal error when using it in FR=0 mode. llvm-svn: 191498	2013-09-27 10:08:31 +00:00
Daniel Sanders	36c671e2c7	[mips][msa] Expand all truncstores and loadexts for MSA as well as DSP llvm-svn: 191496	2013-09-27 09:44:59 +00:00
Daniel Sanders	f4f1a872ca	[mips][msa] Added missing check in performSRACombine Reviewers: jacksprat, dsanders Reviewed By: dsanders Differential Revision: http://llvm-reviews.chandlerc.com/D1755 llvm-svn: 191495	2013-09-27 09:25:29 +00:00
Puyan Lotfi	74e38de492	First check in. Modified a comment. llvm-svn: 191491	2013-09-27 07:36:10 +00:00
Craig Topper	dbe8b7d236	Put HasAVX512 predicate on some patterns to properly disable them when AVX512 isn't enabled. Currently it works simply because the SSE and AVX version of the same patterns are checked first in the DAG isel table. llvm-svn: 191490	2013-09-27 07:20:47 +00:00
Craig Topper	8f14de8f32	Switch HasAVX to UseAVX in one spot to ensure that AVX512 form of VINSERTPS is used in AVX512 mode. llvm-svn: 191489	2013-09-27 07:16:24 +00:00
Craig Topper	c6a1aac735	Removal some duplicate patterns. llvm-svn: 191488	2013-09-27 07:11:17 +00:00
Yunzhong Gao	4467f33e3c	Fixing Intel format of the vshufpd instruction. Phabricator code review is located at: http://llvm-reviews.chandlerc.com/D1759 llvm-svn: 191481	2013-09-27 01:44:23 +00:00
Rui Ueyama	333d28a0bb	Revert "llvm-objdump: Dump COFF import table if -private-headers option is given." This reverts commit r191472 because it's failing on BE machine. llvm-svn: 191480	2013-09-27 01:29:36 +00:00
Rui Ueyama	5b1adbaad9	llvm-objdump: Dump COFF import table if -private-headers option is given. This is a patch to add capability to llvm-objdump to dump COFF Import Table entries, so that we can write tests for LLD checking Import Table contents. llvm-objdump did not print anything but just file name if the format is COFF and -private-headers option is given. This is a patch adds capability for dumping DLL Import Table, which is specific to the COFF format. In this patch I defined a new iterator to iterate over import table entries. Also added a few functions to COFFObjectFile.cpp to access fields of the entry. Differential Revision: http://llvm-reviews.chandlerc.com/D1719 llvm-svn: 191472	2013-09-27 00:07:01 +00:00
Adrian Prantl	6ac40036f1	MCParser/Debug info: Accept line number 0 as a legitimate value, since CFE produces it to indicate artificial locations. c.f.: DWARF standard, Table 6.2: line -- An unsigned integer indicating a source line number. Lines are numbered beginning at 1. The compiler may emit the value 0 in cases where an instruction cannot be attributed to any source line. llvm-svn: 191471	2013-09-26 23:37:11 +00:00
Jack Carter	cb8b40b08d	[mips][msa] Direct Object Emission for 3RF instructions. Patch by Matheus Almeida llvm-svn: 191461	2013-09-26 21:31:43 +00:00
Jack Carter	142ec8283d	[mips][msa] Updates encoding of 3RF instructions to match the latest revision of the MSA spec (1.06). This does not affect any of the existing output. Patch by Matheus Almeida llvm-svn: 191460	2013-09-26 21:18:57 +00:00
Weiming Zhao	286304a317	Fix PR 17372: Emitting PLD for stack address for ARM Thumb2 t2PLDi12, t2PLDi8, t2PLDs was omitted in Thumb2InstrInfo. This patch fixes it. llvm-svn: 191441	2013-09-26 17:25:10 +00:00
Bill Schmidt	cea1596205	[PowerPC] Fix PR17354: Generate nop after local calls for PIC code. When generating code for shared libraries, even local calls may be intercepted, so we need a nop after the call for the linker to fix up the TOC. Test case adapted from the one provided in PR17354. llvm-svn: 191440	2013-09-26 17:09:28 +00:00
Andrea Di Biagio	549d6605a0	Revert r191393 since it caused pr17380. llvm-svn: 191438	2013-09-26 16:54:01 +00:00
Venkatraman Govindaraju	4c0cdd734c	[Sparc] Implements exception handling in SPARC with DwarfCFI. llvm-svn: 191432	2013-09-26 15:11:00 +00:00
Venkatraman Govindaraju	3816d43a9a	Implements parsing and emitting of .cfi_window_save in MC. llvm-svn: 191431	2013-09-26 14:49:40 +00:00
Amara Emerson	b4ad2f396a	[ARM] Use the load-acquire/store-release instructions optimally in AArch32. Patch by Artyom Skrobov. llvm-svn: 191428	2013-09-26 12:22:36 +00:00
David Majnemer	7137420d94	PPC: Allow partial fills in writeNopData() When asked to pad an irregular number of bytes, we should fill with zeros. This is consistent with the behavior specified in the AIX Assembler Language Reference as well as other LLVM and binutils assemblers. N.B. There is a small deviation from binutils' PPC assembler: when handling pads which are greater than 4 bytes but not mod 4, binutils will not emit any NOP sequences at all and only use zeros. This may or may not be a bug but there is no excellent rationale as to why that behavior is important to emulate. If that behavior is needed, we can change writeNopData() to behave in the same way. This fixes PR17352. llvm-svn: 191426	2013-09-26 09:18:48 +00:00
Andrew Trick	71e8bb6d1d	Added temp flag -misched-bench for staging in default changes. llvm-svn: 191423	2013-09-26 05:53:35 +00:00
Andrew Trick	6f5aad7a24	whitespace llvm-svn: 191422	2013-09-26 05:53:31 +00:00
David Majnemer	08249a31b2	PPC: Do not introduce ISD nodes for fctid and fctiw llvm-svn: 191421	2013-09-26 05:22:11 +00:00
David Majnemer	6ad26d3364	PPC: Add support for fctid and fctiw Encodings were checked against the Power ISA documents and double checked against binutils. This fixes PR17350. llvm-svn: 191419	2013-09-26 04:11:24 +00:00
Jack Carter	3eb663b037	[mips][msa] Direct Object Emission for 3R instructions. This is the first set of instructions with a ".b" modifier thus we need to add the required code to disassemble a MSA128B register class. Patch by Matheus Almeida llvm-svn: 191415	2013-09-26 00:09:46 +00:00
Jack Carter	77551abef4	[mips][msa] Updates encoding of 3R instructions to match the latest revision of the MSA spec (1.06). Internal changes only. Patch by Matheus Almeida llvm-svn: 191414	2013-09-26 00:02:44 +00:00
Jack Carter	3381298227	[mips][msa] Direct Object Emission for 2RF instructions. Patch by Matheus Almeida llvm-svn: 191413	2013-09-25 23:56:25 +00:00
Jack Carter	5dc8ac92b9	[mips][msa] Direct Object Emission support for the MSA instruction set. In more detail, this patch adds the ability to parse, encode and decode MSA registers ($w0-$w31). The format of 2RF instructions (MipsMSAInstrFormat.td) was updated so that we could attach a test case to this patch i.e., the test case parses, encodes and decodes 2 MSA instructions. Following patches will add the remainder of the instructions. Note that DecodeMSA128BRegisterClass is missing from MipsDisassembler.td because it's not yet required at this stage and having it would cause a compiler warning (unused function). Patch by Matheus Almeida llvm-svn: 191412	2013-09-25 23:50:44 +00:00
Jack Carter	56c681eb7f	[mips][msa] Updates encoding of 2RF instructions to match the latest revision of the MSA spec (1.06). This only changes internal encodings and doesn't affect output. Patch by Matheus Almeida llvm-svn: 191411	2013-09-25 23:42:03 +00:00
Weiming Zhao	2052f4843b	Fix PR 17368: disable vector mul distribution for square of add/sub for ARM Generally, it is desirable to distribute (a + b) * c to ac + bc for ARM with VMLx forwarding, where a, b and c are vectors. However, for (a + b)(a + b), distribution will result in one extra instruction. With distribution: x = a + b (add) y = a x (mul) z = y + b * y (mla) Without distribution: x = a + b (add) z = x * x (mul) This patch checks if a mul is a square of add/sub. If yes, skip distribution. llvm-svn: 191410	2013-09-25 23:12:06 +00:00
Eric Christopher	4c7e6ba7d3	Dump the normal dwarf pubtypes section as well. llvm-svn: 191408	2013-09-25 23:02:41 +00:00
Eric Christopher	0de5359e20	Unify pubsection/gnu pubsection printing. llvm-svn: 191407	2013-09-25 23:02:36 +00:00
Eric Christopher	a88fd7fdb6	Slight formatting change for pubnames/pubtypes output. llvm-svn: 191401	2013-09-25 21:17:37 +00:00
Reed Kotler	a6ce797f05	Fix a bad typo in the inline assembly code for mips16 pic fp stubs and make one cosmetic cleanup to make it look the same as gcc in this area; adjusting test cases. llvm-svn: 191400	2013-09-25 20:58:50 +00:00
Andrea Di Biagio	9f3313109f	Teach DAGCombiner how to canonicalize dags according to the rule (shl (zext (shr A, X)), X) => (zext (shl (shr A, X), X)). The rule only triggers when there are no other uses of the zext to avoid materializing more instructions. This helps the DAGCombiner understand that the shl/shr sequence can then be converted into an and instruction. llvm-svn: 191393	2013-09-25 19:01:01 +00:00
Andrew Trick	b6854d80e3	Mark the x86 machine model as incomplete. PR17367. Ideally, the machinel model is added at the time the instructions are defined. But many instructions in X86InstrSSE.td still need a model. Without this workaround the scheduler asserts because x86 already has itinerary classes for these instructions, indicating they should be modeled by the scheduler. Since we use the new machine model for other instructions, it expects a new machine model for these too. llvm-svn: 191391	2013-09-25 18:14:12 +00:00
Arnold Schwaighofer	07520324f5	SLPVectorize: Put horizontal reductions feeding a store under separate flag Put them under a separate flag for experimentation. They are more likely to interfere with loop vectorization which happens later in the pass pipeline. llvm-svn: 191371	2013-09-25 14:02:32 +00:00
Richard Sandiford	652784e29a	[SystemZ] Define the GR64 low-word logic instructions as pseudo aliases. Another patch to avoid duplication of encoding information. Things like NILF, NILL and NILH are used as both 32-bit and 64-bit instructions. Here the 64-bit versions are defined as aliases of the 32-bit ones. llvm-svn: 191369	2013-09-25 11:11:53 +00:00
David Majnemer	0c58bc64a4	MC: Add support for treating $ as a reference to the PC The binutils assembler supports a mode called DOLLAR_DOT which treats the dollar sign token as a reference to the current program counter if the dollar sign doesn't precede a constant or identifier. This commit adds a new MCAsmInfo flag stating whether or not a given target supports this interpretation of the dollar sign token; by default, this flag is not enabled. Further, enable this flag for PPC. The system assembler for AIX and binutils both support using the dollar sign in this manner. This fixes PR17353. llvm-svn: 191368	2013-09-25 10:47:21 +00:00
Richard Sandiford	f348f831d5	[SystemZ] Define the call instructions as pseudo aliases. Similar to r191364, but for calls. This patch also removes the shortening of BRASL to BRAS within a TU. Doing that was a bit controversial internally, since there's a strong expectation with the z assembler that WYWIWYG. llvm-svn: 191366	2013-09-25 10:37:17 +00:00
Richard Sandiford	6cbd7f0c5d	[SystemZ] Use subregs for 64-bit truncating stores Another patch to reduce the duplication of encoding information. Rather than define separate patterns for truncating 64-bit stores, use the 32-bit stores with a subreg. No behavioral changed intended. llvm-svn: 191365	2013-09-25 10:29:47 +00:00
Richard Sandiford	9ab97cd147	[SystemZ] Define the return instruction as a pseudo alias of BR This is the first of a few patches to reduce the dupliation of encoding information. The return instruction is a normal BR in which one of the registers is fixed. llvm-svn: 191364	2013-09-25 10:20:08 +00:00
Richard Sandiford	35ec4e356c	[SystemZ] Add instruction-shortening pass When loading immediates into a GR32, the port prefered LHI, followed by LLILH or LLILL, followed by IILF. LHI and IILF are natural 32-bit operations, but LLILH and LLILL also clear the upper 32 bits of the register. This was represented as taking a 32-bit subreg of a 64-bit assignment. Using subregs for something as simple as a move immediate was probably a bad idea. Also, I have patches to add support for the high-word facility, and we don't want something like LLILH and LLILL to stop the high word of the same GPR from being used. This patch therefore uses LHI and IILF to begin with and adds a late machine-specific pass to use LLILH and LLILL if the other half of the register is not live. The high-word patches extend this behavior to IIHF, LLIHL and LLIHH. No behavioral change intended. llvm-svn: 191363	2013-09-25 10:11:07 +00:00
David Majnemer	1ccd2f2aee	MC: Remove vestigial PCSymbol field from AsmInfo llvm-svn: 191362	2013-09-25 09:36:11 +00:00
Evgeniy Stepanov	32be0340f5	[msan] Fix -Wreturn-type warnings in non-self-hosted build. llvm-svn: 191361	2013-09-25 08:56:00 +00:00
Akira Hatanaka	7d92b346a7	Revert r191350. llvm-svn: 191353	2013-09-25 00:52:34 +00:00
Akira Hatanaka	f215e077bb	[mips] Move public functions to the beginning of the class definition. No intended functionality change. llvm-svn: 191352	2013-09-25 00:34:42 +00:00
Akira Hatanaka	30f97cfa82	[mips] Define getTargetNode as a template function. No intended functionality change. llvm-svn: 191350	2013-09-25 00:30:25 +00:00
Quentin Colombet	fa403ab3fb	[PR16882] Ignore noreturn definitions when setting isPhysRegUsed. PEI inserts a save/restore sequence for the link register, according to the information it gets from the MachineRegisterInfo. MachineRegisterInfo is populated by the VirtRegMap pass. This pass was not aware of noreturn calls and was registering the definitions of these calls the same way as regular operations. Modify VirtRegPass so that it does not set the isPhysRegUsed information for registers only defined by noreturn calls. The rational is that a noreturn call is the "last instruction" of the program (if it returns the behavior is undefined), so everything that is defined by it cannot be used and will not interfere with anything else. Therefore, it is pointless to account for then. llvm-svn: 191349	2013-09-25 00:26:17 +00:00
Andrew Trick	d24698c8ef	CriticalAntiDepBreaker is no longer needed for armv7 scheduling. This is being disabled because it is no longer needed for performance. It is only used by postRAscheduler which is also planned for removal, and it is implemented with an out-dated view of register liveness. It consideres aliases instead of register units, assumes valid kill flags, and assumes implicit uses on partial register defs. Kill flags and implicit operands are error prone and impossible to verify. We should gradually eliminate dependence on them in the postRA phases. Targets that still benefit from this should move to the MI scheduler. If that doesn't solve the problem, then we should add a hook to regalloc to optimize reload placement. llvm-svn: 191348	2013-09-25 00:26:16 +00:00
Jim Grosbach	aff6a0caa2	MachO: Improve backend diagnostic for overalignment. Give the symbol's name and disengage the enchanced crash reporting. llvm-svn: 191344	2013-09-24 23:56:31 +00:00
Peter Collingbourne	4ccf0f1bef	Move LTO support library to a component, allowing it to be tested more reliably across platforms. Patch by Tom Roeder! llvm-svn: 191343	2013-09-24 23:52:22 +00:00
Eli Friedman	a961d694e2	Add missing check to SETCC optimization. PR17338. llvm-svn: 191337	2013-09-24 22:50:14 +00:00
David Blaikie	b9c7f6aef4	llvm-dwarfdump: add missing opening quotation mark lost in r191330 llvm-svn: 191333	2013-09-24 20:23:36 +00:00
David Blaikie	ea4ca1a099	llvm-dwarfdump: re-add field formatting for the entry kind lost in r191329 CR feedback from Eric Christopher llvm-svn: 191330	2013-09-24 19:56:27 +00:00
David Blaikie	ecd21fff61	llvm-dwarfdump support for gnu_pubtypes llvm-svn: 191329	2013-09-24 19:50:00 +00:00
Yunzhong Gao	dd36e9387b	Adding a feature flag to the llvm backend for x86 TBM instruction set. Adding TBM feature to bdver2 processor; piledriver supports this instruction set according to the following document: http://developer.amd.com/wordpress/media/2012/10/New-Bulldozer-and-Piledriver-Instructions.pdf Phabricator code review is located here: http://llvm-reviews.chandlerc.com/D1692 llvm-svn: 191324	2013-09-24 18:21:52 +00:00
Benjamin Kramer	01df817a33	MemoryBuiltins: Remove posix_memalign from the list and replace it with a TODO. This code isn't ready to deal with allocation functions where the return is not the allocated pointer. The checks below will reject posix_memalign anyways. llvm-svn: 191319	2013-09-24 17:49:08 +00:00
Roman Divacky	e33098f5cb	Make the size and expr arguments of .fill directive optional. llvm-svn: 191318	2013-09-24 17:44:41 +00:00
Benjamin Kramer	2939dd3d11	MemoryBuiltins: Reinstate optimizing (uninitialized) loads from operator new. llvm-svn: 191315	2013-09-24 17:34:29 +00:00
Yi Jiang	edf2d9179e	set the cost of tiny trees to INT_MAX in SLP vectorizer to disable vectorization on them llvm-svn: 191314	2013-09-24 17:26:43 +00:00
Benjamin Kramer	4d4df04353	MemoryBuiltins: Fix operator new bits. We really don't want to optimize malloc return value checks away. llvm-svn: 191313	2013-09-24 17:15:14 +00:00
Andrew Trick	dc4c1adfc7	Comment typo. llvm-svn: 191312	2013-09-24 17:11:19 +00:00
Benjamin Kramer	fd4777c046	Teach MemoryBuiltins and InstructionSimplify that operator new never returns NULL. This is safe per C++11 18.6.1.1p3: [operator new returns] a non-null pointer to suitably aligned storage (3.7.4), or else throw a bad_alloc exception. This requirement is binding on a replacement version of this function. Brings us a tiny bit closer to eliminating more vector push_backs. llvm-svn: 191310	2013-09-24 16:37:51 +00:00
Benjamin Kramer	30d249a1b3	Push analysis passes to InstSimplify when they're around anyways. llvm-svn: 191309	2013-09-24 16:37:40 +00:00
Daniel Sanders	fae5f2a9c9	[mips][msa] Added support for matching pckev, and pckod from normal IR (i.e. not intrinsics) llvm-svn: 191306	2013-09-24 14:53:25 +00:00
Daniel Sanders	2ed228b29b	[mips][msa] Added support for matching ilv[lr], ilvod, and ilvev from normal IR (i.e. not intrinsics) llvm-svn: 191304	2013-09-24 14:36:12 +00:00
Benjamin Kramer	64bdb29a83	DAGCombiner: Unify rotate matching for extended and unextended amounts. No functionality change, lots of indentation changes. llvm-svn: 191303	2013-09-24 14:21:28 +00:00
Daniel Sanders	2630718ae8	[mips][msa] Added support for matching shf from normal IR (i.e. not intrinsics) llvm-svn: 191302	2013-09-24 14:20:00 +00:00
Daniel Sanders	e508704b52	[mips][msa] Added support for matching vshf from normal IR (i.e. not intrinsics) llvm-svn: 191301	2013-09-24 14:02:15 +00:00
Daniel Sanders	f49dd82e06	[mips][msa] Remove the VSPLAT and VSPLATD nodes in favour of matching BUILD_VECTOR. Most constant BUILD_VECTOR's are matched using ComplexPatterns which cover bitcasted as well as normal vectors. However, it doesn't seem to be possible to match ldi.[bhwd] in a type-agnostic manner (e.g. to support the widest range of immediates, it should be possible to use ldi.b to load v2i64) using TableGen so ldi.[bhwd] is matched using custom code in MipsSEISelDAGToDAG.cpp This made the majority of the constant splat BUILD_VECTOR lowering redundant. The only transformation remaining for constant splats is when an (up-to) 32-bit constant splat is possible but the value does not fit into a 10-bit signed integer. In this case, the BUILD_VECTOR is transformed into a bitcasted BUILD_VECTOR so that fill.[bhw] can be used to splat the vector from a GPR32 register (which is initialized using the usual lui/addui sequence). There are no additional tests since this is a re-implementation of previous functionality. The change is intended to make it easier to implement some of the upcoming instruction selection patches since they can rely on existing support for BUILD_VECTOR's in the DAGCombiner. compare_float.ll changed slightly because a BITCAST is no longer introduced during legalization. llvm-svn: 191299	2013-09-24 13:33:07 +00:00
Daniel Sanders	f86622ba13	[mips][msa] Non-constant BUILD_VECTOR's should be expanded to INSERT_VECTOR_ELT instead of memory operations. The resulting code is the same length, but doesnt cause memory traffic or latency. llvm-svn: 191297	2013-09-24 13:16:15 +00:00
Daniel Sanders	4f3ff1b9e8	[mips][msa] Added partial support for matching fmax_a from normal IR (i.e. not intrinsics) This covers the case where fmax_a can be used to implement ISD::FABS. llvm-svn: 191296	2013-09-24 13:02:08 +00:00
Daniel Sanders	5a942e6fd0	[mips][msa] Line wrapping. No functional change. llvm-svn: 191295	2013-09-24 12:45:36 +00:00
Daniel Sanders	bfc39cedf2	[mips][msa] Added support for matching andi, ori, nori, and xori from normal IR (i.e. not intrinsics) llvm-svn: 191293	2013-09-24 12:32:47 +00:00
Daniel Sanders	3ce56622c8	[mips][msa] Added support for matching max, maxi, min, mini from normal IR (i.e. not intrinsics) llvm-svn: 191291	2013-09-24 12:18:31 +00:00
Daniel Sanders	e1d2435543	[mips][msa] Added support for matching bsel and bseli from normal IR (i.e. not intrinsics) This required correcting the definition of the bsel and bseli intrinsics. llvm-svn: 191290	2013-09-24 12:04:44 +00:00
Evgeniy Stepanov	5522a70674	[msan] Handling of atomic load/store, atomic rmw, cmpxchg. llvm-svn: 191287	2013-09-24 11:20:27 +00:00
Daniel Sanders	fd538dc745	[mips][msa] Added support for matching comparisons from normal IR (i.e. not intrinsics) MIPS SelectionDAG changes: * Added VCEQ, VCL[ET]_[SU] nodes to represent vector comparisons that produce a bitmask. llvm-svn: 191286	2013-09-24 10:46:19 +00:00
Daniel Sanders	cba1922915	[mips][msa] Added support for matching slli, srai, and srli from normal IR (i.e. not intrinsics) llvm-svn: 191285	2013-09-24 10:28:18 +00:00
Bill Wendling	c63c30c9a2	Followup to r191252. Make sure that the code that handles the constant addresses is run for the GEPs. This just refactors that code and then calls it for the GEPs that are collected during the iteration. <rdar://problem/12445434> llvm-svn: 191281	2013-09-24 07:19:30 +00:00
NAKAMURA Takumi	cc57dd58e0	DWARFTypeUnit::dump(): Use PRIx64 to format uint64_t. llvm-svn: 191266	2013-09-24 03:23:07 +00:00
Jiangning Liu	63dc840fc5	Initial support for Neon scalar instructions. Patch by Ana Pazos. 1.Added support for v1ix and v1fx types. 2.Added Scalar Pairwise Reduce instructions. 3.Added initial implementation of Scalar Arithmetic instructions. llvm-svn: 191263	2013-09-24 02:47:27 +00:00
Michael Gottesman	5e3600c1ce	[stackprotector] Allow for copies from vreg -> vreg to be in a terminator sequence. Sometimes a copy from a vreg -> vreg sneaks into the middle of a terminator sequence. It is safe to slice this into the stack protector success bb. This fixes PR16979. llvm-svn: 191260	2013-09-24 01:50:26 +00:00
Eli Friedman	410393a15d	Misc fixes for cpp backend. PR17317. llvm-svn: 191258	2013-09-24 00:36:09 +00:00
Eric Christopher	55364d71d0	Add namespaces to the list of items that we expose via pubnames. llvm-svn: 191257	2013-09-24 00:17:57 +00:00
Eric Christopher	0840a5a69c	Format the index entry kind string to align. llvm-svn: 191255	2013-09-24 00:17:49 +00:00
Bill Wendling	585a901a12	Selecting the address from a very long chain of GEPs can blow the stack. The recursive nature of the address selection code can cause the stack to explode if there is a long chain of GEPs. Convert the recursive bit into a iterative method to avoid this. <rdar://problem/12445434> llvm-svn: 191252	2013-09-24 00:13:08 +00:00
David Blaikie	427e435b21	Comments for r191234 as suggested by Eric Christopher. llvm-svn: 191244	2013-09-23 23:39:55 +00:00
Eric Christopher	6d0f1e683a	Add more external types to the pubtypes table. Expand the asm checking patch until we get full dumping support. llvm-svn: 191239	2013-09-23 23:15:58 +00:00
David Blaikie	8de5e98f6a	Unbreak the build (from r191233)since we're calling printf. llvm-svn: 191238	2013-09-23 23:15:57 +00:00
Eric Christopher	ccac5c4bf9	Rename IsStatic variable to Linkage in order to be a bit more descriptive. llvm-svn: 191236	2013-09-23 22:59:14 +00:00
Eric Christopher	b0fc0b9a7b	Formatting. llvm-svn: 191235	2013-09-23 22:59:11 +00:00
David Blaikie	03c089cf97	llvm-dwarfdump/libDebugInfo support for type units llvm-svn: 191234	2013-09-23 22:44:47 +00:00
David Blaikie	07e22449a3	Exract most of DWARFCompileUnit into a new DWARFUnit to prepare for the coming DWARFTypeUnit. llvm-svn: 191233	2013-09-23 22:44:40 +00:00
Reed Kotler	e883f501bb	Make nomips16 mask not repeat if it ends with a '.'. This mask is purely for debugging and testing. llvm-svn: 191231	2013-09-23 22:36:11 +00:00
Bill Wendling	8faa30ef4b	Reformat code with clang-format. llvm-svn: 191226	2013-09-23 20:57:47 +00:00
Eric Christopher	261d234302	Handle gnu pubtypes sections: a) Make sure we are emitting the correct section in our section labels when we begin the module. b) Make sure we are emitting the correct pubtypes section in the presence of gnu pubtypes. c) For C++ struct, union, class, and enumeration types are default external. llvm-svn: 191225	2013-09-23 20:55:35 +00:00
Kay Tiong Khoo	9195a5b081	fix typo: than -> then llvm-svn: 191214	2013-09-23 18:43:51 +00:00
Richard Mitton	089ed89e76	Fixed debug_aranges handling for common symbols. The size of common symbols is now tracked correctly, so they can be listed in the arange section without needing knowledge of other following symbols. .comm (and .lcomm) do not indicate to the system assembler any particular section to use, so we have to treat them as having no section. Test case update to account for this. llvm-svn: 191210	2013-09-23 17:56:20 +00:00
David Blaikie	1b5ee5d9f1	DebugInfo: Wrap section data and relocs together for dwarf dumping support This is a small step that may enable some simplifications in producer (DWARFContext) and consumer (DWARFCompileUnit and other places) by making a more complete abstraction around the data and relocations for a section. Small initial steps could include simple changes such as passing the pair to DWARFCompileUnit's ctor rather than passing the data and relocs separately. I don't intend to pursue any such changes immediately, however. The motivation for doing this now is that type unit dumping will need to deal with these data+reloc pairs moreso than the existing dumping support has needed to associate the data as type unit sections are named the same (debug_types) and comdat group folded. So to implement dumping and reloc handling we'll need a mapping of section->data+relocs. llvm-svn: 191209	2013-09-23 17:42:01 +00:00
Arnold Schwaighofer	22639407d7	Revert "LoopVectorizer: Only allow vectorization of intrinsics." Revert 191122 - with extra checks we are allowed to vectorize math library function calls. Standard library indentifiers are reserved names so functions with external linkage must not overrided them. However, functions with internal linkage can. Therefore, we can vectorize calls to math library functions with a check for external linkage and matching signature. This matches what we do during SelectionDAG building. llvm-svn: 191206	2013-09-23 14:54:39 +00:00
Daniel Sanders	86d0c8d751	[mips][msa] Added support for matching addvi, and subvi from normal IR (i.e. not intrinsics) llvm-svn: 191203	2013-09-23 14:29:55 +00:00
Amara Emerson	330afb54d3	[ARM] Split A/R class into separate subtarget features. Patch by Bradley Smith. llvm-svn: 191202	2013-09-23 14:26:15 +00:00
Benjamin Kramer	942dfe625b	InstSimplify: Fold equality comparisons between non-inbounds GEPs. Overflow doesn't affect the correctness of equalities. Computing this is cheap, we just reuse the computation for the inbounds case and try to peel of more non-inbounds GEPs. This pattern is unlikely to ever appear in code generated by Clang, but SCEV occasionally produces it. llvm-svn: 191200	2013-09-23 14:16:38 +00:00
Daniel Sanders	a4c8f3a7b0	[mips][msa] Added support for matching insert and copy from normal IR (i.e. not intrinsics) Changes to MIPS SelectionDAG: * Added nodes VEXTRACT_[SZ]EXT_ELT to represent extract and extend in a single operation and implemented the DAG combines necessary to fold sign/zero extends into the extract. llvm-svn: 191199	2013-09-23 14:03:12 +00:00
Daniel Sanders	766cb697a8	[mips][msa] Added support for matching pcnt from normal IR (i.e. not intrinsics) llvm-svn: 191198	2013-09-23 13:40:21 +00:00
Daniel Sanders	f7456c78f0	[mips][msa] Added support for matching nor from normal IR (i.e. not intrinsics) llvm-svn: 191195	2013-09-23 13:22:24 +00:00
Daniel Sanders	8ca81e484e	[mips][msa] Added support for matching and, or, and xor from normal IR (i.e. not intrinsics) llvm-svn: 191194	2013-09-23 12:57:42 +00:00
Daniel Sanders	5f38701d5a	Partially revert r191192: Fix -Wunused-variable error when assertions are disabled and -Werror is in use. An unrelated change crept in because 'svn revert' isn't recursive by default. The unrelated changes have been reverted. llvm-svn: 191193	2013-09-23 12:33:38 +00:00
Daniel Sanders	3253de49c9	Fix -Wunused-variable error when assertions are disabled and -Werror is in use. llvm-svn: 191192	2013-09-23 12:26:55 +00:00
Daniel Sanders	7a289d0e39	[mips][msa] Implemented build_vector using ldi, fill, and custom SelectionDAG nodes (VSPLAT and VSPLATD) Note: There's a later patch on my branch that re-implements this to select build_vector without the custom SelectionDAG nodes. The future patch avoids the constant-folding problems stemming from the custom node (i.e. it doesn't need to re-implement all the DAG combines related to BUILD_VECTOR). Changes to MIPS specific SelectionDAG nodes: * Added VSPLAT This is a special case of BUILD_VECTOR that covers the case the BUILD_VECTOR is a splat operation. * Added VSPLATD This is a special case of VSPLAT that handles the cases when v2i64 is legal llvm-svn: 191191	2013-09-23 12:02:46 +00:00
Venkatraman Govindaraju	94629eb861	[Sparc] Use correct instruction pattern for CMPri. llvm-svn: 191180	2013-09-22 18:54:54 +00:00
David Blaikie	4f099cb2ba	Remove dead code llvm-svn: 191179	2013-09-22 18:25:32 +00:00
David Blaikie	ba860d74ba	StringRef-ize some things llvm-svn: 191178	2013-09-22 17:01:50 +00:00
Benjamin Kramer	8817cca5ce	Provide basic type safety for array_pod_sort comparators. This makes using array_pod_sort significantly safer. The implementation relies on function pointer casting but that should be safe as we're dealing with void* here. llvm-svn: 191175	2013-09-22 14:09:50 +00:00
Benjamin Kramer	5626259506	Drop spurious handle in comment. llvm-svn: 191172	2013-09-22 11:24:58 +00:00
Venkatraman Govindaraju	51270837aa	[Sparc] Make SPARC instructions' encoding well defined such that TableGen can automatically generate code emitter. llvm-svn: 191168	2013-09-22 09:54:42 +00:00
Venkatraman Govindaraju	709d154d69	[Sparc] Clean up MOVcc instructions so that TableGen can encode them correctly. No functionality change intended. llvm-svn: 191167	2013-09-22 09:18:26 +00:00
Venkatraman Govindaraju	2fb440fbad	[Sparc] Clean up branch instructions, so that TableGen can encode branch conditions as well. No functionality change intended. llvm-svn: 191166	2013-09-22 08:51:55 +00:00
Tim Northover	31d093c705	ISelDAG: spot chain cycles involving MachineNodes Previously, the DAGISel function WalkChainUsers was spotting that it had entered already-selected territory by whether a node was a MachineNode (amongst other things). Since it's fairly common practice to insert MachineNodes during ISelLowering, this was not the correct check. Looking around, it seems that other nodes get their NodeId set to -1 upon selection, so this makes sure the same thing happens to all MachineNodes and uses that characteristic to determine whether we should stop looking for a loop during selection. This should fix PR15840. llvm-svn: 191165	2013-09-22 08:21:56 +00:00
Venkatraman Govindaraju	cb1dca602c	[Sparc] Add support for TLS in sparc. llvm-svn: 191164	2013-09-22 06:48:52 +00:00
David Majnemer	7b1cdb980b	X86: Use R_X86_64_TPOFF64 for FK_Data_8 Summary: LLVM would crash when trying to come up with a relocation type for assembly like: movabsq $V@TPOFF, %rax Instead, we say the relocation type is R_X86_64_TPOFF64. Fixes PR17274. Reviewers: dblaikie, nrieck, rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1717 llvm-svn: 191163	2013-09-22 05:30:16 +00:00
Venkatraman Govindaraju	7e7eb8ce69	[SPARC] Make functions with GLOBAL_OFFSET_TABLE access as non-leaf functions. llvm-svn: 191160	2013-09-22 01:40:24 +00:00
Venkatraman Govindaraju	e9ef51222b	[Sparc] Emit .register directive to declare the use of global registers %g2, %g4, %g6 and %g7. llvm-svn: 191158	2013-09-22 00:42:30 +00:00
Hal Finkel	25415c2b83	Correct the pre-increment load latencies in the PPC A2 itinerary Pre-increment loads are microcoded on the A2, and the address increment occurs only after the load completes. As a result, the latency of the GPR address update is an additional 2 cycles on top of the load latency. llvm-svn: 191156	2013-09-22 00:08:14 +00:00
Venkatraman Govindaraju	829aec5900	[Sparc] Fix lowering FABS on fp128 (long double) on pre-v9 targets. llvm-svn: 191154	2013-09-21 23:51:08 +00:00
Benjamin Kramer	90901a35ce	SROA: Handle casts involving vectors of pointers and integer scalars. SROA wants to convert any types of equivalent widths but it's not possible to convert vectors of pointers to an integer scalar with a single cast. As a workaround we add a bitcast to the corresponding int ptr type first. This type of cast used to be an edge case but has become common with SLP vectorization. Fixes PR17271. llvm-svn: 191143	2013-09-21 20:36:04 +00:00
Juergen Ributzka	f043a65327	Revert "SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too." This reverts commit r191130. llvm-svn: 191138	2013-09-21 15:09:46 +00:00
Craig Topper	58f6e64e07	Remove alignment restrictions from FMA load folding. llvm-svn: 191136	2013-09-21 05:58:59 +00:00
Arnold Schwaighofer	d743feef81	SLPVectorizer: Fix multiline comment warning llvm-svn: 191135	2013-09-21 05:37:30 +00:00
David Majnemer	f90c3b5a1c	ELF: Parse types in directives like binutils gas Allow binutils .type and .section directives to take the following forms: - @<type> - %<type> - "<type>" llvm-svn: 191134	2013-09-21 05:25:12 +00:00
Juergen Ributzka	c2551eb4ff	Fix the buildbot llvm-svn: 191133	2013-09-21 05:15:01 +00:00
Juergen Ributzka	ab930591c7	[X86] Emulate AVX 256bit MIN/MAX support by splitting the vector. In AVX 256bit vectors are valid vectors and therefore the Type Legalizer doesn't split the VSELECT and SETCC nodes. AVX only supports MIN/MAX on 128bit vectors and this fix enables vector splitting for this special case in the X86 DAG Combiner. This fix is related to PR16695, PR17002, and <rdar://problem/14594431>. llvm-svn: 191131	2013-09-21 04:55:22 +00:00
Juergen Ributzka	e9a80fc912	SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too. The Type Legalizer recognizes that VSELECT needs to be split, because the type is to wide for the given target. The same does not always apply to SETCC, because less space is required to encode the result of a comparison. As a result VSELECT is split and SETCC is unrolled into scalar comparisons. This commit fixes the issue by checking for VSELECT-SETCC patterns in the DAG Combiner. If a matching pattern is found, then the result mask of SETCC is promoted to the expected vector mask for the given target. This mask has usually te same size as the VSELECT return type (except for Intel KNL). Now the type legalizer will split both VSELECT and SETCC. This allows the following X86 DAG Combine code to sucessfully detect the MIN/MAX pattern. This fixes PR16695, PR17002, and <rdar://problem/14594431>. llvm-svn: 191130	2013-09-21 04:55:18 +00:00
NAKAMURA Takumi	68fa6f9d36	Initialize BSSSection explicitly in InitMachOMCObjectFileInfo() to appease msvc. This can revert r191087. llvm-svn: 191128	2013-09-21 02:34:45 +00:00
Reed Kotler	78fb291e62	Set .reorder for the stub so that gas takes care of delay slot processing. llvm-svn: 191125	2013-09-21 01:37:52 +00:00

... 5 6 7 8 9 ...

64709 Commits