llvm-project

Commit Graph

Author	SHA1	Message	Date
Matthias Braun	c1988f384c	LiveIntervalAnalysis: Mark subregister defs as undef when we determined they are only reading a dead superregister value This was not necessary before as this case can only be detected when the liveness analysis is at subregister level. llvm-svn: 226733	2015-01-21 22:55:13 +00:00
Chris Bieneman	9e13af7ac3	Adding a new cl::HideUnrelatedOptions API to allow clang to migrate off cl::getRegisteredOptions. Summary: cl::getRegisteredOptions really exposes some of the innards of how command line parsing is implemented. Exposing new APIs that allow us to disentangle client code from implementation details will allow us to make more extensive changes to command line parsing. Reviewers: chandlerc, dexonsmith, beanz Reviewed By: dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7100 llvm-svn: 226729	2015-01-21 22:45:52 +00:00
Simon Pilgrim	b16b09b154	[X86][SSE] Added support for SSE3 lane duplication shuffle instructions This patch adds shuffle matching for the SSE3 MOVDDUP, MOVSLDUP and MOVSHDUP instructions. The big use of these being that they avoid many single source shuffles from needing to use (pre-AVX) dual source instructions such as SHUFPD/SHUFPS: causing extra moves and preventing load folds. Adding these instructions uncovered an issue in XFormVExtractWithShuffleIntoLoad which crashed on single operand shuffle instructions (now fixed). It also involved fixing getTargetShuffleMask to correctly identify theses instructions as unary shuffles. Also adds a missing tablegen pattern for MOVDDUP. Differential Revision: http://reviews.llvm.org/D7042 llvm-svn: 226716	2015-01-21 22:44:35 +00:00
Jonathan Roelofs	229eb4ca5c	Fix load-store optimizer on thumbv4t Thumbv4t does not have lo->lo copies other than MOVS, and that can't be predicated. So emit MOVS when needed and bail if there's a predicate. http://reviews.llvm.org/D6592 llvm-svn: 226711	2015-01-21 22:39:43 +00:00
David Majnemer	4c0a6e918a	InstCombine: Don't strip bitcasts off of callsites marked 'thunk' The return type of a thunk is meaningless, we just want the arguments and return value to be forwarded. llvm-svn: 226708	2015-01-21 22:32:04 +00:00
Simon Pilgrim	47af023ada	[X86][SSE] movddup shuffle mask decodes Patch to provide shuffle decodes and asm comments for the SSE3/AVX1 movddup double duplication instructions. llvm-svn: 226705	2015-01-21 22:02:30 +00:00
Matthias Braun	311730ac78	LiveIntervalAnalysis: Factor out code to update liveness on vreg def removal This cleans up code and is more in line with the general philosophy of modifying LiveIntervals through LiveIntervalAnalysis instead of changing them directly. This also fixes a case where SplitEditor::removeBackCopies() would miss the subregister ranges. llvm-svn: 226690	2015-01-21 19:02:30 +00:00
Matthias Braun	cfb8ad29b5	LiveIntervalAnalysis: Factor out code to update liveness on physreg def removal This cleans up code and is more in line with the general philosophy of modifying LiveIntervals through LiveIntervalAnalysis instead of changing them directly. llvm-svn: 226687	2015-01-21 18:50:21 +00:00
Matthias Braun	1002baf7b9	LiveIntervalAnalysis: Remove unused pruneValue() variant. llvm-svn: 226686	2015-01-21 18:45:57 +00:00
Adrian Prantl	1292e24d0e	Let subprograms with instructions without parent scopes fail the verification. Tested via a unit test. Follow-up to r226616. llvm-svn: 226684	2015-01-21 18:32:56 +00:00
Matt Arsenault	b00554886f	R600/SI: Custom lower fround This fixes it for SI. It also removes the pattern used previously for Evergreen for f32. I'm not sure if the the new R600 output is better or not, but it uses 1 fewer instructions if BFI is available. llvm-svn: 226682	2015-01-21 18:18:25 +00:00
Colin LeMahieu	94269db8ba	[Hexagon] Converting multiply and accumulate with immediate intrinsics to patterns. llvm-svn: 226681	2015-01-21 18:13:15 +00:00
Ahmed Bougacha	8f09e9f7c5	[X86] Declare SSE4.1/AVX2 vector extloads covered by PMOV[SZ]X legal. Now that we can fully specify extload legality, we can declare them legal for the PMOVSX/PMOVZX instructions. This for instance enables a DAGCombine to fire on code such as (and (<zextload-equivalent> ...), <redundant mask>) to turn it into: (zextload ...) as seen in the testcase changes. There is one regression, in widen_load-2.ll: we're no longer able to do store-to-load forwarding with illegal extload memory types. This will be addressed separately. Differential Revision: http://reviews.llvm.org/D6533 llvm-svn: 226676	2015-01-21 17:07:06 +00:00
George Burgess IV	3c898c2119	Fixed a bug with how we determine bitset indices. llvm-svn: 226671	2015-01-21 16:37:21 +00:00
Yaron Keren	3f02c14cc7	Add missing include guards to WindowsSupport.h. llvm-svn: 226669	2015-01-21 16:20:38 +00:00
Tim Northover	cf3d80fedb	Revert "DAGCombine: fold (or (and X, M), (and X, N)) -> (and X, (or M, N))" It hadn't gone through review yet, but was still on my local copy. This reverts commit r226663 llvm-svn: 226665	2015-01-21 15:48:52 +00:00
Tim Northover	b9184f2b1a	AArch64: add backend option to reserve x18 (platform register) AAPCS64 says that it's up to the platform to specify whether x18 is reserved, and a first step on that way is to add a flag controlling it. From: Andrew Turner <andrew@fubar.geek.nz> llvm-svn: 226664	2015-01-21 15:43:31 +00:00
Tim Northover	85cd2791c9	DAGCombine: fold (or (and X, M), (and X, N)) -> (and X, (or M, N)) llvm-svn: 226663	2015-01-21 15:43:28 +00:00
Michael Kuperstein	ada9fa1ca9	[x32] Fast ISel should use LEA64_32r instead of LEA32r to adjust addresses in x32 mode. llvm-svn: 226661	2015-01-21 14:44:05 +00:00
Evgeniy Stepanov	79ca0fd1a0	[msan] Update origin for the entire destination range on memory store. Previously we always stored 4 bytes of origin at the destination address even for 8-byte (and longer) stores. This should fix rare missing, or incorrect, origin stacks in MSan reports. llvm-svn: 226658	2015-01-21 13:21:31 +00:00
Jozef Kolek	5cfebdde2b	[mips][microMIPS] MicroMIPS 16-bit unconditional branch instruction B Implement microMIPS 16-bit unconditional branch instruction B. Implemented 16-bit microMIPS unconditional instruction has real name B16, and B is an alias which expands to either B16 or BEQ according to the rules: b 256 --> b16 256 # R_MICROMIPS_PC10_S1 b 12256 --> beq $zero, $zero, 12256 # R_MICROMIPS_PC16_S1 b label --> beq $zero, $zero, label # R_MICROMIPS_PC16_S1 Differential Revision: http://reviews.llvm.org/D3514 llvm-svn: 226657	2015-01-21 12:39:30 +00:00
Jozef Kolek	2c6d73207e	[mips][microMIPS] Implement ADDIUPC instruction Differential Revision: http://reviews.llvm.org/D6582 llvm-svn: 226656	2015-01-21 12:10:11 +00:00
Chandler Carruth	df5747a900	[PM] Refactor the InstCombiner interface to use an external worklist. Because in its primary function pass the combiner is run repeatedly over the same function until doing so produces no changes, it is essentially to not re-allocate the worklist. However, as a utility, the more common pattern would be to put a limited set of instructions in the worklist rather than the entire function body. That is also the more likely pattern when used by the new pass manager. The result is a very light weight combiner that does the visiting with a separable worklist. This can then be wrapped up in a helper function for users that want a combiner utility, or as I have here it can be wrapped up in a pass which manages the iterations used when combining an entire function's instructions. Hopefully this removes some of the worst of the interface warts that became apparant with the last patch here. However, there is clearly more work. I've again left some FIXMEs for the most egregious. The ones that stick out to me are the exposure of the worklist and IR builder as public members, and the use of pointers rather than references. However, fixing these is likely to be much more mechanical and less interesting so I didn't want to touch them in this patch. llvm-svn: 226655	2015-01-21 11:38:17 +00:00
Chandler Carruth	ba4c5179a0	[PM] Simplify (ha! ha!) the way that instcombine calls the SimplifyLibCalls utility by sinking it into the specific call part of the combiner. This will avoid us needing to do any contortions to build this object in a subsequent refactoring I'm doing and seems generally better factored. We don't need this utility everywhere and it carries no interesting state so we might as well build it on demand. llvm-svn: 226654	2015-01-21 11:23:40 +00:00
Vladimir Medic	435cf8a415	[Mips][Disassembler]When disassembler meets load/store from coprocessor 2 instructions for mips r6 it crashes as the access to operands array is out of range. This patch adds dedicated decoder method that properly handles decoding of these instructions. llvm-svn: 226652	2015-01-21 10:47:36 +00:00
Craig Topper	42b326ea12	[x86] Remove some unnecessary and slightly confusing typecasts from some patterns. I think it actually went i32->iPtr->i32 in some of these cases. llvm-svn: 226647	2015-01-21 08:43:57 +00:00
Craig Topper	7ff6ab30a9	[X86] Convert all the i8imm used by AVX512 and MMX instructions to u8imm. llvm-svn: 226646	2015-01-21 08:43:49 +00:00
Craig Topper	620b50cc23	[X86] Convert all the i8imm used by SSE and AVX instructions to u8imm. This makes the assembler check their size and removes a hack from the disassembler to avoid sign extending the immediate. llvm-svn: 226645	2015-01-21 08:15:54 +00:00
Craig Topper	f38dea1cfa	[x86] Add assembly parser bounds checking to the immediate value for cmpss/cmpsd/cmpps/cmppd. llvm-svn: 226642	2015-01-21 06:07:53 +00:00
Chandler Carruth	9280382ac6	[PM] Replace an abuse of inheritance to override a single function with a more direct approach: a type-erased glorified function pointer. Now we can pass a function pointer into this for the easy case and we can even pass a lambda into it in the interesting case in the instruction combiner. I'll be using this shortly to simplify the interfaces to InstCombiner, but this helps pave the way and seems like a better design for the libcall simplifier utility. llvm-svn: 226640	2015-01-21 02:11:59 +00:00
Adrian Prantl	34bcbeed03	Make DIExpression::Verify() stricter by checking that the number of elements and the ordering is sane and cleanup the accessors. llvm-svn: 226627	2015-01-21 00:59:20 +00:00
Chandler Carruth	1edb9d63e9	[PM] Separate the InstCombiner from its pass. This creates a small internal pass which runs the InstCombiner over a function. This is the hard part of porting InstCombine to the new pass manager, as at this point none of the code in InstCombine has access to a Pass object any longer. The resulting interface for the InstCombiner is pretty terrible. I'm not planning on leaving it that way. The key thing missing is that we need to separate the worklist from the combiner a touch more. Once that's done, it should be possible for any part of LLVM to just create a worklist with instructions, populate it, and then combine it until empty. The pass will just be the (obvious and important) special case of doing that for an entire function body. For now, this is the first increment of factoring to make all of this work. llvm-svn: 226618	2015-01-20 22:44:35 +00:00
Adrian Prantl	de200dfad2	DebugLocs without a scope should fail the verification. Follow-up to r226588. llvm-svn: 226616	2015-01-20 22:37:25 +00:00
Chandler Carruth	b3d03df3ac	[PM] Reformat this code with clang-format so that subsequent changes don't get muddied up by formatting changes. Some of these don't really seem like improvements to me, but they also don't seem any worse and I care much more about not formatting them manually than I do about the particular formatting. =] llvm-svn: 226610	2015-01-20 21:10:35 +00:00
Colin LeMahieu	988c68f2a7	[Hexagon] Adding intrinsics for doubleword ALU operations. llvm-svn: 226606	2015-01-20 20:45:05 +00:00
Daniel Jasper	6b77455f81	Prevent binary-tree deterioration in sparse switch statements. This addresses part of llvm.org/PR22262. Specifically, it prevents considering the densities of sub-ranges that have fewer than TLI.getMinimumJumpTableEntries() elements. Those densities won't help jump tables. This is not a complete solution but works around the most pressing issue. Review: http://reviews.llvm.org/D7070 llvm-svn: 226600	2015-01-20 19:43:33 +00:00
Ramkumar Ramachandra	be10ece5ed	[GC] Verify-pass void vararg functions in gc.statepoint With the appropriate Verifier changes, exactracting the result out of a statepoint wrapping a vararg function crashes. However, a void vararg function works fine: commit this first step. Differential Revision: http://reviews.llvm.org/D7071 llvm-svn: 226599	2015-01-20 19:42:46 +00:00
Adrian Prantl	565cc18d8f	Reapply: Teach SROA how to update debug info for fragmented variables. This reapplies r225379. ChangeLog: - The assertion that this commit previously ran into about the inability to handle indirect variables has since been removed and the backend can handle this now. - Testcases were upgrade to the new MDLocation format. - Instead of keeping a DebugDeclares map, we now use llvm::FindAllocaDbgDeclare(). Original commit message follows. Debug info: Teach SROA how to update debug info for fragmented variables. This allows us to generate debug info for extremely advanced code such as typedef struct { long int a; int b;} S; int foo(S s) { return s.b; } which at -O1 on x86_64 is codegen'd into define i32 @foo(i64 %s.coerce0, i32 %s.coerce1) #0 { ret i32 %s.coerce1, !dbg !24 } with this patch we emit the following debug info for this TAG_formal_parameter [3] AT_location( 0x00000000 0x0000000000000000 - 0x0000000000000006: rdi, piece 0x00000008, rsi, piece 0x00000004 0x0000000000000006 - 0x0000000000000008: rdi, piece 0x00000008, rax, piece 0x00000004 ) AT_name( "s" ) AT_decl_file( "/Volumes/Data/llvm/_build.ninja.release/test.c" ) Thanks to chandlerc, dblaikie, and echristo for their feedback on all previous iterations of this patch! llvm-svn: 226598	2015-01-20 19:42:22 +00:00
Tom Stellard	e99fb65d87	R600/SI: Add subtarget feature to enable VGPR spilling for all shader types This is disabled by default, but can be enabled with the subtarget feature: 'vgpr-spilling' llvm-svn: 226597	2015-01-20 19:33:04 +00:00
Tom Stellard	021053f500	R600/SI: Fix simple-loop.ll test llvm-svn: 226596	2015-01-20 19:33:02 +00:00
Jozef Kolek	0d49117769	Reverted revision 226577. llvm-svn: 226595	2015-01-20 19:29:28 +00:00
Chandler Carruth	3a62216a8a	[PM] Clean up a bunch of the doxygen / API docs on the InstCombiner pass prior to refactoring it. llvm-svn: 226594	2015-01-20 19:27:58 +00:00
Manman Ren	dab999d54f	[llvm link] Destroy ConstantArrays in LLVMContext if they are not used. ConstantArrays constructed during linking can cause quadratic memory explosion. An example is the ConstantArrays constructed when linking in GlobalVariables with appending linkage. Releasing all unused constants can cause a 20% LTO compile-time slowdown for a large application. So this commit releases unused ConstantArrays only. rdar://19040716. It reduces memory footprint from 20+G to 6+G. llvm-svn: 226592	2015-01-20 19:24:59 +00:00
Tom Stellard	3a70d07f51	R600/SI: Remove stray debugging code from r226586 llvm-svn: 226591	2015-01-20 19:24:31 +00:00
Adrian Prantl	f88b2c8c74	Add an assertion and prefer a crash over an infinite loop. llvm-svn: 226588	2015-01-20 18:03:37 +00:00
Tom Stellard	95292bbfcd	R600/SI: Use external symbols for scratch buffer We were passing the scratch buffer address to the shaders via user sgprs, but now we use external symbols and have the driver patch the shader using reloc information. llvm-svn: 226586	2015-01-20 17:49:47 +00:00
Tom Stellard	8255af45cb	R600/SI: Add kill flag when copying scratch offset to a register This allows us to re-use the same register for the scratch offset when accessing large private arrays. llvm-svn: 226585	2015-01-20 17:49:45 +00:00
Tom Stellard	8058069529	R600/SI: Don't store scratch buffer frame index in MUBUF offset field We don't have a good way of legalizing this if the frame index offset is more than the 12-bits, which is size of MUBUF's offset field, so now we store the frame index in the vaddr field. llvm-svn: 226584	2015-01-20 17:49:43 +00:00
Tom Stellard	1106b1c662	R600/SI: Update SIInstrInfo:verifyInstruction() after r225662 Now that we have our own custom register operand types, we need to handle them in the verifiier. llvm-svn: 226583	2015-01-20 17:49:41 +00:00
Aaron Ballman	6fa2141dca	Silencing a -Wunused-variable warning in non-asserts builds; NFC. llvm-svn: 226581	2015-01-20 17:10:45 +00:00
Jozef Kolek	45f7f9c1ab	[mips][microMIPS] MicroMIPS 16-bit unconditional branch instruction B Implement microMIPS 16-bit unconditional branch instruction B. Implemented 16-bit microMIPS unconditional instruction has real name B16, and B is an alias which expands to either B16 or BEQ according to the rules: b 256 --> b16 256 # R_MICROMIPS_PC10_S1 b 12256 --> beq $zero, $zero, 12256 # R_MICROMIPS_PC16_S1 b label --> beq $zero, $zero, label # R_MICROMIPS_PC16_S1 Differential Revision: http://reviews.llvm.org/D3514 llvm-svn: 226577	2015-01-20 16:45:27 +00:00
Kai Nacke	63072f81b3	[mips] Add octeon branch instructions bbit0/bbit032/bbit1/bbit132 This commits adds the octeon branch instructions bbit0/bbit032/bbit1/bbit132. It also includes patterns for instruction selection and test cases. Reviewed by D. Sanders llvm-svn: 226573	2015-01-20 16:10:51 +00:00
Evgeniy Stepanov	c5b974e6d2	[msan] Optimize -msan-check-constant-shadow. The new code does not create new basic blocks in the case when shadow is a compile-time constant; it generates either an unconditional __msan_warning call or nothing instead. llvm-svn: 226569	2015-01-20 15:21:35 +00:00
Mohit K. Bhakkad	46ad7f7ec5	[MSan][LLVM][MIPS] Shadow and Origin offsets for MIPS Reviewers: kcc, samsonov, petarj, eugenis Differential Revision: http://reviews.llvm.org/D6146 llvm-svn: 226565	2015-01-20 13:05:42 +00:00
Craig Topper	9f4d485610	[x86] Add some mayLoad/hasSideEffects flags. Remove one that was already covered by a pattern. llvm-svn: 226562	2015-01-20 12:15:30 +00:00
Chandler Carruth	aaf0b4cd57	[PM] Port LoopInfo to the new pass manager, adding both a LoopAnalysis pass and a LoopPrinterPass with the expected associated wiring. I've added a RUN line to the only test case (!!!) we have that actually prints loops. Everything seems to be working. This is somewhat exciting as this is the first analysis using another analysis to go in for the new pass manager. =D I also believe it is the last analysis necessary for porting instcombine, but of course I may yet discover more. llvm-svn: 226560	2015-01-20 10:58:50 +00:00
Daniel Jasper	d106b734cf	Factor out a splitSwitchCase() function so that it can be reused. This is in preparation for a fix to llvm.org/PR22262. One of the ideas here is to first find a good jump table range first and then split before and after it. Thereby, we don't need to use the split-based-on-density heuristic at all, which can make the "binary tree" deteriorate in various cases. Also some minor cleanups. No functional changes. llvm-svn: 226551	2015-01-20 08:57:44 +00:00
Chandler Carruth	5175b9a7b9	[PM] Move the LoopInfo analysis pointer into the InstCombiner class along with the other analyses. The most obvious reason why is because eventually I need to separate out the pass layer from the rest of the instcombiner. However, it is also probably a compile time win as every query through the pass manager layer is pretty slow these days. llvm-svn: 226550	2015-01-20 08:35:24 +00:00
Karthik Bhat	0b0f4660fa	Fix Operandreorder logic in SLPVectorizer to generate longer vectorizable chain. This patch fixes 2 issues in reorderInputsAccordingToOpcode 1) AllSameOpcodeLeft and AllSameOpcodeRight was being calculated incorrectly resulting in code not being vectorized in few cases. 2) Adds logic to reorder operands if we get longer chain of consecutive loads enabling vectorization. Handled the same for cases were we have AltOpcode. Thanks Michael for inputs and review. Review: http://reviews.llvm.org/D6677 llvm-svn: 226547	2015-01-20 06:11:00 +00:00
David Majnemer	3087b22e1a	Bitcode: Don't create comdats when autoupgrading macho bitcode Don't infer COMDAT groups from older bitcode if the target is macho, it doesn't have COMDATs. llvm-svn: 226546	2015-01-20 05:58:07 +00:00
Duncan P. N. Exon Smith	aa687a3d4c	Reapply "IR: Simplify DIBuilder's HeaderBuilder API, NFC" This reverts commit r226542, effectively reapplying r226540. This time, initialize `IsEmpty` in the copy and move constructors as well. llvm-svn: 226545	2015-01-20 05:02:42 +00:00
Duncan P. N. Exon Smith	5f39dfd429	Revert "IR: Simplify DIBuilder's HeaderBuilder API, NFC" This reverts commit r226540, since I hit an unexpected bot failure [1]. I'll investigate. [1]: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/20244 llvm-svn: 226542	2015-01-20 03:01:27 +00:00
Duncan P. N. Exon Smith	03e0583a2d	IR: Move MDNode clone() methods from ValueMapper to MDNode, NFC Now that the clone methods used by `MapMetadata()` don't do any remapping (and return a temporary), they make more sense as member functions on `MDNode` (and subclasses). llvm-svn: 226541	2015-01-20 02:56:57 +00:00
Duncan P. N. Exon Smith	8a07e7f657	IR: Simplify DIBuilder's HeaderBuilder API, NFC Change `HeaderBuilder` API to work well even when it's not starting with a tag. There's already one case like this, and the tag is moving elsewhere as part of PR22235. llvm-svn: 226540	2015-01-20 02:54:07 +00:00
Duncan P. N. Exon Smith	a7477285b9	AsmParser: PARSE_MD_FIELD() => ParseMDField(), NFC Extract most of `PARSE_MD_FIELD()` into a function. llvm-svn: 226539	2015-01-20 02:42:29 +00:00
Duncan P. N. Exon Smith	8839cb1dc8	AsmParser: Refactor duplicate code, NFC llvm-svn: 226538	2015-01-20 02:39:21 +00:00
Chandler Carruth	10f28f26fd	[PM] Replace the Pass argument in MergeBasicBlockIntoOnlyPred with a DominatorTree argument as that is the analysis that it wants to update. This removes the last non-loop utility function in Utils/ which accepts a raw Pass argument. llvm-svn: 226537	2015-01-20 01:37:09 +00:00
Duncan P. N. Exon Smith	408f5a25fa	IR: Delete GenericDwarfNode during teardown Fix a leak in `LLVMContextImpl` teardown that the leak sanitizer tracked down [1]. I've just switched to automatic dispatch here (since I'll inevitably forget again with the next class). [1]: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/811/steps/check-llvm%20asan/logs/stdio llvm-svn: 226536	2015-01-20 01:18:32 +00:00
Duncan P. N. Exon Smith	db6bc8bfdf	Bitcode: Simplify MDNode subclass dispatch, NFC llvm-svn: 226535	2015-01-20 01:03:09 +00:00
Duncan P. N. Exon Smith	6592deeab2	Bitcode: WriteMDNode() => WriteMDTuple(), NFC llvm-svn: 226534	2015-01-20 01:01:53 +00:00
Duncan P. N. Exon Smith	9a6f64e7b8	Bitcode: Add ValueEnumerator::getMetadataOrNullID(), NFC llvm-svn: 226533	2015-01-20 01:00:23 +00:00
Duncan P. N. Exon Smith	2da09e4408	IR: Canonicalize GenericDwarfNode empty headers to null llvm-svn: 226532	2015-01-20 00:58:46 +00:00
Duncan P. N. Exon Smith	0f529998a5	IR: Detect whether to call recalculateHash() via SFINAE, NFC Rather than relying on updating switch statements correctly, detect whether `setHash()` exists in the subclass. If so, call `recalculateHash()` and `setHash(0)` appropriately. llvm-svn: 226531	2015-01-20 00:57:33 +00:00
Duncan P. N. Exon Smith	fed199a758	IR: Introduce GenericDwarfNode As part of PR22235, introduce `DwarfNode` and `GenericDwarfNode`. The former is a metadata node with a DWARF tag. The latter matches our current (generic) schema of a header with string (and stringified integer) data and an arbitrary number of operands. This doesn't move it into place yet; that change will require a large number of testcase updates. llvm-svn: 226529	2015-01-20 00:01:43 +00:00
Duncan P. N. Exon Smith	2a6b5fcfaa	AsmParser: Abstract more of MDLocation parser, NFC llvm-svn: 226527	2015-01-19 23:44:41 +00:00
Duncan P. N. Exon Smith	66ca92e509	AsmParser: Split up ParseMDFieldsImpl(), NFC llvm-svn: 226526	2015-01-19 23:39:32 +00:00
Duncan P. N. Exon Smith	13890af51c	AsmParser: Fix error location for missing fields llvm-svn: 226524	2015-01-19 23:32:36 +00:00
Duncan P. N. Exon Smith	909131b95f	IR: Cleanup MDNode field use, NFC Swap usage of `SubclassData32` and `MDNodeSubclassData`, and rename `MDNodeSubclassData` to `NumUnresolved`. Small drive-by cleanup to `countUnresolvedOperands()` since otherwise the name clash with local vars named `NumUnresolved` would be confusing. llvm-svn: 226523	2015-01-19 23:18:34 +00:00
Duncan P. N. Exon Smith	8647529250	IR: Move replaceWithUniqued(), etc., to source file, NFC llvm-svn: 226522	2015-01-19 23:17:09 +00:00
Duncan P. N. Exon Smith	a1ae4f6b30	IR: Cleanup MDNode::MDNode(), NFC llvm-svn: 226521	2015-01-19 23:15:21 +00:00
Duncan P. N. Exon Smith	2bc00f4a38	IR: Merge UniquableMDNode back into MDNode, NFC As pointed out in r226501, the distinction between `MDNode` and `UniquableMDNode` is confusing. When we need subclasses of `MDNode` that don't use all its functionality it might make sense to break it apart again, but until then this makes the code clearer. llvm-svn: 226520	2015-01-19 23:13:14 +00:00
Duncan P. N. Exon Smith	93e983e707	IR: Extract MDNodeOpsKey, NFC Make the MDTuple operand hashing logic reusable. llvm-svn: 226519	2015-01-19 22:53:18 +00:00
Duncan P. N. Exon Smith	f9d1bc9919	IR: Simplify uniquifyImpl(), NFC llvm-svn: 226518	2015-01-19 22:52:07 +00:00
Duncan P. N. Exon Smith	6cf10d2786	IR: Simplify erasing from uniquing store, NFC llvm-svn: 226517	2015-01-19 22:47:08 +00:00
Duncan P. N. Exon Smith	6dc22bf27b	Utils: Simplify MapMetadata(), NFC Extract out the operand remapping loops, which are now very similar. llvm-svn: 226515	2015-01-19 22:44:32 +00:00
Duncan P. N. Exon Smith	9fa10658ce	Skip upcast, NFC llvm-svn: 226514	2015-01-19 22:41:14 +00:00
Simon Pilgrim	20bc37c7db	[X86][AVX] Missing AVX1 memory folding float instructions Now that we can create much more exhaustive X86 memory folding tests, this patch adds the missing AVX1/F16C floating point instruction stack foldings we can easily test for including the scalar intrinsics (add, div, max, min, mul, sub), conversions float/int to double, half precision conversions, rounding, dot product and bit test. The patch also adds a couple of obviously missing SSE instructions (more to follow once we have full SSE testing). Now that scalar folding is working it broke a very old test (2006-10-07-ScalarSSEMiscompile.ll) - this test appears to make no sense as its trying to ensure that a scalar subtraction isn't folded as it 'would zero the top elts of the loaded vector' - this test just appears to be wrong to me. Differential Revision: http://reviews.llvm.org/D7055 llvm-svn: 226513	2015-01-19 22:40:45 +00:00
Duncan P. N. Exon Smith	c862be860d	Fix whitespace, NFC llvm-svn: 226512	2015-01-19 22:40:25 +00:00
Duncan P. N. Exon Smith	0dcffe2cdc	Utils: Simplify MapMetadata(), NFC Take advantage of the new ability of temporary nodes to mutate to distinct and uniqued nodes to greatly simplify the `MapMetadata()` helper functions. llvm-svn: 226511	2015-01-19 22:39:07 +00:00
Duncan P. N. Exon Smith	e33530909d	IR: Allow temporary nodes to become uniqued or distinct Add `MDNode::replaceWithUniqued()` and `MDNode::replaceWithDistinct()`, which mutate temporary nodes to become uniqued or distinct. On uniquing collisions, the unique version is returned and the node is deleted. This takes advantage of temporary nodes being folded back in, and should let me clean up some awkward logic in `MapMetadata()`. llvm-svn: 226510	2015-01-19 22:24:52 +00:00
Duncan P. N. Exon Smith	c5a0e2e3a7	IR: Split out countUnresolvedOperands(), NFC llvm-svn: 226508	2015-01-19 22:18:29 +00:00
Duncan P. N. Exon Smith	422e5c7acc	Cleanup whitespace, NFC llvm-svn: 226507	2015-01-19 22:16:01 +00:00
Duncan P. N. Exon Smith	7d82313bcd	IR: Return unique_ptr from MDNode::getTemporary() Change `MDTuple::getTemporary()` and `MDLocation::getTemporary()` to return (effectively) `std::unique_ptr<T, MDNode::deleteTemporary>`, and clean up call sites. (For now, `DIBuilder` call sites just call `release()` immediately.) There's an accompanying change in each of clang and polly to use the new API. llvm-svn: 226504	2015-01-19 21:30:18 +00:00
Rafael Espindola	2658554aec	Add r224985 back with fixes. The fixes are to note that AArch64 has additional restrictions on when local relocations can be used. In particular, ld64 requires that relocations to cstring/cfstrings use linker visible symbols. Original message: In an assembly expression like bar: .long L0 + 1 the intended semantics is that bar will contain a pointer one byte past L0. In sections that are merged by content (strings, 4 byte constants, etc), a single position in the section doesn't give the linker enough information. For example, it would not be able to tell a relocation must point to the end of a string, since that would look just like the start of the next. The solution used in ELF to use relocation with symbols if there is a non-zero addend. In MachO before this patch we would just keep all symbols in some sections. This would miss some cases (only cstrings on x86_64 were implemented) and was inefficient since most relocations have an addend of 0 and can be represented without the symbol. This patch implements the non-zero addend logic for MachO too. llvm-svn: 226503	2015-01-19 21:11:14 +00:00
Duncan P. N. Exon Smith	946fdcc50c	IR: Remove MDNodeFwdDecl Remove `MDNodeFwdDecl` (as promised in r226481). Aside from API changes, there's no real functionality change here. `MDNode::getTemporary()` now forwards to `MDTuple::getTemporary()`, which returns a tuple with `isTemporary()` equal to true. The main point is that we can now add temporaries of other `MDNode` subclasses, needed for PR22235 (I introduced `MDNodeFwdDecl` in the first place because I didn't recognize this need, and thought they were only needed to handle forward references). A few things left out of (or highlighted by) this commit: - I've had to remove the (few) uses of `std::unique_ptr<>` to deal with temporaries, since the destructor is no longer public. `getTemporary()` should probably return the equivalent of `std::unique_ptr<T, MDNode::deleteTemporary>`. - `MDLocation::getTemporary()` doesn't exist yet (worse, it actually does exist, but does the wrong thing: `MDNode::getTemporary()` is inherited and returns an `MDTuple`). - `MDNode` now only has one subclass, `UniquableMDNode`, and the distinction between them is actually somewhat confusing. I'll fix those up next. llvm-svn: 226501	2015-01-19 20:36:39 +00:00
Colin LeMahieu	0ee02fc9fe	[Hexagon] Updating muxir/ri/ii intrinsics. Setting predicate registers as compatible with i32 rather than doing custom type conversion. llvm-svn: 226500	2015-01-19 20:31:18 +00:00
Duncan P. N. Exon Smith	5b8c440100	IR: Extract out and reuse `storeImpl()`, NFC llvm-svn: 226499	2015-01-19 20:18:13 +00:00
Duncan P. N. Exon Smith	b57f9e9735	IR: Extract out getUniqued(), NFC llvm-svn: 226498	2015-01-19 20:16:50 +00:00
Duncan P. N. Exon Smith	1b0064d0d2	IR: Reuse `getImpl()` for `getDistinct()`, NFC Merge `getDistinct()`'s implementation with those of `get()` and `getIfExists()` for both `MDTuple` and `MDLocation`. This will make it easier to scale to supporting temporaries. llvm-svn: 226497	2015-01-19 20:14:15 +00:00
Duncan P. N. Exon Smith	efdf285bbe	IR: Simplify MDNode::setOperand(), NFC llvm-svn: 226492	2015-01-19 19:29:25 +00:00
Duncan P. N. Exon Smith	3d5805685b	IR: Simplify handleChangedOperand() fast path, NFC Use `isUniqued()` instead of `isStoredDistinctInContext()`, and remove an assertion that won't be valid once temporaries are merged back in. llvm-svn: 226491	2015-01-19 19:28:28 +00:00
Duncan P. N. Exon Smith	b8f796031f	IR: Remove direct comparisons against Metadata::Storage, NFC llvm-svn: 226490	2015-01-19 19:26:24 +00:00
Duncan P. N. Exon Smith	f08b8b4be6	IR: Assert that resolve() is only called on uniqued nodes, NFC Add an assertion in `UniquableMDNode::resolve()` to prevent temporaries from being resolved (once they're merged back in). Needed to shuffle order of `resolve()` and `storeDistinctInContext()` to prevent it from firing. llvm-svn: 226489	2015-01-19 19:25:33 +00:00
Duncan P. N. Exon Smith	105acf7885	IR: Remove isa<UniquableMDNode>, NFC llvm-svn: 226488	2015-01-19 19:10:14 +00:00
Duncan P. N. Exon Smith	9b1c6d34e5	IR: Simplify DIBuilder::trackIfUnresolved(), NFC llvm-svn: 226487	2015-01-19 19:09:14 +00:00
Duncan P. N. Exon Smith	e34014d11c	IR: Remove isa<MDNodeFwdDecl>, NFC llvm-svn: 226486	2015-01-19 19:06:41 +00:00
Duncan P. N. Exon Smith	66ed52231f	IR: Unify code for MDNode::isResolved(), NFC Unify the definitions of `MDNode::isResolved()` and `UniquableMDNode::isResolved()`. Previously, `UniquableMDNode` could answer this question more efficiently, but now that RAUW support has been unified with `MDNodeFwdDecl`, `MDNode` doesn't need any casts to figure out the answer. llvm-svn: 226485	2015-01-19 19:03:18 +00:00
Duncan P. N. Exon Smith	2711ca7c28	IR: Store RAUW support and Context in the same pointer, NFC Add an `LLVMContext &` to `ReplaceableMetadataImpl`, create a class that either holds a reference to an `LLVMContext` or owns a `ReplaceableMetadataImpl`, and use the new class in `MDNode`. - This saves a pointer in `UniquableMDNode` at the cost of a pointer in `ValueAsMetadata` (which didn't used to store the `LLVMContext`). There are far more of the former. - Unifies RAUW support between `MDNodeFwdDecl` (which is going away, see r226481) and `UniquableMDNode`. llvm-svn: 226484	2015-01-19 19:02:06 +00:00
Colin LeMahieu	fcd4569af6	[Hexagon] Converting intrinsics combine imm/imm, simple shifts and extends. llvm-svn: 226483	2015-01-19 18:56:19 +00:00
Duncan P. N. Exon Smith	de03a8b38d	IR: Add isUniqued() and isTemporary() Change `MDNode::isDistinct()` to only apply to 'distinct' nodes (not temporaries), and introduce `MDNode::isUniqued()` and `MDNode::isTemporary()` for the other two possibilities. llvm-svn: 226482	2015-01-19 18:45:35 +00:00
Duncan P. N. Exon Smith	f134045365	IR: Use an enum to describe Metadata storage, NFC More clearly describe the type of storage used for `Metadata`. - `Uniqued`: uniqued, stored in the context. - `Distinct`: distinct, stored in the context. - `Temporary`: not owned by anyone. This is the first in a series of commits to fix a design problem with `MDNodeFwdDecl` that I need to solve for PR22235. While `MDNodeFwdDecl` works well as a forward declaration, we use `MDNode::getTemporary()` for more than forward declarations -- we also need to create early versions of nodes (with fields not filled in) that we'll fill out later (see `DIBuilder::finalize()` and `CGDebugInfo::finalize()` for examples). This was a blind spot I had when I introduced `MDNodeFwdDecl` (which David Blaikie (indirectly) highlighted in an unrelated review [1]). [1]: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20150112/252381.html In general, we need `MDTuple::getTemporary()` to give a temporary tuple (like `MDNodeFwdDecl`), `MDLocation::getTemporary()` to give a temporary location, and (the problem at hand) `GenericDebugMDNode::getTemporary()` to give a temporary generic debug node. So I need to fold the idea of "temporary" nodes back into `UniquableMDNode`. (More commits to follow as I refactor.) llvm-svn: 226481	2015-01-19 18:36:18 +00:00
Colin LeMahieu	9327bdad2f	[Hexagon] Converting remaining ALU32/ALU intrinsics. llvm-svn: 226480	2015-01-19 18:33:58 +00:00
Colin LeMahieu	663419b008	[Hexagon] Converting ALU32/ALU intrinsics to new patterns. llvm-svn: 226478	2015-01-19 18:22:19 +00:00
Adrian Prantl	5883af3faa	Remove support for DIVariable's FlagIndirectVariable and expect frontends to use a DIExpression with a DW_OP_deref instead. This is not only a much more natural place for this informationl; there is also a technical reason: The FlagIndirectVariable is used to mark a variable that is turned into a reference by virtue of the calling convention; this happens for example to aggregate return values. The inliner, for example, may actually need to undo this indirection to correctly represent the value in its new context. This is impossible to implement because the DIVariable can't be safely modified. We can however safely construct a new DIExpression on the fly. llvm-svn: 226476	2015-01-19 17:57:29 +00:00
Greg Fitzgerald	fa78d08675	[AArch64] Implement GHC calling convention Original patch by Luke Iannini. Minor improvements and test added by Erik de Castro Lopo. Differential Revision: http://reviews.llvm.org/D6877 From: Erik de Castro Lopo <erikd@mega-nerd.com> llvm-svn: 226473	2015-01-19 17:40:05 +00:00
Colin LeMahieu	310bad8b7e	[Hexagon] Converting halfword to double accumulating multiply intrinsics. llvm-svn: 226472	2015-01-19 17:36:32 +00:00
Rafael Espindola	c569ac46eb	Produce errors when an assignment expression would use a common symbol. An assignment will produce a symbol with a given section and offset. There is no way to represent something like "1 byte after a common symbol". This matches the behavior of GNU as. Part of PR22217. llvm-svn: 226470	2015-01-19 17:30:24 +00:00
Bradley Smith	3131e85edd	[ARM] SSAT/USAT with an 'asr #32' shift should result in an undefined encoding rather than unpredictable llvm-svn: 226469	2015-01-19 16:37:17 +00:00
Bradley Smith	30057b245e	[ARM] Fixup sign extend instruction availability w.r.t. DSP extension llvm-svn: 226468	2015-01-19 16:36:02 +00:00
Rafael Espindola	12ca34f53f	Bring r226038 back. No change in this commit, but clang was changed to also produce trivial comdats when needed. Original message: Don't create new comdats in CodeGen. This patch stops the implicit creation of comdats during codegen. Clang now sets the comdat explicitly when it is required. With this patch clang and gcc now produce the same result in pr19848. llvm-svn: 226467	2015-01-19 15:16:06 +00:00
Chandler Carruth	d450056c78	[PM] Replace the Pass argument to SplitEdge with specific analyses used and updated. This may appear to remove handling for things like alias analysis when splitting critical edges here, but in fact no callers of SplitEdge relied on this. Similarly, all of them wanted to preserve LCSSA if there was any update of the loop info. That makes the interface much simpler. With this, all of BasicBlockUtils.h is free of Pass arguments and prepared for the new pass manager. This is tho majority of utilities that relied on pass arguments. llvm-svn: 226459	2015-01-19 12:36:53 +00:00
Chandler Carruth	f8753fc48d	[PM] Cleanup a dead option to critical edge splitting that I noticed while refactoring this API for the new pass manager. No functionality changed here, the code didn't actually support this option. llvm-svn: 226457	2015-01-19 12:12:00 +00:00
Chandler Carruth	37df2cfbf8	[PM] Remove the Pass argument from all of the critical edge splitting APIs and replace it and numerous booleans with an option struct. The critical edge splitting API has a really large surface of flags and so it seems worth burning a small option struct / builder. This struct can be constructed with the various preserved analyses and then flags can be flipped in a builder style. The various users are now responsible for directly passing along their analysis information. This should be enough for the critical edge splitting to work cleanly with the new pass manager as well. This API is still pretty crufty and could be cleaned up a lot, but I've focused on this change just threading an option struct rather than a pass through the API. llvm-svn: 226456	2015-01-19 12:09:11 +00:00
Chandler Carruth	ad34d91343	[PM] Relax asserts and always try to reconstruct loop simplify form when we can while splitting critical edges. The only code which called this and didn't require simplified loops to be preserved is polly, and the code behaves correctly there anyways. Without this change, it becomes really hard to share this code with the new pass manager where things like preserving loop simplify form don't make any sense. If anyone discovers this code behaving incorrectly, what it should be testing for is whether the loops it needs to be in simplified form are in fact in that form. It should always be trying to preserve that form when it exists. llvm-svn: 226443	2015-01-19 10:23:00 +00:00
Erik Eckstein	76cb53a839	SLPVectorizer: limit the number of alias checks to reduce the runtime. In case of blocks with many memory-accessing instructions, alias checking can take lot of time (because calculating the memory dependencies has quadratic complexity). I chose a limit which resulted in no changes when running the benchmarks. llvm-svn: 226439	2015-01-19 09:33:38 +00:00
Hal Finkel	c3168129af	[PowerPC] Minor correction to r226432 We don't need to exclude patchpoints from the implicit r2 dependence in FastISel because it is added as an implicit operand and, thus, should not confuse that StackMap code. By inspection / no test case. llvm-svn: 226434	2015-01-19 07:44:45 +00:00
Michael Kuperstein	54c61edee7	[MIScheduler] Slightly better handling of constrainLocalCopy when both source and dest are local This fixes PR21792. Differential Revision: http://reviews.llvm.org/D6823 llvm-svn: 226433	2015-01-19 07:30:47 +00:00
Hal Finkel	af51993ee1	[PowerPC] Add r2 as an operand for all calls under both PPC64 ELF V1 and V2 Our PPC64 ELF V2 call lowering logic added r2 as an operand to all direct call instructions in order to represent the dependency on the TOC base pointer value. Restricting this to ELF V2, however, does not seem to make sense: calls under ELF V1 have the same dependence, and indirect calls have an r2 dependence just as direct ones. Make sure the dependence is noted for all calls under both ELF V1 and ELF V2. llvm-svn: 226432	2015-01-19 07:20:27 +00:00
Craig Topper	f4bf9119a1	[x86] Change AVX512 intrinsics to take a 8-bit immediate for the comparision kind instead of a 32-bit immediate. This better aligns with the emitted instruction. It also matches SSE and AVX1 equivalents. Also add auto upgrade support. llvm-svn: 226430	2015-01-19 06:07:27 +00:00
Chandler Carruth	0eae112009	[PM] Lift the analyses into the interface for SplitLandingPadPredecessors and remove the Pass argument from its interface. Another step to the utilities being usable with both old and new pass managers. llvm-svn: 226426	2015-01-19 03:03:39 +00:00
David Blaikie	186db431c0	unique_ptrify the RelInfo parameter to TargetRegistry::createMCSymbolizer llvm-svn: 226416	2015-01-18 20:45:48 +00:00
David Blaikie	9459832ebd	std::unique_ptrify the MCStreamer argument to createAsmPrinter llvm-svn: 226414	2015-01-18 20:29:04 +00:00
Hal Finkel	58884f9fe6	[PowerPC] Don't hard-code R2 as register when processing TOC relocations Instructions that have high-order TOC relocations always carry R2 as their base register, so it does not matter whether we take the register from the instruction or just hard-code it in PPCAsmPrinter. In the future, however, we might want to apply these relocations to instructions using a different register, so taking the register from the instruction is a better thing to do. No change in functionality here, however. llvm-svn: 226403	2015-01-18 15:59:44 +00:00
Hal Finkel	8ea446b6a4	[PowerPC] Add some FIXMEs for fastcc and FPR <-> GPR moves So we don't forget, once we support FPR <-> GPR moves on the P8, we'll likely want to re-visit this part of the calling convention. llvm-svn: 226401	2015-01-18 14:31:10 +00:00
Hal Finkel	f81b6dd7a2	[PowerPC] Initial PPC64 calling-convention changes for fastcc The default calling convention specified by the PPC64 ELF (V1 and V2) ABI is designed to work with both prototyped and non-prototyped/varargs functions. As a result, GPRs and stack space are allocated for every argument, even those that are passed in floating-point or vector registers. GlobalOpt::OptimizeFunctions will transform local non-varargs functions (that do not have their address taken) to use the 'fast' calling convention. When functions are using the 'fast' calling convention, don't allocate GPRs for arguments passed in other types of registers, and don't allocate stack space for arguments passed in registers. Other changes for the fast calling convention may be added in the future. llvm-svn: 226399	2015-01-18 12:08:47 +00:00
Chandler Carruth	b5797b659f	[PM] Pull the analyses used for another utility routine into its API rather than relying on the pass object. This one is a bit annoying, but will pay off. First, supporting this one will make the next one much easier, and for utilities like LoopSimplify, this is moving them (slowly) closer to not having to pass the pass object around throughout their APIs. llvm-svn: 226396	2015-01-18 09:21:15 +00:00
Chandler Carruth	32c52c7e04	[PM] Sink the specific analyses preserved by SplitBlock into its interface, removing Pass from its interface. This also makes those analyses optional so that passes which don't even preserve these (or use them) can skip the logic entirely. llvm-svn: 226394	2015-01-18 02:39:37 +00:00
Chandler Carruth	b5c115357c	[PM] Replace another Pass argument with specific analyses that are optionally updated by MergeBlockIntoPredecessors. No functionality changed, just refactoring to clear the way for the new pass manager. llvm-svn: 226392	2015-01-18 02:11:23 +00:00
Chandler Carruth	94209094a5	[PM] Refactor how the LoopRotation pass access the DominatorTree. Instead of querying the pass every where we need to, do that once and cache a pointer in the pass object. This is both simpler and I'm about to add yet another place where we need to dig out that pointer. llvm-svn: 226391	2015-01-18 02:08:05 +00:00
Chandler Carruth	5eee895ccf	[PM] Lift the actual analyses used into the inferface rather than accepting a Pass and querying it for analyses. This is necessary to allow the utilities to work both with the old and new pass managers, and I also think this makes the interface much more clear and helps the reader know what analyses the utility can actually handle. I plan to repeat this process iteratively to clean up all the pass utilities. llvm-svn: 226386	2015-01-18 01:45:07 +00:00
Chandler Carruth	691addc25f	[PM] Now that LoopInfo isn't in the Pass type hierarchy, it is much cleaner to derive from the generic base. Thise removes a ton of boiler plate code and somewhat strange and pointless indirections. It also remove a bunch of the previously needed friend declarations. To fully remove these, I also lifted the verify logic into the generic LoopInfoBase, which seems good anyways -- it is generic and useful logic even for the machine side. llvm-svn: 226385	2015-01-18 01:25:51 +00:00
Chandler Carruth	bc045a5a33	[PM] Cleanup more warnings my refactoring exposed where now we have unused variables in a no-asserts build. I've fixed this by putting the entire loop behind an #ifndef as it contains nothing other than asserts. llvm-svn: 226377	2015-01-17 14:49:23 +00:00
Chandler Carruth	24fd029a60	[PM] Remove a dead field. This was dead even before I refactored how we initialized it, but my refactoring made it trivially dead and it is now caught by a Clang warning. This fixes the warning and should clean up the -Werror bot failures (sorry!). llvm-svn: 226376	2015-01-17 14:31:35 +00:00
Chandler Carruth	4f8f307c77	[PM] Split the LoopInfo object apart from the legacy pass, creating a LoopInfoWrapperPass to wire the object up to the legacy pass manager. This switches all the clients of LoopInfo over and paves the way to port LoopInfo to the new pass manager. No functionality change is intended with this iteration. llvm-svn: 226373	2015-01-17 14:16:18 +00:00
Hal Finkel	c19805a75d	[PowerPC] Don't list R11 as a patchpoint scratch register R11's status is the same under both the PPC64 ELF V1 and V2 ABIs: it is reserved for use as an "environment pointer" for compilation models that require such a thing. We don't, we also don't need a second scratch register, and because we support only "local" patchpoint call targets, we might as well let R11 be used for anyregcc patchpoints. llvm-svn: 226369	2015-01-17 03:57:34 +00:00
Mehdi Amini	37f316afaf	Improve DAG combine pass on certain IR vector patterns Loading 2 2x32-bit float vectors into the bottom half of a 256-bit vector produced suboptimal code in AVX2 mode with certain IR combinations. In particular, the IR optimizer folded 2f32 + 2f32 -> 4f32, 4f32 + 4f32 (undef) -> 8f32 into a 2f32 + 2f32 -> 8f32, which seems more canonical, but then mysteriously generated rather bad code; the movq/movhpd combination didn't match. The problem lay in the BUILD_VECTOR optimization path. The 2f32 inputs would get promoted to 4f32 by the type legalizer, eventually resulting in a BUILD_VECTOR on two 4f32 into an 8f32. The BUILD_VECTOR then, recognizing these were both half the output size, concatted them and then produced a shuffle. However, the resulting concat + shuffle was more complex than it should be; in the case where the upper half of the output is undef, we probably want to generate shuffle + concat instead. This enhancement causes the vector_shuffle combine step to recognize this suboptimal pattern and correct it. I included it there instead of in BUILD_VECTOR in case the same suboptimal pattern occurs for other reasons. This results in the optimizer correctly producing the optimal movq + movhpd sequence for all three variations on this IR, even with AVX2. I've included a test case. Radar link: rdar://problem/19287012 Fix for PR 21943. From: Fiona Glaser <fglaser@apple.com> llvm-svn: 226360	2015-01-17 01:35:56 +00:00
Lang Hames	2996895f28	[RuntimeDyld] Tidy up emitCommonSymbols a little. NFC. llvm-svn: 226358	2015-01-17 00:55:05 +00:00
Richard Trieu	73d06526ba	Remove std::move that was preventing return value optimization. llvm-svn: 226356	2015-01-17 00:46:44 +00:00
Matthias Braun	7618b2b23d	RegisterCoalescer: Cleanup and improved comment for a subtle detail. llvm-svn: 226353	2015-01-17 00:33:13 +00:00
Matthias Braun	0eb940aed0	RegisterCoalescer: Cleanup by factoring out a common expression llvm-svn: 226352	2015-01-17 00:33:11 +00:00
Matthias Braun	e2fa081615	RegisterCoalescer: Cleanup comment style - Consistenly put comments above the function declaration, not the definition. To achieve this some duplicate comments got merged and some comment parts describing implementation details got moved into their functions. - Consistently use doxygen comments above functions. - Do not use doxygen comments inside functions. llvm-svn: 226351	2015-01-17 00:33:09 +00:00
Matthias Braun	fc6ef3a270	RegisterCoalescer: Drive-by typo + whitespace fix llvm-svn: 226350	2015-01-17 00:33:06 +00:00
Lang Hames	1f7eab338f	[RuntimeDyld] Remove the brace initialization that was introduced in r226341. Evidently MSVC doesn't like it. llvm-svn: 226349	2015-01-17 00:32:56 +00:00
Philip Reames	287987ca13	Update a comment Be a bit more explicit about the fact that addrspace(1) is not reserved. llvm-svn: 226344	2015-01-16 23:21:07 +00:00
Philip Reames	36319538d0	clang-format all the GC related files (NFC) Nothing interesting here... llvm-svn: 226342	2015-01-16 23:16:12 +00:00
Lang Hames	6bfd398022	[RuntimeDyld] Track symbol visibility in RuntimeDyld. RuntimeDyld symbol info previously consisted of just a Section/Offset pair. This patch replaces that pair type with a SymbolInfo class that also tracks symbol visibility. A new method, RuntimeDyld::getExportedSymbolLoadAddress, is introduced which only returns a non-zero result for exported symbols. For non-exported or non-existant symbols this method will return zero. The RuntimeDyld::getSymbolAddress method retains its current behavior, returning non-zero results for all symbols regardless of visibility. No in-tree clients of RuntimeDyld are changed. The newly introduced functionality will be used by the Orc APIs. No test case: Since this patch doesn't modify the behavior for any in-tree clients we don't have a good tool to test this with yet. Once Orc is in we can use it to write regression tests that test these changes. llvm-svn: 226341	2015-01-16 23:13:56 +00:00
Kevin Enderby	c1271893af	Fix the Archive::Child::getRawSize() method used by llvm-objdump’s -archive-headers option and tweak its use in llvm-objdump. Add back the test case for the -archive-headers option. llvm-svn: 226332	2015-01-16 22:10:36 +00:00
Colin LeMahieu	823415b881	[Hexagon] Converting halfword to doubleword multiply intrinsics. llvm-svn: 226326	2015-01-16 21:41:57 +00:00
Colin LeMahieu	cd9b276966	[Hexagon] Converting accumulating halfword multiply intrinsics to patterns. llvm-svn: 226324	2015-01-16 21:36:34 +00:00
Colin LeMahieu	3b047e0ee5	[Hexagon] Beginning converting intrinsics to patterns instead of duplicated definitions. Converting halfword multiply intrinsics. llvm-svn: 226318	2015-01-16 20:38:54 +00:00
Colin LeMahieu	54adb6a5d5	[Hexagon] Fix 226309, replacement atomic store patterns didn't actually exist, added new versions. llvm-svn: 226315	2015-01-16 20:16:14 +00:00
Saleem Abdulrasool	c3f8ad3e83	X86: fix comment typo in AsmParser Fix a typo. NFC. llvm-svn: 226313	2015-01-16 20:16:06 +00:00
Philip Reames	2b45395876	Move ownership of GCStrategy objects to LLVMContext Note: This change ended up being slightly more controversial than expected. Chandler has tentatively okayed this for the moment, but I may be revisiting this in the near future after we settle some high level questions. Rather than have the GCStrategy object owned by the GCModuleInfo - which is an immutable analysis pass used mainly by gc.root - have it be owned by the LLVMContext. This simplifies the ownership logic (i.e. can you have two instances of the same strategy at once?), but more importantly, allows us to access the GCStrategy in the middle end optimizer. To this end, I add an accessor through Function which becomes the canonical way to get at a GCStrategy instance. In the near future, this will allows me to move some of the checks from http://reviews.llvm.org/D6808 into the Verifier itself, and to introduce optimization legality predicates for some of the recent additions to InstCombine. (These will follow as separate changes.) Differential Revision: http://reviews.llvm.org/D6811 llvm-svn: 226311	2015-01-16 20:07:33 +00:00
Colin LeMahieu	bb6718b30e	[Hexagon] Removing old duplicate atomic load/store patterns. llvm-svn: 226309	2015-01-16 19:53:35 +00:00
Philip Reames	7de640a876	Remove gc.root's findCustomSafePoints mechanism Searching all of the existing gc.root implementations I'm aware of (all three of them), there was exactly one use of this mechanism, and that was to implement a performance improvement that should have been applied to the default lowering. Having this function is requiring a dependency on a CodeGen class (MachineFunction), in a class which is otherwise completely independent of CodeGen. I could solve this differently, but given that I see absolutely no value in preserving this mechanism, I going to just get rid of it. Note: Tis is the first time I'm intentionally breaking previously supported gc.root functionality. Given 3.6 has branched, I believe this is a good time to do this. Differential Revision: http://reviews.llvm.org/D7004 llvm-svn: 226305	2015-01-16 19:33:28 +00:00
Colin LeMahieu	7d1f632380	[Hexagon] Converting old patterns to new versions using classes. llvm-svn: 226304	2015-01-16 19:29:59 +00:00
Adam Nemet	3e8b22bc1b	[AVX512] Add intrinsics for masked aligned FP loads and stores Similar to the unaligned cases. Test was generated with update_llc_test_checks.py. Part of <rdar://problem/17688758> llvm-svn: 226296	2015-01-16 18:50:09 +00:00
Duncan P. N. Exon Smith	2f5bb31302	IR: Allow 16-bits for column info Raise the limit for column information from 8 bits to 16 bits. llvm-svn: 226291	2015-01-16 17:33:08 +00:00
Duncan P. N. Exon Smith	c9cddb0837	IR: Cleanup dead code, NFC Line/column fixups already exist in `MDLocation`. Delete the duplicated logic in `DebugLoc`. llvm-svn: 226290	2015-01-16 17:31:29 +00:00
Colin LeMahieu	2e3a26de0c	[Hexagon] Updating call/jump instruction patterns. llvm-svn: 226288	2015-01-16 17:05:27 +00:00
Andrea Di Biagio	ae47bc6ab9	[X86][DAG] Disable target specific combine on INSERTPS dag nodes at -O0. This patch disables target specific combine on X86ISD::INSERTPS dag nodes if optlevel is CodeGenOpt::None. The backend currently implements a target specific combine rule that converts a vector load used by an INSERTPS dag node into a scalar load plus a scalar_to_vector. This allows ISel to select a single INSERTPSrm instead of two instructions (i.e. a vector load plus INSERTPSrr). However, the existing target combine rule on INSERTPS nodes only works under the assumption that ISel will always be able to match an INSERTPSrm. This is not true in general at -O0, since the backend only allows folding a load into the memory operand of an instruction if the optimization level is not CodeGenOpt::None. In the example below: // __m128 test(__m128 a, __m128 b) { __m128 c = _mm_insert_ps(a, b, 1 << 6); return c; } // Before this patch, at -O0, the backend would have canonicalized the load to 'b' into a scalar load plus scalar_to_vector. Later on, ISel would have selected an INSERTPSrr leaving the insertps mask in an inconsistent state: movss 4(%rdi), %xmm1 insertps $64, %xmm1, %xmm0 # xmm0 = xmm1[1],xmm0[1,2,3]. With this patch, the backend avoids folding the vector load into the operand of the INSERTPS. The new codegen at -O0 is: movaps (%rdi), %xmm1 insertps $64, %xmm1, %xmm0 # %xmm1[1],xmm0[1,2,3]. llvm-svn: 226277	2015-01-16 14:55:26 +00:00
Toma Tabacu	f476200c63	[mips] Remove a redundant semicolon and add space before curly brackets. NFC. llvm-svn: 226269	2015-01-16 10:45:15 +00:00
Timur Iskhodzhanov	60b721363c	Revert r226242 - Revert Revert Don't create new comdats in CodeGen This breaks AddressSanitizer (ninja check-asan) on Windows llvm-svn: 226251	2015-01-16 08:38:45 +00:00
Hal Finkel	52f7c018d3	[PowerPC] Adjust PatchPoints for ppc64le Bill Schmidt pointed out that some adjustments would be needed to properly support powerpc64le (using the ELF V2 ABI). For one thing, R11 is not available as a scratch register, so we need to use R12. R12 is also available under ELF V1, so to maintain consistency, I flipped the order to make R12 the first scratch register in the array under both ABIs. llvm-svn: 226247	2015-01-16 04:40:58 +00:00
Mehdi Amini	590a2700fc	Fix Reassociate handling of constant in presence of undef float http://reviews.llvm.org/D6993 llvm-svn: 226245	2015-01-16 03:00:58 +00:00
Rafael Espindola	67a79e72f5	Revert "Revert Don't create new comdats in CodeGen" This reverts commit r226173, adding r226038 back. No change in this commit, but clang was changed to also produce trivial comdats for costructors, destructors and vtables when needed. Original message: Don't create new comdats in CodeGen. This patch stops the implicit creation of comdats during codegen. Clang now sets the comdat explicitly when it is required. With this patch clang and gcc now produce the same result in pr19848. llvm-svn: 226242	2015-01-16 02:22:55 +00:00
Sanjoy Das	a1837a342d	Add a new pass "inductive range check elimination" IRCE eliminates range checks of the form 0 <= A * I + B < Length by splitting a loop's iteration space into three segments in a way that the check is completely redundant in the middle segment. As an example, IRCE will convert len = < known positive > for (i = 0; i < n; i++) { if (0 <= i && i < len) { do_something(); } else { throw_out_of_bounds(); } } to len = < known positive > limit = smin(n, len) // no first segment for (i = 0; i < limit; i++) { if (0 <= i && i < len) { // this check is fully redundant do_something(); } else { throw_out_of_bounds(); } } for (i = limit; i < n; i++) { if (0 <= i && i < len) { do_something(); } else { throw_out_of_bounds(); } } IRCE can deal with multiple range checks in the same loop (it takes the intersection of the ranges that will make each of them redundant individually). Currently IRCE does not do any profitability analysis. That is a TODO. Please note that the status of this pass is experimental, and it is not part of any default pass pipeline. Having said that, I will love to get feedback and general input from people interested in trying this out. This pass was originally r226201. It was reverted because it used C++ features not supported by MSVC 2012. Differential Revision: http://reviews.llvm.org/D6693 llvm-svn: 226238	2015-01-16 01:03:22 +00:00
Kevin Enderby	a975d4df1d	This should fix the build bot clang-cmake-armv7-a15-full failing on the macho-archive-headers.test added with r226228. llvm-svn: 226232	2015-01-16 00:27:31 +00:00
Matt Arsenault	eeb2a7e688	R600/SI: Add patterns for v_cvt_{flr\|rpi}_i32_f32 llvm-svn: 226230	2015-01-15 23:58:35 +00:00
Filipe Cabecinhas	c552c9abce	Fix edge case when Start overflowed in 32 bit mode llvm-svn: 226229	2015-01-15 23:50:44 +00:00
Kevin Enderby	13023a1af6	Add the option, -archive-headers, used with -macho to print the Mach-O archive headers to llvm-objdump. llvm-svn: 226228	2015-01-15 23:19:11 +00:00
Matt Arsenault	268757ba60	R600/SI: Fix trailing comma with modifiers Instructions with 1 operand can still use source modifiers, so make sure we don't print an extra comma afterwards. llvm-svn: 226226	2015-01-15 23:17:03 +00:00
Colin LeMahieu	cd9c4e3e07	[Hexagon] Adding new-value store and bit reverse instructions. llvm-svn: 226224	2015-01-15 23:10:29 +00:00
Filipe Cabecinhas	4013950034	Report fatal errors instead of segfaulting/asserting on a few invalid accesses while reading MachO files. Summary: Shift an older “invalid file” test to get a consistent naming for these tests. Bugs found by afl-fuzz Reviewers: rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6945 llvm-svn: 226219	2015-01-15 22:52:38 +00:00
Lang Hames	7e0692b614	[Object] Add SF_Exported flag. This flag will be set on all symbols that would be exported from a dylib if their containing object file were linked into one. No test case: No command line tools query this flag, and there are no Object unit tests. llvm-svn: 226217	2015-01-15 22:33:30 +00:00
Sanjoy Das	7f62ac8e4d	Revert r226201 (Add a new pass "inductive range check elimination") The change used C++11 features not supported by MSVC 2012. I will fix the change to use things supported MSVC 2012 and recommit shortly. llvm-svn: 226216	2015-01-15 22:18:10 +00:00
David Majnemer	f1f72c9e43	InductiveRangeCheckElimination: Remove extra ';' This silences a GCC warning. llvm-svn: 226215	2015-01-15 21:55:16 +00:00
Andrew Kaylor	204096b59e	Fixing pedantic build warnings. llvm-svn: 226214	2015-01-15 21:50:53 +00:00
Colin LeMahieu	c59328e627	[Hexagon] Fix 226206 by uncommenting required pattern and changing patterns for simple load-extends. llvm-svn: 226210	2015-01-15 21:35:49 +00:00
Hal Finkel	e2ab0f17cf	[PowerPC] Loosen ELFv1 PPC64 func descriptor loads for indirect calls Function pointers under PPC64 ELFv1 (which is used on PPC64/Linux on the POWER7, A2 and earlier cores) are really pointers to a function descriptor, a structure with three pointers: the actual pointer to the code to which to jump, the pointer to the TOC needed by the callee, and an environment pointer. We used to chain these loads, and make them opaque to the rest of the optimizer, so that they'd always occur directly before the call. This is not necessary, and in fact, highly suboptimal on embedded cores. Once the function pointer is known, the loads can be performed ahead of time; in fact, they can be hoisted out of loops. Now these function descriptors are almost always generated by the linker, and thus the contents of the descriptors are invariant. As a result, by default, we'll mark the associated loads as invariant (allowing them to be hoisted out of loops). I've added a target feature to turn this off, however, just in case someone needs that option (constructing an on-stack descriptor, casting it to a function pointer, and then calling it cannot be well-defined C/C++ code, but I can imagine some JIT-compilation system doing so). Consider this simple test: $ cat call.c typedef void (fp)(); void bar(fp x) { for (int i = 0; i < 1600000000; ++i) x(); } $ cat main.c typedef void (fp)(); void bar(fp x); void foo() {} int main() { bar(foo); } On the PPC A2 (the BG/Q supercomputer), marking the function-descriptor loads as invariant brings the execution time down to ~8 seconds from ~32 seconds with the loads in the loop. The difference on the POWER7 is smaller. Compiling with: gcc -std=c99 -O3 -mcpu=native call.c main.c : ~6 seconds [this is 4.8.2] clang -O3 -mcpu=native call.c main.c : ~5.3 seconds clang -O3 -mcpu=native call.c main.c -mno-invariant-function-descriptors : ~4 seconds (looks like we'd benefit from additional loop unrolling here, as a first guess, because this is faster with the extra loads) The -mno-invariant-function-descriptors will be added to Clang shortly. llvm-svn: 226207	2015-01-15 21:17:34 +00:00
Colin LeMahieu	f87697f05e	[Hexagon] Updating indexed load-extend patterns and changing test to new expected output. llvm-svn: 226206	2015-01-15 21:07:52 +00:00
Sanjoy Das	7059e2959d	Add a new pass "inductive range check elimination" IRCE eliminates range checks of the form 0 <= A * I + B < Length by splitting a loop's iteration space into three segments in a way that the check is completely redundant in the middle segment. As an example, IRCE will convert len = < known positive > for (i = 0; i < n; i++) { if (0 <= i && i < len) { do_something(); } else { throw_out_of_bounds(); } } to len = < known positive > limit = smin(n, len) // no first segment for (i = 0; i < limit; i++) { if (0 <= i && i < len) { // this check is fully redundant do_something(); } else { throw_out_of_bounds(); } } for (i = limit; i < n; i++) { if (0 <= i && i < len) { do_something(); } else { throw_out_of_bounds(); } } IRCE can deal with multiple range checks in the same loop (it takes the intersection of the ranges that will make each of them redundant individually). Currently IRCE does not do any profitability analysis. That is a TODO. Please note that the status of this pass is experimental, and it is not part of any default pass pipeline. Having said that, I will love to get feedback and general input from people interested in trying this out. Differential Revision: http://reviews.llvm.org/D6693 llvm-svn: 226201	2015-01-15 20:45:46 +00:00
Hal Finkel	5ef58eb86d	Revert "r226086 - Revert "r226071 - [RegisterCoalescer] Remove copies to reserved registers"" Reapply r226071 with fixes. Two fixes: 1. We need to manually remove the old and create the new 'deaf defs' associated with physical register definitions when we move the definition of the physical register from the copy point to the point of the original vreg def. This problem was picked up by the machinstr verifier, and could trigger a verification failure on test/CodeGen/X86/2009-02-12-DebugInfoVLA.ll, so I've turned on the verifier in the tests. 2. When moving the def point of the phys reg up, we need to make sure that it is neither defined nor read in between the two instructions. We don't, however, extend the live ranges of phys reg defs to cover uses, so just checking for live-range overlap between the pair interval and the phys reg aliases won't pick up reads. As a result, we manually iterate over the range and check for reads. A test soon to be committed to the PowerPC backend will test this change. Original commit message: [RegisterCoalescer] Remove copies to reserved registers This allows the RegisterCoalescer to join "non-flipped" range pairs with a physical destination register -- which allows the RegisterCoalescer to remove copies like this: <vreg> = something (maybe a load, for example) ... (things that don't use PHYSREG) PHYSREG = COPY <vreg> (with all of the restrictions normally applied by the RegisterCoalescer: having compatible register classes, etc. ) Previously, the RegisterCoalescer handled only the opposite case (copying from a physical register). I don't handle the problem fully here, but try to get the common case where there is only one use of <vreg> (the COPY). An upcoming commit to the PowerPC backend will make this pattern much more common on PPC64/ELF systems. llvm-svn: 226200	2015-01-15 20:32:09 +00:00
Philip Reames	66c9fb0d52	Style cleanup of old gc.root lowering code Use static functions for helpers rather than static member functions. a) this changes the linking (minor at best), and b) this makes it obvious no object state is involved. llvm-svn: 226198	2015-01-15 19:49:25 +00:00
Philip Reames	b87144160e	clang-format GCStrategy.cpp & GCRootLowering.cpp (NFC) llvm-svn: 226196	2015-01-15 19:39:17 +00:00
Philip Reames	f27f373895	Split GCStrategy.cpp into two files (NFC) This preparation for an update to http://reviews.llvm.org/D6811. GCStrategy.cpp will hopefully be moving into IR/, where as the lowering logic needs to stay in CodeGen/ llvm-svn: 226195	2015-01-15 19:29:42 +00:00
Colin LeMahieu	538b85810c	[Hexagon] Removing old versions of vsplice, valign, cl0, ct0 and updating references to new versions. llvm-svn: 226194	2015-01-15 19:28:32 +00:00
Marek Olsak	f0b130ace0	R600/SI: Unify VOP2 instructions which are VOP3-only on VI This removes some duplicated classes and definitions. These instructions are defined: _e32 // pseudo _e32_si _e64 // pseudo _e64_si _e64_vi llvm-svn: 226191	2015-01-15 18:43:06 +00:00
Marek Olsak	c536850526	R600/SI: Use 64-bit encoding by default for opcodes that are VOP3-only on VI llvm-svn: 226190	2015-01-15 18:43:01 +00:00
Marek Olsak	15e4a59899	R600/SI: Add V_READLANE_B32 and V_WRITELANE_B32 for VI These are VOP3-only on VI. The new multiclass doesn't define VOP3 versions of VOP2 instructions. llvm-svn: 226189	2015-01-15 18:42:55 +00:00
Marek Olsak	a93603d508	R600/SI: Don't shrink instructions whose e32 encoding doesn't exist v2: modify hasVALU32BitEncoding instead v3: - add pseudoToMCOpcode helper to AMDGPUInstInfo, which is used by both hasVALU32BitEncoding and AMDGPUMCInstLower::lower - report an error if a pseudo can't be lowered llvm-svn: 226188	2015-01-15 18:42:51 +00:00
Marek Olsak	dc4d202f10	R600/SI: Add common class VOPAnyCommon llvm-svn: 226187	2015-01-15 18:42:44 +00:00
Marek Olsak	eae20ab5fd	R600/SI: Don't select SI-only VOP3 opcodes on VI llvm-svn: 226186	2015-01-15 18:42:40 +00:00
Colin LeMahieu	504157f1ae	[Hexagon] Adding vmux instruction. Removing old transfer instructions and updating references. llvm-svn: 226184	2015-01-15 18:16:00 +00:00
Joerg Sonnenberger	b6956e113a	Support @PLT loads on 32bit x86. llvm-svn: 226182	2015-01-15 17:59:02 +00:00
Colin LeMahieu	2d1c14563e	[Hexagon] Deleting old float comparison instruction and updating references to new ones. llvm-svn: 226179	2015-01-15 17:28:14 +00:00
Colin LeMahieu	7959cac725	[Hexagon] Replacing old fadd/fsub instructions and updating references. llvm-svn: 226176	2015-01-15 16:30:07 +00:00
Timur Iskhodzhanov	f5adf13fac	Revert Don't create new comdats in CodeGen It breaks AddressSanitizer on Windows. llvm-svn: 226173	2015-01-15 16:14:34 +00:00
Daniel Sanders	023c806109	[mips] Fix a typo in the compare patterns for MIPS32r6/MIPS64r6. Summary: The patterns intended for the SETLE node were actually matching the SETLT node. Reviewers: atanasyan, sstankovic, vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6997 llvm-svn: 226171	2015-01-15 15:41:03 +00:00
Mehdi Amini	fa546b29a0	Fix SelectionDAG -view-*-dags filtering llvm-svn: 226163	2015-01-15 12:03:32 +00:00
Alexander Kornienko	8c0809c7f8	Replace size method call of containers to empty method where appropriate This patch was generated by a clang tidy checker that is being open sourced. The documentation of that checker is the following: /// The emptiness of a container should be checked using the empty method /// instead of the size method. It is not guaranteed that size is a /// constant-time function, and it is generally more efficient and also shows /// clearer intent to use empty. Furthermore some containers may implement the /// empty method but not implement the size method. Using empty whenever /// possible makes it easier to switch to another container in the future. Patch by Gábor Horváth! llvm-svn: 226161	2015-01-15 11:41:30 +00:00
Chandler Carruth	8ca43224db	[PM] Port TargetLibraryInfo to the new pass manager, provided by the TargetLibraryAnalysis pass. There are actually no direct tests of this already in the tree. I've added the most basic test that the pass manager bits themselves work, and the TLI object produced will be tested by an upcoming patches as they port passes which rely on TLI. This is starting to point out the awkwardness of the invalidate API -- it seems poorly fitting on the result object. I suspect I will change it to live on the analysis instead, but that's not for this change, and I'd rather have a few more passes ported in order to have more experience with how this plays out. I believe there is only one more analysis required in order to start porting instcombine. =] llvm-svn: 226160	2015-01-15 11:39:46 +00:00
Chandler Carruth	b98f63dbdb	[PM] Separate the TargetLibraryInfo object from the immutable pass. The pass is really just a means of accessing a cached instance of the TargetLibraryInfo object, and this way we can re-use that object for the new pass manager as its result. Lots of delta, but nothing interesting happening here. This is the common pattern that is developing to allow analyses to live in both the old and new pass manager -- a wrapper pass in the old pass manager emulates the separation intrinsic to the new pass manager between the result and pass for analyses. llvm-svn: 226157	2015-01-15 10:41:28 +00:00
Craig Topper	9fdd078afb	Hide some redundant AVX512 instructions from the asm parser, but force them to show up in the disassembler. llvm-svn: 226155	2015-01-15 09:37:15 +00:00
David Majnemer	f0982d0ac6	SimplifyIndVar: Remove unused variable OtherOperandIdx is not used anymore, remove it to silence warnings. llvm-svn: 226138	2015-01-15 07:11:23 +00:00
NAKAMURA Takumi	24ebfcb619	Update libdeps since TLI was moved from Target to Analysis in r226078. llvm-svn: 226126	2015-01-15 05:21:00 +00:00
NAKAMURA Takumi	ab7289dd0e	Reorder. llvm-svn: 226125	2015-01-15 05:20:46 +00:00
Hal Finkel	dd669615dd	Revert "r226071 - [RegisterCoalescer] Remove copies to reserved registers" Reverting this while I investigate some bad behavior this is causing. As a possibly-related issue, adding -verify-machineinstrs to one of the test cases now fails because of this change: llc test/CodeGen/X86/2009-02-12-DebugInfoVLA.ll -march=x86-64 -o - -verify-machineinstrs * Bad machine code: No instruction at def index * - function: foo - basic block: BB#0 return (0x10007e21f10) [0B;736B) - liverange: [128r,128d:9)[160r,160d:8)[176r,176d:7)[336r,336d:6)[464r,464d:5)[480r,480d:4)[624r,624d:3)[752r,752d:2)[768r,768d:1)[78 4r,784d:0) 0@784r 1@768r 2@752r 3@624r 4@480r 5@464r 6@336r 7@176r 8@160r 9@128r - register: %DS Valno #3 is defined at 624r * Bad machine code: Live segment doesn't end at a valid instruction * - function: foo - basic block: BB#0 return (0x10007e21f10) [0B;736B) - liverange: [128r,128d:9)[160r,160d:8)[176r,176d:7)[336r,336d:6)[464r,464d:5)[480r,480d:4)[624r,624d:3)[752r,752d:2)[768r,768d:1)[78 4r,784d:0) 0@784r 1@768r 2@752r 3@624r 4@480r 5@464r 6@336r 7@176r 8@160r 9@128r - register: %DS [624r,624d:3) LLVM ERROR: Found 2 machine code errors. where 624r corresponds exactly to the interval combining change: 624B %RSP<def> = COPY %vreg16; GR64:%vreg16 Considering merging %vreg16 with %RSP RHS = %vreg16 [608r,624r:0) 0@608r updated: 608B %RSP<def> = MOV64rm <fi#3>, 1, %noreg, 0, %noreg; mem:LD8[%saved_stack.1] Success: %vreg16 -> %RSP Result = %RSP llvm-svn: 226086	2015-01-15 03:08:59 +00:00
Chandler Carruth	a61ef990dd	Switch this header file to not hard-code Windows line endings. llvm-svn: 226081	2015-01-15 02:21:56 +00:00
Chandler Carruth	62d4215baa	[PM] Move TargetLibraryInfo into the Analysis library. While the term "Target" is in the name, it doesn't really have to do with the LLVM Target library -- this isn't an abstraction which LLVM targets generally need to implement or extend. It has much more to do with modeling the various runtime libraries on different OSes and with different runtime environments. The "target" in this sense is the more general sense of a target of cross compilation. This is in preparation for porting this analysis to the new pass manager. No functionality changed, and updates inbound for Clang and Polly. llvm-svn: 226078	2015-01-15 02:16:27 +00:00
NAKAMURA Takumi	95b3880dd0	Win64Exception.cpp: Try to fix crash for x64 EH. "Per" might be null there. llvm-svn: 226077	2015-01-15 02:15:21 +00:00
Sanjoy Das	8c252bde36	Fix PR22222 The bug was introduced in r225282. r225282 assumed that sub X, Y is the same as add X, -Y. This is not correct if we are going to upgrade the sub to sub nuw. This change fixes the issue by making the optimization ignore sub instructions. Differential Revision: http://reviews.llvm.org/D6979 llvm-svn: 226075	2015-01-15 01:46:09 +00:00
Hal Finkel	8299646236	[RegisterCoalescer] Remove copies to reserved registers This allows the RegisterCoalescer to join "non-flipped" range pairs with a physical destination register -- which allows the RegisterCoalescer to remove copies like this: <vreg> = something (maybe a load, for example) ... (things that don't use PHYSREG) PHYSREG = COPY <vreg> (with all of the restrictions normally applied by the RegisterCoalescer: having compatible register classes, etc. ) Previously, the RegisterCoalescer handled only the opposite case (copying from a physical register). I don't handle the problem fully here, but try to get the common case where there is only one use of <vreg> (the COPY). An upcoming commit to the PowerPC backend will make this pattern much more common on PPC64/ELF systems. llvm-svn: 226071	2015-01-15 01:25:28 +00:00
Hal Finkel	64202167c5	[PowerPC] Add assembler support for mcrfs and friends Fill out our support for the floating-point status and control register instructions (mcrfs and friends). As it turns out, these are necessary for compiling src/test/harness_fp.h in TBB for PowerPC. Thanks to Raf Schietekat for reporting the issue! llvm-svn: 226070	2015-01-15 01:00:53 +00:00
Richard Smith	e78bb1249e	For PR21145: recognise a builtin call to a known deallocation function even if it's defined in the current module. Clang generates this situation for the C++14 sized deallocation functions, because it generates a weak definition in case one isn't provided by the C++ runtime library. llvm-svn: 226069	2015-01-15 01:00:33 +00:00
Colin LeMahieu	8ffce23cda	[Hexagon] Replacing old versions of stores and loads. llvm-svn: 226065	2015-01-15 00:15:30 +00:00
Ramkumar Ramachandra	dba7329ebb	[GC] CodeGenPrep transform: simplify offsetable relocate The transform is somewhat involved, but the basic idea is simple: find derived pointers that have been offset from the base pointer using gep and replace the relocate of the derived pointer with a gep to the relocated base pointer (with the same offset). llvm-svn: 226060	2015-01-14 23:27:07 +00:00
Colin LeMahieu	c7522f31f1	[Hexagon] Replacing old version of convert and load f64. llvm-svn: 226057	2015-01-14 23:07:36 +00:00
Philip Reames	8d5d68f1aa	getMangledTypeStr: clarify how it mangles types, and add tests "Write a set of tests that show how name mangling is done for overloaded intrinsics." These happen to use gc.relocates to exercise the codepath in question, but is not a GC specific test. Patch by: artagnon@gmail.com Differential Revision: http://reviews.llvm.org/D6915 llvm-svn: 226056	2015-01-14 23:05:17 +00:00
NAKAMURA Takumi	a50a89a081	Update libdeps in NVPTXCodeGen, since r225944. llvm-svn: 226055	2015-01-14 23:01:36 +00:00
Reid Kleckner	e80a0a7572	Use MMI->getPersonality() instead of MMI->getPersonalities()[MMI->getPersonalityIndex()] Also nuke the comment about supporting multiple personalities in a single function, aka PR1414. That's just crazy. llvm-svn: 226052	2015-01-14 22:47:54 +00:00
Duncan P. N. Exon Smith	9885469922	IR: Move MDLocation into place This commit moves `MDLocation`, finishing off PR21433. There's an accompanying clang commit for frontend testcases. I'll attach the testcase upgrade script I used to PR21433 to help out-of-tree frontends/backends. This changes the schema for `DebugLoc` and `DILocation` from: !{i32 3, i32 7, !7, !8} to: !MDLocation(line: 3, column: 7, scope: !7, inlinedAt: !8) Note that empty fields (line/column: 0 and inlinedAt: null) don't get printed by the assembly writer. llvm-svn: 226048	2015-01-14 22:27:36 +00:00
Matthias Braun	96a319588a	MachineVerifier: Allow undef reads if a matching superreg is defined. Summary: Some pseudo instruction expansions break down a wide register use into multiple uses of smaller sub registers. If the super register was partially undefined the broken down sub registers may be completely undefined now leading to MachineVerifier complaints. Unfortunately liveness information to add the required dead flags is not easily (cheaply) available when expanding pseudo instructions. This commit changes the verifier to be quiet if there is an additional implicit use of a super register. Pseudo instruction expanders can use this to mark cases where partially defined values get potentially broken into completely undefined ones. Differential Revision: http://reviews.llvm.org/D6973 llvm-svn: 226047	2015-01-14 22:25:14 +00:00
Duncan P. N. Exon Smith	503cf3bff9	IR: Always print MDLocation line Print `MDLocation`'s `line` field even when it's 0. llvm-svn: 226046	2015-01-14 22:14:26 +00:00
Duncan P. N. Exon Smith	e16d587515	IR: Drop metadata references more aggressively during teardown Sometimes teardown happens before the debug info graph is complete (e.g., when clang throws an error). In that case, `MDNode`s will still have RAUW, so deleting constants that the `MDNode`s point at will be relatively expensive -- it'll cause re-uniquing all up the chain (what I've been referring to as "teardown madness"). So, drop references before deleting constants. We need to drop a few more references now: the metadata side of the metadata/value bridges needs to be dropped off the cliff along with the rest of it (previously, the bridges were cleaned before we did anything with the `MDNode`s). There's no real functionality change here -- state before and after `LLVMContextImpl::~LLVMContextImpl()` is unchanged -- so no testcase. llvm-svn: 226044	2015-01-14 21:58:17 +00:00
Rafael Espindola	fad1639a12	Don't create new comdats in CodeGen. This patch stops the implicit creation of comdats during codegen. Clang now sets the comdat explicitly when it is required. With this patch clang and gcc now produce the same result in pr19848. llvm-svn: 226038	2015-01-14 20:55:48 +00:00
Colin LeMahieu	11a34b385d	[Hexagon] Removing old, unused !tstbit instructions. llvm-svn: 226036	2015-01-14 20:26:15 +00:00
Chandler Carruth	e3288147f0	[MBP] Add flags to disable the BadCFGConflict check in MachineBlockPlacement. Some benchmarks have shown that this could lead to a potential performance benefit, and so adding some flags to try to help measure the difference. A possible explanation. In diamond-shaped CFGs (A followed by either B or C both followed by D), putting B and C both in between A and D leads to the code being less dense than it could be. Always either B or C have to be skipped increasing the chance of cache misses etc. Moving either B or C to after D might be beneficial on average. In the long run, but we should probably do a better job of analyzing the basic block and branch probabilities to move the correct one of B or C to after D. But even if we don't use this in the long run, it is a good baseline for benchmarking. Original patch authored by Daniel Jasper with test tweaks and a second flag added by me. Differential Revision: http://reviews.llvm.org/D6969 llvm-svn: 226034	2015-01-14 20:19:29 +00:00
Bill Schmidt	082cfc05f1	[PPC64] Add support for the ICBT instruction on POWER8. Patch by Kit Barton. Support for the ICBT instruction is currently present, but limited to embedded processors. This change adds a new FeatureICBT that can be used to identify whether the ICBT instruction is available on a specific processor. Two new tests are added: * Positive test to ensure the icbt instruction is present when using -mcpu=pwr8 * Negative test to ensure the icbt instruction is not generated when using -mcpu=pwr7 Both test cases use the Prefetch opcode in LLVM. They are based on the ppc64-prefetch.ll test case. llvm-svn: 226033	2015-01-14 20:17:10 +00:00
Duncan P. N. Exon Smith	4a4f78583e	IR: Fix a use-after-free in RAUW Happened pretty commonly during `LLVMContext` teardown when `clang -g` hit an error. This fixes the use-after-free. Next I'll clean up teardown so that it's not RAUW'ing when metadata-tracked values are deleted (only really causes a problem if the graph is mid-construction when teardown starts, but it's still unnecessary work). llvm-svn: 226029	2015-01-14 19:56:10 +00:00
David Majnemer	a0afb55ff9	InstCombine: Don't take A-B<0 into A<B if A-B has other uses This fixes PR22226. llvm-svn: 226023	2015-01-14 19:26:56 +00:00
Rafael Espindola	7244bb3c17	Revert "Add r224985 back with two fixes." This reverts commit r225644 while I debug a regression. llvm-svn: 226022	2015-01-14 19:07:23 +00:00
Reid Kleckner	9b5eaf0d5a	Emit the Itanium LSDA for unknown EH personalities on Win64 This fixes lots of generic CodeGen tests that use __gcc_personality_v0. This suggests that using ExceptionHandling::MSVC was a mistake, and we should instead classify each function by personality function. This would, for example, allow us to LTO a binary containing uses of SEH and Itanium EH. llvm-svn: 226019	2015-01-14 18:50:10 +00:00
Reid Kleckner	b57c1dc0f7	Remove dead code for llvm.eh.selector in the old EH model llvm-svn: 226018	2015-01-14 18:49:39 +00:00
Colin LeMahieu	c91fabc233	[Hexagon] Removing old versions of cmph and updating references. llvm-svn: 226013	2015-01-14 18:26:14 +00:00
Rafael Espindola	4e74d3be35	Add support for comdats with names larger than 256 characters. llvm-svn: 226012	2015-01-14 18:25:45 +00:00
Colin LeMahieu	ffacc6eac6	[Hexagon] Removing old versions of cmpb and updating references. llvm-svn: 226006	2015-01-14 18:05:44 +00:00
Colin LeMahieu	fa947906bf	[Hexagon] Deleting versions of compare-not that don't have encoding information. Updating references. llvm-svn: 226003	2015-01-14 16:49:12 +00:00
Tom Stellard	0febe685ed	R600/SI: Use IMPLICIT_DEF and KILL when failing to spill VGPRs This helps us avoid 'invalid register class for operand' verifier errors. llvm-svn: 225989	2015-01-14 15:42:34 +00:00
Tom Stellard	42fb60e1a7	R600/SI: Spill VGPRs to scratch space for compute shaders llvm-svn: 225988	2015-01-14 15:42:31 +00:00
Olivier Sallenave	c8d13bd370	Override the TLI callback enableAggressiveFMAFusion and return true. Indeed, fmul, fmadd and fadd nodes cost the same number of cycles, so we can enable more combining heuristics to produce more fmadd nodes. llvm-svn: 225984	2015-01-14 14:47:24 +00:00
Erik Eckstein	13c4ab89ba	reapply: SLPVectorizer: Cache results from memory alias checking. This speeds up the dependency calculations for blocks with many load/store/call instructions. Beside the improved runtime, there is no functional change. Compared to the original commit, this re-applied commit contains a bug fix which ensures that there are no incorrect collisions in the alias cache. llvm-svn: 225977	2015-01-14 11:24:47 +00:00
Chandler Carruth	d9903888d9	[cleanup] Re-sort all the #include lines in LLVM using utils/sort_includes.py. I clearly haven't done this in a while, so more changed than usual. This even uncovered a missing include from the InstrProf library that I've added. No functionality changed here, just mechanical cleanup of the include order. llvm-svn: 225974	2015-01-14 11:23:27 +00:00
Jyoti Allur	5a1391410d	Correct POP handling for v7m llvm-svn: 225972	2015-01-14 10:48:16 +00:00
Chandler Carruth	64764b446b	[PM] Port domtree to the new pass manager (at last). This adds the domtree analysis to the new pass manager. The analysis returns the same DominatorTree result entity used by the old pass manager and essentially all of the code is shared. We just have different boilerplate for running and printing the analysis. I've converted one test to run in both modes just to make sure this is exercised while both are live in the tree. llvm-svn: 225969	2015-01-14 10:19:28 +00:00
Kai Nacke	755b6e8a42	[mips] Refine octeon instructions seq/seqi/sne/snei This commit refines the pattern for the octeon seq/seqi/sne/snei instructions. The target register is set to 0 or 1 according to the result of the comparison. In C, this is something like rd = (unsigned long)(rs == rt) This commit adds a zext to bring the result to i64. With this change the instruction is selected for this type of code. (gcc produces the same code for the above C code.) llvm-svn: 225968	2015-01-14 10:19:09 +00:00
Brad Smith	dd6675cef9	Use the integrated assembler by default on SPARC. llvm-svn: 225957	2015-01-14 07:53:39 +00:00
David Majnemer	7efc6139d9	Use the operand vector instead so inline assembly can be validated too The buildbots got upset after r225941, this should hopefully fix things. llvm-svn: 225954	2015-01-14 06:14:36 +00:00
Mehdi Amini	d8976b8ed3	SelectionDAG: add a -filter-view-dags option to llc This option takes the name of the basic block you want to visualize with -view-*-dags Differential Revision: http://reviews.llvm.org/D6948 llvm-svn: 225953	2015-01-14 06:03:18 +00:00
Mehdi Amini	648eff1695	DAG Combiner: Fold SelectCC When Cond is UNDEF In case folding a node end up with a NaN as operand for the select, the folding of the condition of the selectcc node returns "UNDEF". Differential Revision: http://reviews.llvm.org/D6889 llvm-svn: 225952	2015-01-14 05:45:24 +00:00
Mehdi Amini	7b068f6ba4	Add assertions for out of bound index in ComputeLinearIndex llvm-svn: 225951	2015-01-14 05:38:48 +00:00
Saleem Abdulrasool	aa32297fb8	X86: only access operands if they are present If there is no associated immediate (MS style inline asm), do not try to access the operand, assume that it is valid. This should fix the buildbots after SVN r225941. llvm-svn: 225950	2015-01-14 05:37:10 +00:00
Mehdi Amini	8923cc5470	Fold a loop for array processing in ComputeLinearIndex When processing an array, every Elt has the same layout, it is useless to recursively call each ComputeLinearIndex on each element. Just do it once and multiply by the number of elements. Differential Revision: http://reviews.llvm.org/D6832 llvm-svn: 225949	2015-01-14 05:33:01 +00:00
JF Bastien	eeea8970b4	Revert "Insert random noops to increase security against ROP attacks (llvm)" This reverts commit: http://reviews.llvm.org/D3392 llvm-svn: 225948	2015-01-14 05:24:33 +00:00
Duncan P. N. Exon Smith	9f6bddd4b2	NVPTX: Use MapMetadata() instead of custom/stale/untested logic Copy the `GVMap` over to a standard `ValueToValueMapTy` so that we can reuse the `MapMetadata()` logic. Unfortunately the `GVMap` can't just be replaced, since `MapMetadata()` likes to modify the map, but at least this will prevent NVPTX from bitrotting. llvm-svn: 225944	2015-01-14 05:14:30 +00:00
Duncan P. N. Exon Smith	f864ae2745	NVPTX: Remove bogus remap logic for global variable address spaces The comment is incorrect, and the code mangles debug info. Remove the bad logic, which wasn't tested anyway. llvm-svn: 225943	2015-01-14 05:13:18 +00:00
Saleem Abdulrasool	ca24b1d638	X86: validate 'int' instruction The int instruction takes as an operand an 8-bit immediate value. Validate that the input is valid rather than silently truncating the value. llvm-svn: 225941	2015-01-14 05:10:21 +00:00
Hao Liu	e28d154cd5	Fix a wrong comment in LoopVectorize. I.E. more than two -> exactly two Fix a typo function name in LoopVectorize. I.E. collectStrideAcccess() -> collectStrideAccess() llvm-svn: 225935	2015-01-14 03:02:16 +00:00
Duncan P. N. Exon Smith	e65b0663e6	Remove trailing slash from r225924 llvm-svn: 225929	2015-01-14 01:42:43 +00:00
Matt Arsenault	e698663687	R600/SI: Fix bad code with unaligned byte vector loads Don't do the v4i8 -> v4f32 combine if the load will need to be expanded due to alignment. This stops adding instructions to repack into a single register that the v_cvt_ubyteN_f32 instructions read. llvm-svn: 225926	2015-01-14 01:35:22 +00:00
Matt Arsenault	bd22342322	Implement new way of expanding extloads. Now that the source and destination types can be specified, allow doing an expansion that doesn't use an EXTLOAD of the result type. Try to do a legal extload to an intermediate type and extend that if possible. This generalizes the special case custom lowering of extloads R600 has been using to work around this problem. This also happens to fix a bug that would incorrectly use more aligned loads than should be used. llvm-svn: 225925	2015-01-14 01:35:17 +00:00
Duncan P. N. Exon Smith	e54cd9a6f3	Utils: Remove unreachable break, NFC llvm-svn: 225924	2015-01-14 01:31:34 +00:00
Duncan P. N. Exon Smith	a5a0f5766a	Utils: Handle remapping distinct MDLocations Part of PR21433. llvm-svn: 225921	2015-01-14 01:29:32 +00:00
Duncan P. N. Exon Smith	b84840c04e	Utils: Thread distinct-ness through the cloneMD*() functions, NFC The new logic isn't actually reachable yet, so no functionality change. llvm-svn: 225918	2015-01-14 01:24:38 +00:00
Duncan P. N. Exon Smith	7c69c1ebda	Utils: Extract cloneMDNode(), NFC llvm-svn: 225917	2015-01-14 01:22:47 +00:00
Duncan P. N. Exon Smith	b6515d6a71	Utils: Move cloneMD*() up, NFC llvm-svn: 225915	2015-01-14 01:21:24 +00:00
Duncan P. N. Exon Smith	47d82981d6	Utils: Add mapping for uniqued MDLocations Still doesn't handle distinct ones. Part of PR21433. llvm-svn: 225914	2015-01-14 01:20:27 +00:00
Tom Stellard	ae38f30d7b	R600/SI: Define a schedule model The machine scheduler is still disabled by default. The schedule model is not complete yet, and could be improved. llvm-svn: 225913	2015-01-14 01:13:19 +00:00
Duncan P. N. Exon Smith	4766e01250	Utils: Extract cloneMDTuple(), NFC llvm-svn: 225912	2015-01-14 01:12:14 +00:00
Duncan P. N. Exon Smith	fb9d128ab1	Utils: Extract shouldRemapUniquedNode(), NFC llvm-svn: 225911	2015-01-14 01:08:47 +00:00
Hal Finkel	934361a4b8	Revert "r225811 - Revert "r225808 - [PowerPC] Add StackMap/PatchPoint support"" This re-applies r225808, fixed to avoid problems with SDAG dependencies along with the preceding fix to ScheduleDAGSDNodes::RegDefIter::InitNodeNumDefs. These problems caused the original regression tests to assert/segfault on many (but not all) systems. Original commit message: This commit does two things: 1. Refactors PPCFastISel to use more of the common infrastructure for call lowering (this lets us take advantage of this common code for lowering some common intrinsics, stackmap/patchpoint among them). 2. Adds support for stackmap/patchpoint lowering. For the most part, this is very similar to the support in the AArch64 target, with the obvious differences (different registers, NOP instructions, etc.). The test cases are adapted from the AArch64 test cases. One difference of note is that the patchpoint call sequence takes 24 bytes, so you can't use less than that (on AArch64 you can go down to 16). Also, as noted in the docs, we take the patchpoint address to be the actual code address (assuming the call is local in the TOC-sharing sense), which should yield higher performance than generating the full cross-DSO indirect-call sequence and is likely just as useful for JITed code (if not, we'll change it). StackMaps and Patchpoints are still marked as experimental, and so this support is doubly experimental. So go ahead and experiment! llvm-svn: 225909	2015-01-14 01:07:51 +00:00
JF Bastien	dcdd5ad252	Insert random noops to increase security against ROP attacks (llvm) A pass that adds random noops to X86 binaries to introduce diversity with the goal of increasing security against most return-oriented programming attacks. Command line options: -noop-insertion // Enable noop insertion. -noop-insertion-percentage=X // X% of assembly instructions will have a noop prepended (default: 50%, requires -noop-insertion) -max-noops-per-instruction=X // Randomly generate X noops per instruction. ie. roll the dice X times with probability set above (default: 1). This doesn't guarantee X noop instructions. In addition, the following 'quick switch' in clang enables basic diversity using default settings (currently: noop insertion and schedule randomization; it is intended to be extended in the future). -fdiversify This is the llvm part of the patch. clang part: D3393 http://reviews.llvm.org/D3392 Patch by Stephen Crane (@rinon) llvm-svn: 225908	2015-01-14 01:07:26 +00:00
Hal Finkel	665026838b	Adjust ScheduleDAGSDNodes::RegDefIter for patchpoints PATCHPOINT is a strange pseudo-instruction. Depending on how it is used, and whether or not the AnyReg calling convention is being used, it might or might not define a value. However, its TableGen definition says that it defines one value, and so when it doesn't, the code in ScheduleDAGSDNodes::RegDefIter becomes confused and the code that uses the RegDefIter will try to get the register class of the MVT::Other type associated with the PATCHPOINT's chain result (under certain circumstances). This will be covered by the PPC64 PatchPoint test cases once that support is re-committed. llvm-svn: 225907	2015-01-14 01:07:03 +00:00
Duncan P. N. Exon Smith	637e765907	Utils: Simplify code, NFC llvm-svn: 225906	2015-01-14 01:07:03 +00:00
Duncan P. N. Exon Smith	b557989a40	Utils: Extract mapUniquedNode(), NFC llvm-svn: 225905	2015-01-14 01:06:21 +00:00
Reid Kleckner	0a57f65514	CodeGen support for x86_64 SEH catch handlers in LLVM This adds handling for ExceptionHandling::MSVC, used by the x86_64-pc-windows-msvc triple. It assumes that filter functions have already been outlined in either the frontend or the backend. Filter functions are used in place of the landingpad catch clause type info operands. In catch clause order, the first filter to return true will catch the exception. The C specific handler table expects the landing pad to be split into one block per handler, but LLVM IR uses a single landing pad for all possible unwind actions. This patch papers over the mismatch by synthesizing single instruction BBs for every catch clause to fill in the EH selector that the landing pad block expects. Missing functionality: - Accessing data in the parent frame from outlined filters - Cleanups (from __finally) are unsupported, as they will require outlining and parent frame access - Filter clauses are unsupported, as there's no clear analogue in SEH In other words, this is the minimal set of changes needed to write IR to catch arbitrary exceptions and resume normal execution. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D6300 llvm-svn: 225904	2015-01-14 01:05:27 +00:00
Duncan P. N. Exon Smith	8725ca8c60	Utils: MDNode => UniquableMDNode, NFC Although this makes the `cast<>` assert more often, the `assert(Node->isResolved())` on the following line would assert in all those cases. So, no functionality change here. llvm-svn: 225903	2015-01-14 01:05:17 +00:00
Duncan P. N. Exon Smith	14cc94c1c6	Utils: Separate out mapDistinctNode(), NFC llvm-svn: 225902	2015-01-14 01:03:05 +00:00
Duncan P. N. Exon Smith	3956a85e6e	Utils: Use helper function directly, NFC llvm-svn: 225901	2015-01-14 01:02:17 +00:00
Adrian Prantl	7813d9c979	Debug Info: Implement DwarfCompileUnit::addComplexAddress() using DIEDwarfExpression (and get rid of a bunch of redundant code). NFC llvm-svn: 225900	2015-01-14 01:01:30 +00:00
Adrian Prantl	ad768c3719	Debug Info: Emitting a register in DwarfExpression may fail. Report the status in a bool and let the users deal with the error. NFC. llvm-svn: 225899	2015-01-14 01:01:28 +00:00
Adrian Prantl	658676c3ea	Debug Info: Move DIEDwarfExpression into DwarfExpression.h because it needs to be accessed from both DwarfCompileUnit.cpp and DwarfUnit.cpp. NFC. llvm-svn: 225898	2015-01-14 01:01:22 +00:00
Duncan P. N. Exon Smith	077affdbb9	Utils: Extract helper function, NFC llvm-svn: 225897	2015-01-14 01:01:19 +00:00
Duncan P. N. Exon Smith	34651ee2f6	Utils: Use MDTuple::get() directly, NFC Working towards supporting `MDLocation` in `MapMetadata()`. llvm-svn: 225896	2015-01-14 00:59:57 +00:00
Ahmed Bougacha	71d7b18e3d	[SimplifyLibCalls] Don't try to simplify indirect calls. It turns out, all callsites of the simplifier are guarded by a check for CallInst::getCalledFunction (i.e., to make sure the callee is direct). This check wasn't done when trying to further optimize a simplified fortified libcall, introduced by a refactoring in r225640. Fix that, add a testcase, and document the requirement. llvm-svn: 225895	2015-01-14 00:55:05 +00:00
Eric Christopher	16370678e3	Remove unused predicate. llvm-svn: 225893	2015-01-14 00:50:33 +00:00
Eric Christopher	6e30cd95cb	Migrate ABIName to MCTargetOptions so that it can be shared between the TargetMachine level and the MC level. llvm-svn: 225891	2015-01-14 00:50:31 +00:00
Chandler Carruth	11f5032368	Revert r225854: [PM] Move the LazyCallGraph printing functionality to a print method. This was formulated on a bad idea, but sadly I didn't uncover how bad this was until I got further down the path. I had hoped that we could provide a low boilerplate way of printing analyses, but it just doesn't seem like this really fits the needs of the analyses. Not all analyses really want to do printing, and those that do don't all use the same interface. Instead, with the new pass manager let's just take advantage of the fact that creating an explicit printer pass like the LCG has is pretty low boilerplate already and rely on that for testing. llvm-svn: 225861	2015-01-14 00:27:45 +00:00
Adrian Prantl	8efadbf868	Debug Info: Don't bother emitting DW_AT_frame_base if the function has no frame register. "Tested" via an assertion triggered by DwarfExpression. llvm-svn: 225858	2015-01-14 00:15:16 +00:00
Adrian Prantl	1411577ad9	Revert "Debug Info: Bail out of AddMachineRegPiece() if MachineReg is not a" This reverts commit r225852, it was a bad idea. MachineReg should always be a physical register. If it isn't this DebugLoc shouldn't have been created in the first place. llvm-svn: 225857	2015-01-14 00:15:12 +00:00

... 4 5 6 7 8 ...

76152 Commits