llvm-project

Commit Graph

Author	SHA1	Message	Date
David Majnemer	45ebda4278	[Verifier] Don't abort on invalid cleanuprets Code in visitEHPadPredecessors assume a little too much about the validity of a cleanupret with an invalid cleanuppad operand. llvm-svn: 262364	2016-03-01 18:59:50 +00:00
Easwaran Raman	8832e5e2f5	Fix breakage caused by r262360. llvm-svn: 262363	2016-03-01 18:59:11 +00:00
Daniel Berlin	83fc77b4c0	Add the beginnings of an update API for preserving MemorySSA Summary: This adds the beginning of an update API to preserve MemorySSA. In particular, this patch adds a way to remove memory SSA accesses when instructions are deleted. It also adds relevant unit testing infrastructure for MemorySSA's API. (There is an actual user of this API, i will make that diff dependent on this one. In practice, a ton of opt passes remove memory instructions, so it's hopefully an obviously useful API :P) Reviewers: hfinkel, reames, george.burgess.iv Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17157 llvm-svn: 262362	2016-03-01 18:46:54 +00:00
Simon Atanasyan	f69c7e5382	[DebugInfo] Dump CIE augmentation data as a list of hex bytes CIE augmentation data might contain non-printable characters. The patch prints the data as a list of hex bytes. Differential Revision: http://reviews.llvm.org/D17759 llvm-svn: 262361	2016-03-01 18:38:05 +00:00
Easwaran Raman	7c4f25d2ed	Metadata support for profile summary. This adds support to convert ProfileSummary object to Metadata and create a ProfileSummary object from metadata. This would allow attaching profile summary information to Module allowing optimization passes to use it. llvm-svn: 262360	2016-03-01 18:30:58 +00:00
Matt Arsenault	03dac8d8e4	DAGCombiner: Turn extract of bitcasted integer into truncate This reduces the number of bitcast nodes and generally cleans up the DAG when bitcasting between integers and vectors everywhere. llvm-svn: 262358	2016-03-01 18:01:37 +00:00
Matt Arsenault	e55c1658ea	Add isScalarInteger helper to EVT/MVT llvm-svn: 262357	2016-03-01 18:01:28 +00:00
Changpeng Fang	24f035af32	AMDGPU/SI: Implement DS_PERMUTE/DS_BPERMUTE Instruction Definitions and Intrinsics Summary: This patch impleemnts DS_PERMUTE/DS_BPERMUTE instruction definitions and intrinsics, which are new since VI. Reviewers: tstellarAMD, arsenm Subscribers: llvm-commits, arsenm Differential Revision: http://reviews.llvm.org/D17614 llvm-svn: 262356	2016-03-01 17:51:23 +00:00
Kostya Serebryany	f84df30e4f	[libFuzzer] remove FuzzerSanitizerOptions.cpp llvm-svn: 262354	2016-03-01 17:46:32 +00:00
Michael Zuckerman	433b241570	[LLVM][AVX512] PSRL{DI\|QI} Change imm8 to int Differential Revision: http://reviews.llvm.org/D17713 llvm-svn: 262353	2016-03-01 17:46:32 +00:00
Hans Wennborg	e64cf9dddb	[X86] Check that attribute parameters match for tail calls (PR26590) In the code below on 32-bit targets, x would previously get forwarded to g() without sign-extension to 32 bits as required by the parameter attribute. void g(signed short); void f(unsigned short x) { g(x); } llvm-svn: 262352	2016-03-01 17:45:23 +00:00
Sanjay Patel	2ca144f14c	fix documentation comments; NFC llvm-svn: 262351	2016-03-01 17:25:35 +00:00
Petar Jovanovic	6315f3f9b7	Revert "calculate builtin_object_size if argument is a removable pointer" Revert r262337 as "check-llvm ubsan" step failed on sanitizer-x86_64-linux-fast buildbot. llvm-svn: 262349	2016-03-01 16:50:08 +00:00
Sanjay Patel	9fea531fec	function names start with a lowercase letter; NFC llvm-svn: 262347	2016-03-01 16:17:48 +00:00
Nikolay Haustov	e309e1415d	[AMDGPU] Remove unused disassembler code. llvm-svn: 262346	2016-03-01 16:02:40 +00:00
Rafael Espindola	5cd721ae12	Refactor duplicated code for linking with pthread. llvm-svn: 262344	2016-03-01 15:54:40 +00:00
Nikolay Haustov	47a115cd41	[AMDGPU] Fix build warnings. llvm-svn: 262338	2016-03-01 14:50:59 +00:00
Petar Jovanovic	8aef99aa86	calculate builtin_object_size if argument is a removable pointer This patch fixes calculating correct value for builtin_object_size function when pointer is used only in builtin_object_size function call and never after that. Patch by Strahinja Petrovic. Differential Revision: http://reviews.llvm.org/D17337 llvm-svn: 262337	2016-03-01 14:39:55 +00:00
Nikolay Haustov	ac106add0f	[AMDGPU] Disassembler code refactored + error messages. Idea behind this change is to make code shorter and as much common for all targets as possible. Let's even accept more code than is valid for a particular target, leaving it for the assembler to sort out. 64bit instructions decoding added. Error\warning messages on unrecognized instructions operands added, InstPrinter allowed to print invalid operands helping to find invalid/unsupported code. The change is massive and hard to compare with previous version, so it makes sense just to take a look on the new version. As a bonus, with a few TD changes following, it disassembles the majority of instructions. Currently it fully disassembles >300K binary source of some blas kernel. Previous TODOs were saved whenever possible. Patch by: Valery Pykhtin Differential Revision: http://reviews.llvm.org/D17720 llvm-svn: 262332	2016-03-01 13:57:29 +00:00
Petr Pavlu	7ad9ec9fcf	[LTO] Fix error reporting from lto_module_create_in_local_context() Function lto_module_create_in_local_context() would previously rely on the default LLVMContext being created for it by LTOModule::makeLTOModule(). This context exits the program on error and is not arranged to update sLastStringError in tools/lto/lto.cpp. Function lto_module_create_in_local_context() now creates an LLVMContext by itself, sets it up correctly to its needs and then passes it to LTOModule::createInLocalContext() which takes ownership of the context and keeps it present for the lifetime of the returned LTOModule. Function LTOModule::makeLTOModule() is modified to take a reference to LLVMContext (instead of a pointer) and no longer creates a default context when nullptr is passed to it. Method LTOModule::createInContext() that takes a pointer to LLVMContext is removed because it allows to pass a nullptr to it. Instead LTOModule::createFromBuffer() (that takes a reference to LLVMContext) should be used. Differential Revision: http://reviews.llvm.org/D17715 llvm-svn: 262330	2016-03-01 13:13:49 +00:00
Michael Zuckerman	7878888690	[AVX512][PSRAQ][PSRAD] Change imm8 to int. Differential Revision: http://reviews.llvm.org/D17692 llvm-svn: 262320	2016-03-01 11:36:23 +00:00
Amjad Aboud	719325fe11	Disallow generating vzeroupper before return instruction (iret) in interrupt handler function. This resolves https://llvm.org/bugs/show_bug.cgi?id=26412 Differential Revision: http://reviews.llvm.org/D17542 llvm-svn: 262319	2016-03-01 11:32:03 +00:00
Simon Atanasyan	255689c3ad	[MC][YAML] Rangify the loop. NFC llvm-svn: 262317	2016-03-01 10:11:27 +00:00
Vasileios Kalintiris	3a8f7f9e31	[mips] Promote the result of SETCC nodes to GPR width. Summary: This patch modifies the existing comparison, branch, conditional-move and select patterns, and adds new ones where needed. Also, the updated SLT{u,i,iu} set of instructions generate a GPR width result. The majority of the code changes in the Mips back-end fix the wrong assumption that the result of SETCC nodes always produce an i32 value. The changes in the common code path account for the fact that in 64-bit MIPS targets, i1 is promoted to i32 instead of i64. Reviewers: dsanders Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D10970 llvm-svn: 262316	2016-03-01 10:08:01 +00:00
Nikolay Haustov	ea8febde04	[TableGen] AsmMatcher: Skip optional operands in the midle of instruction if it is not present Previosy, if actual instruction have one of optional operands then other optional operands listed before this also should be presented. For example instruction v_fract_f32 v0, v1, mul:2 have one optional operand - OMod and do not have optional operand clamp. Previously this was not allowed because clamp is listed before omod in AsmString: string AsmString = "v_fract_f32$vdst, $src0_modifiers$clamp$omod"; Making this work required some hacks (both OMod and Clamp match classes have same PredicateMethod). Now, if MatchInstructionImpl meets formal optional operand that is not presented in actual instruction it skips this formal operand and tries to match current actual operand with next formal. Patch by: Sam Kolton Review: http://reviews.llvm.org/D17568 [AMDGPU] Assembler: Check immediate types for several optional operands in predicate methods With this change you should place optional operands in order specified by asm string: clamp -> omod offset -> glc -> slc -> tfe Fixes for several tests. Depends on D17568 Patch by: Sam Kolton Review: http://reviews.llvm.org/D17644 llvm-svn: 262314	2016-03-01 08:34:43 +00:00
Nikolay Haustov	95b4fcd377	AsmParser: Fix nested .irp/.irpc Count .irp/.irpc in parseMacroLikeBody similar to .rept Update tests. Review: http://reviews.llvm.org/D17707 llvm-svn: 262313	2016-03-01 08:18:28 +00:00
Craig Topper	073e947f1b	[X86] Centralize the masking of TSFlags with FormMask into a variable earlier so we can stop masking in multiple places. NFC llvm-svn: 262312	2016-03-01 07:15:59 +00:00
Craig Topper	5c8dc5f064	[X86] Localize a temporary variable into the cases its need in. NFC llvm-svn: 262310	2016-03-01 06:42:48 +00:00
Craig Topper	b8c29b4ae9	[X86] Be consistent about using pre/post increment/decrement in nearby code. NFC llvm-svn: 262309	2016-03-01 06:42:46 +00:00
Craig Topper	d40a55064f	[X86] Combine some initialization code with variable declaration and comments. NFC llvm-svn: 262301	2016-03-01 05:42:16 +00:00
Matt Arsenault	a67c4916cf	LegalizeDAG: Use correct ptr type when expanding unaligned load/store This fixes regressions exposed in existing AMDGPU tests in a future commit when all loads are custom lowered. llvm-svn: 262299	2016-03-01 05:13:35 +00:00
Matt Arsenault	d275fcabcb	AMDGPU: Don't emit build_pair during udivrem legalization Technically you aren't supposed to emit these after type legalization for some reason, and we use vector extracts of bitcasted integers as the canonical way to do this. llvm-svn: 262298	2016-03-01 05:06:05 +00:00
Matt Arsenault	f4dfc1a027	AMDGPU: Don't use estimated stack size when we know the real stack size llvm-svn: 262297	2016-03-01 04:58:20 +00:00
Matt Arsenault	59b8b77405	AMDGPU: Set HasExtractBitInsn This currently does not have the control over the bitwidth, and there are missing optimizations to reduce the integer to 32-bit if it can be. But in most situations we do want the sinking to occur. llvm-svn: 262296	2016-03-01 04:58:17 +00:00
David Majnemer	cb305dea1c	[WinEH] Allocate the registration node before the catch objects The CatchObjOffset is relative to the end of the EH registration node for 32-bit x86 WinEH targets. A special sentinel value, 0, is used to indicate that no catch object should be initialized. This means that a catch object allocated immediately before the registration node would be assigned a CatchObjOffset of 0, leading the runtime to believe that a catch object should not be initialized. To handle this, allocate the registration node prior to any other frame object. This will ensure that catch objects will not be allocated before the registration node. This fixes PR26757. Differential Revision: http://reviews.llvm.org/D17689 llvm-svn: 262294	2016-03-01 04:30:16 +00:00
David Majnemer	f08579f5a8	[Verifier] Diagnose when unwinding out of cycles of blocks Generally speaking, this can only happen with unreachable code. However, neglecting to check for this condition would lead us to loop forever. llvm-svn: 262284	2016-03-01 01:19:05 +00:00
Adam Nemet	948775196d	[LLE] Add testcase for the fix in r262267 llvm-svn: 262280	2016-03-01 00:50:14 +00:00
Adam Nemet	b8486e5a32	[LAA] Add missing debug output llvm-svn: 262279	2016-03-01 00:50:08 +00:00
Sanjay Patel	6f2c01f712	[x86, InstCombine] transform more x86 masked loads to LLVM intrinsics Continuation of: http://reviews.llvm.org/rL262269 llvm-svn: 262273	2016-02-29 23:59:00 +00:00
Adam Nemet	efc091f457	[LLE] Fix a comment llvm-svn: 262270	2016-02-29 23:21:12 +00:00
Sanjay Patel	98a71505f5	[x86, InstCombine] transform x86 AVX masked loads to LLVM intrinsics The intended effect of this patch in conjunction with: http://reviews.llvm.org/rL259392 http://reviews.llvm.org/rL260145 is that customers using the AVX intrinsics in C will benefit from combines when the load mask is constant: __m128 mload_zeros(float f) { return _mm_maskload_ps(f, _mm_set1_epi32(0)); } __m128 mload_fakeones(float f) { return _mm_maskload_ps(f, _mm_set1_epi32(1)); } __m128 mload_ones(float f) { return _mm_maskload_ps(f, _mm_set1_epi32(0x80000000)); } __m128 mload_oneset(float f) { return _mm_maskload_ps(f, _mm_set_epi32(0x80000000, 0, 0, 0)); } ...so none of the above will actually generate a masked load for optimized code. This is the masked load counterpart to: http://reviews.llvm.org/rL262064 llvm-svn: 262269	2016-02-29 23:16:48 +00:00
David Majnemer	fe2f7f367a	[Verifier] Handle more funclet edge cases This change makes the verifier a little more paranoid. It was possible to trick the verifier into crashing or infinite looping. llvm-svn: 262268	2016-02-29 22:56:36 +00:00
Adam Nemet	83be06e529	[LLE] Fix SingleSource/Benchmarks/Polybench/stencils/jacobi-2d-imper with Polly We can actually have dependences between accesses with different underlying types. Bail in this case. A test will follow shortly. llvm-svn: 262267	2016-02-29 22:53:59 +00:00
Eric Christopher	114fa1c3f6	Simplify some boolean conditional return statements in AArch64. http://reviews.llvm.org/D9979 Patch by Richard Thomson (and some conflict resolution by me). llvm-svn: 262266	2016-02-29 22:50:49 +00:00
Adrian Prantl	dba58fbdd9	Improve the debug output of DwarfDebug::buildLocationList(). llvm-svn: 262265	2016-02-29 22:28:22 +00:00
Adrian Prantl	a349714bf9	Document an anomaly in this testcase. llvm-svn: 262264	2016-02-29 22:28:16 +00:00
Paul Robinson	a908e7bd4d	Reapply r262092: [FileCheck] Abort if -NOT is combined with another suffix. Combinations of suffixes that look useful are actually ignored; complaining about them will avoid mistakes. Differential Revision: http://reviews.llvm.org/D17587 llvm-svn: 262263	2016-02-29 22:13:03 +00:00
Sanjoy Das	999dc75c12	[Verifier] Minor fix to error message; NFC llvm-svn: 262262	2016-02-29 22:04:25 +00:00
Colin LeMahieu	ab9eca4d9f	[Hexagon] As a size optimization, not lazy extending TPREL or DTPREL variants since they're usually in range. llvm-svn: 262258	2016-02-29 21:21:56 +00:00
Colin LeMahieu	9e5a9c32db	[Hexagon] Missed member initialization causing ubsan failure. llvm-svn: 262252	2016-02-29 20:42:25 +00:00
Adam Nemet	dd9e637aca	Enable LoopLoadElimination by default Summary: I re-benchmarked this and results are similar to original results in D13259: On ARM64: SingleSource/Benchmarks/Polybench/linear-algebra/solvers/dynprog -59.27% SingleSource/Benchmarks/Polybench/stencils/adi -19.78% On x86: SingleSource/Benchmarks/Polybench/linear-algebra/solvers/dynprog -27.14% And of course the original ~20% gain on SPECint_2006/456.hmmer with Loop Distribution. In terms of compile time, there is ~5% increase on both SingleSource/Benchmarks/Misc/oourafft and SingleSource/Benchmarks/Linkpack/linkpack-pc. These are both very tiny loop-intensive programs where SCEV computations dominates compile time. The reason that time spent in SCEV increases has to do with the design of the old pass manager. If a transform pass does not preserve an analysis we invalidate the analysis even if there was no modification made by the transform pass. This means that currently we don't take advantage of LLE and LV sharing the same analysis (LAA) and unfortunately we recompute LAA and SCEV for LLE. (There should be a way to work around this limitation in the case of SCEV and LAA since both compute things on demand and internally cache their result. Thus we could pretend that transform passes preserve these analyses and manually invalidate them upon actual modification. On the other hand the new pass manager is supposed to solve so I am not sure if this is worthwhile.) Reviewers: hfinkel, dberlin Subscribers: dberlin, reames, mssimpso, aemerson, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D16300 llvm-svn: 262250	2016-02-29 20:35:11 +00:00
Adrian Prantl	c0a85eca6c	Fixup MIPS testcase after r262247 and make it a little more robust. llvm-svn: 262249	2016-02-29 20:25:10 +00:00
Geoff Berry	f5ba61d18c	[AArch64] Fix isLegalAddImmediate() to return true for valid negative values. Reviewers: t.p.northover, jmolloy Subscribers: mcrosier, aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D17463 llvm-svn: 262248	2016-02-29 19:53:22 +00:00
Adrian Prantl	fb2add2be1	Fix PR26585 by improving the promotion of DBG_VALUEs to DW_AT_locations. When a variable is described by a single DBG_VALUE instruction we can often use a more efficient inline DW_AT_location instead of using a location list. This commit makes the heuristic that decides when to apply this optimization stricter by also verifying that the DBG_VALUE is live at the entry of the function (instead of just checking that it is valid until the end of the function). <rdar://problem/24611008> llvm-svn: 262247	2016-02-29 19:49:46 +00:00
Steven Wu	f2fe0141ca	Rename embedded bitcode section in MachO Summary: Rename the section embeds bitcode from ".llvmbc,.llvmbc" to "__LLVM,__bitcode". The new name matches MachO section naming convention. Reviewers: rafael, pcc Subscribers: davide, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D17388 llvm-svn: 262245	2016-02-29 19:40:10 +00:00
Ahmed Bougacha	bb5d7d7ed8	[X86] Move the ATOMIC_LOAD_OP ISel from DAGToDAG to ISelLowering. NFCI. This is long-standing dirtiness, as acknowledged by r77582: The current trick is to select it into a merge_values with the first definition being an implicit_def. The proper solution is to add new ISD opcodes for the no-output variant. Doing this before selection will let us combine away some constructs. Differential Revision: http://reviews.llvm.org/D17659 llvm-svn: 262244	2016-02-29 19:28:07 +00:00
Colin LeMahieu	b9f1eae328	[Hexagon] Setting sign mismatch flag on expression instead of using bit tricks. llvm-svn: 262243	2016-02-29 19:17:56 +00:00
Rong Xu	9e926e8b92	Minor code cleanup. NFC llvm-svn: 262242	2016-02-29 19:16:04 +00:00
David Majnemer	e60ee3b8ce	[WinEH] Make setjmp work correctly with EH 32-bit X86 EH on Windows utilizes a stack of registration nodes allocated and deallocated on entry/exit. A registration node contains a bunch of EH personality specific information like which try-state we are currently in. Because a setjmp target allows control flow from arbitrary program points, there is no way to ensure that the try-state we are in is correctly updated once we transfer control. MSVC compatible compilers, like MSVC and ICC, utilize runtime helpers to reinitialize the try-state when a longjmp occurs. This is implemented by adding additional arguments to _setjmp3: the desired try-state and a helper routine to update the try-state. Differential Revision: http://reviews.llvm.org/D17721 llvm-svn: 262241	2016-02-29 19:16:03 +00:00
Dehao Chen	939993ff2f	Move discriminator assignment to the right place. Summary: Now discriminator is assigned per-function instead of per-module. Reviewers: davidxl, dnovillo Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D17664 llvm-svn: 262240	2016-02-29 18:59:48 +00:00
Colin LeMahieu	73cd686ce1	[Hexagon] Using MustExtend flag on expression instead of passing around bools. llvm-svn: 262238	2016-02-29 18:39:51 +00:00
Adrian Prantl	693e8de0fa	fix typo in comment llvm-svn: 262236	2016-02-29 17:06:46 +00:00
Nemanja Ivanovic	1a5706ca1b	Fix for PR26180 Corresponds to Phabricator review: http://reviews.llvm.org/D16592 This fix includes both an update to how we handle the "generic" CPU on LE systems as well as Anton's fix for the Fast Isel issue. llvm-svn: 262233	2016-02-29 16:42:27 +00:00
Daniel Sanders	03a8d2f8ec	[mips] Range check uimm20 and fixed a bug this revealed. Summary: The bug was that dextu's operand 3 would print 0-31 instead of 32-63 when printing assembly. This came up when replacing MipsInstPrinter::printUnsignedImm() with a version that could handle arbitrary bit widths. MipsAsmPrinter::printUnsignedImm*() don't seem to be used so they have been removed. Reviewers: vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D15521 llvm-svn: 262231	2016-02-29 16:06:38 +00:00
Vasileios Kalintiris	29620aca3e	[mips] Do not use SLL for ANY_EXTEND nodes as the high bits are undefined. Reviewers: dsanders Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D15420 llvm-svn: 262230	2016-02-29 15:58:12 +00:00
Daniel Sanders	611eb82953	[mips] Make isel select the correct DEXT variant up front. Summary: Previously, it would always select DEXT and substitute any invalid matches for DEXTU/DEXTM during MipsMCCodeEmitter::encodeInstruction(). This works but causes problems when adding range checked immediates to IAS. Now isel selects the correct variant up front. Reviewers: vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D16810 llvm-svn: 262229	2016-02-29 15:26:54 +00:00
Rafael Espindola	8d6fbc3a4e	IRObject: Mark extern_weak as weak. llvm-svn: 262222	2016-02-29 14:26:06 +00:00
Benjamin Kramer	6bb15021b3	[InstSimplify] Restore fsub 0.0, (fsub 0.0, X) ==> X optzn I accidentally removed this in r262212 but there was no test coverage to detect it. llvm-svn: 262215	2016-02-29 12:18:25 +00:00
Daniel Sanders	90f0d0b8e3	[mips] Make symbols an acceptable branch target when expanding compare-to-immediate-and-branch macros. Reviewers: vkalintiris Subscribers: llvm-commits, vkalintiris, dim, seanbruno, dsanders Differential Revision: http://reviews.llvm.org/D15369 llvm-svn: 262213	2016-02-29 11:24:49 +00:00
Benjamin Kramer	f5b2a47ac6	[InstSimplify] fsub 0.0, (fsub -0.0, X) ==> X is only safe if signed zeros are ignored. Only allow fsub -0.0, (fsub -0.0, X) ==> X without nsz. PR26746. llvm-svn: 262212	2016-02-29 11:12:23 +00:00
Daniel Sanders	27ba83fd45	[test-release.sh] Add lldb to list of projects (disabled by default) Reviewers: hans Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17070 llvm-svn: 262211	2016-02-29 11:04:39 +00:00
Chandler Carruth	8b5a7419b8	[PM] Wire up optimization levels and default pipeline construction APIs in the PassBuilder. These are really just stubs for now, but they give a nice API surface that Clang or other tools can start learning about and enabling for experimentation. I've also wired up parsing various synthetic module pass names to generate these set pipelines. This allows the pipelines to be combined with other passes and have their order controlled, with clear separation between the kind of canned pipeline, and the level of optimization to be used within that canned pipeline. The most interesting part of this patch is almost certainly the spec for the different optimization levels. I don't think we can ever have hard and fast rules that would make it easy to determine whether a particular optimization makes sense at a particular level -- it will always be in large part a judgement call. But hopefully this will outline the expected rationale that should be used, and the direction that the pipelines should be taken. Much of this was based on a long llvm-dev discussion I started years ago to try and crystalize the intent behind these pipelines, and now, at long long last I'm returning to the task of actually writing it down somewhere that we can cite and try to be consistent with. Differential Revision: http://reviews.llvm.org/D12826 llvm-svn: 262196	2016-02-28 22:16:03 +00:00
NAKAMURA Takumi	df0cd72657	[PM] Appease mingw32's auto-import DLL build with minimal tweaks, with fix for clang. char AnalysisBase::ID should be declared as extern and defined in one module. llvm-svn: 262188	2016-02-28 17:17:00 +00:00
Vasileios Kalintiris	c9aaa3171d	[mips] Remove unused function declarations from MipsRegisterInfo.h. NFC. llvm-svn: 262187	2016-02-28 16:55:28 +00:00
NAKAMURA Takumi	ca04a1f720	Revert r262185, "[PM] Appease mingw32's auto-import DLL build with minimal tweaks." I'll rework soon. llvm-svn: 262186	2016-02-28 16:54:06 +00:00
NAKAMURA Takumi	de40e7437e	[PM] Appease mingw32's auto-import DLL build with minimal tweaks. char AnalysisBase::ID should be declared as extern and defined in one module. llvm-svn: 262185	2016-02-28 16:38:46 +00:00
JF Bastien	3a0814ac1a	WebAssembly: fix test Operand order seems to have changed, the new one is nicer. llvm-svn: 262180	2016-02-28 15:44:54 +00:00
JF Bastien	1afd1e2baa	WebAssembly: fix build More API churn, experimental target got sad. llvm-svn: 262179	2016-02-28 15:33:53 +00:00
Michael Zuckerman	96836fc81c	[AVX512][PSLLW ][PSLLV] Change imm8 to int Differential Revision: http://reviews.llvm.org/D17684 llvm-svn: 262176	2016-02-28 07:32:10 +00:00
Xinliang David Li	985ff20a9c	[PGO] Remove redundant counter copies for avail_extern functions. Differential Revision: http://reviews.llvm.org/D17654 llvm-svn: 262157	2016-02-27 23:11:30 +00:00
Duncan P. N. Exon Smith	ebcce78f65	CodeGen: Remove an iterator => pointer conversion, NFC Part of PR26753. llvm-svn: 262154	2016-02-27 20:27:44 +00:00
Matt Arsenault	3a61985b2f	AMDGPU: More bits of frame index are known to be zero The maximum private allocation for the whole GPU is 4G, so the maximum possible index for a single workitem is the maximum size divided by the smallest granularity for a dispatch. This increases the number of known zero high bits, which enables more offset folding. The maximum private size per workitem with this is 128M but may be smaller still. llvm-svn: 262153	2016-02-27 20:26:57 +00:00
Duncan P. N. Exon Smith	d6ebd07b8d	CodeGen: Use MachineInstr& in InlineSpiller::rematerializeFor() InlineSpiller::rematerializeFor() never uses its parameter as an iterator, so take it by reference instead. This removes an implicit conversion from MachineBasicBlock::iterator to MachineInstr*. llvm-svn: 262152	2016-02-27 20:23:14 +00:00
Duncan P. N. Exon Smith	be8f8c4478	CodeGen: Update LiveIntervalAnalysis API to use MachineInstr&, NFC These parameters aren't expected to be null, so take them by reference. llvm-svn: 262151	2016-02-27 20:14:29 +00:00
Duncan P. N. Exon Smith	fd8cc23220	CodeGen: Change MachineInstr to use MachineInstr&, NFC Change MachineInstr API to prefer MachineInstr& over MachineInstr* whenever the parameter is expected to be non-null. Slowly inching toward being able to fix PR26753. llvm-svn: 262149	2016-02-27 20:01:33 +00:00
Matt Arsenault	982224cfb8	DAGCombiner: Don't unnecessarily swap operands in ReassociateOps In the case where op = add, y = base_ptr, and x = offset, this transform: (op y, (op x, c1)) -> (op (op x, y), c1) breaks the canonical form of add by putting the base pointer in the second operand and the offset in the first. This fix is important for the R600 target, because for some address spaces the base pointer and the offset are stored in separate register classes. The old pattern caused the ISel code for matching addressing modes to put the base pointer and offset in the wrong register classes, which required no-trivial code transformations to fix. llvm-svn: 262148	2016-02-27 19:57:45 +00:00
Duncan P. N. Exon Smith	d3a7467221	CodeGen: Use MachineInstr& in HashMachineInstr, NFC Also update HashEndOfMBB to take MachineBasicBlock&. llvm-svn: 262146	2016-02-27 19:48:01 +00:00
Duncan P. N. Exon Smith	5e6e8c7a0a	CodeGen: Use MachineInstr& in AntiDepBreaker API, NFC Take parameters as MachineInstr& instead of MachineInstr* in AntiDepBreaker API, since these are required to be non-null. No functionality change intended. Looking toward PR26753. llvm-svn: 262145	2016-02-27 19:33:37 +00:00
Duncan P. N. Exon Smith	bd529fbb4a	CodeGen: Assert valid MI in AntiDepBreaker::UpdateDbgValue This already assumes a valid MI, since it dereferences the MI in an assertion before checking for null. At an explicit assert. llvm-svn: 262144	2016-02-27 19:23:34 +00:00
Duncan P. N. Exon Smith	1f6624ae34	AArch64: Use MachineInstr& in guaranteesZeroRegInBlock(), NFC llvm-svn: 262143	2016-02-27 19:12:54 +00:00
Duncan P. N. Exon Smith	5702287809	CodeGen: Update DFAPacketizer API to take MachineInstr&, NFC In all but one case, change the DFAPacketizer API to take MachineInstr& instead of MachineInstr*. In DFAPacketizer::endPacket(), take MachineBasicBlock::iterator. Besides cleaning up the API, this is in search of PR26753. llvm-svn: 262142	2016-02-27 19:09:00 +00:00
Duncan P. N. Exon Smith	f9ab416d70	WIP: CodeGen: Use MachineInstr& in MachineInstrBundle.h, NFC Update APIs in MachineInstrBundle.h to take and return MachineInstr& instead of MachineInstr* when the instruction cannot be null. Besides being a nice cleanup, this is tacking toward a fix for PR26753. llvm-svn: 262141	2016-02-27 17:05:33 +00:00
JF Bastien	13d3b9b777	WebAssembly: fix build It was broken by the work for PR26753. llvm-svn: 262140	2016-02-27 16:38:23 +00:00
Renato Golin	9a5419ecf7	Revert "[sancov] do not instrument nodes that are full pre-dominators" This reverts commit r262103, as it broke all ARM and AArch64 bots. llvm-svn: 262139	2016-02-27 14:19:19 +00:00
Simon Pilgrim	9e10b1655c	Tidyup for loops - don't repeat upper limit evaluation if you don't have to. NFCI. llvm-svn: 262137	2016-02-27 13:26:58 +00:00
Chris Dewhurst	0a2c033e2d	Addition of tests to previous check-in. Tests for coprocessor register usage in Sparc. Previous check-in message was: The patch adds missing registers and instructions to complete all the registers supported by the Sparc v8 manual. These are all co-processor registers, with the exception of the floating-point deferred-trap queue register. Although these will not be lowered automatically by any instructions, it allows the use of co-processor instructions implemented by inline-assembly. Code Reviewed at http://reviews.llvm.org/D17133, with the exception of a very small change in brace placement in SparcInstrInfo.td, which was formerly causing a problem in the disassembly of the %fq register. llvm-svn: 262135	2016-02-27 12:52:26 +00:00
Simon Pilgrim	83e76327e8	[X86][AVX] vpermilvar.pd mask element indices only use bit1 llvm-svn: 262134	2016-02-27 12:51:46 +00:00
Chris Dewhurst	053826af69	The patch adds missing registers and instructions to complete all the registers supported by the Sparc v8 manual. These are all co-processor registers, with the exception of the floating-point deferred-trap queue register. Although these will not be lowered automatically by any instructions, it allows the use of co-processor instructions implemented by inline-assembly. Code Reviewed at http://reviews.llvm.org/D17133, with the exception of a very small change in brace placement in SparcInstrInfo.td, which was formerly causing a problem in the disassembly of the %fq register. llvm-svn: 262133	2016-02-27 12:49:59 +00:00
Simon Pilgrim	a9a7bf68ee	[X86][AVX] Added AVX1 target shuffle combine tests llvm-svn: 262132	2016-02-27 12:33:08 +00:00
Simon Pilgrim	3b42ca0760	Strip trailing whitespace. NFCI. llvm-svn: 262131	2016-02-27 11:49:16 +00:00
Chandler Carruth	30811a4dde	[PM] Loosen the regex for the proxy template name even further to cope with 'class' keywords in the template arguments and other silliness. llvm-svn: 262130	2016-02-27 11:07:16 +00:00
Chandler Carruth	08a25ce0e3	[PM] Use a boring regex instead of explicitly naming the analysis manager as some compilers print the typedef name and others print the "canonical" name of the underlying class template. This isn't really an important artifact of the test anyways so it seems fine to just loosen the test assertions here. llvm-svn: 262129	2016-02-27 10:48:14 +00:00
Chandler Carruth	afcec4c55a	[PM] Provide explicit instantiation declarations and definitions for the PassManager and AnalysisManager template specializations as well. llvm-svn: 262128	2016-02-27 10:45:35 +00:00
Chandler Carruth	2a54094d40	[PM] Provide two templates for the two directionalities of analysis manager proxies and use those rather than repeating their definition four times. There are real differences between the two directions: outer AMs are const and don't need to have invalidation tracked. But every proxy in a particular direction is identical except for the analysis manager type and the IR unit they proxy into. This makes them prime candidates for nice templates. I've started introducing explicit template instantiation declarations and definitions as well because we really shouldn't be emitting all this everywhere. I'm going to go back and add the same for the other templates like this in a follow-up patch. I've left the analysis manager as an opaque type rather than using two IR units and requiring it to be an AnalysisManager template specialization. I think its important that users retain the ability to provide their own custom analysis management layer and provided it has the appropriate API everything should Just Work. llvm-svn: 262127	2016-02-27 10:38:10 +00:00
Matt Arsenault	360d244d5b	DAGCombiner: Relax sqrt NaN folding check This is OK for +0 since compares to +/-0 give the same result. llvm-svn: 262125	2016-02-27 09:38:05 +00:00
Matt Arsenault	9d82ee7526	AMDGPU: Split vi-insts subtarget feature This will be more useful for marking builtins acceptable for which subtargets. llvm-svn: 262121	2016-02-27 08:53:55 +00:00
Matt Arsenault	274d34e725	AMDGPU: Add s_sleep intrinsic llvm-svn: 262120	2016-02-27 08:53:52 +00:00
Matt Arsenault	61738cbcb6	AMDGPU: Implement readcyclecounter This matches the behavior of the HSAIL clock instruction. s_realmemtime is used if the subtarget supports it, and falls back to s_memtime if not. Also introduces new intrinsics for each of s_memtime / s_memrealtime. llvm-svn: 262119	2016-02-27 08:53:46 +00:00
Duncan P. N. Exon Smith	353c84e747	CodeGen: Avoid implicit conversion in MachineInstrBuilder, NFC Avoid another implicit conversion from MachineInstrBundleIterator to MachineInstr*, this time in MachineInstrBuilder.h (this is in pursuit of PR26753). llvm-svn: 262118	2016-02-27 07:00:35 +00:00
Duncan P. N. Exon Smith	b6bb889dfd	CodeGen: Remove implicit iterator to pointer conversions, NFC Remove a couple of implicit conversions from MachineInstrBundleIterator to MachineInstr*. llvm-svn: 262116	2016-02-27 06:51:00 +00:00
Duncan P. N. Exon Smith	3ac9cc6156	CodeGen: Take MachineInstr& in SlotIndexes and LiveIntervals, NFC Take MachineInstr by reference instead of by pointer in SlotIndexes and the SlotIndex wrappers in LiveIntervals. The MachineInstrs here are never null, so this cleans up the API a bit. It also incidentally removes a few implicit conversions from MachineInstrBundleIterator to MachineInstr* (see PR26753). At a couple of call sites it was convenient to convert to a range-based for loop over MachineBasicBlock::instr_begin/instr_end, so I added MachineBasicBlock::instrs. llvm-svn: 262115	2016-02-27 06:40:41 +00:00
Sean Silva	ea399f0242	[instrprof] Use __{start,stop}_SECNAME on PS4 too. Summary: The PS4 linker seems to handle this fine. Hi David, it seems that indeed most ELF linkers support __{start,stop}_SECNAME, as our proprietary linker does as well. This follows the pattern of r250679 w.r.t. the testing. Maggie, Phillip, Paul: I've tested this with the PS4 SDK 3.5 toolchain prerelease and it seems to work fine. Reviewers: davidxl Subscribers: probinson, phillip.power, MaggieYi Differential Revision: http://reviews.llvm.org/D17672 llvm-svn: 262112	2016-02-27 06:01:26 +00:00
Mike Aizatsky	9056284912	[sancov] properly initializing pass. llvm-svn: 262111	2016-02-27 05:50:40 +00:00
Kostya Serebryany	3c767db3c5	[libFuzzer] don't emit callbacks to sanitizer run-time in -fsanitize-coverage=trace-pc mode; update libFuzzer doc for previous commit llvm-svn: 262110	2016-02-27 05:45:12 +00:00
Philip Reames	70b391864d	Suppress an uncovered switch warning [NFC] llvm-svn: 262109	2016-02-27 05:18:30 +00:00
Chandler Carruth	ad8cb382fa	[LICM] Teach LICM how to handle cases where the alias set tracker was merged into a loop that was subsequently unrolled (or otherwise nuked). In this case it can't merge in the ASTs for any remaining nested loops, it needs to re-add their instructions dircetly. The fix is very isolated, but I've pulled the code for merging blocks into the AST into a single place in the process. The only behavior change is in the case which would have crashed before. This fixes a crash reported by Mikael Holmen on the list after r261316 restored much of the loop pass pipelining and allowed us to actually do this kind of nested transformation sequenc. I've taken that test case and further reduced it into the somewhat twisty maze of loops in the included test case. This does in fact trigger the bug even in this reduced form. llvm-svn: 262108	2016-02-27 04:34:07 +00:00
Kostya Serebryany	bf821db932	[libFuzzer] fixing the bot llvm-svn: 262106	2016-02-27 03:14:23 +00:00
Mike Aizatsky	0d202ffa7c	[sancov] print_coverage_points command. Differential Revision: http://reviews.llvm.org/D17670 llvm-svn: 262104	2016-02-27 02:21:44 +00:00
Mike Aizatsky	9b53ab7121	[sancov] do not instrument nodes that are full pre-dominators Summary: Without tree pruning clang has 2,667,552 points. Wiht only dominators pruning: 1,515,586. With both dominators & predominators pruning: 1,340,534. Differential Revision: http://reviews.llvm.org/D17671 llvm-svn: 262103	2016-02-27 02:10:27 +00:00
Kostya Serebryany	2d4f8f168b	[libFuzzer] speedup path coverage handling llvm-svn: 262102	2016-02-27 01:50:16 +00:00
Junmo Park	272a2bc365	Minor code cleanup. NFC. llvm-svn: 262096	2016-02-27 01:10:43 +00:00
Reid Kleckner	892ae2e2b6	[InstCombine] Be more conservative about removing stackrestore We ended up removing a save/restore pair around an inalloca call, leading to a miscompile in Chromium. llvm-svn: 262095	2016-02-27 00:53:54 +00:00
Paul Robinson	4b618dcc93	Revert r262092, caught LLD tests llvm-svn: 262093	2016-02-26 23:44:10 +00:00
Paul Robinson	abcfa39566	[FileCheck] Abort if -NOT is combined with another suffix. Combinations of suffixes that look useful actually are ignored; complaining about them will avoid mistakes. Differential Revision: http://reviews.llvm.org/D17587 llvm-svn: 262092	2016-02-26 23:34:02 +00:00
Cong Hou	e0eb8bfe37	Fix a bug in isVectorReductionOp() in SelectionDAGBuilder.cpp that may cause assertion failure on AArch64. llvm-svn: 262091	2016-02-26 23:25:30 +00:00
Ahmed Bougacha	0c95decaaa	[X86] Move an encoding test from CodeGen to MC. NFC. llvm-svn: 262089	2016-02-26 23:00:03 +00:00
Ahmed Bougacha	ccf38fd0e2	[X86] Delete old redundant test. NFC. llvm-svn: 262088	2016-02-26 23:00:00 +00:00
Ahmed Bougacha	ffcab7bf32	[X86] Fix a stale comment. NFC. llvm-svn: 262087	2016-02-26 22:59:57 +00:00
Ahmed Bougacha	55e1592889	[X86] Remove the unused SDTX86atomicBinary. NFC. llvm-svn: 262086	2016-02-26 22:59:41 +00:00
Philip Reames	adf0e35308	[LVI] Extend select handling to catch min/max/clamp idioms Most of this is fairly straight forward. Add handling for min/max via existing matcher utility and ConstantRange routines. Add handling for clamp by exploiting condition constraints on inputs. Note that I'm only handling two constant ranges at this point. It would be reasonable to consider treating overdefined as a full range if the instruction is typed as an integer, but that should be a separate change. Differential Revision: http://reviews.llvm.org/D17184 llvm-svn: 262085	2016-02-26 22:53:59 +00:00
Kostya Serebryany	66ff0756e4	[libFuzzer] add -print_final_stats=1 flag llvm-svn: 262084	2016-02-26 22:42:23 +00:00
Simon Pilgrim	4d1a088323	Strip trailing whitespace. NFCI. llvm-svn: 262083	2016-02-26 22:28:50 +00:00
Philip Reames	ba31312f63	[ConstantRange] Add umin/smin operators This was split off from http://reviews.llvm.org/D17184. Reviewed by: Sanjoy llvm-svn: 262080	2016-02-26 22:08:18 +00:00
Kit Barton	915c5ecee1	[PPC] Legalize FNEG on PPC when possible Currently we always expand ISD::FNEG. For v4f32 and v2f64 vector types VSX has native support for this opcode Phabricator: http://reviews.llvm.org/D17647 llvm-svn: 262079	2016-02-26 21:59:44 +00:00
Simon Pilgrim	10e3ca2cc1	Fix spelling. NFCI. llvm-svn: 262078	2016-02-26 21:56:27 +00:00
Sanjay Patel	fc7e7ebf36	[x86, InstCombine] transform x86 AVX2 masked stores to LLVM intrinsics Replicate everything for integers...because x86. Continuation of: http://reviews.llvm.org/rL262064 llvm-svn: 262077	2016-02-26 21:51:44 +00:00
Kostya Serebryany	da63c1d09a	[libFuzzer] initial implementation of path coverage based on -fsanitize-coverage=trace-pc. This does not scale well yet, but already cracks FullCoverageSetTest in seconds llvm-svn: 262073	2016-02-26 21:33:56 +00:00
Chris Bieneman	be22727598	[CMake] Allow LLVM_TARGETS_TO_BUILD to accept "Native" This allows a user to specify "Native" as a target when configuring LLVM. Native will resolve to the LLVM_NATIVE_ARCH, which is the target that supports code generation for the host. llvm-svn: 262070	2016-02-26 21:21:40 +00:00
Paul Robinson	1d412f6457	Reapply r262054 with triple fix. llvm-svn: 262069	2016-02-26 21:18:34 +00:00
Kit Barton	93612ec5f2	Power9] Implement new vsx instructions: compare and conversion This change implements the following vsx instructions: Quad/Double-Precision Compare: xscmpoqp xscmpuqp xscmpexpdp xscmpexpqp xscmpeqdp xscmpgedp xscmpgtdp xscmpnedp xvcmpnedp(.) xvcmpnesp(.) Quad-Precision Floating-Point Conversion xscvqpdp(o) xscvdpqp xscvqpsdz xscvqpswz xscvqpudz xscvqpuwz xscvsdqp xscvudqp xscvdphp xscvhpdp xvcvhpsp xvcvsphp xsrqpi xsrqpix xsrqpxp 28 instructions Phabricator: http://reviews.llvm.org/D16709 llvm-svn: 262068	2016-02-26 21:11:55 +00:00
Chris Bieneman	e50f744743	[CMake] Add the gold plugin before clang This is needed to connect dependencies between the LLVMgold plugin and the clang stage-2 builds due to limitations in ExternalProject_Add. Patch by Mike Edwards Differential Revision: http://reviews.llvm.org/D17655 llvm-svn: 262067	2016-02-26 21:07:04 +00:00
Chris Bieneman	142f4cac26	[CMake] Assigning the LTO component to lto.h This makes it so lto.h is installed when you run the install-LTO target. llvm-svn: 262066	2016-02-26 21:07:02 +00:00
Sanjay Patel	1ace99351f	[x86, InstCombine] transform x86 AVX masked stores to LLVM intrinsics The intended effect of this patch in conjunction with: http://reviews.llvm.org/rL259392 http://reviews.llvm.org/rL260145 is that customers using the AVX intrinsics in C will benefit from combines when the store mask is constant: void mstore_zero_mask(float f, __m128 v) { _mm_maskstore_ps(f, _mm_set1_epi32(0), v); } void mstore_fake_ones_mask(float f, __m128 v) { _mm_maskstore_ps(f, _mm_set1_epi32(1), v); } void mstore_ones_mask(float f, __m128 v) { _mm_maskstore_ps(f, _mm_set1_epi32(0x80000000), v); } void mstore_one_set_elt_mask(float f, __m128 v) { _mm_maskstore_ps(f, _mm_set_epi32(0x80000000, 0, 0, 0), v); } ...so none of the above will actually generate a masked store for optimized code. Differential Revision: http://reviews.llvm.org/D17485 llvm-svn: 262064	2016-02-26 21:04:14 +00:00
Sanjay Patel	51488ed2d5	[x86] refactor to eliminate duplicated code; NFCI llvm-svn: 262062	2016-02-26 20:59:05 +00:00
Amaury Sechet	b2055c53ba	Fix warning in DwarfCFIException. NFC llvm-svn: 262061	2016-02-26 20:49:07 +00:00
Paul Robinson	d68c435a5d	Revert r262054 on one file that fails sometimes. llvm-svn: 262060	2016-02-26 20:41:07 +00:00
Amaury Sechet	7067ad3c27	Extract the method to begin and end a fragment in AsmPrinterHandler in their own method. NFC Summary: This is extracted from D17555 Reviewers: davidxl, reames, sanjoy, MatzeB, pete Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17580 llvm-svn: 262058	2016-02-26 20:30:37 +00:00
Quentin Colombet	87e23e5733	[GlobalISel] Fix a ranlib warning about empty TOC. Fixes PR26733 llvm-svn: 262057	2016-02-26 20:05:02 +00:00
Paul Robinson	51fa0a87c3	Fix tests that used CHECK-NEXT-NOT and CHECK-DAG-NOT. FileCheck actually doesn't support combo suffixes. Differential Revision: http://reviews.llvm.org/D17588 llvm-svn: 262054	2016-02-26 19:40:34 +00:00
Nirav Dave	2993854bb4	Fix Sparc 32bit Lowering to rebundle up v2i32 values. Summary: Fix LowerCall to rebundle v2i32 values after lowering and add testcase Reviewers: jyknight Subscribers: llvm-commits, jyknight Differential Revision: http://reviews.llvm.org/D17615 llvm-svn: 262048	2016-02-26 18:55:22 +00:00
Sanjay Patel	155193c3aa	[x86, AVX] fold 'isPositive' 256-bit vector integer operations (PR26701) This extends the fold introduced with: http://reviews.llvm.org/rL262036 llvm-svn: 262047	2016-02-26 18:42:50 +00:00
Reid Kleckner	1762ad3e73	[IR] Optimize bitfield layout of Value for MSVC This should save a pointer of padding from all MSVC Value subclasses. Recall that MSVC will not pack the following bitfields together: unsigned Bits : 29; unsigned Flag1 : 1; unsigned Flag2 : 1; unsigned Flag3 : 1; Add a static_assert because LLVM developers always trip over this behavior. This regressed in June. llvm-svn: 262045	2016-02-26 18:08:59 +00:00
Sanjay Patel	334685b486	[x86, AVX] add 256-bit tests llvm-svn: 262044	2016-02-26 18:07:58 +00:00
Renato Golin	9590c532b8	[CMAKE] Update build on recent Haiku This patch updates cmake build scripts to build on Haiku. It adds Haiku x86_64 to config.guess. Please consider reviewing. Pathc by Jérôme Duval. llvm-svn: 262038	2016-02-26 17:01:45 +00:00
Sanjay Patel	4402a32b32	[x86, SSE] fold 'isPositive' vector integer operations (PR26701) This is one of the cases shown in: https://llvm.org/bugs/show_bug.cgi?id=26701 Shift and negate is what InstCombine appears to prefer, so I've started with that pattern. Note that the 'pcmpeq' instructions are always generating the negative one for the actual 'pcmpgt' comparison in each case (side note: why isn't there an alias mnemonic for that?). Differential Revision: http://reviews.llvm.org/D17630 llvm-svn: 262036	2016-02-26 16:56:03 +00:00
Reid Kleckner	70c9bc71d4	[WinEH] Fix funclet return block clobber mask placement MBB slot index intervals are half open, not closed. getMBBEndIndex() returns the slot index of the start of the next block in layout order. Placing a register mask there is incorrect if the successor of the funclet return is not laid out after the return. Clang generates IR for catch bodies before generating the following normal code, so we never noticed this issue until the D frontend authors filed a bug about it. Instead, we can put the clobber mask on the last instruction of the funclet return block. We still aren't using a register mask operand on the CATCHRET instruction because it would cause PEI to spill all CSRs, including XMM regs, in the prologue. Fixes PR26679. llvm-svn: 262035	2016-02-26 16:53:19 +00:00
Chandler Carruth	470734b512	[PM] Finish removing references to fix MSVC builds. Somehow adding base classes changed whether the decltype of these expressions was a reference. I'm somewhat horrified why, and there may need to be a deeper fix on MSVC, but this should at least get the bots a step further. llvm-svn: 262008	2016-02-26 12:30:18 +00:00
Chris Dewhurst	829b104dc2	Reverting breaking change. Sorry. llvm-svn: 262007	2016-02-26 12:20:10 +00:00
Chandler Carruth	58dde8cbc5	[PM] Speculative patch to try and fix MSVC's compilation. No idea why r262004 triggered this, but just trying to fix somehow. llvm-svn: 262006	2016-02-26 12:17:54 +00:00
Chris Dewhurst	9c3bf91d6e	Reviewed at reviews.llvm.org/D17133 llvm-svn: 262005	2016-02-26 11:46:47 +00:00
Chandler Carruth	3a63435551	[PM] Introduce CRTP mixin base classes to help define passes and analyses in the new pass manager. These just handle really basic stuff: turning a type name into a string statically that is nice to print in logs, and getting a static unique ID for each analysis. Sadly, the format of passes in anonymous namespaces makes using their names in tests really annoying so I've customized the names of the no-op passes to keep tests sane to read. This is the first of a few simplifying refactorings for the new pass manager that should reduce boilerplate and confusion. llvm-svn: 262004	2016-02-26 11:44:45 +00:00
Chris Dewhurst	6456376fe9	Initial test commit only llvm-svn: 262003	2016-02-26 11:38:24 +00:00
Chandler Carruth	610c408855	[PM] Remove a FIXME now that it is no longer needed. This has been fixed for some time, but the code hadn't been updated. llvm-svn: 261996	2016-02-26 10:02:04 +00:00
Nikolay Haustov	2f684f1347	[AMDGPU] Assembler: Basic support for MIMG Add parsing and printing of image operands. Matches legacy sp3 assembler. Change image instruction order to have data/image/sampler operands in the beginning. This is needed because optional operands in MC are always last. Update SITargetLowering for new order. Add basic MC test. Update CodeGen tests. Review: http://reviews.llvm.org/D17574 llvm-svn: 261995	2016-02-26 09:51:05 +00:00
Chandler Carruth	5582532c0a	[PM] Clean up some formatting with the latest clang-format. llvm-svn: 261992	2016-02-26 09:37:52 +00:00
James Molloy	4eba0154fb	[AArch64] Slight cleanup in FPLoadBalancing Instead of the convoluted if-statment we can just use getColor. This also fixes a bug where we relied upon the parity of tablegen-generated register indexes (instead of using the machine encoding). llvm-svn: 261990	2016-02-26 09:10:53 +00:00
Simon Pilgrim	cf5352db84	[X86][F16C] Added native IR half/float conversion tests. Placeholder tests until we start improving native vector support. llvm-svn: 261989	2016-02-26 08:52:29 +00:00
David Blaikie	f1958da1c3	llvm-dwp: provide diagnostics for duplicate DWO IDs These diagnostics aren't perfect - in the case of merging several dwos into dwps and those dwps into more dwps - just getting the message about the original source file name might not be much help (since it's the same in both dwos, by definition - but doesn't tell you which chain of dwps to backtrack) It might be worth adding the DW_AT_dwo_id to the split debug info to improve the diagnostic experience - might help track down the duplicates better. llvm-svn: 261988	2016-02-26 07:30:15 +00:00
David Blaikie	5d6d4dc306	llvm-dwp: Support empty .dwo files Though a bit odd, this is handy for a few reasons - for example, in a build system that wants consistent input/output of build steps, but where split-dwarf might be overriden/disabled by the user on a per-file basis. llvm-svn: 261987	2016-02-26 07:04:58 +00:00
Craig Topper	c929349912	[X86] Null out some redundant patterns for masked vector register to register moves. These can be accomplished with both aligned and unaligned opcodes. Currently aligned is what is being used so remove the redundant patterns for the unaligned versions. But don't do this for the byte and word vector types since they don't have aligned versions. llvm-svn: 261985	2016-02-26 06:50:29 +00:00
Craig Topper	7f36be935e	[TableGen] Fix typos in comments. NFC llvm-svn: 261984	2016-02-26 06:50:27 +00:00
Craig Topper	d50b5f8abc	[X86] Add test cases for r261977 and fix a grammatical error. llvm-svn: 261983	2016-02-26 06:50:24 +00:00
Haicheng Wu	5539f852ae	[JumpThreading] Simplify Instructions first in ComputeValueKnownInPredecessors() This change tries to find more opportunities to thread over basic blocks. llvm-svn: 261981	2016-02-26 06:06:04 +00:00
Craig Topper	4d187630de	[X86] Remove a couple returns after llvm_unreachables. NFC llvm-svn: 261979	2016-02-26 05:29:39 +00:00
Craig Topper	a11be0be89	[X86] Use inclusive ranges for XMM/YMM/ZMM registers in is32Extended and isX86_64ExtendedReg. NFC llvm-svn: 261978	2016-02-26 05:29:35 +00:00
Craig Topper	29c2273369	[X86] Explicitly diagnose use of %xmm16-%xmm31, %ymm16-%ymm31 and %zmm16-%zmm31 when AVX512 is not enabled in the asm parser. llvm-svn: 261977	2016-02-26 05:29:32 +00:00
Hongbin Zheng	b8bb0d8813	Another fix the testcase introduced by r261903 - Add the missing matches llvm-svn: 261971	2016-02-26 03:41:47 +00:00
Sanjoy Das	7a4c94d3a7	Minor doc fix: statepoints are invokable too llvm-svn: 261968	2016-02-26 03:33:59 +00:00
Matthias Braun	9dcd65f478	MachineCopyPropagation: Catch copies of the form A<-B;A<-B Differential Revision: http://reviews.llvm.org/D17475 llvm-svn: 261966	2016-02-26 03:18:55 +00:00
Matthias Braun	e39ff70685	MachineCopyPropagation: Keep scanning through instructions with regmasks This also simplifies the code by removing the overly conservative NoInterveningSideEffect() function. This function checked: - That the two copies belong to the same block: We only process one block at a time and clear our maps in between it is impossible to find a copy from a different block. - There is no terminator between the two copy instructions: This is not allowed anyway (the MachineVerifier would complain) - Does not have instructions with hasUnmodeledSideEffects() or isCall() set: Even for those instructuction we must have all clobbers/defs of registers explicit as an operand. If the register is explicitely clobbered we would never come to the point of checking for NoInterveningSideEffect() anyway. (I also checked this with a temporary build of the test-suite with all potentially failing conditions in NoInterveningSideEffect() turned into asserts) Differential Revision: http://reviews.llvm.org/D17474 llvm-svn: 261965	2016-02-26 03:18:50 +00:00
Xinliang David Li	23682e9cab	[PGO] Add test case to ensure covmap section is not allocatable. Differential Revision: http://reviews.llvm.org/D17324 llvm-svn: 261959	2016-02-26 03:05:10 +00:00
Michael Zolotukhin	9f520ebc54	[LoopUnrollAnalyzer] Check that we're using SCEV for the same loop we're simulating. Summary: Check that we're using SCEV for the same loop we're simulating. Otherwise, we might try to use the iteration number of the current loop in SCEV expressions for inner/outer loops IVs, which is clearly incorrect. Reviewers: chandlerc, hfinkel Subscribers: sanjoy, llvm-commits, mzolotukhin Differential Revision: http://reviews.llvm.org/D17632 llvm-svn: 261958	2016-02-26 02:57:05 +00:00
Junmo Park	820e392601	Minor code cleanups. NFC. llvm-svn: 261955	2016-02-26 02:07:36 +00:00
Michael Zolotukhin	374651d9aa	[UnitTests] UnrollAnalyzer: make unit-test more general so that it can cover more cases in future. llvm-svn: 261954	2016-02-26 01:44:04 +00:00
Mike Aizatsky	5971f18133	[sancov] Pruning full dominator blocks from instrumentation. Summary: This is the first simple attempt to reduce number of coverage- instrumented blocks. If a basic block dominates all its successors, then its coverage information is useless to us. Ingore such blocks if santizer-coverage-prune-tree option is set. Differential Revision: http://reviews.llvm.org/D17626 llvm-svn: 261949	2016-02-26 01:17:22 +00:00
Sanjay Patel	7ed9361896	[x86, SSE] add tests to show missing pcmp folds llvm-svn: 261948	2016-02-26 01:14:27 +00:00
Xinliang David Li	c1f74d1cfe	Add forward declarations /NFC llvm-svn: 261946	2016-02-26 00:54:08 +00:00
David Majnemer	08dd52dc75	[WinEH] Don't remove unannotated inline-asm calls Inline-asm calls aren't annotated with funclet bundle operands because they don't throw and cannot be inlined through. We shouldn't require them to bear an funclet bundle operand. llvm-svn: 261942	2016-02-26 00:04:25 +00:00
Owen Anderson	7bd3499d05	More internal details of SROA pass to library visibility. llvm-svn: 261934	2016-02-25 23:34:21 +00:00
Justin Bogner	78cd1ddfbb	Support: Give ManagedStatic's helper object library visibility It doesn't make much sense to export these symbols. llvm-svn: 261931	2016-02-25 22:05:19 +00:00
Hemant Kulkarni	de1152f444	Reverts change r261907 and r261918 llvm-svn: 261927	2016-02-25 20:47:07 +00:00
Hongbin Zheng	8c70ab75a0	Use regex in testcase, do not fail windows bots llvm-svn: 261922	2016-02-25 19:16:40 +00:00
Hemant Kulkarni	c518d9b6ec	Fix endianness issue on BE machines introduced by r261907 llvm-svn: 261918	2016-02-25 18:56:01 +00:00
David L Kreitzer	602ba70a0b	Reformatted a comment to fit the 80 column limit. NFC. llvm-svn: 261916	2016-02-25 18:50:45 +00:00
Hongbin Zheng	bb48b353a1	Try to fix windows fail at r261902. Introduce move constructor and move assignment operator to PostDominatorTree. llvm-svn: 261910	2016-02-25 18:24:19 +00:00
Hemant Kulkarni	2a834115bf	[llvm-readobj] Enable GNU style sections and relocations printing http://reviews.llvm.org/D17523 llvm-svn: 261907	2016-02-25 18:02:00 +00:00
Hongbin Zheng	bc53977a0d	Introduce RegionInfoAnalysis, which compute Region Tree in the new PassManager. NFC Differential Revision: http://reviews.llvm.org/D17571 llvm-svn: 261904	2016-02-25 17:54:25 +00:00
Hongbin Zheng	751337faa7	Introduce DominanceFrontierAnalysis to the new PassManager to compute DominanceFrontier. NFC Differential Revision: http://reviews.llvm.org/D17570 llvm-svn: 261903	2016-02-25 17:54:15 +00:00
Hongbin Zheng	3f97840721	Introduce analysis pass to compute PostDominators in the new pass manager. NFC Differential Revision: http://reviews.llvm.org/D17537 llvm-svn: 261902	2016-02-25 17:54:07 +00:00
Tim Northover	aa35bd26c7	ARM: disallow pc as a base register in Thumb2 memory ops. These should all be deferring to the "OP (literal)" variant according to the ARM ARM. llvm-svn: 261895	2016-02-25 16:54:52 +00:00

... 2 3 4 5 6 ...

128295 Commits