llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	83fb4019f7	[X86][SSE] Added vector urem combine tests llvm-svn: 285119	2016-10-25 21:10:12 +00:00
Simon Pilgrim	4ebb04510a	[DAGCombiner] Enable sdiv(x.y) -> udiv(x,y) combine for vectors SelectionDAG::SignBitIsZero (via SelectionDAG::computeKnownBits) has supported vectors since rL280927 llvm-svn: 285118	2016-10-25 20:56:42 +00:00
Guozhi Wei	ae541f6a71	[InstCombine] Resubmit the combine of A->B->A BitCast and fix for pr27996 The original patch of the A->B->A BitCast optimization was reverted by r274094 because it may cause infinite loop inside compiler https://llvm.org/bugs/show_bug.cgi?id=27996. The problem is with following code xB = load (type B); xA = load (type A); +yA = (A)xB; B -> A +zAn = PHI[yA, xA]; PHI +zBn = (B)zAn; // A -> B store zAn; store zBn; optimizeBitCastFromPhi generates +zBn = (B)zAn; // A -> B and expects it will be combined with the following store instruction to another store zAn Unfortunately before combineStoreToValueType is called on the store instruction, optimizeBitCastFromPhi is called on the new BitCast again, and this pattern repeats indefinitely. optimizeBitCastFromPhi only generates BitCast for load/store instructions, only the BitCast before store can cause the reexecution of optimizeBitCastFromPhi, and BitCast before store can easily be handled by InstCombineLoadStoreAlloca.cpp. So the solution to the problem is if all users of a CI are store instructions, we should not do optimizeBitCastFromPhi on it. Then optimizeBitCastFromPhi will not be called on the new BitCast instructions. Differential Revision: https://reviews.llvm.org/D23896 llvm-svn: 285116	2016-10-25 20:43:42 +00:00
Simon Pilgrim	61cdeb46c9	[X86][SSE] Added vector sdiv combine tests llvm-svn: 285112	2016-10-25 20:25:47 +00:00
Robert Lougher	3080d71fc8	revert: "Remove debug location from common tail when tail-merging" This reverts r285093, as it caused unexpected buildbot failures on clang-ppc64le-linux, clang-ppc64be-linux, clang-ppc64be-linux-multistage and clang-ppc64be-linux-lnt. Failing test ubsan/TestCases/TypeCheck/vptr.cpp. llvm-svn: 285110	2016-10-25 20:17:58 +00:00
Kostya Serebryany	c48c93184a	[libFuzzer] when mutating based on CMP traces also try adding +/- 1 to the desired bytes. Add another test for use_cmp llvm-svn: 285109	2016-10-25 20:15:15 +00:00
Sanjay Patel	f3dda13bd2	[InstCombine] Ensure that truncated int types are legal. Fixes the FIXMEs in D25952 and rL285075. Patch by bryant! Differential Revision: https://reviews.llvm.org/D25955 llvm-svn: 285108	2016-10-25 20:11:47 +00:00
Evandro Menezes	7696dc0685	[AArch64] Adjust the cost model for Exynos M1. Modify the maximum jump table size. llvm-svn: 285106	2016-10-25 20:05:42 +00:00
Tim Shen	85de51db85	[APFloat] Make APFloat an interface class to the internal IEEEFloat. NFC. Summary: The intention is to make APFloat an interface class, so that later I can add a second implementation class DoubleAPFloat to correctly implement PPCDoubleDouble semantic. The interface of IEEEFloat is not public, and can be simplified (currently it's exactly the same as the old APFloat), but that belongs to a separate patch. DoubleAPFloat should look like: class DoubleAPFloat { const fltSemantics *Semantics; std::unique_ptr<APFloat> APFloats; // Two heap-allocated APFloats. }; There is no functional change, nor public interface change. Reviewers: hfinkel, chandlerc, iteratee, echristo, kbarton Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D25536 llvm-svn: 285105	2016-10-25 19:55:59 +00:00
Evandro Menezes	eb97e3554c	Add option to specify minimum number of entries for jump tables Add an option to allow easier experimentation by target maintainers with the minimum number of entries to create jump tables. Also clarify the name of the other existing option governing the creation of jump tables. Differential revision: https://reviews.llvm.org/D25883 llvm-svn: 285104	2016-10-25 19:53:51 +00:00
Vedant Kumar	22c1b7c1d8	[llvm-cov] Don't use colored output until we know it's supported CodeCoverageTool::error() depends on CoverageViewOptions::Colors being initialized. Should fix: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/56 llvm-svn: 285103	2016-10-25 19:52:57 +00:00
Evandro Menezes	601f4cb9f7	Switch lowering: improve partitioning of jump tables When there's a tie between partitionings of jump tables, consider also cases that result in no jump tables, but in one or a few cases. The motivation is that many contemporary processors typically perform case switches fairly quickly. Differential revision: https://reviews.llvm.org/D25212 llvm-svn: 285099	2016-10-25 19:11:43 +00:00
Matthew Simpson	c62266d680	[LV] Sink scalar operands of predicated instructions When we predicate an instruction (div, rem, store) we place the instruction in its own basic block within the vectorized loop. If a predicated instruction has scalar operands, it's possible to recursively sink these scalar expressions into the predicated block so that they might avoid execution. This patch sinks as much scalar computation as possible into predicated blocks. We previously were able to sink such operands only if they were extractelement instructions. Differential Revision: https://reviews.llvm.org/D25632 llvm-svn: 285097	2016-10-25 18:59:45 +00:00
Sanjay Patel	8f9235dd7f	[InstCombine] add tests for missing icmp + shl nuw fold Patch by bryant! Differential Revision: https://reviews.llvm.org/D25952 llvm-svn: 285095	2016-10-25 18:47:56 +00:00
Michael Ilseman	e542804343	Add -strip-nonlinetable-debuginfo capability This adds a new function to DebugInfo.cpp that takes an llvm::Module as input and removes all debug info metadata that is not directly needed for line tables, thus effectively stripping all type and variable information from the module. The primary motivation for this feature was the bitcode work flow (cf. http://lists.llvm.org/pipermail/llvm-dev/2016-June/100643.html for more background). This is not wired up yet, but will be in subsequent patches. For testing, the new functionality is exposed to opt with a -strip-nonlinetable-debuginfo option. The secondary use-case (and one that works right now!) is as a reduction pass in bugpoint. I added two new bugpoint options (-disable-strip-debuginfo and -disable-strip-debug-types) to control the new features. By default it will first attempt to remove all debug information, then only the type info, and then proceed to hack at any remaining MDNodes. Thanks to Adrian Prantl for stewarding this patch! llvm-svn: 285094	2016-10-25 18:44:13 +00:00
Robert Lougher	e32564774c	Remove debug location from common tail when tail-merging The branch folding pass tail merges blocks into a common-tail. However, the tail retains the debug information from one of the original inputs to the merge (chosen randomly). This is a problem for sampled-based PGO, as hits on the common-tail will be attributed to whichever block was chosen, irrespective of which path was actually taken to the common-tail. This patch fixes the issue by nulling the debug location for the common-tail. Differential Revision: https://reviews.llvm.org/D25742 llvm-svn: 285093	2016-10-25 18:44:07 +00:00
Michael Kuperstein	cffedc4a94	Fix 80-char violations. NFC. llvm-svn: 285092	2016-10-25 18:31:23 +00:00
Vedant Kumar	d9f7446c84	[unittests] STLExtrasTest: Remove an MSVC 2013 workaround, NFCI. Let's see what the bots have to say about this... llvm-svn: 285091	2016-10-25 18:11:17 +00:00
Vedant Kumar	29d6302736	[unittests] Remove an MSVC 2013 workaround, NFCI. Let's see what the bots have to say about this... llvm-svn: 285090	2016-10-25 17:58:25 +00:00
Nico Weber	262a1334b1	Revert 285087. The sanitizer-windows bot turned red with: FAILED: utils/TableGen/CMakeFiles/obj.llvm-tblgen.dir/IntrinsicEmitter.cpp.obj C:\PROGRA~2\MICROS~1.0\VC\bin\AMD64_~2\cl.exe ... -c C:\...\llvm\utils\TableGen\IntrinsicEmitter.cpp c:\...\llvm\utils\tablegen\intrinsicemitter.cpp(254) : fatal error C1001: An internal error has occurred in the compiler. http://lab.llvm.org:8011/builders/sanitizer-windows/builds/114/steps/build%20clang%20lld/logs/stdio llvm-svn: 285089	2016-10-25 17:46:29 +00:00
Vedant Kumar	a3661effd9	[llvm-cov] Add support for loading coverage from multiple objects Differential Revision: https://reviews.llvm.org/D25086 llvm-svn: 285088	2016-10-25 17:40:55 +00:00
Nico Weber	04fbf573ce	Try removing an MSVC2010 workaround. Things seem to build fine locally without this, so let's see what the bots think. llvm-svn: 285087	2016-10-25 17:35:00 +00:00
Dan Gohman	f50d964bdb	[WebAssembly] Add immediate fields to call_indirect and memory operators. call_indirect, grow_memory, and current_memory now have immediate operands in the 0xd binary encoding. llvm-svn: 285085	2016-10-25 16:55:52 +00:00
Dehao Chen	c1472b5092	Move discriminator assignment to where it is used. (NFC) llvm-svn: 285084	2016-10-25 16:50:27 +00:00
Andrea Di Biagio	824cabd06d	[IndVarSimplify][Dwarf] When widening the IV increment, correctly set the debug loc. When indvars widened an induction variable, the debug location for the loop increment computation was incorrectly set equal to the debug loc of the loop latch terminator. This patch fixes the issue by propagating the correct location from the original loop increment instruction to the new widened increment. Differential Revision: https://reviews.llvm.org/D25872 llvm-svn: 285083	2016-10-25 16:45:17 +00:00
Pavel Labath	ec534e6dfe	Replace TimeValue by TimePoint in LegacyPassManager. NFC. llvm-svn: 285081	2016-10-25 16:20:07 +00:00
Geoff Berry	91e9a5cc23	[EarlyCSE] Make MemorySSA memory dependency check more aggressive. Now that MemorySSA keeps track of whether MemoryUses are optimized, use getClobberingMemoryAccess() to check MemoryUse memory dependencies since it should no longer be so expensive. This is a follow-up change to https://reviews.llvm.org/D25881 llvm-svn: 285080	2016-10-25 16:18:47 +00:00
Sanjay Patel	e3de152530	fix formatting; NFC llvm-svn: 285078	2016-10-25 16:12:31 +00:00
Ulrich Weigand	7bdb485e18	[SystemZ] Do not use LOC(G) for volatile loads It is not safe to use LOAD ON CONDITION to implement access to a memory location marked "volatile", since the architecture leaves it unspecified whether or not an access happens if the condition is false. The current code already appears to care about that: def LOC : CondUnaryRSY<"loc", 0xEBF2, nonvolatile_load, GR32, 4>; Unfortunately, that "nonvolatile_load" operator is simply ignored by the CondUnaryRSY class, and there was no test to catch it. llvm-svn: 285077	2016-10-25 15:39:15 +00:00
Sanjay Patel	d59f7f9047	[InstCombine] add test and code comment to show potentially misguided icmp trunc transform llvm-svn: 285075	2016-10-25 15:16:39 +00:00
Simon Pilgrim	5c3c9707c3	[X86][SSE] Add support for (V)PMOVSX* constant folding We already have (V)PMOVZX* combining support, this is the beginning of handling (V)PMOVSX* similarly - other combines in combineVSZext can be generalized in future patches. This unearthed an interesting bug in that we were generating illegal build vectors on 32-bit targets - it was proving difficult to create a test for it from PMOVZX, but it fired immediately with PMOVSX. I've created a more general form of the existing getConstVector to handle these cases - ideally this should be handled in non-target-specific code but I couldn't find an equivalent. Differential Revision: https://reviews.llvm.org/D25874 llvm-svn: 285072	2016-10-25 14:29:25 +00:00
Sanjay Patel	62fbfe4e21	[InstCombine] fix checks for previous commit (r285069) Accidentally put in the hoped-for checks ahead of the transform! llvm-svn: 285070	2016-10-25 13:30:19 +00:00
Sanjay Patel	97beffe037	[InstCombine] add tests for bitcast interference with min/max (PR28001) llvm-svn: 285069	2016-10-25 13:27:56 +00:00
Rafael Espindola	20aa1779d0	fix warning llvm-svn: 285064	2016-10-25 12:28:26 +00:00
Zvi Rackover	124470a202	[DAGCombine] Preserve shuffles when one of the vector operands is constant Summary: Do not perform combines such as: vector_shuffle<4,1,2,3>(build_vector(Ud, C0, C1 C2), scalar_to_vector(X)) -> build_vector(X, C0, C1, C2) Keeping the shuffle allows lowering the constant build_vector to a materialized constant vector (such as a vector-load from the constant-pool or some other idiom). Reviewers: delena, igorb, spatel, mkuper, andreadb, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25524 llvm-svn: 285063	2016-10-25 12:14:19 +00:00
Rafael Espindola	7912110ddc	Make the LTO comdat api more symbol table friendly. In an IR symbol table I would expect the comdats to be represented as: - A table of strings, one for each comdat name. - Each symbol has an optional index into that table. The natural api for accessing that would be InputFile: ArrayRef<StringRef> getComdatTable() const; Symbol: int getComdatIndex() const; This patch implements an API as close to that as possible. The implementation on top of the current IRObjectFile is a bit hackish, but should map just fine over a symbol table and is very convenient to use. llvm-svn: 285061	2016-10-25 12:02:03 +00:00
Benjamin Kramer	7df3043db3	Fix an unused warning in WebAssemblyInstPrinter with NDEBUG. Patch by Sam McCall! Differential Revision: https://reviews.llvm.org/D25934 llvm-svn: 285055	2016-10-25 09:08:50 +00:00
Craig Topper	01e4667e02	[AVX-512] Add support for creating SIGN_EXTEND_VECTOR_INREG and ZERO_EXTEND_VECTOR_INREG for 512-bit vectors to support vpmovzxbq and vpmovsxbq. Summary: The one tricky thing about this is that the sign/zero_extend_inreg uses v64i8 as an input type which isn't legal without BWI support. Though the vpmovsxbq and vpmovzxbq instructions themselves don't require BWI. To support this we need to add custom lowering for ZERO_EXTEND_VECTOR_INREG with v64i8 input. This can mostly reuse the existing sign extend code with a couple checks for sign extend vs zero extend added. Reviewers: delena, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25594 llvm-svn: 285053	2016-10-25 04:00:29 +00:00
Peter Collingbourne	4f3b2df9bb	GlobalDCE: Restore a statement accidentally removed in r285048. llvm-svn: 285052	2016-10-25 02:57:27 +00:00
Matthias Braun	c8440dddb2	MachineInstrBundle: Pass iterators to getBundle(Start\|End); NFC This is a function to go backwards in a block to find the first instruction in a bundle, so iterator is a more natural choice for parameter/return rather than a reference to a MachineInstruction. llvm-svn: 285051	2016-10-25 02:55:17 +00:00
Peter Collingbourne	ca7664e761	IR: Deduplicate getParent() functions on derived classes of GlobalValue into the base class. NFCI. llvm-svn: 285050	2016-10-25 02:54:08 +00:00
Kostya Serebryany	3364f90783	[libFuzzer] simplify the code for use_cmp, also use the position hint when available, add a test llvm-svn: 285049	2016-10-25 02:04:43 +00:00
Peter Collingbourne	7695cb6da8	GlobalDCE: Deduplicate code. NFCI. llvm-svn: 285048	2016-10-25 01:58:26 +00:00
Sanjay Patel	02be063351	[InstCombine] auto-generate checks llvm-svn: 285046	2016-10-25 00:44:02 +00:00
Sanjay Patel	5cff621dce	[InstCombine] auto-generate checks llvm-svn: 285045	2016-10-25 00:41:00 +00:00
Dan Gohman	48abaa9c74	[WebAssembly] Reorder load/store operands to match binary encoding. The p2align operand of a load/store is encoded before the offset operand; reorder the MachineInstr operands accordingly. llvm-svn: 285044	2016-10-25 00:17:11 +00:00
Vedant Kumar	5c61c70387	[llvm-cov] Do not print out the filename of the object file When we load coverage data from multiple objects, we don't have a way to attribute a source object to a function record. Printing out the object filename next to the source filename is already not very useful: soon, it'll actually become misleading. Stop printing out the filename now. llvm-svn: 285043	2016-10-25 00:08:33 +00:00
Dan Gohman	3acb187d95	[WebAssembly] Implement more WebAssembly binary encoding. This changes locals from being declared by the emitLocal hook in WebAssemblyTargetStreamer, rather than with an instruction. After exploring the infastructure in LLVM more, this seems to make more sense since declaring locals doesn't use an encoded opcode. This also adds more 0xd opcodes, type encodings, and miscellaneous binary encoding bits. llvm-svn: 285040	2016-10-24 23:27:49 +00:00
Matthias Braun	8b38ffaa98	CodeGen/Passes: Pass MachineFunction as functor arg; NFC Passing a MachineFunction as argument is more natural and avoids an unnecessary round-trip through the logic determining the correct Subtarget because MachineFunction already has a reference anyway. llvm-svn: 285039	2016-10-24 23:23:02 +00:00
Sanjay Patel	60f80d7a8b	[InstCombine] regenerate some checks llvm-svn: 285036	2016-10-24 22:50:26 +00:00
Justin Bogner	b63109944d	cmake: Rename installhdrs to install-llvm-headers and fix the dependencies The installhdrs target was inconsistently named and would behave differently depending on whether or not you ran a build first. This renames it to install-llvm-headers to match other target names and adds a dependency on intrinsics_gen so that it will always install the same set of things. llvm-svn: 285035	2016-10-24 21:58:58 +00:00
Eli Friedman	c5b7262073	Fix regression from my recent GlobalsAA fix. There are two fixes here: one, AnalyzeUsesOfPointer can't return false until it has checked all the uses of the pointer. Two, if a global uses another global, we have to assume the address of the first global escapes. Fixes https://llvm.org/bugs/show_bug.cgi?id=30707 . Differential Revision: https://reviews.llvm.org/D25798 llvm-svn: 285034	2016-10-24 21:47:44 +00:00
Simon Pilgrim	e3e6585c2d	[SelectionDAG] Update ComputeNumSignBits SRA/SHL handlers to accept scalar or vector splats Use isConstOrConstSplat helper. Also use APInt instead of getZExtValue directly to avoid out of range issues. llvm-svn: 285033	2016-10-24 21:47:19 +00:00
Justin Bogner	51c6d93fed	cmake: Make /usr/share/cmake installable with LLVM_DISTRIBUTION_COMPONENTS Add a cmake-exports install component and appropriate targets for LLVM_DISTRIBUTION_COMPONENTS to work with. llvm-svn: 285030	2016-10-24 21:40:15 +00:00
Matthias Braun	fc371558a0	Use MachineInstr::mop_iterator instead of MIOperands; NFC (Const)?MIOperands is equivalent to the C++ style MachineInstr::mop_iterator. Use the latter for consistency except for a few callers of MIOperands::analyzePhysReg(). llvm-svn: 285029	2016-10-24 21:36:43 +00:00
Kevin Enderby	79d6c63f61	nother additional error check for an invalid Mach-O file when contained in a Mach-O universal file and the cputypes in both headers don’t match. llvm-svn: 285026	2016-10-24 21:15:11 +00:00
Simon Pilgrim	d8ec09c74f	Use SDValue::getConstantOperandVal() helper. NFCI. llvm-svn: 285025	2016-10-24 20:56:52 +00:00
Vedant Kumar	26f89df41f	[pbqp] unique_ptr-ify (Vector\|Matrix)::Data, NFC. Suggested by David Blaikie. llvm-svn: 285024	2016-10-24 20:51:46 +00:00
Vedant Kumar	06ca6ea6c3	[pbqp] Delete some dead code, NFC. I took the opportunity to replace some copy\|move constructors\|assignment operators with default implementations. As a follow-up, I plan on threading unique_ptr<T []> through a few areas per David Blaikie's advice. Differential Revision: https://reviews.llvm.org/D24424 llvm-svn: 285018	2016-10-24 20:35:35 +00:00
Dan Gohman	5d3391f859	[WebAssembly] Fix a broken URL. llvm-svn: 285017	2016-10-24 20:35:17 +00:00
Dan Gohman	4becc58587	[WebAssembly] Define the `end` opcode value. CFGStackify differentiates between END_LOOP and END_BLOCK, but wasm itself doesn't. For now, just use the same opcode for both. llvm-svn: 285016	2016-10-24 20:32:04 +00:00
Dan Gohman	c968297b95	[WebAssembly] Update opcode values according to recent spec changes. This corresponds to the "0xd" opcode renumbering. llvm-svn: 285014	2016-10-24 20:21:49 +00:00
Dan Gohman	4fc4e42dea	[WebAssembly] Add an option to make get_local/set_local explicit. This patch adds a pass, controlled by an option and off by default for now, for making implicit get_local/set_local explicit. This simplifies emitting wasm with MC. Differential Revision: https://reviews.llvm.org/D25836 llvm-svn: 285009	2016-10-24 19:49:43 +00:00
Davide Italiano	c3e0ce8f85	Merge two if conditions into one. NFCI. llvm-svn: 285008	2016-10-24 19:41:47 +00:00
Peter Collingbourne	6733564e5a	Target: Change various section classifiers in TargetLoweringObjectFile to take a GlobalObject. These functions are about classifying a global which will actually be emitted, so it does not make sense for them to take a GlobalValue which may for example be an alias. Change the Mach-O object writer and the Hexagon, Lanai and MIPS backends to look through aliases before using TargetLoweringObjectFile interfaces. These are functional changes but all appear to be bug fixes. Differential Revision: https://reviews.llvm.org/D25917 llvm-svn: 285006	2016-10-24 19:23:39 +00:00
Sanjay Patel	87eea10711	[x86] add tests for {-1,0,1} select of constants llvm-svn: 285005	2016-10-24 19:13:29 +00:00
Mandeep Singh Grang	da99e33ae3	[llvm] Remove redundant --check-prefix=CHECK from tests Reviewers: MatzeB, mcrosier, rengolin Differential Revision: https://reviews.llvm.org/D25894 llvm-svn: 285003	2016-10-24 18:57:55 +00:00
Peter Collingbourne	16e9b944e9	CodeGen: Do not add a global's address space to the folding set profile. It is already part of the type (which is part of the global, which is already being added), so there's no need to do it. llvm-svn: 285002	2016-10-24 18:56:09 +00:00
Adrian Prantl	28d2d281e7	add-discriminators: Fix handling of lexical scopes. This fixes a bug in the handling of lexical scopes, when more than one scope is defined on the same line or functions are inlined into call sites that are on the same line as the function definition. This situation can easily happen in macro expansions. The problem is solved by introducing a SmallDenseMap<DIScope , DILexicalBlockFile , 1> that keeps track of all the different lexical scopes that share a line/file location. Fixes PR30681. llvm-svn: 284998	2016-10-24 18:23:51 +00:00
Benjamin Kramer	bc473621b8	Clarify that MSVC is not the issue here anymore. llvm-svn: 284997	2016-10-24 18:11:05 +00:00
Krzysztof Parzyszek	eb6172404d	Revert r284972 and remove other defaulted copy/move constructors/= David Blaikie pointed out that we get them for free without having to write anything. llvm-svn: 284996	2016-10-24 17:40:46 +00:00
Ehsan Amiri	c90b02cf50	[PPC] Generate positive FP zero using xor insn instead of loading from constant area https://reviews.llvm.org/D23614 Currently we load +0.0 from constant area. That can change to be generated using XOR instruction. llvm-svn: 284995	2016-10-24 17:31:09 +00:00
Hans Wennborg	47f2616b6a	ReleaseNotes: mention new compiler requirements llvm-svn: 284994	2016-10-24 17:29:52 +00:00
Eli Friedman	b37864b58d	Revert r284580+r284917. ("Synthesize TBB/TBH instructions") The optimization has correctness issues, so reverting for now to fix tests on thumb1 targets. llvm-svn: 284993	2016-10-24 17:20:50 +00:00
Simon Pilgrim	a072e375b5	Removed FIXME from include ordering comment Nothing to fix, it's just the way it has to be. llvm-svn: 284991	2016-10-24 17:15:05 +00:00
Rong Xu	b05bac940d	Check the number of Args in LibCallsShrinkWrap. Some library fucntions can have no argument. llvm-svn: 284989	2016-10-24 16:50:12 +00:00
Evandro Menezes	eff2bd9d4f	[AArch64] Optionally use the Newton series for reciprocal estimation Add support for estimating the square root or its reciprocal and division or reciprocal using the combiner generic Newton series. Differential revision: https://reviews.llvm.org/D25291 llvm-svn: 284986	2016-10-24 16:14:58 +00:00
Justin Lebar	566eb864b6	[ADT] static_assert that SmallDenseMap is instantiated with a power-of-two number of inline buckets. Summary: If you try to instantiate it with a non-power-of-two buckets, DenseMap will assert at runtime (!) if we ever outgrow our inline storage. I believe using a constexpr function inside of a static_assert is safe now that we've unsupported MSVC 2013 and GCC < 4.8. Reviewers: bkramer, qcolombet, escha Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25900 llvm-svn: 284985	2016-10-24 16:06:53 +00:00
Geoff Berry	6815468768	[EarlyCSE] Optimize MemoryPhis and reduce memory clobber queries w/ MemorySSA Summary: When using MemorySSA, re-optimize MemoryPhis when removing a store since this may create MemoryPhis with all identical arguments. Also, when using MemorySSA to check if two MemoryUses are reading from the same version of the heap, use the defining access instead of calling getClobberingAccess, since the latter can currently result in many more AA calls. Once the MemorySSA use optimization tracking changes are done, we can remove this limitation, which should result in more loads being CSE'd. Reviewers: dberlin Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D25881 llvm-svn: 284984	2016-10-24 15:54:00 +00:00
Ehsan Amiri	1f31e9157d	[PPC] Better codegen for AND, ANY_EXT, SRL sequence https://reviews.llvm.org/D24924 This improves the code generated for a sequence of AND, ANY_EXT, SRL instructions. This is a targetted fix for this special pattern. The pattern is generated by target independet dag combiner and so a more general fix may not be necessary. If we come across other similar cases, some ideas for handling it are discussed on the code review. llvm-svn: 284983	2016-10-24 15:46:58 +00:00
Sanjay Patel	7b12e36740	[x86] regenerate checks llvm-svn: 284982	2016-10-24 15:43:40 +00:00
Nicolai Haehnle	a785209bc2	AMDGPU: Fix Two Address problems with v_movreld Summary: The v_movreld machine instruction is used with three operands that are in a sense tied to each other (the explicit VGPR_32 def and the implicit VGPR_NN def and use). There is no way to express that using the currently available operand bits, and indeed there are cases where the Two Address instructions pass does the wrong thing. This patch introduces a new set of pseudo instructions that are identical in intended semantics as v_movreld, but they only have two tied operands. Having to add a new set of pseudo instructions is admittedly annoying, but it's a fairly straightforward and solid approach. The only alternative I see is to try to teach the Two Address instructions pass about Three Address instructions, and I'm afraid that's trickier and is going to end up more fragile. Note that v_movrels does not suffer from this problem, and so this patch does not touch it. This fixes several GL45-CTS.shaders.indexing.* tests. Reviewers: tstellarAMD, arsenm Subscribers: kzhuravl, wdng, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D25633 llvm-svn: 284980	2016-10-24 14:56:02 +00:00
Nico Weber	b38d341106	Revert 284971. It seems to break selfhost on some bots, see e.g. http://lab.llvm.org:8011/builders/clang-x86-windows-msvc2015/builds/21 http://lab.llvm.org:8011/builders/clang-ppc64be-linux-multistage/builds/20 http://lab.llvm.org:8011/builders/clang-ppc64be-linux-lnt/builds/22 llvm-svn: 284979	2016-10-24 14:52:04 +00:00
Nirav Dave	1a9044b782	[MC] Fix Various End Of Line Comment checkings Fix AsmParser lines to correctly handle end-of-line pre-processor comments parsing when '#' is not the assembly line comment prefix. Reviewers: rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25567 llvm-svn: 284978	2016-10-24 14:35:29 +00:00
Pavel Labath	676a875b06	[Chrono] Fix !HAVE_FUTIMENS build If we don't have futimens(), we fall back to futimes(), which only supports microsecond timestamps. In that case, we need to explicitly cast away the extra precision in setLastModificationAndAccessTime(). llvm-svn: 284977	2016-10-24 14:19:28 +00:00
Pavel Labath	51c454c1a9	Remove unused #includes of TimeValue.h. NFC. llvm-svn: 284975	2016-10-24 14:00:26 +00:00
Pavel Labath	bff47b51b6	[Object] Replace TimeValue with std::chrono Summary: Most of the changes are very straight-forward. The only choice I had to make was to use second-precision time points in the Archive classes. I did this because the archive files use that precision in the on-disk representation anyway. Reviewers: rafael, zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25773 llvm-svn: 284974	2016-10-24 13:38:27 +00:00
Joel Jones	504bf334b0	AArch64 ILP32 relocations for assembly and ELF Summary: Add relocations for AArch64 ILP32. Includes: - Addition of definitions for R_AARCH32_* - Definition of new -target-abi: ilp32 - Definition of data layout string - Tests for added relocations. Not comprehensive, but matches existing tests for 64-bit. Renames "CHECK-OBJ" to "CHECK-OBJ-LP64". - Tests for llvm-readobj Reviewers: zatrazz, peter.smith, echristo, t.p.northover Subscribers: aemerson, rengolin, mehdi_amini Differential Revision: https://reviews.llvm.org/D25159 llvm-svn: 284973	2016-10-24 13:37:13 +00:00
Krzysztof Parzyszek	f74683f930	[RDF] Add default move constructors/assignment operators llvm-svn: 284972	2016-10-24 13:15:20 +00:00
Pablo Barrio	f9e0d0b7d0	[JumpThreading] Unfold selects that depend on the same condition Summary: These are good candidates for jump threading. This enables later opts (such as InstCombine) to combine instructions from the selects with instructions out of the selects. SimplifyCFG will fold the select again if unfolding wasn't worth it. Patch by James Molloy and Pablo Barrio. Reviewers: reames, bkramer, mcrosier, gberry, haicheng, jmolloy, sebpop Subscribers: jojo, rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D25477 llvm-svn: 284971	2016-10-24 13:04:45 +00:00
Simon Pilgrim	6d2de6aa9e	Fix windows builds by swapping windows.h and wincrypt.h ordering. We need to include windows.h first even though it breaks default include ordering rules llvm-svn: 284968	2016-10-24 12:39:23 +00:00
Pavel Labath	757ca886cd	Remove TimeValue usage from llvm/Support Summary: This is a follow-up to D25416. It removes all usages of TimeValue from llvm/Support library (except for the actual TimeValue declaration), and replaces them with appropriate usages of std::chrono. To facilitate this, I have added small utility functions for converting time points and durations into appropriate OS-specific types (FILETIME, struct timespec, ...). Reviewers: zturner, mehdi_amini Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25730 llvm-svn: 284966	2016-10-24 10:59:17 +00:00
Simon Dardis	9c34854833	[mips] synci microMIPS instruction definition. Add synci to the microMIPS instruction definitions, mark the MIPS sync & synci as not being part of microMIPS. This does not cover the sync instruction alias, as that will be handled with a different patch. Add sync to the valid tests for microMIPS. Reviewers: vkalintiris Differential Revision: https://reviews.llvm.org/D25795 llvm-svn: 284962	2016-10-24 10:23:59 +00:00
Hal Finkel	fd448408e3	[llvm-opt-report] Fix unroll-count reporting Fix the implementation of OptReportLocationInfo's operator < so that contexts with different unroll counts are reported separately. llvm-svn: 284957	2016-10-24 05:07:18 +00:00
Craig Topper	8ec5c7326d	[AVX-512] Remove masked pmin/pmax intrinsics and autoupgrade to native IR. Clang patch to replace 512-bit vector and 64-bit element versions with native IR will follow. llvm-svn: 284955	2016-10-24 04:04:16 +00:00
Sanjay Patel	9ca028c2d6	[DAG] enhance computeKnownBits to handle SRL/SRA with vector splat constant llvm-svn: 284953	2016-10-23 23:13:31 +00:00
Simon Pilgrim	d06641d3dc	Use SDValue::getConstantOperandVal() helper. NFCI. llvm-svn: 284949	2016-10-23 20:17:21 +00:00
Justin Lebar	a45e6fde58	Remove LLVM_CONSTEXPR. Summary: With MSVC 2013 and GCC < 4.8 gone, we can use the "constexpr" keyword. Reviewers: bkramer, mehdi_amini Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25901 llvm-svn: 284947	2016-10-23 19:39:16 +00:00
Simon Pilgrim	d09c04d267	[CostModel][X86] Added tests for current integer signed/unsigned remainder costs llvm-svn: 284940	2016-10-23 18:35:02 +00:00
Simon Pilgrim	6ac1e98b09	[X86][SSE] Add SSE41/AVX1 costs for vector shifts. We were defaulting to SSE2 costs which weren't taking into account the availability of PBLENDW/PBLENDVB to improve merging of per-element shift results. llvm-svn: 284939	2016-10-23 16:49:04 +00:00
Simon Pilgrim	e16b1e2271	[CostModel][X86] Added tests for current integer trunc costs llvm-svn: 284938	2016-10-23 15:17:52 +00:00
Simon Pilgrim	96ef0c1103	Use APInt::isAllOnesValue instead of popcnt. NFCI. More obvious implementation and faster too. llvm-svn: 284937	2016-10-23 15:09:44 +00:00
Dylan McKay	479a13c0aa	[AVR] Add the machine code disassembler This adds a super basic implementation of a machine code disassembler. It doesn't support any operands with custom encoding. llvm-svn: 284930	2016-10-22 23:57:59 +00:00
Simon Pilgrim	d3829c89bc	[X86][AVX512VL] Added support for combining target 256-bit shuffles to AVX512VL VPERMV3 llvm-svn: 284922	2016-10-22 20:15:39 +00:00
Simon Pilgrim	56c0524f0f	[X86][AVX512] Added support for combining target shuffles to AVX512 VPERMV3 llvm-svn: 284921	2016-10-22 19:53:59 +00:00
Brian Gesiak	ae8c2517b4	[lit] Add more testing instructions to README Summary: r283710 introduced two regressions, one to llvm-lit, and the other to lit executables that were installed via setuptools. Add instructions on how to test for these regressions in the future. Reviewers: ddunbar, delcypher, beanz, chapuni, cmatthews, echristo Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D25459 llvm-svn: 284919	2016-10-22 17:27:31 +00:00
James Molloy	2bae8640d7	[ARM] Fix crash in ConstantIslands tPCRelJT may not be the first instruction in a block. Check that instead of dereferencing a broken iterator. llvm-svn: 284917	2016-10-22 09:58:37 +00:00
Zvi Rackover	8eb99d88d4	[X86] Apply the Update LLC Test Checks tool on the mmx-bitcast test llvm-svn: 284916	2016-10-22 07:15:00 +00:00
Craig Topper	b084c90a18	[X86] Add support for printing shuffle comments for VALIGN instructions. llvm-svn: 284915	2016-10-22 06:51:56 +00:00
Craig Topper	7b2b8db438	[X86] Add support for lowering v4i64 and v8i64 shuffles directly to PALIGNR. I think shuffle combine can figure it out later, but we should try to get it right up front. llvm-svn: 284914	2016-10-22 06:51:52 +00:00
Craig Topper	9f374533e3	[X86] Remove unnecessary AVX2 check that was already covered by an assertion earlier in the function. NFC llvm-svn: 284913	2016-10-22 06:51:49 +00:00
Craig Topper	bea5cb5491	[X86] Remove 128-bit lane handling from the main loop of matchVectorShuffleAsByteRotate. Instead check for is128LaneRepeatedSuffleMask before the loop and just loop over the repeated mask. I plan to use the loop to support VALIGND/Q shuffles so this makes it easier to reuse. llvm-svn: 284912	2016-10-22 06:51:44 +00:00
Simon Pilgrim	0d376bcbf0	[X86][SSE] Use getConstVector helper for VPERMV mask generation. NFCI. llvm-svn: 284911	2016-10-22 06:18:36 +00:00
Daniel Berlin	f5361139bb	Now that VS2013 is gone, make a memoryssa structure an anonymous union again llvm-svn: 284910	2016-10-22 04:15:41 +00:00
Kostya Serebryany	65f102d4d2	[libFuzzer] mutation: insert the size of the input in bytes as one of the ways to mutate a binary integer llvm-svn: 284909	2016-10-22 03:48:53 +00:00
Gerolf Hoflehner	9e2afa8bd7	[BasicAA] Fix - missed alias in GEP expressions In BasicAA GEP operand values get adjusted ("wrap-around") based on the pointersize. Otherwise, in non-64b modes, AA could report false negatives. However, a wrap-around is valid only for a fully evaluated expression. It had been introduced to fix an alias problem in http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20160118/326163.html. This commit restricts the wrap-around to constant gep operands only where the value is known at compile-time. llvm-svn: 284908	2016-10-22 02:41:39 +00:00
Davide Italiano	738837eed9	[CtorUtils] Modernize. No functional changes intended. llvm-svn: 284904	2016-10-22 01:21:24 +00:00
Kostya Serebryany	10ae9e23a3	[libFuzzer] typo in a test llvm-svn: 284903	2016-10-22 01:07:38 +00:00
Kostya Serebryany	2bfff021ad	[libFuzzer] add a test for asan's strict_string_checks=1 llvm-svn: 284902	2016-10-22 00:05:44 +00:00
David L. Jones	0db4549a1e	Fix map insertion that is elided in release build. The assert() macro doesn't actually execute its body in Release builds, so using it to check cache invariants requires that the insertion be outside of the assert() statement. This change does that, and also makes sure to return the actual map contents. llvm-svn: 284898	2016-10-21 23:30:39 +00:00
Sanjay Patel	5b7767f646	[x86] add test for missing vector SRA combine via computeKnownBits llvm-svn: 284896	2016-10-21 23:02:31 +00:00
Justin Lebar	7b15e2946f	[ADT] Don't rely on string literals not being convertible to non-const char* in CachedHashString. The build was breaking on some platforms because we assumed that CachedHashString("foo") would match the CachedHashString(StringRef) constructor rather than the CachedHashString(char) constructor. To fix this, provide a CachedHashString(const char) constructor, and add a dummy argument to the old CachedHashString(char*) constructor. llvm-svn: 284892	2016-10-21 22:10:23 +00:00
Konstantin Zhuravlyov	fda33eaf0c	[AMDGPU] Perform uchar to float combine for ISD::SINT_TO_FP This will prevent following regression when enabling i16 support (D18049): test/CodeGen/AMDGPU/cvt_f32_ubyte.ll Differential Revision: https://reviews.llvm.org/D25805 llvm-svn: 284891	2016-10-21 22:10:03 +00:00
Justin Lebar	5e83dfedb8	Switch SmallSetVector to use DenseSet when it overflows its inline space. Summary: SetVector already used DenseSet, but SmallSetVector used std::set. This leads to surprising performance differences. Moreover, it means that the set of key types accepted by SetVector and SmallSetVector are quite different! In order to make this change, we had to convert some callsites that used SmallSetVector<std::string, N> to use SmallSetVector<CachedHashString, N> instead. Reviewers: timshen Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25648 llvm-svn: 284887	2016-10-21 21:45:01 +00:00
Justin Bogner	8659d16631	Support: Annotate Error and Expected<> with LLVM_NODISCARD It's always a mistake to ignore return values with Error or Expected types. llvm-svn: 284884	2016-10-21 21:01:12 +00:00
Bob Haarman	62c730c9bd	fixed typo in InstrProf.h; NFC llvm-svn: 284880	2016-10-21 20:38:37 +00:00
Justin Lebar	3c37dc0cb2	[ADT] Get rid of use of LLVM_NOEXCEPT in CachedHashString.h. LLVM_NOEXCEPT is no longer necessary (yay). llvm-svn: 284876	2016-10-21 20:28:00 +00:00
Tom Stellard	6c7dd980e4	AMDGPU/SI: Fix crash caused by r284267 Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, tony-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D25782 llvm-svn: 284875	2016-10-21 20:25:11 +00:00
Sanjay Patel	ca92c36e01	[DAG] enhance computeKnownBits to handle SHL with vector splat constant Also, use APInt to avoid crashing on types larger than vNi64. llvm-svn: 284874	2016-10-21 20:16:27 +00:00
Justin Lebar	c9d66eb05b	[ADT] Compare strings' hashes first before comparing their values. Summary: We already have the hashes in hand, and comparing hashes should be much more discriminatory than comparing the StringRefs' sizes. Reviewers: rafael Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25705 llvm-svn: 284872	2016-10-21 20:10:51 +00:00
Justin Lebar	d87ea9a1c9	[ADT] Add CachedHashString. Summary: This is like CachedHashStringRef, but owns its data. This lets us use strings inside of DenseMaps. Reviewers: timshen Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25645 llvm-svn: 284871	2016-10-21 20:10:48 +00:00
Justin Lebar	245c3e75cd	[IR] Add DenseMapInfo<CallSite>. Summary: A CallSite is basically an Instruction, and you can put Instructions into DenseMaps, so you should be able to do the same with CallSites. This is used in a later patch. Reviewers: timshen Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25643 llvm-svn: 284870	2016-10-21 20:10:44 +00:00
Li Huang	faa857dba7	[SCEV] Memoize visitMulExpr results in SCEVRewriteVisitor. Summary: When SCEVRewriteVisitor traverses the SCEV DAG, it may visit the same SCEV multiple times if this SCEV is referenced by multiple other SCEVs. This has exponential time complexity in the worst case. Memoizing the results will avoid re-visiting the same SCEV. Add a map to save the results, and override the visit function of SCEVVisitor. Now SCEVRewriteVisitor only visit each SCEV once and thus returns the same result for the same input SCEV. This patch fixes PR18606, PR18607. Reviewers: Sanjoy Das, Mehdi Amini, Michael Zolotukhin Differential Revision: https://reviews.llvm.org/D25810 llvm-svn: 284868	2016-10-21 20:05:21 +00:00
Kevin Enderby	2f9d8d0702	Fix a bug in the code of llvm-cxxdump in dumpArchive() when iterating over an archive with object and non-object members that would cause an Abort because to was not calling consumeError() when the code was wanting to ignore a non-object file. Found by Justin Bogner! llvm-svn: 284867	2016-10-21 20:03:14 +00:00
Kostya Serebryany	42909a6f3a	[libFuzzer] mention one more trophie llvm-svn: 284866	2016-10-21 20:01:45 +00:00
Peter Collingbourne	ecdd58f1d6	Analysis: Move llvm::getConstantRangeFromMetadata to IR library. We're about to start using it there. Differential Revision: https://reviews.llvm.org/D25877 llvm-svn: 284865	2016-10-21 19:59:26 +00:00
Peter Collingbourne	e9bd49824d	X86: Improve BT instruction selection for 64-bit values. If a 64-bit value is tested against a bit which is known to be in the range [0..31) (modulo 64), we can use the 32-bit BT instruction, which has a slightly shorter encoding. Differential Revision: https://reviews.llvm.org/D25862 llvm-svn: 284864	2016-10-21 19:57:55 +00:00
Simon Pilgrim	ab48872313	[X86][AVX512BWVL] Added support for lowering v16i16 shuffles to AVX512BWVL vpermw llvm-svn: 284863	2016-10-21 19:54:38 +00:00
Bob Haarman	653baa2aaa	[pdb] added support for dumping globals stream Summary: This adds support for dumping the globals stream from PDB files using llvm-pdbdump, similar to the support we have for the publics stream. Reviewers: ruiu, zturner Subscribers: beanz, mgorny, modocache Differential Revision: https://reviews.llvm.org/D25801 llvm-svn: 284861	2016-10-21 19:43:19 +00:00
Simon Pilgrim	da814cba0d	[X86][AVX512BWVL] Added support for combining target v16i16 shuffles to AVX512BWVL vpermw llvm-svn: 284860	2016-10-21 19:40:29 +00:00
Simon Pilgrim	0109bf116f	[X86][AVX512] Added support for combining target shuffles to AVX512 vpermpd/vpermq/vpermps/vpermd/vpermw llvm-svn: 284858	2016-10-21 19:18:09 +00:00
Krzysztof Parzyszek	6e7fa99d3a	[RDF] Use RegisterId typedef more consistently, NFC llvm-svn: 284857	2016-10-21 19:12:13 +00:00
Anna Thomas	0860259434	[StripGCRelocates] New pass to remove gc.relocates added by RS4GC Summary: Utility pass to remove gc.relocates created by rewrite statepoints for GC. With respect to safepoint verification, the IR generated would be incorrect, and cannot run as such. This would be a single transformation on the final optimized IR. The benefit of the pass is for easy analysis when the IRs are 'polluted' by too many gc.relocates. Added tests. test run: All RS4GC tests with -verify option. Local downstream tests on large IR files. This also works when the pointer being gc.relocated is another gc.relocate. Reviewers: sanjoy, reames Subscribers: beanz, mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D25096 llvm-svn: 284855	2016-10-21 18:43:16 +00:00
Kevin Enderby	41c9c00bf0	For llvm-objdump for Mach-O files add printing of the ARM_THREAD_STATE in the same format as otool-classic(1) on darwin. Also remove an extra space in printing the initprot to make the output match otool-classic(1) on darwin. rdar://28851457 llvm-svn: 284852	2016-10-21 18:22:35 +00:00
Sanjay Patel	81029f6a76	[DAG] fold negation of sign-bit 0 - X --> 0, if the sub is NUW 0 - X --> 0, if X is 0 or the minimum signed value and the sub is NSW 0 - X --> X, if X is 0 or the minimum signed value This is the DAG equivalent of: https://reviews.llvm.org/rL284649 plus the fold for the NUW case which already existed in InstSimplify. Note that we miss a vector fold because of a deficiency in the DAG version of computeKnownBits(). llvm-svn: 284844	2016-10-21 17:24:26 +00:00
Sanjay Patel	97de8a4c3a	[x86] add tests for potential negation folds These are the backend equivalents for the tests added in r284627. The patterns may emerge late, so we should have folds for these in the DAG too. llvm-svn: 284842	2016-10-21 16:56:29 +00:00
Krzysztof Parzyszek	b71085b547	[Hexagon] Handle spills of partially defined double vector registers After register allocation it is possible to have a spill of a register that is only partially defined. That in itself it fine, but creates a problem for double vector registers. Stores of such registers are pseudo instructions that are expanded into pairs of individual vector stores, and in case of a partially defined source, one of the stores may use an entirely undefined register. To avoid this, track the defined parts and only generate actual stores for those. llvm-svn: 284841	2016-10-21 16:38:29 +00:00
Derek Schuff	6f69783f1f	[WebAssembly] Fix for 0xc call_indirect changes Summary: Need to reorder the operands to have the callee as the last argument. Adds a pseudo-instruction, and a pass to lower it into a real call_indirect. This is the first of two options for how to fix the problem. Reviewers: dschuff, sunfish Subscribers: jfb, beanz, mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D25708 llvm-svn: 284840	2016-10-21 16:38:07 +00:00
Abderrazek Zaafrani	9daf8110c8	Set the vectorizer MaxInterleaveFactor for Exynos. llvm-svn: 284839	2016-10-21 16:28:27 +00:00
Reid Kleckner	ac2a2a86e4	Fix -Wunused-variable warning in libFuzzer llvm-svn: 284838	2016-10-21 16:26:27 +00:00
Simon Pilgrim	372e916a3f	[X86][SSE] Regenerated sext/zext constant folding tests and added i686 tests llvm-svn: 284837	2016-10-21 16:22:16 +00:00
Simon Pilgrim	2d96daa885	[X86] Use DAG::getBuildVector helper wrapper where possible. NFCI. llvm-svn: 284835	2016-10-21 16:07:51 +00:00
Konstantin Zhuravlyov	0ff17b627d	[MachineMemOperand][AtomicSDNode] Remove getSuccessOrdering() Differential Revision: https://reviews.llvm.org/D25786 llvm-svn: 284834	2016-10-21 16:02:35 +00:00
Simon Pilgrim	098bd72804	[X86][SSE] Regenerated chained pmovsx store tests and added i686 tests llvm-svn: 284833	2016-10-21 15:51:24 +00:00
Abderrazek Zaafrani	9f382f53d1	Test commit llvm-svn: 284832	2016-10-21 15:24:08 +00:00
Artur Pilipenko	47dc098c06	[LVI] Fix a bug with a guard being the very first instruction in a BB not taken into account While looking for guards use reverse iterator and scan up to rend() not to begin() llvm-svn: 284827	2016-10-21 15:02:21 +00:00
Sanjay Patel	501be9b3d7	fix variable names; NFCI Because we're just 'or-ing' these 2 variables later in the code, I don't think there's a logical bug here, but of course the string with "no size" is the one that should have the size suffix stripped off. llvm-svn: 284826	2016-10-21 14:58:30 +00:00
Artem Tamazov	751985a757	[AMDGPU][mc] Fix ds_min/max[_rtn]_f32 - extra source operand removed. Fixes Bug 28215. Lit tests updated. Differential Revision: https://reviews.llvm.org/D25837 llvm-svn: 284825	2016-10-21 14:49:22 +00:00
Sanjay Patel	cbaba93ce8	[DAG] use SDNode flags 'nsz' to enable fadd/fsub with zero folds As discussed in D24815, let's start the process of killing off the broken fast-math global state housed in TargetOptions and eliminate the need for function-level fast-math attributes. Here we enable two similar folds that are possible when we don't care about signed-zero: fadd nsz x, 0 --> x fsub nsz 0, x --> -x Note that although the test cases include a 'sin' function call, I'm side-stepping the FMF-on-calls question (and lack of support in the DAG) for now. It's not needed for these tests - isNegatibleForFree/GetNegatedExpression just look through a ISD::FSIN node. Also, when we create an FNEG node and propagate the Flags of the FSUB to it, this doesn't actually do anything today because Flags are silently dropped for any node that is not a binary operator. Differential Revision: https://reviews.llvm.org/D25297 llvm-svn: 284824	2016-10-21 14:36:58 +00:00
Simon Pilgrim	c98d99a600	[X86][AVX2] Begun generalizing lowering to VPERMD/VPERMPS in preparation for AVX512 support. llvm-svn: 284823	2016-10-21 13:00:47 +00:00
Simon Pilgrim	f3e9ddbf77	Wdocumentation fix llvm-svn: 284822	2016-10-21 12:51:16 +00:00
Simon Pilgrim	32b06235da	[X86][AVX512] Add mask/maskz writemask support to subvector broadcast shuffle decode comments llvm-svn: 284821	2016-10-21 12:14:24 +00:00
John Brawn	84b21835f1	[LoopUnroll] Keep the loop test only on the first iteration of max-or-zero loops When we have a loop with a known upper bound on the number of iterations, and furthermore know that either the number of iterations will be either exactly that upper bound or zero, then we can fully unroll up to that upper bound keeping only the first loop test to check for the zero iteration case. Most of the work here is in plumbing this 'max-or-zero' information from the part of scalar evolution where it's detected through to loop unrolling. I've also gone for the safe default of 'false' everywhere but howManyLessThans which could probably be improved. Differential Revision: https://reviews.llvm.org/D25682 llvm-svn: 284818	2016-10-21 11:08:48 +00:00
Simon Pilgrim	f447fbf913	[X86][AVX] Add 32-bit target tests for vector lzcnt/tzcnt to demonstrate missed folding opportunities llvm-svn: 284816	2016-10-21 10:50:52 +00:00
Bjorn Pettersson	9fcd605d1e	[AArch64] Corrected spill size for DDD register class. NFCI Summary: The spill size was incorrectly set to 196 bits, which isn't a multiple of 8. This problem was detected when experimenting with asserts that the spill size should be a multiple of the byte size. New corrected value for the spill size is set to 192 bits. Note that tablegen (RegisterInfoEmitter) will divide the size set in the RegisterClass definition by 8. So this change should not have any impact on the tablegen output (trunc(192/8) == trunc(196/8) == 24 bytes). Reviewers: t.p.northover Subscribers: llvm-commits, aemerson, rengolin Differential Revision: https://reviews.llvm.org/D25818 llvm-svn: 284814	2016-10-21 09:53:42 +00:00
Benjamin Kramer	0d41eb8cd1	Fix WebAssembly test after r284757. The change to MachineSink shuffles code around, disable it. llvm-svn: 284813	2016-10-21 09:51:41 +00:00
Benjamin Kramer	aa48572b9d	[Support] Fix AlignOf test on i386-linux. On i386 alignof(double) = 8 is not the same as alignof(struct { double }) = 4. This used to be not an issue because the old implementation always measured alignment inside of structs. Wrap a dummy struct around the test to avoid this issue. llvm-svn: 284812	2016-10-21 09:15:57 +00:00
Craig Topper	c1db0db864	[AVX-512] Add tests to show opportunities for commuting vpermi2/vpermt2 instructions. Commuting will be added in a future commit. llvm-svn: 284808	2016-10-21 05:55:40 +00:00
Davide Italiano	d15477b09d	Revert "[GVN/PRE] Hoist global values outside of loops." There's no agreement about this patch. I personally find the PRE machinery of the current GVN hard enough to reason about that I'm not sure I'll try to land this again, instead of working on the rewrite). llvm-svn: 284796	2016-10-21 01:37:02 +00:00
Keno Fischer	b04df8eaa2	Fix cross-endianness RuntimeDyld relocation for ARM rL284780 fixed the PREL31 relocation and added a test for it. Being the first such test for ARM relocations, it exposed incorrect endianness assumptions (causing buildbot failures on big-endian hosts). Fix that by using the same helpers used for the x86 case. llvm-svn: 284789	2016-10-20 22:15:56 +00:00
Li Huang	fcfe8cd3ae	[SCEV] Add a threshold to restrict number of mul operands to be inlined into SCEV This is to avoid inlining too many multiplication operands into a SCEV, which could take exponential time in the worst case. Reviewers: Sanjoy Das, Mehdi Amini, Michael Zolotukhin Differential Revision: https://reviews.llvm.org/D25794 llvm-svn: 284784	2016-10-20 21:38:39 +00:00
Keno Fischer	c32ffe3916	Fix PREL31 relocation on ARM Summary: This is a 31bits relative relocation instead of a 32bits absolute relocation. Reviewers: t.p.northover, peter.smith, rengolin Subscribers: aemerson, llvm-commits, samparker Differential Revision: https://reviews.llvm.org/D25069 llvm-svn: 284780	2016-10-20 21:15:29 +00:00
Michael Kuperstein	b2443ed62b	[X86] Enable interleaved memory access by default This lets the loop vectorizer generate interleaved memory accesses on x86. Differential Revision: https://reviews.llvm.org/D25350 llvm-svn: 284779	2016-10-20 21:04:31 +00:00
Daniel Berlin	cd2deacac6	[MSSA] Avoid unnecessary use walks when calling getClobberingMemoryAccess Summary: This allows us to mark when uses have been optimized. This lets us avoid rewalking (IE when people call getClobberingAccess on everything), and also enables us to later relax the requirement of use optimization during updates with less cost. Reviewers: george.burgess.iv Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25172 llvm-svn: 284771	2016-10-20 20:13:45 +00:00
Kevin Enderby	c8bb42283e	Another additional error check for invalid Mach-O files for the load commands that use the MachO::twolevel_hints_command type which includes only the LC_TWOLEVEL_HINTS load command. This is not used in llvm libObject code or in llvm tool code. But does appear in one of the binary test files. While this load command is obsolete it is easier to add code for it in libObject than edit or change the binary test case. llvm-svn: 284769	2016-10-20 20:10:30 +00:00
Zachary Turner	6e2c89b54a	Fix case of file include path. llvm-svn: 284763	2016-10-20 18:34:37 +00:00
Zachary Turner	4d49eb9fa0	[CodeView] Refactor serialization to use StreamInterface. This was all using ArrayRef<>s before which presents a problem when you want to serialize to or deserialize from an actual PDB stream. An ArrayRef<> is really just a special case of what can be handled with StreamInterface though (e.g. by using a ByteStream), so changing this to use StreamInterface allows us to plug in a PDB stream and get all the record serialization and deserialization for free on a MappedBlockStream. Subsequent patches will try to remove TypeTableBuilder and TypeRecordBuilder in favor of class that operate on Streams as well, which should allow us to completely merge the reading and writing codepaths for both types and symbols. Differential Revision: https://reviews.llvm.org/D25831 llvm-svn: 284762	2016-10-20 18:31:19 +00:00
Konstantin Zhuravlyov	521e5ef4ce	[AMDGPU] Make note record name a static const member of target streamer Differential Revision: https://reviews.llvm.org/D25746 llvm-svn: 284760	2016-10-20 18:22:36 +00:00
Konstantin Zhuravlyov	08326b6256	[AMDGPU] Emit constant address space data in .rodata section and use relocations instead of fixups (amdhsa only) Differential Revision: https://reviews.llvm.org/D25693 llvm-svn: 284759	2016-10-20 18:12:38 +00:00
Dehao Chen	f03f51555a	Using branch probability to guide critical edge splitting. Summary: The original heuristic to break critical edge during machine sink is relatively conservertive: when there is only one instruction sinkable to the critical edge, it is likely that the machine sink pass will not break the critical edge. This leads to many speculative instructions executed at runtime. However, with profile info, we could model the splitting benefits: if the critical edge has 50% taken rate, it would always be beneficial to split the critical edge to avoid the speculated runtime instructions. This patch uses profile to guide critical edge splitting in machine sink pass. The performance impact on speccpu2006 on Intel sandybridge machines: spec/2006/fp/C++/444.namd 25.3 +0.26% spec/2006/fp/C++/447.dealII 45.96 -0.10% spec/2006/fp/C++/450.soplex 41.97 +1.49% spec/2006/fp/C++/453.povray 36.83 -0.96% spec/2006/fp/C/433.milc 23.81 +0.32% spec/2006/fp/C/470.lbm 41.17 +0.34% spec/2006/fp/C/482.sphinx3 48.13 +0.69% spec/2006/int/C++/471.omnetpp 22.45 +3.25% spec/2006/int/C++/473.astar 21.35 -2.06% spec/2006/int/C++/483.xalancbmk 36.02 -2.39% spec/2006/int/C/400.perlbench 33.7 -0.17% spec/2006/int/C/401.bzip2 22.9 +0.52% spec/2006/int/C/403.gcc 32.42 -0.54% spec/2006/int/C/429.mcf 39.59 +0.19% spec/2006/int/C/445.gobmk 26.98 -0.00% spec/2006/int/C/456.hmmer 24.52 -0.18% spec/2006/int/C/458.sjeng 28.26 +0.02% spec/2006/int/C/462.libquantum 55.44 +3.74% spec/2006/int/C/464.h264ref 46.67 -0.39% geometric mean +0.20% Manually checked 473 and 471 to verify the diff is in the noise range. Reviewers: rengolin, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D24818 llvm-svn: 284757	2016-10-20 18:06:52 +00:00
Simon Pilgrim	365be4f95c	[CostModel][X86] Fixed AVX1/AVX512 sdiv/udiv uniformconst costs for 256/512 bit integer vectors We weren't checking for uniform const costs before the general cost, resulting in very high estimates. llvm-svn: 284755	2016-10-20 18:00:35 +00:00
Pirama Arumuga Nainar	05b0f93ad3	Fix _EXTEND_VECTOR_INREG legalization Summary: While promoting _EXTEND_VECTOR_INREG nodes whose inputs are already promoted, perform the appropriate sign extension for the promoted node before doing the *_EXTEND_VECTOR_INREG operation. If not, the undefined high-order bits of the promoted operand may (a) be garbage inc ase of zext) or (b) contribute the wrong sign-bit (in case of sext) Updated the promote-vec3.ll test after this change. The diff shows explicit zeroing in case of zext and intermediate sign extension in case of sext. Reviewers: RKSimon Subscribers: llvm-commits, srhines Differential Revision: https://reviews.llvm.org/D25790 llvm-svn: 284752	2016-10-20 17:56:36 +00:00
Simon Pilgrim	1388c0acc1	[CostModel][X86] Added tests for sdiv/udiv costs for uniform const and uniform const power-of-2 Shows poor costings in AVX1/AVX512BW for certain vector types llvm-svn: 284748	2016-10-20 17:16:38 +00:00
Sanjay Patel	0051efcf97	[Target] remove TargetRecip class; 2nd try This is a retry of r284495 which was reverted at r284513 due to use-after-scope bugs caused by faulty usage of StringRef. This version also renames a pair of functions: getRecipEstimateDivEnabled() getRecipEstimateSqrtEnabled() as suggested by Eric Christopher. original commit msg: [Target] remove TargetRecip class; move reciprocal estimate isel functionality to TargetLowering This is a follow-up to https://reviews.llvm.org/D24816 - where we changed reciprocal estimates to be function attributes rather than TargetOptions. This patch is intended to be a structural, but not functional change. By moving all of the TargetRecip functionality into TargetLowering, we can remove all of the reciprocal estimate state, shield the callers from the string format implementation, and simplify/localize the logic needed for a target to enable this. If a function has a "reciprocal-estimates" attribute, those settings may override the target's default reciprocal preferences for whatever operation and data type we're trying to optimize. If there's no attribute string or specific setting for the op/type pair, just use the target default settings. As noted earlier, a better solution would be to move the reciprocal estimate settings to IR instructions and SDNodes rather than function attributes, but that's a multi-step job that requires infrastructure improvements. I intend to work on that, but it's not clear how long it will take to get all the pieces in place. Differential Revision: https://reviews.llvm.org/D25440 llvm-svn: 284746	2016-10-20 16:55:45 +00:00
Benjamin Kramer	d65ec2ceb2	Put the move ctor for PassManager back for now, it breaks some builds. For some reason using the default move ctor creates undefined references to it. llvm-svn: 284745	2016-10-20 16:50:07 +00:00
Simon Pilgrim	025e26dd32	[CostModel][X86] Fixed AVX1/AVX512 sdiv/udiv general costs for 256/512 bit integer vectors We weren't accounting for legal types on every subtarget, meaning that many of the costs were using defaults. We still don't correctly cost (or test) the 512-bit sdiv/udiv by uniform const cases, nor the power-of-2 cases. llvm-svn: 284744	2016-10-20 16:39:11 +00:00
Valery Pykhtin	e55fd41f73	[AMDGPU] add fcopysign(f64, f32) pattern Differential revision: https://reviews.llvm.org/D25827 llvm-svn: 284743	2016-10-20 16:17:54 +00:00
Benjamin Kramer	b667ef79ea	[Support] Put back the MSVC hack for AlignedCharArray. MSVC 2015 still errors when passing an alignas'd object by value as an argument. llvm-svn: 284738	2016-10-20 15:53:04 +00:00
Benjamin Kramer	abe323ef8e	[Support] Remove llvm::alignOf now that all uses are gone. Also clean up the legacy hacks for AlignedCharArray. I'm keeping LLVM_ALIGNAS alive for a bit longer because GCC 4.8.0 (which we still support apparently) shipped a buggy alignas(). All other supported compilers have a working alignas. llvm-svn: 284736	2016-10-20 15:36:38 +00:00
Benjamin Kramer	b2505005c7	Retire llvm::alignOf in favor of C++11 alignof. No functionality change intended. llvm-svn: 284733	2016-10-20 15:02:18 +00:00
Benjamin Kramer	26b2593b24	[GVN] Use defaulted members. No functional change. llvm-svn: 284726	2016-10-20 13:09:12 +00:00
Simon Dardis	226752c15d	[mips][mcjit] Add the majority of N32 support. The missing piece is relocation composition for %hi(%neg(%gp_rel(x))) and similar. Patch by: Daniel Sanders llvm-svn: 284724	2016-10-20 13:02:23 +00:00
Simon Pilgrim	16cc616ebc	[CostModel][X86] Added tests for sdiv/udiv costs for scalar and 128/256/512 bit integer vectors Shows current bug in AVX1/AVX512BW costs for 256 bit vector types llvm-svn: 284723	2016-10-20 12:34:00 +00:00
Benjamin Kramer	2a8bef8769	Do a sweep over move ctors and remove those that are identical to the default. All of these existed because MSVC 2013 was unable to synthesize default move ctors. We recently dropped support for it so all that error-prone boilerplate can go. No functionality change intended. llvm-svn: 284721	2016-10-20 12:20:28 +00:00
Pavel Labath	59838f7ea6	Reapply "Add Chrono.h - std::chrono support header" This is a resubmission of r284590. The mingw build should be fixed now. The problem was we were matching time_t with _localtime_64s, which was incorrect on _USE_32BIT_TIME_T systems. Instead I use localtime_s, which should always evaluate to the correct function. llvm-svn: 284720	2016-10-20 12:05:50 +00:00
Simon Pilgrim	618d3aedaf	[DAGCombiner] Add general constant vector support to (srl (shl x, c), c) -> (and x, cst2) We already supported scalar constant / splatted constant vector - now accepts any (non opaque) constant scalar / vector llvm-svn: 284717	2016-10-20 11:10:21 +00:00
Simon Pilgrim	46aa89399d	Wdocumentation fix llvm-svn: 284715	2016-10-20 10:53:38 +00:00
Simon Pilgrim	25059360d5	Fix spelling mistake in comment. llvm-svn: 284714	2016-10-20 10:42:14 +00:00
Simon Pilgrim	071da46a35	Fix MSVC bool -> uint64_t promotion warning llvm-svn: 284713	2016-10-20 10:37:58 +00:00
Benjamin Kramer	656d821aa8	[Go bindings] Update for r284678 API changes. Alignment moved from createBasicType to createAutoVariable. llvm-svn: 284707	2016-10-20 09:14:39 +00:00
Jonas Paulsson	8010b631d5	[SystemZ] Post-RA scheduler implementation Post-RA sched strategy and scheduling instruction annotations for z196, zEC12 and z13. This scheduler optimizes decoder grouping and balances processor resources (including side steering the FPd unit instructions). The SystemZHazardRecognizer keeps track of the scheduling state, which can be dumped with -debug-only=misched. Reviers: Ulrich Weigand, Andrew Trick. https://reviews.llvm.org/D17260 llvm-svn: 284704	2016-10-20 08:27:16 +00:00
George Rimar	de69e9f5db	[Object/ELF] - Check index argument in getSymbol(). Without this check LLD crashes when SHT_GROUP section has invalid symbol index because of next code: template <class ELFT> StringRef elf::ObjectFile<ELFT>::getShtGroupSignature(const Elf_Shdr &Sec) { .. const Elf_Sym *Sym = Obj.getSymbol(Symtab, Sec.sh_info); .. } If sh_info is too large, &Symbols[Index] just asserts. No testcases provided because llvm-objdump/llvm-readelf does not use getSymbol() function. I`ll commit testcase for LLD separatelly. Differential revision: https://reviews.llvm.org/D25516 llvm-svn: 284702	2016-10-20 08:03:10 +00:00
Peter Collingbourne	c7766778a0	X86: Allow expressions to appear as u8imm operands. llvm-svn: 284688	2016-10-20 01:58:34 +00:00
Peter Collingbourne	de1f039360	X86: Deduplicate some lowering code. NFCI. llvm-svn: 284686	2016-10-20 01:21:26 +00:00
Victor Leschuk	2461a2e2c5	DebugInfo: remove broken bitcode upgrade test llvm-svn: 284682	2016-10-20 00:26:36 +00:00
Reid Kleckner	40d7230f2f	Use __func__ directly now that all supported compilers support it Remove the portability macro now that it is unused. llvm-svn: 284681	2016-10-20 00:22:23 +00:00
Victor Leschuk	2ede126b1b	DebugInfo: preparation to implement DW_AT_alignment - Add alignment attribute to DIVariable family - Modify bitcode format to match new DIVariable representation - Update tests to match these changes (also add bitcode upgrade test) - Expect that frontend passes non-zero align value only when it is not default (was forcibly aligned by alignas()/_Alignas()/__atribute__(aligned()) Differential Revision: https://reviews.llvm.org/D25073 llvm-svn: 284678	2016-10-20 00:13:12 +00:00
Derek Schuff	7edf93bc5b	[WebAssembly] Update extending load test for new i1 behavior r284611 changed the behavior of the DAG legalizer for sign-extending i1 values. Update the wasm extending load test to match. llvm-svn: 284677	2016-10-20 00:10:34 +00:00
Reid Kleckner	990504e625	Remove LLVM_NOEXCEPT and replace it with noexcept Now that we have dropped MSVC 2013, all supported compilers support noexcept and we can drop this portability macro. llvm-svn: 284672	2016-10-19 23:52:38 +00:00
Kevin Enderby	210030ba95	Next set of additional error checks for invalid Mach-O files for the load commands that use the MachO::thread_command type but are not used in llvm libObject code but used in llvm tool code. This includes the LC_UNIXTHREAD and LC_THREAD load commands. A quick note about the philosophy of the error checking in libObject for Mach-O files, the idea behind the checking is that we never will return a Mach-O file out of libObject that contains unknown things in the load commands. To do this the 32-bit ARM and PPC general tread states needed to be defined as two test case binaries contained them. If other thread states for other CPUs need to be added we will do that as needed. Going forward the LC_MAIN load command is used to set the entry point in Mach-O executables these days instead of an LC_UNIXTHREAD as was done in the past. So today only in core files are LC_THREAD load commands and thread states usually found. Other thread states have not yet been defined in include/Support/MachO.h at this time. But that can be added as needed with their corresponding checking also added. llvm-svn: 284668	2016-10-19 23:44:34 +00:00
Reid Kleckner	e1d6998f5d	Update Compiler.h to fail fast when building with MSVC 2013 llvm-svn: 284665	2016-10-19 23:34:58 +00:00
Rong Xu	fe90d86c8e	[PGO] Fix a use-after-move. NFC. llvm-svn: 284664	2016-10-19 23:31:59 +00:00
Reid Kleckner	98508a8344	Revert "DenseSet: Appease msc18 to define derived constructors explicitly." This reverts commit r284570. MSVC 18 / 2013 is not supported anymore. llvm-svn: 284661	2016-10-19 23:04:57 +00:00
Reid Kleckner	6130a2df5c	Update docs to reflect new minimum MSVC version requirement Mailing list discussion about this: http://lists.llvm.org/pipermail/llvm-dev/2016-September/104631.html Code changes to simplify the ifdefs will come next, and can be reverted without affecting the policy if someone needs it. llvm-svn: 284660	2016-10-19 23:04:41 +00:00
Rong Xu	2c684cfd94	[PGO] Fix bogus warning for merging empty llvm profile file Profile runtime can generate an empty raw profile (when there is no function in the shared library). This empty profile is treated as a text format profile. A test format profile without the flag of "#IR" is thought to be a clang generated profile. So in llvm profile merging, we will get a bogus warning of "Merge IR generated profile with Clang generated profile." The fix here is to skip the empty profile (when the buffer size is 0) for profile merge. Reviewers: vsk, davidxl Differential Revision: http://reviews.llvm.org/D25687 llvm-svn: 284659	2016-10-19 22:51:17 +00:00
Lang Hames	1009b0b30f	[BuildingAJIT] Use the remote target triple to construct the TargetMachine in Chapter 5. Chapter 5 demonstrates remote JITing: code is executed on the remote, not the machine running the REPL, so it's the remote's triple (and TargetMachine) that we need. llvm-svn: 284657	2016-10-19 22:41:03 +00:00
Mehdi Amini	db46b7d217	Add computeHostNumPhysicalCores() implementation for Darwin Differential Revision: https://reviews.llvm.org/D25800 llvm-svn: 284656	2016-10-19 22:36:07 +00:00
Wei Ding	3cb2a1e8d1	AMDGPU : Add a function to enable and disable IEEEBit for SC and shader respectively. Differential Revision: http://reviews.llvm.org/D25789 llvm-svn: 284655	2016-10-19 22:34:49 +00:00
Lang Hames	412d045e81	Remove the JIT EH/small code model tests for now. These tests rely on two sections being allocated with a limited displacement from one to the other to work. We've never guaranteed this, and consequently these tests usually fail. That led to them being XFAILed, but now they XPASS whenever the sections do happen to be allocated nearby in memory. So I'm removing these for now to get rid of the noise. We can re-instate them if/when we take the time to implement a displacement-respecting allocator. llvm-svn: 284654	2016-10-19 22:19:38 +00:00
Chris Bieneman	4646fe0ac9	[CMake] Make the runtimes directory work with bootstrap builds This patch builds on clang r284648, and allows the runtime directory to make the bootstrap builds depend on the builtin libraries. This patch also make the bootstrap build depend on configuring the other runtimes because the libcxx headers are copied during configuration. I have left a TODO in the code to remove that once I come up with a better solution. llvm-svn: 284650	2016-10-19 21:50:25 +00:00
Sanjay Patel	efd8885772	[InstSimplify] fold negation of sign-bit 0 - X --> X, if X is 0 or the minimum signed value 0 - X --> 0, if X is 0 or the minimum signed value and the sub is NSW I noticed this pattern might be created in the backend after the change from D25485, so we'll want to add a similar fold for the DAG. The use of computeKnownBits in InstSimplify may be something to investigate if the compile time of InstSimplify is noticeable. We could replace computeKnownBits with specific pattern matchers or limit the recursion. Differential Revision: https://reviews.llvm.org/D25785 llvm-svn: 284649	2016-10-19 21:23:45 +00:00
Lang Hames	9ca9a31948	[BuildingAJIT] Make the chapter 5 server export symbols. This will allow chapter 5 to work on Linux. llvm-svn: 284637	2016-10-19 20:22:12 +00:00
Hans Wennborg	2d55d67c62	Typo: nomed struct -> named struct llvm-svn: 284635	2016-10-19 20:10:03 +00:00
Reid Kleckner	f8d1d12fef	[GlobalMerge] Handle non-landingpad EH pads This code crashed on funclet-style EH instructions such as catchpad, catchswitch, and cleanuppad. Just treat all EH pad instructions equivalently and avoid merging the globals they reference through any use. llvm-svn: 284633	2016-10-19 19:56:22 +00:00
Artur Pilipenko	5c6ef75485	[IndVarSimplify] Teach calculatePostIncRange to take guards into account Reviewed By: sanjoy Differential Revision: https://reviews.llvm.org/D25739 llvm-svn: 284632	2016-10-19 19:43:54 +00:00
Matthew Simpson	41fa838f07	[LV] Avoid emitting trivially dead instructions Some instructions from the original loop, when vectorized, can become trivially dead. This happens because of the way we structure the new loop. For example, we create new induction variables and induction variable "steps" in the new loop. Thus, when we go to vectorize the original induction variable update, it may no longer be needed due to the instructions we've already created. This patch prevents us from creating these redundant instructions. This reduces code size before simplification and allows greater flexibility in code generation since we have fewer unnecessary instruction uses. Differential Revision: https://reviews.llvm.org/D25631 llvm-svn: 284631	2016-10-19 19:22:02 +00:00
Chad Rosier	6e3a92ec88	[AliasSetTracker] Add support for memcpy and memmove. Differential Revision: https://reviews.llvm.org/D25776 llvm-svn: 284630	2016-10-19 19:09:03 +00:00
Artur Pilipenko	f2d5dc5dc6	[IndVarSimplify] Use control-dependent range information to prove non-negativity This change is motivated by the case when IndVarSimplify doesn't widen a comparison of IV increment because it can't prove IV increment being non-negative. We end up with a redundant trunc of the widened increment on this example. for.body: %i = phi i32 [ %start, %for.body.lr.ph ], [ %i.inc, %for.inc ] %within_limits = icmp ult i32 %i, 64 br i1 %within_limits, label %continue, label %for.end continue: %i.i64 = zext i32 %i to i64 %arrayidx = getelementptr inbounds i32, i32* %base, i64 %i.i64 %val = load i32, i32* %arrayidx, align 4 br label %for.inc for.inc: %i.inc = add nsw nuw i32 %i, 1 %cmp = icmp slt i32 %i.inc, %limit br i1 %cmp, label %for.body, label %for.end There is a range check inside of the loop which guarantees the IV to be non-negative. NSW on the increment guarantees that the increment is also non-negative. Teach IndVarSimplify to use the range check to prove non-negativity of loop increments. Reviewed By: sanjoy Differential Revision: https://reviews.llvm.org/D25738 llvm-svn: 284629	2016-10-19 18:59:03 +00:00
Chad Rosier	16970a847c	[AliasSetTracker] Return void for add() functions. NFC. Differential Revision: https://reviews.llvm.org/D25748 llvm-svn: 284628	2016-10-19 18:50:32 +00:00
Sanjay Patel	cf26c27478	[InstSimplify] move one and add more tests for potential negation folds llvm-svn: 284627	2016-10-19 18:42:12 +00:00
Rafael Espindola	5d8ad4f757	Add a macro for prefetching data. It will be used in lld. llvm-svn: 284626	2016-10-19 18:34:32 +00:00
Mehdi Amini	a85154518f	[ADT] Zip range adapter This augments the STLExtras toolset with a zip iterator and range adapter. Zip comes in two varieties: `zip`, which will zip to the shortest of the input ranges, and `zip_first`, which limits its `begin() == end()` checks to just the first range. Recommit r284035 after MSVC2013 support has been dropped. Patch by: Bryant Wong <github.com/bryant> Differential Revision: https://reviews.llvm.org/D23252 llvm-svn: 284623	2016-10-19 18:02:21 +00:00
Vedant Kumar	6fd94bf47c	[llvm-cov] Don't spawn a thread unless ThreadCount > 1 Initializing a ThreadPool with ThreadCount = 1 spawns a thread even though we don't need to. This is at least slower than it needs to be, and at worst may somehow be exacerbating PR30735 (llvm-cov times out on ARM bots). As a follow-up, I'll try to add logic to llvm::ThreadPool to avoid spawning a thread when ThreadCount = 1. llvm-svn: 284621	2016-10-19 17:55:44 +00:00
Krzysztof Parzyszek	c87155037b	[AMDGPU] Stop using MCRegisterClass::getSize() Differential Review: https://reviews.llvm.org/D24675 llvm-svn: 284619	2016-10-19 17:40:36 +00:00
Teresa Johnson	ec544c552e	[ThinLTO] Default backend threads to heavyweight_hardware_concurrency Summary: Changes default backend parallelism from thread::hardware_concurrency to the new llvm::heavyweight_hardware_concurrency, which for X86 Linux defaults to the number of physical cores (and will fall back to thread::hardware_concurrency otherwise). This avoid oversubscribing the physical cores using hyperthreading. Reviewers: mehdi_amini, pcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25775 llvm-svn: 284618	2016-10-19 17:35:01 +00:00
Simon Pilgrim	e32d0f8413	Merged nested ifs. NFCI. llvm-svn: 284616	2016-10-19 17:30:24 +00:00
Pavel Labath	504f3844ae	Revert "Add Chrono.h - std::chrono support header" This reverts commit r284590 as it fails on the mingw buildbot. I think I know the fix, but I cannot test it right now. Will reapply when I verify it works ok. This reverts r284590. llvm-svn: 284615	2016-10-19 17:17:53 +00:00
Simon Pilgrim	a20aeea998	[DAGCombiner] Add general constant vector support to (shl (add x, c1), c2) -> (add (shl x, c2), c1 << c2) We already supported scalar constant / splatted constant vector - now accepts any (non opaque) constant scalar / vector llvm-svn: 284613	2016-10-19 17:12:22 +00:00
Reid Kleckner	f7ad5341d0	[WinEH] Allow catchpads to reuse the same catch object This code used a regular when it should have used a multimap. llvm-svn: 284612	2016-10-19 17:08:23 +00:00
Sanjay Patel	3a3aaf67e0	[DAG] optimize negation of bool Use mask and negate for legalization of i1 source type with SIGN_EXTEND_INREG. With the mask, this should be no worse than 2 shifts. The mask can be eliminated in some cases, so that should be better than 2 shifts. This change exposed some missing folds related to negation: https://reviews.llvm.org/rL284239 https://reviews.llvm.org/rL284395 There may be others, so please let me know if you see any regressions. Differential Revision: https://reviews.llvm.org/D25485 llvm-svn: 284611	2016-10-19 16:58:59 +00:00
Zachary Turner	383803230b	[pdb] Improve error messages when DIA is not found. llvm-svn: 284610	2016-10-19 16:42:20 +00:00
Krzysztof Parzyszek	7bb63ac029	[RDF] Switch RefMap in liveness calculation to use lane masks This required reengineering of some of the part of liveness calculation, including fixing some issues caused by the limitations of the previous approach. The current code is not necessarily the fastest, but it should be functionally correct (at least more so than before). The compile-time performance will be addressed in the future. llvm-svn: 284609	2016-10-19 16:30:56 +00:00
Simon Pilgrim	4554e161be	[DAGCombiner] Add general constant vector support to (shl (sra x, c1), c1) -> (and x, (shl -1, c1)) We already supported scalar constant / splatted constant vector - now accepts any (non opaque) constant scalar / vector llvm-svn: 284608	2016-10-19 16:15:30 +00:00
Simon Pilgrim	c2e9724909	[DAGCombiner] Add general constant vector support to (shl (mul x, c1), c2) -> (mul x, c1 << c2) We already supported scalar constant / splatted constant vector - now accepts any (non opaque) constant scalar / vector llvm-svn: 284607	2016-10-19 15:59:28 +00:00
Tim Northover	e699929507	Revert r284604. A.K.A. "TMP" Committed by mistake. llvm-svn: 284606	2016-10-19 15:56:12 +00:00
Tim Northover	34c83da4c3	Sparc: fix test. llvm-svn: 284605	2016-10-19 15:55:11 +00:00
Tim Northover	3d4cd3d930	TMP llvm-svn: 284604	2016-10-19 15:55:09 +00:00
Tim Northover	7152dcaf77	GlobalISel: support translating volatile loads and stores. llvm-svn: 284603	2016-10-19 15:55:06 +00:00
Artur Pilipenko	ed84103a1a	Introduce ConstantRange.addWithNoSignedWrap To be used by upcoming change to IndVarSimplify Reviewed By: sanjoy Differential Revision: https://reviews.llvm.org/D25732 llvm-svn: 284597	2016-10-19 14:44:23 +00:00
Chris Dewhurst	2c3cdd66d2	[Sparc][LEON] Detects an erratum on UT699 LEON 3 processors involving rounding mode changes and issues an appropriate user error message. Differential Revision: https://reviews.llvm.org/D24665 llvm-svn: 284591	2016-10-19 14:01:06 +00:00
Pavel Labath	13b6a10e7b	Add Chrono.h - std::chrono support header Summary: std::chrono mostly covers the functionality of llvm::sys::TimeValue and lldb_private::TimeValue. This header adds a bit of utility functions and typedefs, which make the usage of the library and porting code from TimeValues easier. Rationale: - TimePoint typedef - precision of system_clock is implementation defined - using a well-defined precision helps maintain consistency between platforms, makes it interact better with existing TimeValue classes, and avoids cases there a time point is implicitly convertible to a specific precision on some platforms but not on others. - system_clock::to_time_t only accepts time_points with the default system precision (even though time_t has only second precision on all platforms we support). To avoid the need for explicit casts, I have added a toTimeT() wrapper function. toTimePoint(time_t) was not strictly necessary, but I have added it for symmetry. Reviewers: zturner, mehdi_amini Subscribers: beanz, mgorny, llvm-commits, modocache Differential Revision: https://reviews.llvm.org/D25416 llvm-svn: 284590	2016-10-19 13:58:55 +00:00
Sjoerd Meijer	2fc4cb6f72	Reapply r284571 (with the new tests fixed). llvm-svn: 284588	2016-10-19 13:43:02 +00:00
NAKAMURA Takumi	74ed5a8c15	llvm/test/MC/Mips/macro-ld-sd.s: Sweep a spurious character \xA0 in the test file. It might crash FileCheck. llvm-svn: 284587	2016-10-19 13:41:12 +00:00
Ulrich Weigand	6e31ab388a	[SystemZ] Add missing vector instructions for the assembler Most z13 vector instructions have a base form where the data type of the operation (whether to consider the vector to be 16 bytes, 8 halfwords, 4 words, or 2 doublewords) is encoded into a mask field, and then a set of extended mnemonics where the mask field is not present but the data type is encoded into the mnemonic name. Currently, LLVM only supports the type-specific forms (since those are really the ones needed for code generation), but not the base type-generic forms. To complete the assembler support and make it fully compatible with the GNU assembler, this commit adds assembler aliases for all the base forms of the various vector instructions. It also adds two more alias forms that are documented in the PoP: VFPSO/VFPSODB/WFPSODB -- generic form of VFLCDB etc. VNOT -- special variant of VNO llvm-svn: 284586	2016-10-19 13:03:18 +00:00
Ulrich Weigand	556a90c00c	[SystemZ] Add optional argument to some vector string instructions The vfee[bhf], vfene[bhf], and vistr[bhf] assembler mnemonics are documented in the Principles of Operation to have an optional last operand to encode arbitrary values in a mask field. This commit adds support for those optional operands, and cleans up the patterns to generate vector string instruction as bit. No change to code generation intended. llvm-svn: 284585	2016-10-19 12:57:46 +00:00
Michal Gorny	e22b7f5aea	[cmake] Declare LLVM_CMAKE_PATH for use in subprojects Declare the LLVM_CMAKE_PATH to the source directory location of CMake files, in order to make it possible to easily use them in subprojects. Such a variable is already declared in most of LLVM projects (and inconsistently mixed with direct source tree references), including Clang, LLDB, compiler-rt, libcxx... Declaring it inside main LLVM tree makes it possible to avoid having to declare fallback values or use conditionals in those projects. It should be noted that in some of the subprojects LLVM_CMAKE_PATH is used to reference generated LLVMConfig.cmake file. However, these references are conditional to stand-alone builds and explicitly including this file is unnecessary in combined builds. Differential Revision: https://reviews.llvm.org/D25724 llvm-svn: 284581	2016-10-19 12:18:34 +00:00
James Molloy	fbfd173447	[Thumb-1] Synthesize TBB/TBH instructions to make use of compressed jump tables The TBB and TBH instructions in Thumb-2 allow jump tables to be compressed into sequences of bytes or shorts respectively. These instructions do not exist in Thumb-1, however it is possible to synthesize them out of a sequence of other instructions. It turns out this sequence is so short that it's almost never a lose for performance and is ALWAYS a significant win for code size. TBB example: Before: lsls r0, r0, #2 After: add r0, pc adr r1, .LJTI0_0 ldrb r0, [r0, #6] ldr r0, [r0, r1] lsls r0, r0, #1 mov pc, r0 add pc, r0 => No change in prologue code size or dynamic instruction count. Jump table shrunk by a factor of 4. The only case that can increase dynamic instruction count is the TBH case: Before: lsls r0, r4, #2 After: lsls r4, r4, #1 adr r1, .LJTI0_0 add r4, pc ldr r0, [r0, r1] ldrh r4, [r4, #6] mov pc, r0 lsls r4, r4, #1 add pc, r4 => 1 more instruction in prologue. Jump table shrunk by a factor of 2. So there is an argument that this should be disabled when optimizing for performance (and a TBH needs to be generated). I'm not so sure about that in practice, because on small cores with Thumb-1 performance is often tied to code size. But I'm willing to turn it off when optimizing for performance if people want (also note that TBHs are fairly rare in practice!) llvm-svn: 284580	2016-10-19 12:06:49 +00:00
Simon Pilgrim	7dcb6e572e	[DAGCombiner] Just call isConstOrConstSplat directly. NFCI. This will get the same ConstantSDNode scalar or vector splat value as the current separate dyn_cast<ConstantSDNode> / isVector() approach. llvm-svn: 284578	2016-10-19 11:28:15 +00:00
Simon Pilgrim	9122ac994b	Fix line endings llvm-svn: 284576	2016-10-19 11:16:58 +00:00
Simon Pilgrim	b2ca2505cc	[DAGCombine] Generalize distributeTruncateThroughAnd to work with any non-opaque constant or constant vector llvm-svn: 284574	2016-10-19 08:57:37 +00:00
Sjoerd Meijer	3f5111d363	Revert of r284571 because of failing tests. llvm-svn: 284572	2016-10-19 07:45:48 +00:00
Sjoerd Meijer	a318779263	Checking FP function attribute values and adding more build attribute tests. This renames the function for checking FP function attribute values and also adds more build attribute tests (which are in separate files because build attributes are set per file). Differential Revision: https://reviews.llvm.org/D25625 llvm-svn: 284571	2016-10-19 07:25:06 +00:00
NAKAMURA Takumi	da9dc6ad84	DenseSet: Appease msc18 to define derived constructors explicitly. msc18 doesn't recognize "using BaseT::BaseT;" llvm\include\llvm/ADT/DenseSet.h(213) : error C2875: using-declaration causes a multiple declaration of 'BaseT' llvm\include\llvm/ADT/DenseSet.h(214) : see reference to class template instantiation 'llvm::DenseSet<ValueT,ValueInfoT>' being compiled llvm\include\llvm/ADT/DenseSet.h(231) : error C2875: using-declaration causes a multiple declaration of 'BaseT' llvm\include\llvm/ADT/DenseSet.h(232) : see reference to class template instantiation 'llvm::SmallDenseSet<ValueT,InlineBuckets,ValueInfoT>' being compiled llvm-svn: 284570	2016-10-19 05:43:17 +00:00
Craig Topper	a4dc340cf2	[AVX-512] Teach isel lowering that a subvector broadcast being inserted into both halves of a 512-bit vector can be combined into a larger subvector broadcast. Summary: This allows us to create broadcasts of 128-bit vector loads into 512-bit vectors. New patterns added to support 8-bit and 16-bit vector types and v2f64/v2i64->v8f64/v8i64 without DQI instructions. There also fallback patterns when the load can't be folded. These patterns are a little complex as we first need to insert the lower 128-bits into the second 128-bits using a zmm subvector insert instruction. We need to use a zmm insert in case VLX isn't available. Then use another zmm sub vector insert to take those 256-bits and insert them into the upper bits. Since we used a zmm insert to create the 256-bits we also need to do a extract_subreg to get just the lower 256-bits to pass to the second insert. The outer insert for the fallback patterns should have its type correct because eventually we should also supported masked operations here too. So we need a DQI and a NoDQI version of the v16f32/v16i32 patterns. Reviewers: RKSimon, delena, igorb Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25651 llvm-svn: 284567	2016-10-19 04:44:17 +00:00
Dehao Chen	4b41571d24	Update the section.ll to fix non-x86 failure. llvm-svn: 284566	2016-10-19 03:53:41 +00:00
Dehao Chen	95fc43143d	Revert r284545 again as the regression in ppc still exists. There is bug in MBPI exposed by th patch. Also update the section.ll to fix non-x86 failure. llvm-svn: 284563	2016-10-19 01:18:25 +00:00
Vitaly Buka	490fda3366	[asan] Replace std::to_string with llvm::to_string llvm-svn: 284557	2016-10-19 00:16:56 +00:00
Kostya Serebryany	95b1a434d2	[libFuzzer] extend -print_coverage to also print uncovered lines, functions, and files. Example of output: COVERAGE: COVERED: in DSO2(int) /pathto/DSO2.cpp:6 COVERED: in DSO2(int) /pathto/DSO2.cpp:8 COVERED: in DSO1(int) /pathto/DSO1.cpp:6 COVERED: in DSO1(int) /pathto/DSO1.cpp:8 COVERED: in LLVMFuzzerTestOneInput /pathto/DSOTestMain.cpp:16 COVERED: in LLVMFuzzerTestOneInput /pathto/DSOTestMain.cpp:19 COVERED: in LLVMFuzzerTestOneInput /pathto/DSOTestMain.cpp:25 COVERED: in LLVMFuzzerTestOneInput /pathto/DSOTestMain.cpp:26 MODULE_WITH_COVERAGE: /pathto/libLLVMFuzzer-DSO1.so UNCOVERED_LINE: in DSO1(int) /pathto/DSO1.cpp:9 UNCOVERED_FUNC: in Uncovered1() MODULE_WITH_COVERAGE: /pathto/libLLVMFuzzer-DSO2.so UNCOVERED_LINE: in DSO2(int) /pathto/DSO2.cpp:9 UNCOVERED_FUNC: in Uncovered2() MODULE_WITH_COVERAGE: /pathto/LLVMFuzzer-DSOTest UNCOVERED_LINE: in LLVMFuzzerTestOneInput /pathto/DSOTestMain.cpp:21 UNCOVERED_LINE: in LLVMFuzzerTestOneInput /pathto/DSOTestMain.cpp:27 UNCOVERED_FILE: /pathto/DSOTestExtra.cpp Several things are not perfect here: * we are using objdump+awk instead of sancov because sancov does not support DSOs yet. * this breaks in the presence of ASAN_OPTIONS=strip_path_prefix=... (need to implement another API to get the module name by PC) llvm-svn: 284554	2016-10-19 00:12:03 +00:00
Vitaly Buka	5910a92560	[asan] Simplify calculation of stack frame layout extraction calculation of stack description into separate function. Reviewers: eugenis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25754 llvm-svn: 284547	2016-10-18 23:29:52 +00:00
Vitaly Buka	d88e52012b	[asan] Append line number to variable name if line is available and in the same file as the function. PR30498 Reviewers: eugenis Differential Revision: https://reviews.llvm.org/D25715 llvm-svn: 284546	2016-10-18 23:29:41 +00:00
Dehao Chen	f8ac3d26d5	Using branch probability to guide critical edge splitting. Summary: The original heuristic to break critical edge during machine sink is relatively conservertive: when there is only one instruction sinkable to the critical edge, it is likely that the machine sink pass will not break the critical edge. This leads to many speculative instructions executed at runtime. However, with profile info, we could model the splitting benefits: if the critical edge has 50% taken rate, it would always be beneficial to split the critical edge to avoid the speculated runtime instructions. This patch uses profile to guide critical edge splitting in machine sink pass. The performance impact on speccpu2006 on Intel sandybridge machines: spec/2006/fp/C++/444.namd 25.3 +0.26% spec/2006/fp/C++/447.dealII 45.96 -0.10% spec/2006/fp/C++/450.soplex 41.97 +1.49% spec/2006/fp/C++/453.povray 36.83 -0.96% spec/2006/fp/C/433.milc 23.81 +0.32% spec/2006/fp/C/470.lbm 41.17 +0.34% spec/2006/fp/C/482.sphinx3 48.13 +0.69% spec/2006/int/C++/471.omnetpp 22.45 +3.25% spec/2006/int/C++/473.astar 21.35 -2.06% spec/2006/int/C++/483.xalancbmk 36.02 -2.39% spec/2006/int/C/400.perlbench 33.7 -0.17% spec/2006/int/C/401.bzip2 22.9 +0.52% spec/2006/int/C/403.gcc 32.42 -0.54% spec/2006/int/C/429.mcf 39.59 +0.19% spec/2006/int/C/445.gobmk 26.98 -0.00% spec/2006/int/C/456.hmmer 24.52 -0.18% spec/2006/int/C/458.sjeng 28.26 +0.02% spec/2006/int/C/462.libquantum 55.44 +3.74% spec/2006/int/C/464.h264ref 46.67 -0.39% geometric mean +0.20% Manually checked 473 and 471 to verify the diff is in the noise range. Reviewers: rengolin, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D24818 llvm-svn: 284545	2016-10-18 23:24:02 +00:00
Dehao Chen	62d0e64e9e	revert r284541. llvm-svn: 284544	2016-10-18 23:11:20 +00:00
Rong Xu	1c0e9b97d2	Conditionally eliminate library calls where the result value is not used Summary: This pass shrink-wraps a condition to some library calls where the call result is not used. For example: sqrt(val); is transformed to if (val < 0) sqrt(val); Even if the result of library call is not being used, the compiler cannot safely delete the call because the function can set errno on error conditions. Note in many functions, the error condition solely depends on the incoming parameter. In this optimization, we can generate the condition can lead to the errno to shrink-wrap the call. Since the chances of hitting the error condition is low, the runtime call is effectively eliminated. These partially dead calls are usually results of C++ abstraction penalty exposed by inlining. This optimization hits 108 times in 19 C/C++ programs in SPEC2006. Reviewers: hfinkel, mehdi_amini, davidxl Subscribers: modocache, mgorny, mehdi_amini, xur, llvm-commits, beanz Differential Revision: https://reviews.llvm.org/D24414 llvm-svn: 284542	2016-10-18 21:36:27 +00:00
Dehao Chen	ea62ae9844	Using branch probability to guide critical edge splitting. Summary: The original heuristic to break critical edge during machine sink is relatively conservertive: when there is only one instruction sinkable to the critical edge, it is likely that the machine sink pass will not break the critical edge. This leads to many speculative instructions executed at runtime. However, with profile info, we could model the splitting benefits: if the critical edge has 50% taken rate, it would always be beneficial to split the critical edge to avoid the speculated runtime instructions. This patch uses profile to guide critical edge splitting in machine sink pass. The performance impact on speccpu2006 on Intel sandybridge machines: spec/2006/fp/C++/444.namd 25.3 +0.26% spec/2006/fp/C++/447.dealII 45.96 -0.10% spec/2006/fp/C++/450.soplex 41.97 +1.49% spec/2006/fp/C++/453.povray 36.83 -0.96% spec/2006/fp/C/433.milc 23.81 +0.32% spec/2006/fp/C/470.lbm 41.17 +0.34% spec/2006/fp/C/482.sphinx3 48.13 +0.69% spec/2006/int/C++/471.omnetpp 22.45 +3.25% spec/2006/int/C++/473.astar 21.35 -2.06% spec/2006/int/C++/483.xalancbmk 36.02 -2.39% spec/2006/int/C/400.perlbench 33.7 -0.17% spec/2006/int/C/401.bzip2 22.9 +0.52% spec/2006/int/C/403.gcc 32.42 -0.54% spec/2006/int/C/429.mcf 39.59 +0.19% spec/2006/int/C/445.gobmk 26.98 -0.00% spec/2006/int/C/456.hmmer 24.52 -0.18% spec/2006/int/C/458.sjeng 28.26 +0.02% spec/2006/int/C/462.libquantum 55.44 +3.74% spec/2006/int/C/464.h264ref 46.67 -0.39% geometric mean +0.20% Manually checked 473 and 471 to verify the diff is in the noise range. Reviewers: rengolin, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D24818 llvm-svn: 284541	2016-10-18 21:36:11 +00:00
David Blaikie	69494a9805	dwarfdump: add space missing from the type unit header description llvm-svn: 284540	2016-10-18 21:18:43 +00:00
David Blaikie	e4c3915a5a	dwarfdump: Include the name in the unit description, even in non-summarized mode (accidentally removed this from my previous change when I was rejecting some clang-format formatting... ) llvm-svn: 284539	2016-10-18 21:16:45 +00:00
Dehao Chen	83033e0b9a	Add target for test to fix regression introduced by r284533. llvm-svn: 284538	2016-10-18 21:13:31 +00:00
David Blaikie	50cc27ecb9	dwarfdump: -summarize-types: print a short summary (unqualified type name, hash, length) of type units rather than dumping contents This is just a quick utility handy for getting rough summaries of types in a given object or dwo file. I've been using it to investigate the amount of type info redundancy across a project build, for example. llvm-svn: 284537	2016-10-18 21:09:48 +00:00
Eli Friedman	c0a717ba5b	Improve ARM lowering for "icmp <2 x i64> eq". The custom lowering is pretty straightforward: basically, just AND together the two halves of a <4 x i32> compare. Differential Revision: https://reviews.llvm.org/D25713 llvm-svn: 284536	2016-10-18 21:03:40 +00:00
Davide Italiano	36efa68463	[GVN] Consistently use division instead of shift. NFCI. This is in line with other places of GVN (e.g. load coercion logic). llvm-svn: 284535	2016-10-18 21:02:27 +00:00
Davide Italiano	64cd985e44	[GVN] Remove dead code. NFC. llvm-svn: 284534	2016-10-18 21:00:26 +00:00
Dehao Chen	302b69c940	Use profile info to set function section prefix to group hot/cold functions. Summary: The original implementation is in r261607, which was reverted in r269726 to accomendate the ProfileSummaryInfo analysis pass. The new implementation: 1. add a new metadata for function section prefix 2. query against ProfileSummaryInfo in CGP to set the correct section prefix for each function 3. output the section prefix set by CGP Reviewers: davidxl, eraman Subscribers: vsk, llvm-commits Differential Revision: https://reviews.llvm.org/D24989 llvm-svn: 284533	2016-10-18 20:42:47 +00:00
Evandro Menezes	4dd6c68d67	[AArch64] Fix test triplet llvm-svn: 284532	2016-10-18 20:41:30 +00:00
Evandro Menezes	ce8d60156c	[AArch64] Avoid materializing 0.0 when generating FP SELECT Transform `a == 0.0 ? 0.0 : x` to `a == 0.0 ? a : x` and `a != 0.0 ? x : 0.0` to `a != 0.0 ? x : a` to avoid materializing 0.0 for FCSEL, since it does not have to be materialized beforehand for FCMP, as it has a form that has 0.0 as an implicit operand. Differential Revision: https://reviews.llvm.org/D24808 llvm-svn: 284531	2016-10-18 20:37:35 +00:00
Kevin Enderby	89baf99c92	One more additional error check for invalid Mach-O files for a load command that use the MachO:: linkedit_data_command type but is not used in llvm libObject code but used in llvm tool code. This is for the LC_CODE_SIGNATURE load command. llvm-svn: 284529	2016-10-18 20:24:12 +00:00
Tim Northover	6e9043009e	GlobalISel: translate the @llvm.objectsize intrinsic. llvm-svn: 284527	2016-10-18 20:03:51 +00:00
Tim Northover	55782222c0	GlobalISel: select small binary operations on AArch64. AArch64 actually supports many 8-bit operations under the definition used by GlobalISel: the designated information-carrying bits of a GPR32 get the right value if you just use the normal 32-bit instruction. llvm-svn: 284526	2016-10-18 20:03:48 +00:00
Tim Northover	3f18603c52	GlobalISel: translate memcpy intrinsics. llvm-svn: 284525	2016-10-18 20:03:45 +00:00
Mandeep Singh Grang	0f7f030c44	Remove unused typedef Summary: Unused: typedef SmallSetVector<RegionT *, 4> RegionSet Reviewers: MatzeB, grosser Subscribers: zinob Differential Revision: https://reviews.llvm.org/D25744 llvm-svn: 284524	2016-10-18 20:01:12 +00:00
Tim Northover	4494d69862	GlobalISel: support floating-point constants on AArch64. Patch from Ahmed Bougacha. llvm-svn: 284523	2016-10-18 19:47:57 +00:00
Krzysztof Parzyszek	5bb417bed2	[Hexagon] Handle block live-ins with lane masks in HexagonBlockRanges llvm-svn: 284522	2016-10-18 19:47:20 +00:00
Benjamin Kramer	4c2582ad78	Reduce global namespace pollution. NFC. llvm-svn: 284521	2016-10-18 19:39:31 +00:00
Benjamin Kramer	ee042234ae	[esan] Remove global variable. It's not thread safe and completely unnecessary. llvm-svn: 284520	2016-10-18 19:39:23 +00:00
Simon Pilgrim	476560aac4	[X86][SSE] Added vector lshr/shl combine tests This doesn't cover all combines in DAGCombiner::visitSRL/visitSHL yet, but identifies several cases where we fail to combine vectors (or non-splatted) vectors llvm-svn: 284518	2016-10-18 19:28:12 +00:00
Benjamin Kramer	1e425c9f24	[InterleavedAccessPass] Remove global variable. This is a threading hazard and rightfully complained about by tsan. No functionality change. llvm-svn: 284515	2016-10-18 18:59:58 +00:00
Kostya Serebryany	bb59ef77ca	[libFuzzer] detect leaks after every run when executing fixed inputs (./fuzzer -runs=1000000 my-file) llvm-svn: 284514	2016-10-18 18:38:08 +00:00
Sanjay Patel	19601fa587	revert r284495: [Target] remove TargetRecip class There's something wrong with the StringRef usage while parsing the attribute string. llvm-svn: 284513	2016-10-18 18:36:49 +00:00
Kostya Serebryany	652ede62e9	[sancov] add __sanitizer_cov_trace_pc_guard to the supported coverage points llvm-svn: 284509	2016-10-18 18:11:42 +00:00
Kostya Serebryany	8dfed45cd4	[libFuzzer] reshuffle the code for -exit_on_src_pos and -exit_on_item llvm-svn: 284508	2016-10-18 18:06:05 +00:00
Vitaly Buka	8dbe20c1c7	[asan] Rename test file as the poisoning is not "experimental" Reviewers: eugenis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25737 llvm-svn: 284507	2016-10-18 18:05:06 +00:00
Vitaly Buka	b1cdd1f6d7	[asan] Combine check-prefixes in stack-poisoning test Reviewers: eugenis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25736 llvm-svn: 284506	2016-10-18 18:05:04 +00:00
Vitaly Buka	8e1906ea7e	[asan] Make -asan-experimental-poisoning the only behavior Reviewers: eugenis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25735 llvm-svn: 284505	2016-10-18 18:04:59 +00:00
Kevin Enderby	6f69582e9b	Next set of additional error checks for invalid Mach-O files for the load commands that use the MachO::routines_command and and MachO::routines_command_64 types but are not used in llvm libObject code but used in llvm tool code. This includes the LC_ROUTINES and LC_ROUTINES_64 load commands. llvm-svn: 284504	2016-10-18 17:54:17 +00:00
Justin Lebar	688c8347c9	[ADT] Remove CachedHash<T>. Nobody is using it. Differential Revision: https://reviews.llvm.org/D25630 llvm-svn: 284503	2016-10-18 17:50:39 +00:00
Sanjoy Das	507dd40a4a	[SCEV] Make CompareValueComplexity a little bit smarter This helps canonicalization in some cases. Thanks to Pankaj Chawla for the investigation and the test case! llvm-svn: 284501	2016-10-18 17:45:16 +00:00
Sanjoy Das	9cd877a25a	[SCEV] Extract out a helper function; NFC llvm-svn: 284500	2016-10-18 17:45:13 +00:00
Simon Pilgrim	a2778b2591	[X86][SSE] Added vector ashr combine tests This doesn't cover all combines in DAGCombiner::visitSRA yet, but identifies several cases where we fail to combine vectors (or non-splatted) vectors llvm-svn: 284498	2016-10-18 17:31:07 +00:00
Teresa Johnson	f7f02fa089	[docs] Increase minimum supported GCC version for building LLVM to 4.8 Summary: The RFC proposal sent to increase the minimum required GCC version to 4.8 received a lot of support. See the following thread: http://lists.llvm.org/pipermail/llvm-dev/2016-October/105955.html, This patch implements that by updating the docs. I believe the references to libstdc++ 4.7 issues can be removed as well, please let me know if that is not the case or if they should be updated a different way. Reviewers: rengolin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25683 llvm-svn: 284497	2016-10-18 17:17:37 +00:00
Sanjay Patel	08fff9ca81	[Target] remove TargetRecip class; move reciprocal estimate isel functionality to TargetLowering This is a follow-up to D24816 - where we changed reciprocal estimates to be function attributes rather than TargetOptions. This patch is intended to be a structural, but not functional change. By moving all of the TargetRecip functionality into TargetLowering, we can remove all of the reciprocal estimate state, shield the callers from the string format implementation, and simplify/localize the logic needed for a target to enable this. If a function has a "reciprocal-estimates" attribute, those settings may override the target's default reciprocal preferences for whatever operation and data type we're trying to optimize. If there's no attribute string or specific setting for the op/type pair, just use the target default settings. As noted earlier, a better solution would be to move the reciprocal estimate settings to IR instructions and SDNodes rather than function attributes, but that's a multi-step job that requires infrastructure improvements. I intend to work on that, but it's not clear how long it will take to get all the pieces in place. Differential Revision: https://reviews.llvm.org/D25440 llvm-svn: 284495	2016-10-18 17:05:05 +00:00
Simon Pilgrim	25e9628978	[DAGCombiner] Add splatted vector support to (udiv x, (shl pow2, y)) -> x >>u (log2(pow2)+y) llvm-svn: 284491	2016-10-18 16:36:00 +00:00
Adrian Prantl	919bdf1d4f	Disable fatal errors in the Verifier instantiated by bugpoint's crash debugger. When bugpoint hacks at a testcase it may at one point create illegal debug info metadata that won't even pass the Verifier. A bugpoint driver built with assertions should not assert on it, but reject the malformed intermediate step and continue to do its job. llvm-svn: 284490	2016-10-18 16:24:43 +00:00
Simon Pilgrim	ca3072ac58	[X86][AVX512] Add mask/maskz writemask support to constant pool shuffle decode commentx llvm-svn: 284488	2016-10-18 15:45:37 +00:00
Simon Dardis	858915f054	[mips][ias] Handle more complicated expressions for memory operands This patch teaches ias for mips to handle expressions such as (84)+(831)($sp). Such expression typically occur from the expansion of multiple macro definitions. This partially resolves PR/30383. Thanks to Sean Bruno for reporting the issue! Reviewers: zoran.jovanovic, vkalintiris Differential Revision: https://reviews.llvm.org/D24667 llvm-svn: 284485	2016-10-18 15:17:17 +00:00
Simon Dardis	c4463c942c	[mips] Fix sync instruction definition The 'sync' instruction for MIPS was defined in MIPS-II as taking no operands. MIPS32 extended the define of 'sync' as taking an optional unsigned 5 bit immediate. This patch correct the definition of sync so that it is accepted with an operand of 0 or no operand for MIPS-II to MIPS-V, and a 5 bit unsigned immediate for MIPS32 and later revisions. Additionally a clear error is given when the MIPS32 version of sync is used when targeting pre MIPS32. This partially resolves PR/30714. Thanks to Daniel Sanders for reporting this issue! Reveiwers: vkalintiris Differential Revision: https://reviews.llvm.org/D25672 llvm-svn: 284483	2016-10-18 14:42:13 +00:00
Victor Leschuk	197aa3192d	DebugInfo: change alignment type from uint64_t to uint32_t to save space. In futher patches we shall have alignment field added to DIVariable family and switching from uint64_t to uint32_t will save 4 bytes per variable. Differential Revision: https://reviews.llvm.org/D25620 llvm-svn: 284482	2016-10-18 14:31:22 +00:00
Simon Dardis	aff4d141b9	[mips] Macro expansion for ld, sd for O32 ld and sd when assembled for the O32 ABI expand to a pair of 32 bit word loads or stores using the specified source or destination register and the next register. This patch does not add support for the cases where the offset is greater than a 16 bit signed immediate as that would lead to a wrong/misleading error message as the assembler would report "instruction requires a CPU feature not currently enabled" for ld & sd for MIPS64 when their offset is not a signed 16 bit number. This fixes PR/29159. Thanks to Sean Bruno for reporting this issue! Reviewers: vkalintiris, seanbruno, zoran.jovanovic Differential Review: https://reviews.llvm.org/D24556 llvm-svn: 284481	2016-10-18 14:28:00 +00:00
Michael Zuckerman	1bee6340ef	[x86][inline-asm][avx512] allow swapping of '{k<num>}' & '{z}' marks Committing on behalf of Coby Tayree: After check-all and LGTM Desc: AVX512 allows dest operand to be followed by an op-mask register specifier ('{k<num>}', which in turn may be followed by a merging/zeroing specifier ('{z}') Currently, the following forms are allowed: {k<num>} {k<num>}{z} This patch allows the following forms: {z}{k<num>} and ignores the next form: {z} Justification would be quite simple - GCC Differential Revision: http://reviews.llvm.org/D25013 llvm-svn: 284479	2016-10-18 13:52:39 +00:00
Simon Pilgrim	65e0c73875	Strip trailing whitespace (NFCI) llvm-svn: 284478	2016-10-18 13:44:00 +00:00
Vasileios Kalintiris	3955b75ba9	[mips][FastISel] Instantiate the MipsFastISel class only for targets that support FastISel. Summary: Instead of instantiating the MipsFastISel class and checking if the target is supported in the overriden methods, we should perform that check before creating the class. This allows us to enable FastISel only for targets that truly support it, ie. MIPS32 to MIPS32R5. Reviewers: sdardis Subscribers: ehostunreach, llvm-commits Differential Revision: https://reviews.llvm.org/D24824 llvm-svn: 284475	2016-10-18 13:05:42 +00:00
George Rimar	bcfcb9e60f	[llvm-readobj] - Teach readobj to print PT_OPENBSD_RANDOMIZE/PT_OPENBSD_WXNEEDED headers. These are OpenBSD specific program headers and currently we support them in LLD. Description of headers (just in case) available here: http://man.openbsd.org/OpenBSD-current/man5/elf.5 OpenBSD commits were: For PT_OPENBSD_RANDOMIZE: `c494713c45` For PT_OPENBSD_WXNEEDED: `2a5a8fc7e3` Differential revision: https://reviews.llvm.org/D25616 llvm-svn: 284471	2016-10-18 10:54:56 +00:00
John Brawn	ecf79300dd	[SCEV] More accurate calculation of max backedge count of some less-than loops In loops that look something like i = n; do { ... } while(i++ < n+k); where k is a constant, the maximum backedge count is k (in fact the backedge count will be either 0 or k, depending on whether n+k wraps). More generally for LHS < RHS if RHS-(LHS of first comparison) is a constant then the loop will iterate either 0 or that constant number of times. This allows for more loop unrolling with the recent upper bound loop unrolling changes, and I'm working on a patch that will let loop unrolling additionally make use of the loop being executed either 0 or k times (we need to retain the loop comparison only on the first unrolled iteration). Differential Revision: https://reviews.llvm.org/D25607 llvm-svn: 284465	2016-10-18 10:10:53 +00:00
Renato Golin	9ce5074d29	Revert "Resubmit "Add support for advanced number formatting."" This reverts commits 284436 and 284437 because they still break AArch64 bots: Value of: format_number(-10, IntegerStyle::Integer, 1) Actual: "-0" Expected: "-10" llvm-svn: 284462	2016-10-18 09:30:18 +00:00
Simon Pilgrim	33f138b566	[X86][SSE] Added extra (mul x, (1 << c)) -> x << c style vector tests vXi64 will benefit more from lowering to shifts than multiplies llvm-svn: 284461	2016-10-18 09:29:13 +00:00
Javed Absar	e7c338081a	[ARM] Assign cost of scaling for Cortex-R52 This patch assigns cost of the scaling used in addressing for Cortex-R52. On Cortex-R52 a negated register offset takes longer than a non-negated register offset, in a register-offset addressing mode. Differential Revision: http://reviews.llvm.org/D25670 Reviewer: jmolloy llvm-svn: 284460	2016-10-18 09:08:54 +00:00
Simon Pilgrim	4ddc92b6cd	[X86][SSE] Add lowering to cvttpd2dq/cvttps2dq for sitofp v2f64/2f32 to 2i32 As discussed on PR28461 we currently miss the chance to lower "fptosi <2 x double> %arg to <2 x i32>" to cvttpd2dq due to its use of illegal types. This patch adds support for fptosi to 2i32 from both 2f64 and 2f32. It also recognises that cvttpd2dq zeroes the upper 64-bits of the xmm result (similar to D23797) - we still don't do this for the cvttpd2dq/cvttps2dq intrinsics - this can be done in a future patch. Differential Revision: https://reviews.llvm.org/D23808 llvm-svn: 284459	2016-10-18 07:42:15 +00:00
Dean Michael Berris	156f6cafc2	[XRay] Support for for tail calls for ARM no-Thumb This patch adds simplified support for tail calls on ARM with XRay instrumentation. Known issue: compiled with generic flags: `-O3 -g -fxray-instrument -Wall -std=c++14 -ffunction-sections -fdata-sections` (this list doesn't include my specific flags like --target=armv7-linux-gnueabihf etc.), the following program #include <cstdio> #include <cassert> #include <xray/xray_interface.h> [[clang::xray_always_instrument]] void __attribute__ ((noinline)) fC() { std::printf("In fC()\n"); } [[clang::xray_always_instrument]] void __attribute__ ((noinline)) fB() { std::printf("In fB()\n"); fC(); } [[clang::xray_always_instrument]] void __attribute__ ((noinline)) fA() { std::printf("In fA()\n"); fB(); } // Avoid infinite recursion in case the logging function is instrumented (so calls logging // function again). [[clang::xray_never_instrument]] void simplyPrint(int32_t functionId, XRayEntryType xret) { printf("XRay: functionId=%d type=%d.\n", int(functionId), int(xret)); } int main(int argc, char* argv[]) { __xray_set_handler(simplyPrint); printf("Patching...\n"); __xray_patch(); fA(); printf("Unpatching...\n"); __xray_unpatch(); fA(); return 0; } gives the following output: Patching... XRay: functionId=3 type=0. In fA() XRay: functionId=3 type=1. XRay: functionId=2 type=0. In fB() XRay: functionId=2 type=1. XRay: functionId=1 type=0. XRay: functionId=1 type=1. In fC() Unpatching... In fA() In fB() In fC() So for function fC() the exit sled seems to be called too much before function exit: before printing In fC(). Debugging shows that the above happens because printf from fC is also called as a tail call. So first the exit sled of fC is executed, and only then printf is jumped into. So it seems we can't do anything about this with the current approach (i.e. within the simplification described in https://reviews.llvm.org/D23988 ). Differential Revision: https://reviews.llvm.org/D25030 llvm-svn: 284456	2016-10-18 05:54:15 +00:00
Craig Topper	72b9f9864f	[AVX-512] Add test case to check shuffle decoding for masked vpermilps for r284450. This is harder to do for vpermilpd as shuffle combining turns the constant vector into an immediate since all vpermilpd's inputs with constant vector can also be encoded with the immediate form. llvm-svn: 284455	2016-10-18 05:44:04 +00:00
Justin Bogner	b7c9deb587	Object: Add a missing return in ObjectFile::createObjectFile When Error was threaded through these APIs back in r265606 the "return" was missed here, which triggers a warning if/when I add LLVM_NODISCARD to the Error type. llvm-svn: 284454	2016-10-18 05:17:23 +00:00
Craig Topper	448358b5f1	[X86] Fix DecodeVPERMVMask to handle cases where the constant pool entry has a different type than the shuffle itself. This is especially important for 32-bit targets with 64-bit shuffle elements. llvm-svn: 284453	2016-10-18 04:48:33 +00:00
Andrew Trick	f9df92c915	Improve tablegen gen-subtarget diagnostics for missing machine models. -debug-only=subtarget-emitter prints a lot of machine model diagnostics. This prunes the output so that the "No machine model for XXX on processor YYY" only appears when there is definitely no machine model for that opcode. Previously it was printing that error even if the opcode was covered by a more general scheduling class. <rdar://problem/15919845> [TableGen][CodeGenSchedule] Debug output does not help spotting the missing scheduling classes llvm-svn: 284452	2016-10-18 04:17:44 +00:00
Craig Topper	7268bf99ab	[AVX-512] Fix DecodeVPERMV3Mask to handle cases where the constant pool entry has a different type than the shuffle itself. Summary: This is especially important for 32-bit targets with 64-bit shuffle elements.This is similar to how PSHUFB and VPERMIL handle the same problem. Reviewers: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25666 llvm-svn: 284451	2016-10-18 04:00:32 +00:00
Craig Topper	175a415e78	[AVX-512] Add support for decoding shuffle mask from constant pool for masked VPERMILPS/PD. llvm-svn: 284450	2016-10-18 03:36:52 +00:00
Mandeep Singh Grang	e82678a657	Fix differences in codegen between Linux and Windows toolchains Summary: There are differences in codegen between Linux and Windows due to: 1. Using std::sort which uses quicksort which is a non-stable sort. 2. Iterating over Set data structure where the iteration order is non deterministic. Reviewers: arsenm, grosbach, junbuml, zinob, MatzeB Subscribers: MatzeB, wdng Differential Revision: https://reviews.llvm.org/D25695 llvm-svn: 284441	2016-10-18 00:11:19 +00:00
Zachary Turner	0d31d9c012	Rename HexStyle -> HexFormatStyle, and remove a constexpr. This should fix the remaining broken builds. llvm-svn: 284437	2016-10-17 23:08:47 +00:00
Zachary Turner	7cd0745c95	Resubmit "Add support for advanced number formatting." This resubmits commits 284425 and r284428, which were reverted in r284429 due to some infinite recursion caused by an incorrect selection of function overloads. Reproduced the failure on Linux using GCC 4.8.4, and confirmed that with the new patch the tests path on GCC as well as MSVC. So hopefully this fixes everything. llvm-svn: 284436	2016-10-17 22:49:24 +00:00
Konstantin Zhuravlyov	98a3ac7106	[AMDGPU] Mark .note section SHF_ALLOC so lld creates a segment for it Differential Revision: https://reviews.llvm.org/D25694 llvm-svn: 284435	2016-10-17 22:40:15 +00:00
Justin Lebar	ee34a7343d	[ADT] Move CachedHashString to its own header in ADT, and rename to CachedHashStringRef. Summary: Reclaiming the name 'CachedHashString' will let us add a type with that name that owns its value. Reviewers: timshen Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25644 llvm-svn: 284434	2016-10-17 22:24:36 +00:00
Justin Lebar	61b9b6a9d5	[ADT] Add an initializer_list constructor to {Small,}DenseSet. Reviewers: timshen Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25629 llvm-svn: 284433	2016-10-17 22:24:32 +00:00
Justin Lebar	4bccb581bf	[ADT] Add SmallDenseSet. Summary: This matches SmallDenseMap. Reviewers: timshen Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25628 llvm-svn: 284432	2016-10-17 22:24:28 +00:00
Kevin Enderby	2490de06f7	Next set of additional error checks for invalid Mach-O files for the load commands that use the MachO::sub_framework_command, MachO::sub_umbrella_command, MachO::sub_library_command and MachO::sub_client_command types but are not used in llvm libObject code but used in llvm tool code. This includes the LC_SUB_FRAMEWORK, LC_SUB_UMBRELLA, LC_SUB_LIBRARY and LC_SUB_CLIENT load commands. llvm-svn: 284431	2016-10-17 22:09:25 +00:00
Zachary Turner	9d58e362d2	Revert formatting changes. This reverts r288425 and r284428 as they are causing test crashes on some systems. llvm-svn: 284429	2016-10-17 21:25:41 +00:00
Zachary Turner	47e2c0a9cb	Try to fix build after invalid pointer conversion. llvm-svn: 284428	2016-10-17 21:14:27 +00:00
Sanjay Patel	8716b3cbe0	remove FIXME comment (fixed with r284424); NFC llvm-svn: 284427	2016-10-17 21:08:39 +00:00
Zachary Turner	99eef2d736	[Support] Add support for "advanced" number formatting. raw_ostream has not afforded a lot of flexibility in terms of how to format numbers when outputting. Wrap this all up into a set of low level helper functions that can be used to output numbers with arbitrary precision, alignment, format, etc and then update raw_ostream to use these functions. This will be useful for upcoming improvements to llvm's string formatting libraries, but are still useful independently. Differential Revision: https://reviews.llvm.org/D25497 llvm-svn: 284425	2016-10-17 20:57:45 +00:00
Sanjay Patel	523cd8290a	[DAG] use isConstOrConstSplat in ComputeNumSignBits to optimize SRA The scalar version of this pattern was noted in: https://reviews.llvm.org/D25485 and fixed with: https://reviews.llvm.org/rL284395 More refactoring of the constant/splat helpers is needed and will happen in follow-up patches. Differential Revision: https://reviews.llvm.org/D25685 llvm-svn: 284424	2016-10-17 20:41:39 +00:00
Sanjay Patel	a7cab58055	[DAG] make isConstOrConstSplat and isConstOrConstSplatFP more accessible; NFC As noted in: https://reviews.llvm.org/D25685 This is the next-to-smallest step needed to enable the ComputeNumSignBits fix in that patch. In a minor attempt to keep some structure, we're pulling the FP helper over along with its integer sibling, but clearly we can and should do more refactoring of the similar helper functions in DAGCombiner and SelectionDAG to simplify and not duplicate functionality. llvm-svn: 284421	2016-10-17 20:26:46 +00:00
Davide Italiano	84bd58e915	[opt] Strip coverage if debug info is not present. If -coverage is passed, but -g is not, clang populates the PassManager pipeline with StripSymbols(debugOnly = true). The stripSymbol pass therefore scans the list of named metadata, drops !llvm.dbg.cu, but leaves !llvm.gcov and !0 (the compileUnit MD) around. The verifier runs, and finds out that there's a CU not listed in !llvm.dbg.cu (as it was previously dropped) -> crash. When we strip debug info, so, check if there's coverage data, and strip it as well, in order to avoid pending metadata left around. Differential Revision: https://reviews.llvm.org/D25689 llvm-svn: 284418	2016-10-17 20:05:35 +00:00
Dehao Chen	018a3afa99	Ignore debug info when making optimization decisions in SimplifyCFG. Summary: Debug info should not affect code generation. This patch properly handles debug info to make sure the generated code are the same with or without debug info. Reviewers: davidxl, mzolotukhin, jmolloy Subscribers: aprantl, llvm-commits Differential Revision: https://reviews.llvm.org/D25286 llvm-svn: 284415	2016-10-17 19:28:44 +00:00
Rafael Espindola	d6d2bafacc	Delete dead code. llvm-svn: 284414	2016-10-17 19:25:42 +00:00
Mehdi Amini	8b16663eb3	[doc] use double `` to prevent html output of merging double dash llvm-svn: 284412	2016-10-17 19:23:19 +00:00

... 5 6 7 8 9 ...

140138 Commits