llvm-project

Commit Graph

Author	SHA1	Message	Date
Peter Collingbourne	c74cf06ee4	Object: Use offset+size as the irsymtab string representation. This is consistent with the bitcode string table. Differential Revision: https://reviews.llvm.org/D31922 llvm-svn: 300465	2017-04-17 17:55:24 +00:00
Peter Collingbourne	a0f371a106	Bitcode: Add a string table to the bitcode format. Add a top-level STRTAB block containing a string table blob, and start storing strings for module codes FUNCTION, GLOBALVAR, ALIAS, IFUNC and COMDAT in the string table. This change allows us to share names between globals and comdats as well as between modules, and improves the efficiency of loading bitcode files by no longer using a bit encoding for symbol names. Once we start writing the irsymtab to the bitcode file we will also be able to share strings between it and the module. On my machine, link time for Chromium for Linux with ThinLTO decreases by about 7% for no-op incremental builds or about 1% for full builds. Total bitcode file size decreases by about 3%. As discussed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2017-April/111732.html Differential Revision: https://reviews.llvm.org/D31838 llvm-svn: 300464	2017-04-17 17:51:36 +00:00
Konstantin Zhuravlyov	dc77b2e960	Distinguish between code pointer size and DataLayout::getPointerSize() in DWARF info generation llvm-svn: 300463	2017-04-17 17:41:25 +00:00
Tim Northover	879a0b2e1b	AArch64: support nonlazybind It's almost certainly not a good idea to actually use it in most cases (there's a pretty large code size overhead on AArch64), but we can't do those experiments until it's supported. llvm-svn: 300462	2017-04-17 17:27:56 +00:00
Craig Topper	d23004c37b	Introduce APInt::isSignBitSet/isSignBitClear. Use in place isSignBitSet in place of isNegative in known bits tracking. This makes statements like KnownZero.isNegative() (which means the value we're tracking is positive) less confusing. llvm-svn: 300457	2017-04-17 16:38:20 +00:00
Matt Arsenault	7205f3c2e4	AMDGPU: SimplifyDemandedElts for image intrinsics Causes some VGPR usage improvements in shaderdb, but introduces some SGPR spilling regressions due to random scheduling changes later. llvm-svn: 300453	2017-04-17 15:12:44 +00:00
Davide Italiano	ce161a7812	[LCSSA] Don't insert tokens into the worklist at all. We're gonna skip them anyway, so there's no point in inserting them in the first place. llvm-svn: 300452	2017-04-17 14:32:05 +00:00
Amaury Sechet	f8429754d8	Introducing LLVMMetadataRef Summary: This seems like an uncontroversial first step toward providing access to the metadata hierarchy that now exists in LLVM. This should allow for good debug info support from C. Future plans are to deprecate API that take mixed bags of values and metadata (mainly the LLVMMDNode family of functions) and migrate the rest toward the use of LLVMMetadataRef. Once this is in place, mapping of DIBuilder will be able to start. Reviewers: mehdi_amini, echristo, whitequark, jketema, Wallbraker Reviewed By: Wallbraker Subscribers: Eugene.Zelenko, axw, mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D19448 llvm-svn: 300447	2017-04-17 11:52:54 +00:00
Max Kazantsev	751579cac0	[LoopPeeling] Get rid of Phis that become invariant after N steps This patch is a generalization of the improvement introduced in rL296898. Previously, we were able to peel one iteration of a loop to get rid of a Phi that becomes an invariant on the 2nd iteration. In more general case, if a Phi becomes invariant after N iterations, we can peel N times and turn it into invariant. In order to do this, we for every Phi in loop's header we define the Invariant Depth value which is calculated as follows: Given %x = phi <Inputs from above the loop>, ..., [%y, %back.edge]. If %y is a loop invariant, then Depth(%x) = 1. If %y is a Phi from the loop header, Depth(%x) = Depth(%y) + 1. Otherwise, Depth(%x) is infinite. Notice that if we peel a loop, all Phis with Depth = 1 become invariants, and all other Phis with finite depth decrease the depth by 1. Thus, peeling N first iterations allows us to turn all Phis with Depth <= N into invariants. Reviewers: reames, apilipenko, mkuper, skatkov, anna, sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31613 llvm-svn: 300446	2017-04-17 09:52:02 +00:00
Serguei Katkov	11d9c4f691	[BPI] NFC: reorder ifs to bail out earlier This is non-functional change to re-order if statements to bail out earlier from unreachable and ColdCall heuristics. Reviewers: sanjoy, reames, junbuml, vsk, chandlerc Reviewed By: chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31704 llvm-svn: 300442	2017-04-17 06:39:47 +00:00
Max Kazantsev	8ed6b66d85	[LoopPeeling] Fix condition for phi-eliminating peeling When peeling loops basing on phis becoming invariants, we make a wrong loop size check. UP.Threshold should be compared against the total numbers of instructions after the transformation, which is equal to 2 * LoopSize in case of peeling one iteration. We should also check that the maximum allowed number of peeled iterations is not zero. Reviewers: sanjoy, anna, reames, mkuper Reviewed By: mkuper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31753 llvm-svn: 300441	2017-04-17 05:38:28 +00:00
Serguei Katkov	2616bbb16d	[BPI] Use metadata info before any other heuristics Metadata potentially is more precise than any heuristics we use, so it makes sense to use first metadata info if it is available. However it makes sense to examine it against other strong heuristics like unreachable one. If edge coming to unreachable block has higher probability then it is expected by unreachable heuristic then we use heuristic and remaining probability is distributed among other reachable blocks equally. An example where metadata might be more strong then unreachable heuristic is as follows: it is possible that there are two branches and for the branch A metadata says that its probability is (0, 2^25). For the branch B the probability is (1, 2^25). So the expectation is that first edge of B is hotter than first edge of A because first edge of A did not executed at least once. If first edge of A points to the unreachable block then using the unreachable heuristics we'll set the probability for A to (1, 2^20) and now edge of A becomes hotter than edge of B. This is unexpected behavior. This fixed the biggest part of https://bugs.llvm.org/show_bug.cgi?id=32214 Reviewers: sanjoy, junbuml, vsk, chandlerc Reviewed By: chandlerc Subscribers: llvm-commits, reames, davidxl Differential Revision: https://reviews.llvm.org/D30631 llvm-svn: 300440	2017-04-17 04:33:04 +00:00
Craig Topper	218a359fbd	[InstCombine] Simplify 1/X for vectors. llvm-svn: 300439	2017-04-17 03:41:47 +00:00
Craig Topper	eee53c030a	[InstCombine] Add test cases for missing support for simplifying 1/X for vectors. NFC llvm-svn: 300438	2017-04-17 03:41:44 +00:00
Craig Topper	1a18a7c51e	[InstCombine] Add support for vector srem->urem. llvm-svn: 300437	2017-04-17 01:51:24 +00:00
Craig Topper	b60f300afb	[InstCombine] Add missing testcases for srem->urem conversion. The vector version isn't currently supported. NFC llvm-svn: 300436	2017-04-17 01:51:21 +00:00
Craig Topper	f248468359	[InstCombine] Add support for turning vector sdiv into udiv. llvm-svn: 300435	2017-04-17 01:51:19 +00:00
Craig Topper	43b012b1b3	[InstCombine] Add test cases for missing support for turning vector sdiv into udiv. NFC llvm-svn: 300434	2017-04-17 01:51:16 +00:00
Davide Italiano	ee654bf5f1	[LCSSA] Simplify a loop. NFCI. llvm-svn: 300433	2017-04-17 00:02:45 +00:00
Craig Topper	da886c665b	[InstCombine][ValueTracking] When computing known bits for Srem make sure we don't compute known bits for the LHS twice. If we already called computeKnownBits for the RHS being a constant power of 2, we've already computed everything we can and should just stop. I think previously we would still recurse if we had determined the result was negative or had not determined the sign bit at all. llvm-svn: 300432	2017-04-16 21:46:12 +00:00
Davide Italiano	dd37c67d81	[LCSSA] Fix non-determinism due to iterating over a SmallPtrSet. Use a SmallSetVector instead. llvm-svn: 300431	2017-04-16 21:07:04 +00:00
Craig Topper	0d304f01b4	[InstCombine] In SimplifyDemandedUseBits, don't bother to mask known bits of constants with DemandedMask. Just because we didn't demand them doesn't mean they aren't known. llvm-svn: 300430	2017-04-16 20:55:58 +00:00
Benjamin Kramer	f5f593b674	[X86] Remove special handling for 16 bit for A asm constraints. Our 16 bit support is assembler-only + the terrible hack that is .code16gcc. Simply using 32 bit registers does the right thing for the latter. Fixes PR32681. llvm-svn: 300429	2017-04-16 20:13:08 +00:00
Bryant Wong	c819ba8874	MemorySSA: Stop tracking def-or-use blocks. The tracking is unused, since MemoryPhis are not pruned as of r282419. Differential Revision: https://reviews.llvm.org/D32121 llvm-svn: 300428	2017-04-16 19:45:51 +00:00
Sanjay Patel	35ed2413af	[InstSimplify] improve getTrue/getFalse; NFCI The ConstantInt version has the same assert, and using null/allOnes is likely less efficient. The only advantage of these local variants (and there's probably a better way to achieve this?) is to save typing "ConstantInt::" over and over. llvm-svn: 300426	2017-04-16 17:43:11 +00:00
Dimitry Andric	6380f0ffce	Garbage collect HAVE_EXECINFO_H from config.h.cmake after r300062. NFCI. llvm-svn: 300425	2017-04-16 17:22:44 +00:00
Sanjay Patel	70a575a468	[Constants] simplify get true/false code; NFCI llvm-svn: 300424	2017-04-16 17:00:21 +00:00
Michael Zuckerman	16b20d2fc5	[X86][X86 intrinsics]Folding cmp(sub(a,b),0) into cmp(a,b) optimization This patch adds new optimization (Folding cmp(sub(a,b),0) into cmp(a,b)) to instCombineCall pass and was written specific for X86 CMP intrinsics. Differential Revision: https://reviews.llvm.org/D31398 llvm-svn: 300422	2017-04-16 13:26:08 +00:00
Craig Topper	9edfb08d93	[APInt] Fix a bug in lshr by a value more than 64 bits above the bit width. This was throwing an assert because we determined the intra-word shift amount by subtracting the size of the full word shift from the total shift amount. But we failed to account for the fact that we clipped the full word shifts by total words first. To fix this just calculate the intra-word shift as the remainder of dividing by bits per word. llvm-svn: 300405	2017-04-16 01:03:51 +00:00
Dimitry Andric	909b3376ba	Use correct registers for "A" inline asm constraint Summary: In PR32594, inline assembly using the 'A' constraint on x86_64 causes llvm to crash with a "Cannot select" stack trace. This is because `X86TargetLowering::getRegForInlineAsmConstraint` hardcodes that 'A' means the EAX and EDX registers. However, on x86_64 it means the RAX and RDX registers, and on 16-bit x86 (ia16?) it means the old AX and DX registers. Add new register classes in `X86RegisterInfo.td` to support these cases, and amend the logic in `getRegForInlineAsmConstraint` to cope with different subtargets. Also add a test case, derived from PR32594. Reviewers: craig.topper, qcolombet, RKSimon, ab Reviewed By: ab Subscribers: ab, emaste, royger, llvm-commits Differential Revision: https://reviews.llvm.org/D31902 llvm-svn: 300404	2017-04-15 22:15:01 +00:00
Sanjay Patel	ef9f586bb2	[InstCombine] allow (X != C1 && X != C2) and similar patterns to match splat vector constants llvm-svn: 300402	2017-04-15 17:55:06 +00:00
Sanjay Patel	c8405b82a1	[InstCombine] add tests to show missing transforms for vectors; NFC llvm-svn: 300401	2017-04-15 17:50:45 +00:00
Eric Christopher	908ed7f20c	Tidy checking for the soft float attribute. llvm-svn: 300394	2017-04-15 06:14:52 +00:00
Eric Christopher	85be8ca881	Cache the DataLayout rather than looking it up frequently. llvm-svn: 300393	2017-04-15 06:14:50 +00:00
Vedant Kumar	1a6a2b642b	[ProfileData] Unify getInstrProfSectionName helpers This is a version of D32090 that unifies all of the `getInstrProfSectionName` helper functions. (Note: the build failures which D32090 would have addressed were fixed with r300352.) We should unify these helper functions because they are hard to use in their current form. E.g we recently introduced more helpers to fix section naming for COFF files. This scheme doesn't totally succeed at hiding low-level details about section naming, so we should switch to an API that is easier to maintain. This is not an NFC commit because it fixes llvm-cov's testing support for COFF files (this falls out of the API change naturally). This is an area where we lack tests -- I will see about adding one as a follow up. Testing: check-clang, check-profile, check-llvm. Differential Revision: https://reviews.llvm.org/D32097 llvm-svn: 300381	2017-04-15 00:09:57 +00:00
Sanjoy Das	044f956f9a	Generalize SCEV's unit testing helper a bit llvm-svn: 300379	2017-04-14 23:47:53 +00:00
Craig Topper	9a458cd517	[InstCombine] MakeAnd/Or/Xor handling to reuse previous APInt computations When checking if we should return a constant, we create some temporary APInts to see if we know all bits. But the exact computations we do are needed in several other locations in the same code. This patch moves them to named temporaries so we can reuse them. Ideally we'd write directly to KnownZero/One, but we currently seem to only write those variables after all the simplifications checks and I didn't want to change that with this patch. Differential Revision: https://reviews.llvm.org/D32094 llvm-svn: 300376	2017-04-14 22:34:14 +00:00
Krzysztof Parzyszek	9edaea21af	[RDF] No longer ignore implicit defs or uses on any instructions This used to be a Hexagon-specific treatment, but is no longer needed since it's switched to subregister liveness tracking. llvm-svn: 300369	2017-04-14 21:19:17 +00:00
Krzysztof Parzyszek	fabb68fc06	[RDF] Correctly enumerate reg units for reg masks llvm-svn: 300368	2017-04-14 21:17:36 +00:00
Reid Kleckner	fb502d2f5e	[IR] Make paramHasAttr to use arg indices instead of attr indices This avoids the confusing 'CS.paramHasAttr(ArgNo + 1, Foo)' pattern. Previously we were testing return value attributes with index 0, so I introduced hasReturnAttr() for that use case. llvm-svn: 300367	2017-04-14 20:19:02 +00:00
Kostya Serebryany	23f28e6c75	[libFuzzer] more trophies llvm-svn: 300366	2017-04-14 20:11:16 +00:00
Sam Clegg	135a4b8ea1	[WebAssembly] Improve readobj and nm support for wasm Now that the libObect support for wasm is better we can have readobj and nm produce more useful output too. Differential Revision: https://reviews.llvm.org/D31514 llvm-svn: 300365	2017-04-14 19:50:44 +00:00
Sanjay Patel	7cfe41659c	[InstCombine] (X != C1 && X != C2) --> (X \| (C1 ^ C2)) != C2 ...when C1 differs from C2 by one bit and C1 <u C2: http://rise4fun.com/Alive/Vuo And move related folds to a helper function. This reduces code duplication and will make it easier to remove the scalar-only restriction as a follow-up step. llvm-svn: 300364	2017-04-14 19:23:50 +00:00
Craig Topper	fb71b7d3e0	[InstCombine] Support folding a subtract with a constant LHS into a phi node We currently only support folding a subtract into a select but not a PHI. This fixes that. I had to fix an assumption in FoldOpIntoPhi that assumed the PHI node was always in operand 0. Now we pass it in like we do for FoldOpIntoSelect. But we still require some dancing to find the Constant when we create the BinOp or ConstantExpr. This is based code is similar to what we do for selects. Since I touched all call sites, this also renames FoldOpIntoPhi to foldOpIntoPhi to match coding standards. Differential Revision: https://reviews.llvm.org/D31686 llvm-svn: 300363	2017-04-14 19:20:12 +00:00
Stanislav Mekhanoshin	eff0bc7839	[AMDGPU] set read_only access qualifier for pointers If a kernel's pointer argument is known to be readonly set access qualifier accordingly. This allows RT not to flush caches before dispatches. Differential Revision: https://reviews.llvm.org/D32091 llvm-svn: 300362	2017-04-14 19:11:40 +00:00
Sam Clegg	dd2d7bf100	[Test commit] Cleanup some whitespace in a test file llvm-svn: 300361	2017-04-14 18:43:57 +00:00
Craig Topper	d61ccd735e	[InstCombine] Regenerate test checks using script. NFC llvm-svn: 300360	2017-04-14 18:42:55 +00:00
Sanjay Patel	9d39a9d860	[InstCombine] add/move tests for and/or-of-icmps equality folds; NFC llvm-svn: 300357	2017-04-14 18:19:27 +00:00
Craig Topper	8580cd4e1a	[ValueTracking] Avoid undefined behavior in unittest by not making a named ArrayRef from a std::initializer_list One of the ValueTracking unittests creates a named ArrayRef initialized by a std::initializer_list. The underlying array for an std::initializer_list is only guaranteed to have a lifetime as long as the initializer_list object itself. So this can leave the ArrayRef pointing at an array that no long exists. This fixes this to just create an explicit array instead of an ArrayRef. Differential Revision: https://reviews.llvm.org/D32089 llvm-svn: 300354	2017-04-14 17:59:19 +00:00
Craig Topper	c22c7b1459	[InstCombine] Refactor SimplifyUsingDistributiveLaws to more explicitly skip code when LHS/RHS aren't BinaryOperators Currently this code always makes 2 or 3 calls to tryFactorization regardless of whether the LHS/RHS are BinaryOperators. We make 3 calls when both operands are BinaryOperators with the same opcode. Or surprisingly, when neither are BinaryOperators. This is because getBinOpsForFactorization returns Instruction::BinaryOpsEnd when the operand is not a BinaryOperator. If both LHS and RHS are not BinaryOperators then they both have an Opcode of Instruction::BinaryOpsEnd. When this happens we rely on tryFactorization to early out due to A/B/C/D being null. Similar behavior occurs for the other calls, we rely on getBinOpsForFactorization having made A/B or C/D null to get tryFactorization to early out. We also rely on these null checks to check the result of getIdentityValue and early out for it. This patches refactors this to pull these checks up to SimplifyUsingDistributiveLaws so we don't rely on BinaryOpsEnd as a sentinel or this A/B/C/D null behavior. I think this makes this code easier to reason about. Should also give a tiny performance improvement for cases where the LHS or RHS isn't a BinaryOperator. Differential Revision: https://reviews.llvm.org/D31913 llvm-svn: 300353	2017-04-14 17:55:41 +00:00
Xinliang David Li	4a5ddf8038	[Profile] Make host tool aware of object format when quering prof section names Differential Revision: https://reviews.llvm.org/D32073 llvm-svn: 300352	2017-04-14 17:48:40 +00:00
Alexey Bataev	9c27d79520	Update tests for the patch. llvm-svn: 300351	2017-04-14 17:47:07 +00:00
Sanjoy Das	6a46f767a0	Use range-for in a few places llvm-svn: 300350	2017-04-14 17:42:12 +00:00
Sanjoy Das	3470e14ba4	Rewrite SCEV Normalization using SCEVRewriteVisitor; NFC Removes all of the boilerplate, cache management etc. from ScalarEvolutionNormalization, and keeps only the interesting bits. llvm-svn: 300349	2017-04-14 17:42:10 +00:00
Sanjoy Das	7ea3cb1008	Make SCEVRewriteVisitor smarter about when it trys to create SCEVs This change really saves just one foldingset lookup, but makes SCEVRewriteVisitor "feature compatible" with the handwritten logic in ScalarEvolutionNormalization, so that I can change ScalarEvolutionNormalization to use SCEVRewriteVisitor in a next step. This is a non-functional change, but _may_ improve performance in some pathological cases, but that's unlikely. llvm-svn: 300348	2017-04-14 17:42:08 +00:00
Sanjoy Das	988f32d303	Add missing #include Again, caught by the modules build. llvm-svn: 300346	2017-04-14 17:25:23 +00:00
Krzysztof Parzyszek	74b1f254d4	[RDF] Switch RegisterAggr to a bit vector of register units This avoids many complications related to the complex register aliasing schemes. llvm-svn: 300345	2017-04-14 17:25:13 +00:00
Davide Italiano	91239088a1	[FunctionImport] assert(false) -> llvm_unreachable(). NFCI. llvm-svn: 300344	2017-04-14 17:22:02 +00:00
Sanjoy Das	01545beb75	Remove "#if 0"ed out assert It won't compile after the recent changes I've made, and I think keeping it in provides very little value. Instead I've added (in an earlier commit) a C++ unit test to check the Denormalize(Normalized(X)) == X property for specific instances of X, which is what the assert was trying to do anyway. llvm-svn: 300339	2017-04-14 16:47:15 +00:00
Sanjoy Das	369f3039a3	Delete some unnecessary boilerplate The PostIncTransform class was not pulling its weight, so delete it and use free functions instead. This also makes the use of `function_ref` more idiomatic. We were storing an instance of function_ref in the PostIncTransform class before, which was fine in that specific case, but the usage after this change is more obviously okay. llvm-svn: 300338	2017-04-14 16:47:12 +00:00
Krzysztof Parzyszek	4fe9d6c640	[RDF] Refine propagation of reached uses in liveness computation llvm-svn: 300337	2017-04-14 16:33:54 +00:00
Sanjoy Das	e32214b08c	Add missing #include for STLExtras Looks like earlier I was relying on #include ordering in files that used ScalarEvolutionNormalization.h. Found thanks to the selfhost modules buildbot! llvm-svn: 300336	2017-04-14 16:28:12 +00:00
Krzysztof Parzyszek	f928e24d2a	[Hexagon] Fix a latent problem with interpreting live-in lane masks A non-zero lane mask on a register with no subregister means that the whole register is live-in. It is equivalent to a full mask. llvm-svn: 300335	2017-04-14 16:21:55 +00:00
Sanjoy Das	478cd98b22	Use range for llvm-svn: 300334	2017-04-14 15:50:19 +00:00
Sanjoy Das	c5a87a1949	Simplify PostIncTransform further; NFC Instead of having two ways to check if an add recurrence needs to be normalized, just pass in one predicate to decide that. llvm-svn: 300333	2017-04-14 15:50:07 +00:00
Sanjoy Das	b600d3f2c4	Add a unit test for SCEV Normalization llvm-svn: 300332	2017-04-14 15:50:04 +00:00
Sanjoy Das	e3a15e832c	Tighten the API for ScalarEvolutionNormalization llvm-svn: 300331	2017-04-14 15:49:59 +00:00
Sanjoy Das	ac9f3ea0b4	Remove NormalizeAutodetect; NFC It is cleaner to have a callback based system where the logic of whether an add recurrence is normalized or not lives on IVUsers. This is one step in a multi-step cleanup. llvm-svn: 300330	2017-04-14 15:49:53 +00:00
Krzysztof Parzyszek	643aaea59e	[Hexagon] Make a couple of passes compliant with -opt-bisect-limit llvm-svn: 300329	2017-04-14 15:26:34 +00:00
Simon Pilgrim	0c559f6d9e	[Bugpoint] Use boolean AND instead of bitwise AND (PR32660) llvm-svn: 300327	2017-04-14 15:21:15 +00:00
Simon Pilgrim	5a22eaa2bf	[X86][SSE] Update MOVNTDQA non-temporal loads to generic implementation (LLVM) MOVNTDQA non-temporal aligned vector loads can be correctly represented using generic builtin loads, allowing us to remove the existing x86 intrinsics. Clang companion patch: D31766. Differential Revision: https://reviews.llvm.org/D31767 llvm-svn: 300325	2017-04-14 15:05:35 +00:00
Nirav Dave	353158c177	Fix missing virtual destructor to silence build warning. llvm-svn: 300322	2017-04-14 13:34:33 +00:00
Nirav Dave	642ed1ef7e	Reorder StoreMergeCandidates to run faster. NFCI. llvm-svn: 300321	2017-04-14 13:34:30 +00:00
Dmitry Preobrazhensky	e6ef099dcd	[AMDGPU][MC] Corrected ds_write_src2_* to require one offset instead of two. Fixed bug 32551: https://bugs.llvm.org//show_bug.cgi?id=32551 Reviewers: vpykhtin Differential Revision: https://reviews.llvm.org/D31809 llvm-svn: 300319	2017-04-14 12:28:07 +00:00
Dmitry Preobrazhensky	5714860ee4	[AMDGPU][MC] Enabled constants for src operands of s_cbranch_g_fork Fixed bug 32619: https://bugs.llvm.org//show_bug.cgi?id=32619 Reviewers: artem.tamazov, vpykhtin Differential Revision: https://reviews.llvm.org/D31973 llvm-svn: 300318	2017-04-14 11:52:26 +00:00
Andrew V. Tischenko	4e7bcd5216	Fix for PR#30562: Selection DAG error: Detected cycle in SelectionDAG. Patch by Dinar Temirbulatov llvm-svn: 300314	2017-04-14 09:17:09 +00:00
Alex Denisov	3aa1d004b6	Add more test cases for StringRef::edit_distance Example strings taken from here: http://www.let.rug.nl/~kleiweg/lev/ llvm-svn: 300312	2017-04-14 08:34:32 +00:00
Andrew V. Tischenko	75745d0c3e	This patch closes PR#32216: Better testing of schedule model instruction latencies/throughputs. The details are here: https://reviews.llvm.org/D30941 llvm-svn: 300311	2017-04-14 07:44:23 +00:00
Gil Rapaport	334f8fbe47	[LV] Remove implicit single basic block assumption This patch is part of D28975's breakdown - no change in output intended. LV's code currently assumes the vectorized loop is a single basic block up until predicateInstructions() is called. This patch removes two manifestations of this assumption (loop phi incoming values, dominator tree update) by replacing the use of vectorLoopBody with the vectorized loop's latch/header. Differential Revision: https://reviews.llvm.org/D32040 llvm-svn: 300310	2017-04-14 07:30:23 +00:00
Craig Topper	66df10ff63	[ValueTracking] Calculate the KnownZeros for Intrinsic::ctpop without using a temporary APInt to count leading zeros on. The APInt was created from an 'unsigned' and we just wanted to know how many bits the value needed to represent it. We can just use Log2_32 from MathExtras.h to get the info. llvm-svn: 300309	2017-04-14 06:43:34 +00:00
Craig Topper	1281deaa00	[ValueTracking] Use APInt::isNegative(). NFC llvm-svn: 300308	2017-04-14 06:43:32 +00:00
Craig Topper	f8631cd1de	[ValueTracking] Use APInt::sext instead of zext and setBitsFrom. NFC llvm-svn: 300307	2017-04-14 06:43:29 +00:00
Craig Topper	c9a4fc0750	[InstCombine] Use APInt::setSignBit and APInt::isNegative(). NFC llvm-svn: 300305	2017-04-14 05:09:04 +00:00
Xinliang David Li	9a71766751	Fix test failure on windows: pass module to getInstrProfXXName calls llvm-svn: 300302	2017-04-14 03:03:24 +00:00
Peter Collingbourne	8446f1fe6a	Object, LTO: Add target triple to irsymtab and LTO API. Start using it in LLD to avoid needing to read bitcode again just to get the target triple, and in llvm-lto2 to avoid printing symbol table information that is inappropriate for the target. Differential Revision: https://reviews.llvm.org/D32038 llvm-svn: 300300	2017-04-14 02:55:06 +00:00
Daniel Berlin	2f72b19b05	NewGVN: Don't propagate over phi backedges where undef causes us to have >1 value, unless we can prove the phi node is cycle free. Fixes PR 32607. llvm-svn: 300299	2017-04-14 02:53:37 +00:00
Sanjoy Das	b4654299f3	Use range-for; NFC llvm-svn: 300292	2017-04-14 01:33:15 +00:00
Sanjoy Das	62f4b6bece	Use transform instead of manual loop; NFC llvm-svn: 300291	2017-04-14 01:33:13 +00:00
NAKAMURA Takumi	216db54678	LLVMCodeGen: Add ProfileData into deps corresponding to r300277. llvm-svn: 300289	2017-04-14 00:36:06 +00:00
Stanislav Mekhanoshin	86b0a5465b	[AMDGPU] added SIInstrInfo::getAddNoCarry() helper Addressed rest of post submit comments from D31993. Differential Revision: https://reviews.llvm.org/D32057 llvm-svn: 300288	2017-04-14 00:33:44 +00:00
Lang Hames	c7b9ecaa63	[ORC] Re-enable the Error/Expected unit tests that were disabled in r300177. The tests were failing due to an occasional deadlock in SerializationTraits for Error: Both serializers and deserializers were protected by a single mutex and in the unit test (where both ends of the RPC are in the same process) one side might obtain the mutex, then block waiting for input, leaving the other side of the connection unable to obtain the mutex to write the data the first side was waiting for. Splitting the mutex into two (one for serialization, one for deserialization) appears to have fixed the issue. llvm-svn: 300286	2017-04-14 00:06:12 +00:00
Reid Kleckner	a77172a744	Simplify some Verifier attribute checks with AttributeSet Now that we have a type that can represent the attributes on a single return, function, or parameter, we can pass it around directly rather than passing around AttributeList and Idx. Removes some more one-based argument attribute index counting. NFC llvm-svn: 300285	2017-04-14 00:06:06 +00:00
Matthias Braun	836c383b30	MIRLangRef: Add a section on simplifying .mir tests Differential Revision: http://reviews.llvm.org/D32058 llvm-svn: 300282	2017-04-13 23:45:14 +00:00
Xinliang David Li	57dea2d359	[Profile] PE binary coverage bug fix PR/32584 Differential Revision: https://reviews.llvm.org/D32023 llvm-svn: 300277	2017-04-13 23:37:12 +00:00
Adam Nemet	c5779460f4	[AArch64] Avoid partial register writes on lane 0 of BUILD_VECTOR for i8/i16/f16 This further improves Ahmed's change in rL299482. See the new comment for the rationale. The patch recovers most of the regression for bzip2 after D31965. We're down to +2.68% from +6.97%. Differential Revision: https://reviews.llvm.org/D32028 llvm-svn: 300276	2017-04-13 23:32:47 +00:00
Konstantin Zhuravlyov	d24aeb20fc	AMDGPU/GFX9: Do not use v_pack_b32_f16 when packing Differential Revision: https://reviews.llvm.org/D31819 llvm-svn: 300275	2017-04-13 23:17:00 +00:00
Hans Wennborg	f93c58b81b	build_llvm_package.bat: Move to VS2017 It's required for building the clang-format plugin after r300225. llvm-svn: 300273	2017-04-13 23:13:23 +00:00
Reid Kleckner	f021fab2af	[IR] Make getParamAttributes take argument numbers, not ArgNo+1 Add hasParamAttribute() and use it instead of hasAttribute(ArgNo+1, Kind) everywhere. The fact that the AttributeList index for an argument is ArgNo+1 should be a hidden implementation detail. NFC llvm-svn: 300272	2017-04-13 23:12:13 +00:00
Alexei Starovoitov	56db145164	[bpf] Fix memory offset check for loads and stores If the offset cannot fit into the instruction, an addition to the pointer is emitted before the actual access. However, BPF offsets are 16-bit but LLVM considers them to be, for the matter of this check, to be 32-bit long. This causes the following program: int bpf_prog1(void ign) { volatile unsigned long t = 0x8983984739ull; return (unsigned long )((0xffffffff8fff0002ull) + t); } To generate the following (wrong) code: 0: 18 01 00 00 39 47 98 83 00 00 00 00 89 00 00 00 r1 = 590618314553ll 2: 7b 1a f8 ff 00 00 00 00 (u64 )(r10 - 8) = r1 3: 79 a1 f8 ff 00 00 00 00 r1 = (u64 )(r10 - 8) 4: 79 10 02 00 00 00 00 00 r0 = (u64 *)(r1 + 2) 5: 95 00 00 00 00 00 00 00 exit Fix it by changing the offset check to 16-bit. Patch by Nadav Amit <nadav.amit@gmail.com> Signed-off-by: Alexei Starovoitov <ast@kernel.org> Differential Revision: https://reviews.llvm.org/D32055 llvm-svn: 300269	2017-04-13 22:24:13 +00:00
Matthias Braun	e6185b70e9	MIRLangRef: Simplify/update documentation - Refer to options by `-option` instead of `option` - Use `-mtriple=` instead of `-march` in the example (-march will still target the default operating system which is usually not what you want in a test) - Rephrase sentence because output does not go to stdout by default (you need -o - for that as should be expected). llvm-svn: 300268	2017-04-13 22:14:45 +00:00

1 2 3 4 5 ...

147606 Commits