llvm-project

Commit Graph

Author	SHA1	Message	Date
Zhan Jun Liau	4fbc3f4a37	[SystemZ] Add support for the .insn directive Summary: Add support for the .insn directive. .insn is an s390 specific directive that allows encoding of an instruction instead of using a mnemonic. The motivating case is some code in node.js that requires support for the .insn directive. Reviewers: koriakin, uweigand Subscribers: koriakin, llvm-commits Differential Revision: https://reviews.llvm.org/D21809 llvm-svn: 278012	2016-08-08 15:13:08 +00:00
Sebastian Pop	bfb96c5bfd	GVN-hoist: enable by default llvm-svn: 278010	2016-08-08 14:46:15 +00:00
Artur Pilipenko	eed618d5c0	[LVI] NFC. On the fast dest path use inverse predicate instead of inverse range result Gathering constantins from a condition on the false path ask makeAllowedICmpRegion about inverse predicate instead of inversing the resulting range. This change was separated from the review "[LVI] Make LVI smarter about comparisons with non-constants" (https://reviews.llvm.org/D23205#inline-198361) llvm-svn: 278009	2016-08-08 14:33:11 +00:00
Artur Pilipenko	54b50cc1a8	[LVI] NFC. Rename confusing local NegOffset to Offset NegOffset is not necessarily negative llvm-svn: 278008	2016-08-08 14:13:56 +00:00
Artur Pilipenko	21472910c1	[LVI] NFC. Extract LHS, RHS, Predicate locals in getValueFromCondition llvm-svn: 278007	2016-08-08 14:08:37 +00:00
Silviu Baranga	fa00ba3c1a	[AArch64] PR28877: Don't assume we're running after legalization when creating vcvtfp2fxs Summary: The DAG combine transformation that was generating the aarch64_neon_vcvtfp2fxs node was assuming that all inputs where legal and wasn't accounting that the input could be a v4f64 if we're trying to do the transformation before legalization. We now bail out in this case. All illegal types besides v4f64 were already rejected. Fixes https://llvm.org/bugs/show_bug.cgi?id=28877. Reviewers: jmolloy Subscribers: aemerson, rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D23261 llvm-svn: 278002	2016-08-08 13:13:57 +00:00
Daniel Sanders	3feeb9c851	Re-commit r277988: [mips][ias] Fix all the hacks related to MIPS-specific unary operators (%hi/%lo/%gp_rel/etc.). Hopefully with the MSVC builds fixed. I've added a missing '#include <tuple>' that gcc and clang don't seem to need. llvm-svn: 277995	2016-08-08 11:50:25 +00:00
Simon Pilgrim	33fc788374	[X86][SSE] Assert if the shuffle mask indices are not -1 or within a valid input range As discussed in post-review rL277959 llvm-svn: 277993	2016-08-08 11:07:34 +00:00
Daniel Sanders	cae9aeed39	Revert r277988: [mips][ias] Fix all the hacks related to MIPS-specific unary operators (%hi/%lo/%gp_rel/etc.). It seems that MSVC doesn't like std::tie(). llvm-svn: 277990	2016-08-08 09:33:14 +00:00
Daniel Sanders	2ab623b5a3	[mips][ias] Fix all the hacks related to MIPS-specific unary operators (%hi/%lo/%gp_rel/etc.). Summary: They are now lexed as a single token on targets where MCAsmInfo::HasMipsExpressions is true and then parsed in a similar way to the '~' operator as part of MCExpr::parseExpression. As a result: * expressions and immediates no longer have different parsing rules. The difference is now solely down to whether evaluateAsAbsolute() succeeds. * %hi(%neg(%gp_rel(x))) are no longer parsed as a single operator and decomposed into the three MipsMCExpr nodes. They are parsed directly as three MipsMCExpr nodes. * parseMemOperand no longer needs to eat all the surrounding parenthesis to get at the outermost operator to make this work * %hi(%neg(%gp_rel(x))) and %lo(%neg(%gp_rel(x))) are no longer the only 3-in-1 relocs that parse for N64. They're still the only combinations that are permitted in relocatable expressions though. Fixing that should be a later patch. * We no longer need to list all the tokens that can occur as the first token of an expression or immediate. test/MC/Mips/expr1.s: This change also prevents the incorrect lowering of %lo(2*4)+foo to %lo(8+foo) which is not an equivalent expression (the difference is whether foo is truncated to 16-bit or not) and the test has been updated to account for the macro expansion the correct expression requires. Reviewers: sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: https://reviews.llvm.org/D23110 llvm-svn: 277988	2016-08-08 09:20:52 +00:00
Diana Picus	4dd6c249ac	[SelectionDAG] Refactor visitInlineAsm a bit. NFCI. This shaves off ~100 lines from visitInlineAsm. llvm-svn: 277987	2016-08-08 08:54:39 +00:00
Sean Silva	0873e7d218	Add some comments linking back to PR28400. Thanks to Mehdi for the suggestion! llvm-svn: 277984	2016-08-08 07:03:49 +00:00
Sean Silva	7f21f4b264	[PM] More workaround for PR28400 llvm-svn: 277982	2016-08-08 05:38:06 +00:00
Sean Silva	744f7a843f	[PM] Invalidate CallGraphAnalysis because it holds AssertingVH This is essentially PR28400. The fix here is similar to that implemented in r274656. llvm-svn: 277980	2016-08-08 05:38:01 +00:00
Daniel Berlin	4b4c722e79	[MSSA] Fix PR28880 by fixing use optimizer's lower bound tracking behavior. Summary: In the use optimizer, we need to keep of whether the lower bound still dominates us or else we may decide a lower bound is still valid when it is not due to intervening pushes/pops. Fixes PR28880 (and probably a bunch of other things). Reviewers: george.burgess.iv Subscribers: MatzeB, llvm-commits, sebpop Differential Revision: https://reviews.llvm.org/D23237 llvm-svn: 277978	2016-08-08 04:44:53 +00:00
Eli Friedman	02419a9849	[JumpThreading] Fix handling of aliasing metadata. Summary: The correctness fix here is that when we CSE a load with another load, we need to combine the metadata on the two loads. This matches the behavior of other passes, like instcombine and GVN. There's also a minor optimization improvement here: for load PRE, the aliasing metadata on the inserted load should be the same as the metadata on the original load. Not sure why the old code was throwing it away. Issue found by inspection. Differential Revision: http://reviews.llvm.org/D21460 llvm-svn: 277977	2016-08-08 04:10:22 +00:00
Davide Italiano	151e5be5ea	[MC] Delete use of *structors_used. Jim Grosbach and Kevin Enderby think those are not used anymore. Originally submitted by: Rafael Espindola llvm-svn: 277973	2016-08-08 03:30:01 +00:00
Davide Italiano	e3b916d164	[SimplifyLibCalls] Emit sqrt intrinsic instead of a libcall. llvm-svn: 277972	2016-08-08 03:23:01 +00:00
Eli Friedman	2a65dd1ba6	[SROA] Fix crash with lifetime intrinsic partially covering alloca. Summary: PromoteMemToReg looks specifically for the pattern bitcast+lifetime.start (or a bitcast-equivalent GEP); any offset will lead to an assertion failure. Fixes https://llvm.org/bugs/show_bug.cgi?id=27999 . Differential Revision: https://reviews.llvm.org/D22737 llvm-svn: 277969	2016-08-08 01:30:53 +00:00
Craig Topper	f44423120f	[AVX-512] Improve lowering of inserting a single element into lowest element of a 512-bit vector of zeroes by using vmovq/vmovd/vmovss/vmovsd. llvm-svn: 277965	2016-08-07 21:52:59 +00:00
Davide Italiano	27da131f32	[SLC] Emit an intrinsic instead of a libcall for pow. Differential Revision: https://reviews.llvm.org/D22104 llvm-svn: 277963	2016-08-07 20:27:03 +00:00
Nico Weber	99ceee8a85	Revert r277905, it caused PR28894 llvm-svn: 277962	2016-08-07 20:18:04 +00:00
Craig Topper	2c51c74d52	[AVX-512] Add 512-bit logical operations to load folding tables. Add avx512f stack folding test and move some tests from the avx512vl test. llvm-svn: 277961	2016-08-07 17:14:09 +00:00
Craig Topper	938e7ab9e1	[AVX-512] Add EVEX encoded floating point MAX/MIN instructions to the load folding tables. llvm-svn: 277960	2016-08-07 17:14:05 +00:00
Simon Pilgrim	21c61fba45	[X86] lowerVectorShuffle - ensure that undefined mask elements only use SM_SentinelUndef Help lowering and combining (which can specify SM_SentinelZero mask elements) share more shuffle matching code. llvm-svn: 277959	2016-08-07 15:29:12 +00:00
Elena Demikhovsky	dca03bebd3	AVX-512: Changed lowering of BITCAST between i1 vectors and i8/i16/i32 integer values Optimized lowering of BITCAST node. The BITCAST node can be replaced with COPY_TO_REG instead of KMOV. It allows to suppress two opposite BITCAST operations and avoid redundant "movs". Differential Revision: https://reviews.llvm.org/D23247 llvm-svn: 277958	2016-08-07 13:05:58 +00:00
David Majnemer	d150137f64	[InstSimplify] Fold gep (gep V, C), (sub 0, V) to C llvm-svn: 277952	2016-08-07 07:58:12 +00:00
David Majnemer	dc8767a49a	[InstSimplify] Try hard to simplify pointer comparisons Simplify ptrtoint comparisons involving operands with different source types. llvm-svn: 277951	2016-08-07 07:58:10 +00:00
David Majnemer	4e4f4437c2	[InstCombine] Infer inbounds on geps of allocas llvm-svn: 277950	2016-08-07 07:58:00 +00:00
Craig Topper	49841c3812	[X86] Add commutable floating point max/min instructions to the load folding tables. llvm-svn: 277949	2016-08-07 05:39:51 +00:00
Craig Topper	c4d757093e	[X86] Simplify a shuffle mask copy. NFC llvm-svn: 277947	2016-08-07 05:39:46 +00:00
Michael Zolotukhin	442b82f0eb	Revert "Revert "[LoopSimplify] Fix updating LCSSA after separating nested loops."" This reverts commit r277901. Reaaply the commit as it looks like it has nothing to do with the bots failures. llvm-svn: 277946	2016-08-07 01:56:54 +00:00
Lang Hames	4679644c53	[ExecutionEngine][RuntimeDyld] Move JITSymbol from ExecutionEngine to RuntimeDyld. JITSymbol really belongs in RuntimeDyld. This should fix the llvm-rtdyld link failures caused by r277943. llvm-svn: 277945	2016-08-07 01:19:37 +00:00
Lang Hames	71f089c82b	[RuntimeDyld] Remove symbol that is unused as of r277943. llvm-svn: 277944	2016-08-07 01:12:44 +00:00
Lang Hames	00769a0904	[RuntimeDyld] Replace manual flag checks with JITSymbolFlags::fromObjectSymbol. llvm-svn: 277943	2016-08-07 00:18:14 +00:00
Lang Hames	73976f622d	[ORC] Re-apply r277896, removing bogus triples and datalayouts that broke tests on linux last time. llvm-svn: 277942	2016-08-06 22:36:26 +00:00
Kostya Serebryany	728447bd3b	[libFuzzer] make libFuzzer work with a bit older clang versions llvm-svn: 277941	2016-08-06 21:28:56 +00:00
Kostya Serebryany	ff1f2107ec	[libFuzzer] don't print bogus error message llvm-svn: 277940	2016-08-06 21:23:29 +00:00
Simon Pilgrim	bc573ca1b8	[X86][AVX2] Improve sign/zero extension on AVX2 targets Split extensions to large vectors into 256-bit chunks - the equivalent of what we do with pre-AVX2 into 128-bit chunks llvm-svn: 277939	2016-08-06 21:21:12 +00:00
Gor Nishanov	28c889593a	CoroSplit: Squash unused variable FnTrigger warning in NDEBUG llvm-svn: 277938	2016-08-06 21:11:10 +00:00
Gor Nishanov	2ed6e788a8	[Coroutines] Part 5: Add CGSCC restart trigger Summary: CoroSplit pass processes the coroutine twice. First, it lets it go through complete IPO optimization pipeline as a single function. It forces restart of the pipeline by inserting an indirect call to an empty function "coro.devirt.trigger" which is devirtualized by CoroElide pass that triggers a restart of the pipeline by CGPassManager. (In later patches, when CoroSplit pass sees the same coroutine the second time, it splits it up, adds coroutine subfunctions to the SCC to be processed by IPO pipeline.) Documentation and overview is here: http://llvm.org/docs/Coroutines.html. Upstreaming sequence (rough plan) 1.Add documentation. (https://reviews.llvm.org/D22603) 2.Add coroutine intrinsics. (https://reviews.llvm.org/D22659) 3.Add empty coroutine passes. (https://reviews.llvm.org/D22847) 4.Add coroutine devirtualization + tests. ab) Lower coro.resume and coro.destroy (https://reviews.llvm.org/D22998) c) Do devirtualization (https://reviews.llvm.org/D23229) 5.Add CGSCC restart trigger + tests. <= we are here 6.Add coroutine heap elision + tests. 7.Add the rest of the logic (split into more patches) Reviewers: mehdi_amini, majnemer Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D23234 llvm-svn: 277936	2016-08-06 20:44:39 +00:00
Craig Topper	9d8676acc0	[AVX-512] Add SQRT/RCP14/RNDSCALE to hasUndefRegUpdate. llvm-svn: 277934	2016-08-06 19:31:52 +00:00
Craig Topper	19505bc354	[AVX-512] Add AVX-512 scalar CVT instructions to hasUndefRegUpdate. llvm-svn: 277933	2016-08-06 19:31:50 +00:00
Craig Topper	f5d05fb0ce	[X86] Add VRCPSSr_Int, VRSQRTSSr_Int, VSQRTSSr_Int, and VSQRTSDr_Int to hasUndefRegUpdate. llvm-svn: 277931	2016-08-06 19:31:44 +00:00
Simon Pilgrim	7d168e19e8	[X86][SSE] Enable commutation between MOVHLPS and UNPCKHPD Assuming SSE2 is available then we can safely commute between these, removing some unnecessary register moves and improving memory folding opportunities. VEX encoded versions don't benefit so I haven't added support to them. llvm-svn: 277930	2016-08-06 18:40:28 +00:00
Mike Aizatsky	a8e84b9b37	[libfuzzer] do not warn about missing pcbuffer functions: they are new. llvm-svn: 277927	2016-08-06 17:03:22 +00:00
Benjamin Kramer	3f0c1e625d	[ARM] Don't copy MCInsts in loop. NFC. llvm-svn: 277924	2016-08-06 12:58:24 +00:00
Benjamin Kramer	41e66dade1	[Inliner] Use function_ref for functors which are never taken ownership of. llvm-svn: 277922	2016-08-06 12:33:46 +00:00
Benjamin Kramer	a3d4def878	[LoadCombine] Simplify code with a brace init. NFC. llvm-svn: 277921	2016-08-06 12:11:11 +00:00
Simon Pilgrim	f56309f11a	[X86][SSE] Add 2 input shuffle support to matchBinaryVectorShuffle Not actually used yet... llvm-svn: 277919	2016-08-06 11:22:39 +00:00
Benjamin Kramer	b7d3311c77	Move helpers into anonymous namespaces. NFC. llvm-svn: 277916	2016-08-06 11:13:10 +00:00
David Majnemer	70c93fa69a	[CodeGen] Fix a -Wdocumentation warning A parameter was documented with the wrong name. No functionality change is intended. llvm-svn: 277915	2016-08-06 08:37:12 +00:00
David Majnemer	a19d0f2f3e	[ValueTracking] Teach computeKnownBits about [su]min/max Reasoning about a select in terms of a min or max allows us to derive a tigher bound on the result. llvm-svn: 277914	2016-08-06 08:16:00 +00:00
David Majnemer	1665d8635e	[CallGraphSCCPass] Use an ArrayRef instead of a pair of iterators No functional change is intended. llvm-svn: 277913	2016-08-06 06:21:02 +00:00
Sanjoy Das	ba04d3a620	[InstCombine] Don't coerce non-integral pointers to integers Reviewers: majnemer Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D23231 llvm-svn: 277910	2016-08-06 02:58:48 +00:00
Matthias Braun	9a0035d8d2	Revert "(refs/bisect/bad) GVN-hoist: enable by default" GVN-Hoist appears to miscompile llvm-testsuite SingleSource/Benchmarks/Misc/fbench.c at the moment. I filed http://llvm.org/PR28880 This reverts commit r277786. llvm-svn: 277909	2016-08-06 02:23:15 +00:00
Gor Nishanov	31d8c9af89	Part 4c: Coroutine Devirtualization: Devirtualize coro.resume and coro.destroy. Summary: This is the 4c patch of the coroutine series. CoroElide pass now checks if PostSplit coro.begin is referenced by coro.subfn.addr intrinsics. If so replace coro.subfn.addrs with an appropriate coroutine subfunction associated with that coro.begin. Documentation and overview is here: http://llvm.org/docs/Coroutines.html. Upstreaming sequence (rough plan) 1.Add documentation. (https://reviews.llvm.org/D22603) 2.Add coroutine intrinsics. (https://reviews.llvm.org/D22659) 3.Add empty coroutine passes. (https://reviews.llvm.org/D22847) 4.Add coroutine devirtualization + tests. ab) Lower coro.resume and coro.destroy (https://reviews.llvm.org/D22998) c) Do devirtualization <= we are here 5.Add CGSCC restart trigger + tests. 6.Add coroutine heap elision + tests. 7.Add the rest of the logic (split into more patches) Reviewers: majnemer Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D23229 llvm-svn: 277908	2016-08-06 02:16:35 +00:00
Nico Weber	c893e603ab	Revert r277896. It breaks ExecutionEngine/OrcLazy/weak-function.ll on most bots. Script: -- ... -- Exit Code: 1 Command Output (stderr): -- Could not find main function. llvm-svn: 277907	2016-08-06 02:00:45 +00:00
Kyle Butt	71cb44d969	CodeGen: If Convert blocks that would form a diamond when tail-merged. The following function currently relies on tail-merging for if conversion to succeed. The common tail of cond_true and cond_false is extracted, and this then forms a diamond pattern that can be successfully if converted. If this block does not get extracted, either because tail-merging is disabled or the threshold is higher, we should still recognize this pattern and if-convert it. define i32 @t2(i32 %a, i32 %b) nounwind { entry: %tmp1434 = icmp eq i32 %a, %b ; <i1> [#uses=1] br i1 %tmp1434, label %bb17, label %bb.outer bb.outer: ; preds = %cond_false, %entry %b_addr.021.0.ph = phi i32 [ %b, %entry ], [ %tmp10, %cond_false ] %a_addr.026.0.ph = phi i32 [ %a, %entry ], [ %a_addr.026.0, %cond_false ] br label %bb bb: ; preds = %cond_true, %bb.outer %indvar = phi i32 [ 0, %bb.outer ], [ %indvar.next, %cond_true ] %tmp. = sub i32 0, %b_addr.021.0.ph %tmp.40 = mul i32 %indvar, %tmp. %a_addr.026.0 = add i32 %tmp.40, %a_addr.026.0.ph %tmp3 = icmp sgt i32 %a_addr.026.0, %b_addr.021.0.ph br i1 %tmp3, label %cond_true, label %cond_false cond_true: ; preds = %bb %tmp7 = sub i32 %a_addr.026.0, %b_addr.021.0.ph %tmp1437 = icmp eq i32 %tmp7, %b_addr.021.0.ph %indvar.next = add i32 %indvar, 1 br i1 %tmp1437, label %bb17, label %bb cond_false: ; preds = %bb %tmp10 = sub i32 %b_addr.021.0.ph, %a_addr.026.0 %tmp14 = icmp eq i32 %a_addr.026.0, %tmp10 br i1 %tmp14, label %bb17, label %bb.outer bb17: ; preds = %cond_false, %cond_true, %entry %a_addr.026.1 = phi i32 [ %a, %entry ], [ %tmp7, %cond_true ], [ %a_addr.026.0, %cond_false ] ret i32 %a_addr.026.1 } Without tail-merging or diamond-tail if conversion: LBB1_1: @ %bb @ =>This Inner Loop Header: Depth=1 cmp r0, r1 ble LBB1_3 @ BB#2: @ %cond_true @ in Loop: Header=BB1_1 Depth=1 subs r0, r0, r1 cmp r1, r0 it ne cmpne r0, r1 bgt LBB1_4 LBB1_3: @ %cond_false @ in Loop: Header=BB1_1 Depth=1 subs r1, r1, r0 cmp r1, r0 bne LBB1_1 LBB1_4: @ %bb17 bx lr With diamond-tail if conversion, but without tail-merging: @ BB#0: @ %entry cmp r0, r1 it eq bxeq lr LBB1_1: @ %bb @ =>This Inner Loop Header: Depth=1 cmp r0, r1 ite le suble r1, r1, r0 subgt r0, r0, r1 cmp r1, r0 bne LBB1_1 @ BB#2: @ %bb17 bx lr llvm-svn: 277905	2016-08-06 01:52:37 +00:00
Kyle Butt	54bf3cef92	IfConverter: Split ScanInstructions into 2 functions. ScanInstructions is now 2 functions: AnalyzeBranches and ScanInstructions. ScanInstructions also now takes a pair of arguments delimiting the instructions to be scanned. This will be used for forked diamond support to re-scan only a portion of the block. llvm-svn: 277904	2016-08-06 01:52:34 +00:00
Kyle Butt	4f0e287906	IfConversion: Document countDuplicatedInstructions. NFC llvm-svn: 277903	2016-08-06 01:52:33 +00:00
Kyle Butt	fe916828ee	IfConversion: factor out 2 functions to skip debug instrs. NFC Skipping debug instructions occurrs repeatedly, factor it out. llvm-svn: 277902	2016-08-06 01:52:31 +00:00
Michael Zolotukhin	09cf304ebc	Revert "[LoopSimplify] Fix updating LCSSA after separating nested loops." This reverts commit r277877. Try to appease clang-x64-ninja-win7 buildbot. llvm-svn: 277901	2016-08-06 01:48:51 +00:00
Lang Hames	62a459603c	[ORC] Add (partial) weak symbol support to the CompileOnDemand layer. This adds partial support for weak functions to the CompileOnDemandLayer by modifying the addLogicalModule method to check for existing stub definitions before building a new stub for a weak function. This scheme is sufficient to support ODR definitions, but fails for general weak definitions if strong definition is encountered after the first weak definition. (A more extensive refactor will be required to fully support weak symbols). This patch does not add weak symbol support to RuntimeDyld: I hope to add that in the near future. llvm-svn: 277896	2016-08-06 00:54:43 +00:00
Sanjoy Das	b8c2ebea08	[IRCE] Remove unused headers; NFC llvm-svn: 277892	2016-08-06 00:02:01 +00:00
Sanjoy Das	cf181867a6	[IRCE] Preserve loop-simplify form Fixes PR28764. Right now there is no way to test this, but (as mentioned on the PR) with Michael Zolotukhin's yet to be checked in LoopSimplify verfier, 8 of the llvm-lit tests for IRCE crash. llvm-svn: 277891	2016-08-06 00:01:56 +00:00
Sanjay Patel	8e3ab17c44	[InstCombine] refactor ctlz/cttz folds (NFCI) Note that this fold really belongs in InstSimplify. Refactoring here anyway as an intermediate step because there's a planned addition to this function in D23134. Differential Revision: https://reviews.llvm.org/D23223 llvm-svn: 277883	2016-08-05 22:42:46 +00:00
Daniel Berlin	7ac3d74017	[MSSA] Use depth first iterator instead of custom version. Summary: Originally the plan was to use the custom worklist to do some block popping, and because we don't actually need a visited set. The custom one we have here is slightly broken, and it's not worth fixing vs using depth_first_iterator since we aren't going to go the route we originally were. Fixes PR28874 Reviewers: george.burgess.iv Subscribers: llvm-commits, gberry Differential Revision: https://reviews.llvm.org/D23187 llvm-svn: 277880	2016-08-05 22:09:14 +00:00
Justin Bogner	272cbacc25	CodeView: Remove an unused variable It was breaking the -Werror build. llvm-svn: 277878	2016-08-05 21:57:10 +00:00
Michael Zolotukhin	4c65c3596a	[LoopSimplify] Fix updating LCSSA after separating nested loops. This fixes PR28825. The problem was that we only checked if a value from a created inner loop is used in the outer loop, and fixed LCSSA for them. But we missed to fixup LCSSA for values used in exits of the outer loop. llvm-svn: 277877	2016-08-05 21:52:58 +00:00
Zachary Turner	5e35eaac83	Fix non portable include path. llvm-svn: 277876	2016-08-05 21:50:02 +00:00
Daniel Berlin	7af95876cf	[MSSA] Match assert vs llvm_unreachable style in verification functions. llvm-svn: 277873	2016-08-05 21:47:20 +00:00
Daniel Berlin	2919b1c41b	Rewrite domination verifier to handle local domination as well. Summary: Rewrite domination verifier to handle local domination as well. This catches a bug Geoff Berry noticed. Reviewers: george.burgess.iv Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23184 llvm-svn: 277872	2016-08-05 21:46:52 +00:00
Zachary Turner	5e3e4bb26b	[CodeView] Decouple record deserialization from visitor dispatch. Until now, our use case for the visitor has been to take a stream of bytes representing a type stream, deserialize the records in sequence, and do something with them, where "something" is determined by how the user implements a particular set of callbacks on an abstract class. For actually writing PDBs, however, we want to do the reverse. We have some kind of description of the list of records in their in-memory format, and we want to process each one. Perhaps by serializing them to a byte stream, or perhaps by converting them from one description format (Yaml) to another (in-memory representation). This was difficult in the current model because deserialization and invoking the callbacks were tightly coupled. With this patch we change this so that TypeDeserializer is itself an implementation of the particular set of callbacks. This decouples deserialization from the iteration over a list of records and invocation of the callbacks. TypeDeserializer is initialized with another implementation of the callback interface, so that upon deserialization it can pass the deserialized record through to the next set of callbacks. In a sense this is like an implementation of the Decorator design pattern, where the Deserializer is a decorator. This will be useful for writing Pdbs from yaml, where we have a description of the type records in Yaml format. In this case, the visitor implementation would have each visitation callback method implemented in such a way as to extract the proper set of fields from the Yaml, and it could maintain state that builds up a list of these records. Finally at the end we can pass this information through to another set of callbacks which serializes them into a byte stream. Reviewed By: majnemer, ruiu, rnk Differential Revision: https://reviews.llvm.org/D23177 llvm-svn: 277871	2016-08-05 21:45:34 +00:00
Marek Olsak	355a8642b4	AMDGPU/SI: Increase SGPR limit to 96 on Tonga/Iceland Summary: This is the setting of the Vulkan closed source driver. It decreases the max wave count from 10 to 8. 26010 shaders in 14650 tests Totals: VGPRS: 829593 -> 808440 (-2.55 %) Spilled SGPRs: 81878 -> 42226 (-48.43 %) Spilled VGPRs: 367 -> 358 (-2.45 %) Scratch VGPRs: 1764 -> 1748 (-0.91 %) dwords per thread Code Size: 36677864 -> 35923932 (-2.06 %) bytes There is a massive decrease in SGPR spilling in general and -7.4% spilled VGPRs for DiRT Showdown (= SGPRs spilled to scratch?) Reviewers: arsenm, tstellarAMD, nhaehnle Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: https://reviews.llvm.org/D23034 llvm-svn: 277867	2016-08-05 21:23:29 +00:00
Weiming Zhao	f68a6a720c	[ARM] Constant Materialize: imms with specific value can be encoded into mov.w Summary: Thumb2 supports encoding immediates with specific patterns into mov.w by splatting the low 8 bits into other bytes. I'm resubmitting this patch. The test case in the original commit r277610 does not specify triple, so builds with differnt default triple will have different output. This patch fixed trile as thumb-darwin-apple. Reviewers: john.brawn, jmolloy, bruno Subscribers: jmolloy, aemerson, rengolin, samparker, llvm-commits Differential Revision: https://reviews.llvm.org/D23090 llvm-svn: 277865	2016-08-05 20:58:29 +00:00
Davide Italiano	500929df9c	[FlattenCFG] Simplify + remove unused variable. NFCI. llvm-svn: 277864	2016-08-05 20:53:35 +00:00
Dehao Chen	e1c7c57d11	Remove cold callsite heuristic that is not necessary because of cold callee heuristic. llvm-svn: 277863	2016-08-05 20:49:04 +00:00
Dehao Chen	de39cb9384	Replace hot-callsite based heuristic to use its own threshold parameter instead of share inline-hint parameter Summary: Hot callsites should have higher threshold than inline hints. This patch uses separate threshold parameter for hot callsites. Reviewers: davidxl, eraman Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D22368 llvm-svn: 277860	2016-08-05 20:28:41 +00:00
Mike Aizatsky	b4bbc3bb7a	[sanitizers] trace buffer API to use user-allocated buffer. Differential Revision: https://reviews.llvm.org/D23185 llvm-svn: 277859	2016-08-05 20:09:53 +00:00
Ivan Krasin	b05e06e4fd	WholeProgramDevirt: print remarks with devirtualized method names. Summary: Chrome on Linux uses WholeProgramDevirt for speed ups, and it's important to detect regressions on both sides: the toolchain, if fewer methods get devirtualized after an update, and Chrome, if an innocently looking change caused many hot methods become virtual again. The need to track devirtualized methods is not Chrome-specific, but it's probably the only user of the pass at this time. Reviewers: kcc Differential Revision: https://reviews.llvm.org/D23219 llvm-svn: 277856	2016-08-05 19:45:16 +00:00
David Callahan	45e442ebaa	[ADCE] Refactoring for new functionality (NFC) Summary: This is another refactoring to break up the one function into three logical components functions. Another non-functional change before we start added in features. Reviewers: nadav, mehdi_amini, majnemer Subscribers: twoh, freik, llvm-commits Differential Revision: https://reviews.llvm.org/D23102 llvm-svn: 277855	2016-08-05 19:38:11 +00:00
Sanjoy Das	6fa08aafcc	[ConstantFolding] Don't create illegal (non-integral) inttoptrs Reviewers: majnemer, arsenm Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D23182 llvm-svn: 277854	2016-08-05 19:23:29 +00:00
David Callahan	c1c810de0b	[AutoFDO] Fix handling of empty profiles Summary: If a profile has no samples for a function, then the function "entry count" is set to the value 0. Several places in the code test that if the Function::getEntryCount is defined at all. Here we change to treat a 0 entry count the same as undefined. In particular, this fixes a problem in getLayoutSuccessorProbThreshold in MachineBlockPlacement.cpp where we use a different and inferior heuristic for laying out basic blocks. Reviewers: danielcdh, dnovillo Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23082 llvm-svn: 277849	2016-08-05 18:38:19 +00:00
Sanjoy Das	b0b4e86215	[SCEV] Don't infinitely recurse on unreachable code llvm-svn: 277848	2016-08-05 18:34:14 +00:00
Kevin Enderby	600fb3f28e	Add the first of what will be a long line of additional error checks for invalid Mach-O files. This is where an LC_SEGMENT load command has a fileoff field that extends past the end of the file. Also fix llvm-nm and llvm-size to remove the errorToErrorCode() call so error messages are printed. And needed to update a few test cases now that they do print the error messages just a bit differently. llvm-svn: 277845	2016-08-05 18:19:40 +00:00
Dehao Chen	17c6afc35b	Do not assign new discriminator for all intrinsics. Summary: We do not care about intrinsic calls when assigning discriminators. Reviewers: davidxl, dnovillo Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23212 llvm-svn: 277843	2016-08-05 17:56:49 +00:00
Tim Northover	14e7f73a0f	GlobalISel: clear pending phis after MachineFunction translated Test is just reordering the existing functions (it would trigger for any function after one with a phi). llvm-svn: 277841	2016-08-05 17:50:36 +00:00
Simon Pilgrim	69b6a70834	[X86][SSE] Add initial support for 2 input target shuffle combining. At the moment only the INSERTPS matching can actually use 2 inputs but the plumbing is now in place. llvm-svn: 277839	2016-08-05 17:36:14 +00:00
Tim Northover	97d0cb3165	GlobalISel: IRTranslate PHI instructions llvm-svn: 277835	2016-08-05 17:16:40 +00:00
Ulrich Weigand	c3b495a649	[PowerPC] Wrong fast-isel codegen for VSX floating-point loads There were two locations where fast-isel would generate a LFD instruction with a target register class VSFRC instead of F8RC when VSX was enabled. This can ccause invalid registers to be used in certain cases, like: lfd 36, ... instead of using a VSX load instruction. The wrong register number gets silently truncated, causing invalid code to be generated. The first place is PPCFastISel::PPCEmitLoad, which had multiple problems: 1.) The IsVSSRC and IsVSFRC flags are not initialized correctly, since they are computed from resultReg, which is still zero at this point in many cases. Fixed by changing the helper routines to operate on a register class instead of a register and passing in UseRC. 2.) Even with this fixed, Is64VSXLoad is still wrong due to a typo: bool Is32VSXLoad = IsVSSRC && Opc == PPC::LFS; bool Is64VSXLoad = IsVSSRC && Opc == PPC::LFD; The second line needs to use isVSFRC (like PPCEmitStore does). 3.) Once both the above are fixed, we're now generating a VSX instruction -- but an incorrect one, since generation of an indexed instruction with null index is wrong. Fixed by copying the code handling the same issue in PPCEmitStore. The second place is PPCFastISel::PPCMaterializeFP, where we would emit an LFD to load a constant from the literal pool, and use the wrong result register class. Fixed by hardcoding a F8RC class even on systems supporting VSX. Fixes: https://llvm.org/bugs/show_bug.cgi?id=28630 Differential Revision: https://reviews.llvm.org/D22632 llvm-svn: 277823	2016-08-05 15:22:05 +00:00
Zhan Jun Liau	8d3f29759f	[SystemZ] Add missing classes and instructions Summary: Add instruction formats E, RSI, SSd, SSE, and SSF. Added BRXH, BRXLE, PR, MVCK, STRAG, and ECTG instructions to test out those formats. Reviewers: uweigand Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23179 llvm-svn: 277822	2016-08-05 15:14:34 +00:00
Benjamin Kramer	aa160c22f7	[SimplifyCFG] Make range reduction code deterministic. This generated IR based on the order of evaluation, which is different between GCC and Clang. With that in mind you get bootstrap miscompares if you compare a Clang built with GCC-built Clang vs. Clang built with Clang-built Clang. Diagnosing that made my head hurt. This also reverts commit r277337, which "fixed" the test case. llvm-svn: 277820	2016-08-05 14:55:02 +00:00
Simon Pilgrim	24dc1e7a90	[X86][SSE] Update the the target shuffle matches to use the effective mask's value type directly instead of via the input value type. Preparation for adding 2 input support so we want to avoid unnecessary references to the input value type. llvm-svn: 277817	2016-08-05 14:33:11 +00:00
Simon Pilgrim	7080005e67	[X86][SSE] Consistently use the target shuffle root value type for vector size calculations. NFCI. Preparation for adding 2 input support so we want to avoid unnecessary references to the input value type. llvm-svn: 277814	2016-08-05 13:02:53 +00:00
NAKAMURA Takumi	f72c663ac5	LLLexer.cpp: Avoid using BitsToDouble() to preserve SNaN like "double 0x7FF4000000000000". We should not use double (or float) in the LLVM, unless it is really needed. x87 FP register doesn't preserve SNaN to move the value. FIXME: APFloat() may have the constructor by raw bit. llvm-svn: 277813	2016-08-05 11:59:49 +00:00
NAKAMURA Takumi	2b8c774ce7	Reformat. llvm-svn: 277812	2016-08-05 11:59:45 +00:00
Simon Pilgrim	6f7b0cd530	[X86][SSE] Added target shuffle combine binary compute matching function. NFCI. Added matchBinaryPermuteVectorShuffle and moved the blend+zero and insertps matching code into it. llvm-svn: 277808	2016-08-05 11:16:53 +00:00
John Brawn	4d79ec7fe8	Reapply r276973 "Adjust Registry interface to not require plugins to export a registry" This differs from the previous version by being more careful about template instantiation/specialization in order to prevent errors when building with clang -Werror. Specifically: * begin is not defined in the template and is instead instantiated when Head is. I think the warning when we don't do that is wrong (PR28815) but for now at least do it this way to avoid the warning. * Instead of performing template specializations in LLVM_INSTANTIATE_REGISTRY instead provide a template definition then do explicit instantiation. No compiler I've tried has problems with doing it the other way, but strictly speaking it's not permitted by the C++ standard so better safe than sorry. Original commit message: Currently the Registry class contains the vestiges of a previous attempt to allow plugins to be used on Windows without using BUILD_SHARED_LIBS, where a plugin would have its own copy of a registry and export it to be imported by the tool that's loading the plugin. This only works if the plugin is entirely self-contained with the only interface between the plugin and tool being the registry, and in particular this conflicts with how IR pass plugins work. This patch changes things so that instead the add_node function of the registry is exported by the tool and then imported by the plugin, which solves this problem and also means that instead of every plugin having to export every registry they use instead LLVM only has to export the add_node functions. This allows plugins that use a registry to work on Windows if LLVM_EXPORT_SYMBOLS_FOR_PLUGINS is used. llvm-svn: 277806	2016-08-05 11:01:08 +00:00
Strahinja Petrovic	30e0ce8e9f	[PowerPC] fix passing long double arguments to function (soft-float) This patch fixes passing long double type arguments to function in soft float mode. If there is less than 4 argument registers free (long double type is mapped in 4 gpr registers in soft float mode) long double type argument must be passed through stack. Differential Revision: https://reviews.llvm.org/D20114. llvm-svn: 277804	2016-08-05 08:47:26 +00:00

1 2 3 4 5 ...

93627 Commits