llvm-project

Commit Graph

Author	SHA1	Message	Date
Aditya Nandakumar	892979effc	[GISel]: Implement widenScalar for Legalizing G_PHI https://reviews.llvm.org/D37018 llvm-svn: 311763	2017-08-25 04:57:27 +00:00
Chandler Carruth	46259260c7	[x86] NFC - normalize test case formatting of IR and generate CHECK lines with the script rather than using manually written checks. llvm-svn: 311753	2017-08-25 02:32:51 +00:00
Chandler Carruth	5c69dac589	Teach the llc check updater to recognize the end-of-function comment used on Windows and sometimes Darwin. Cleans up generated patterns for me quite a bit. llvm-svn: 311752	2017-08-25 02:32:48 +00:00
Gor Nishanov	e29e94cf87	[coroutines] Add support for symmetric control transfer (musttail on coro.resumes followed by a suspend) Summary: Add musttail to any resume instructions that is immediately followed by a suspend (i.e. ret). We do this even in -O0 to support guaranteed tail call for symmetrical coroutine control transfer (C++ Coroutines TS extension). This transformation is done only in the resume part of the coroutine that has identical signature and calling convention as the coro.resume call. Reviewers: GorNishanov Reviewed By: GorNishanov Subscribers: EricWF, majnemer, llvm-commits Differential Revision: https://reviews.llvm.org/D37125 llvm-svn: 311751	2017-08-25 02:25:10 +00:00
Chandler Carruth	96db308f03	[x86] NFC: More refactoring to pave the way to extending this ISel logic to handle other x86 pseudos that carry flags and thus can't be matched by our ISel patterns with fused memory accesses. Differential Revision: https://reviews.llvm.org/D37088 llvm-svn: 311749	2017-08-25 02:06:36 +00:00
Chandler Carruth	03258f251f	[x86] NFC - Refactor the custom lowering of `(load; op; store)` RMW sequences. This extracts the code out of a giant switch in preparation for expanding it to handle operations other thin `inc` and `dec`. Add a FIXME indicating what's coming here. Differential Revision: https://reviews.llvm.org/D37045 llvm-svn: 311748	2017-08-25 02:04:03 +00:00
Craig Topper	355d8cff49	[X86] Add TBM instructions to X86InstrInfo::isDefConvertible. This allows us to remove "test" instructions and use the flags from the TBM instructions directly. llvm-svn: 311747	2017-08-25 01:59:06 +00:00
Matt Arsenault	f5fb1e8bca	DAG: Fix naming crime Because isOperationCustom was only checking for custom lowering on illegal types, this was behaving inconsistently with the other isOperation* functions, so that isOperationLegalOrCustom != (isOperationLegal \|\| isOperationCustom) Luckily this is only used in one place which already checks the type legality on its own. llvm-svn: 311743	2017-08-25 01:26:13 +00:00
Justin Bogner	ad96ff1228	[sanitizer-coverage] Make sure pc-tables aren't dead stripped Add a reference to the PC array in llvm.used so that linkers that aggressively dead strip (like ld64) don't remove it. llvm-svn: 311742	2017-08-25 01:24:54 +00:00
Mandeep Singh Grang	12bd32937e	[unittests] Remove reverse iteration tests which use pointer-like keys Summary: The expected order of pointer-like keys is hash-function-dependent which in turn depends on the platform/environment. Need to come up with a better way to test reverse iteration of containers with pointer-like keys. Reviewers: dblaikie, mehdi_amini, efriedma, mgrang Reviewed By: mgrang Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D37128 llvm-svn: 311741	2017-08-25 01:11:28 +00:00
Chandler Carruth	5b491808f5	[x86] Back out one aspect of r311318: don't generically set FeatureSlowUAMem32. The idea was to mark things that are slow on widely available processors as slow in the generic CPU so that the code generated for that CPU would be fast across those processors. However, for this feature that doesn't work out very well at all. The problem here is that you can very easily enable AVX or AVX2 on top of this generic CPU. For example, this can happen just by using AVX2 intrinsics from Clang within a region of code guarded by a dynamic CPU feature test. When you do that, the generated code with SlowUAMem32 set is ... amazingly slower. The problem is that there really aren't very good alternatives to the unaligned loads, and so our vector codegen regresses significantly. The other issue is that there are plenty of AMD CPUs with AVX1 that don't set FeatureSlowUAMem32 and so we shouldn't just check for AVX2 instead of this special feature. =/ It would be nice to have the target attriute logic be able to enable/disable more than just one feature at a time and control this in a more fine grained and useful way, but that doesn't seem easy. Given that it is only Sandybridge and Ivybridge that set this feature, for now I'm just backing it out of the generic CPU. That has the additional advantage of going back to the previous state that people seemed vaguely happy with. llvm-svn: 311740	2017-08-25 00:56:05 +00:00
Stephen Hines	cc14a386d8	Fix two (three) more issues with unchecked Error. Summary: If assertions are disabled, but LLVM_ABI_BREAKING_CHANGES is enabled, this will cause an issue with an unchecked Success. Switching to consumeError() is the correct way to bypass the check. This patch also includes disabling 2 tests that can't work without assertions enabled, since llvm_unreachable() with NDEBUG won't crash. Reviewers: llvm-commits, lhames Reviewed By: lhames Subscribers: lhames, pirama Differential Revision: https://reviews.llvm.org/D36729 llvm-svn: 311739	2017-08-25 00:48:21 +00:00
Chandler Carruth	8ac488b161	[x86] Fix an amazing goof in the handling of sub, or, and xor lowering. The comment for this code indicated that it should work similar to our handling of add lowering above: if we see uses of an instruction other than flag usage and store usage, it tries to avoid the specialized X86ISD::* nodes that are designed for flag+op modeling and emits an explicit test. Problem is, only the add case actually did this. In all the other cases, the logic was incomplete and inverted. Any time the value was used by a store, we bailed on the specialized X86ISD node. All of this appears to have been historical where we had different logic here. =/ Turns out, we have quite a few patterns designed around these nodes. We should actually form them. I fixed the code to match what we do for add, and it has quite a positive effect just within some of our test cases. The only thing close to a regression I see is using: notl %r testl %r, %r instead of: xorl -1, %r But we can add a pattern or something to fold that back out. The improvements seem more than worth this. I've also worked with Craig to update the comments to no longer be actively contradicted by the code. =[ Some of this still remains a mystery to both Craig and myself, but this seems like a large step in the direction of consistency and slightly more accurate comments. Many thanks to Craig for help figuring out this nasty stuff. Differential Revision: https://reviews.llvm.org/D37096 llvm-svn: 311737	2017-08-25 00:34:07 +00:00
Sanjay Patel	e404cbff66	[DAG] convert vector select-of-constants to logic/math This goes back to a discussion about IR canonicalization. We'd like to preserve and convert more IR to 'select' than we currently do because that's likely the best choice in IR: http://lists.llvm.org/pipermail/llvm-dev/2016-September/105335.html ...but that's often not true for codegen, so we need to account for this pattern coming in to the backend and transform it to better DAG ops. Steps in this patch: 1. Add an EVT param to the existing convertSelectOfConstantsToMath() TLI hook to more finely enable this transform. Other targets will probably want that anyway to distinguish scalars from vectors. We're using that here to exclude AVX512 targets, but it may not be necessary. 2. Convert a vselect to ext+add. This eliminates a constant load/materialization, and the vector ext is often free. Implementing a more general fold using xor+and can be a follow-up for targets that don't have a legal vselect. It's also possible that we can remove the TLI hook for the special case fold implemented here because we're eliminating a constant, but it needs to be tested on other targets. Differential Revision: https://reviews.llvm.org/D36840 llvm-svn: 311731	2017-08-24 23:24:43 +00:00
Mandeep Singh Grang	872f689d0a	[ADT] Enable reverse iteration for DenseMap Reviewers: mehdi_amini, dexonsmith, dblaikie, davide, chandlerc, davidxl, echristo, efriedma Reviewed By: dblaikie Subscribers: rsmith, mgorny, emaste, llvm-commits Differential Revision: https://reviews.llvm.org/D35043 llvm-svn: 311730	2017-08-24 23:02:48 +00:00
Xinliang David Li	66531dd10a	[Profile] backward propagate profile info in JumpThreading Take-2 after fixing bugs in the original patch. Differential Revsion: http://reviews.llvm.org/D36864 llvm-svn: 311727	2017-08-24 22:54:01 +00:00
Sanjay Patel	bb789381fc	[InstCombine] fix and enhance udiv/urem narrowing There are 3 small independent changes here: 1. Account for multiple uses in the pattern matching: avoid the transform if it increases the instruction count. 2. Add a missing fold for the case where the numerator is the constant: http://rise4fun.com/Alive/E2p 3. Enable all folds for vector types. There's still one more potential change - use "shouldChangeType()" to keep from transforming to an illegal integer type. Differential Revision: https://reviews.llvm.org/D36988 llvm-svn: 311726	2017-08-24 22:54:01 +00:00
Dehao Chen	f0e27e63e7	Move accurate-sample-profile into the function attribute. Summary: We need to have accurate-sample-profile in function attribute so that it works with LTO. Reviewers: davidxl, rsmith Reviewed By: davidxl Subscribers: sanjoy, mehdi_amini, javed.absar, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D37113 llvm-svn: 311706	2017-08-24 21:37:04 +00:00
Eugene Zelenko	5df3d89009	[CodeGen] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 311703	2017-08-24 21:21:39 +00:00
Chad Rosier	f98335e0b0	[PartialInlining] Formatting. NFC. llvm-svn: 311702	2017-08-24 21:21:09 +00:00
Nathan Hawes	9b656ffbef	test commit: fix typo in comment llvm-svn: 311701	2017-08-24 21:20:41 +00:00
Chad Rosier	4cb2e82774	[PartialInlining] Type. NFC. llvm-svn: 311699	2017-08-24 20:29:02 +00:00
Konstantin Zhuravlyov	68107657d4	AMDGPU: Fix gfx801 features gfx801 has 1/2 rate F64, Fast F32 FMA Differential Revision: https://reviews.llvm.org/D36981 llvm-svn: 311694	2017-08-24 20:03:07 +00:00
Jacob Gravelle	690b76e13d	[WebAssembly] FastISel : Bail to SelectionDAG for constexpr calls Summary: Currently FastISel lowers constexpr calls as indirect calls. We'd like those to direct calls, and falling back to SelectionDAGISel handles that. Reviewers: dschuff, sunfish Subscribers: jfb, sbc100, llvm-commits, aheejin Differential Revision: https://reviews.llvm.org/D37073 llvm-svn: 311693	2017-08-24 19:53:44 +00:00
Heejin Ahn	34672faf49	[WebAssembly] Update GCC test suite failure expectations Summary: Update GCC test suite failure expectations as we add -O0 to the bare tests in WebAssembly waterfall. There are still several untriaged lld failures. Reviewers: sbc100, jgravelle-google, dschuff Reviewed By: dschuff Subscribers: jfb Differential Revision: https://reviews.llvm.org/D37100 llvm-svn: 311691	2017-08-24 19:43:09 +00:00
Krzysztof Parzyszek	c802d27a93	[Hexagon] Set access size for vector pseudo loads/stores llvm-svn: 311690	2017-08-24 19:19:24 +00:00
Daniel Sanders	069bb8d45f	[globalisel][tablegen] Predicates should start from GIPFP_Invalid+1 not GIPFP_Invalid This fixes a warning when there are zero defined predicates and also fixes an unnoticed bug where the first predicate in the table was unusable. llvm-svn: 311684	2017-08-24 18:54:16 +00:00
Victor Leschuk	6aedf785c5	Remove duplicate code llvm-svn: 311675	2017-08-24 17:02:38 +00:00
Victor Leschuk	471579b52e	Add missing break in switch llvm-svn: 311673	2017-08-24 16:57:10 +00:00
Pete Couperus	2d1f6d67c5	[ARC] Add ARC backend. Add the ARC backend as an experimental target to lib/Target. Reviewed at: https://reviews.llvm.org/D36331 llvm-svn: 311667	2017-08-24 15:40:33 +00:00
Krasimir Georgiev	719f97cf65	[X86AsmParser] Refactor AsmRewrite constructors, NFCI Summary: This is a follow-up of https://reviews.llvm.org/D37105, where a slight refactoring of the constructors of AsmRewrite is proposed. Reviewers: coby Reviewed By: coby Differential Revision: https://reviews.llvm.org/D37110 llvm-svn: 311666	2017-08-24 15:03:18 +00:00
Sanjay Patel	1cc58ecc8a	fix typo; NFC llvm-svn: 311665	2017-08-24 15:00:13 +00:00
Sjoerd Meijer	b0eb5fb317	[AArch64] Add FMOVH0: materialize 0 using zero register for f16 values Instead of loading 0 from a constant pool, it's of course much better to materialize it using an fmov and the zero register. Thanks to Ahmed Bougacha for the suggestion. Differential Revision: https://reviews.llvm.org/D37102 llvm-svn: 311662	2017-08-24 14:47:06 +00:00
Sanjay Patel	5d67d8916e	[BypassSlowDivision] move map helper code to header; NFC We can reuse this code with other div/rem transforms as shown in: https://reviews.llvm.org/D31037 https://bugs.llvm.org/show_bug.cgi?id=31028 llvm-svn: 311661	2017-08-24 14:43:33 +00:00
Chad Rosier	bfd4014304	[TargetParser][AArch64] Add support for RDM feature in the target parser. Differential Revision: https://reviews.llvm.org/D37081 llvm-svn: 311659	2017-08-24 14:30:44 +00:00
Michael Zuckerman	9ee61d9b00	Adding base lit test for x86interleaved llvm-svn: 311658	2017-08-24 14:11:28 +00:00
Coby Tayree	ee1bc325c0	[fixup][rL311639] rL311639 created X86AsmParser a dependency in X86AsmPrinter, which broke builds this fix adds the necessary dep llvm-svn: 311657	2017-08-24 14:10:50 +00:00
Krasimir Georgiev	9ee966548e	[X86AsmParser] Fix msan: use-of-uninitialized-value after r311639 Summary: CodeGen/ms-inline-asm.c test triggers msan use-of-uninitialized-value here: llvm/lib/MC/MCParser/AsmParser.cpp:5629:7 Reviewers: bkramer, coby Differential Revision: https://reviews.llvm.org/D37105 llvm-svn: 311653	2017-08-24 13:38:18 +00:00
Krzysztof Parzyszek	c09a14eeb2	[Hexagon] Generate correct runtime check when recognizing memmove The check (assuming positive stride) for validity of memmove should be (a) the destination is at a lower address than the source, or (b) the distance between the source and destination is greater than or equal the number of bytes copied. For the second part it is sufficient to assume that the destination is at a higher address, since the opposite case is covered by (a). The distance calculation was previously done by subtracting the pointers in the wrong order. llvm-svn: 311650	2017-08-24 11:59:53 +00:00
Evgeny Astigeevich	540a39adf7	[ARM, Thumb1] Prevent ARMTargetLowering::isLegalAddressingMode from accepting illegal modes ARMTargetLowering::isLegalAddressingMode can accept illegal addressing modes for the Thumb1 target. This causes generation of redundant code and affects performance. This fixes PR34106: https://bugs.llvm.org/show_bug.cgi?id=34106 Differential Revision: https://reviews.llvm.org/D36467 llvm-svn: 311649	2017-08-24 10:00:25 +00:00
Tobias Grosser	d7eb619299	Model cache size and associativity in TargetTransformInfo Summary: We add the precise cache sizes and associativity for the following Intel architectures: - Penry - Nehalem - Westmere - Sandy Bridge - Ivy Bridge - Haswell - Broadwell - Skylake - Kabylake Polly uses since several months a performance model for BLAS computations that derives optimal cache and register tile sizes from cache and latency information (based on ideas from "Analytical Modeling Is Enough for High-Performance BLIS", by Tze Meng Low published at TOMS 2016). While bootstrapping this model, these target values have been kept in Polly. However, as our implementation is now rather mature, it seems time to teach LLVM itself about cache sizes. Interestingly, L1 and L2 cache sizes are pretty constant across micro-architectures, hence a set of architecture specific default values seems like a good start. They can be expanded to more target specific values, in case certain newer architectures require different values. For now a set of Intel architectures are provided. Just as a little teaser, for a simple gemm kernel this model allows us to improve performance from 1.2s to 0.27s. For gemm kernels with less optimal memory layouts even larger speedups can be reported. Reviewers: Meinersbur, bollu, singam-sanjay, hfinkel, gareevroman, fhahn, sebpop, efriedma, asb Reviewed By: fhahn, asb Subscribers: lsaba, asb, pollydev, llvm-commits Differential Revision: https://reviews.llvm.org/D37051 llvm-svn: 311647	2017-08-24 09:46:25 +00:00
Sjoerd Meijer	afc2cd3c9e	[AArch64] Custom lowering of copysign f16 This is a follow up patch of r311154 and introduces custom lowering of copysign f16 to avoid promotions to single precision types when the subtarget supports fullfp16. Differential Revision: https://reviews.llvm.org/D36893 llvm-svn: 311646	2017-08-24 09:21:10 +00:00
Daniel Sanders	2c269f6bf8	Re-commit: [globalisel][tablegen] Add support for ImmLeaf without SDNodeXForm Summary: This patch adds support for predicates on imm nodes but only for ImmLeaf and not for PatLeaf or PatFrag and only where the value does not need to be transformed before being rendered into the instruction. The limitation on PatLeaf/PatFrag/SDNodeXForm is due to differences in the necessary target-supplied C++ for GlobalISel. Depends on D36085 The previous commit was reverted for breaking the build but this appears to have been the recurring problem on the Windows bots with tablegen not being re-run when llvm-tblgen is changed but the .td's aren't. If it re-occurs then forcing a build with clean=True should fix it but this string should do this in advance: Requires a clean build. Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar Reviewed By: rovka Subscribers: kristof.beyls, javed.absar, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D36086 llvm-svn: 311645	2017-08-24 09:11:20 +00:00
Coby Tayree	21c312d8c6	[LLVM][x86][Inline Asm] support for GCC style inline asm - Y<x> constraints This patch is intended to enable the use of basic double letter constraints used in GCC extended inline asm {Yi Y2 Yz Y0 Ym Yt}. Supersedes D35204 Clang counterpart: D36371 Differential Revision: https://reviews.llvm.org/D36369 llvm-svn: 311644	2017-08-24 09:08:33 +00:00
Mikael Holmen	7a99e33b8e	[Reassociate] Do not drop debug location if replacement is missing Summary: When reassociating an expression, do not drop the instruction's original debug location in case the replacement location is missing. The debug location must at least not be dropped for inlinable callsites of debug-info-bearing functions in debug-info-bearing functions. Failing to do so would result in an "inlinable function " "call in a function with debug info must have a !dbg location" error in the verifier. As preserving the original debug location is not expected to result in overly jumpy debug line information, it is preserved for all other cases too. This fixes PR34231: https://bugs.llvm.org/show_bug.cgi?id=34231 Original patch by David Stenberg Reviewers: davide, craig.topper, mcrosier, dblaikie, aprantl Reviewed By: davide, aprantl Subscribers: aprantl Differential Revision: https://reviews.llvm.org/D36865 llvm-svn: 311642	2017-08-24 09:05:00 +00:00
Coby Tayree	d89128925b	[X86AsmParser] Refactoring, (almost) NFC. Some refactoring to X86AsmParser, mostly regarding the way rewrites are conducted. Mainly, we try to concentrate all the rewrite effort under one hood, so it'll hopefully be less of a mess and easier to maintain and understand. naturally, some frontend tests were affected: D36794 Differential Revision: https://reviews.llvm.org/D36793 llvm-svn: 311639	2017-08-24 08:46:25 +00:00
Matt Arsenault	d664315ae8	IPRA: Don't assume called function is first call operand Fixes not finding the called global for AMDGPU call pseudoinstructions, which prevented IPRA from doing much. llvm-svn: 311637	2017-08-24 07:55:15 +00:00
Matt Arsenault	00459e4a06	IPRA: Exit early on functions without calls llvm-svn: 311636	2017-08-24 07:55:13 +00:00
Sjoerd Meijer	046a969360	[AArch64] fix for fcos and frem f16 promotion Fix for copy-paste mistake in r311154; setOperationAction for fcos and frem f16 operands appeared twice (and it should be set to 'promote'). Differential Revision: https://reviews.llvm.org/D37071 llvm-svn: 311635	2017-08-24 07:43:52 +00:00
Chandler Carruth	dc2556934c	[x86] NFC: Clean up two tests and generate precise checks for them. Mostly this involved giving unnamed values names and running the IR through `opt` to re-format it but merging in any important comments in the original. I then deleted pointless comments and inlined the function attributes for ease of reading and editting. All of this is to make it much easier to see the instructions being generated here and evaluate any updates to the tests. llvm-svn: 311634	2017-08-24 07:38:36 +00:00
Igor Breger	47be5fbbe9	[GlobalISel][X86] Support G_IMPLICIT_DEF. Summary: Support G_IMPLICIT_DEF. Reviewers: zvi, guyblank, t.p.northover Reviewed By: guyblank Subscribers: rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D36733 llvm-svn: 311633	2017-08-24 07:06:27 +00:00
Lang Hames	cbe694be03	[docs] In the CMake primer, correct the description of the ARGV/ARGN variables. ARGN is the sublist of unnamed arguments, not the count of the arguments. llvm-svn: 311632	2017-08-24 05:38:39 +00:00
Lang Hames	7febf2baff	[Support] Rewrite handleAllErrors in terms of cantFail. This just switches handleAllErrors from using custom assertions that all errors have been handled to using cantFail. This change involves moving some of the class and function definitions around though. llvm-svn: 311631	2017-08-24 05:35:27 +00:00
Wei Ding	a131d3fb29	Add ‘llvm.experimental.constrained.fma‘ Intrinsic. Differential Revision: http://reviews.llvm.org/D36335 llvm-svn: 311629	2017-08-24 04:18:24 +00:00
Adam Nemet	0ada0d5b21	Support all integer types in DiagnosticInfoOptimizationBase::Argument We were missing size_t (unsigned long) on macOS. llvm-svn: 311628	2017-08-24 04:04:49 +00:00
Daniel Berlin	f948603a15	NewGVN: We weren't properly simplifying selects with equal arguments due to a thinko. llvm-svn: 311626	2017-08-24 02:43:17 +00:00
Eric Beckmann	b85172f6ff	Fix bug 34051 by handling empty .res files gracefully. Summary: Previously, llvm-cvtres crashes on .res files which are empty except for the null header. This allows the library to simply pass over them. Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D37044 llvm-svn: 311625	2017-08-24 02:36:50 +00:00
Hans Wennborg	c39ec95d88	[DAG] Fix Node Replacement in PromoteIntBinOp When one operand is a user of another in a promoted binary operation we may replace and delete the returned value before returning triggering an assertion. Reorder node replacements to prevent this. Fixes PR34137. Landing on behalf of Nirav. Differential Revision: https://reviews.llvm.org/D36581 llvm-svn: 311623	2017-08-24 01:08:27 +00:00
Dylan McKay	4f5002198b	[AVR] Use the correct register classes for 16-bit atomic operations llvm-svn: 311620	2017-08-24 00:14:38 +00:00
Dehao Chen	b2d1de5a7c	Add test to cover accurate-sample-profile. Summary: This patch adds test to cover the logic guarded by "accurate-sample-profile" flag. Reviewers: davidxl Reviewed By: davidxl Subscribers: sanjoy, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D37084 llvm-svn: 311618	2017-08-23 23:19:11 +00:00
Tim Northover	4bafa16748	ARM: use internal relocations for local symbols after all. Switching to external relocations for ARM-mode branches (to allow Thumb interworking when the offset is unencodable) causes calls to temporary symbols to be miscompiled and instead go to the parent externally visible symbol. Calling a temporary never happens in compiled code, but can occasionally in hand-written assembly. llvm-svn: 311611	2017-08-23 22:07:10 +00:00
Adrian Prantl	7db6b5e2b3	Retire the llvm.dbg.mir hack after r311594. llvm-svn: 311610	2017-08-23 22:02:36 +00:00
Aditya Nandakumar	850b983455	Fix Verifier test - add REQUIRES aarch64-registered-target llvm-svn: 311609	2017-08-23 21:55:36 +00:00
Adrian Prantl	33aa8acb40	Add a Verifier check for DILocation's scopes. Found via https://bugs.llvm.org/show_bug.cgi?id=33997. llvm-svn: 311608	2017-08-23 21:52:24 +00:00
Jonas Devlieghere	a845167dca	[WebAssembly] Fix overflow for input with missing version Differential revision: https://reviews.llvm.org/D37070 llvm-svn: 311605	2017-08-23 21:36:04 +00:00
Rong Xu	15848e5977	[PGO] Set edge weights for indirectbr instruction with profile counts Current PGO only annotates the edge weight for branch and switch instructions with profile counts. We should also annotate the indirectbr instruction as all the information is there. This patch enables the annotating for indirectbr instructions. Also uses this annotation in branch probability analysis. Differential Revision: https://reviews.llvm.org/D37074 llvm-svn: 311604	2017-08-23 21:36:02 +00:00
Geoff Berry	90bef32219	[AArch64][Falkor] Fix bug in Falkor HWPF tag collision avoidance LDPDi was incorrectly marked as ignoring the destination register in the prefetcher tag. llvm-svn: 311599	2017-08-23 21:11:28 +00:00
Pete Couperus	ed9569dac8	Test commit. Fix instrinsic -> intrinsic typo. llvm-svn: 311598	2017-08-23 20:58:22 +00:00
Aditya Nandakumar	efd8a84cd5	[GISEl]: Translate phi into G_PHI G_PHI has the same semantics as PHI but also has types. This lets us verify that the types in the G_PHI are consistent. This also allows specifying legalization actions for G_PHIs. https://reviews.llvm.org/D36990 llvm-svn: 311596	2017-08-23 20:45:48 +00:00
Reid Kleckner	950567aac4	Attempt to fix the BUILD_SHARED_LIBS build after the DIExpression change llvm-svn: 311595	2017-08-23 20:39:35 +00:00
Reid Kleckner	6d353348e5	Parse and print DIExpressions inline to ease IR and MIR testing Summary: Most DIExpressions are empty or very simple. When they are complex, they tend to be unique, so checking them inline is reasonable. This also avoids the need for CodeGen passes to append to the llvm.dbg.mir named md node. See also PR22780, for making DIExpression not be an MDNode. Reviewers: aprantl, dexonsmith, dblaikie Subscribers: qcolombet, javed.absar, eraman, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D37075 llvm-svn: 311594	2017-08-23 20:31:27 +00:00
Lei Huang	0cb591fc4c	Update branch coalescing to be a PowerPC specific pass Implementing this pass as a PowerPC specific pass. Branch coalescing utilizes the analyzeBranch method which currently does not include any implicit operands. This is not an issue on PPC but must be handled on other targets. Differential Revision : https: // reviews.llvm.org/D32776 llvm-svn: 311588	2017-08-23 19:25:04 +00:00
Greg Clayton	27bfabaf82	Updated my email address. llvm-svn: 311581	2017-08-23 18:00:07 +00:00
Benjamin Kramer	3c56b0bb8f	[X86] Fix -Wenum-compare warning lib/Target/X86/X86ISelLowering.cpp:34613:25: error: enumeral mismatch in conditional expression: 'llvm::ISD::NodeType' vs 'llvm::X86ISD::NodeType' llvm-svn: 311580	2017-08-23 17:50:46 +00:00
Craig Topper	853a8d9ffc	[AVX512] Don't create SHRUNKBLEND SDNodes for 512-bit vectors There are no 512-bit blend instructions so we shouldn't create SHRUNKBLEND for them. On a side note, it looks like there may be a missed opportunity for constant folding TESTM when LHS and RHS are equal. This fixes PR34139. Differential Revision: https://reviews.llvm.org/D36992 llvm-svn: 311572	2017-08-23 16:41:02 +00:00
Craig Topper	f1417ca625	[X86] Remove X86ISD::FMADD in favor ISD::FMA There's no reason to have a target specific node with the same semantics as a target independent opcode. This should simplify D36335 so that it doesn't need to touch X86ISelDAGToDAG.cpp Differential Revision: https://reviews.llvm.org/D36983 llvm-svn: 311568	2017-08-23 16:28:04 +00:00
Yonghong Song	c6d2571031	bpf: close the file descriptor after probe inside getHostCPUNameForBPF Signed-off-by: Yonghong Song <yhs@fb.com> llvm-svn: 311567	2017-08-23 16:24:31 +00:00
Hans Wennborg	66f6fc0a49	LowerAtomic: Don't skip optnone functions; atomic still need lowering (PR34020) The lowering isn't really an optimization, so optnone shouldn't make a difference. ARM relies on the pass running when using "-mthread-model single", because in that mode, it doesn't run AtomicExpand. See bug for more details. Differential Revision: https://reviews.llvm.org/D37040 llvm-svn: 311565	2017-08-23 15:43:28 +00:00
Ilya Biryukov	b2c0794e30	Fixed invalid variable name in Dockerfile scripts. LLVM_SVN_REVISION was used instead of LLVM_SVN_REV. This caused a revision option to be ignored in Dockerfiles. llvm-svn: 311564	2017-08-23 15:36:44 +00:00
Victor Leschuk	3697ebe25f	Revert r311546 as it breaks build http://lab.llvm.org:8011/builders/llvm-clang-x86_64-expensive-checks-win/builds/4394 llvm-svn: 311560	2017-08-23 15:21:10 +00:00
Victor Leschuk	9f11c0bddf	Make lit :: shtest-format.py supported on Windows again It was marked as unsupported on Windows in r311230 because on some Win10 machines it failed or caused hang. The problem was that on these machines system bash (C:\Windows\System32\bash.exe) was used which requires paths to be passed like '/mnt/c/path/to/my/script' instead of 'C:\path\to\my\script'. TODO: we should make lit detect if system bash is used instead of msys and set appropriate path format. llvm-svn: 311558	2017-08-23 14:59:09 +00:00
Rui Ueyama	a93f087d3e	Revert r311552: [Bash-autocompletion] Add support for static analyzer flags This reverts commit r311552 because it broke ubsan and asan bots. llvm-svn: 311557	2017-08-23 14:48:58 +00:00
Gor Nishanov	2f55b958b1	[coroutines] CoroBegin from inner coroutines should be considered for spills Summary: If a coroutine outer calls another coroutine inner and the inner coroutine body is inlined into the outer, coro.begin from the inner coroutine should be considered for spilling if accessed across suspends. Prior to this change, coroutine frame building code was not considering any coro.begins for spilling. With this change, we only ignore coro.begin for the current coroutine, but, any coro.begins that were inlined into the current coroutine are eligible for spills. Fixes PR34267 Reviewers: GorNishanov Subscribers: qcolombet, llvm-commits, EricWF Differential Revision: https://reviews.llvm.org/D37062 llvm-svn: 311556	2017-08-23 14:47:52 +00:00
Chad Rosier	8db41e9dbd	[Reassociate] Don't canonicalize x + (-Constant * y) -> x - (Constant * y).. ..if the resulting subtract will be broken up later. This can cause us to get into an infinite loop. x + (-5.0 * y) -> x - (5.0 * y) ; Canonicalize neg const x - (5.0 * y) -> x + (0 - (5.0 * y)) ; Break up subtract x + (0 - (5.0 * y)) -> x + (-5.0 * y) ; Replace 0-X with X*-1. PR34078 llvm-svn: 311554	2017-08-23 14:10:06 +00:00
Yuka Takahashi	5e7071f5d7	[Bash-autocompletion] Add support for static analyzer flags Summary: This is a patch for clang autocomplete feature. It will collect values which -analyzer-checker takes, which is defined in clang/StaticAnalyzer/Checkers/Checkers.inc, dynamically. First, from ValuesCode class in Options.td, TableGen will generate C++ code in Options.inc. Options.inc will be included in DriverOptions.cpp, and calls OptTable's addValues function. addValues function will add second argument to Option's Values class. Values contains string like "foo,bar,.." which is handed to Values class in OptTable. Reviewers: v.g.vassilev, teemperor, ruiu Subscribers: hiraditya, cfe-commits Differential Revision: https://reviews.llvm.org/D36782 llvm-svn: 311552	2017-08-23 13:39:47 +00:00
Daniel Sanders	c3885c4589	[globalisel][tablegen] Add support for ImmLeaf without SDNodeXForm Summary: This patch adds support for predicates on imm nodes but only for ImmLeaf and not for PatLeaf or PatFrag and only where the value does not need to be transformed before being rendered into the instruction. The limitation on PatLeaf/PatFrag/SDNodeXForm is due to differences in the necessary target-supplied C++ for GlobalISel. Depends on D36085 Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar Reviewed By: rovka Subscribers: kristof.beyls, javed.absar, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D36086 llvm-svn: 311546	2017-08-23 12:14:18 +00:00
Florian Hahn	5b92960091	[ARM] Check for assembler instructions in test. Currently this test causes test failures on some machines, due to isel not being registered. Update the test to run all passes and check emitted assembly instructions for now. llvm-svn: 311545	2017-08-23 11:53:24 +00:00
Florian Hahn	214e13d949	[ARM] Add missing patterns for insert_subvector. Summary: In some cases, shufflevector instruction can be transformed involving insert_subvector instructions. The ARM backend was missing some insert_subvector patterns, causing a failure during instruction selection. AArch64 has similar patterns. Reviewers: t.p.northover, olista01, javed.absar, rengolin Reviewed By: javed.absar Subscribers: aemerson, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D36796 llvm-svn: 311543	2017-08-23 10:20:59 +00:00
Daniel Sanders	499807079b	[globalisel][tablegen] Add tests for FeatureBitsets and ComplexPattern predicates. llvm-svn: 311542	2017-08-23 10:09:25 +00:00
Davide Italiano	06d9eda150	[gold] Test we don't strip globals when producing relocatables. lld was broken in this regard (PR33097). The gold plugin gets this right so, no changes needed, but better adding a test. llvm-svn: 311541	2017-08-23 09:43:41 +00:00
Davide Italiano	c78885818a	[InstCombine] Fold branches with irrelevant conditions to a constant. InstCombine folds instructions with irrelevant conditions to undef. This, as Nuno confirmed is a bug. (see https://bugs.llvm.org/show_bug.cgi?id=33409#c1 ) Given the original motivation for the change is that of removing an USE, we now fold to false instead (which reaches the same goal without undesired side effects). Fixes PR33409. Differential Revision: https://reviews.llvm.org/D36975 llvm-svn: 311540	2017-08-23 09:14:37 +00:00
Hiroshi Inoue	cc555bd0ac	[PowerPC] better instruction selection for OR (XOR) with a 32-bit immediate - recommitting after fixing a test failure on MacOS On PPC64, OR (XOR) with a 32-bit immediate can be done with only two instructions, i.e. ori + oris. But the current LLVM generates three or four instructions for this purpose (and also it clobbers one GPR). This patch makes PPC backend generate ori + oris (xori + xoris) for OR (XOR) with a 32-bit immediate. e.g. (x \| 0xFFFFFFFF) should be ori 3, 3, 65535 oris 3, 3, 65535 but LLVM generates without this patch li 4, 0 oris 4, 4, 65535 ori 4, 4, 65535 or 3, 3, 4 Differential Revision: https://reviews.llvm.org/D34757 llvm-svn: 311538	2017-08-23 08:55:18 +00:00
Krasimir Georgiev	3d55cef48b	[AArch64] Silence unused variable warning in opt mode after r311533 llvm-svn: 311535	2017-08-23 08:40:22 +00:00
Sjoerd Meijer	24c98189ed	[AArch64] ISel legalization debug messages. NFCI. Debugging AArch64 instruction legalization and custom lowering is really an unpleasant experience because it shows nodes that appear out of thin air. In commit r311444, some debug messages have been added to SelectionDAG, the target independent part, and this patch adds some AArch64 specific messages. Differential Revision: https://reviews.llvm.org/D36964 llvm-svn: 311533	2017-08-23 08:18:37 +00:00
Alex Bradbury	d5d559421f	[Lanai] Remove dead functions from LanaiRegisterInfo getEHExceptionRegister and getEHHandlerRegister are unused and were removed from most backends in rL192099. This patch removes them from Lanai. Differential Revision: https://reviews.llvm.org/D36829 llvm-svn: 311531	2017-08-23 07:14:48 +00:00
Hiroshi Inoue	dbb285ca51	Revert rL311526: [PowerPC] better instruction selection for OR (XOR) with a 32-bit immediate This reverts commit rL311526 due to failures in some buildbot. llvm-svn: 311530	2017-08-23 06:38:05 +00:00
Craig Topper	a85f86225a	[InstCombine] Remove unused argument. NFC llvm-svn: 311529	2017-08-23 05:46:09 +00:00
Craig Topper	a94069fb4c	[InstCombine] Replace a simple matcher with a plain old dyn_cast. NFC llvm-svn: 311528	2017-08-23 05:46:08 +00:00
Craig Topper	524c44f74e	[InstCombine] Remove an unnecessary dyn_cast to Instruction and a switch over two opcodes. Just dyn_cast to the specific instruction classes individually. NFC Change the helper methods to take the more specific class as well. llvm-svn: 311527	2017-08-23 05:46:07 +00:00
Hiroshi Inoue	c4449df1b0	[PowerPC] better instruction selection for OR (XOR) with a 32-bit immediate On PPC64, OR (XOR) with a 32-bit immediate can be done with only two instructions, i.e. ori + oris. But the current LLVM generates three or four instructions for this purpose (and also it clobbers one GPR). This patch makes PPC backend generate ori + oris (xori + xoris) for OR (XOR) with a 32-bit immediate. e.g. (x \| 0xFFFFFFFF) should be ori 3, 3, 65535 oris 3, 3, 65535 but LLVM generates without this patch li 4, 0 oris 4, 4, 65535 ori 4, 4, 65535 or 3, 3, 4 Differential Revision: https://reviews.llvm.org/D34757 llvm-svn: 311526	2017-08-23 05:15:15 +00:00
Dean Michael Berris	0884b73220	[XRay][CodeGen] Use PIC-friendly code in XRay sleds; remove synthetic references in .text Summary: This change achieves two things: - Redefine the Custom Event handling instrumentation points emitted by the compiler to not require dynamic relocation of references to the __xray_CustomEvent trampoline. - Remove the synthetic reference we emit at the end of a function that we used to keep auxiliary sections alive in favour of SHF_LINK_ORDER associated with the section where the function is defined. To achieve the custom event handling change, we've had to introduce the concept of sled versioning -- this will need to be supported by the runtime to allow us to understand how to turn on/off the new version of the custom event handling sleds. That change has to land first before we change the way we write the sleds. To remove the synthetic reference, we rely on a relatively new linker feature that preserves the sections that are associated with each other. This allows us to limit the effects on the .text section of ELF binaries. Because we're still using absolute references that are resolved at runtime for the instrumentation map (and function index) maps, we mark these sections write-able. In the future we can re-define the entries in the map to use relative relocations instead that can be statically determined by the linker. That change will be a bit more invasive so we defer this for later. Depends on D36816. Reviewers: dblaikie, echristo, pcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36615 llvm-svn: 311525	2017-08-23 04:49:41 +00:00
Yonghong Song	dc1dbf6ef3	bpf: add variants of -mcpu=# and support for additional jmp insns -mcpu=# will support: . generic: the default insn set . v1: insn set version 1, the same as generic . v2: insn set version 2, version 1 + additional jmp insns . probe: the compiler will probe the underlying kernel to decide proper version of insn set. We did not not use -mcpu=native since llc/llvm will interpret -mcpu=native as the underlying hardware architecture regardless of -march value. Currently, only x86_64 supports -mcpu=probe. Other architecture will silently revert to "generic". Also added -mcpu=help to print available cpu parameters. llvm will print out the information only if there are at least one cpu and at least one feature. Add an unused dummy feature to enable the printout. Examples for usage: $ llc -march=bpf -mcpu=v1 -filetype=asm t.ll $ llc -march=bpf -mcpu=v2 -filetype=asm t.ll $ llc -march=bpf -mcpu=generic -filetype=asm t.ll $ llc -march=bpf -mcpu=probe -filetype=asm t.ll $ llc -march=bpf -mcpu=v3 -filetype=asm t.ll 'v3' is not a recognized processor for this target (ignoring processor) ... $ llc -march=bpf -mcpu=help -filetype=asm t.ll Available CPUs for this target: generic - Select the generic processor. probe - Select the probe processor. v1 - Select the v1 processor. v2 - Select the v2 processor. Available features for this target: dummy - unused feature. Use +feature to enable a feature, or -feature to disable it. For example, llc -mcpu=mycpu -mattr=+feature1,-feature2 ... Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Signed-off-by: Yonghong Song <yhs@fb.com> Acked-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 311522	2017-08-23 04:25:57 +00:00
Matthias Braun	d6c0868da5	Fix tail-merge-after-mbp test The output of this test changed after the fix in r311520 to have -run-pass=block-placement behave like it does in a normal pipeline. Adjust the test. llvm-svn: 311521	2017-08-23 03:49:53 +00:00
Matthias Braun	8426d1342d	Add test case for r311511 This also changes the TailDuplicator to be configured explicitely pre/post regalloc rather than relying on the isSSA() flag. This was necessary to have `llc -run-pass` work reliably. llvm-svn: 311520	2017-08-23 03:17:59 +00:00
Martell Malone	cc82cdfffc	NFC: fix ToolDrivers syntax and typo errors infoTable -> InfoTable camelCase Libtool Options #define offset llvm-svn: 311517	2017-08-23 02:10:28 +00:00
George Karpenkov	0ac90d3f78	Update LLVM fuzzers to use the libFuzzer bundled with the compiler toolchain Differential Revision: https://reviews.llvm.org/D37041 llvm-svn: 311515	2017-08-23 00:40:58 +00:00
George Karpenkov	218ea7f69c	Remove llvm-pdbutil/fuzzer. The code does not compile, is not maintained, and does not have a buildbot. Differential Revision: https://reviews.llvm.org/D37032 llvm-svn: 311512	2017-08-23 00:02:10 +00:00
Matthias Braun	55bc9b3f9e	TargetInstrInfo: Change duplicate() to work on bundles. Adds infrastructure to clone whole instruction bundles rather than just single instructions. This fixes a bug where tail duplication would unbundle instructions while cloning. This should unbreak the "Clang Stage 1: cmake, RA, with expensive checks enabled" build on greendragon. The bot broke with r311139 hitting this pre-existing bug. A proper testcase will come next. llvm-svn: 311511	2017-08-22 23:56:30 +00:00
Craig Topper	35189d5221	[SelectionDAG] Make ISD::isConstantSplatVector always return an element sized APInt. This partially reverts r311429 in favor of making ISD::isConstantSplatVector do something not confusing. Turns out the only other user of it was also having to deal with the weird property of it returning a smaller size. So rather than continue to deal with this quirk everywhere, just make the interface do something sane. Differential Revision: https://reviews.llvm.org/D37039 llvm-svn: 311510	2017-08-22 23:54:13 +00:00
Craig Topper	ec4b82571c	[InstCombine] Remove check for sext of vector icmp from shouldOptimizeCast Looks like for 'and' and 'or' we end up performing at least some of the transformations this is bocking in a round about way anyway. For 'and sext(cmp1), sext(cmp2) we end up later turning it into 'select cmp1, sext(cmp2), 0'. Then we optimize that back to sext (and cmp1, cmp2). This is the same result we would have gotten if shouldOptimizeCast hadn't blocked it. We do something analogous for 'or'. With this patch we allow that transformation to happen directly in foldCastedBitwiseLogic. And we now support the same thing for 'xor'. This is definitely opening up many other cases, but since we already went around it for some cases hopefully it's ok. Differential Revision: https://reviews.llvm.org/D36213 llvm-svn: 311508	2017-08-22 23:40:15 +00:00
Jonas Devlieghere	4942a0b0f3	Revert "[llvm-dwarfdump] Print type names in DW_AT_type DIEs" This reverts commit r311492. llvm-svn: 311499	2017-08-22 21:59:46 +00:00
Jonas Devlieghere	f456d1864d	[llvm-dwarfdump] Print type names in DW_AT_type DIEs This patch adds printing for DW_AT_type DIEs like it's currently already the case for DW_AT_specification DIEs. llvm-svn: 311492	2017-08-22 21:41:49 +00:00
Peter Collingbourne	001052a067	WholeProgramDevirt: Create bitcast to i8* at each virtual call site. We can't reuse the llvm.assume instruction's bitcast because it may not dominate every user of the vtable pointer. Differential Revision: https://reviews.llvm.org/D36994 llvm-svn: 311491	2017-08-22 21:41:19 +00:00
Matt Morehouse	b1fa8255db	[SanitizerCoverage] Optimize stack-depth instrumentation. Summary: Use the initialexec TLS type and eliminate calls to the TLS wrapper. Fixes the sanitizer-x86_64-linux-fuzzer bot failure. Reviewers: vitalybuka, kcc Reviewed By: kcc Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D37026 llvm-svn: 311490	2017-08-22 21:28:29 +00:00
Jakub Kuderski	2724d45325	[ADCE][Dominators] Reapply: Teach ADCE to preserve dominators Summary: This patch teaches ADCE to preserve both DominatorTrees and PostDominatorTrees. This is reapplies the original patch r311057 that was reverted in r311381. The previous version wasn't using the batch update api for updating dominators, which in vary rare cases caused assertion failures. This also fixes PR34258. Reviewers: dberlin, chandlerc, sanjoy, davide, grosser, brzycki Reviewed By: davide Subscribers: grandinj, zhendongsu, llvm-commits, david2050 Differential Revision: https://reviews.llvm.org/D35869 llvm-svn: 311467	2017-08-22 16:30:21 +00:00
Jonas Devlieghere	a680a8f5f8	[Debug info] Add new DbgValues after looping over DAG I was contacted by Jesper Antonsson from Ericsson who ran into problems with r311181 in their test suites with for an out-of-tree target. Because of the latter I don't have a reproducer, but we definitely don't want to modify the data structure on which we are iterating inside the loop. llvm-svn: 311466	2017-08-22 16:28:07 +00:00
Sanjay Patel	0ab50f6d68	[x86] auto-generate full checks; NFC I don't see anything Darwin-specific here, so I made the target generic x86-64. llvm-svn: 311465	2017-08-22 16:27:00 +00:00
Sanjay Patel	40b8e3bfe5	[x86] simplify runs and auto-generate full checks I've replaced the two OS-specific runs with a generic run because there's no functional difference in the resulting output that we're checking. Also, the script still doesn't work with a Win target. llvm-svn: 311463	2017-08-22 16:21:45 +00:00
Erich Keane	0343ef8672	Emit section information for extern variables Update IR generated to retain section information for external declarations. This is related to https://reviews.llvm.org/D36487 Patch By: eandrews Differential Revision: https://reviews.llvm.org/D36712 llvm-svn: 311459	2017-08-22 15:30:43 +00:00
Sam Parker	d65e19f7b3	[ARM][AArch64] Add Armv8.3-a unittests Add Armv8.3-A to the architecture to the TargetParser unittests. Differential Revision: https://reviews.llvm.org/D36748 llvm-svn: 311450	2017-08-22 12:46:33 +00:00
Sam Parker	6dc3fcb1c6	[ARM][AArch64] v8.3-A Javascript Conversion Armv8.3-A adds instructions that convert a double-precision floating point number to a signed 32-bit integer with round towards zero, designed for improving Javascript performance. Differential Revision: https://reviews.llvm.org/D36785 llvm-svn: 311448	2017-08-22 11:08:21 +00:00
Renato Golin	c070c73d5e	[ARM] Avoid creating duplicate ANDs in SelectionDAG When expanding a BRCOND into a BR_CC, do not create an AND 1 if one already exists. Review: D36705 Patch by Joel Galenson <jgalenson@google.com> llvm-svn: 311447	2017-08-22 11:02:45 +00:00
Renato Golin	f63d701669	[ARM] Call setBooleanContents(ZeroOrOneBooleanContent) The ARM backend should call setBooleanContents so that it can use known bits to make some optimizations. Review: D35821 Patch by Joel Galenson <jgalenson@google.com> llvm-svn: 311446	2017-08-22 11:02:37 +00:00
Sjoerd Meijer	e0c933f5d6	[SelectionDAG] Add getNode debug messages This adds debug messages to various functions that create new SDValue nodes. This is e.g. useful to have during legalization, as otherwise it can prints legalization info of nodes that did not appear in the dumps before. Differential Revision: https://reviews.llvm.org/D36984 llvm-svn: 311444	2017-08-22 10:43:51 +00:00
Sjoerd Meijer	b9de2b4871	[AArch64] Cleanup of HasFullFP16 argument. NFC. This is a clean up of commit r311154; it's not necessary to pass HasFullFP16 as an argument, instead just query the DAG. Differential Revision: https://reviews.llvm.org/D36978 llvm-svn: 311438	2017-08-22 09:21:08 +00:00
Chandler Carruth	b866178067	Fix a typo in r311435. llvm-svn: 311437	2017-08-22 09:20:52 +00:00
Alex Bradbury	080f6976c0	Use report_fatal_error for unsupported calling conventions The calling convention can be specified by the user in IR. Failing to support a particular calling convention isn't a programming error, and so relying on llvm_unreachable to catch and report an unsupported calling convention is not appropriate. Differential Revision: https://reviews.llvm.org/D36830 llvm-svn: 311435	2017-08-22 09:11:41 +00:00
George Rimar	1e94ca115d	[lib/Analysis] - Mark personality functions as live. This is PR33245. Case I am fixing is next: Imagine we have 2 BC files, one defines and uses personality routine, second has only declaration and also uses it. Previously algorithm computing dead symbols (llvm::computeDeadSymbols) did not know about personality routines and leaved them dead even if function that has routine was live. As a result thinLTOInternalizeAndPromoteGUID() method changed binding for such symbol to local. Later when LLD tried to link these objects it failed because one object had undefined global symbol for routine and second object contained local definition instead of global. Patch set the live root flag on the corresponding FunctionSummary for personality routines when we build the per-module summaries during the compile step. Differential revision: https://reviews.llvm.org/D36834 llvm-svn: 311432	2017-08-22 08:50:56 +00:00
Craig Topper	b49f0893b2	[X86] Prevent several calls to ISD::isConstantSplatVector from returning a narrower APInt than the original scalar type ISD::isConstantSplatVector can shrink to the smallest splat width. But we don't check the size of the resulting APInt at all. This can cause us to misinterpret the results. This patch just adds a flag to prevent the APInt from changing width. Fixes PR34271. Differential Revision: https://reviews.llvm.org/D36996 llvm-svn: 311429	2017-08-22 05:40:17 +00:00
Eric Beckmann	87c6acf38a	Integrate manifest merging library into LLD. Summary: Now that the llvm-mt manifest merging libraries are complete, we may use them to merge manifests instead of needing to shell out to mt.exe. Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D36255 llvm-svn: 311424	2017-08-22 03:15:28 +00:00
Adrian Prantl	acdc3a7bff	dsymutil: don't copy compile units without children from PCM files rdar://problem/33830532 llvm-svn: 311416	2017-08-22 01:10:48 +00:00
George Karpenkov	748bf121bb	Moving libFuzzer from LLVM to compiler-rt. This change only removes libFuzzer tests and CMake machinery, the source copy temporarily remains at the old location. Differential Revision: https://reviews.llvm.org/D36980 llvm-svn: 311405	2017-08-21 23:25:12 +00:00
Justin Bogner	7d449d31a4	Re-apply "Introduce FuzzMutate library" Same as r311392 with some fixes for library dependencies. Thanks to Chapuni for helping work those out! Original commit message: This introduces the FuzzMutate library, which provides structured fuzzing for LLVM IR, as described in my EuroLLVM 2017 talk. Most of the basic mutators to inject and delete IR are provided, with support for most basic operations. llvm-svn: 311402	2017-08-21 22:57:06 +00:00
Quentin Colombet	4056e80719	[RegAlloc] Make sure live-ranges reflect the state of the IR when removing them When removing a live-range we used to not touch them making debug prints harder to read because the IR was not matching what the live-ranges information was saying. This only affects debug printing and allows to put stronger asserts in the code (see r308906 for instance). llvm-svn: 311401	2017-08-21 22:56:18 +00:00
Craig Topper	7227ebad9c	[ValueTracking] Add assertions that the starting Depth in isKnownToBeAPowerOfTwo and ComputeNumSignBitsImpl is not above MaxDepth The function does an equality check later to terminate the recursion, but that won't work if its starts out too high. Similar assert already exists in computeKnownBits. llvm-svn: 311400	2017-08-21 22:56:12 +00:00
Sanjay Patel	6f527aae0b	[InstCombine] add udiv/urem tests with constant numerator; NFC llvm-svn: 311396	2017-08-21 22:40:02 +00:00
Justin Bogner	6e39755d84	Revert "Re-apply "Introduce FuzzMutate library"" The dependencies for the new library seem to be misconfigured on some linux configs: http://bb.pgr.jp/builders/llvm-i686-linux-RA/builds/5435/steps/build_all/logs/stdio This reverts r311392. llvm-svn: 311393	2017-08-21 22:28:47 +00:00
Justin Bogner	f5c8736482	Re-apply "Introduce FuzzMutate library" Redo r311356 with a fix to avoid std::uniform_int_distribution<bool>. The bool specialization is undefined according to the standard, even though libc++ seems to have it. Original commit message: This introduces the FuzzMutate library, which provides structured fuzzing for LLVM IR, as described in my [EuroLLVM 2017 talk][1]. Most of the basic mutators to inject and delete IR are provided, with support for most basic operations. llvm-svn: 311392	2017-08-21 22:25:04 +00:00
Sanjay Patel	5e3037cfc4	[InstCombine] add more tests for udiv/urem narrowing; NFC We don't currently limit these folds with hasOneUse() or shouldChangeType(). llvm-svn: 311390	2017-08-21 21:57:52 +00:00
Evandro Menezes	bc11ca1a31	[AArch64] Restore the test of conditional branch fusion Restore the functionality of this test that was broken by https://reviews.llvm.org/rL306144. Differential revision: https://reviews.llvm.org/D36807 llvm-svn: 311389	2017-08-21 21:57:43 +00:00
Tim Northover	ef1fc5ae89	GlobalISel (AArch64): fix ABI at border between GPRs and SP. If a struct would end up half in GPRs and half on SP the ABI says it should actually go entirely on the stack. We were getting this wrong in GlobalISel before, causing compatibility issues. llvm-svn: 311388	2017-08-21 21:56:11 +00:00
Steven Wu	010fc49e42	[IR] AutoUpgrade ModuleFlagBehavior for PIC and PIE level Summary: From r303590, ModuleFlagBehavior for PIC and PIE level is changed from Error to Max. This will cause bitcode compatibility issue when linking against a bitcode static archive built with old compiler. Add an auto-ugprade path to upgrade the the ModuleFlagBehavior in the old bitcode to match the new one so IRLinker can link them. Reviewers: tejohnson, mehdi_amini, dexonsmith Reviewed By: dexonsmith Subscribers: hans, llvm-commits Differential Revision: https://reviews.llvm.org/D36556 llvm-svn: 311387	2017-08-21 21:49:13 +00:00
Craig Topper	775ffcc8f5	[InstCombine] Move the checks for pointer types in getMaskedTypeForICmpPair earlier in the function I don't think there's any reason to have them scattered about and on all 4 operands. We already have an early check that both compares must be the same type. And within a given compare the LHS and RHS must have the same type. Beyond that I don't think there's anyway this function returns anything valid for pointer types. So let's just return early and be done with it. Differential Revision: https://reviews.llvm.org/D36561 llvm-svn: 311383	2017-08-21 21:00:45 +00:00
Pirama Arumuga Nainar	3d48bb5fc2	[Support, Windows] Handle long paths with unix separators Summary: The function widenPath() for Windows also normalizes long path names by iterating over the path's components and calling append(). The assumption during the iteration that separators are not returned by the iterator doesn't hold because the iterators do return a separator when the path has a drive name. Handle this case by ignoring separators during iteration. Reviewers: rnk Subscribers: danalbert, srhines Differential Revision: https://reviews.llvm.org/D36752 llvm-svn: 311382	2017-08-21 20:49:44 +00:00
Sanjoy Das	08a38fe71e	Revert "Reapply: [ADCE][Dominators] Teach ADCE to preserve dominators" Summary: This partially reverts commit r311057 since it breaks ADCE. See PR34258. Reviewers: kuhar Subscribers: mcrosier, david2050, llvm-commits Differential Revision: https://reviews.llvm.org/D36979 llvm-svn: 311381	2017-08-21 20:39:18 +00:00
Sam Elliott	6f9a9b5769	[ORE] Remove Old Optimization Remark API Summary: https://bugs.llvm.org/show_bug.cgi?id=33789 Reviewers: anemet Reviewed By: anemet Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36972 llvm-svn: 311380	2017-08-21 20:30:44 +00:00
Zachary Turner	5641c07d6b	[PDB] Serialize records into a stack-allocated buffer. We were using a std::vector<> and resizing to MaxRecordLength, which is ~64KB. We would then do this repeatedly often many times in a tight loop, which was causing measurable performance impact when linking PDBs. Patch by Alex Telishev Differential Revision: https://reviews.llvm.org/D36940 llvm-svn: 311375	2017-08-21 20:17:19 +00:00
George Karpenkov	fb0994b37e	Always compile libFuzzer with no coverage Do not compile libFuzzer itself with coverage, regardless of LLVM variables Differential Revision: https://reviews.llvm.org/D36887 llvm-svn: 311374	2017-08-21 20:12:58 +00:00
Zachary Turner	d76dc2d31e	[lld/pdb] Speed up construction of publics & globals addr map. computeAddrMap function calls std::stable_sort with a comparison function that computes deserialized symbols every time its called. In the result deserializeAs<PublicSym32> is called 20-30 times per symbol. It's much faster to calculate it beforehand and pass a pointer to it to the comparison function. Patch by Alex Telishev Differential Revision: https://reviews.llvm.org/D36941 llvm-svn: 311373	2017-08-21 20:08:40 +00:00
Haicheng Wu	0812c5bea3	[InlineCost] Add cl::opt to allow full inline cost to be computed for debugging purposes. Currently, the inline cost model will bail once the inline cost exceeds the inline threshold in order to avoid unnecessary compile-time. However, when debugging it is useful to compute the full cost, so this command line option is added to override the default behavior. I took over this work from Chad Rosier (mcrosier@codeaurora.org). Differential Revision: https://reviews.llvm.org/D35850 llvm-svn: 311371	2017-08-21 20:00:09 +00:00
Chad Rosier	4eb18742ca	[InlineCost] Add more debug during inline cost computation. llvm-svn: 311370	2017-08-21 19:56:46 +00:00
Zachary Turner	abc037927b	[BinaryStream] Defaultify copy and move constructors. The various BinaryStream classes had explicit copy constructors which resulted in deleted move constructors. This was causing the internal std::shared_ptr to get copied rather than moved very frequently, since these classes are often used as return values. Patch by Alex Telishev Differential Revision: https://reviews.llvm.org/D36942 llvm-svn: 311368	2017-08-21 19:46:46 +00:00
Sanjay Patel	82ec872990	[LibCallSimplifier] try harder to fold memcmp with constant arguments (2nd try) The 1st try was reverted because it could inf-loop by creating a dead instruction. Fixed that to not happen and added a test case to verify. Original commit message: Try to fold: memcmp(X, C, ConstantLength) == 0 --> load X == *C Without this change, we're unnecessarily checking the alignment of the constant data, so we miss the transform in the first 2 tests in the patch. I noted this shortcoming of LibCallSimpifier in one of the recent CGP memcmp expansion patches. This doesn't help the example in: https://bugs.llvm.org/show_bug.cgi?id=34032#c13 ...directly, but it's worth short-circuiting more of these simple cases since we're already trying to do that. The benefit of transforming to load+cmp is that existing IR analysis/transforms may further simplify that code. For example, if the load of the variable is common to multiple memcmp calls, CSE can remove the duplicate instructions. Differential Revision: https://reviews.llvm.org/D36922 llvm-svn: 311366	2017-08-21 19:13:14 +00:00
Craig Topper	74177e1ed1	[InstCombine] Teach foldSelectICmpAnd to recognize a (icmp slt X, 0) and (icmp sgt X, -1) as equivalent to an and with the sign bit of the truncated type This is similar to what was already done in foldSelectICmpAndOr. Ultimately I'd like to see if we can call foldSelectICmpAnd from foldSelectIntoOp if we detect a power of 2 constant. This would allow us to remove foldSelectICmpAndOr entirely. Differential Revision: https://reviews.llvm.org/D36498 llvm-svn: 311362	2017-08-21 19:02:06 +00:00
Justin Bogner	b5fb3b56d7	Revert "Introduce FuzzMutate library" Looks like this fails to build with libstdc++. This reverts r311356 llvm-svn: 311358	2017-08-21 17:57:12 +00:00
Justin Bogner	0233637085	Introduce FuzzMutate library This introduces the FuzzMutate library, which provides structured fuzzing for LLVM IR, as described in my [EuroLLVM 2017 talk][1]. Most of the basic mutators to inject and delete IR are provided, with support for most basic operations. I will follow up with the instruction selection fuzzer, which is implemented in terms of this library. [1]: http://llvm.org/devmtg/2017-03//2017/02/20/accepted-sessions.html#2 llvm-svn: 311356	2017-08-21 17:44:36 +00:00
Sean Fertile	00393cce3a	[PPC] Refine checks for emiting TOC restore nop and tail-call eligibility. For the medium and large code models we only need to check if a call crosses dso-boundaries when considering tail-call elgibility. Differential Revision: https://reviews.llvm.org/D34245 llvm-svn: 311353	2017-08-21 17:35:32 +00:00
Sam Elliott	e963c89d11	Migrate WholeProgramDevirt to new Optimization Remark API Summary: This is an attempt to move WholeProgramDevirt to the new remark API. https://bugs.llvm.org/show_bug.cgi?id=33793 Reviewers: anemet Reviewed By: anemet Subscribers: fhahn, llvm-commits Differential Revision: https://reviews.llvm.org/D36943 llvm-svn: 311352	2017-08-21 16:57:21 +00:00
Davide Italiano	5a2530da05	[APFloat] Fix IsInteger() for DoubleAPFloat. Previously, we would just assert instead. Differential Revision: https://reviews.llvm.org/D36961 llvm-svn: 311351	2017-08-21 16:51:54 +00:00
Sanjay Patel	cf081f9a30	[InstCombine] add tests for memcmp with constant; NFC This is the baseline (current) version of the tests that would have been added with the transform in r311333 (reverted at r311340 due to inf-looping). Adding these now to aid in testing and minimize the patch if/when it is reinstated. llvm-svn: 311350	2017-08-21 16:47:12 +00:00
Sam Elliott	e604b563ea	Emit only A Single Opt Remark When Inlining Summary: This updates the Inliner to only add a single Optimization Remark when Inlining, rather than an Analysis Remark and an Optimization Remark. Fixes https://bugs.llvm.org/show_bug.cgi?id=33786 Reviewers: anemet, davidxl, chandlerc Reviewed By: anemet Subscribers: haicheng, fhahn, mehdi_amini, dblaikie, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D36054 llvm-svn: 311349	2017-08-21 16:45:47 +00:00
Craig Topper	cc255bcd77	[InstCombine] Fix a weakness in canEvaluateZExtd around 'and' instructions Summary: If the bitsToClear from the LHS of an 'and' comes back non-zero, but all of those bits are known zero on the RHS, we can reset bitsToClear. Without this, the 'or' in the modified test case blocks the transform because it has non-zero bits in its RHS in those bits. Reviewers: spatel, majnemer, davide Reviewed By: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36944 llvm-svn: 311343	2017-08-21 16:04:11 +00:00
Craig Topper	8078dd2984	[X86] When selecting sse_load_f32/f64 pattern, make sure there's only one use of every node all the way back to the root of the match Summary: With masked operations, its possible for the operation node like fadd, fsub, etc. to be used by multiple different vselects. Since the pattern matching will start at the vselect, we need to make sure the operation node itself is only used once before we can fold a load. Otherwise we'll end up folding the same load into multiple instructions. Reviewers: RKSimon, spatel, zvi, igorb Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36938 llvm-svn: 311342	2017-08-21 16:04:04 +00:00
Xinliang David Li	d2838fc4b9	Revert 311208, 311209 llvm-svn: 311341	2017-08-21 16:00:38 +00:00
Sanjay Patel	707f786cc5	revert r311333: [LibCallSimplifier] try harder to fold memcmp with constant arguments We're getting lots of compile-timeout bot failures like: http://lab.llvm.org:8011/builders/clang-native-arm-lnt/builds/7119 http://lab.llvm.org:8011/builders/clang-cmake-x86_64-avx2-linux llvm-svn: 311340	2017-08-21 15:16:25 +00:00
Sanjay Patel	0707434ce8	[InstCombine] add vector tests; NFC llvm-svn: 311339	2017-08-21 15:11:39 +00:00
Zachary Turner	d1de2f4f5e	[llvm-pdbutil] Add support for dumping detailed module stats. This adds support for dumping a summary of module symbols and CodeView debug chunks. This option prints a table for each module of all of the symbols that occurred in the module and the number of times it occurred and total byte size. Then at the end it prints the totals for the entire file. Additionally, this patch adds the -jmc (just my code) option, which suppresses modules which are from external libraries or linker imports, so that you can focus only on the object files and libraries that originate from your own source code. llvm-svn: 311338	2017-08-21 14:53:25 +00:00
Sanjay Patel	48c67c9965	[InstCombine] regenerate test checks; NFC llvm-svn: 311337	2017-08-21 14:34:06 +00:00
Sanjay Patel	7756edfa93	[LibCallSimplifier] try harder to fold memcmp with constant arguments Try to fold: memcmp(X, C, ConstantLength) == 0 --> load X == *C Without this change, we're unnecessarily checking the alignment of the constant data, so we miss the transform in the first 2 tests in the patch. I noted this shortcoming of LibCallSimpifier in one of the recent CGP memcmp expansion patches. This doesn't help the example in: https://bugs.llvm.org/show_bug.cgi?id=34032#c13 ...directly, but it's worth short-circuiting more of these simple cases since we're already trying to do that. The benefit of transforming to load+cmp is that existing IR analysis/transforms may further simplify that code. For example, if the load of the variable is common to multiple memcmp calls, CSE can remove the duplicate instructions. Differential Revision: https://reviews.llvm.org/D36922 llvm-svn: 311333	2017-08-21 13:55:49 +00:00
Stefan Pintilie	9495f33e45	[PowerPC] Check if the pre-increment PHI Node already exists Preparations to use the per-increment are sometimes done in the target independent pass Loop Strength Reduction. We try to detect them in the PowerPC specific pass so that they are not done twice and so that we do not add PHIs that are not required. Differential Revision: https://reviews.llvm.org/D36736 llvm-svn: 311332	2017-08-21 13:36:18 +00:00
Igor Breger	685889cf9b	[GlobalISel][X86] Support G_BRCOND operation. Summary: Support G_BRCOND operation. For now don't try to fold cmp/trunc instructions. Reviewers: zvi, guyblank Reviewed By: guyblank Subscribers: rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D34754 llvm-svn: 311327	2017-08-21 10:51:54 +00:00
Oliver Stannard	9bd18aa7d8	[AsmParser] Recommit: Hash is not a comment on some targets Re-committing after r311325 fixed an unintentional use of '#' comments in clang. The '#' token is not a comment for all targets (on ARM and AArch64 it marks an immediate operand), so we shouldn't treat it as such. Comments are already converted to AsmToken::EndOfStatement by AsmLexer::LexLineComment, so this check was unnecessary. Differential Revision: https://reviews.llvm.org/D36405 llvm-svn: 311326	2017-08-21 09:58:37 +00:00
Igor Breger	03c2208d5f	[GlobalISel][X86] InstructionSelector, for now use fallback path for LOAD_STACK_GUARD and PHI nodes. llvm-svn: 311323	2017-08-21 09:17:28 +00:00
Igor Breger	1b5e3d3e28	[GlobalISel][X86] LowerCall, for now don't handel ByValue function arguments. llvm-svn: 311321	2017-08-21 08:59:59 +00:00
Michael Zuckerman	bdb6673151	[InterLeaved] Adding lit test for future work interleaved load strid 3 llvm-svn: 311320	2017-08-21 08:56:39 +00:00
Chandler Carruth	98c51cbee1	[x86] Teach the "generic" x86 CPU to avoid patterns that are slow on widely used processors. This occured to me when I saw that we were generating 'inc' and 'dec' when for Haswell and newer we shouldn't. However, there were a few "X is slow" things that we should probably just set. I've avoided any of the "X is fast" features because most of those would be pretty serious regressions on processors where X isn't actually fast. The slow things are likely to be negligible costs on processors where these aren't slow and a significant win when they are slow. In retrospect this seems somewhat obvious. Not sure why we didn't do this a long time ago. Differential Revision: https://reviews.llvm.org/D36947 llvm-svn: 311318	2017-08-21 08:45:22 +00:00
Chandler Carruth	63dd5e0ef6	[x86] Handle more cases where we can re-use an atomic operation's flags rather than doing a separate comparison. This both saves an explicit comparision and avoids the use of `xadd` which introduces register constraints and other challenges to the generated code. The motivating case is from atomic reference counts where `1` is the sentinel rather than `0` for whatever reason. This can and should be lowered efficiently on x86 by just using a different flag, however the x86 code only handled the `0` case. There remains some further opportunities here that are currently hidden due to canonicalization. I've included test cases that show these and FIXMEs. However, I don't at the moment have any production use cases and they seem substantially harder to address. Differential Revision: https://reviews.llvm.org/D36945 llvm-svn: 311317	2017-08-21 08:45:19 +00:00
Sam Parker	b252ffd2cc	[ARM][AArch64] Cortex-A75 and Cortex-A55 support This patch introduces support for Cortex-A75 and Cortex-A55, Arm's latest big.LITTLE A-class cores. They implement the ARMv8.2-A architecture, including the cryptography and RAS extensions, plus the optional dot product extension. They also implement the RCpc AArch64 extension from ARMv8.3-A. Cortex-A75: https://developer.arm.com/products/processors/cortex-a/cortex-a75 Cortex-A55: https://developer.arm.com/products/processors/cortex-a/cortex-a55 Differential Revision: https://reviews.llvm.org/D36667 llvm-svn: 311316	2017-08-21 08:43:06 +00:00
George Rimar	d7305ef06c	[Support/Parallel] - Do not use a task group for a very small task. parallel_for_each_n splits a given task into small pieces of tasks and then passes them to background threads managed by a thread pool to process them in parallel. TaskGroup then waits for all tasks to be done, which is done by TaskGroup's destructor. In the previous code, all tasks were passed to background threads, and the main thread just waited for them to finish their jobs. This patch changes the logic so that the main thread processes a task just like other worker threads instead of just waiting for workers. This patch improves the performance of parallel_for_each_n for a task which is too small that we do not split it into multiple tasks. Previously, such task was submitted to another thread and the main thread waited for its completion. That involves multiple inter-thread synchronization which is not cheap for small tasks. Now, such task is processed by the main thread, so no inter-thread communication is necessary. Differential revision: https://reviews.llvm.org/D36607 llvm-svn: 311312	2017-08-21 08:00:54 +00:00
Coby Tayree	c54c5cbe67	[X86] Allow xacquire/xrelease prefixes Allow those prefixes on assembly code Differential Revision: https://reviews.llvm.org/D36845 llvm-svn: 311309	2017-08-21 07:50:15 +00:00
Craig Topper	d6f4be97e6	[AVX-512] Don't change which instructions we use for unmasked subvector broadcasts when AVX512DQ is enabled. There's no functional difference between the AVX512DQ instructions if we're not masking. This change unifies test checks and removes extra isel entries. Similar was done for subvector insert and extracts recently. llvm-svn: 311308	2017-08-21 05:29:02 +00:00
Craig Topper	485cca1ecb	[AVX512] Add 128->256 vbroadcastf64x2/vbroadcasti64x2 instructions to the EVEX->VEX table. llvm-svn: 311307	2017-08-21 05:03:28 +00:00
Dean Michael Berris	c5caf3e9c6	[XRay][tools] Support new kinds of instrumentation map entries Summary: When extracting the instrumentation map from a binary, we should be able to recognize the new kinds of instrumentation sleds we've been emitting with the compiler using -fxray-instrument. This change adds a test for all the kinds of sleds we currently support (sans the tail-call sled, which is a bit harder to force in a simple prebuilt input). Reviewers: kpw, dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36819 llvm-svn: 311305	2017-08-21 00:14:06 +00:00
Chandler Carruth	bd6dc14230	Revert r311077: [LV] Using VPlan ... This causes LLVM to assert fail on PPC64 and crash / infloop in other cases. Filed http://llvm.org/PR34248 with reproducer attached. llvm-svn: 311304	2017-08-20 23:17:11 +00:00
Craig Topper	a152903c1b	[InstCombine] Add a test case for a weakness in canEvaluateZExtd. NFC llvm-svn: 311303	2017-08-20 21:38:28 +00:00
Craig Topper	d63b33f9c4	[AVX512] Add a test to check what happens when a load is referenced by two different masked scalar intrinsics with the same op inputs, but different masking node. We're missing some single use checks in the sse_load_f32/f64 handling that cause us to replicate the load. llvm-svn: 311300	2017-08-20 19:47:00 +00:00
Kuba Mracek	6734671dda	Fix archive-update.test after r311296. llvm-svn: 311299	2017-08-20 18:31:30 +00:00
Craig Topper	702097dafc	[AVX-512] Use a scalar load pattern for FPCLASSSS/FPCLASSSD patterns. llvm-svn: 311297	2017-08-20 18:30:24 +00:00
Kuba Mracek	2c0bca49b1	Remove uses of "%T" from test/Object/archive-* tests. llvm-svn: 311296	2017-08-20 18:18:44 +00:00
Benjamin Kramer	806ae44012	[NVPTX] Reduce copypasta. No functionality change intended. llvm-svn: 311295	2017-08-20 17:30:32 +00:00
Kuba Mracek	d3f3fae32d	Get rid of even more "%T" expansions, see <https://reviews.llvm.org/D35396 >. llvm-svn: 311294	2017-08-20 17:05:22 +00:00
Kuba Mracek	5c393d2565	Get rid of some more "%T" expansions, see <https://reviews.llvm.org/D35396 >. llvm-svn: 311293	2017-08-20 17:00:08 +00:00
Benjamin Kramer	760e00b0bc	[MachO] Use Twines more efficiently. llvm-svn: 311291	2017-08-20 15:13:39 +00:00
Benjamin Kramer	2e5be849cc	[Mem2Reg] Modernize code a bit. No functionality change intended. llvm-svn: 311290	2017-08-20 14:34:44 +00:00
Benjamin Kramer	49a49fe816	Move helper classes into anonymous namespaces. No functionality change intended. llvm-svn: 311288	2017-08-20 13:03:48 +00:00
Benjamin Kramer	df8c2628ac	[dlltool] Make memory buffer ownership less weird. There's no reason to destroy them in a global destructor. llvm-svn: 311287	2017-08-20 13:03:32 +00:00
Elena Demikhovsky	f58f838495	Changed basic cost of store operation on X86 Store operation takes 2 UOps on X86 processors. The exact cost calculation affects several optimization passes including loop unroling. This change compensates performance degradation caused by https://reviews.llvm.org/D34458 and shows improvements on some benchmarks. Differential Revision: https://reviews.llvm.org/D35888 llvm-svn: 311285	2017-08-20 12:34:29 +00:00
Aditya Kumar	a525fffd07	[Loop Vectorize] Added a separate metadata Added a separate metadata to indicate when the loop has already been vectorized instead of setting width and count to 1. Patch written by Divya Shanmughan and Aditya Kumar Differential Revision: https://reviews.llvm.org/D36220 llvm-svn: 311281	2017-08-20 10:32:41 +00:00
Igor Breger	88a3d5c855	[GlobalISel][X86] Support call ABI. Summary: Support call ABI. For now only Linux C and X86_64_SysV calling conventions supported. Variadic function not supported. Reviewers: zvi, guyblank, oren_ben_simhon Reviewed By: oren_ben_simhon Subscribers: rovka, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D34602 llvm-svn: 311279	2017-08-20 09:25:22 +00:00
Igor Breger	b3a860a5e8	[GlobalISel][X86] Support asimetric copy from/to GPR physical register. Usually this case generated by ABI lowering, it requare to performe trancate/anyext. llvm-svn: 311278	2017-08-20 07:14:40 +00:00

... 2 3 4 5 6 ...

153521 Commits