llvm-project

Commit Graph

Author	SHA1	Message	Date
Serge Rogatch	fa846a116e	[XRay][AArch64] More staging for tail call support in XRay AArch64 - in compiler-rt Summary: This patch provides a trampoline for function tail exit tracing. Still, it's staging because code `1` is passed to the handler function (indicating a normal exit) instead of `2`, which would indicate tail exit. This is so until the logging part of XRay supports tail exits too. Related: https://reviews.llvm.org/D28947 (LLVM) Reviewers: dberris, rengolin Reviewed By: rengolin Subscribers: aemerson, llvm-commits, iid_iunknown Differential Revision: https://reviews.llvm.org/D28948 llvm-svn: 293082	2017-01-25 20:27:19 +00:00
Matt Arsenault	5d9101941f	AMDGPU: Set call_convention bit in kernel_code_t According to the documentation this is supposed to be -1 if indirect calls are not supported. llvm-svn: 293081	2017-01-25 20:21:57 +00:00
Serge Rogatch	bc2d34394d	[XRay][AArch64] More staging for tail call support in XRay on AArch64 - in LLVM Summary: This patch prepares more for tail call support in XRay. Until the logging part supports tail calls, this is just staging, so it seems LLVM part is mostly ready with this patch. Related: https://reviews.llvm.org/D28948 (compiler-rt) Reviewers: dberris, rengolin Reviewed By: dberris Subscribers: llvm-commits, iid_iunknown, aemerson Differential Revision: https://reviews.llvm.org/D28947 llvm-svn: 293080	2017-01-25 20:21:49 +00:00
Marshall Clow	071aded6ee	Fixed a typo in the synopsis (noecept -> noexcept). Thanks to Kim for the catch llvm-svn: 293079	2017-01-25 20:14:03 +00:00
Michal Gorny	30dc91a707	[cmake] Fix -rpath-link in stand-alone build Set LLVM_LIBRARY_OUTPUT_INTDIR as expected by llvm_setup_rpath() macro when doing stand-alone builds. This is required to pass correct -rpath-link when linking shared libraries, and therefore ensure that the linker can find dependency libraries correctly during the build. Differential Revision: https://reviews.llvm.org/D29099 llvm-svn: 293078	2017-01-25 19:33:14 +00:00
Krzysztof Parzyszek	ee9aa3ffee	Add iterator_range<regclass_iterator> to {Target,MC}RegisterInfo, NFC llvm-svn: 293077	2017-01-25 19:29:04 +00:00
Alexey Bataev	1da8ba2adc	[SLP] Extra test for functionality with extra args. llvm-svn: 293076	2017-01-25 17:24:31 +00:00
Chad Rosier	4f724dce42	Revert "Do not verify dominator tree if it has no roots" This reverts commit r293033, per Danny's comment. In short, we require domtrees to have roots at all times. llvm-svn: 293075	2017-01-25 17:15:48 +00:00
Matthias Braun	aeb8e33968	PowerPC: Slight cleanup of getReservedRegs(); NFC Change getReservedRegs() to not mark a register as reserved and then revert that decision in some cases. Motivated by the discussion in https://reviews.llvm.org/D29056 llvm-svn: 293073	2017-01-25 17:12:10 +00:00
Mehdi Amini	fd7165364b	[libcxx] Mentions "targeting C++11 and above" instead of "targeting C++11" in the doc llvm-svn: 293071	2017-01-25 17:00:30 +00:00
Arpith Chacko Jacob	2cd6eeabfd	[OpenMP] Support for the proc_bind-clause on 'target parallel' on the NVPTX device. This patch adds support for the proc_bind clause on the Spmd construct 'target parallel' on the NVPTX device. Since the parallel region is created upon kernel launch, this clause can be safely ignored on the NVPTX device at codegen time for level 0 parallelism. Reviewers: ABataev Differential Revision: https://reviews.llvm.org/D29128 llvm-svn: 293069	2017-01-25 16:55:10 +00:00
Kostya Kortchinsky	198f864c07	[scudo] Enabling AArch64 support for Scudo Summary: Adding ARM64 as a supported architecture for Scudo. The random shuffle is not yet supported for SizeClassAllocator32, which is used by the AArch64 allocator, so disable the associated test for now. Reviewers: kcc, alekseyshl, rengolin Reviewed By: rengolin Subscribers: aemerson, mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D28960 llvm-svn: 293068	2017-01-25 16:35:18 +00:00
Krzysztof Parzyszek	0fd6296b82	Add loop pass insertion point EP_LateLoopOptimizations Differential Revision: https://reviews.llvm.org/D28694 llvm-svn: 293067	2017-01-25 16:12:25 +00:00
Bill Seurer	710e821ddc	[powerpc] deactivate ThreadedOneSizeMallocStressTest asan test on powerpc64 This has not reliably worked on powerpc since r279664. Re-enable this once the problem is tracked down and fixed. llvm-svn: 293066	2017-01-25 16:03:27 +00:00
Nico Weber	b1706ca107	Clarify how to forward-declare __llvm_profile symbols. llvm-svn: 293065	2017-01-25 16:01:32 +00:00
Artur Pilipenko	8fb3d57e67	[Guards] Introduce loop-predication pass This patch introduces guard based loop predication optimization. The new LoopPredication pass tries to convert loop variant range checks to loop invariant by widening checks across loop iterations. For example, it will convert for (i = 0; i < n; i++) { guard(i < len); ... } to for (i = 0; i < n; i++) { guard(n - 1 < len); ... } After this transformation the condition of the guard is loop invariant, so loop-unswitch can later unswitch the loop by this condition which basically predicates the loop by the widened condition: if (n - 1 < len) for (i = 0; i < n; i++) { ... } else deoptimize This patch relies on an NFC change to make ScalarEvolution::isMonotonicPredicate public (revision 293062). Reviewed By: sanjoy Differential Revision: https://reviews.llvm.org/D29034 llvm-svn: 293064	2017-01-25 16:00:44 +00:00
Chad Rosier	072e70b365	[AArch64] Minor code refactoring. NFC. llvm-svn: 293063	2017-01-25 15:56:59 +00:00
Artur Pilipenko	5eade5cba8	NFC. Make ScalarEvolution::isMonotonicPredicate public Will be used by the upcoming LoopPredication optimization. llvm-svn: 293062	2017-01-25 15:07:55 +00:00
Artur Pilipenko	b85f7a5d99	[InstCombine] Canonicalize guards for NOT OR condition This is a partial fix for Bug 31520 - [guards] canonicalize guards in instcombine Reviewed By: apilipenko Differential Revision: https://reviews.llvm.org/D29075 Patch by Maxim Kazantsev. llvm-svn: 293061	2017-01-25 14:45:12 +00:00
Simon Pilgrim	6f6b279109	[InstCombine][SSE] Add support for PACKSS/PACKUS constant folding Differential Revision: https://reviews.llvm.org/D28949 llvm-svn: 293060	2017-01-25 14:37:24 +00:00
Martin Bohme	8396e14e7f	[ARM] GlobalISel: Fix stack-use-after-scope bug. Summary: Lifetime extension wasn't triggered on the result of BuildMI because the reference was non-const. However, instead of adding a const, I've removed the reference entirely as RVO should kick in anyway. Reviewers: rovka, bkramer Reviewed By: bkramer Subscribers: aemerson, rengolin, dberris, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D29124 llvm-svn: 293059	2017-01-25 14:28:19 +00:00
Artur Pilipenko	4df4c4a4aa	[InstCombine] Canonicalize guards for AND condition This is a partial fix for Bug 31520 - [guards] canonicalize guards in instcombine Reviewed By: apilipenko Differential Revision: https://reviews.llvm.org/D29074 Patch by Maxim Kazantsev. llvm-svn: 293058	2017-01-25 14:20:52 +00:00
Krzysztof Parzyszek	520e51d951	[compiler-rt] Fix xray compilation errors: errno and size_t Include errno.h, and use size_t instead of std::size_t, since stddef.h was included (and not cstddef). llvm-svn: 293057	2017-01-25 14:20:30 +00:00
Artur Pilipenko	e812ca00bb	[InstCombine] Allow InstrCombine to remove one of adjacent guards if they are equivalent This is a partial fix for Bug 31520 - [guards] canonicalize guards in instcombine Reviewed By: majnemer, apilipenko Differential Revision: https://reviews.llvm.org/D29071 Patch by Maxim Kazantsev. llvm-svn: 293056	2017-01-25 14:12:12 +00:00
Krasimir Georgiev	91834227a3	[clang-format] Implement comment reflowing. Summary: This presents a version of the comment reflowing with less mutable state inside the comment breakable token subclasses. The state has been pushed into the driving breakProtrudingToken method. For this, the API of BreakableToken is enriched by the methods getSplitBefore and getLineLengthAfterSplitBefore. Reviewers: klimek Reviewed By: klimek Subscribers: djasper, klimek, mgorny, cfe-commits, ioeric Differential Revision: https://reviews.llvm.org/D28764 llvm-svn: 293055	2017-01-25 13:58:58 +00:00
George Rimar	f242ffa095	[ELF] - Implemented support for R_386_PC8/R_386_8 relocations. These relocations are used in linux kernel. Differential revision: https://reviews.llvm.org/D28094 llvm-svn: 293054	2017-01-25 13:36:49 +00:00
Michal Gorny	a56833b0e7	[test] Add HAVE_LIBZ to canonicalized booleans Canonicalize HAVE_LIBZ as well to fix buildbot failures. llvm-svn: 293053	2017-01-25 13:31:53 +00:00
Michal Gorny	638ac70a92	[test] Port clang tests to canonicalized booleans Use the new llvm_canonicalize_cmake_booleans() function to canonicalize booleans for lit tests. Replace the duplicate ENABLE_CLANG* variables used to hold canonicalized values with in-place canonicalization. Use implicit logic in Python code to avoid overrelying on exact 0/1 values. Differential Revision: https://reviews.llvm.org/D28529 llvm-svn: 293052	2017-01-25 13:11:45 +00:00
Martin Bohme	ac93aa7022	[Driver] Prevent no-arc-exception-silence.m test from writing output. Summary: This enables the test to run on systems where output cannot be written. Reviewers: compnerd Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D29123 llvm-svn: 293051	2017-01-25 12:55:53 +00:00
Anastasia Stulova	d1f390ef99	[OpenCL] Diagnose write_only image3d when extension is disabled Prior to OpenCL 2.0, image3d_t can only be used with the write_only access qualifier when the cl_khr_3d_image_writes extension is enabled, see e.g. OpenCL 1.1 s6.8b. Require the extension for write_only image3d_t types and guard uses of write_only image3d_t in the OpenCL header. Patch by Sven van Haastregt! Review: https://reviews.llvm.org/D28860 llvm-svn: 293050	2017-01-25 12:18:50 +00:00
Arpith Chacko Jacob	7ecc0b7f3d	[OpenMP] Support for thread_limit-clause on the 'target teams' directive. The thread_limit-clause on the combined directive applies to the 'teams' region of this construct. We modify the ThreadLimitClause class to capture the clause expression within the 'target' region. Reviewers: ABataev Differential Revision: https://reviews.llvm.org/D29087 llvm-svn: 293049	2017-01-25 11:44:35 +00:00
Arpith Chacko Jacob	bc126344e1	[OpenMP] Support for num_teams-clause on the 'target teams' directive. The num_teams-clause on the combined directive applies to the 'teams' region of this construct. We modify the NumTeamsClause class to capture the clause expression within the 'target' region. Reviewers: ABataev Differential Revision: https://reviews.llvm.org/D29085 llvm-svn: 293048	2017-01-25 11:28:18 +00:00
Pavel Labath	46897a46ee	include Host/Time.h in Cocoa.cpp Time.h contains the necessary magic to enable timegm on all android targets. llvm-svn: 293047	2017-01-25 11:19:49 +00:00
Pavel Labath	8abd34f015	NPL: Compartmentalize arm64 single step workaround better The main motivation for me doing this is being able to build an arm android lldb-server against api level 9 headers, but it seems like a good cleanup nonetheless. The entirety of the cpu_set_t dance now resides in SingleStepCheck.cpp, which is only built on arm64. llvm-svn: 293046	2017-01-25 11:19:45 +00:00
Pavel Labath	2d0c5b0297	Replace chdir() usage with the llvm equivalent. This removes a hack in PosixApi.h, which tends to produce strange compile errors when it's included in the wrong order. llvm-svn: 293045	2017-01-25 11:10:52 +00:00
Peter Smith	9694376a93	[ELF] Add local mapping symbols to ARM PLT entries Mapping symbols allow a mapping symbol aware disassembler to correctly disassemble the PLT when the code immediately prior to the PLT is Thumb. To implement this we add a function to add symbols with local binding to be defined in SyntheticSymbols. Differential Revision: https://reviews.llvm.org/D28956 llvm-svn: 293044	2017-01-25 10:31:16 +00:00
Artem Dergachev	55705955ce	[analyzer] Fix MacOSXAPIChecker fp with static locals seen from nested blocks. This is an attempt to avoid new false positives caused by the reverted r292800, however the scope of the fix is significantly reduced - some variables are still in incorrect memory spaces. Relevant test cases added. rdar://problem/30105546 rdar://problem/30156693 Differential revision: https://reviews.llvm.org/D28946 llvm-svn: 293043	2017-01-25 10:21:45 +00:00
Alexey Bataev	d28ab559a7	[SLP] Improve horizontal vectorization for non-power-of-2 number of instructions. If number of instructions in horizontal reduction list is not power of 2 then only PowerOf2Floor(NumberOfInstructions) last elements are actually vectorized, other instructions remain scalar. Patch tries to vectorize the remaining elements either. Differential Revision: https://reviews.llvm.org/D28959 llvm-svn: 293042	2017-01-25 09:54:38 +00:00
whitequark	16f1e5f1ca	Mark @llvm.powi.* as safe to speculatively execute. Floating point intrinsics in LLVM are generally not speculatively executed, since most of them are defined to behave the same as libm functions, which set errno. However, the @llvm.powi.* intrinsics do not correspond to any libm function, and lacks any defined error handling semantics in LangRef. It most certainly does not alter errno. llvm-svn: 293041	2017-01-25 09:32:30 +00:00
Mohammed Agabaria	20caee95e1	[X86] enable memory interleaving for X86\SLM arch. Differential Revision: https://reviews.llvm.org/D28547 llvm-svn: 293040	2017-01-25 09:14:48 +00:00
Artur Pilipenko	bc93452420	Fix buildbot failures introduced by 293036 Fix unused variable, specify types explicitly to make VC compiler happy. llvm-svn: 293039	2017-01-25 09:10:07 +00:00
Artur Pilipenko	41c0005aa3	[DAGCombiner] Match load by bytes idiom and fold it into a single load. Attempt #2 . The previous patch (https://reviews.llvm.org/rL289538) got reverted because of a bug. Chandler also requested some changes to the algorithm. http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20161212/413479.html This is an updated patch. The key difference is that collectBitProviders (renamed to calculateByteProvider) now collects the origin of one byte, not the whole value. It simplifies the implementation and allows to stop the traversal earlier if we know that the result won't be used. From the original commit: Match a pattern where a wide type scalar value is loaded by several narrow loads and combined by shifts and ors. Fold it into a single load or a load and a bswap if the targets supports it. Assuming little endian target: i8 a = ... i32 val = a[0] \| (a[1] << 8) \| (a[2] << 16) \| (a[3] << 24) => i32 val = ((i32)a) i8 a = ... i32 val = (a[0] << 24) \| (a[1] << 16) \| (a[2] << 8) \| a[3] => i32 val = BSWAP(((i32)a)) This optimization was discussed on llvm-dev some time ago in "Load combine pass" thread. We came to the conclusion that we want to do this transformation late in the pipeline because in presence of atomic loads load widening is irreversible transformation and it might hinder other optimizations. Eventually we'd like to support folding patterns like this where the offset has a variable and a constant part: i32 val = a[i] \| (a[i + 1] << 8) \| (a[i + 2] << 16) \| (a[i + 3] << 24) Matching the pattern above is easier at SelectionDAG level since address reassociation has already happened and the fact that the loads are adjacent is clear. Understanding that these loads are adjacent at IR level would have involved looking through geps/zexts/adds while looking at the addresses. The general scheme is to match OR expressions by recursively calculating the origin of individual bytes which constitute the resulting OR value. If all the OR bytes come from memory verify that they are adjacent and match with little or big endian encoding of a wider value. If so and the load of the wider type (and bswap if needed) is allowed by the target generate a load and a bswap if needed. Reviewed By: RKSimon, filcab, chandlerc Differential Revision: https://reviews.llvm.org/D27861 llvm-svn: 293036	2017-01-25 08:53:31 +00:00
Diana Picus	d83df5d372	[ARM] GlobalISel: Support i1 add and ABI extensions Add support for: * i1 add * i1 function arguments, if passed through registers * i1 returns, with ABI signext/zeroext Differential Revision: https://reviews.llvm.org/D27706 llvm-svn: 293035	2017-01-25 08:47:40 +00:00
Diana Picus	8b6c6bedcb	[ARM] GlobalISel: Support i8/i16 ABI extensions At the moment, this means supporting the signext/zeroext attribute on the return type of the function. For function arguments, signext/zeroext should be handled by the caller, so there's nothing for us to do until we start lowering calls. Note that this does not include support for other extensions (i8 to i16), those will be added later. Differential Revision: https://reviews.llvm.org/D27705 llvm-svn: 293034	2017-01-25 08:10:40 +00:00
Serge Pavlov	43a7759f4b	Do not verify dominator tree if it has no roots If dominator tree has no roots, the pass that calculates it is likely to be skipped. It occures, for instance, in the case of entities with linkage available_externally. Do not run tree verification in such case. Differential Revision: https://reviews.llvm.org/D28767 llvm-svn: 293033	2017-01-25 07:58:10 +00:00
Diana Picus	ac03b4b924	Revert "Use filename in linemarker when compiling preprocessed source" This reverts commit r293004 because it broke the buildbots with "unknown CPU" errors. I tried to fix it in r293026, but that broke on Green Dragon with this kind of error: error: expected string not found in input // CHECK: l{{ +}}df{{ +}}ABS{{ +}}{{0+}}{{.+}}preprocessed-input.c{{$}} ^ <stdin>:2:1: note: scanning from here /Users/buildslave/jenkins/sharedspace/incremental@2/clang-build/tools/clang/test/Frontend/Output/preprocessed-input.c.tmp.o: file format Mach-O 64-bit x86-64 ^ <stdin>:2:67: note: possible intended match here /Users/buildslave/jenkins/sharedspace/incremental@2/clang-build/tools/clang/test/Frontend/Output/preprocessed-input.c.tmp.o: file format Mach-O 64-bit x86-64 I suppose this means that llvm-objdump doesn't support Mach-O, so the test should indeed check for linux (but not for x86). I'll leave it to someone that knows better. llvm-svn: 293032	2017-01-25 07:27:05 +00:00
Dean Michael Berris	d09bf194fa	Implemented color coding and Vertex labels in XRay Graph Summary: A patch to enable the llvm-xray graph subcommand to color edges and vertices based on statistics and to annotate vertices with statistics. Depends on D27243 Reviewers: dblaikie, dberris Reviewed By: dberris Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D28225 llvm-svn: 293031	2017-01-25 07:14:43 +00:00
Coby Tayree	77807d93af	[X86]Enable the use of 'mov' with a 64bit GPR and a large immediate Enable the next form (intel style): "mov <reg64>, <largeImm>" which is should be available, where <largeImm> stands for immediates which exceed the range of a singed 32bit integer Differential Revision: https://reviews.llvm.org/D28988 llvm-svn: 293030	2017-01-25 07:09:42 +00:00
Diana Picus	1d8eaf4387	[ARM] GlobalISel: Bail out on Thumb. NFC Thumb is not supported yet, so bail out early. llvm-svn: 293029	2017-01-25 07:08:53 +00:00
Matt Arsenault	74a576e7d3	AMDGPU: Check nsz instead of unsafe math llvm-svn: 293028	2017-01-25 06:27:02 +00:00

... 4 5 6 7 8 ...

253170 Commits All Branches Search

253170 Commits

All Branches