llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	29a5a6eed0	Fix uninitialized variable warning. NFCI.	2019-11-13 14:40:21 +00:00
Simon Pilgrim	6ebc5089b2	Fix uninitialized variable warning. NFCI.	2019-11-13 14:40:20 +00:00
Simon Pilgrim	b3be859baa	Sparc - fix uninitialized variable warnings. NFCI.	2019-11-13 14:40:20 +00:00
Simon Pilgrim	66f2ed0746	PPCReduceCRLogicals - fix static analyzer warnings. NFC - Fix uninitialized variable warnings. - Fix null dereference warnings.	2019-11-13 14:40:20 +00:00
Simon Pilgrim	d1bd5e476b	SLPVectorizer - make comparison operators + isInSchedulingRegion const Fixes cppcheck warnings.	2019-11-13 14:40:19 +00:00
Kadir Cetinkaya	16bdcc809c	[clang][Tooling] Filter flags that generate output in SyntaxOnlyAdjuster Summary: Flags that generate output could result in failures when creating syntax only actions. This patch introduces initial logic for filtering out those. The first such flag is "save-temps", which saves intermediate files(bitcode, assembly, etc.) into a specified directory. Fixes https://github.com/clangd/clangd/issues/191 Reviewers: hokein Subscribers: ilya-biryukov, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D70173	2019-11-13 15:03:30 +01:00
Haojian Wu	33e882d5ad	[clangd] Add bool return type to Index::refs API. Summary: Similar to fuzzyFind, the bool indicates whether there are more xref results. Reviewers: ilya-biryukov Reviewed By: ilya-biryukov Subscribers: merge_guards_bot, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D70139	2019-11-13 14:42:30 +01:00
Florian Hahn	f7499011ca	[InstCombine] Avoid moving ops that do restrict undef across shuffles. I think we have to be a bit more careful when it comes to moving ops across shuffles, if the op does restrict undef. For example, without this patch, we would move 'and %v, <0, 0, -1, -1>' over a 'shufflevector %a, undef, <undef, undef, 1, 2>'. As a result, the first 2 lanes of the result are undef after the combine, but they really should be 0, unless I am missing something. For ops that do fold to undef on undef operands, the current behavior should be fine. I've add conservative check OpDoesRestrictUndef, maybe there's a better existing utility? Reviewers: spatel, RKSimon, lebedev.ri Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D70093	2019-11-13 13:40:34 +00:00
Luís Marques	c5b56caa32	Revert "[RISCV] Fix wrong CFI directives" test/DebugInfo/RISCV/relax-debug-frame.ll wasn't properly updated.	2019-11-13 13:28:33 +00:00
Florian Hahn	70cc355f2f	[InstCombine] Precommit shuffle tests for D70093.	2019-11-13 13:25:28 +00:00
Sjoerd Meijer	d90804d26b	[ARM][MVE] canTailPredicateLoop This implements TTI hook 'preferPredicateOverEpilogue' for MVE. This is a first version and it operates on single block loops only. With this change, the vectoriser will now determine if tail-folding scalar remainder loops is possible/desired, which is the first step to generate MVE tail-predicated vector loops. This is disabled by default for now. I.e,, this is depends on option -disable-mve-tail-predication, which is off by default. I will follow up on this soon with a patch for the vectoriser to respect loop hint 'vectorize.predicate.enable'. I.e., with this loop hint set to Disabled, we don't want to tail-fold and we shouldn't query this TTI hook, which is done in D70125. Differential Revision: https://reviews.llvm.org/D69845	2019-11-13 13:24:33 +00:00
Luís Marques	a5ce8bd715	[RISCV] Fix wrong CFI directives Summary: Removes CFI CFA directives that could incorrectly propagate beyond the basic block they were inteded for. Specifically it removes the epilogue CFI directives. See the branch_and_tail_call test for an example of the issue. Should fix the stack unwinding issues caused by the incorrect directives. Reviewers: asb, lenary, shiva0217 Reviewed By: lenary Tags: #llvm Differential Revision: https://reviews.llvm.org/D69723	2019-11-13 13:06:15 +00:00
Simon Tatham	a12f588ebb	[ARM,MVE] Add intrinsics for contiguous load/stores. This patch adds the ACLE intrinsics for all the MVE load and store instructions not already handled by D69791. These ones don't need new IR intrinsics, because they can be implemented in terms of standard LLVM IR constructions. Some of the load and store instructions access less than 128 bits of memory, sign/zero extending each value to a wider vector lane on load or truncating it on store. These are represented in IR by a load of a shorter vector followed by a zext/sext, and conversely, a trunc followed by a short store. Existing ISel patterns already recognize those combinations and turn them into the right MVE instructions. The predicated forms of all these instructions are represented in the same way, except that the ordinary load/store operation is replaced with the existing intrinsics @llvm.masked.{load,store}. These are currently only code-generated as predicated MVE load/store instructions if you give LLVM the `-enable-arm-maskedldst` option; so I've done that in the LLVM codegen test. When we make that the default, that option can be removed. In the Tablegen backend, I've had to add a handful of extra support features: * We need to be able to make clang::Address objects out of a pointer and an alignment (previously we only needed these when the user passed us an existing one). * We can now specify vector types that aren't 128 bits wide (for use in those intermediate values in IR), the parametrized type system can make one starting from two existing vector types (using the lane count of one and the element type of the other). * I've added support for code generation of pointer casts, and for specifying LLVM types as operands to IRBuilder operations (for zext and sext, though I think they'll come in useful again). * Now not all IR construction operations need to be specified as Builder.CreateFoo; some don't involve a Builder at all, and one passes it as a parameter to a tiny static helper function in CGBuiltin.cpp. Reviewers: ostannard, MarkMurrayARM, dmgreen Subscribers: kristof.beyls, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D70088	2019-11-13 12:47:00 +00:00
Simon Pilgrim	4d0e7b628a	[X86][AVX] Add plausible schedule classes to MASKPAIR/VP2INTERSECT/VDPBF16PS instructions These are really just placeholders that use approximately the right resources - once we have CPUs scheduler models that support these instructions they will need revisiting. In the meantime this means that all instructions have a class of some kind., meaning models can be more easily flagged as complete.	2019-11-13 12:02:01 +00:00
JonChesterfield	fd9fa9995c	[libomptarget] Move supporti.h to support.cu Summary: [libomptarget] Move supporti.h to support.cu Reimplementation of D69652, without the unity build and refactors. Will need a clean build of libomptarget as the cmakelists changed. Reviewers: ABataev, jdoerfert Reviewed By: jdoerfert Subscribers: mgorny, jfb, openmp-commits Tags: #openmp Differential Revision: https://reviews.llvm.org/D70131	2019-11-13 11:36:46 +00:00
Hans Wennborg	6ea4775900	Revert `57dd4b0` "[ValueTracking] Allow context-sensitive nullness check for non-pointers" This caused miscompiles of Chromium (https://crbug.com/1023818). The reduced repro is small enough to fit here: $ cat /tmp/a.c unsigned char f(unsigned char p) { unsigned char result = 0; for (int shift = 0; shift < 1; ++shift) result \|= p[0] << (shift 8); return result; } $ bin/clang -O2 -S -o - /tmp/a.c \| grep -A4 f: f: # @f .cfi_startproc # %bb.0: # %entry xorl %eax, %eax retq That's nicely optimized, but I don't think it's the right result :-) > Same as D60846 but with a fix for the problem encountered there which > was a missing context adjustment in the handling of PHI nodes. > > The test that caused D60846 to be reverted was added in `e15ab8f277`. > > Reviewers: nikic, nlopes, mkazantsev,spatel, dlrobertson, uabelho, hakzsam > > Subscribers: hiraditya, bollu, llvm-commits > > Tags: #llvm > > Differential Revision: https://reviews.llvm.org/D69571 This reverts commit `57dd4b03e4`.	2019-11-13 12:19:02 +01:00
Mirko Brkusanin	fed17867cd	[Mips] Add rematerialization support for ldi.fmt Instruction ldi.fmt can be considered cheap enough to avoid spill and restore of value that it produces since it's loaded from immediate. Differential Revision: https://reviews.llvm.org/D69898	2019-11-13 11:33:52 +01:00
Simon Atanasyan	068db2ed4d	[mips] Show an error if 64-bit target triple provided with 32-bit CPU When a 64-bit triple is used emit an error if the CPU only supports 32-bit code. Patch by Miloš Stojanović. Differential Revision: https://reviews.llvm.org/D70018	2019-11-13 13:32:39 +03:00
Simon Atanasyan	b3853d8526	[mips][test] Add Mips CPU tests. NFC Adding tests check all available CPUs on Mips. Patch by Miloš Stojanović. Differential Revision: https://reviews.llvm.org/D70017	2019-11-13 13:32:39 +03:00
Sven van Haastregt	2fe674baa3	[OpenCL] Add remaining vector data builtin functions Add the remaining half (fp16) vector data load and store builtin functions from the OpenCL C specification. Patch by Pierre Gondois and Sven van Haastregt.	2019-11-13 10:16:33 +00:00
Daniil Suchkov	cba4a27745	Temporarily revert "[InstCombine] Fold PHIs with equal incoming pointers" Revert due to sanitizer-windows buildbot failure. This reverts commit `bbb29738b5`.	2019-11-13 17:14:11 +07:00
David Stenberg	5e646ff530	[DebugInfo] Avoid creating entry values for clobbered registers Summary: Entry values are considered for parameters that have register-described DBG_VALUEs in the entry block (along with other conditions). If a parameter's value has been propagated from the caller to the callee, then the parameter's DBG_VALUE in the entry block may be described using a register defined by some instruction, and entry values should not be emitted for the parameter, which can currently occur. One such case was seen in the attached test case, in which the second parameter, which is described by a redefinition of the first parameter's register, would incorrectly get an entry value using the first parameter's register. This commit intends to solve such cases by keeping track of register defines, and ignoring DBG_VALUEs in the entry block that are described by such registers. In a RelWithDebInfo build of clang-8, the average size of the set was 27, and in a RelWithDebInfo+ASan build it was 30. Reviewers: djtodoro, NikolaPrica, aprantl, vsk Reviewed By: djtodoro, vsk Subscribers: hiraditya, llvm-commits Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D69889	2019-11-13 11:10:47 +01:00
David Stenberg	4fec44cd61	[DebugInfo] Add helper for finding entry value candidates [NFC] Summary: The conditions that are used to determine if entry values should be emitted for a parameter are quite many, and will grow slightly in a follow-up commit, so move those to a helper function, as was suggested in the code review for D69889. Reviewers: djtodoro, NikolaPrica Reviewed By: djtodoro Subscribers: probinson, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69955	2019-11-13 11:10:47 +01:00
Sander de Smalen	3367686b4d	[AArch64] Extend storeRegToStackSlot to spill SVE registers. This patch allows the register allocator to spill SVE registers to the stack. Reviewers: ostannard, efriedma, rengolin, cameron.mcinally Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D70082	2019-11-13 10:09:32 +00:00
Daniil Suchkov	bbb29738b5	[InstCombine] Fold PHIs with equal incoming pointers In case when all incoming values of a PHI are equal pointers, this transformation inserts a definition of such a pointer right after definition of the base pointer and replaces with this value both PHI and all it's incoming pointers. Primary goal of this transformation is canonicalization of this pattern in order to enable optimizations that can't handle PHIs. Non-inbounds pointers aren't currently supported. Reviewers: spatel, RKSimon, lebedev.ri, apilipenko Reviewed By: apilipenko Tags: #llvm Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D68128	2019-11-13 17:00:34 +07:00
Sander de Smalen	9a1c243aa5	[AArch64][SVE] Allocate locals that are scalable vectors. This patch adds a target interface to set the StackID for a given type, which allows scalable vectors (e.g. `<vscale x 16 x i8>`) to be assigned a 'sve-vec' StackID, so it is allocated in the SVE area of the stack frame. Reviewers: ostannard, efriedma, rengolin, cameron.mcinally Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D70080	2019-11-13 09:45:24 +00:00
Simon Tatham	5b9e4daef0	[ARM,MVE] Use VMOV.{S8,S16} for sign-extended extractelement. MVE includes instructions that extract an 8- or 16-bit lane from a vector and sign-extend it into the output 32-bit GPR. `ARMInstrMVE.td` already included isel patterns to select those instructions in response to the `ARMISD::VGETLANEs` selection-DAG node type. But `ARMISD::VGETLANEs` was never actually generated, because the code that creates it was conditioned on NEON only. It's an easy fix to enable the same code for integer MVE, and now IR that sign-extends the result of an extractelement (whether explicitly or as part of the function call ABI) will use `vmov.s8` instead of `vmov.u8` followed by `sxtb`. Reviewers: SjoerdMeijer, dmgreen, ostannard Subscribers: kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70132	2019-11-13 09:08:41 +00:00
David Zarzycki	1d55c9e59e	[libcxx testing] Fix -Wtautological-overlap-compare bug	2019-11-13 10:55:19 +02:00
joanlluch	d384ad6b63	[TargetLowering][DAGCombine][MSP430] Shift Amount Threshold in DAGCombine (4) Summary: Replaces ``` unsigned getShiftAmountThreshold(EVT VT) ``` by ``` bool shouldAvoidTransformToShift(EVT VT, unsigned amount) ``` thus giving more flexibility for targets to decide whether particular shift amounts must be considered expensive or not. Updates the MSP430 target with a custom implementation. This continues D69116, D69120, D69326 and updates them, so all of them must be committed before this. Existing tests apply, a few more have been added. Reviewers: asl, spatel Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70042	2019-11-13 09:23:08 +01:00
Craig Topper	a4b7613a49	[X86] Remove setOperationAction for FP_TO_SINT v8i16. This is no longer needed after widening legalization as we custom legalize v8i8 ourselves. Added entries to the cost model, but bumped the cost slightly to account for the truncate shuffle that wasn't costed before.	2019-11-12 22:45:52 -08:00
Michael Kruse	7be6ec5fa2	[GPGPU] Fix regression test after 395124. Commit 395124 "NVPTX: Don't insert an extra empty line at the end of the last section" changed the length of the kernel payload. Update the regression test to the new binary size.	2019-11-13 06:20:17 +00:00
Jonas Devlieghere	7ba28644a1	[Reproducer] Discard reproducer directory if not generated. If lldb was run in capture mode, but no reproducer was generated, make sure we clean up the reproducer directory.	2019-11-12 20:16:33 -08:00
Francesco Petrogalli	d8b6b11143	[VFABI] Add LLVM internal mangling for vector functions. Summary: This patch adds a custom ISA for vector functions for internal use in LLVM. The <isa> token is set to "_LLVM_", and it is not attached to any specific instruction Vector ISA, or Vector Function ABI. The ISA is used as a token for handling Vector Function ABI-style vectorization for those vector functions that are not directly associated to any existing Vector Function ABI (for example, some of the vector functions exposed by TargetLibraryInfo). The demangling function for this ISA in a Vector Function ABI context is set to be the same as the common one shared between X86 and AArch64. Reviewers: jdoerfert, sdesmalen, simoll Subscribers: kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70089	2019-11-13 03:26:39 +00:00
Justin Hibbits	bc4bc5aa0d	Add 8548 CPU definition and attributes 8548 CPU is GCC's name for the e500v2, so accept this in clang. The e500v2 doesn't support lwsync, so define __NO_LWSYNC__ for this as well, as GCC does. Differential Revision: https://reviews.llvm.org/D67787	2019-11-12 20:34:34 -06:00
Matt Arsenault	9d7bccab66	AMDGPU: Extend add x, (ext setcc) combine to sub This is the same as the add case, but inverts the operation type. This avoids regressions in a future patch.	2019-11-13 07:13:58 +05:30
Matt Arsenault	4b47213951	AMDGPU: Switch backend default max workgroup size to 1024 Previously this would default to 256, not the maximum supported size of 1024. Using a maximum lower than the hardware maximum requires language runtimes to enforce this limit for correctness, which no language has correctly done. Switch the default to the conservatively correct maximum, and force frontends to opt-in to the more optimal 256 default maximum. I don't really understand why the changes in occupancy-levels.ll increased the computed occupancy, which I expected to decrease. I'm not sure if these tests should be forcing the old maximum.	2019-11-13 07:11:02 +05:30
Matt Arsenault	25c5da5a42	AMDGPU Reduce reported maximum group size to 1024 While some targets allow encoding 2048, this was never tested or supported.	2019-11-13 06:34:28 +05:30
Alina Sbirlea	793b42a454	[GlobalsAA] Reenable test.	2019-11-12 16:53:28 -08:00
Muhammad Omair Javaid	9b95835698	[LLDB] Add core definition for armv8l and armv7l This patch adds core definitions in lldb ArchSpecs for armv8l and armv7l cores. This was needed because on Linux running on 32-bit Arm v8 we are returned armv8l in case we are running 32-bit sysroot on 64bit kernel. In case of 32-bit kernel and 32-bit sysroot running on arm v8 hardware we are returned armv7l. This is quite common when we run 32 bit arm using docker container. Signed-off-by: Muhammad Omair Javaid <omair.javaid@linaro.org> Differential Revision: https://reviews.llvm.org/D69904	2019-11-13 05:40:09 +05:00
Richard Smith	5ad6f279f2	Don't assume that the clang binary's resolved name includes the string 'clang'. This is not true in practice in some content-addressed file systems.	2019-11-12 16:37:05 -08:00
Leonard Chan	e278c138a9	[Sema] Add MacroQualified case for FunctionTypeUnwrapper This is a fix for PR43315. An assertion error is hit for this minimal example: ``` //clang -cc1 -triple x86_64-- -S tstVMStructRC-min.cpp int (a b)(); // Assertion `Chunk.Kind == DeclaratorChunk::Function' failed. ``` This is because we do not cover the case in the FunctionTypeUnwrapper where it receives a MacroQualifiedType. We have not run into this earlier because this is a unique case where the __attribute__ contains both __cdecl__ and __regparm__ (in that order), and we are compiling for x86_64. Changing the architecture or the order of __cdecl__ and __regparm__ does not raise the assertion. Differential Revision: https://reviews.llvm.org/D67992	2019-11-12 16:22:13 -08:00
Alina Sbirlea	92611da5bf	Temporarily disable test.	2019-11-12 15:57:51 -08:00
Eric Christopher	7a3ad48d6d	Temporarily Revert "Reapply [LVI] Normalize pointer behavior" as it's broken python 3.6. Reverting to figure out if it's a problem in python or the compiler for now. This reverts commit `885a05f48a`.	2019-11-12 15:51:51 -08:00
Jonas Devlieghere	056c319769	[LLDB] Only set FRAMEWORK when we're actually building a framework.	2019-11-12 15:42:07 -08:00
Jonas Devlieghere	34ca6e1fbe	[LLDB] Remove debug message in AddLLDB.cmake	2019-11-12 15:33:03 -08:00
Douglas Yung	7ebde1bf67	Add a shim for setenv on PS4 since it does not exist. A few years back a similar change was made for getenv since neither function is supported on the PS4 platform. Recently, commit `d889d1e` added a call to setenv in compiler-rt which was causing linking errors because the symbol was not found. This fixes that issue by putting in a shim similar to how we previously dealt with the lack of getenv. Differential Revision: https://reviews.llvm.org/D70033	2019-11-12 15:05:45 -08:00
Craig Topper	3e1aee2ba7	[X86] Don't consider v64i1 as a legal type unless v64i8 is also a legal type. This avoids some nasty issues with argument passing and lowering of arbitrary v64i8 shuffles.	2019-11-12 14:56:02 -08:00
Craig Topper	0f04ffc073	[X86] Only pass v64i8/v32i16 as v16i32 on non-avx512bw targets if the v16i32 type won't be split by prefer-vector-width=256 Otherwise just let the v64i8/v32i16 types be split to v32i8/v16i16. In reality this shouldn't happen because it means we have a 512-bit vector argument, but min-legal-vector-width says a value less than 512. But a 512-bit argument should have been factored into the preferred vector width.	2019-11-12 14:56:01 -08:00
Sterling Augustine	38c356176b	Fix include guard and properly order __deregister_frame_info. Summary: This patch fixes two problems with the crtbegin.c as written: 1. In do_init, register_frame_info is not guarded by a #define, but in do_fini, deregister_frame_info is guarded by #ifndef CRT_HAS_INITFINI_ARRAY. Thus when CRT_HAS_INITFINI_ARRAY is not defined, frames are registered but then never deregistered. The frame registry mechanism builds a linked-list from the .so's static variable do_init.object, and when the .so is unloaded, this memory becomes invalid and should be deregistered. Further, libgcc's crtbegin treats the frame registry as independent from the initfini array mechanism. This patch fixes this by adding a new #define, "EH_USE_FRAME_INFO_REGISTRY", which is set by the cmake option COMPILER_RT_CRT_USE_EH_FRAME_REGISTRY Currently, do_init calls register_frame_info, and then calls the binary's constructors. This allows constructors to safely use libunwind. However, do_fini calls deregister_frame_info and then calls the binary's destructors. This prevents destructors from safely using libunwind. This patch also switches that ordering, so that destructors can safely use libunwind. As it happens, this is a fairly common scenario for thread sanitizer.	2019-11-12 14:54:41 -08:00
Weverything	9740f9f0b6	Add -Wtautological-compare to -Wall Some warnings in -Wtautological-compare subgroups are DefaultIgnore. Adding this group to -Wmost, which is part of -Wall, will aid in their discoverability. Differential Revision: https://reviews.llvm.org/D69292	2019-11-12 14:36:57 -08:00

... 2 3 4 5 6 ...

331882 Commits All Branches Search

331882 Commits

All Branches