llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	03aa967c0d	[CostModel][X86][ARM] Teach getCastInstrCost to include the splitting factor when handling operations that type legalize to the same number of subvectors or scalar components Previously, we just always returned 1. But that ignores that we have to do the operation for each subvector or scalar component. Differential Revision: https://reviews.llvm.org/D78824	2020-04-24 13:36:26 -07:00
Alexandre Ganea	0e13a0331f	[llvm-cov] Prevent llvm-cov from using too many threads As reported here: https://reviews.llvm.org/D75153#1987272 Before, each instance of llvm-cov was creating one thread per hardware core, which wasn't needed probably because the number of inputs were small. This was probably causing a thread rlimit issue on large core count systems. After this patch, the previous behavior is restored (to what was before rG8404aeb5): If --num-threads is not specified, we create one thread per input, up to num.cores. When specified, --num-threads indicates any number of threads, with no upper limit. Differential Revision: https://reviews.llvm.org/D78408	2020-04-24 15:28:25 -04:00
Tyker	42431da895	[AssumeBundles] Use assume bundles in isKnownNonZero Summary: Use nonnull and dereferenceable from an assume bundle in isKnownNonZero Reviewers: jdoerfert, nikic, lebedev.ri, reames, fhahn, sstefan1 Reviewed By: jdoerfert Subscribers: fhahn, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76149	2020-04-24 20:41:51 +02:00
Vedant Kumar	c0fa447e02	AArch64: Remove reversedInstructionsWithoutDebug helper When using reversedInstructionsWithoutDebug to construct a range from a pair of MachineInstrBundleIterators, the range unexpectedly leaves out an element. This results in mis-optimization as @mstorsjo points out in https://reviews.llvm.org/D78157. The problem is that when we convert a MachineInstrBundleIterator to a reverse iterator, the result gets incremented: MachineInstrBundleIterator(++I.getReverse()) The comment there explains that the "resulting iterator will dereference ... to the previous node, which is somewhat unexpected; but converting the two endpoints in a range will give the same range in reverse". This makes it hard to understand what reversedInstructionsWithoutDebug will do: I've removed the helper to prevent similar mistakes in the future.	2020-04-24 11:28:17 -07:00
Mircea Trofin	fdbf493a70	[llvm][NFC][CallSite] Remove {Immutable}CallSite and CallSiteBase Reviewers: dblaikie, craig.topper Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78794	2020-04-24 11:03:35 -07:00
Christopher Tetreault	947be4a024	[SVE] Do not store a bool for Scalable in VectorType Summary: - Whether or not a vector is scalable is a function of its type. Since all instances of ScalableVectorType will have true for this value and all instances of FixedVectorType will have false for this value, there is no need to store it as a class member. Reviewers: efriedma, fpetrogalli, kmclaughlin Reviewed By: fpetrogalli Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78601	2020-04-24 10:36:26 -07:00
Craig Topper	4cf73a3fc6	[CostModel][X86] Account for splitting cost when vector zext/sext type legalize to the same size vector.	2020-04-24 09:59:23 -07:00
Alexandre Ganea	e98f73a629	[MC] Fix quadratic behavior in addPendingLabel() Differential Revision: https://reviews.llvm.org/D78775	2020-04-24 12:48:54 -04:00
Mircea Trofin	c3770c5d6d	[llvm][NFC] Factor out inlining pipeline as a module pipeline. Summary: This simplifies testing in scenarios where we want to set up module-wide analyses for inlining. The patch enables treating inlining and its function cleanups, as a module pass. The alternative would be for tests to describe the pipeline, which is tedious and adds maintenance overhead. Reviewers: davidxl, dblaikie, jdoerfert, sstefan1 Subscribers: hiraditya, steven_wu, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78512	2020-04-24 09:24:12 -07:00
Luke Geeson	740a1dd050	[ARM] Armv8.6-a Matrix Mul cmd line support This patch upstreams support for the Armv8.6-a Matrix Multiplication Extension. A summary of the features can be found here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a This patch includes: - Command line options to enable these features with +i8mm, +f32mm, or f64mm Note: +f32mm and +f64mm are optional and so are not enabled by default This is part of a patch series, starting with BFloat16 support and the other components in the armv8.6a extension (in previous patches linked in phabricator) Based on work by: - Luke Geeson - Oliver Stannard - Luke Cheeseman Reviewers: t.p.northover, DavidSpickett Reviewed By: DavidSpickett Subscribers: DavidSpickett, ostannard, kristof.beyls, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77875	2020-04-24 15:54:06 +01:00
Luke Geeson	7da1905125	[AArch32] Armv8.6-a Matrix Mult Assembly + Intrinsics This patch upstreams support for the Armv8.6-a Matrix Multiplication Extension. A summary of the features can be found here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a This patch includes: - Assembly support for AArch32 - Intrinsics Support for AArch32 Neon Intrinsics for Matrix Multiplication Note: these extensions are optional in the 8.6a architecture and so have to be enabled by default No additional IR types or C Types are needed for this extension. This is part of a patch series, starting with BFloat16 support and the other components in the armv8.6a extension (in previous patches linked in phabricator) Based on work by: - Luke Geeson - Oliver Stannard - Luke Cheeseman Reviewers: t.p.northover, miyuki Reviewed By: miyuki Subscribers: miyuki, ostannard, kristof.beyls, hiraditya, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77872	2020-04-24 15:54:06 +01:00
Luke Geeson	832cd74913	[AArch64] Armv8.6-a Matrix Mult Assembly + Intrinsics This patch upstreams support for the Armv8.6-a Matrix Multiplication Extension. A summary of the features can be found here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a This patch includes: - Assembly support for AArch64 only (no SVE or Neon) - Intrinsics Support for AArch64 Armv8.6a Matrix Multiplication Instructions (No bfloat16 matrix multiplication) No IR types or C Types are needed for this extension. This is part of a patch series, starting with BFloat16 support and the other components in the armv8.6a extension (in previous patches linked in phabricator) Based on work by: - Luke Geeson - Oliver Stannard - Luke Cheeseman Reviewers: ostannard, t.p.northover, rengolin, kmclaughlin Reviewed By: kmclaughlin Subscribers: kmclaughlin, kristof.beyls, hiraditya, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77871	2020-04-24 15:54:06 +01:00
Benjamin Kramer	7aaff8fd2d	[ADT] Move allocate_buffer to MemAlloc.h and out of line There's an ABI breakage here if LLVM is compiled in C++14 without aligned allocation and a user tries to use the result with aligned allocation. If DenseMap or unique_function is used across that ABI boundary it will break (PR45413). Moving it out of line is a bit of a band-aid and LLVM doesn't really give ABI guarantees at this level, but given the number of complaints I've received over this it still seems worth fixing.	2020-04-24 13:32:50 +02:00
Simon Pilgrim	0517255a28	PassAnalysisSupport.h - reduce StringRef.h include to forward declaration. NFC.	2020-04-24 12:17:11 +01:00
Max Kazantsev	9cd4debd5a	[LoopVectorize] Preserve CFG analyses if CFG wasn't modified One of transforms the loop vectorizer makes is LCSSA formation. In some cases it is the only transform it makes. We should not drop CFG analyzes if only LCSSA was formed and no actual CFG changes was made. We should think of expanding this logic to other passes as well, and maybe make it a part of PM framework. Reviewed By: Florian Hahn Differential Revision: https://reviews.llvm.org/D78360	2020-04-24 17:22:24 +07:00
Simon Atanasyan	0eec6662f6	[MC][mips] Replace setRType## methods by single setRTypes function. NFC MCELFObjectWriter::setRType## methods are always used altogether to build complete MIPS N64 ABI "chain" of relocations. Using single function for this task makes code less verbose.	2020-04-24 12:13:27 +03:00
Johannes Doerfert	1dfc473177	Revert "[Attributor][NFC] Encode IRPositions in the bits of a single pointer" A dependent patch has been reverted [0]. Until it goes back in this one has to stay out. [0] `ebdb893994` This reverts commit `d254b50b2b`.	2020-04-24 02:53:51 -05:00
Johannes Doerfert	ebdb893994	Revert "[Attributor][NFC] Let AbstractAttribute be an IRPosition" It seems this breaks the windows builds: http://lab.llvm.org:8011/builders/llvm-clang-win-x-aarch64/builds/7454/steps/build-llvm-project/logs/stdio This reverts commit `6782635e90`.	2020-04-24 02:24:15 -05:00
Johannes Doerfert	d254b50b2b	[Attributor][NFC] Encode IRPositions in the bits of a single pointer This reduces memory consumption for IRPositions by eliminating the vtable pointer and the `KindOrArgNo` integer. Since each abstract attribute has an associated IRPosition, the 12-16 bytes we save add up quickly. No functional change is intended. --- Single run of the Attributor module and then CGSCC pass (oldPM) for SPASS/clause.c (~10k LLVM-IR loc): Before: ``` calls to allocation functions: 469545 (260135/s) temporary memory allocations: 77137 (42735/s) peak heap memory consumption: 30.50MB peak RSS (including heaptrack overhead): 119.50MB total memory leaked: 269.07KB ``` After: ``` calls to allocation functions: 468999 (274108/s) temporary memory allocations: 77002 (45004/s) peak heap memory consumption: 28.83MB peak RSS (including heaptrack overhead): 118.05MB total memory leaked: 269.07KB ``` Difference: ``` calls to allocation functions: -546 (5808/s) temporary memory allocations: -135 (1436/s) peak heap memory consumption: -1.67MB peak RSS (including heaptrack overhead): 0B total memory leaked: 0B ``` --- CTMark 15 runs Metric: compile_time Program lhs rhs diff test-suite...:: CTMark/sqlite3/sqlite3.test 25.07 24.09 -3.9% test-suite...Mark/mafft/pairlocalalign.test 14.58 14.14 -3.0% test-suite...-typeset/consumer-typeset.test 21.78 21.58 -0.9% test-suite :: CTMark/SPASS/SPASS.test 21.95 22.03 0.4% test-suite :: CTMark/lencod/lencod.test 25.43 25.50 0.3% test-suite...ark/tramp3d-v4/tramp3d-v4.test 23.88 23.83 -0.2% test-suite...TMark/7zip/7zip-benchmark.test 60.24 60.11 -0.2% test-suite :: CTMark/kimwitu++/kc.test 15.69 15.69 -0.0% test-suite...:: CTMark/ClamAV/clamscan.test 25.43 25.42 -0.0% test-suite :: CTMark/Bullet/bullet.test 37.63 37.62 -0.0% Geomean difference -0.8% --- Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D78722	2020-04-24 01:58:47 -05:00
Johannes Doerfert	6782635e90	[Attributor][NFC] Let AbstractAttribute be an IRPosition Since every AbstractAttribute so far, and for the foreseeable future, corresponds to a single IRPosition we can simplify the class structure. We already did this for IRAttribute but there is no reason to stop there.	2020-04-24 01:58:47 -05:00
Johannes Doerfert	2891b007e3	[Attributor][NFC] Add `const` and missing state constructors	2020-04-24 01:08:23 -05:00
Mircea Trofin	b8960b5d81	[llvm][NFC][CallSite] Remove remaining {Immutable}CallSite uses Reviewers: dblaikie, craig.topper Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78789	2020-04-23 22:19:39 -07:00
James Y Knight	01097dc644	Remove InvokeInst methods which duplicate those of its superclass CallBase.	2020-04-23 19:11:11 -04:00
Christopher Tetreault	5d0c3a8026	[SVE] Remove VectorType::isScalable() Summary: * This is a property of the instance of VectorType. If isa<ScalableVectorType>(T) is true, then T->isScalable() would have returned true and vice-versa. Most code that checks this function uses the result to bail out if a vector is a scalable vector. This code will be cleaner if it just calls isa<ScalableVectorType>(T) Reviewers: efriedma, craig.topper, huntergr, sdesmalen Reviewed By: sdesmalen Subscribers: tschuett, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77690	2020-04-23 15:39:59 -07:00
aartbik	907871d9ad	[llvm] [CodeGen] Fixed vector halving bug for masked load Summary: Given a VL=14 that is enveloped by a proper VL=16, splitting the masked load using the enveloping halving VL=8/8 should yields should eventually yield V=8/5. This fixes various assert failures in getHalfNumVectorElementsVT() and IncrementMemoryAddress(). Note, I suspect similar fixes will be needed for other masked operations, but for now I send out a fix for masked load only. Bugzilla issue 45563 https://bugs.llvm.org/show_bug.cgi?id=45563 Reviewers: craig.topper, mehdi_amini, nicolasvasilache Reviewed By: craig.topper Subscribers: hiraditya, dmgreen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78608	2020-04-23 15:12:44 -07:00
Puyan Lotfi	9721fbf85b	[NFC] Refactoring PropertyAttributeKind for ObjCPropertyDecl and ObjCDeclSpec. This is a code clean up of the PropertyAttributeKind and ObjCPropertyAttributeKind enums in ObjCPropertyDecl and ObjCDeclSpec that are exactly identical. This non-functional change consolidates these enums into one. The changes are to many files across clang (and comments in LLVM) so that everything refers to the new consolidated enum in DeclObjCCommon.h. 2nd Landing Attempt... Differential Revision: https://reviews.llvm.org/D77233	2020-04-23 17:21:25 -04:00
Christopher Tetreault	3ecced163f	[SVE] Remove calls to isScalable from IR Reviewers: efriedma, sdesmalen, dexonsmith, dblaikie Reviewed By: sdesmalen Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77691	2020-04-23 11:51:22 -07:00
Mircea Trofin	201498c6f3	[llvm][NFC] Factor out cost-model independent inling decision Summary: llvm::getInlineCost starts off by determining whether inlining should happen or not because of user directives or easily determinable unviability. This CL refactors this functionality as a reusable API. Reviewers: davidxl, eraman Reviewed By: davidxl, eraman Subscribers: hiraditya, haicheng, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73825	2020-04-23 10:58:43 -07:00
Christopher Tetreault	3d178581ac	[SVE] Make VectorType::getNumElements() complain for scalable vectors Summary: Piggy-back off of TypeSize's STRICT_FIXED_SIZE_VECTORS flag and: - if it is defined, assert that the vector is not scalable - if it is not defined, complain if the vector is scalable Reviewers: efriedma, sdesmalen, c-rhodes Reviewed By: sdesmalen Subscribers: hiraditya, mgorny, tschuett, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78576	2020-04-23 10:47:38 -07:00
Vedant Kumar	517f0f14bf	MachineBasicBlock: Avoid copy in skipDebugInstructions{Forward,Backward}, NFC	2020-04-23 10:22:28 -07:00
Mircea Trofin	cea6f4d5f8	[llvm][NFC][CallSite] Remove CallSite from TypeMetadataUtils & related Reviewers: craig.topper, dblaikie Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78666	2020-04-23 08:23:16 -07:00
Simon Pilgrim	7c5fc40060	XCOFF.h - replace StringRef.h include with forward declaration. NFC. Move StringRef.h include to XCOFF.cpp	2020-04-23 13:52:48 +01:00
River Riddle	7f85adb54d	[mlir][Standard] Allow select to use an i1 for vector and tensor values It currently requires that the condition match the shape of the selected value, but this is only really useful for things like masks. This revision allows for the use of i1 to mean that all of the vector/tensor is selected. This also matches the behavior of LLVM select. A benefit of this change is that transformations that want to generate selects, like those on the CFG, don't have to special case vector/tensor. Previously the only way to generate a select from an i1 was to use a splat, but that doesn't support dynamically shaped/unranked tensors. Differential Revision: https://reviews.llvm.org/D78690	2020-04-23 04:50:09 -07:00
River Riddle	af331bc52d	[mlir][Standard] Add a canonicalization to simplify cond_br when the successors are identical This revision adds support for canonicalizing the following: ``` cond_br %cond, ^bb1(A, ..., N), ^bb1(A, ..., N) br ^bb1(A, ..., N) ``` If the operands to the successor are different and the cond_br is the only predecessor, we emit selects for the branch operands. ``` cond_br %cond, ^bb1(A), ^bb1(B) %select = select %cond, A, B br ^bb1(%select) ``` Differential Revision: https://reviews.llvm.org/D78682	2020-04-23 04:42:02 -07:00
Serguei Katkov	c0d2bbb1d4	[CaptureTracking] Replace hardcoded constant to option. NFC. The motivation is to be able to play with the option and change if it is required. Reviewers: fedor.sergeev, apilipenko, rnk, jdoerfert Reviewed By: fedor.sergeev Subscribers: hiraditya, dantrushin, llvm-commits Differential Revision: https://reviews.llvm.org/D78624	2020-04-23 18:23:35 +07:00
Sander de Smalen	a5e0389b2a	[AArch64] Define ACLE FP conversion intrinsics with more specific predicate. This patch changes the FP conversion intrinsics to take a predicate that matches the number of lanes for the vector with the widest element type as opposed to using <vscale x 16 x i1>. For example: ```<vscale x 4 x float> @llvm.aarch64.sve.fcvt.f32f16(<vscale x 4 x float>, <vscale x 4 x i1>, <vscale x 8 x half>)``` now uses <vscale x 4 x i1> instead of <vscale x 16 x i1> And similar for: ```<vscale x 4 x float> @llvm.aarch64.sve.fcvt.f32f64(<vscale x 4 x float>, <vscale x 2 x i1>, <vscale x 2 x double>)``` where the predicate now matches the wider type, so <vscale x 2 x i1>. Reviewers: efriedma, SjoerdMeijer, paulwalker-arm, rengolin Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78402	2020-04-23 10:53:23 +01:00
Kazuaki Ishizaki	0312b9f550	[llvm] NFC: Fix trivial typo in rst and td files Differential Revision: https://reviews.llvm.org/D77469	2020-04-23 14:26:32 +09:00
Puyan Lotfi	bbf386f02b	Revert "[NFC] Refactoring PropertyAttributeKind for ObjCPropertyDecl and ObjCDeclSpec." This reverts commit `2aa044ed08`. Reverting due to bot failure in lldb.	2020-04-23 00:05:08 -04:00
Puyan Lotfi	2aa044ed08	[NFC] Refactoring PropertyAttributeKind for ObjCPropertyDecl and ObjCDeclSpec. This is a code clean up of the PropertyAttributeKind and ObjCPropertyAttributeKind enums in ObjCPropertyDecl and ObjCDeclSpec that are exactly identical. This non-functional change consolidates these enums into one. The changes are to many files across clang (and comments in LLVM) so that everything refers to the new consolidated enum in DeclObjCCommon.h. Differential Revision: https://reviews.llvm.org/D77233	2020-04-22 23:27:06 -04:00
Vedant Kumar	5c04274dab	[GIsel][CombinerHelper] Don't consider debug insts in dominance queries [3/14] Summary: This fixes several issues where the presence of debug instructions could disable certain combines, due to dominance queries finding uses/defs that don't actually exist. Reviewers: dsanders, fhahn, paquette, aemerson Subscribers: hiraditya, arphaman, aprantl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78253	2020-04-22 17:03:40 -07:00
Vedant Kumar	10ce1bc8d0	[MachineBasicBlock] Add helpers for skipping debug instructions [1/14] Summary: These helpers are exercised by follow-up commits in this patch series, which is all about removing CodeGen differences with vs. without debug info in the AArch64 backend. Reviewers: fhahn, aprantl, jpaquette, paquette Subscribers: kristof.beyls, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78260	2020-04-22 17:03:39 -07:00
Mark Lacey	328bb446dd	Add a policy to enable computing SchedDFSResult. Summary: Make GenericScheduler compute SchedDFSResult on initialization if the policy is set. This makes it possible to create classes that extend GenericScheduler and rely on the results of SchedDFSResult, e.g. to perform subtree scheduling. NFC unless the policy is set. Subscribers: MatzeB, hiraditya, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78432	2020-04-22 16:36:11 -07:00
Juneyoung Lee	aca335955c	[ValueTracking] Let analyses assume a value cannot be partially poison Summary: This is RFC for fixes in poison-related functions of ValueTracking. These functions assume that a value can be poison bitwisely, but the semantics of bitwise poison is not clear at the moment. Allowing a value to have bitwise poison adds complexity to reasoning about correctness of optimizations. This patch makes the analysis functions simply assume that a value is either fully poison or not, which has been used to understand the correctness of a few previous optimizations. The bitwise poison semantics seems to be only used by these functions as well. In terms of implementation, using value-wise poison concept makes existing functions do more precise analysis, which is what this patch contains. Reviewers: spatel, lebedev.ri, jdoerfert, reames, nikic, nlopes, regehr Reviewed By: nikic Subscribers: fhahn, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78503	2020-04-23 08:08:53 +09:00
Juneyoung Lee	5ceef26350	Revert "RFC: [ValueTracking] Let analyses assume a value cannot be partially poison" This reverts commit `80faa8c3af`.	2020-04-23 08:07:09 +09:00
Juneyoung Lee	80faa8c3af	RFC: [ValueTracking] Let analyses assume a value cannot be partially poison Summary: This is RFC for fixes in poison-related functions of ValueTracking. These functions assume that a value can be poison bitwisely, but the semantics of bitwise poison is not clear at the moment. Allowing a value to have bitwise poison adds complexity to reasoning about correctness of optimizations. This patch makes the analysis functions simply assume that a value is either fully poison or not, which has been used to understand the correctness of a few previous optimizations. The bitwise poison semantics seems to be only used by these functions as well. In terms of implementation, using value-wise poison concept makes existing functions do more precise analysis, which is what this patch contains. Reviewers: spatel, lebedev.ri, jdoerfert, reames, nikic, nlopes, regehr Reviewed By: nikic Subscribers: fhahn, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78503	2020-04-23 07:57:12 +09:00
Lang Hames	3ceea67c09	[JITLink] Fix edge removal iterator invalidation. This patch changes Block::removeEdge to return a valid iterator to the new next element, and uses this to update the edge removal algorithm in LinkGraph::splitBlock.	2020-04-22 14:16:46 -07:00
Craig Topper	be04aba6fc	[CallSite removal][ValueTracking] Use CallBase instead of ImmutableCallSite for getIntrinsicForCallSite. NFC Differential Revision: https://reviews.llvm.org/D78613	2020-04-22 12:06:58 -07:00
Alexander Shaposhnikov	c19c3293d3	[ObjectYAML][MachO] Add support for relocations Add support for relocations for MachO to ObjectYAML / yaml2obj / obj2yaml. Test plan: make check-all Differential revision: https://reviews.llvm.org/D77844	2020-04-22 11:50:55 -07:00
Johannes Doerfert	68a27587c2	[OpenMP][FIX] Do not use InaccessibleMemOrArgMemOnly for barrier and flush This was reported as PR45635, committed first as `72a9e7c926`, reverted by `188f5cde96`, and now recommitted with the test change.	2020-04-22 11:10:54 -05:00
Christopher Tetreault	2dea3f1298	[SVE] Add new VectorType subclasses Summary: Introduce new types for fixed width and scalable vectors. Does not remove getNumElements yet so as to not break code during transition period. Reviewers: deadalnix, efriedma, sdesmalen, craig.topper, huntergr Reviewed By: sdesmalen Subscribers: jholewinski, arsenm, jvesely, nhaehnle, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, liufengdb, kerbowa, Joonsoo, grosul1, frgossen, lldb-commits, tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm, #lldb Differential Revision: https://reviews.llvm.org/D77587	2020-04-22 08:59:01 -07:00
Johannes Doerfert	188f5cde96	Revert "[OpenMP][FIX] Do not use InaccessibleMemOrArgMemOnly for barrier and flush" Forgot to include test changes :( This reverts commit `72a9e7c926`.	2020-04-22 10:36:54 -05:00
Johannes Doerfert	72a9e7c926	[OpenMP][FIX] Do not use InaccessibleMemOrArgMemOnly for barrier and flush This was reported as PR45635.	2020-04-22 10:18:46 -05:00
Kerry McLaughlin	17f6e18acf	[AArch64][SVE] Add SVE intrinsic for LD1RQ Summary: Adds the following intrinsic for contiguous load & replicate: - @llvm.aarch64.sve.ld1rq The LD1RQ intrinsic only needs the SImmS16XForm added by this patch. The others (SImmS2XForm, SImmS3XForm & SImmS4XForm) were added for consistency. Reviewers: andwar, sdesmalen, efriedma, cameron.mcinally, dancgr, rengolin Reviewed By: sdesmalen Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76929	2020-04-22 11:29:27 +01:00
Benjamin Kramer	b198f1f86c	Make some static class members constexpr This allows them to be ODR used in C++17 mode. NFC.	2020-04-22 12:25:01 +02:00
Craig Topper	05a11974ae	[CallSite removal] Remove unneeded includes of CallSite.h. NFC	2020-04-22 00:07:13 -07:00
Craig Topper	daadb48553	[CallSite removal][TargetTransformInfoImpl] Replace CallSite with CallBase. NFC	2020-04-21 22:49:30 -07:00
Andrew Browne	a30e7ea88e	Make SmallVector assert if it cannot grow. Context: /// Double the size of the allocated memory, guaranteeing space for at /// least one more element or MinSize if specified. void grow(size_t MinSize = 0) { this->grow_pod(MinSize, sizeof(T)); } void push_back(const T &Elt) { if (LLVM_UNLIKELY(this->size() >= this->capacity())) this->grow(); memcpy(reinterpret_cast<void *>(this->end()), &Elt, sizeof(T)); this->set_size(this->size() + 1); } When grow is called in push_back() without a MinSize specified, this is relying on the guarantee of space for at least one more element. There is an edge case bug where the SmallVector is already at its maximum size and push_back() calls grow() with default MinSize of zero. Grow is unable to provide space for one more element, but push_back() assumes the additional element it will be available. This can result in silent memory corruption, as this->end() will be an invalid pointer and the program may continue executing. Another alternative to fix would be to remove the default argument from grow(), which would mean several changing grow() to grow(this->size()+1) in several places. No test case added because it would require allocating ~4GB. Reviewers: echristo Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77601	2020-04-21 17:53:39 -07:00
Amy Huang	2360933147	Reland "Implement some functions in NativeSession." with fixes so that the tests pass on Linux. Summary: This change implements readFromExe, and calculating VA and RVA, which are some of the functionalities that will be used for native PDB reading for llvm symbolizer. bug: https://bugs.llvm.org/show_bug.cgi?id=41795	2020-04-21 16:35:27 -07:00
Amy Huang	507d80fbd2	Revert "Implement some NativeSession functions" along with some followup fixes. This reverts commits `a6d8a055e9` `4927ae0858` `1e1f5eb7c9`	2020-04-21 14:20:13 -07:00
Christopher Tetreault	8bec33c096	[SVE] Remove VectorType::getBitWidth() Summary: * VectorType::getBitWidth() is just an unsafe version of getPrimitiveSizeInBits() that assumes all vectors are fixed width. Reviewers: efriedma, sdesmalen, huntergr, craig.topper Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77833	2020-04-21 13:33:46 -07:00
Johannes Doerfert	46b7ed0e6f	[Attributor] Remove dependence edges eagerly If we have a dependence between an abstract attribute A to an abstract attribute B such hat changes in A should trigger an update of B, we do not need to keep the dependence around once the update was triggered. If the dependence is still required the update will reinsert it into the dependence map, if it is not we avoid triggering B in the future. This replaces the "recompute interval" mechanism we used before to prune stale dependences. Number of required iterations is generally down, compile time for the module pass (not really the CGSCC pass) is down quite a bit. There is one test change which looks like an artifact in the undefined behavior AA that needs to be looked at.	2020-04-21 15:22:10 -05:00
Johannes Doerfert	c5794f77eb	[Attributor][PM] Introduce `-attributor-enable={none,cgscc,module,all}` The old command line option `-attributor-disable` was too coarse grained as we want to measure the effects of the module or cgscc pass without the other as well. Since `none` is the default there is no real functional change. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D78571	2020-04-21 15:22:10 -05:00
Amy Huang	a6d8a055e9	Implement some functions in NativeSession. Summary: This change implements readFromExe, and calculating VA and RVA, which are some of the functionalities that will be used for native PDB reading for llvm symbolizer. bug: https://bugs.llvm.org/show_bug.cgi?id=41795 Reviewers: hans, amccarth, rnk Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78128	2020-04-21 11:48:40 -07:00
Ana Pazos	66590e1e9e	[MC][PGO][PGSO] Cleanup unused MBFI in AsmPrinter Summary: Machine Block Frequency Info (MBFI) is being computed but unused in AsmPrinter. MBFI computation was introduced with PGO change D71149 and then its use was removed in D71106. No need to keep computing it. Reviewers: MaskRay, jyknight, skan, yamauchi, davidxl, efriedma, huihuiz Reviewed By: MaskRay, skan, yamauchi Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78526	2020-04-21 10:01:56 -07:00
Fangrui Song	5771c98562	[XRay] Change xray_instr_map sled addresses from absolute to PC relative for x86-64 xray_instr_map contains absolute addresses of sleds, which are relocated by `R_*_RELATIVE` when linked in -pie or -shared mode. By making these addresses relative to PC, we can avoid the dynamic relocations and remove the SHF_WRITE flag from xray_instr_map. We can thus save VM pages containg xray_instr_map (because they are not modified). This patch changes x86-64 and bumps the sled version to 2. Subsequent changes will change powerpc64le and AArch64. Reviewed By: dberris, ianlevesque Differential Revision: https://reviews.llvm.org/D78082	2020-04-21 09:36:09 -07:00
Johannes Doerfert	177c065e50	[Attributor] Use a pointer value type for the OpcodeInstMap This reduces memory consumption and the need to copy complex data structures repeatedly. No functional change is intended. --- Single run of the Attributor module and then CGSCC pass (oldPM) for SPASS/clause.c (~10k LLVM-IR loc): Before: ``` calls to allocation functions: 490390 (320725/s) temporary memory allocations: 84601 (55330/s) peak heap memory consumption: 41.70MB peak RSS (including heaptrack overhead): 131.18MB total memory leaked: 269.04KB ``` After: ``` calls to allocation functions: 489359 (301144/s) temporary memory allocations: 82983 (51066/s) peak heap memory consumption: 36.76MB peak RSS (including heaptrack overhead): 126.48MB total memory leaked: 269.04KB ``` Difference: ``` calls to allocation functions: -1031 (-10739/s) temporary memory allocations: -1618 (-16854/s) peak heap memory consumption: -4.94MB peak RSS (including heaptrack overhead): 0B total memory leaked: 0B ---	2020-04-21 11:20:09 -05:00
Johannes Doerfert	99662c22cd	[Attributor] Use a pointer value type for the QueryMap This reduces memory consumption and the need to copy complex data structures repeatedly. No functional change is intended. --- Single run of the Attributor module and then CGSCC pass (oldPM) for SPASS/clause.c (~10k LLVM-IR loc): Before: ``` calls to allocation functions: 596180 (374484/s) temporary memory allocations: 84979 (53378/s) peak heap memory consumption: 52.14MB peak RSS (including heaptrack overhead): 139.79MB total memory leaked: 269.04KB ``` After: ``` calls to allocation functions: 489200 (303285/s) temporary memory allocations: 83406 (51708/s) peak heap memory consumption: 41.70MB peak RSS (including heaptrack overhead): 131.76MB total memory leaked: 269.04KB ``` Difference: ``` calls to allocation functions: -106980 (-5094285/s) temporary memory allocations: -1573 (-74904/s) peak heap memory consumption: -10.44MB peak RSS (including heaptrack overhead): 0B total memory leaked: 0B ---	2020-04-21 11:20:04 -05:00
Johannes Doerfert	40f3baeb20	[Attributor] Pass the Attributor to the AbstractAttribute constructors AbstractAttribute::initialize is used to initialize the deduction and the object we do not always call it. To make sure we have the option to initialize the object even if initialize is not called we pass the Attributor to AbstractAttribute constructors now.	2020-04-21 11:20:02 -05:00
Johannes Doerfert	91a6c88349	[Attributor] Use a pointer value type for the AAMap This reduces memory consumption and the need to copy complex data structures repeatedly. No functional change is intended. --- Single run of the Attributor module and then CGSCC pass (oldPM) for SPASS/clause.c (~10k LLVM-IR loc): Before: ``` calls to allocation functions: 613353 (376521/s) temporary memory allocations: 83636 (51341/s) peak heap memory consumption: 75.64MB peak RSS (including heaptrack overhead): 162.97MB total memory leaked: 269.04KB ``` After: ``` calls to allocation functions: 616575 (349929/s) temporary memory allocations: 83650 (47474/s) peak heap memory consumption: 72.15MB peak RSS (including heaptrack overhead): 159.81MB total memory leaked: 269.04KB ``` Difference: ``` calls to allocation functions: 3222 (24225/s) temporary memory allocations: 14 (105/s) peak heap memory consumption: -3.49MB peak RSS (including heaptrack overhead): 0B total memory leaked: 0B ---	2020-04-21 11:19:58 -05:00
Pavel Labath	cc0acda782	[DWARFDataExtractor] Add a "truncating" constructor Summary: This constructor allows us to create a new DWARFDataExtractor which will only present a subrange of an entire debug section. Since debug sections typically consist of multiple contributions, it is expected that one will create a new data extractor for each contribution in order to avoid unexpectedly running off into the next one. This is very useful for unifying the flows for detecting parse errors. Without it, the code needs to consider two very different scenarios: 1. If there is another contribution after the current one, the DataExtractor functions will just start reading from there. This is detectable by comparing the current offset against the known end-of-contribution offset. 2. If this is the last contribution, the data extractor will just start returning zeroes (or other default values). This situation can not be detected by checking the parsing offset, as this will not be advanced in case of errors. Using a truncated data extractor simplifies the code (and reduces cognitive load) by making these two cases behave identically -- a running off the end of a contribution will _always_ produce an EOF error (if one uses error-aware parsing methods) or return default values. Reviewers: dblaikie, probinson, jhenderson, ikudrin Subscribers: aprantl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77556	2020-04-21 16:48:09 +02:00
Simon Pilgrim	75aeb53485	SHA1.h - remove unnecessary ArrayRef.h/StringRef.h includes. NFC. By moving the update(StringRef) wrapper into SHA1.cpp we can depend just on system headers.	2020-04-21 15:12:17 +01:00
Sam Parker	32c0561e0c	Attempting fix buildbot getUserCost is faulting for some builders.	2020-04-21 11:42:21 +01:00
Sam Parker	ee959ddc5e	[TTI] Remove getOperationCost This API call has been used recently with, a very valid, expectation that it would do something useful but it doesn't actually query any backend information. So, remove this method and merge its functionality into getUserCost. As well as that, also use getCastInstrCost to get a proper cost from the backend for the concerned instructions though we only currently return the answer if it's considered free. The default implementation now also checks int/ptr conversions too, as well as truncs and bitcasts. Differential Revision: https://reviews.llvm.org/D76124	2020-04-21 09:15:34 +01:00
Craig Topper	2cf3c033f3	[DenseMap] Don't capture the BucketEnd pointer before an operation that might change the number of buckets. This code was added in `887efa51c1` to fix reverse iteration. The call to InsertIntoBucket/InsertIntoBucketWithLookup can change the number of buckets which will invalidate the BucketEnd. So don't cache it and calculate it when creating the iterator.	2020-04-21 00:36:34 -07:00
Craig Topper	68b2e507e4	[Local] Update getOrEnforceKnownAlignment/getKnownAlignment to use Align/MaybeAlign. Differential Revision: https://reviews.llvm.org/D78443	2020-04-20 21:31:44 -07:00
Shengchen Kan	c031378ce0	[MC][NFC] Use camelCase style for functions in MCObjectStreamer	2020-04-20 20:09:20 -07:00
Shengchen Kan	8bb059ab63	[MC][Bugfix] Remove redundant parameter for relaxInstruction Summary: Before this patch, `relaxInstruction` takes three arguments, the first argument refers to the instruction before relaxation and the third argument is the output instruction after relaxation. There are two quite strange things: 1) The first argument's type is `const MCInst &`, the third argument's type is `MCInst &`, but they may be aliased to the same variable 2) The backends of ARM, AMDGPU, RISC-V, Hexagon assume that the third argument is a fresh uninitialized `MCInst` even if `relaxInstruction` may be called like `relaxInstruction(Relaxed, STI, Relaxed)` in a loop. In this patch, we drop the thrid argument, and let `relaxInstruction` directly modify the given instruction. Also, this patch fixes the bug https://bugs.llvm.org/show_bug.cgi?id=45580, which is introduced by D77851, and breaks the assumption of ARM, AMDGPU, RISC-V, Hexagon. Reviewers: Razer6, MaskRay, jyknight, asb, luismarques, enderby, rtaylor, colinl, bcain Reviewed By: Razer6, MaskRay, bcain Subscribers: bcain, nickdesaulniers, nathanchance, wuzish, annita.zhang, arsenm, dschuff, jyknight, dylanmckay, sdardis, nemanjai, jvesely, nhaehnle, tpr, sbc100, jgravelle-google, kristof.beyls, hiraditya, aheejin, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78364	2020-04-21 11:06:55 +08:00
Johannes Doerfert	87aa362985	[Attributor] Use the BumpPtrAllocator in InformationCache as well We now also use the BumpPtrAllocator from the Attributor in the InformationCache. The lifetime of objects in either is pretty much the same and it should result in consistently good performance regardless of the allocator. Doing so requires to call more constructors manually but so far that does not seem to be problematic or messy. --- Single run of the Attributor module and then CGSCC pass (oldPM) for SPASS/clause.c (~10k LLVM-IR loc): Before: ``` calls to allocation functions: 615359 (368257/s) temporary memory allocations: 83315 (49859/s) peak heap memory consumption: 75.64MB peak RSS (including heaptrack overhead): 163.43MB total memory leaked: 269.04KB ``` After: ``` calls to allocation functions: 613042 (359555/s) temporary memory allocations: 83322 (48869/s) peak heap memory consumption: 75.64MB peak RSS (including heaptrack overhead): 162.92MB total memory leaked: 269.04KB ``` Difference: ``` calls to allocation functions: -2317 (-68147/s) temporary memory allocations: 7 (205/s) peak heap memory consumption: 2.23KB peak RSS (including heaptrack overhead): 0B total memory leaked: 0B ---	2020-04-20 21:12:41 -05:00
Shengchen Kan	f0019d4ff2	[MC][NFC] Use camelCase style for function EmitInstToData	2020-04-20 18:53:39 -07:00
Chris Bieneman	887efa51c1	Fix DenseMap iterator asserts when shouldReverseIterate==true This patch gets the asserts working correctly when LLVM_REVERSE_ITERATION=On by fixing the iterators returned by the DenseMap::find* methods so that they return well-formed iterators that work with reverse iteration, and satisfy the assertions.	2020-04-20 19:31:32 -05:00
Chris Bieneman	2171fa63b3	Fixing bot breakage This should resolve the failures from `31282d399b`.	2020-04-20 17:44:17 -05:00
Chris Bieneman	31282d399b	Fix LLVM_REVERSE_ITERATION A recent change (`4e86e5eedc`), broke `LLVM_REVERSE_ITERATION` for DenseMaps by adding an assert. It is valid to de-reference and increment one step behind `End` when reverse iteration is enabled because `End` is actually the start of the pointer bucket.	2020-04-20 17:30:31 -05:00
Sriraman Tallam	365b60fc93	New pass to make internal linkage symbol names unique. With clang option -funique-internal-linkage-symbols, symbols with internal linkage get names with the module hash appended. Differential Revision: https://reviews.llvm.org/D78243	2020-04-20 15:05:22 -07:00
Christopher Tetreault	56e4888627	[SVE] Remove calls to getBitWidth from Analysis Reviewers: efriedma, sdesmalen, jnspaulsson, jonpa Reviewed By: efriedma Subscribers: tschuett, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77898	2020-04-20 13:39:45 -07:00
Andrew Litteken	1488bef8fc	[MachineOutliner] Annotation for outlined functions in AArch64 - Adding changes to support comments on outlined functions with outlining for the conditions through which it was outlined (e.g. Thunks, Tail calls) - Adapts the emitFunctionHeader to print out a comment next to the header if the target specifies it based on information in MachineFunctionInfo - Adds mir test for function annotiation Differential Revision: https://reviews.llvm.org/D78062	2020-04-20 13:33:31 -07:00
Craig Topper	fcc9d70260	Revert "[Local] Update getOrEnforceKnownAlignment/getKnownAlignment to use Align/MaybeAlign." This is breaking the clang build. This reverts commit `897409fb56`.	2020-04-20 13:25:06 -07:00
Craig Topper	897409fb56	[Local] Update getOrEnforceKnownAlignment/getKnownAlignment to use Align/MaybeAlign. Differential Revision: https://reviews.llvm.org/D78443	2020-04-20 13:08:05 -07:00
Nikita Popov	b3f5472c2b	[ValueLattice] Add move constructor (NFC) Following the rule of five, declare move constructor and move assignment operator for ValueLatticeElement. This allows moving the ConstantRange rather than copying it. This does not matter in most cases, where we're dealing with APInts <= 64 bits. It does avoid unnecessary copies of allocations for larger APInts. Additionally we change the implementation approach to make the copy/move assignment operators make use of the copy/move constructors, rather than the other way around. The constructors are the more primitive operations. Differential Revision: https://reviews.llvm.org/D78425	2020-04-20 18:32:38 +02:00
Nikita Popov	54d01cbc15	[IPT] Don't use OrderedInstructions (NFC) Use Instruction::comesBefore() instead of OrderedInstructions inside InstructionPrecedenceTracking. This also removes the dominator tree dependency. Differential Revision: https://reviews.llvm.org/D78461	2020-04-20 18:25:31 +02:00
Konstantin Schwarz	12030494fc	[GlobalISel] Introduce InlineAsmLowering class Summary: Similar to the CallLowering class used for lowering LLVM IR calls to MIR calls, we introduce a separate class for lowering LLVM IR inline asm to MIR INLINEASM. There is no functional change yet, all existing tests should pass. Reviewers: arsenm, dsanders, aemerson, volkan, t.p.northover, paquette Reviewed By: aemerson Subscribers: gargaroff, wdng, mgorny, rovka, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78316	2020-04-20 15:10:18 +02:00
Georgii Rymar	76e0ab23f6	[FileCheck] - Refactor the code related to string arrays. NFCI. There are few `std::vector<std::string>` members in `FileCheckRequest`. This patch changes these arrays to `std::vector<StringRef>` and refactors the code related to cleanup/improve/simplify it. Differential revision: https://reviews.llvm.org/D78202	2020-04-20 14:54:49 +03:00
Sam Parker	e3056ae9a0	[NFC][TTI] Explicit use of VectorType The API for shuffles and reductions uses generic Type parameters, instead of VectorType, and so assertions and casts are used a lot. This patch makes those types explicit, which means that the clients can't be lazy, but results in less ambiguity, and that can only be a good thing. Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=45562 Differential Revision: https://reviews.llvm.org/D78357	2020-04-20 09:16:52 +01:00
Bill Wendling	edcfc391e1	[Object] Use BFD name for little-endian PowerPC64 Summary: Little-endian PowerPC object files should report "elf64-powerpcle" instead of "elf64-powerpc". Reviewers: jhenderson, MaskRay, espindola, alexshap, rupprecht, #powerpc Reviewed By: MaskRay Subscribers: wuzish, emaste, nemanjai, shchenz, steven.zhang, llvm-commits Tags: #llvm, #powerpc Differential Revision: https://reviews.llvm.org/D78344	2020-04-19 20:10:05 -07:00
Craig Topper	252873879e	[CallSite removal][Analysis] Replace CallSite with CallBase in MemoryBuiltins. NFC Differential Revision: https://reviews.llvm.org/D78449	2020-04-19 18:32:48 -07:00
Simon Pilgrim	69062a1cf6	SectionMemoryManager.h - remove unnecessary StringRef.h include. NFC.	2020-04-19 21:25:10 +01:00
Florian Hahn	a7aaadc135	[TTI] Clean up includes (NFC). Remove some unnecessary includes, replace some with forward declarations. This also exposed a few places that were missing some includes.	2020-04-19 20:11:59 +01:00
Florian Hahn	32af48cdcf	[IVDescriptors] Clean up includes. Some includes are not required and forward declarations can be used instead. This also exposed a few places that were not directly including required files.	2020-04-19 20:07:47 +01:00
Florian Hahn	7a87e8f90b	[LoopUtils] Clean up includes, use forward decls if appropriate (NFC). Most of the includes in LoopUtils.h are not required in the header and they can be replaced by forward declarations. Unfortunately includes of TargetTransformInfo.h and IVDescriptors.h pull in a bunch of additional things, but there is no easy way to get rid of them at the moment I think.	2020-04-19 19:44:29 +01:00
Simon Pilgrim	330162c5a6	DependenceGraphBuilder.h - remove unused includes. NFC. Replace with forward declarations.	2020-04-19 17:58:17 +01:00
Fangrui Song	041a3557f0	[CMake] Delete HAVE_SCHED_GETAFFINITY and HAVE_CPU_COUNT sched_getaffinity (Linux specific) has been available * in glibc since 2002-08-08 (commit 972e719e8154eec5f543b027e2a08dfa285d55d5) * in musl since the initial check-in.	2020-04-19 08:50:23 -07:00
Florian Hahn	e01ae15066	[LAA] Remove unnecessary includes (NFC).	2020-04-19 15:16:29 +01:00
Simon Pilgrim	cbd790a443	DebugHandlerBase.h - reduce MachineInstr.h include to DebugLoc.h include. We were only including MachineInstr.h for DebugLoc.h. This exposes an implicit include dependency in BTFDebug.h where I've had to add the MachineInstr.h include.	2020-04-19 11:14:01 +01:00
Simon Pilgrim	9308dffc21	BuildLibCalls.h - remove unnecessary TargetLibraryInfo forward declaration. NFC We already have to include the TargetLibraryInfo.h header.	2020-04-19 11:14:00 +01:00
Simon Pilgrim	c96ca71a9f	TypeBasedAliasAnalysis.h - replace InstrTypes.h include with forward declaration. NFC.	2020-04-19 11:13:59 +01:00
Benjamin Kramer	ff54d1c897	Remove remaining callers of CreateShuffleVector with unsigned indices and mark it as deprecated No functionality change intended.	2020-04-19 11:48:28 +02:00
Simon Pilgrim	59b0e015fc	OMPConstants.h - replace StringRef.h include with forward declaration. NFC.	2020-04-19 10:29:48 +01:00
Florian Hahn	6ba0695c60	[ValueLattice] Add struct for merge options. This makes it easier to extend the merge options in the future and also reduces the risk of accidentally setting a wrong option. Reviewers: efriedma, nikic, reames, davide Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D78368	2020-04-19 09:03:16 +01:00
Craig Topper	2a58271158	[CallSite removal][PtrUseVisitor] Use visitCallBase instead of visitCallSite. NFC	2020-04-18 23:15:12 -07:00
Craig Topper	a310da85cb	[SyntheticCountsPropagation] Remove unnecessary includes and add a LLVM license header. NFC Noticed while looking for CallSite.h uses to remove.	2020-04-18 22:33:41 -07:00
Carl Ritson	ad0d3bbb27	[Dominators] Facilitate updates to MachinePostDominatorTree Summary: Add getBase accessor so that underlying tree can be manipulated in a similar manner to MachineDominatorTree. Reviewers: kuhar, arsenm, hliao, nhaehnle Reviewed By: kuhar Subscribers: lkail, mgorny, hiraditya, wdng, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77967	2020-04-19 10:04:35 +09:00
Lang Hames	702b3f01dd	[ORC] Add a convenience method to create a JITEvaluatedSymbol from a pointer. This can be used to reduce boilerplate code, especially when defining absolute symbols.	2020-04-18 14:16:54 -07:00
Lang Hames	c6ade39ba0	[ORC] Replace LLJIT::defineAbsolute with an LLJIT::define convenience method. LLJIT::defineAbsolute did not mangle its Name argument, which is inconsistent with the behavior of other LLJIT methods (e.g. lookup). Since it is currently unused anyway, this commit replaces it with a generic 'define' convenience method for adding MaterializationUnits to the main JITDylib. This simplifies use of the generic absoluteSymbols function (as well as the symbolAlias, reexports and other functions that generate MaterializationUnits) with LLJIT.	2020-04-18 14:16:54 -07:00
Simon Pilgrim	9719b638be	UnrollLoop.h - replace StringRef.h/ValueMapper.h includes with forward declarations. NFC.	2020-04-18 21:43:22 +01:00
Nikita Popov	c5c967c6a2	[ValueLattice] Remove unnecessary ConstVal nulling (NFC) ConstVal is not an owned pointer, so setting it to nullptr is not actually doing anything. If we switch to a state that does not use ConstVal, the value does not matter. Split out from D78425.	2020-04-18 22:36:38 +02:00
Nikita Popov	a42fd18d0f	[PredicateInfo] Factor out PredicateInfoBuilder (NFC) When running IPSCCP on a module with many small functions, memory usage is dominated by PredicateInfo, which is a huge structure (partially due to some unfortunate nested SmallVector use). However, most of it is actually only temporary state needed to build predicate info, and does not need to be retained after initial construction. This patch factors out the predicate building logic and state into a separate PrediceInfoBuilder, with the extra bonus that it does not need to live in the header anymore. Differential Revision: https://reviews.llvm.org/D78326	2020-04-18 22:34:38 +02:00
LemonBoy	aad3d578da	[DebugInfo] Change DIEnumerator payload type from int64_t to APInt This allows the representation of arbitrarily large enumeration values. See https://lists.llvm.org/pipermail/llvm-dev/2017-December/119475.html for context. Reviewed By: andrewrk, aprantl, MaskRay Differential Revision: https://reviews.llvm.org/D62475	2020-04-18 12:49:31 -07:00
Mircea Trofin	ec73ae11a3	[llvm][NFC][CallSite] Remove CallSite from ProfileSummary Summary: Depends on D78395. Reviewers: craig.topper, dblaikie, wmi, davidxl Subscribers: eraman, hiraditya, haicheng, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78414	2020-04-18 12:03:14 -07:00
vgxbj	ac00376a13	[Object] Change uint32_t getSymbolFlags() to Expected<uint32_t> getSymbolFlags(). This change enables getSymbolFlags() to return errors which benefit error reporting in clients. Differential Revision: https://reviews.llvm.org/D77860	2020-04-18 21:27:57 +08:00
Simon Pilgrim	9b95186c30	HeatUtils.h - remove unnecessary includes. NFC. Replace with BlockFrequencyInfo/Function forward declarations Move BlockFrequencyInfo.h include to HeatUtils.cpp	2020-04-18 13:37:06 +01:00
Florian Hahn	b7cdb138af	[ValueLattice] Use 8 bits for Tag. Suggested as follow-up in D78145 post-commit to be more machine friendly.	2020-04-18 13:31:17 +01:00
Simon Pilgrim	0b24215101	IRReader.h - remove unnecessary StringRef forward declaration. NFC. We need to include StringRef.h.	2020-04-18 12:31:42 +01:00
Nikita Popov	f005f6c234	Revert "ADT: SmallVector size/capacity use word-size integers when elements are small" This reverts commit `b8d08e961d`. This change causes a 1% compile-time and 1% memory usage regression: http://llvm-compile-time-tracker.com/compare.php?from=73b7dd1fb3c17a4ac4b1f1e603f26fa708009649&to=b8d08e961df1d229872c785ebdbc8367432e9752&stat=instructions http://llvm-compile-time-tracker.com/compare.php?from=73b7dd1fb3c17a4ac4b1f1e603f26fa708009649&to=b8d08e961df1d229872c785ebdbc8367432e9752&stat=max-rss	2020-04-18 11:46:58 +02:00
Florian Hahn	4ee45ab60f	[LV] Invalidate cost model decisions along with interleave groups. Cost-modeling decisions are tied to the compute interleave groups (widening decisions, scalar and uniform values). When invalidating the interleave groups, those decisions also need to be invalidated. Otherwise there is a mis-match during VPlan construction. VPWidenMemoryRecipes created initially are left around w/o converting them into VPInterleave recipes. Such a conversion indeed should not take place, and these gather/scatter recipes may in fact be right. The crux is leaving around obsolete CM_Interleave (and dependent) markings of instructions along with their costs, instead of recalculating decisions, costs, and recipes. Alternatively to forcing a complete recompute later on, we could try to selectively invalidate the decisions connected to the interleave groups. But we would likely need to run the uniform/scalar value detection parts again anyways and the extra complexity is probably not worth it. Fixes PR45572. Reviewers: gilr, rengolin, Ayal, hsaito Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D78298	2020-04-18 10:23:49 +01:00
Brad Moody	fb42d3afad	[ADT] Fix bug in BitVector and SmallBitVector DenseMap hashing. BitVectors and SmallBitVectors with equal contents but different capacities were getting different hashes. Reviewed By: aganea Differential Revision: https://reviews.llvm.org/D77038	2020-04-18 00:21:08 -05:00
Mircea Trofin	41ad8b7388	[llvm][NFC][CallSite] Remove CallSite from Evaluator. Reviewers: craig.topper, dblaikie Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78395	2020-04-17 19:11:17 -07:00
Craig Topper	cd28a4736a	[AbstractCallSite] Fix some doxygen comments I failed to update when ImmutableCallSite was replaced with CallBase. Also fix an 80 column violation.	2020-04-17 17:08:28 -07:00
Matt Arsenault	f463792506	AMDGPU: Remove custom node for RSQ_LEGACY Directly select from the intrinsic. This wasn't getting much value from the custom node.	2020-04-17 19:50:36 -04:00
Andrew Browne	b8d08e961d	ADT: SmallVector size/capacity use word-size integers when elements are small SmallVector currently uses 32bit integers for size and capacity to reduce sizeof(SmallVector). This limits the number of elements to UINT32_MAX. For a SmallVector<char>, this limits the SmallVector size to only 4GB. Buffering bitcode output uses SmallVector<char>, but needs >4GB output. This changes SmallVector size and capacity to conditionally use word-size integers if the element type is small (<4 bytes). For larger elements types, the vector size can reach ~16GB with 32bit size. Making this conditional on the element type provides both the smaller sizeof(SmallVector) for larger types which are unlikely to grow so large, and supports larger capacities for smaller element types. This change also includes a fix for the bug where a SmallVector with 32bit size has reached UINT32_MAX elements, and cannot provide guaranteed growth. Context: // Double the size of the allocated memory, guaranteeing space for at // least one more element or MinSize if specified. void grow(size_t MinSize = 0) { this->grow_pod(MinSize, sizeof(T)); } void push_back(const T &Elt) { if (LLVM_UNLIKELY(this->size() >= this->capacity())) this->grow(); memcpy(reinterpret_cast<void *>(this->end()), &Elt, sizeof(T)); this->set_size(this->size() + 1); } When grow is called in push_back() without a MinSize specified, this is relying on the guarantee of space for at least one more element. There is an edge case bug where the SmallVector is already at its maximum size and push_back() calls grow() with default MinSize of zero. Grow is unable to provide space for one more element, but push_back() assumes the additional element it will be available. This can result in silent memory corruption, as this->end() will be an invalid pointer and the program may continue executing. An alternative to this fix would be to remove the default argument from grow(), which would mean several changing grow() to grow(this->size()+1) in several places. No test case added because it would require allocating a large ammount. Differential Revision: https://reviews.llvm.org/D77621	2020-04-17 16:11:13 -07:00
Christopher Tetreault	c858debebc	Remove asserting getters from base Type Summary: Remove asserting vector getters from Type in preparation for the VectorType refactor. The existence of these functions complicates the refactor while adding little value. Reviewers: dexonsmith, sdesmalen, efriedma Reviewed By: efriedma Subscribers: cfe-commits, hiraditya, llvm-commits Tags: #llvm, #clang Differential Revision: https://reviews.llvm.org/D77278	2020-04-17 14:03:31 -07:00
Bjorn Pettersson	4e7e414ec9	[Float2Int] Make iteration over Roots deterministic Summary: Use a SmallSetVector instead of a SmallPtrSet when collecting and storing Roots. The iteration order for a SmallPtrSet is not deterministic, so in the past the order of items inserted in the WorkList inside walkBackwards has been non-deterministic. This patch intends to make the order of rewrites done in Float2Int deterministic by changing the container for the Roots set. The semantics result of the transformation should not be any different afaict. But at least naming of IR variables (when outputting the result as an ll file) should be more stable now. Reviewers: craig.topper, spatel, cameron.mcinally Reviewed By: spatel Subscribers: mgrang, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74534	2020-04-17 21:40:12 +02:00
Craig Topper	5f6d93c7d3	[CallSite removal][Attributor] Replaces use of CallSite with CallBase. NFC Differential Revision: https://reviews.llvm.org/D78343	2020-04-17 10:44:31 -07:00
Craig Topper	0feaba683e	[CallSite removal][MemCpyOptimizer] Replace CallSite with CallBase. NFC There are also some adjustments to use MaybeAlign in here due to CallBase::getParamAlignment() being deprecated. It would be a little cleaner if getOrEnforceKnownAlignment was migrated to Align/MaybeAlign. Differential Revision: https://reviews.llvm.org/D78345	2020-04-17 10:32:45 -07:00
Craig Topper	8c94d616e1	Revert "[CallSite removal][MemCpyOptimizer] Replace CallSite with CallBase. NFC" There were extra changes that weren't supposed to be in there This reverts commit `b91f78db37`.	2020-04-17 10:11:22 -07:00
Craig Topper	b91f78db37	[CallSite removal][MemCpyOptimizer] Replace CallSite with CallBase. NFC There are also some adjustments to use MaybeAlign in here due to CallBase::getParamAlignment() being deprecated. It would be cleaner if getOrEnforceKnownAlignment was migrated to Align/MaybeAlign. Differential Revision: https://reviews.llvm.org/D78345	2020-04-17 10:07:20 -07:00
Nikita Popov	24cae17c28	[MI] Reduce MachineInstr size (NFC) Move CapOperands next to AsmPrinterFlags, to reduce size of MachineInstr by 8 bytes.	2020-04-17 18:30:56 +02:00
Nikita Popov	0f1678cd08	[PredicateInfo] Remove unused member (NFC) PredicateInfo takes up a large amount of memory during IPSCCP with many functions. And a large part of that space seems to be going completely to waste here...	2020-04-17 18:30:36 +02:00
Stefan Pintilie	b771c4a842	[PowerPC][Future] More support for PCRel addressing for global values Add initial support for PC Relative addressing for global values that require GOT indirect addressing. This patch adds PCRelative support for global addresses that may not be known at link time and may require access through the GOT. Differential Revision: https://reviews.llvm.org/D76064	2020-04-17 11:06:13 -05:00
Dominik Montada	55e3a7c6b2	[GlobalISel][AMDGPU] add legalization for G_FREEZE Summary: Copy the legalization rules from SelectionDAG: -widenScalar using anyext -narrowScalar using intermediate merges -scalarize/fewerElements using unmerge -moreElements using G_IMPLICIT_DEF and insert Add G_FREEZE legalization actions to AMDGPULegalizerInfo. Use the same legalization actions as G_IMPLICIT_DEF. Depends on D77795. Reviewers: dsanders, arsenm, aqjune, aditya_nandakumar, t.p.northover, lebedev.ri, paquette, aemerson Reviewed By: arsenm Subscribers: kzhuravl, yaxunl, dstuttard, tpr, t-tye, jvesely, nhaehnle, kerbowa, wdng, rovka, hiraditya, volkan, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78092	2020-04-17 16:44:46 +02:00
Florian Hahn	c245d3e033	[ValueLattice] Steal bits from Tag to track range extensions (NFC). Users of ValueLatticeElement currently have to ensure constant ranges are not extended indefinitely. For example, in SCCP, mergeIn goes to overdefined if a constantrange value is repeatedly merged with larger constantranges. This is a simple form of widening. In some cases, this leads to an unnecessary loss of information and things can be improved by allowing a small number of extensions in the hope that a fixed point is reached after a small number of steps. To make better decisions about widening, it is helpful to keep track of the number of range extensions. That state is tied directly to a concrete ValueLatticeElement and some unused bits in the class can be used. The current patch preserves the existing behavior by default: CheckWiden defaults to false and if CheckWiden is true, a single change to the range is allowed. Follow-up patches will slightly increase the threshold for widening. Reviewers: efriedma, davide, mssimpso Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D78145	2020-04-17 15:38:23 +01:00
jasonliu	77618cc237	[XCOFF][AIX] Fix getSymbol to return the correct qualname when necessary Summary: AIX symbol have qualname and unqualified name. The stock getSymbol could only return unqualified name, which leads us to patch many caller side(lowerConstant, getMCSymbolForTOCPseudoMO). So we should try to address this problem in the callee side(getSymbol) and clean up the caller side instead. Note: this is a "mostly" NFC patch, with a fix for the original lowerConstant behavior. Differential Revision: https://reviews.llvm.org/D78045	2020-04-17 13:45:14 +00:00
Benjamin Kramer	166467e822	[VectorUtils] Create shufflevector masks as int vectors instead of Constants No functionality change intended.	2020-04-17 15:28:00 +02:00
Simon Pilgrim	de94715b64	UnifyFunctionExitNodes.h - remove unnecessary PassRegistry.h include. NFC	2020-04-17 12:17:59 +01:00
Simon Pilgrim	30725c2b35	SSAUpdaterBulk.h - remove unnecessary SmallPtrSet.h include. NFC	2020-04-17 12:17:59 +01:00
Simon Pilgrim	2c16ab746e	Scalar.h - remove unused forward declarations. NFC.	2020-04-17 12:17:58 +01:00
Max Kazantsev	72c13446ce	[NFC] Add missing 'const' notion to LCSSA-related functions These functions don't really do any changes to loop info or dominator tree. We should state this explicitly using 'const'.	2020-04-17 17:49:34 +07:00
Fraser Cormack	c819ef9653	Provide operand indices to adjustSchedDependency This allows targets to know exactly which operands are contributing to the dependency, which is required for targets with per-operand scheduling models. Differential Revision: https://reviews.llvm.org/D77135	2020-04-17 11:08:44 +01:00
Simon Pilgrim	bcd7f77713	MCObjectWriter.h - remove Endian.h/EndianStream.h/raw_ostream.h includes. NFC Push these includes down to the the writers that actually need them, a number of which were implicitly relying on the MCObjectWriter.h.	2020-04-17 10:44:08 +01:00
Simon Pilgrim	29bfcbe832	ConstantPools.h - remove unused DenseMap.h include. NFC.	2020-04-17 10:44:07 +01:00
Simon Pilgrim	711cdd474f	MCStreamer.h - remove unused llvm::MCCodePaddingContext forward declaration. NFC.	2020-04-17 10:44:07 +01:00
Simon Pilgrim	a0ae3d55ae	MCWasmStreamer.h.h - cleanup includes and forward declarations. NFC. Remove unnecessary SmallPtrSet.h/SectionKind.h includes Remove unused MCAssembler/raw_ostream forward declarations	2020-04-17 10:44:07 +01:00

1 2 3 4 5 ...

40625 Commits