llvm-project

Commit Graph

Author	SHA1	Message	Date
Kazushi (Jam) Marukawa	a72d384249	[VE] Change the default type of v64 register class Change the default type of v64 register class from v512i32 to v256f64. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91301	2020-11-12 19:07:07 +09:00
Julian Gross	0313e3bfe6	[MLIR] Added documentation and manual to use bufferization features. Added documentation about the bufferization features. Furthermore, the usage of pre- and post-processing is described. This also includes information about optimization functionalities. Differential Revision: https://reviews.llvm.org/D90675	2020-11-12 10:43:05 +01:00
Kadir Cetinkaya	6484aa1add	[clangd] Simplify relations deserialization loop, NFC.	2020-11-12 10:33:39 +01:00
David Sherwood	3225fcf11e	[SVE] Deal with SVE tuple call arguments correctly when running out of registers When passing SVE types as arguments to function calls we can run out of hardware SVE registers. This is normally fine, since we switch to an indirect mode where we pass a pointer to a SVE stack object in a GPR. However, if we switch over part-way through processing a SVE tuple then part of it will be in registers and the other part will be on the stack. I've fixed this by ensuring that: 1. When we don't have enough registers to allocate the whole block we mark any remaining SVE registers temporarily as allocated. 2. We temporarily remove the InConsecutiveRegs flags from the last tuple part argument and reinvoke the autogenerated calling convention handler. Doing this prevents the code from entering an infinite recursion and, in combination with 1), ensures we switch over to the Indirect mode. 3. After allocating a GPR register for the pointer to the tuple we then deallocate any SVE registers we marked as allocated in 1). We also set the InConsecutiveRegs flags back how they were before. 4. I've changed the AArch64ISelLowering LowerCALL and LowerFormalArguments functions to detect the start of a tuple, which involves allocating a single stack object and doing the correct numbers of legal loads and stores. Differential Revision: https://reviews.llvm.org/D90219	2020-11-12 08:41:50 +00:00
David Green	1551d8dd48	[ARM] Remove unused check labels. NFC	2020-11-12 08:37:46 +00:00
Marek Kurdej	e331dfea70	[libc++] [P0340] [C++20] Update status page. NFC. This was implemented in 410b650e674496e61506fa88f3026759b8759d0f: "Implement P0340R3: Make 'underlying_type' SFINAE-friendly. Reviewed as https://reviews.llvm.org/D63574 llvm-svn: 364094"	2020-11-12 09:32:29 +01:00
MaheshRavishankar	5ca20851e4	[mlir][Linalg] Improve the logic to perform tile and fuse with better dependence tracking. This change does two main things 1) An operation might have multiple dependences to the same producer. Not tracking them correctly can result in incorrect code generation with fusion. To rectify this the dependence tracking needs to also have the operand number in the consumer. 2) Improve the logic used to find the fused loops making it easier to follow. The only constraint for fusion is that linalg ops (on buffers) have update semantics for the result. Fusion should be such that only one iteration of the fused loop (which is also a tiled loop) must touch only one (disjoint) tile of the output. This could be relaxed by allowing for recomputation that is the default when oeprands are tensors, or can be made legal with promotion of the fused view (in future). Differential Revision: https://reviews.llvm.org/D90579	2020-11-12 00:25:24 -08:00
Amara Emerson	ad376657c1	[AArch64][GlobalISel] Optimize G_PTR_ADD with a negated offset to be a G_SUB.	2020-11-11 22:46:53 -08:00
Max Kazantsev	2734a9ebf4	[NFC][SCEV] Generalize monotonicity check for full and limited iteration space A piece of logic of `isLoopInvariantExitCondDuringFirstIterations` is actually a generalized predicate monotonicity check. This patch moves it into the corresponding method and generalizes it a bit. Differential Revision: https://reviews.llvm.org/D90395 Reviewed By: apilipenko	2020-11-12 12:37:07 +07:00
Chuanqi Xu	cd89c4dbdd	[NFC][coroutines] remove unused argument in SemaCoroutine Test plan: check-llvm, check-clang Reviewers: lxfind, junparser Differential Revision: https://reviews.llvm.org/D91243	2020-11-12 13:22:20 +08:00
Xun Li	94a45a8098	Revert "[Coroutine] Allocas used by StoreInst does not always escape" This reverts commit `8bc7b9278e`, which landed by accident.	2020-11-11 21:09:39 -08:00
Aart Bik	0846659648	[mlir][sparse] export sparse tensor runtime support through header file Exposing the C versions of the methods of the sparse runtime support lib through header files will enable using the same methods in an MLIR program as well as a C++ program, which will simplify future benchmarking comparisons (e.g. comparing MLIR generated code with eigen for Matrix Market sparse matrices). Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D91316	2020-11-11 21:03:39 -08:00
Max Kazantsev	d6dd938589	[IndVars] IV user should not prevent use widening Sometimes the an instruction we are trying to widen is used by the IV (which means the instruction is the IV increment). Currently this may prevent its widening. We should ignore such user because it will be dead once the transform is done anyways. Differential Revision: https://reviews.llvm.org/D90920 Reviewed By: fhahn	2020-11-12 12:02:01 +07:00
Xun Li	8bc7b9278e	[Coroutine] Allocas used by StoreInst does not always escape In the existing logic, for a given alloca, as long as its pointer value is stored into another location, it's considered as escaped. This is a bit too conservative. Specifically, in non-optimized build mode, it's often to have patterns of code that first store an alloca somewhere and then load it right away. These used should be handled without conservatively marking them escaped. This patch tracks how the memory location where an alloca pointer is stored into is being used. As long as we only try to load from that location and nothing else, we can still consider the original alloca not escaping and keep it on the stack instead of putting it on the frame. Differential Revision: https://reviews.llvm.org/D91305	2020-11-11 20:53:51 -08:00
Max Kazantsev	2e01ceafaa	[IndVars] Recognize 'sub nuw' expressed as 'add' for widening InstCombine canonicalizes 'sub nuw' instructions to 'add' without the `nuw` flag. The typical case where we see it is decrementing induction variables. For them, IndVars fails to prove that it's legal to widen them, and inserts unprofitable `zext`'s. This patch adds recognition of such pattern using SCEV. Differential Revision: https://reviews.llvm.org/D89550 Reviewed By: fhahn, skatkov	2020-11-12 10:51:29 +07:00
Max Kazantsev	813781a923	[Test] Add Check statement	2020-11-12 10:47:34 +07:00
Richard Smith	2d4035e493	Fix structural comparison of template template arguments to compare the right union member. Should fix the armv8 buildbot.	2020-11-11 19:15:21 -08:00
Qiu Chaofan	2abc33683b	[PowerPC] [Clang] Define macros to identify quad-fp semantics We have option -mabi=ieeelongdouble to set current long double to IEEEquad semantics. Like what GCC does, we need to define __LONG_DOUBLE_IEEE128__ macro in this case, and __LONG_DOUBLE_IBM128__ if using PPCDoubleDouble. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D90208	2020-11-12 10:26:13 +08:00
Lang Hames	c7e64df445	[docs] Fix formatting, clarify comment in ORCv2 doc	2020-11-12 13:11:01 +11:00
Richard Smith	856fd98a17	Generalize regex matching std::string variants to compensate for recent improvements to Clang's type printing.	2020-11-11 17:55:47 -08:00
Aart Bik	e1dbc25ee2	[mlir][sparse] integrate sparse annotation into generic linalg op This CL integrates the new sparse annotations (hereto merely added as fully transparent attributes) more tightly to the generic linalg op in order to add verification of the annotations' consistency as well as to make make other passes more aware of their presence (in the long run, rewriting rules must preserve the integrity of the annotations). Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D91224	2020-11-11 17:26:30 -08:00
Joseph Huber	292e898c16	[OpenMP] Begin Adding OpenMP Tool to Gather OpenMP Information Summary: This patch begins to add support for a set of scripts that can be used to get information from OpenMP programs to better describe problems and eventually show the data to the user in formatted output. Right now the only support is forformatting the register and memory usage reports from ptxas and nvlink. This is simply done as a wrapper around clang and clang++. Reviewers: jdoerfert DIfferential Revision: https://reviews.llvm.org/D91085	2020-11-11 20:00:37 -05:00
Joachim Protze	25b3164bfb	[OpenMP][Tools][Tests] Fix ompt multiplex test With `6213ed0` the master callback was renamed to masked. The multiplex tests must check for masked now.	2020-11-12 01:43:49 +01:00
peter klausler	cc575dd2ce	[flang][MSVC] Use list<Message> rather than forward_list<> in Messages The implementation of Messages with forward_list<> makes some nonstandard assumptions about the validity of iterators that don't hold up with MSVC's implementation. Use list<> instead. The measured performance is comparable. This change obviated a distinction between two member functions of Messages, and the uses of one have been replaced with calls to the other. Similar usage in CharBuffer was also replaced for consistency. Differential revision: https://reviews.llvm.org/D91210	2020-11-11 16:38:38 -08:00
Sam McCall	1630e50874	[Syntax] Tablegen literal expressions. Non-mechanical changes: - Added FIXME to StringLiteral to cover multi-token string literals. - LiteralExpression::getLiteralToken() is gone. (It was never called) This is because we don't codegen methods in Alternatives It's conceptually suspect if we consider multi-token string literals, though. Differential Revision: https://reviews.llvm.org/D91277	2020-11-12 01:26:02 +01:00
Dave Lee	0783ad9e6a	[lldb] Switch expect to runCmd in TestRecursiveTypes (NFC) Following discussion in D91193, a change made in D88792 was not quite right. This restores the message argument, and switches from `expect` to `runCmd`. Differential Revision: https://reviews.llvm.org/D91206	2020-11-11 16:17:38 -08:00
Lang Hames	48ee1ea05c	[docs] Fix formatting in ORCv2.rst. Bold and fixed-width do not appear to mix well.	2020-11-12 11:08:58 +11:00
Lang Hames	984e87923f	[docs] Update ORCv2 design doc. Fixes some formatting and wording, and adds a roadmap section.	2020-11-12 10:33:29 +11:00
Arnold Schwaighofer	431337662e	[coro] Async coroutines: Allow more than 3 arguments in the dispatch function We need to be able to call function pointers. Inline the dispatch function. Also inline the context projection function. Transfer debug locations from the suspend point to the inlined functions. Use the function argument index instead of the function argument in coro.id.async. This solves any spurious use issues. Coerce the arguments of the tail call function at a suspend point. The LLVM optimizer seems to drop casts leading to a vararg intrinsic. rdar://70097093 Differential Revision: https://reviews.llvm.org/D91098	2020-11-11 15:25:28 -08:00
Arthur Eubanks	b6ccff3d5f	[NewPM] Provide method to run all pipeline callbacks, used for -O0 Some targets may add required passes via TargetMachine::registerPassBuilderCallbacks(). We need to run those even under -O0. As an example, BPFTargetMachine adds BPFAbstractMemberAccessPass, a required pass. This also allows us to clean up BackendUtil.cpp (and out-of-tree Rust usage of the NPM) by allowing us to share added passes like coroutines and sanitizers between -O0 and other optimization levels. Since callbacks may end up not adding passes, we need to check if the pass managers are empty before adding them, so PassManager now has an isEmpty() function. For example, polly adds callbacks but doesn't always add passes in those callbacks, so this is necessary to keep -debug-pass-manager tests' output from changing depending on if polly is enabled or not. Tests are a continuation of those added in https://reviews.llvm.org/D89083. Reviewed By: asbirlea, Meinersbur Differential Revision: https://reviews.llvm.org/D89158	2020-11-11 15:10:27 -08:00
Richard Smith	5f12f4ff90	Suppress printing of inline namespace names in diagnostics by default, except where they are necessary to disambiguate the target. This substantially improves diagnostics from the standard library, which are otherwise full of `::__1::` noise.	2020-11-11 15:05:51 -08:00
Richard Smith	e7f3e2103c	Suppress printing template arguments that match default template arguments of types by default. This somewhat improves the worst-case printing of types like std::string, std::vector, etc., where many irrelevant default arguments can be included in the type as printed if we've lost the type sugar.	2020-11-11 15:05:51 -08:00
Sam McCall	686d8a0911	[clangd] Add index server request logging - Add verbose logging of payloads - Add public logging of request summaries - fix non-logging of messages in request scopes (oops!) - add test for public/non-public logging, extending pipeline_helper a bit. We've accumulated quite a lot of duplication in the request handlers by now. I should factor that out, but not in this patch... Differential Revision: https://reviews.llvm.org/D90654	2020-11-11 23:58:18 +01:00
Ruslan Arutyunyan	e5ec94a1a0	[libc++] Implement P0919R3: heterogenous lookup for unordered containers Implement heterogenous lookup for unordered containers, including the refinement from P1690R1. Differential Revision: https://reviews.llvm.org/D87171	2020-11-11 17:44:42 -05:00
Baptiste Saleil	37c4ac8545	[PowerPC] Accumulator/Unprimed Accumulator register copy, spill and restore This patch adds support for accumulator/unprimed accumulator register copy, spill and restore for MMA. Authored By: Baptiste Saleil Reviewed By: #powerpc, bsaleil, amyk Differential Revision: https://reviews.llvm.org/D90616	2020-11-11 16:23:45 -06:00
Michael Kruse	c8a0e27cfb	[Polly][OpTree] Fix mid-processing change of access kind. Operand tree forwarding can cause the change of an access kind; in particular change from a scalar kind to an array kind if the scalar dependency is not necessary. Such an access cannot and doesn't need to be forwarded anymore. Fixes llvm.org/PR48034	2020-11-11 16:21:48 -06:00
Michael Kruse	c1cf51e777	[Polly][OpTree] Better report applied changes. Print to dbgs() any taken action. Also, read-only scalars do not require any action unless -polly-analyze-read-only-scalars=true is used. Better refect this by using ForwardingAction::triviallyForwardable and thus not bumping the statistics.	2020-11-11 16:21:48 -06:00
Duncan P. N. Exon Smith	4c55c3b66d	Frontend: Change ComputePreambleBounds to take MemoryBufferRef, NFC Avoid requiring an actual MemoryBuffer in ComputePreambleBounds, when a MemoryBufferRef will do just fine. Differential Revision: https://reviews.llvm.org/D90890	2020-11-11 17:19:51 -05:00
Sam McCall	3c09103291	[clangd] Sanity-check array sizes read from disk before allocating them. Previously a corrupted index shard could cause us to resize arrays to an arbitrary int32. This tends to be a huge number, and can render the system unresponsive. Instead, cap this at the amount of data that might reasonably be read (e.g. the #bytes in the file). If the specified length is more than that, assume the data is corrupt. Differential Revision: https://reviews.llvm.org/D91258	2020-11-11 23:16:53 +01:00
Louis Dionne	69ca17a92c	[libc++] NFC: Simplify incude of <cstdlib> We include <exception>, which includes <cstdlib> unconditionally anyway.	2020-11-11 17:04:32 -05:00
Louis Dionne	a7b6574144	[libc++abi] Reuse libc++'s refstring.h header instead of copying it This has been a long-standing TODO item, however we have now been requiring a monorepo layout to build libc++ and libc++abi for a while now. Hence, we can fix this code duplication issue now. Note that it's still not super pretty to reach into libc++ to include headers, but it's better than having duplicated code which can get out of sync.	2020-11-11 16:58:32 -05:00
peter klausler	67b13e9785	[flang] Fix CheckSpecificationExpr handling of associated names Avoid a spurious error message about a dummy procedure reference in a specification expression by restructuring the handling of use-associated and host-associated symbols. Updated to fix a circular dependence between shared library binaries that was introduced by the original patch. Differential revision: https://reviews.llvm.org/D91286	2020-11-11 13:51:42 -08:00
Arthur Eubanks	d9cbceb041	[CGSCC][Inliner] Handle new non-trivial edges in updateCGAndAnalysisManagerForPass Previously the inliner did a bit of a hack by adding ref edges for all new edges introduced by performing an inline before calling updateCGAndAnalysisManagerForPass(). This was because updateCGAndAnalysisManagerForPass() didn't handle new non-trivial call edges. This adds handling of non-trivial call edges to updateCGAndAnalysisManagerForPass(). The inliner called updateCGAndAnalysisManagerForFunctionPass() since it was handling adding newly introduced edges (so updateCGAndAnalysisManagerForPass() would only have to handle promotion), but now it needs to call updateCGAndAnalysisManagerForCGSCCPass() since updateCGAndAnalysisManagerForPass() is now handling the new call edges and function passes cannot add new edges. We follow the previous path of adding trivial ref edges then letting promotion handle changing the ref edges to call edges and the CGSCC updates. So this still does not allow adding call edges that result in an addition of a non-trivial ref edge. This is in preparation for better detecting devirtualization. Previously since the inliner itself would add ref edges, updateCGAndAnalysisManagerForPass() would think that promotion and thus devirtualization had happened after any sort of inlining. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D91046	2020-11-11 13:43:49 -08:00
Florian Hahn	1d5daed199	[Matrix] Update mangling to use paramterized vendor ext type syntax. The Itanium CXX ABI grammer has been extended to support parameterized vendor extended types [1]. This patch updates Clang's mangling for matrix types to use the new extension. [1] `b359d28971` Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D91253	2020-11-11 21:39:22 +00:00
Louis Dionne	d4a1e03c5f	[libc++] NFC: Synchronize libc++abi and libc++ new definitions Some changes were made to the libc++abi new/delete definitions, but they were not copied back to the libc++ definition. It sucks that we have this duplication, but for now at least let's keep them in sync.	2020-11-11 16:35:25 -05:00
Jessica Paquette	7a70a2f04d	[AArch64][GlobalISel] Mark G_FCONSTANT as legal when there is full fp16 support When there is full fp16 support, there is no reason to widen 16-bit G_FCONSTANTs to 32 bits. Mark them as legal in this case. Also, we currently import a pattern for materializing a 16-bit 0.0. Add a testcase showing we select it. (All other 16-bit G_FCONSTANTS are not yet selected.) Differential Revision: https://reviews.llvm.org/D89164	2020-11-11 13:25:11 -08:00
Kazushi (Jam) Marukawa	6e0ae20f3b	[VE] Support vector register in inline asm Support a vector register constraint in inline asm of clang. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91251	2020-11-12 06:18:35 +09:00
Duncan P. N. Exon Smith	4e9af3d478	Frontend: Skip namespace around createVFSFromCompilerInvocation definition, NFC Qualify definitions with `clang::` rather than opening/closing a namespace. Differential Revision: https://reviews.llvm.org/D90957	2020-11-11 16:15:06 -05:00
peter klausler	b781a04f6a	[flang] Allow labels on END statements. F18 clause 5.3.3 explicitly allows labels on program unit END statements. Label resolution code accounts for this for singleton program units, but incorrectly generates an error for host subprograms with internal subprograms. subroutine s(n) call s1(n) if (n == 0) goto 88 ! incorrect error print, 's' contains subroutine s1(n) if (n == 0) goto 77 ! ok print, 's1' 77 end subroutine s1 88 end Label resolution code makes a sequential pass over an entire file to collect label information for all subprograms, followed by a pass through that information for semantics checks. The problem is that END statements may be separated from prior subprogram code by internal subprogram definitions, so an END label can be associated with the wrong subprogram. There are several ways to fix this. Labels are always local to a subprogram. So the two separate passes over the entire file could probably instead be interleaved to perform analysis on a subprogram as soon as the end of the subprogram is reached, using a small stack. The stack structure would account for the "split" code case. This might work. It is possible that there is some not otherwise apparent advantage to the current full-file pass design. The parse tree has productions that provide access to a subprogram END statement "in advance". An alternative is to access this information to solve the problem. This PR implements this latter option. Differential revision: https://reviews.llvm.org/D91217	2020-11-11 13:06:19 -08:00
Jianzhou Zhao	0dd87825db	Add a flag to control whether to propagate labels from condition values to results Before the change, DFSan always does the propagation. W/o origin tracking, it is harder to understand such flows. After the change, the flag is off by default. Reviewed-by: morehouse Differential Revision: https://reviews.llvm.org/D91234	2020-11-11 20:41:42 +00:00

1 2 3 4 5 ...

371905 Commits All Branches Search

371905 Commits

All Branches