llvm-project

Commit Graph

Author	SHA1	Message	Date
Jay Foad	aa1e5fbc9b	[AMDGPU] Fix permissions on test files	2022-02-24 12:17:54 +00:00
Florian Hahn	0c5ea01b20	[AArch64] Add vector select test showing redundant operations. The tests show sub-optimal lowering of extend/cmp/select chains starting with v16i8 vectors.	2022-02-24 12:12:21 +00:00
Shao-Ce SUN	78b5f0fb05	[NFC][RISCV] Reuse ISD::NodeType in float extension Reviewed By: asb Differential Revision: https://reviews.llvm.org/D120412	2022-02-24 19:57:55 +08:00
Sven van Haastregt	88182e2dfd	[OpenCL] opencl-c.h: remove arg names for image builtins This simplifies completeness comparisons against OpenCLBuiltins.td and also makes the header no longer "claim" the identifiers "image", "image_array", "coord", "sampler", "sample", "gradientX", "gradientY", "lod", and "color". Continues the direction set out in D119560.	2022-02-24 11:52:32 +00:00
Pavel Labath	a85d3b66cb	[lldb] Fix macos build for D120425	2022-02-24 12:47:43 +01:00
Javier Setoain	cd0d21b47b	[mlir][LLVM] Allow scalable vectors in ShuffleVectorOp The current implementation of ShuffleVectorOp assumes all vectors are scalable. LLVM IR allows shufflevector operations on scalable vectors, and the current translation between LLVM Dialect and LLVM IR does the rigth thing when the shuffle mask is all zeroes. This is required to do a splat operation on a scalable vector, but it doesn't make sense for scalable vectors outside of that operation, i.e.: with non-all zero masks. Differential Revision: https://reviews.llvm.org/D118371	2022-02-24 11:24:34 +00:00
Simon Pilgrim	e41a138520	[X86] LowerShiftByScalarVariable - use getSplatSourceVector for vXi8 shift expansion Using getSplatValue causes poor codegen due to not always being able to remove the EXTRACT_VECTOR_ELT created inside getSplatValue. The vXi16 shifts/rotates are still showing occasional regressions but vXi8 is a definite improvement.	2022-02-24 11:24:06 +00:00
Nikita Popov	c7fe6f9c92	Revert "[RISCV] add the MC layer support of Zfinx extension" This reverts commit `7798ecca9c`. As reported in https://reviews.llvm.org/D93298#3331641 and following, this causes assertion failures with inline assembly.	2022-02-24 12:14:31 +01:00
Shao-Ce SUN	a8b4b9104c	[NFC][clang] Simplify `isOneOf` function Reviewed By: tmatheson Differential Revision: https://reviews.llvm.org/D117740	2022-02-24 19:12:34 +08:00
Pavel Labath	b5eeb8873a	[lldb] One more fix for the MonitorChildProcess patch (D120425)	2022-02-24 12:06:42 +01:00
Matthias Springer	25bc684603	[mlir][linalg][bufferize] Always bufferize in-place with "out" operands by default In D115022, we introduced an optimization where OpResults of a `linalg.generic` may bufferize in-place with an "in" OpOperand if the corresponding "out" OpOperand is not used in the computation. This optimization can lead to unexpected behavior if the newly chosen OpOperand is in the same alias set as another OpOperand (that is used in the computation). In that case, the newly chosen OpOperand must bufferize out-of-place. This can be confusing to users, as always choosing the "out" OpOperand (regardless of whether it is used) would be expected when having the notion of "destination-passing style" in mind. With this change, we go back to always bufferizing in-place with "out" OpOperands by default, but letting users override the behavior with a bufferization option. Differential Revision: https://reviews.llvm.org/D120182	2022-02-24 19:58:05 +09:00
Pavel Labath	c64dbb66d9	[lldb] Fix windows build for D120425	2022-02-24 11:50:54 +01:00
Andrzej Warzynski	19b6e70554	[flang][docs] Update the top "doc" page * Added a note about "Classic Flang" * sorted entries in ToC Differential Revision: https://reviews.llvm.org/D120067	2022-02-24 10:41:53 +00:00
Pavel Labath	12c9c4a885	[lldb/host] Remove monitor_signals argument from process monitoring functions All current callers set the argument to false. monitor_signals=true used to be used in the Process plugins (which needed to know when the debugged process gets a signal), but this implementation has several serious issues, which means that individual process plugins now orchestrate the monitoring of debugged processes themselves. This allows us to simplify the implementation (no need to play with process groups), and the interface (we only catch fatal events, so the callback is always called just once). Differential Revision: https://reviews.llvm.org/D120425	2022-02-24 11:12:59 +01:00
Benjamin Kramer	fb3a2d45cd	[Sema] Silence unused variable warning in Release builds. NFC.	2022-02-24 11:00:52 +01:00
rkayaith	e9db306dcd	[mlir][python] Support more types in IntegerAttr.value Previously only accessing values for `index` and signless int types would work; signed and unsigned ints would hit an assert in `IntegerAttr::getInt`. This exposes `IntegerAttr::get{S,U}Int` to the C API and calls the appropriate function from the python bindings. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120194	2022-02-24 10:26:31 +01:00
Marek Kurdej	bfb4afee74	[clang-format] Avoid inserting space after C++ casts. Fixes https://github.com/llvm/llvm-project/issues/53876. This is a solution for standard C++ casts: const_cast, dynamic_cast, reinterpret_cast, static_cast. A general approach handling all possible casts is not possible without semantic information. Consider the code: ``` static_cast<T>(function_pointer_variable)(arguments); ``` vs. ``` some_return_type<T> (function_pointer_variable)(parameters); // Later used as: function_pointer_variable = &some_function; return function_pointer_variable(args); ``` In the latter case, it's not a cast but a variable declaration of a pointer to function. Without knowing what `some_return_type<T>` is (and clang-format does not know it), it's hard to distinguish between the two cases. Theoretically, one could check whether "parameters" are types (not a cast) and "arguments" are value/expressions (a cast), but that might be inefficient (needs lots of lookahead). Reviewed By: MyDeveloperDay, HazardyKnusperkeks, owenpan Differential Revision: https://reviews.llvm.org/D120140	2022-02-24 10:21:02 +01:00
Xiaodong Liu	56b5f00186	[sancov][sanitizer-common] Correct sanitizer coverage point Sanitizer coverage point should be the previous instruction PC of the caller and the offset to the previous instruction might be different on each CPU architecture. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D119233	2022-02-24 17:16:45 +08:00
Marek Kurdej	46f6c834d9	[clang-format] Fix QualifierOrder breaking the code with requires clause. Fixes https://github.com/llvm/llvm-project/issues/53962. Given the config: ``` BasedOnStyle: LLVM QualifierAlignment: Custom QualifierOrder: ['constexpr', 'type'] ``` The code: ``` template <typename F> requires std::invocable<F> constexpr constructor(); ``` was incorrectly formatted to: ``` template <typename F> requires constexpr std::invocable<F> constructor(); ``` because we considered `std::invocable<F> constexpr` as a type, not recognising the requires clause. This patch avoids moving the qualifier across the boundary of the requires clause (checking `ClosesRequiresClause`). Reviewed By: HazardyKnusperkeks, owenpan Differential Revision: https://reviews.llvm.org/D120309	2022-02-24 10:16:10 +01:00
Sander de Smalen	ff3f3a54e2	[AArch64][AsmParser] Arch directives should set implied features. When assembling for example an SVE instruction with the `.arch +sve2` directive, +sve should be implied by setting +sve2, similar to what would happen if one would pass the mattr=+sve2 flag on the command-line. The AsmParser doesn't set the implied features, meaning that the SVE instruction does not assemble. This patch fixes that. Note that the same does not hold when disabling a feature. For example, +nosve2 does not imply +nosve. Reviewed By: c-rhodes Differential Revision: https://reviews.llvm.org/D120259	2022-02-24 09:15:17 +00:00
Luis Penagos	dbc4d281bd	[clang-format] Do not insert space after new/delete keywords in C function declarations Fixes https://github.com/llvm/llvm-project/issues/46915. Reviewed By: curdeius, HazardyKnusperkeks Differential Revision: https://reviews.llvm.org/D120374	2022-02-24 10:06:40 +01:00
Nikolas Klauser	de462a43d3	[libc++] Remove <chrono> include from <future> Reviewed By: Quuxplusone, #libc Spies: libcxx-commits Differential Revision: https://reviews.llvm.org/D120436	2022-02-24 10:05:51 +01:00
Iain Sandoe	69350e569d	[C++20][Modules][3/8] Initial handling for module partitions. This implements the parsing and recognition of module partition CMIs and removes the FIXMEs in the parser. Module partitions are recognised in the base computation of visibility, however additional amendments to visibility follow in subsequent patches. Differential Revision: https://reviews.llvm.org/D118586	2022-02-24 09:01:09 +00:00
lian wang	e1d4d1c242	[RISCV] Add schedule class for Zbm and Zbe extension Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D119805	2022-02-24 08:49:25 +00:00
lian wang	05e82be7ea	[RISCV][NFC] Remove useless intrinsic function declare in test of Zbp extension Reviewed By: benshi001 Differential Revision: https://reviews.llvm.org/D120393	2022-02-24 08:43:13 +00:00
Martin Storsjö	5333732fce	[libcxx] [test] Fix the get/put long_double_ru_RU tests on Apple platforms This fixes issue #45084 (https://llvm.org/PR45739). Remove unnecessary trailing spaces after the "RUB" international currency symbol (and after the plain number in some parts of the put_long_double test). Both of these test files are `XFAIL: netbsd \|\| linux \|\| LIBCXX-WINDOWS-FIXME`, and then have some of their test cases commented out when `__APPLE__`. This patch comments-in those test cases and adjusts them all to work on Apple, while leaving the test `XFAIL`ed on NetBSD, Linux, and Windows. Differential Revision: https://reviews.llvm.org/D120316	2022-02-24 10:21:36 +02:00
Fangrui Song	7fd60ee6e0	[analyzer] Fix -Wunused-lambda-capture in -DLLVM_ENABLE_ASSERTIONS=off builds	2022-02-24 00:13:13 -08:00
Nikita Popov	a266af7211	[InstCombine] Canonicalize SPF to min/max intrinsics Now that integer min/max intrinsics have good support in both InstCombine and other passes, start canonicalizing SPF min/max to intrinsic min/max. Once this sticks, we can stop matching SPF min/max in various places, and can remove hacks we have for preventing infinite loops and breaking of SPF canonicalization. Differential Revision: https://reviews.llvm.org/D98152	2022-02-24 09:01:20 +01:00
Nikita Popov	aa551ad198	Revert "[InstCombine] Remove one-use limitation from X-Y==0 fold" This reverts commit `65dc78d63e`. This caused a major code-size regression on tramp3d-v4, revert until I can investigate.	2022-02-24 08:50:40 +01:00
Adrian Kuegel	c1d36e2304	[mlir][SCF] Apply suggested ClangTidy fixes.	2022-02-24 08:23:24 +01:00
Maksim Panchenko	4101aa130a	[BOLT] Support PC-relative relocations with addends PC-relative memory operand could reference a different object from the one located at the target address, e.g. when a negative offset is used. Check relocations for the real referenced object. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D120379	2022-02-23 22:54:42 -08:00
eopXD	8b62f19f27	[Driver][RISCV] Add missing rv64 test case Add missing test case. Thanks @rogfer01 for spotting this out. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D120297	2022-02-23 22:53:42 -08:00
Chenbing.Zheng	2aac00e4a6	[RISCV] Add more tests for vcpop and vfirst with VL=0 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D120300	2022-02-24 05:59:12 +00:00
Chenbing.Zheng	2ae92e19eb	[RISCV][NFC] Add helper function isVectorConfigInstr to reduce Repeated code. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D119924	2022-02-24 05:59:12 +00:00
Fangrui Song	9f9ac3464e	[ELF] Symbols.h: remove #include "InputFiles.h"	2022-02-23 21:36:45 -08:00
Fangrui Song	8ca46bba23	[ELF] Move isUsedInRegularObj assignment from ctor to call sites. NFC This removes the tricky `isUsedInRegularObj(!file \|\| file->kind() == InputFile::ObjKind)` and the copy from `Symbol::mergeProperties`.	2022-02-23 21:32:50 -08:00
Craig Topper	5b7ac107b1	[RISCV] Use SelectionDAG::getFreeze to simplify some code. NFC	2022-02-23 21:13:01 -08:00
Fangrui Song	00b6d2106b	[ELF][test] Avoid race on a.out	2022-02-23 20:48:49 -08:00
Fangrui Song	38fbedab32	[ELF] Don't rely on Symbols.h's transitive inclusion of InputFiles.h. NFC	2022-02-23 20:44:34 -08:00
Amir Ayupov	af6e66f44c	[BOLT][NFC] Report errors from RewriteInstance `discoverStorage` and `run` Further improve error handling in BOLT by reporting `RewriteInstance` errors in a library and fuzzer-friendly way instead of exiting. Follow-up to D119658 Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D120224	2022-02-23 20:42:39 -08:00
Fangrui Song	ba061713d3	[ELF] Move TLS mismatch error from Symbol::replace to postParse * detect `def_tls.o undef_nontls.o` violation * place error checking code (checking duplicate symbol) together * allow `--defsym tls1=tls2 def_tls.o` As a degraded error checking, `--defsym tls1=42` violation will not be detected.	2022-02-23 20:34:48 -08:00
V Donaldson	338b478e70	[flang] Do not print format tabs As an extension, tabs are accepted in a format, but should be skipped, not printed.	2022-02-23 20:03:10 -08:00
Yaxun (Sam) Liu	9d899d8f01	[HIP] Support `-fgpu-default-stream` Introduce -fgpu-default-stream={legacy\|per-thread} option to support per-thread default stream for HIP runtime. When -fgpu-default-stream=per-thread, HIP kernels are launched through hipLaunchKernel_spt instead of hipLaunchKernel. Also HIP_API_PER_THREAD_DEFAULT_STREAM=1 is defined by the preprocessor to enable other per-thread stream API's. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D120298	2022-02-23 22:28:29 -05:00
Fangrui Song	b01430a04f	[ELF] Don't rely on Symbols.h's transitive inclusion of InputFiles.h. NFC	2022-02-23 19:18:24 -08:00
Bixia Zheng	c8ae8cfb5d	[mlir][sparse][taco] Add support for float32. Previously, we only support float64. We now support float32 and float64. When constructing a tensor without providing a data type, the default is float32. Fix the tests to data type consistency. All PyTACO application tests now use float32 to match the default data type of TACO. Other tests may use float32 or float64. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D120356	2022-02-23 18:24:22 -08:00
Fangrui Song	47d18be58b	[ELF] Remove SharedSymbol::getFile. NFC Symbol.h depends on InputFiles.h. This change moves us toward dropping the weird dependency. The call sites will become slightly uglier (`cast<SharedFile>(s->file)`), but the compromise is acceptable.	2022-02-23 17:57:52 -08:00
Mircea Trofin	7e3606f43c	[ScalarEvolution] Control flag for nonstrict inequalities in finite loops D118090 causes a pretty significant (19%) regression in some Eigen benchmarks. Investigating is a bit time consuming as the compilation unit where this occurs is large. Rather than revert, this patch adds a flag controlling that behavior (enabled by default).	2022-02-23 17:56:35 -08:00
Aart Bik	652b39b46f	[mlir][sparse][linalg] add linalg rewriting specific to sparse tensors Now that sparse tensor types are first-class citizens and the sparse compiler is taking shape, it is time to make sure other compiler optimizations compose well with sparse tensors. Mostly, this should be completely transparent (i.e., dense and sparse take the same path). However, in some cases, optimizations only make sense in the context of sparse tensors. This is a first example of such an optimization, where fusing a sampled elt-wise multiplication only makes sense when the resulting kernel has a potential lower asymptotic complexity due to the sparsity. As an extreme example, running SDDMM with 1024x1024 matrices and a sparse sampling matrix with only two elements runs in 463.55ms in the unfused case but just 0.032ms in the fused case, with a speedup of 14485x that is only possible in the exciting world of sparse computations! Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D120429	2022-02-23 17:29:41 -08:00
Med Ismail Bennani	3e3e79a9e4	[lldb/test] Fix TestProgressReporting.py race issue with the event listener This patch is a follow-up of D120100 to address some feedbacks from @labath. This should mainly fix the race issue with the even listener by moving the listener setup to the main thread. This also changes the SBDebugger::GetProgressFromEvent SWIG binding arguments to be output only, so the user don't have to provide them. Finally, this updates the test to check it the out arguments are returned in a tuple and re-enables the test on all platforms. Differential Revision: https://reviews.llvm.org/D120284 Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2022-02-23 16:45:28 -08:00
Matthias Braun	6a383369f9	PGOInstrumentation, GCOVProfiling: Split indirectbr critical edges regardless of PHIs The `SplitIndirectBrCriticalEdges` function was originally designed for `CodeGenPrepare` and skipped splitting of edges when the destination block didn't contain any `PHI` instructions. This only makes sense when reducing COPYs like `CodeGenPrepare`. In the case of `PGOInstrumentation` or `GCOVProfiling` it would result in missed counters and wrong result in functions with computed goto. Differential Revision: https://reviews.llvm.org/D120096	2022-02-23 16:27:37 -08:00

... 11 12 13 14 15 ...

416698 Commits All Branches Search

416698 Commits

All Branches