llvm-project

Commit Graph

Author	SHA1	Message	Date
Markus Böck	5522ec00bc	[mlir][NFC] Fix typos in DataLayoutInterfaces.td	2021-08-06 18:54:25 +02:00
Zheng Chen	30b0c455b1	[LoopCacheAnalysis]: handle mismatch type for Numerator and CacheLineSize fix an assertion due to mismatch type for Numerator and CacheLineSize in loop cache analysis pass. Reviewed By: bmahjour Differential Revision: https://reviews.llvm.org/D107618	2021-08-06 16:51:09 +00:00
Jon Roelofs	eae4a44c1d	[GlobalISel][KnownBits] Implement G_CTPOP Implementation copied almost verbatim from ValueTracking. Differential revision: https://reviews.llvm.org/D107606	2021-08-06 09:48:39 -07:00
Michael Liao	d1cacd5928	[MemCpyOpt] Teach memcpyopt to handle loads from the constant memory. - Loads from the constant memory (either explicit one or as the source of memory transfer intrinsics) won't alias any stores. Reviewed By: asbirlea, efriedma Differential Revision: https://reviews.llvm.org/D107605	2021-08-06 12:43:52 -04:00
Craig Topper	b2ca4dc935	[LegalizeTypes] Add a simple expansion for SMULO when a libcall isn't available. This isn't optimal, but prevents crashing when the libcall isn't available. It just calculates the full product and makes sure the high bits match the sign of the low half. Each of the pieces should go through their own type legalization. This can make D107420 unnecessary. Needs tests, but I wanted to start discussion about D107420. Reviewed By: FreddyYe Differential Revision: https://reviews.llvm.org/D107581	2021-08-06 09:43:01 -07:00
David Green	77e8f4eeee	[ARM] Define ComplexPatternFuncMutatesDAG Some of the Arm complex pattern functions call canExtractShiftFromMul, which can modify the DAG in-place. For this to be valid and handled successfully we need to define ComplexPatternFuncMutatesDAG. Differential Revision: https://reviews.llvm.org/D107476	2021-08-06 17:35:11 +01:00
Paul Robinson	f88ad8d00f	Speculative fix for MachO lld test after "Have REQUIRES support the target triple" See: http://45.33.8.238/macm1/15677/step_10.txt This is a test that has `REQUIRES: x86` which means it never ran before; I don't have a MachO environment but based on the FileCheck output it looks like it should be sufficient to remove one CHECK line.	2021-08-06 09:23:45 -07:00
Pirama Arumuga Nainar	16ebb7ab5c	[llvm-objcopy] [COFF] Do not patch debug entries if PointerToRawData is zero Fix an edge case missed by https://reviews.llvm.org/D78921. For e.g., the Repro debug entry (generated with the /Brepro linker flag) does not have a debug-directory payload. Do not attempt to patch Debug entries without a payload. Differential Revision: https://reviews.llvm.org/D107324	2021-08-06 09:23:25 -07:00
Paul Robinson	e4cc071e92	Disable a dataflow fuzz test after "Have REQUIRES support the target triple" See: https://lab.llvm.org/buildbot/#/builders/75/builds/8095/steps/8/logs/stdio which shows: unsupported option '-fsanitize=dataflow' for target 'i386-unknown-linux-gnu' The other dataflow tests in the same directory were already disabled, so I think it's fine to disable this one as well.	2021-08-06 09:14:39 -07:00
Jonas Devlieghere	825a08f898	[lldb] Fix TestFunctionStarts.py on AS The tests strips the binary which invalidates the code signature. Skip code signing for this test.	2021-08-06 09:03:01 -07:00
Geoffrey Martin-Noble	ca6baf1e1d	[MLIR][std] Introduce bitcast operation This patch introduces a bitcast operation to the standard dialect. RFC: https://llvm.discourse.group/t/rfc-introduce-a-bitcast-op/3774 Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D105376	2021-08-06 08:47:51 -07:00
Kazu Hirata	276be84d0a	[CodeGen] Remove computeDefOperandLatency (NFC) The last use was removed on Oct 9, 2016 in commit `5c924d7117`.	2021-08-06 08:26:55 -07:00
Alex Zinenko	c4c1030976	[mlir] support collapsed loops in OpenMP-to-LLVM translation Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D105706	2021-08-06 17:13:12 +02:00
Paul Robinson	17e9732f48	Fix test failure found by "Have REQUIRES support the target triple"	2021-08-06 08:07:58 -07:00
Jake Egan	41bcfe8174	[AIX] Define _ARCH_PPC64 macro for 32-bit %%% The macro _ARCH_PPC64 is already defined for 64-bit, but this patch defines it for 32-bit on AIX to follow xlc. See: https://www.ibm.com/docs/en/xl-c-and-cpp-aix/13.1.0?topic=features-macros-related-architecture-settings Note: This change creates a discrepancy between GCC, which defines _ARCH_PPC64 only for 64-bit mode. Tested with SPEC. %%% Reviewed By: cebowleratibm Differential Revision: https://reviews.llvm.org/D107244	2021-08-06 10:42:44 -04:00
Jake Egan	869d07ee88	[AIX] Define __HOS_AIX__ macro %%% This patch defines __HOS_AIX__ macro for AIX in case of a cross compiler implementation. %%% Tested with SPEC. Reviewed By: cebowleratibm Differential Revision: https://reviews.llvm.org/D107242	2021-08-06 10:40:13 -04:00
Paul Robinson	100a7b6197	[lit] Have REQUIRES support the target triple Currently the UNSUPPORTED and XFAIL clauses support specifying substrings of the target triple; but REQUIRES does not, which can trip people up or lead to hacking config files to insert substitute feature names. Consistency across all three lit clauses seems preferable. Differential Revision: https://reviews.llvm.org/D107162	2021-08-06 07:31:15 -07:00
Corentin Jabot	131b4620ee	Implement P1937 consteval in unevaluated contexts In an unevaluated contexts, consteval functions should not be immediately evaluated.	2021-08-06 10:29:28 -04:00
Corentin Jabot	3c8e94bc20	Disallow narrowing conversions to bool in noexcept specififers Completes the support for P1401R5.	2021-08-06 10:26:39 -04:00
Jake Egan	3189dd205a	[AIX] Define __THW_PPC__ macro %%% This patch defines the macro __THW_PPC__ for AIX. %%% Tested with SPEC. Reviewed By: cebowleratibm Differential Revision: https://reviews.llvm.org/D107243	2021-08-06 09:52:26 -04:00
Jake Egan	420e1d4cf4	[AIX] Define __THW_BIG_ENDIAN__ macro %%% This patch defines the macro __THW_BIG_ENDIAN__ for AIX. %%% Tested with SPEC. Reviewed By: cebowleratibm Differential Revision: https://reviews.llvm.org/D107241	2021-08-06 09:46:59 -04:00
Jay Foad	57b9107e3f	[GlobalISel] Improve widening of cttz/cttz_zero_undef Differential Revision: https://reviews.llvm.org/D107631	2021-08-06 14:25:56 +01:00
Arthur O'Dwyer	f221d905b1	[libc++] IWYU to fix Modules complaints about _LIBCPP_ASSERT. NFCI. This fixes all places that used _LIBCPP_ASSERT without including <__debug>. git grep -l _LIBCPP_ASSERT \| xargs git grep -L __debug	2021-08-06 09:20:59 -04:00
Kadir Cetinkaya	79c2616d31	[clangd] Canonicalize inputs provided with `--` We already strip all the inputs provided without `--`, this patch also handles the cases with `--`. Differential Revision: https://reviews.llvm.org/D107637	2021-08-06 15:04:04 +02:00
Kadir Cetinkaya	3bf77980d9	[clangd] Strip mutliple arch options This patch strips all the arch options in case of multiple ones. As it results in multiple compiler jobs, which clangd cannot handle. It doesn't pick any over the others as it is unclear which one the user wants and defaulting to host architecture seems less surprising. Users also have the ability to explicitly specify the architecture they want via clangd config files. Fixes https://github.com/clangd/clangd/issues/827. Differential Revision: https://reviews.llvm.org/D107634	2021-08-06 15:04:04 +02:00
Dmitry Preobrazhensky	02b1c3f052	[AMDGPU][MC][NFC][DOC] Updated AMD GPU assembler syntax description. Corrected sendmsg description (bug https://bugs.llvm.org/show_bug.cgi?id=49648).	2021-08-06 15:52:26 +03:00
Jan Svoboda	4aafd5f00c	[clang] Remove misleading assertion in FullSourceLoc D31709 added an assertion was added to `FullSourceLoc::hasManager()` that ensured a valid `SourceLocation` is always paired with a `SourceManager`, and missing `SourceManager` is always paired with an invalid `SourceLocation`. This appears to be incorrect, since clients never cared about constructing `FullSourceLoc` to uphold that invariant, or always checking `isValid()` before calling `hasManager()`. The assertion started failing when serializing diagnostics pointing into an explicit module. Explicit modules don't have valid `SourceLocation` for the `import` statement, since they are "imported" from the command-line argument `-fmodule-name=x.pcm`. This patch removes the assertion, since `FullSourceLoc` was never intended to uphold any kind of invariants between the validity of `SourceLocation` and presence of `SourceManager`. Reviewed By: arphaman Differential Revision: https://reviews.llvm.org/D106862	2021-08-06 14:48:28 +02:00
Andrzej Warzynski	3709822d26	[flang][docs] Document the `flang` wrapper script Differential Revision: https://reviews.llvm.org/D107543	2021-08-06 12:45:32 +00:00
Rainer Orth	779714f89b	[profile] Only use NT_GNU_BUILD_ID if supported The Solaris buildbots have been broken for some time by the unconditional use of `NT_GNU_BUILD_ID`, e.g. Solaris/sparcv9 <https://lab.llvm.org/staging/#/builders/50/builds/4910> and Solaris/amd64 <https://lab.llvm.org/staging/#/builders/101/builds/3751>. Being a GNU extension, it is not defined in `<sys/elf.h>`. However, providing a fallback definition doesn't help because the code also relies on `__ehdr_start`, another unportable GNU extension that most likely never will be implemented in Solaris `ld`. Besides, there's reallly no point in supporting build ids since they aren't used on Solaris at all. This patch fixes this by making the relevant code conditional on the definition of `NT_GNU_BUILD_ID`. Tested on `amd64-pc-solaris2.11` and `sparcv9-sun-solaris2.11`. Differential Revision: https://reviews.llvm.org/D107556	2021-08-06 14:04:11 +02:00
Mircea Trofin	ae1a2a09e4	[NFC][MLGO] Make logging more robust 1) add some self-diagnosis (when asserts are enabled) to check that all features have the same nr of entries 2) avoid storing pointers to mutable fields because the proto API contract doesn't actually guarantee those stay fixed even if no further mutation of the object occurs. Differential Revision: https://reviews.llvm.org/D107594	2021-08-06 04:44:52 -07:00
Luna Kirkby	6385abd0c4	Split 'qualifier on reference type has no effect' out into a new flag This introduces a new flag ignored-reference-qualifiers for the existing "'A' qualifier on reference type B has no effect" diagnostic, as a child of ignored-qualifiers. Rationale: This particular diagnostic is enabled by default, but other parts of ignored-qualifiers are not. Anecdotally, a user may encounter this diagnostic in the wild, and, seeing it to be valuable, might try to raise it to error with -Werror=ignored-qualifiers, whereupon the other diagnostics the flag covers will also be raised, to the user's surprise and confusion. By splitting this diagnostic out into a separate flag, and marking it as a child of ignored-qualifiers, we allow the user more granular control of the diagnostics they care about, while maintaining backwards compatibility with existing build scripts.	2021-08-06 07:09:16 -04:00
Reshabh Sharma	5173854f19	[AMDGPU] Handle functions in llvm's global ctors and dtors list This patch introduces a new code object metadata field, ".kind" which is used to add support for init and fini kernels. HSAStreamer will use function attributes, "device-init" and "device-fini" to distinguish between init and fini kernels from the regular kernels and will emit metadata with ".kind" set to "init" and "fini" respectively. To reduce the number of init and fini kernels, the ctors and dtors present in the llvm's global.ctors and global.dtors lists are called from a single init and fini kernel respectively. Reviewed by: yaxunl Differential Revision: https://reviews.llvm.org/D105682	2021-08-06 15:53:33 +05:30
Simon Pilgrim	dbce6a8d9d	[ARM] Fold insert_subvector to concat_vectors D107068 fixed the same problem on aarch64 but the arm variant wasn't exposed in existing test coverage. I've copied the arm64-neon-copy tests (and stripped the intrinsic test from it) for testing on arm neon builds as well.	2021-08-06 11:21:31 +01:00
Simon Pilgrim	18e6a03b1a	[X86][AVX] Extract SUBV_BROADCAST constant bits from just the lower subvector range (PR51281) As reported on PR51281, an internal fuzz test encountered an issue when extracting constant bits from a SUBV_BROADCAST node from a constant pool source larger than the broadcasted subvector width. The getTargetConstantBitsFromNode was assuming that the Constant would the same size as the subvector, resulting in the incorrect packing of the per-element bits data. This patch attempts to solve this by using the SUBV_BROADCAST node to determine the subvector width, and then ensuring we extract only the lowest bits from Constant of that subvector bitsize. Differential Revision: https://reviews.llvm.org/D107158	2021-08-06 11:21:31 +01:00
Alexander Belyaev	aa2210a830	[linalg] Expose `rewriteAsPaddedOp` function. Differential Revision: https://reviews.llvm.org/D107629	2021-08-06 12:08:12 +02:00
Justas Janickas	a5a2f05dcc	[C++4OpenCL] Introduces __remove_address_space utility This change provides a way to conveniently declare types that have address space qualifiers removed. Since OpenCL adds address spaces implicitly even when they are not specified in source, it is useful to allow deriving address space unqualified types. Fixes llvm.org/PR45326 Differential Revision: https://reviews.llvm.org/D106785	2021-08-06 10:40:22 +01:00
Stefan Gränitz	9c63e5b415	[Orc][examples] Temporarily disable tests for the C API due to failures on sanitizer bots These tests were added while the OrcV2Example tests had been disabled: https://reviews.llvm.org/rGe5d8cfb2f134fcf0235ec1a35eec875a9cd36b21 Failures on sanitizer bots: https://green.lab.llvm.org/green/job/clang-stage2-cmake-RgSan/7992/testReport/	2021-08-06 11:33:01 +02:00
Cullen Rhodes	08bc441174	[AArch64] NFC: drop unnecessary llvm:: namespace prefix on MCInst	2021-08-06 09:23:18 +00:00
Sven van Haastregt	22fdf617b6	[OpenCL][Docs] Adding builtins requires adding to both now As we are trying to reach parity between opencl-c.h and -fdeclare-opencl-builtins, ensure the documentation mentions that new builtins should be added to both. Reviewed by: Anastasia Stulova	2021-08-06 10:21:26 +01:00
David Sherwood	3fd96e1b2e	[LoopVectorize] Improve vectorisation of some intrinsics by treating them as uniform This patch adds more instructions to the Uniforms list, for example certain intrinsics that are uniform by definition or whose operands are loop invariant. This list includes: 1. The intrinsics 'experimental.noalias.scope.decl' and 'sideeffect', which are always uniform by definition. 2. If intrinsics 'lifetime.start', 'lifetime.end' and 'assume' have loop invariant input operands then these are also uniform too. Also, in VPRecipeBuilder::handleReplication we check if an instruction is uniform based purely on whether or not the instruction lives in the Uniforms list. However, there are certain cases where calls to some intrinsics can be effectively treated as uniform too. Therefore, we now also treat the following cases as uniform for scalable vectors: 1. If the 'assume' intrinsic's operand is not loop invariant, then we are free to treat this as uniform anyway since it's only a performance hint. We will get the benefit for the first lane. 2. When the input pointers for 'lifetime.start' and 'lifetime.end' are loop variant then for scalable vectors we assume these still ultimately come from the broadcast of an alloca. We do not support scalable vectorisation of loops containing alloca instructions, hence the alloca itself would be invariant. If the pointer does not come from an alloca then the intrinsic itself has no effect. I have updated the assume test for fixed width, since we now treat it as uniform: Transforms/LoopVectorize/assume.ll I've also added new scalable vectorisation tests for other intriniscs: Transforms/LoopVectorize/scalable-assume.ll Transforms/LoopVectorize/scalable-lifetime.ll Transforms/LoopVectorize/scalable-noalias-scope-decl.ll Differential Revision: https://reviews.llvm.org/D107284	2021-08-06 10:13:15 +01:00
Vladislav Vinogradov	59f59d1c62	[mlir] Allow to override type/attr aliases from various hooks Use new return type for `OpAsmDialectInterface::getAlias`: * `AliasResult::NoAlias` if an alias was not provided. * `AliasResult::OverridableAlias` if an alias was provided, but it might be overriden by other hook. * `AliasResult::FinalAlias` if an alias was provided and it should be used (no other hooks will be checked). In that case `AsmPrinter` will use either the first alias with `FinalAlias` result or the last alias with `OverridableAlias` result (it depends on dialect array order). Used `OverridableAlias` result for `BuiltinOpAsmDialectInterface`. Use case: provide more informative alias for built-in attributes like `AffineMapAttr` instead of generic "map<N>". Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D107437	2021-08-06 12:05:31 +03:00
Chuanqi Xu	0fd03feb4b	[FuncSpec] Return changed if function is changed by tryToReplaceWithConstant The may get changed before specialization by RunSCCPSolver. In other words, the pass may change the function without specialization happens. Add test and comment to reveal this. And it may return No Changed if the function get changed by RunSCCPSolver before the specialization. It looks like a potential bug. Test Plan: check-all Reviewed By: https://reviews.llvm.org/D107622 Differential Revision: https://reviews.llvm.org/D107622	2021-08-06 17:00:17 +08:00
Esme-Yi	2919ac8971	[llvm-readobj][XCOFF] Warn about invalid offset Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D107398	2021-08-06 08:54:02 +00:00
David Sherwood	43a5c750d1	Revert "[LoopVectorize] Add support for replication of more intrinsics with scalable vectors" This reverts commit `95800da914`.	2021-08-06 09:48:16 +01:00
Jay Foad	83610d4eb0	[AMDGPU][GlobalISel] Better legalization of 32-bit ctlz/cttz Differential Revision: https://reviews.llvm.org/D107474	2021-08-06 09:40:48 +01:00
Jay Foad	24b67a9024	[AMDGPU][GlobalISel] Improve regbankselect for 64-bit VGPR ctlz_zero_undef/cttz_zero_undef We can improve on the generic splitting by using ffbh/ffbl, which have a defined result when the input is zero. Differential Revision: https://reviews.llvm.org/D107442	2021-08-06 09:40:48 +01:00
Jay Foad	d77b43c385	[AMDGPU][GlobalISel] Add G_AMDGPU_FFBL_B32 This is the counterpart to G_AMDGPU_FFBH_U32 which already exists. These instructions have a defined result of -1 when the input is zero. Differential Revision: https://reviews.llvm.org/D107441	2021-08-06 09:40:48 +01:00
Jay Foad	cd2594e1c6	[GlobalISel] Improve legalization of narrow CTTZ Differential Revision: https://reviews.llvm.org/D107457	2021-08-06 09:40:48 +01:00
Chuanqi Xu	62fc3e0ad6	[NFC] [FuncSpec] Remove unused variables in isArgumentInteresting	2021-08-06 16:38:20 +08:00
Chuanqi Xu	cc3f40bb41	[FuncSpec] Move invariant computation for spec cost out of loop (NFC-ish) Noticed that the computation for function specialization cost of a function wouldn't change during the traversal of the arguments for the function. We could hoist the computation out of the traversal. I observed about ~1% improvement on compile time for spec2017. But I guess it may not be precise. This should be NFC and fine. Reviewed By: Sjoerd Meijer Differential Revision: https://reviews.llvm.org/D107621	2021-08-06 15:43:05 +08:00

... 2 3 4 5 6 ...

396140 Commits All Branches Search

396140 Commits

All Branches