llvm-project

Commit Graph

Author	SHA1	Message	Date
Lang Hames	897eb579c6	[ORC-RT] ExecutorAddrDiff ergonomic improvements; contains and overlaps methods Renames StartAddress and EndAddress members to Start and End. Adds contains and overlap methods. Adds a constructor from an address and size. These changes are counterparts to LLVM commits `ef391df2b6`, `c0d889995e`, and `37f1b7a3f3`.	2021-09-25 12:01:42 -07:00
Fangrui Song	a892c0e49e	[ELF][test] Improve test coverage	2021-09-25 11:57:54 -07:00
Lang Hames	99951a5684	Revert "[ORC] Introduce EPCGenericRTDyldMemoryManager." This reverts commit `bef55a2b47` while I investigate failures on some bots. Also reverts "[lli] Add ChildTarget dependence on OrcTargetProcess library." (`7a219d801b`) which was a fallow-up to `bef55a2b47`.	2021-09-25 11:19:14 -07:00
Lang Hames	7a219d801b	[lli] Add ChildTarget dependence on OrcTargetProcess library. ChildTarget depends on OrcTargetProcess after `bef55a2b47`.	2021-09-25 10:51:29 -07:00
Lang Hames	bef55a2b47	[ORC] Introduce EPCGenericRTDyldMemoryManager. EPCGenericRTDyldMemoryMnaager is an EPC-based implementation of the RuntimeDyld::MemoryManager interface. It enables remote-JITing via EPC (backed by a SimpleExecutorMemoryManager instance on the executor side) for RuntimeDyld clients. The lli and lli-child-target tools are updated to use SimpleRemoteEPC and SimpleRemoteEPCServer (rather than OrcRemoteTargetClient/Server), and EPCGenericRTDyldMemoryManager for MCJIT tests. By enabling remote-JITing for MCJIT and RuntimeDyld-based ORC clients, EPCGenericRTDyldMemoryManager allows us to deprecate older remote-JITing support, including OrcTargetClient/Server, OrcRPCExecutorProcessControl, and the Orc RPC system itself. These will be removed in future patches.	2021-09-25 10:42:10 -07:00
Simon Pilgrim	18c8ed5416	[DAG] ReduceLoadOpStoreWidth - replace getABITypeAlign with allowsMemoryAccess (PR45116) One of the cases identified in PR45116 - we don't need to limit store narrowing to ABI alignment, we can use allowsMemoryAccess - which tests using getABITypeAlign, but also checks if a target permits (fast) misaligned memory access by checking allowsMisalignedMemoryAccesses as a fallback.	2021-09-25 18:35:57 +01:00
mydeveloperday	c2ec5dd209	[clang-format] Left/Right alignment fixer can cause false positive replacements when they don't actually change anything Earlier during the development of {D69764} I felt it was no longer necessary to ensure we were not trying to change code which didn't need to change and we felt this could be removed, however I'd like to bring this back for now as I am seeing some false positives in terms of the "replacements" What I see is the generation of a replacement which is a "No Op" on the original code, I think this comes about because of the merging of replacements: ``` static const a; -> const static a; -> static const a; ``` The replacements don't really merge, in such a way as to identify when we have gone back to the original Also remove the Penalty as I'm not using it (and it became marked as set and no used, I'd rather get rid of it if it means nothing) I think we need to do this step for now, as many people use the --output-replacements-xml to identify that the file "needs a clang-format" The same can be seen with the -n or --dry-run option as this uses the replacements to drive the error/warning output. Reviewed By: HazardyKnusperkeks Differential Revision: https://reviews.llvm.org/D110392	2021-09-25 17:35:41 +01:00
Simon Pilgrim	8c83bd3bd4	[CostModel][X86] Adjust vXi32 multiply costs if it can be performed using PMADDWD Update the costs to match the codegen from combineMulToPMADDWD - not only can we use PMADDWD is its zero-extended, but also if its a constant or sign-extended from a vXi16 (which can be replaced with a zero-extension).	2021-09-25 16:28:48 +01:00
Simon Pilgrim	eb7c78c2c5	[X86][SSE] combineMulToPMADDWD - mask off upper bits of sign-extended vXi32 constants If we are multiplying by a sign-extended vXi32 constant, then we can mask off the upper 16 bits to allow folding to PMADDWD and make use of its implicit sign-extension from i16	2021-09-25 15:50:45 +01:00
Simon Pilgrim	2a4fa0c27c	[X86][SSE] combineMulToPMADDWD - enable sext(v8i16) -> zext(v8i16) fold on sub-128 bit vectors	2021-09-25 15:50:45 +01:00
Kazu Hirata	44c401bdc3	[Mips] Remove redundant declarations (NFC) Note that identical declarations immediately precede what's being removed in this patch. Identified with readability-redundant-declaration.	2021-09-25 07:41:11 -07:00
Simon Pilgrim	f5a26ccae2	[X86][SSE] combineMulToPMADDWD - enable sext(v8i16) -> zext(v8i16) fold on pre-SSE41 targets We already do this on SSE41 targets where we have sext/zext instructions, now that combineShiftToPMULH handles SSE2 targets, we can enable this here as well.	2021-09-25 14:35:31 +01:00
Simon Pilgrim	4c72b10f0a	[X86] X86FastISel::fastMaterializeConstant - break if-else chain to fix llvm-else-after-return warning. NFCI All previous if-else cases return	2021-09-25 14:31:14 +01:00
Simon Pilgrim	a25f25c3b7	[X86] combineShiftToPMULH - relax from ISA from SSE41 to SSE2 With improved shuffle combines (in particular canonicalizeShuffleWithBinOps), we can now usefully perform this on any SSE2+ target. We should be able to remove this entirely and just use DAGCombiner's combineShiftToMULH if we can someday get it to support illegal (pre-widened) types.	2021-09-25 14:08:03 +01:00
Michał Górny	3a6ba36751	[lldb] Convert misc. StringConvert uses Replace misc. StringConvert uses with llvm::to_integer() and llvm::to_float(), except for cases where further refactoring is planned. The purpose of this change is to eliminate the StringConvert API that is duplicate to LLVM, and less correct in behavior at the same time. Differential Revision: https://reviews.llvm.org/D110447	2021-09-25 14:19:19 +02:00
Valentin Clement	2c28e3f008	[fir] Add desc to fir.array_load op and update operand name This patch is part of the upstreaming effort from fir-dev branch. Add a description for the fir.array_load opeartion and rename lenParams to typeparams. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D110393 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-09-25 14:10:35 +02:00
Simon Pilgrim	5a14edd8ed	[InstCombine] Ensure shifts are in range for (X << C1) / C2 -> X fold. We can get here before out of range shift amounts have been handled - limit to BW-2 for sdiv and BW-1 for udiv Fixes https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=38078	2021-09-25 12:57:43 +01:00
Markus Böck	0b61f43b60	[CMake] Consistently use the LibXml2::LibXml2 target instead of LIBXML2_LIBRARIES Linking against the LibXml2::LibXml2 target has the advantage of not only importing the library, but also adding the include path as well as any definitions the library requires. In case of a static build of libxml2, eg. a define is set on Windows to remove any DLL imports and export. LLVM already makes use of the target, but c-index-test and lldb were still linking against the library only. The workaround for Mac OS-X that I removed seems to have also been made redundant since https://reviews.llvm.org/D84563 I believe Differential Revision: https://reviews.llvm.org/D109975	2021-09-25 13:13:11 +02:00
Simon Pilgrim	ee267b1c7c	[IR] DIBuilder::createEnumerator - pass APSInt by const reference Avoid unnecessary copy by value.	2021-09-25 11:58:06 +01:00
Simon Pilgrim	6bd5b1b1ce	[DAG] combineShiftToMULH - move getValueType() inside assert. NFCI. Avoids an unnecessary (void).	2021-09-25 11:56:35 +01:00
Kunwar Shaanjeet Singh Grover	0f78ece169	[MLIR] Add functionality to remove redundant local variables This patch adds functionality to FlatAffineConstraints to remove local variables using equalities. This helps in keeping output representation of FlatAffineConstraints smaller. This patch is part of a series of patches aimed at generalizing affine dependence analysis. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D110056	2021-09-25 16:10:43 +05:30
David Green	883758ed48	[ARM] Fix Arm block placement creating branches after jump tables. Given: - A jump table - Which jumps to the next block - The next block ends in a WLS - Where the WLS conditionally jumps to block earlier in the program. The Arm block placement pass would attempt to move the block containing the WLS earlier, as the WLS instruction can only branch forward. In doing so it would add a branch from the jumptable block to the WLS block, thinking it previously fell-through. This in itself would be fine, if a little inefficient, but the constant island pass expects all instructions after a jump-table branch to have been removed by analyzeBranch. So it gets confused and can assign the same labels to multiple jump table blocks. I've changed the condition to the same as used in analyzeBranch.	2021-09-25 11:32:25 +01:00
Dmitry Vyukov	b02938439d	tsan: uninline RacyStacks::operator== It's only used during race reporting. There is no point in polluting the main header file with it. Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D110470	2021-09-25 12:08:51 +02:00
Simon Pilgrim	993f3c61b3	[TTI] getUserCost - Ensure a vector insert/extract index is in unsigned 32-bit range Otherwise fallback to the generic 'unknown index' path Fixes https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=29050	2021-09-25 10:50:54 +01:00
Jim Lin	ed687c0211	[RISCV] Fix incorrect operand type of inst alias for InstR4 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D110381	2021-09-25 11:25:12 +08:00
Matheus Izvekov	37adc4f957	[clang] set templates as invalid when any of the parameters are invalid See PR51872 for the original repro. This fixes a crash when converting a templated constructor into a deduction guide, in case any of the template parameters were invalid. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D110460	2021-09-25 04:04:47 +02:00
Amara Emerson	f4cfda03d6	[AArch64][AMDGPU] Re-generate some tests with CHECK-NEXT to prepare for a patch.	2021-09-24 18:26:08 -07:00
Petr Hosek	d893692024	[CMake] Pass through CMAKE_READELF to subbuilds This matches handling of other CMake variables. Differential Revision: https://reviews.llvm.org/D110463	2021-09-24 18:20:30 -07:00
Jason Molenda	a2e1d68fa9	Add pragma to make it easier to find "image list" impl I couldn't find it; make this easier for next time.	2021-09-24 17:13:03 -07:00
David Blaikie	5cb210862b	DebugInfo: Use the signedness of the underlying enum when encoding enum non-type-template-parameters This improves the accuracy of the debug info and improves round tripping through -gsimple-template-names.	2021-09-24 17:02:55 -07:00
River Riddle	ef764eeeb9	[mlir:ElementsAttr] Avoid crash on empty contiguous ranges We currently, incorrectly, assume that a range always has at least one element when building a contiguous range. This commit adds a proper empty check to avoid crashing. Differential Revision: https://reviews.llvm.org/D110457	2021-09-24 23:48:51 +00:00
modimo	ce6ed64a69	[llvm-profdata] Extend support of --topn to sample profiles Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D110449	2021-09-24 16:42:46 -07:00
Nico Weber	4a45f95e76	[llvm] Remove LLVM_CHECK_ENABLED_PROJECTS again This reverts commit `55f0b33708` and follow-up reverts commit `e9ea03c62c`. LLVM_EXTERNAL_PROJECTS is sufficient, see https://reviews.llvm.org/D110016	2021-09-24 18:49:55 -04:00
Nico Weber	6ece82e900	Revert "[Driver] Correctly handle static C++ standard library" This reverts commit `03142c5f67`. Breaks check-asan if system ld doesn't support --push-state, even if lld was built and is used according to lit's output. See comments on https://reviews.llvm.org/D110128	2021-09-24 18:44:53 -04:00
Konrad Kleine	6deaebe5fe	[llvm] Improve export.sh with help and snapshot This change adds the ability to create source tarballs for unreleased or untagged code by providing the `--git-ref <GIT_REF>` flag to the `llvm/utils/release/export.sh` script. This is useful for creating daily snapshot tarballs that can easily be consumed by packagers who want to build a daily snapshot. The default behavior of `export.sh` hasn't changed. You may also provide a `--template` argument to say how the artifacts are supposed to be named (as suggested by @hans). The `-help` output of `export.sh` was changed quite significantly to look like this: ``` Export the Git sources and build tarballs from them. Usage: export.sh [-release\|--release <major>.<minor>.<patch>] [-rc\|--rc <num>] [-final\|--final] [-git-ref\|--git-ref <git-ref>] [-template\|--template <template>] Flags: -release \| --release <major>.<minor>.<patch> The version number of the release -rc \| --rc <num> The release candidate number -final \| --final When provided, this option will disable the rc flag -git-ref \| --git-ref <git-ref> (optional) Use <git-ref> to determine the release and don't export the test-suite files -template \| --template <template> (optional) Possible placeholders: $PROJECT $YYYYMMDD $GIT_REF $RELEASE $RC. Defaults to '${PROJECT}-${RELEASE}${RC}.src.tar.xz'. The following list shows the filenames (with <placeholders>) for the artifacts that are being generated (given that you don't touch --template). * llvm-<RELEASE><RC>.src.tar.xz * clang-<RELEASE><RC>.src.tar.xz * compiler-rt-<RELEASE><RC>.src.tar.xz * libcxx-<RELEASE><RC>.src.tar.xz * libcxxabi-<RELEASE><RC>.src.tar.xz * libclc-<RELEASE><RC>.src.tar.xz * clang-tools-extra-<RELEASE><RC>.src.tar.xz * polly-<RELEASE><RC>.src.tar.xz * lldb-<RELEASE><RC>.src.tar.xz * lld-<RELEASE><RC>.src.tar.xz * openmp-<RELEASE><RC>.src.tar.xz * libunwind-<RELEASE><RC>.src.tar.xz * flang-<RELEASE><RC>.src.tar.xz Additional files being generated: * llvm-project-<RELEASE><RC>.src.tar.xz (the complete LLVM source project) * test-suite-<RELEASE><RC>.src.tar.xz (only when not using --git-ref) To ease the creation of snapshot builds, we also provide these files * llvm-release-<YYYYMMDD>.txt (contains the <RELEASE> as a text) * llvm-rc-<YYYYMMDD>.txt (contains the rc version passed to the invocation of export.sh) * llvm-git-revision-<YYYYMMDD>.txt (contains the current git revision sha1) Example values for the placeholders: * <RELEASE> -> 13.0.0 * <YYYYMMDD> -> 20210414 * <RC> -> rc4 (will be empty when using --git-ref) In order to generate snapshots of the upstream main branch you could do this for example: export.sh --git-ref upstream/main --template '${PROJECT}-${YYYYMMDD}.src.tar.xz' ``` Reviewed By: tstellar Differential Revision: https://reviews.llvm.org/D101446	2021-09-25 00:38:19 +02:00
Wei Mi	80865f7579	Add "REQUIRES: zlib" in forward-compatible.test since it handles compressed file.	2021-09-24 15:35:07 -07:00
Wei Mi	e8b376547b	Fixed a bug in https://reviews.llvm.org/rG8eb617d719bdc6a4ed7773925d2421b9bbdd4b7a . For compressed profile when reading an unknown section, the data reader pointer adjustment was incorrect. This patch fixed that.	2021-09-24 15:23:45 -07:00
Craig Topper	715cf6ffb9	[RISCV] Add another isel optimization for (and (shl X, c2), c1). Where c1 is a shifted mask with 32-c2 leading zeros and c3 trailing zeros and c3>c2. We can select it as (slli (srliw X, c3-c2), c3).	2021-09-24 15:10:25 -07:00
Jonas Devlieghere	d0649320bf	[dsymutil] Update union-fwd-decl.test for Windows Remove path separators from CHECK-lines in union-fwd-decl.test	2021-09-24 15:07:22 -07:00
Jonas Devlieghere	c628875929	[lldb] Copy the system debugserver in LLDB.framework When using the system debugserver for testing, copy the binary in the LLDB.framework Resource directory instead of the build's bin directory. rdar://82998263	2021-09-24 15:07:22 -07:00
Lang Hames	37f1b7a3f3	[ORC] Allow construction of an ExecutorAddrRange from an addr and a size.	2021-09-24 14:35:50 -07:00
David Blaikie	9911af4b91	WIP: Verify -gsimple-template-names=mangled values Clang will encode names that should be able to be simplified as "_STNname\|<template, args>" (eg: "_STNt1\|<int>") - this verification mode will detect these names, decode them, create the original name ("t1<int>") and the simple name ("t1") - letting the simple name run through the usual rebuilding logic - then compare the two sources of the full name - the rebuilt and the _STN encoding. This helps ensure that -gsimple-template-names is lossless.	2021-09-24 14:28:18 -07:00
Jonas Devlieghere	62d6ff5e9e	[dsymutil] Track incompleteness across unions When determining the incompleteness of a DIE based on its children, make sure we propagate it across union types. See test case for an example. Without this patch we never emit the definition of Container_ivars. Differential revision: https://reviews.llvm.org/D110443	2021-09-24 14:26:37 -07:00
wlei	1422fa5fab	[llvm-profgen] Unify output format of different unsymbolized profiles Differential Revision: https://reviews.llvm.org/D110080	2021-09-24 14:18:00 -07:00
wlei	28277e9b48	[AutoFDO][llvm-profgen] Report zero count for unexecuted part of function code In order to be consistent with compiler that interprets zero count as unexecuted(cold), this change reports zero-value count for unexecuted part of function code. For the implementation, it leverages the range counter, initializes all the executed function range with the zero-value. After all ranges are merged and converted into disjoint ranges, the remaining zero count will indicates the unexecuted(cold) part of the function. This change also extends the current `findDisjointRanges` method which now can support adding zero-value range. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D109713	2021-09-24 14:15:05 -07:00
Lei Zhang	b45476c94c	[mlir][tosa] Do not fold transpose with quantized types For such cases, the type of the constant DenseElementsAttr is different from the transpose op return type. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D110446	2021-09-24 16:57:55 -04:00
wlei	d5f2013004	[AutoFDO][llvm-profgen] Profile generation for LBR(non-CS) sample This patch introduces non-CS AutoFDO profile generation into LLVM. The profile is supposed to be well consumed by compiler using `-fprofile-sample-use=[profile]`. After range and branch counters are extracted from the LBR sample, here we go through each addresses for symbolization, create FunctionSamples and populate its sub fields like TotalSamples, BodySamples and HeadSamples etc. For inlined code, as we need to map back to original code, so we always add body samples to the leaf frame's function sample. Reviewed By: wenlei, hoy Differential Revision: https://reviews.llvm.org/D109551	2021-09-24 13:55:34 -07:00
Diego Caballero	2a876a711d	[mlir] Create a generic reduction detection utility This patch introduces a generic reduction detection utility that works across different dialecs. It is mostly a generalization of the reduction detection algorithm in Affine. The reduction detection logic in Affine, Linalg and SCFToOpenMP have been replaced with this new generic utility. The utility takes some basic components of the potential reduction and returns: 1) the reduced value, and 2) a list with the combiner operations. The logic to match reductions involving multiple combiner operations disabled until we can properly test it. Reviewed By: ftynse, bondhugula, nicolasvasilache, pifon2a Differential Revision: https://reviews.llvm.org/D110303	2021-09-24 20:45:59 +00:00
wlei	a7cdcf25c1	[llvm-profgen] Ignore invalid perf line in LBR record Similar to https://reviews.llvm.org/D109637, there is a whole invalid line of message in perfscript. ``` warning: Invalid address in LBR record at line 14118674: Processed 14138923 events and lost 1 chunks! warning: Invalid address in LBR record at line 14118676: Check IO/CPU overload! ``` This only happened for LBR only perfscript, hybridperfscript have a check of " 0x" to make sure it's the LBR perf line. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D110424	2021-09-24 13:44:57 -07:00
Stanislav Mekhanoshin	cf74ef134c	[AMDGPU] Limit promote alloca max size in functions Non-entry functions have 32 caller saved VGPRs available. If we promote alloca to consume more registers we will have to spill CSRs. There is no reason to eliminate scratch access to get another scratch access instead. Differential Revision: https://reviews.llvm.org/D110372	2021-09-24 13:38:39 -07:00

1 2 3 4 5 ...

399943 Commits All Branches Search

399943 Commits

All Branches