llvm-project

Commit Graph

Author	SHA1	Message	Date
Matthias Springer	d020dd2b21	[mlir] Migrate vector-to-loops.mlir to ProgressiveVectorToSCF Create a copy of vector-to-loops.mlir and adapt the test for ProgressiveVectorToSCF. Fix a small bug in getExtractOp() triggered by this test. Differential Revision: https://reviews.llvm.org/D102388	2021-05-13 22:48:20 +09:00
Krzysztof Parzyszek	2b20dee59b	Fix section title underlining in the release notes	2021-05-13 08:37:06 -05:00
Oliver Stannard	92260d7a18	Revert "[CMake][ELF] Add -fno-semantic-interposition and -Bsymbolic-functions" This reverts commit `3bf1acab5b`. This is causing the test `gcov-shared-flush.c' to fail on the 2-stage aarch64 buildbots (https://lab.llvm.org/buildbot/#/builders/7/builds/2720).	2021-05-13 14:31:17 +01:00
Krzysztof Parzyszek	4dea348731	Add entry about Hexagon V68 support to the release notes	2021-05-13 08:28:55 -05:00
Jinsong Ji	b1509d067e	[AIX] XFAIL CodeGen/Generic/externally_available.ll Globals with “available_externally” linkage should never be emitted into the object file corresponding to the LLVM module. However, AIX system assembler default print error for undefined reference . so AIX chose to emit the available externally symbols into .s, so that users won't run into errors in situations like: clang -target powerpc-ibm-aix -xc -<<<$'extern inline __attribute__((__gnu_inline__)) void foo() {}\nvoid bar() { foo(); }' -O -Xclang -disable-llvm-passes Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D102377	2021-05-13 13:24:48 +00:00
Matthias Springer	bf068e1077	[mlir] Do not use pass labels in unrolled ProgressiveVectorToSCF Do not rely on pass labels to detect if the pattern was already applied in the past (which allows for more some extra optimizations to avoid extra InsertOps and ExtractOps). Instead, check if these optimizations can be applied on-the-fly. This also fixes a bug, where vector.insert and vector.extract ops sometimes disappeared in the middle of the pass because they get folded away, but the next application of the pattern expected them to be there. Differential Revision: https://reviews.llvm.org/D102206	2021-05-13 22:01:08 +09:00
Nico Weber	9310840cc2	[gn build] (manually) port `92f9852fc9`, clang-repl	2021-05-13 08:53:53 -04:00
Georgy Komarov	e07753c881	[clang-tidy] Fix test that requires Windows platofrm This commit fixes the cppcoreguidelines-pro-type-vararg test when it runs on a Windows host, but the toolchain is targeted a non-Windows platform. Reviewed By: njames93 Differential Revision: https://reviews.llvm.org/D102337	2021-05-13 15:51:53 +03:00
Stefan Pintilie	15051f0b4a	[PowerPC] Handle inline assembly clobber of link regsiter This patch adds the handling of clobbers of the link register LR for inline assembly. This patch is to fix: https://bugs.llvm.org/show_bug.cgi?id=50147 Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D101657	2021-05-13 07:43:37 -05:00
Florian Hahn	bdada7546e	[VPlan] Adjust assert in splitBlock to allow splitting at end. SplitAt should only be dereferenced in the assert if it does not point to the end of the block. This fixes a crash in the added test case.	2021-05-13 13:36:35 +01:00
Simon Pilgrim	ba0ec1be29	[X86] X86ExpandPseudo.cpp - try to pass DebugLoc by const-ref to avoid costly TrackingMDNodeRef copies. NFCI.	2021-05-13 13:31:54 +01:00
Simon Pilgrim	4956655640	[X86] X86InstrInfo.cpp - try to pass DebugLoc by const-ref to avoid costly TrackingMDNodeRef copies. NFCI.	2021-05-13 13:31:53 +01:00
Simon Pilgrim	9dfc4ac41c	[X86] VZeroUpperInserter::insertVZeroUpper - avoid DebugLoc creation by embedding in the BuildMI calls. NFCI. Try to pass DebugLoc by const-ref to avoid costly TrackingMDNodeRef copies.	2021-05-13 13:31:52 +01:00
Florian Hahn	ffc157ea82	[Passes] Use regex to match GlobalsAA line in test. On some platforms/compiler combinations, it appears the output is slightly different. Update the test to use a regex, as is done at other places in the new-pm-*default.ll tests to address buildbot failures.	2021-05-13 13:22:18 +01:00
Florian Hahn	860b37526a	[Passes] Run GlobalsAA before LICM during LTO in new PM. This patch adjusts the LTO pipeline in the new PM to run GlobalsAA before LICM to match the legacy PM. This fixes a regression where the new PM failed to vectorize loops that require hoisting/sinking by LICM depending on GlobalsAA info. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D102345	2021-05-13 13:11:18 +01:00
Fraser Cormack	797e580db9	[RISCV][NFC] Simplify test run lines Several tests had -verify-machineinstrs twice, and several tests were explicitly specifying the default FileCheck prefix of CHECK.	2021-05-13 12:41:00 +01:00
Florian Hahn	3eaf235855	[Passes] Use MemorySSA for LICM during LTO. Split off from D102345 to commit this separately from other changes in the patch. This aligns the behavior of the new PM with the legacy PM for LTO, with respect to running LICM. Together with the remaining changes in D102345, this fixes new PM regressions where we fail to vectorize loops that are vectorized with the legacy PM.	2021-05-13 12:16:41 +01:00
Vassil Vassilev	4624412367	[clang-repl] Fix ClangReplInterpreterTests unittest dependency.	2021-05-13 10:32:08 +00:00
David Spickett	0326d4667a	[Utils] Use whoami to get username for arcanist warning message `959eec1fdd` changed the message to show the local username with "$user" but this is not always set. Some systems will have USER/USERNAME/LOGNAME, so just use "whoami" instead.	2021-05-13 11:11:43 +01:00
Nemanja Ivanovic	39e4676ca7	[PowerPC] Provide doubleword vector predicate form comparisons on Power7 There are two reasons this shouldn't be restricted to Power8 and up: 1. For XL compatibility 2. Because clang will expand comparison operators to these intrinsics* *Without this patch, the following causes a selection error: int test(vector signed long a, vector signed long b) { return a < b; } This patch provides the handling for the intrinsics in the back end and removes the Power8 guards from the predicate functions (vec_{all\|any}_{eq\|ne\|gt\|ge\|lt\|le}).	2021-05-13 04:56:56 -05:00
Florian Hahn	e2759f110b	[SCEV] Apply guards to max with non-unitary steps. We already apply loop-guards when computing the maximum with unitary steps. This extends the code to also do so when dealing with non-unitary steps. This allows us to infer a tighter maximum in some cases. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D102267	2021-05-13 09:47:29 +01:00
Bruno Cardoso Lopes	fd184c062c	[TSAN] Honor failure memory orders in AtomicCAS LLVM has lifted strong requirements for CAS failure memory orders in `431e3138a` and `819e0d105e`. Add support for honoring them in `AtomicCAS`. https://github.com/google/sanitizers/issues/970 Differential Revision: https://reviews.llvm.org/D99434	2021-05-13 01:07:22 -07:00
Vassil Vassilev	b2186a69c1	[clang-repl] Add final set of missing library dependencies.	2021-05-13 08:06:58 +00:00
Kristina Bessonova	8a86787847	[libcxx] NFC. Fix misprint unodered -> unordered Differential Revision: https://reviews.llvm.org/D102354	2021-05-13 09:57:29 +02:00
Jason Molenda	afee09751d	[NFC] Add GetInferiorAddrSize method, unify code to compute MachProcess.mm has a sequence to get the address size in the inferior in three places; and I'm about to add a fourth in a future patch. Not a fan.	2021-05-13 00:47:58 -07:00
Jingu Kang	107d19eb01	Revert "[SimpleLoopUnswitch] Port partially invariant unswitch from LoopUnswitch to SimpleLoopUnswitch" This reverts commit `88b259c014`. It needs to fix below bugs. https://bugs.llvm.org/show_bug.cgi?id=50279 https://bugs.llvm.org/show_bug.cgi?id=50302	2021-05-13 08:40:49 +01:00
Serge Pavlov	12537ab772	[FPEnv][X86] Implement lowering of llvm.set.rounding Differential Revision: https://reviews.llvm.org/D74730	2021-05-13 14:30:38 +07:00
serge-sans-paille	6045cb89e5	Use an allow list on reserved macro identifiers The allow list is based on various official sources (see in-code comment). This fixes https://bugs.llvm.org/show_bug.cgi?id=50248 Differential Revision: https://reviews.llvm.org/D102168	2021-05-13 09:23:47 +02:00
Vassil Vassilev	3f4c518592	[clang-repl] Add exhaustive list of libInterpreter dependencies. This patch should appease the bots building with -DBUILD_SHARED_LIBS=On, resolving the regression introduced in `92f9852fc9`.	2021-05-13 07:18:01 +00:00
Vassil Vassilev	92f9852fc9	[clang-repl] Recommit "Land initial infrastructure for incremental parsing" Original commit message: In http://lists.llvm.org/pipermail/llvm-dev/2020-July/143257.html we have mentioned our plans to make some of the incremental compilation facilities available in llvm mainline. This patch proposes a minimal version of a repl, clang-repl, which enables interpreter-like interaction for C++. For instance: ./bin/clang-repl clang-repl> int i = 42; clang-repl> extern "C" int printf(const char*,...); clang-repl> auto r1 = printf("i=%d\n", i); i=42 clang-repl> quit The patch allows very limited functionality, for example, it crashes on invalid C++. The design of the proposed patch follows closely the design of cling. The idea is to gather feedback and gradually evolve both clang-repl and cling to what the community agrees upon. The IncrementalParser class is responsible for driving the clang parser and codegen and allows the compiler infrastructure to process more than one input. Every input adds to the “ever-growing” translation unit. That model is enabled by an IncrementalAction which prevents teardown when HandleTranslationUnit. The IncrementalExecutor class hides some of the underlying implementation details of the concrete JIT infrastructure. It exposes the minimal set of functionality required by our incremental compiler/interpreter. The Transaction class keeps track of the AST and the LLVM IR for each incremental input. That tracking information will be later used to implement error recovery. The Interpreter class orchestrates the IncrementalParser and the IncrementalExecutor to model interpreter-like behavior. It provides the public API which can be used (in future) when using the interpreter library. Differential revision: https://reviews.llvm.org/D96033	2021-05-13 06:30:29 +00:00
Matthias Springer	60da33c2d4	[mlir] Support masks in TransferOpReduceRank and TransferReadPermutationLowering These two patterns allow for more efficient codegen in VectorToSCF. Differential Revision: https://reviews.llvm.org/D102222	2021-05-13 15:08:08 +09:00
LLVM GN Syncbot	9bf1761975	[gn build] Port `d8b37de8a4`	2021-05-13 05:32:35 +00:00
Max Kazantsev	d8b37de8a4	[GC][NFC] Move GCStrategy from CodeGen to IR We want it to be available in analyzes so that we could use the CodeGen notion in middle-end passes (for example, to check if a GC may free some particular pointer). This is a preparatory patch that simply moves the files around. Note: if this causes some build issues, this patch must just be reverted. Differential Revision: https://reviews.llvm.org/D100557 Reviewed By: reames	2021-05-13 12:31:59 +07:00
Lang Hames	2f21a272af	[JITLink] Expose x86-64 pointer jump stub block construction. This can be useful for clients who want to define their own symbol for the stub, or re-use some existing symbol.	2021-05-12 22:28:14 -07:00
Lang Hames	4b0f5edd36	[JITLink] Add a transferDefinedSymbol operation. The transferDefinedSymbol operation updates a Symbol's target block, offset, and size. This can be convenient when you want to redefine the content of some symbol(s) pointing at a block, while retaining the original block in the graph.	2021-05-12 22:28:14 -07:00
Jason Molenda	0c443e92d3	Add some warnings when debugserver is running in translation A debugserver launched x86_64 cannot control an arm64/arm64e process on an Apple Silicon system. Warn when this situation has happened and return an error for the most common case of attach. I think there will be refinements to this in the future, but start out by making it easy to spot the problem when it happens. rdar://76630595	2021-05-12 22:18:24 -07:00
Chuanqi Xu	c1359ef07e	[Coroutines] Salvege Debug.values Summary: The previous implementation of coro-split didn't collect values used by dbg instructions into the spills which made a log debug info unavailable with optimization on. This patch tries to collect these uses which are used by dbg.values. In this way, the debugbility of coroutine could be as powerful as normal functions with optimization on. To avoid enlarging the coroutine frame, this patch only collects `dbg.value` whose value is already in the coroutine frame. This decision may make some debug info getting unavailable. But if we are with optimization on, the performance issue should be considered first. And this patch would make the debugbility of coroutine to be better only without changing the layout of the frame. Test-plan: check-llvm Reviewed By: aprantl, lxfind Differential Revision: https://reviews.llvm.org/D97673	2021-05-13 13:06:33 +08:00
Rob Suderman	3f8aafd790	[mlir][tosa] Fix tosa.cast semantics to perform rounding/clipping Rounding to integers requires rounding (for floating points) and clipping to the min/max values of the destination range. Added this behavior and updated tests appropriately. Reviewed By: sjarus, silvas Differential Revision: https://reviews.llvm.org/D102375	2021-05-12 21:53:53 -07:00
Vassil Vassilev	f6907152db	Revert "[clang-repl] Land initial infrastructure for incremental parsing" This reverts commit `44a4000181`. We are seeing build failures due to missing dependency to libSupport and CMake Error at tools/clang/tools/clang-repl/cmake_install.cmake file INSTALL cannot find	2021-05-13 04:44:19 +00:00
Chuanqi Xu	6e5b8f489a	[Coroutines] Enable printing coroutine frame when dbg info is available Summary: This patch tries to build debug info for coroutine frame in the middle end. Although the coroutine frame is constructed and maintained by the compiler and the programmer shouldn't care about the coroutine frame by the design of C++20 coroutine, a lot of programmers told me that they want to see the layout of the coroutine frame strongly. Although C++ is designed as an abstract layer so that the programmers shouldn't care about the actual memory in bits, many experienced C++ programmers are familiar with assembler and debugger to see the memory layout in fact, After I was been told they want to see the coroutine frame about 3 times, I think it is an actual and desired demand. However, the debug information is constructed in the front end and coroutine frame is constructed in the middle end. This is a natural and clear gap. So I could only try to construct the debug information in the middle end after coroutine frame constructed. It is unusual, but we are in consensus that the approch is the best one. One hard part is we need construct the name for variables since there isn't a map from llvm variables to DIVar. Then here is the strategy this patch uses: - The name `__resume_fn `, `__destroy_fn` and `__coro_index ` are constructed by the patch. - Then the name `__promise` comes from the dbg.variable of corresponding dbg.declare of PromiseAlloca, which shows highest priority to construct the debug information for the member of coroutine frame. - Then if the member is struct, we would try to get the name of the llvm struct directly. Then replace ':' and '.' with '_' to make it printable for debugger. - If the member is a basic type like integer or double, we would try to emit the corresponding name. - Then if the member is a Pointer Type, we would add `Ptr` after corresponding pointee type. - Otherwise, we would name it with 'UnknownType'. Reviewered by: lxfind, aprantl, rjmcall, dblaikie Differential Revision: https://reviews.llvm.org/D99179	2021-05-13 12:43:08 +08:00
Anton Afanasyev	ab2c499d3a	[SLP] Add insertelement instructions to vectorizable tree Add new type of tree node for `InsertElementInst` chain forming vector. These instructions could be either removed, or replaced by shuffles during vectorization and we can add this node to cost model, so naturally estimating their cost, getting rid of `CompensateCost` tricks and reducing further work for InstCombine. This fixes PR40522 and PR35732 in a natural way. Also this patch is the first step towards revectorization of partially vectorization (to fix PR42022 completely). After adding inserts to tree the next step is to add vector instructions there (for instance, to merge `store <2 x float>` and `store <2 x float>` to `store <4 x float>`). Fixes PR40522 and PR35732. Differential Revision: https://reviews.llvm.org/D98714	2021-05-13 07:41:45 +03:00
Anton Afanasyev	cd9090031c	[SLP][Test] Fix and precommit tests for D98714	2021-05-13 07:41:45 +03:00
Anton Afanasyev	00a0595b25	[SLP][Test] Fix and precommit tests for D98714	2021-05-13 07:41:06 +03:00
Vassil Vassilev	44a4000181	[clang-repl] Land initial infrastructure for incremental parsing In http://lists.llvm.org/pipermail/llvm-dev/2020-July/143257.html we have mentioned our plans to make some of the incremental compilation facilities available in llvm mainline. This patch proposes a minimal version of a repl, clang-repl, which enables interpreter-like interaction for C++. For instance: ./bin/clang-repl clang-repl> int i = 42; clang-repl> extern "C" int printf(const char*,...); clang-repl> auto r1 = printf("i=%d\n", i); i=42 clang-repl> quit The patch allows very limited functionality, for example, it crashes on invalid C++. The design of the proposed patch follows closely the design of cling. The idea is to gather feedback and gradually evolve both clang-repl and cling to what the community agrees upon. The IncrementalParser class is responsible for driving the clang parser and codegen and allows the compiler infrastructure to process more than one input. Every input adds to the “ever-growing” translation unit. That model is enabled by an IncrementalAction which prevents teardown when HandleTranslationUnit. The IncrementalExecutor class hides some of the underlying implementation details of the concrete JIT infrastructure. It exposes the minimal set of functionality required by our incremental compiler/interpreter. The Transaction class keeps track of the AST and the LLVM IR for each incremental input. That tracking information will be later used to implement error recovery. The Interpreter class orchestrates the IncrementalParser and the IncrementalExecutor to model interpreter-like behavior. It provides the public API which can be used (in future) when using the interpreter library. Differential revision: https://reviews.llvm.org/D96033	2021-05-13 04:23:24 +00:00
Matthias Springer	2a51e9ff2e	[mlir] Support memref layout maps in vector transfer ops Differential Revision: https://reviews.llvm.org/D102042	2021-05-13 13:22:21 +09:00
Matthias Springer	9b77be5583	[mlir] Unrolled progressive-vector-to-scf. Instead of an SCF for loop, these pattern generate fully unrolled loops with no temporary buffer allocations. Differential Revision: https://reviews.llvm.org/D101981	2021-05-13 13:08:48 +09:00
Matthias Springer	864adf399e	[mlir] Allow empty position in vector.insert and vector.extract Such ops are no-ops and are folded to their respective `source`/`vector` operand. Differential Revision: https://reviews.llvm.org/D101879	2021-05-13 12:54:18 +09:00
Matthias Springer	c52cbe63e4	[mlir] Fix masked vector transfer ops with broadcasts Broadcast dimensions of a vector transfer op have no corresponding dimension in the mask vector. E.g., a 2-D TransferReadOp, where one dimension is a broadcast, can have a 1-D `mask` attribute. This commit also adds a few additional transfer op integration tests for various combinations of broadcasts, masking, dim transposes, etc. Differential Revision: https://reviews.llvm.org/D101745	2021-05-13 12:46:03 +09:00
Chen Zheng	a0ca4c46ca	[Debug-Info] add -gstrict-dwarf support in backend Reviewed By: dblaikie, probinson Differential Revision: https://reviews.llvm.org/D100826	2021-05-12 23:00:52 -04:00
Matthias Springer	6555e53ab0	Revert "[mlir] Fix masked vector transfer ops with broadcasts" This reverts commit `c9087788f7`. Accidentally pushed old version of the commit.	2021-05-13 11:55:00 +09:00

1 2 3 4 5 ...

388263 Commits All Branches Search

388263 Commits

All Branches