llvm-project

Commit Graph

Author	SHA1	Message	Date
Jean Perier	2e65c8e8db	[flang] Allow write after non advancing read in IO runtime 1. To avoid overwriting the part of the record read in the non advancing read, the furtherPositionInRecord field must be set to the max of the furtherPositionInRecord and the positionInRecord at the beginning of the IO write. 2. To allow any further read to succeed after the write, the unit beganReadingRecord_ must be set to false when resetting the recordLength during the write, otherwise, recordLength will not be computed in further read and an assert is hit (at unit.cpp(398)). The added unit test exercises both of these scenarios. Differential Revision: https://reviews.llvm.org/D113740	2021-11-16 14:53:39 +01:00
Zbigniew Sarbinowski	422cf2b506	[SystemZ][z/OS] Fix warnings from unsigned int to long in 32-bit mode This patch fixes the warnings which shows up when libcxx library started to be compiled in 32-bit mode on z/OS. More specifically, the assignment from unsigned int to time_t aka long was flags as follows: ``` libcxx/include/c++/v1/__support/ibm/nanosleep.h:31:11: warning: implicit conversion changes signedness: 'unsigned int' to 'time_t' (aka 'long') [-Wsign-conversion] __sec = sleep(static_cast<unsigned int>(__sec)); ~ ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ libcxx/include/c++/v1/__support/ibm/nanosleep.h:36:36: warning: implicit conversion changes signedness: 'unsigned int' to 'long' [-Wsign-conversion] __rem->tv_nsec = __micro_sec * 1000; ~ ~~~~~~~~~~~~^~~~~~ libcxx/include/c++/v1/__support/ibm/nanosleep.h:47:36: warning: implicit conversion changes signedness: 'unsigned int' to 'long' [-Wsign-conversion] __rem->tv_nsec = __micro_sec * 1000; ~ ~~~~~~~~~~~~^~~~~~ 3 warnings generated. ``` Here is a small test case illustrating the issue: ``` typedef long time_t ; unsigned int sleep(unsigned int ); int main() { time_t sec = 0; #ifdef FIX sec = static_cast<time_t>(sleep(static_cast<unsigned int>(sec))); #else sec = sleep(static_cast<unsigned int>(sec)); #endif } ``` clang++ -c -Wsign-conversion -m32 t.C ``` t.C:8:9: warning: implicit conversion changes signedness: 'unsigned int' to 'time_t' (aka 'long') [-Wsign-conversion] sec = sleep(static_cast<unsigned int>(sec)); ~ ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Reviewed By: ldionne, #libc, Quuxplusone, Mordante Differential Revision: https://reviews.llvm.org/D112837	2021-11-16 13:51:35 +00:00
Andrzej Warzynski	6c3d7fd4c5	[flang][CodeGen] Transform `fir.boxchar_len` to a sequence of LLVM MLIR This patch extends the `FIRToLLVMLowering` pass in Flang by adding a hook to transform `fir.boxchar_len` to a sequence of LLVM MLIR instructions. This is part of the upstreaming effort from the `fir-dev` branch in [1]. [1] https://github.com/flang-compiler/f18-llvm-project Differential Revision: https://reviews.llvm.org/D113763 Originally written by: Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-11-16 13:49:23 +00:00
Alexey Bataev	aa9bbb64be	[SLP]Adjust GEP indices types when trying to build entries. Need to adjust the types of GEPs indices when building the tree entries/operands. Otherwise some of the nodes might differ and vectorizer is unable to correctly find them and count their cost. Differential Revision: https://reviews.llvm.org/D113792	2021-11-16 05:44:33 -08:00
Alexey Bataev	51c0b6843a	[SLP][NFC]Add more tests for shuffles that can be optimized after SLP, NFC.	2021-11-16 05:42:18 -08:00
Valentin Clement	3124618704	[fir] Add fir.gentypedesc conversion Add conversion pattern for the GenTypeDescOp. Convert to a global constant with an addressof. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D113766 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com> Co-authored-by: Jean Perier <jperier@nvidia.com>	2021-11-16 14:41:00 +01:00
Joseph Huber	374cd0fb61	[OpenMP] Fix initializer not working on AMDGPU The RAII class used for debugging RTL entry used a shared variable to keep track of the current depth. This used a global initializer, which isn't supported on AMDGPU. This patch removes the initializer and instead sets it to zero when the state is initialized in the runtime. Reviewed By: jdoerfert, JonChesterfield Differential Revision: https://reviews.llvm.org/D113963	2021-11-16 08:17:15 -05:00
Christian Kühnel	d2da1a2f40	[NFC][clangd] cleaning up unused "using" Cleaning up unused "using" declarations. This patch was generated from automatically applyning clang-tidy fixes. Differential Revision: https://reviews.llvm.org/D113891	2021-11-16 13:09:49 +00:00
Florian Hahn	97b9b6f565	[llvm-reduce] Add new BitWriter dependency after `28d95a2610`.	2021-11-16 12:48:21 +00:00
Sander.DeSmalen@arm.com	305816ff1e	[IndVarSimplify] Reduce nondeterministic behaviour in visitIVCast. rGf39978b84f1d3a1da6c32db48f64c8daae64b3ad led to and/or exposed an issue with IndVarSimplification for a loop where a i32 phi node is no longer replaced by a widened (i64) phi node, because the SCEVs of a sign-extend no longer folded the same way. I'm unsure how to properly explain this because it's all rather complicated, but in short: SCEVs don't fold as nicely as they used to and this caused a difference. While investigating this, I found that IndVarSimplify can actually optimise the case in the way we want to if it chooses the widened IV to be 'signed' (the i32 IV is both sign and zero-extended). Oddly enough, there is some level of indeterminism in the way the algorithm works, it just picks the sign of the 'first' zext/sext user, where the order of the users-iterator is not guaranteed to be the same on each invocation of the pass (e.g. shown by first running loop-rotate, which puts the users in a different order). While I think the fix is valid in the sense that consistently picking _any_ order is better than having an nondeterministic order, I can use a bit of advice from people more familiar in this area of the code-base. For example, I'm not sure if this fix is hiding another issue where the IndVarSimplify pass could actually draw the same conclusions (i.e. that it only needs an i64 phi node) if it does a bit more work, regardless of whether it chooses the induction variable to be signed or unsigned. I'm also not sure if choosing signed is better than unsigned, or whether that just happens to be beneficial only in this individual case. Any feedback would be much appreciated! Reviewed By: reames Differential Revision: https://reviews.llvm.org/D112573	2021-11-16 12:41:04 +00:00
Florian Hahn	28d95a2610	[llvm-reduce] Allow writing temporary files as bitcode. Textual LLVM IR files are much bigger and take longer to write to disk. To avoid the extra cost incurred by serializing to text, this patch adds an option to save temporary files as bitcode instead. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D113858	2021-11-16 12:39:42 +00:00
Diana Picus	f1dfc0275c	[fir] Add fir.cmpc conversion This patch adds the codegen for fir.cmpc. The real and imaginary parts are extracted and compared separately. For the .EQ. predicate the results are AND'd, for the .NE. predicate the results are OR'd, and for other predicates we keep only the result on the real parts. This patch is part of the upstreaming effort from fir-dev. Differential Revision: https://reviews.llvm.org/D113976 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com> Co-authored-by: Jean Perier <jperier@nvidia.com>	2021-11-16 12:26:27 +00:00
Fabian Wolff	738e7f1231	Fix false positive in `bugprone-throw-keyword-missing` check Fixes PR#52400. The tests for bugprone-throw-keyword-missing actually already contain exceptions as class members, but not as members with initializers, which was probably just an oversight.	2021-11-16 07:09:17 -05:00
Pavel Labath	098c01c132	[lldb] Refactor Platform::ResolveExecutable Module resolution is probably the most complex piece of lldb [citation needed], with numerous levels of abstraction, each one implementing various retry and fallback strategies. It is also a very repetitive, with only small differences between "host", "remote-and-connected" and "remote-but-not-(yet)-connected" scenarios. The goal of this patch (first in series) is to reduce the number of abstractions, and deduplicate the code. One of the reasons for this complexity is the tension between the desire to offload the process of module resolution to the remote platform instance (that's how most other platform methods work), and the desire to keep it local to the outer platform class (its easier to subclass the outer class, and it generally makes more sense). This patch resolves that conflict in favour of doing everything in the outer class. The gdb-remote (our only remote platform) implementation of ResolveExecutable was not doing anything gdb-specific, and was rather similar to the other implementations of that method (any divergence is most likely the result of fixes not being applied everywhere rather than intentional). It does this by excising the remote platform out of the resolution codepath. The gdb-remote implementation of ResolveExecutable is moved to Platform::ResolveRemoteExecutable, and the (only) call site is redirected to that. On its own, this does not achieve (much), but it creates new opportunities for layer peeling and code sharing, since all of the code now lives closer together. Differential Revision: https://reviews.llvm.org/D113487	2021-11-16 12:52:51 +01:00
Butygin	6c48f6aafe	[mlir][spirv] add AtomicFAddEXTOp Differential Revision: https://reviews.llvm.org/D113764	2021-11-16 14:24:22 +03:00
Florian Hahn	b7aec4f08e	[SCEV] Support rewriting ZExt expressions with loop guard info. So far, applying loop guard information has been restricted to SCEVUnknown. In a few cases, like PR40961 and PR52464, this leads to SCEV failing to determine tight upper bounds for the backedge taken count. This patch adjusts SCEVLoopGuardRewriter and applyLoopGuards to support re-writing ZExt expressions. This is a first step towards fixing PR40961 and PR52464. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D113577	2021-11-16 11:16:07 +00:00
Matt Devereau	f526c600c0	[AArch64][SVE] Instcombine SVE LD1/ST1 to stock LLVM IR InstCombine AArch64 LD1/ST1 to llvm.masked.load/llvm.masked.store and LD1/ST1 to load/store when a ptrue all predicate pattern operand is present. This allows existing IR optimizations such as dead-load removal to occur. Differential Revision: https://reviews.llvm.org/D113489	2021-11-16 11:10:23 +00:00
Frederik Gossen	3f3d4e8a15	Fix unused variable warning in LoadStoreOpt.cpp with (void)	2021-11-16 12:03:59 +01:00
Frederik Gossen	2bceb7c8da	Revert "Fix unused variable in llvm/lib/CodeGen/GlobalISel/LoadStoreOpt.cpp" This reverts commit `40a609aebe`.	2021-11-16 12:00:17 +01:00
Frederik Gossen	ecfe7a3404	Revert "Fix unused variable warning." This reverts commit `a062e2a8ca`.	2021-11-16 11:59:34 +01:00
Frederik Gossen	9a6817b7ed	Revert "Fix another unused variable error." This reverts commit `5b84ae7c48`.	2021-11-16 11:58:02 +01:00
Pavel Labath	669e57ebd1	[lldb] Simplify specifying of platform supported architectures The GetSupportedArchitectureAtIndex pattern forces the use of complicated patterns in both the implementations of the function and in the various callers. This patch creates a new method (GetSupportedArchitectures), which returns a list (vector) of architectures. The GetSupportedArchitectureAtIndex is kept in order to enable incremental rollout. Base Platform class contains implementations of both of these methods, using the other method as the source of truth. Platforms without infinite stacks should implement at least one of them. This patch also ports Linux, FreeBSD and NetBSD platforms to the new API. A new helper function (CreateArchList) is added to simplify the common task of creating a list of ArchSpecs with the same OS but different architectures. Differential Revision: https://reviews.llvm.org/D113608	2021-11-16 11:43:48 +01:00
Pavel Labath	33c0f93f6c	[lldb/test] Move gdb client utils into the packages tree This infrastructure has proven proven its worth, so give it a more prominent place. My immediate motivation for this is the desire to reuse this infrastructure for qemu platform testing, but I believe this move makes sense independently of that. Moving this code to the packages tree will allow as to add more structure to the gdb client tests -- currently they are all crammed into the same test folder as that was the only way they could access this code. I'm splitting the code into two parts while moving it. The first once contains just the generic gdb protocol wrappers, while the other one contains the unit test glue. The reason for that is that for qemu testing, I need to run the gdb code in a separate process, so I will only be using the first part there. Differential Revision: https://reviews.llvm.org/D113893	2021-11-16 11:35:56 +01:00
Adrian Kuegel	5b84ae7c48	Fix another unused variable error.	2021-11-16 11:32:44 +01:00
Butygin	526b71e44a	[mlir] spirv: Add scf.while spirv conversion * It works similar to scf.for coversion, but convert condition and yield ops as part of scf.whille pattern so it don't need to maintain external state Differential Revision: https://reviews.llvm.org/D113007	2021-11-16 13:19:34 +03:00
Adrian Kuegel	a062e2a8ca	Fix unused variable warning.	2021-11-16 11:17:33 +01:00
Adrian Kuegel	921d91f3ac	[mlir] Support multi-dimensional vectors in MathToLibm conversion. Differential Revision: https://reviews.llvm.org/D113969	2021-11-16 11:13:52 +01:00
Arnab Dutta	1402299271	[MLIR] Simplify semi-affine expressions using flattening For the semi affine expressions, whenever rhs of a floordiv, ceildiv, mod or product expression is a symbolic expression, we introduce a local variable representing the result, and store the floordiv/ceildiv, mod or product affine expression in LocalExprs. In this way the expression is flattened, and trivial addition and subtraction related simplifications are performed. Also rule based matching for detecting and transforming "expr - q * (expr floordiv q)" to "expr mod q", where q is a symbolic exxpression, in simplifyAdd function. Differential Revision: https://reviews.llvm.org/D112808	2021-11-16 15:42:22 +05:30
Frederik Gossen	40a609aebe	Fix unused variable in llvm/lib/CodeGen/GlobalISel/LoadStoreOpt.cpp	2021-11-16 11:05:18 +01:00
David Green	309f1e4ac8	[ARM] Add datalayout to costmodel tests. NFC This adds a sensible datalayout to the ARM cost model tests, to prevent the costs reported being incorrect for the size of pointers.	2021-11-16 09:49:42 +00:00
Groverkss	11462a82c5	[MLIR] FlatAffineConstraints: Allow extraction of explicit representation of local variables This patch extends the existing functionality of computing an explicit representation for local variables, to also get the explicit representation, instead of only the inequality pairs. This is required for a future patch to remove redundant local ids based on their explicit representation. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D113814	2021-11-16 14:51:06 +05:30
Kai Luo	c0da8a4e40	[CGP][PowerPC] Pre-commit test case for D113872. NFC.	2021-11-16 09:18:49 +00:00
Amara Emerson	dcd8728d83	Remove unnecessary <any> include.	2021-11-16 00:50:30 -08:00
Carlos Galvez	9699c0fea3	[clang-tidy][NFC] Simplify ClangTidyStats - Use NSDMI and remove constructor. Differential Revision: https://reviews.llvm.org/D113847	2021-11-16 07:36:05 +00:00
jacquesguan	6405e8b584	[RISCV] Refactor some rvv instructions' definition with foreach. Simplify rvv instructions that use eew in their mnemonic and encoding with foreach. And fix a scheduling bug. Differential Revision: https://reviews.llvm.org/D113453	2021-11-16 15:20:45 +08:00
Dmitry Vyukov	87261caa55	tsan: use pthread_equal instead of direct pthread_t comparison man pthread_equal: The pthread_equal() function is necessary because thread IDs should be considered opaque: there is no portable way for applications to directly compare two pthread_t values. Depends on D113916. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D113919	2021-11-16 07:51:24 +01:00
Dmitry Vyukov	64b45399e5	tsan: speed up pthread_setname_np pthread_setname_np does linear search over all thread descriptors to map pthread_t to the thread descriptor. This has O(N^2) complexity and becomes much worse in the new tsan runtime that keeps all ever existed threads in the thread registry. Replace linear search with direct access if pthread_setname_np is called for the current thread (a very common case). Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D113916	2021-11-16 07:51:08 +01:00
LLVM GN Syncbot	b85f97bc00	[gn build] Port `2e6ae1d3f2`	2021-11-16 06:15:54 +00:00
Chuanqi Xu	2e6ae1d3f2	[libcxx] [Coroutine] Conform Coroutine Implementation Since coroutine is merged in C++ standard and the support for coroutine seems relatively stable. It's the time to move the implementation of coroutine out of the experimental directory and the std::experimental namespace. This patch creates header <coroutine> with conformed implementation with C++ standard. To avoid breaking user's code too fast, the <experimental/coroutine> header is remained. Note that <experimental/coroutine> is deprecated and it would be removed in LLVM15. Reviewed By: Quuxplusone, ldionne Differential Revision: https://reviews.llvm.org/D109433	2021-11-16 14:13:13 +08:00
Mehdi Amini	1585b13024	Revert "[MLIR][LLVM] Permit integer types in switch other than i32" This reverts commit `94992670fc`. Build is broken with: tools/mlir/include/mlir/Dialect/LLVMIR/LLVMOps.cpp.inc:23996:3: error: no matching function for call to 'printSwitchOpCases' printSwitchOpCases(_odsPrinter, *this, getValue().getType(), getCaseValuesAttr(), getCaseDestinations(), getCaseOperands(), getCaseOperands().getTypes()); ^~~~~~~~~~~~~~~~~~	2021-11-16 05:59:12 +00:00
Serguei Katkov	0ecb12a27f	[STATEPOINT] Force implicit-def for lr register. STATEPOINT instruction behavior is similar to call instruction. In aarch64 BL instruction implicitly define lr register, so STATEPOINT instruction should do the same. However STATEPOINT is a general pseudo instruction and I could not find a way to override list of implicit defs for specific target. So this patch post processes inserting STATEPOINT instruction by adding implisit dead def for lr. Reviewers: reames, loicottet, ostannard Reviewed By: reames Subscribers: danilaml, hiraditya, kristof.beyls, llvm-commits, yrouban Differential Revision: https://reviews.llvm.org/D111114	2021-11-16 12:52:00 +07:00
William S. Moses	94992670fc	[MLIR][LLVM] Permit integer types in switch other than i32 LLVM switchop currently only permits i32. Both LLVM IR and MLIR Standard switch permit other integer types leading to an illegal state when lowering an i8 switch from MLIR standard Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D113955	2021-11-16 00:46:25 -05:00
Kazu Hirata	7f00806a6a	[llvm] Use make_early_inc_range (NFC)	2021-11-15 21:28:46 -08:00
LLVM GN Syncbot	582352d488	[gn build] Port `dc84770d55`	2021-11-16 05:11:06 +00:00
Amara Emerson	dc84770d55	[GlobalISel] Add a store-merging optimization pass and enable for AArch64. This is a first attempt at a constant value consecutive store merging pass, a counterpart to the DAGCombiner's store merging optimization. The high level goals of this pass: * Have a simple and efficient algorithm. As close to linear time as we can get. Thus, prioritizing scalability of the algorithm over merging every corner case we can find. The DAGCombiner's store merging code has been the source of compile time and complexity issues in the past and I wanted to avoid that. * Don't introduce any new data structures for ordering memory operations. In MIR, we don't have the concept of chains like we do in the DAG, and the instruction order is stricter than enforcing ordering with graph edges. Although I considered adding something similar, I couldn't justify the overhead. The pass is current split into 3 main parts. The main store merging code focuses on identifying candidate stores and managing the candidate group that's under consideration for merging. Analyzing addressing of stores is a potentially complex part and for now there's just a basic implementation to identify easy cases. Finally, the other main bit of complexity is the alias analysis, which tries to follow the same logic as the DAG's AA. Currently this implementation only supports merging of constant stores. Stores of arbitrary variables are technically possible with a very small change, but the DAG chooses not to do this. Doing so here makes most code worse since there's extra overhead in merging values into wider registers. On AArch64 -Os, this optimization results in very minor savings on CTMark. Differential Revision: https://reviews.llvm.org/D109131	2021-11-15 21:10:39 -08:00
Wenlei He	f7976edc1e	[llvm-profgen] Add switch to allow use of first loadable segment for calculating offset Adding `-use-loadable-segment-as-base` to allow use of first loadable segment for calculating offset. By default first executable segment is used for calculating offset. The switch helps compatibility with unsymbolized profile generated from older tools. Differential Revision: https://reviews.llvm.org/D113727	2021-11-15 19:00:27 -08:00
Greg Clayton	dbd36e1e9f	Add the stop count to "statistics dump" in each target's dictionary. It is great to know how many times the target has stopped over its lifetime as each time the target stops, and possibly resumes without the user seeing it for things like shared library loading and signals that are not notified and auto continued, to help explain why a debug session might be slow. This is now included as "stopCount" inside each target JSON. Differential Revision: https://reviews.llvm.org/D113810	2021-11-15 18:59:09 -08:00
Craig Topper	391b0ba603	[RISCV] Override TargetLowering::hasAndNot for Zbb. Differential Revision: https://reviews.llvm.org/D113937	2021-11-15 18:44:07 -08:00
Craig Topper	d90eeab0ed	[RISCV] Add test cases to prepare for overring TargetLowering::hasAndNot. NFC These test files are copied directly from AArch64. Some of the cases may benefit from ANDN with the Zbb extension. Somes cases already improve use ANDN. selectcc-to-shiftand.ll also contains tests that test select->and conversion even when a ANDN isn't needed. I think this improves our coverage of these optimizations. Differential Revision: https://reviews.llvm.org/D113935	2021-11-15 18:44:07 -08:00
Fabian Wolff	b484fa8289	[X86] Fix crash with inline asm using wrong register name Fixes PR#48678. `X86TargetLowering::getRegForInlineAsmConstraint()` can adjust the register class to match the type, e.g. change `VR128X` to `VR256X` if the type needs 256 bits. However, the function currently returns the unadjusted register and the adjusted register class, e.g. `xmm15` and `VR256X`, which then causes an assertion failure later because the register class does not contain that register. This patch fixes this behavior. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D113834	2021-11-16 10:38:12 +08:00

... 3 4 5 6 7 ...

405068 Commits All Branches Search

405068 Commits

All Branches