llvm-project

Commit Graph

Author	SHA1	Message	Date
Stelios Ioannou	936c777e2b	[AArch64] Adds a pre-indexed paired Load/Store optimization for LDR-STR. This patch merges STR<S,D,Q,W,X>pre-STR<S,D,Q,W,X>ui and LDR<S,D,Q,W,X>pre-LDR<S,D,Q,W,X>ui instruction pairs into a single STP<S,D,Q,W,X>pre and LDP<S,D,Q,W,X>pre instruction, respectively. For each pair, there is a MIR test that verifies this optimization. Differential Revision: https://reviews.llvm.org/D99272 Change-Id: Ie97a20c8c716c08492fe229c22e14e3c98ef08b7	2021-04-30 17:29:58 +01:00
Nathan Sidwell	e90792d8c7	[clang] Update comments on another libstdc++ HACK Document relevant gcc versions and dates. Differential Revision: https://reviews.llvm.org/D101530	2021-04-30 09:29:26 -07:00
Peter Steinfeld	8989268dae	[flang] Allow KIND type parameters to be used as LEN parameters of components When producing the runtime type information for a component of a derived type that had a LEN type parameter, we were not allowing a KIND parameter of the derived type. This was causing one of the NAG correctness tests to fail (.../hibiya/d5.f90). I added a test to our own test suite to check for this. Also, I fixed a typo in .../module/__fortran_type_info.f90. I allowed KIND type parameters to be used for the declarations of components that use LEN parameters by constant folding the value of the LEN parameter. To make the constant folding work, I had to put the semantics::DerivedTypeSpec of the associated derived type into the folding context. To get this semantics::DerivedTypeSpec, I changed the value of the semantics::Scope object that was passed to DescribeComponent() to be the derived type scope rather than the containing non-derived type scope. This scope change, in turn, caused differences in the symbol table output that is checked in typeinfo01.f90. Most of these differences were in the order that the symbols appeared in the dump. But one of them changed one of the values from "CHARACTER(2_8,1)" to "CHARACTER(1_8,1)". I'm not sure if these changes are significant. Please verify that the results of this test are still valid. Also, I wonder if there are other situations in this code where we should be folding constants. For example, what if the field of a component has a component whose type is a PDT with a LEN type parameter, and the component's declaration depends on the KIND type parameter of the current PDT. Here's an example: type string(stringkind) integer,kind :: stringkind character(stringkind) :: value end type string type outer(kindparam) integer,kind :: kindparam type(string(kindparam)) :: field end type outer I don't understand the code or what it's trying to accomplish well enough to figure out if such cases are correctly handled by my new code. Differential Revision: https://reviews.llvm.org/D101482	2021-04-30 09:05:05 -07:00
Vince Bridgers	a27af1d816	[analyzer] Fix assertion in SVals.h Fix assertion in SVals.h apparently caused by https://reviews.llvm.org/D89055. clang:clang/include/clang/StaticAnalyzer/Core/PathSensitive/SVals.h:596: clang::ento::loc::MemRegionVal::MemRegionVal(const clang::ento::MemRegion *): Assertion `r' failed. Backtrace: ... clang/include/clang/StaticAnalyzer/Core/PathSensitive/SVals.h:597:3 clang::QualType, clang::QualType) clang/lib/StaticAnalyzer/Core/SValBuilder.cpp:773:18 clang::QualType, clang::QualType) clang/lib/StaticAnalyzer/Core/SValBuilder.cpp:612:12 clang::QualType) clang/lib/StaticAnalyzer/Core/SValBuilder.cpp:587:12 namespace)::RegionBindingsRef const&, clang::ento::Loc, clang::QualType) clang/lib/StaticAnalyzer/Core/RegionStore.cpp:1510:24 ... Reviewed By: ASDenysPetrov Differential Revision: https://reviews.llvm.org/D101635	2021-04-30 11:00:43 -05:00
Bradley Smith	62e9c7601a	[AArch64][SVE] Remove unused function missed from D101302 The functionality in SVEIntrinsicOpts::isReinterpretToSVBool was moved in D101302, however the original now unused function was not removed (NFC). Differential Revision: https://reviews.llvm.org/D101642	2021-04-30 16:57:09 +01:00
David Spickett	44d0ad53af	[lldb] Change DumpDataExtractorTest function names to lldb style (NFC)	2021-04-30 16:55:34 +01:00
Tomas Matheson	c7df6b1223	Revert "[CodeGen][ARM] Implement atomicrmw as pseudo operations at -O0" This reverts commit `3338290c18`. Broke expensive checks on debian.	2021-04-30 16:53:14 +01:00
David Spickett	8da5d111a5	[lldb] DumpDataExtractor tests for item byte size errors Reviewed By: teemperor Differential Revision: https://reviews.llvm.org/D101631	2021-04-30 16:49:04 +01:00
Tomas Matheson	3338290c18	[CodeGen][ARM] Implement atomicrmw as pseudo operations at -O0 atomicrmw instructions are expanded by AtomicExpandPass before register allocation into cmpxchg loops. Register allocation can insert spills between the exclusive loads and stores, which invalidates the exclusive monitor and can lead to infinite loops. To avoid this, reimplement atomicrmw operations as pseudo-instructions and expand them after register allocation. Floating point legalisation: f16 ATOMIC_LOAD_FADD(f16, f16) is legalised to f32 ATOMIC_LOAD_FADD(i16, f32) and then eventually f32 ATOMIC_LOAD_FADD_16(*i16, f32) Differential Revision: https://reviews.llvm.org/D101164	2021-04-30 16:40:33 +01:00
Paul C. Anagnostopoulos	985ab6e1fa	[TableGen] Fix two bugs in 'defm' when complex 'assert' is involved. This patch fixes two bugs that arise when a 'defm' inherits from a multiclass and also from a class with assertions. Differential Revision: https://reviews.llvm.org/D101626	2021-04-30 11:31:06 -04:00
Konstantin Zhuravlyov	c9c4676a45	AMDGPU/llvm-readobj: Add missing tests for note parsing/displaying This is a follow up review/change for https://reviews.llvm.org/D95638 Add valid note tests for code object v2 notes: - NT_AMD_HSA_CODE_OBJECT_VERSION (required yaml2obj update) - NT_AMD_HSA_HSAIL (required yaml2obj update) - NT_AMD_HSA_ISA_VERSION (required yaml2obj update) - NT_AMD_HSA_METADATA - NT_AMD_HSA_ISA_NAME - NT_AMD_PAL_METADATA Add valid note tests for code object v3 notes: - NT_AMDGPU_METADATA Add invalid note tests for code object v2 notes: - NT_AMD_HSA_CODE_OBJECT_VERSION (required yaml2obj update) - NT_AMD_HSA_HSAIL (required yaml2obj update) - NT_AMD_HSA_ISA_VERSION (required yaml2obj update) Add invalid note tests for code object v3 notes: - NT_AMDGPU_METADATA Differential Revision: https://reviews.llvm.org/D101304	2021-04-30 11:19:16 -04:00
David Spickett	a86cbd4755	[lldb] More tests for DumpDataExtractor * Using a base address or skipping it with LLDB_INVALID_ADDRESS * Using a data offset, which does not effect the printed addresses * Not providing an output stream * Formatting a double sized HexFloat * Formatting over multiple lines Since address printing now has its own test, I've removed the base address from all the format type tests. The multi line tests still use a base address to check that it's incremented correctly for each new line. Reviewed By: teemperor Differential Revision: https://reviews.llvm.org/D101627	2021-04-30 16:16:38 +01:00
Jingu Kang	88b259c014	[SimpleLoopUnswitch] Port partially invariant unswitch from LoopUnswitch to SimpleLoopUnswitch Differential Revision: https://reviews.llvm.org/D99354	2021-04-30 15:55:56 +01:00
Amy Kwan	64d951be61	[PowerPC] Add new infrastructure to select load/store instructions, update P8/P9 load/store patterns. This patch introduces a new infrastructure that is used to select the load and store instructions in the PPC backend. The primary motivation is that the current implementation of selecting load/stores is dependent on the ordering of patterns in TableGen. Given this limitation, we are not able to easily and reliably generate the P10 prefixed load and stores instructions (such as when the immediates that fit within 34-bits). This refactoring is meant to provide us with more control over the patterns/different forms to exploit, as well as eliminating dependency of pattern declaration in TableGen. The idea of this refactoring is that it introduces a set of addressing modes that correspond to different instruction formats of a particular load and store instruction, along with a set of common flags that describes a load/store. Whenever a load/store instruction is being selected, we analyze the instruction and compute a set of flags for it. The computed flags are then used to select the most optimal load/store addressing mode. This patch is the first of a series of patches to be committed - it contains the initial implementation of the refactored load/store selection infrastructure and also updates P8/P9 patterns to adopt this infrastructure. The idea is that incremental patches will add more implementation and support, and eventually the old implementation will be removed. Differential Revision: https://reviews.llvm.org/D93370	2021-04-30 09:53:19 -05:00
Sidharth Baveja	70c433a184	[XCOFF][AIX] Add Global Variables Directly to TOC for 32 bit AIX Summary: This patch implements the backend implementation of adding global variables directly to the table of contents (TOC), rather than adding the address of the variable to the TOC. Currently, this patch will look for the "toc-data" attribute on symbols in the IR, and then add those symbols to the TOC. ATM, this is implemented for 32 bit AIX. Reviewers: sfertile Differential Revision: https://reviews.llvm.org/D101178	2021-04-30 14:48:02 +00:00
Adam Czachorowski	fbfcfdbf68	[clang] Fix assert() crash when checking undeduced arg alignment There already was a check for undeduced and incomplete types, but it failed to trigger when outer type (SubstTemplateTypeParm in test) looked fine, but inner type was not. Differential Revision: https://reviews.llvm.org/D100667	2021-04-30 16:24:33 +02:00
Jay Foad	e2a2df2a1e	[AMDGPU] Add test for set_gpr_idx removal with conditional branches	2021-04-30 15:01:32 +01:00
Dmitry Vyukov	92a3a2dc3e	sanitizer_common: introduce kInvalidTid/kMainTid Currently we have a bit of a mess related to tids: - sanitizers re-declare kInvalidTid multiple times - some call it kUnknownTid - implicit assumptions that main tid is 0 - asan/memprof claim their tids need to fit into 24 bits, but this does not seem to be true anymore - inconsistent use of u32/int to store tids Introduce kInvalidTid/kMainTid in sanitizer_common and use them consistently. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D101428	2021-04-30 15:58:05 +02:00
LLVM GN Syncbot	4978bf65ad	[gn build] Port `43bc584dc0`	2021-04-30 13:48:40 +00:00
Simon Moll	7a86645611	[VE] VP intrinsics are legal	2021-04-30 15:47:55 +02:00
Simon Moll	43bc584dc0	[VP,Integer,#2] ExpandVectorPredication pass This patch implements expansion of llvm.vp.* intrinsics (https://llvm.org/docs/LangRef.html#vector-predication-intrinsics). VP expansion is required for targets that do not implement VP code generation. Since expansion is controllable with TTI, targets can switch on the VP intrinsics they do support in their backend offering a smooth transition strategy for VP code generation (VE, RISC-V V, ARM SVE, AVX512, ..). Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D78203	2021-04-30 15:47:28 +02:00
Jay Foad	181c492ee7	[AMDGPU] Add implicit negative check for the set_gpr_idx tests The only effect of the optimization is to remove s_set_gpr_idx_* instructions, and update_mir_test_checks.py always inserts CHECK: rather than CHECK-NEXT: checks, so without this implicit negative check, the tests would always pass even if the optimization did nothing. Differential Revision: https://reviews.llvm.org/D101622	2021-04-30 14:45:12 +01:00
Anastasia Stulova	3ec82e5195	[OpenCL] Prevent adding vendor extensions for all targets Removed extension begin/end pragma as it has no effect and it is added unconditionally for all targets. Differential Revision: https://reviews.llvm.org/D92244	2021-04-30 14:42:51 +01:00
Nico Weber	a1a2a8e8ac	[lld/mac] Remove unused -L%t flags from tests No behavior change. Differential Revision: https://reviews.llvm.org/D101623	2021-04-30 09:37:02 -04:00
Pooja Yadav	cfb95f6f91	[docs]Added llvm/bindings section Added information about language bindings provided by LLVM. Reviewed By: xgupta, gandhi21299 Differential Revision: https://reviews.llvm.org/D101295	2021-04-30 19:05:22 +05:30
Nico Weber	4b456038e4	[lld/mac] Tweak two comments and fix style on one variable name Cosmetic, no behavior change.	2021-04-30 09:30:51 -04:00
Andrea Di Biagio	8bd4f3d547	[MCA] Fix CarryOver check in the DispatchStage (PR50174). Early exit from method DispatchStage::isAvailable() if the dispatch group is already full. Not all instructions declare at least one uOP. Fixes PR50174.	2021-04-30 14:26:46 +01:00
Florian Hahn	6c31295493	[clang] Refactor mustprogress handling, add it to all loops in c++11+. Currently Clang does not add mustprogress to inifinite loops with a known constant condition, matching C11 behavior. The forward progress guarantee in C++11 and later should allow us to add mustprogress to any loop (http://eel.is/c++draft/intro.progress#1). This allows us to simplify the code dealing with adding mustprogress a bit. Reviewed By: aaron.ballman, lebedev.ri Differential Revision: https://reviews.llvm.org/D96418	2021-04-30 14:13:47 +01:00
Jay Foad	66b8a16cc0	[AMDGPU] Fix inconsistent ---/... in MIR tests and regenerate checks In some cases the lack of --- or ... confused update_mir_test_checks.py into not adding any checks for a function.	2021-04-30 14:10:50 +01:00
Arthur O'Dwyer	6712534ebc	[libc++] [test] Run the clang-format and generated-output checks on the "service" queue As these jobs only run in a couple seconds, and block starting of other jobs, they can run on the "service" queue which doesn't get blocked by other long-running jobs. Differential Revision: https://reviews.llvm.org/D101437	2021-04-30 08:57:03 -04:00
Arthur O'Dwyer	5f51fb3421	[libc++] Minor cleanups in <iterator>. NFCI.	2021-04-30 08:52:58 -04:00
Tomas Matheson	b14a6f06cc	[ARM][MVE] vcreateq lane ordering for big endian Use of bitcast resulted in lanes being swapped for vcreateq with big endian. Fix this by using vreinterpret. No code change for little endian. Adds IR lit test. Differential Revision: https://reviews.llvm.org/D101606	2021-04-30 13:48:05 +01:00
Nathan James	6815037085	[clangd][NFC] Remove unnecessary string captures in lambdas. Due to a somewhat annoying, but necessary, shortfall in -Wunused-lambda-capture, These unused captures aren't warned about. Reviewed By: kadircet Differential Revision: https://reviews.llvm.org/D101611	2021-04-30 13:27:24 +01:00
Hans Wennborg	cbe62f2f2f	Require shell for lld/test/MachO/reproduce.s as a way of not running it on Windows, where the file paths when extracting repro2.tar can become longer than the maximum file length limit (depending on the build dir name) and cause the test to fail. (See https://crbug.com/1204463 for example test failure.)	2021-04-30 14:23:35 +02:00
Martin Probst	b2780cd744	clang-format: [JS] handle "off" in imports Previously, the JavaScript import sorter would ignore `// clang-format off` and `on` comments. This change fixes that. It tracks whether formatting is enabled for a stretch of imports, and then only sorts and merges the imports where formatting is enabled, in individual chunks. This means that there's no meaningful total order when module references are mixed with blocks that have formatting disabled. The alternative approach would have been to sort all imports that have formatting enabled in one group. However that raises the question where to insert the formatting-off block, which can also impact symbol visibility (in particular for exports). In practice, sorting in chunks probably isn't a big problem. This change also simplifies the general algorithm: instead of tracking indices separately and sorting them, it just sorts the vector of module references. And instead of attempting to do fine grained tracking of whether the code changed order, it just prints out the module references text, and compares that to the previous text. Given that source files typically have dozens, but not even hundreds of imports, the performance impact seems negligible. Differential Revision: https://reviews.llvm.org/D101515	2021-04-30 14:18:52 +02:00
Evgeniy Brevnov	7861cb600c	[NARY] Don't optimize min/max if there are side uses (part2) Previous attempt to fix infinite recursion in min/max reassociation was not fully successful (D100170). Newly discovered failing case is due to not properly handled when there is a single use. It should be processed separately from 2 uses case. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D101359	2021-04-30 19:02:02 +07:00
Alexey Bader	76f84e7729	[Doc] Fix sphinx warnings about wrong code-block format Differential Revision: https://reviews.llvm.org/D101549	2021-04-30 11:40:10 +03:00
Florian Hahn	ed9df5bd2f	[Passes] Run sinking/hoisting in SimplifyCFG earlier. Hoisting and sinking instructions out of conditional blocks enables additional vectorization by: 1. Executing memory accesses unconditionally. 2. Reducing the number of instructions that need predication. After disabling early hoisting / sinking, we miss out on a few vectorization opportunities. One of those is causing a ~10% performance regression in one of the Geekbench benchmarks on AArch64. This patch tires to recover the regression by running hoisting/sinking as part of a SimplifyCFG run after LoopRotate and before LoopVectorize. Note that in the legacy pass-manager, we run LoopRotate just before vectorization again and there's no SimplifyCFG run in between, so the sinking/hoisting may impact the later run on LoopRotate. But the impact should be limited and the benefit of hosting/sinking at this stage should outweigh the risk of not rotating. Compile-time impact looks slightly positive for most cases. http://llvm-compile-time-tracker.com/compare.php?from=2ea7fb7b1c045a7d60fcccf3df3ebb26aa3699e5&to=e58b4a763c691da651f25996aad619cb3d946faf&stat=instructions NewPM-O3: geomean -0.19% NewPM-ReleaseThinLTO: geoman -0.54% NewPM-ReleaseLTO-g: geomean -0.03% With a few benchmarks seeing a notable increase, but also some improvements. Alternative to D101290. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D101468	2021-04-30 12:23:57 +01:00
Jun Ma	b310dd1501	[AArch64][SVE] Lower index_vector to step_vector As discussed in D100107, this patch first convert index_vector to step_vector, and convert step_vector back to index_vector after LegalizeDAG. Differential Revision: https://reviews.llvm.org/D100816	2021-04-30 19:04:39 +08:00
Roman Lebedev	ba5b015b0d	[InlineCost] CallAnalyzer: use TTI info for extractvalue - they are free (PR50099) It seems incorrect to use TTI data in some places, and override it in others. In this case, TTI says that `extractvalue` are free, yet we bill them. While this doesn't address https://bugs.llvm.org/show_bug.cgi?id=50099 yet, it reduces the cost from 55 to 50 while the threshold is 45. Differential Revision: https://reviews.llvm.org/D101228	2021-04-30 13:55:11 +03:00
Neal (nealsid)	fd89af6880	Wrap edit line configuration calls into helper functions Currently we call el_set directly to configure the editor in the libedit wrapper. There are some cases in which this causes extra casting, but we pass captureless lambdas as function pointers, which should work out of the box. Since el_set takes varargs, if the cast is incorrect or if the cast is not present, it causes a run time failure rather than compile error. This change makes it so a few different types of configuration is done inside a helper function to provide type safety and eliminate that casting. I didn't do all edit line configuration because I'm not sure how important it was in other cases and it might require something more general keep up with libedit's signature. I'm open to suggestions, though. Reviewed By: teemperor, JDevlieghere Differential Revision: https://reviews.llvm.org/D101250	2021-04-30 12:32:29 +02:00
David Stuttard	a67a377014	[AMDGPU] Tidy up some simple expressions for clarity NFC Slight refactor for clarity. Change-Id: Ib25e7f4582c67a7c57f066cfd5382c1405d7d4c5 Differential Revision: https://reviews.llvm.org/D101610	2021-04-30 11:13:54 +01:00
David Stuttard	417b1164c2	[JITLink] Minor fix to avoid Windows compiler warning for static-cast Change-Id: Id0c1d5535b53e2aebe314151c0efa585e763f3f6 Differential Revision: https://reviews.llvm.org/D100093	2021-04-30 11:08:05 +01:00
Keith Walker	109bf25e2c	[AArch64] Change __ARM_FEATURE_FP16FML macro name to __ARM_FEATURE_FP16_FML The "Arm C Language extensions" document (the current version can be found at https://developer.arm.com/documentation/101028/0012/?lang=en) states that the name of the feature test macro for the FP16 FML extension is __ARM_FEATURE_FP16_FML. Differential Revision: https://reviews.llvm.org/D101532	2021-04-30 11:03:15 +01:00
David Spickett	8fdfc1d64c	[lldb] Add tests for DumpDataExtractor formats Covering basic cases where you have 1 item on 1 line. Apart from eFormatCharArray, where using multiple lines highlights the difference between it and eFormatVectorOfChar. Reviewed By: #lldb, teemperor Differential Revision: https://reviews.llvm.org/D101453	2021-04-30 10:29:05 +01:00
Fraser Cormack	1d85b24762	[RISCV][NFC] Merge RV32/RV64 test checks with a common prefix	2021-04-30 09:43:48 +01:00
Fraser Cormack	791766e6d2	[RISCV] Support STEP_VECTOR with a step greater than one DAGCombiner was recently taught how to combine STEP_VECTOR nodes, meaning the step value is no longer guaranteed to be one by the time it reaches the backend for lowering. This patch supports such cases on RISC-V by lowering to other step values to a multiply following the vid.v instruction. It includes a small optimization for common cases where the multiply can be expressed as a shift left. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D100856	2021-04-30 09:36:18 +01:00
Timm Bäder	95157860ae	[llvm][Support][NFC] Fix fallthrough attribute indentation The attribute does not belong to the if statement before and trips up gcc's indentation checker.	2021-04-30 10:31:31 +02:00
Dmitry Vyukov	b6df852901	tsan: fix fork syscall test Arm64 builders failed with: error: use of undeclared identifier 'SYS_fork' https://lab.llvm.org/buildbot/#/builders/7/builds/2575 Indeed, not all arches have fork syscall. Implement fork via clone on these arches. Differential Revision: https://reviews.llvm.org/D101603	2021-04-30 10:23:34 +02:00
Dominik Montada	97ed1b6036	[GISel] Teach TableGen to check predicates of immediate operands in patterns Reviewed By: dsanders Differential Revision: https://reviews.llvm.org/D91703	2021-04-30 10:18:45 +02:00

... 3 4 5 6 7 ...

387313 Commits All Branches Search

387313 Commits

All Branches