llvm-project

Commit Graph

Author	SHA1	Message	Date
Florian Hahn	8daa338297	[SCEV] Avoid repeated proveNoUnsignedWrapViaInduction calls. At the moment, proveNoUnsignedWrapViaInduction may be called for the same AddRec a large number of times via getZeroExtendExpr. This can have a severe compile-time impact for very loop-heavy code. One one particular workload, LSR takes ~51s without this patch, almost exlusively in proveNoUnsignedWrapViaInduction. With this patch, the time in LSR drops to ~0.4s. If proveNoUnsignedWrapViaInduction failed to prove NUW the first time, it is unlikely to succeed on subsequent tries and the cost doesn't seem to be justified. Besides drastically improving compile-time in some excessive cases, this also has a slightly positive compile-time impact on CTMark: NewPM-O3: -0.07% NewPM-ReleaseThinLTO: -0.08% NewPM-ReleaseLTO-g: -0.06 https://llvm-compile-time-tracker.com/compare.php?from=b435da027d7774c24cdb8c88d09f6b771e07fb14&to=f2729e33e8284b502f6c35a43345272252f35d12&stat=instructions Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D130648	2022-07-28 10:02:19 +01:00
Haojian Wu	6f6c40a875	[pseudo] Eliminate the false `::` nested-name-specifier ambiguity The solution is to favor the longest possible nest-name-specifier, and drop other alternatives by using the guard, per per C++ [basic.lookup.qual.general]. Motivated cases: ``` Foo::Foo() {}; // the constructor can be parsed as: // - Foo ::Foo(); // where the first Foo is return-type, and ::Foo is the function declarator // + Foo::Foo(); // where Foo::Foo is the function declarator ``` ``` void test() { // a very slow parsing case when there are many qualifers! X::Y::Z; // The statement can be parsed as: // - X ::Y::Z; // ::Y::Z is the declarator // - X::Y ::Z; // ::Z is the declarator // + X::Y::Z; // a declaration without declarator (X::Y::Z is decl-specifier-seq) // + X::Y::Z; // a qualifed-id expression } ``` Differential Revision: https://reviews.llvm.org/D130511	2022-07-28 11:01:15 +02:00
Martin Storsjö	dc95d0c525	[clang-tidy] Add CLANG_TIDY_CONFUSABLE_CHARS_GEN cmake cache variable to avoid building when cross compiling This is similar to the LLVM_TABLEGEN, CLANG_TABLEGEN and CLANG_PSEUDO_GEN cmake cache variables. Differential Revision: https://reviews.llvm.org/D129799	2022-07-28 12:00:21 +03:00
Martin Storsjö	18b4a8bcf3	[clang-tidy] Rename the make-confusable-table executable Rename it to clang-tidy-confusable-chars-gen, to make its role clearer in a wider context. In cross builds, the caller might want to provide this tool externally (to avoid needing to rebuild it in the cross build). In such a case, having the tool properly namespaced makes its role clearer. This matches how the clang-pseudo-gen tool was renamed in `a43fef05d4` / D126725. Differential Revision: https://reviews.llvm.org/D129798	2022-07-28 12:00:20 +03:00
Alexander Belyaev	824954a8c9	[mlir] Small stylistic changes to Complex_NumberAttr Differential Revision: https://reviews.llvm.org/D130632	2022-07-28 10:59:52 +02:00
Kirill Okhotnikov	c78144e1c7	[libc][math] Improved performance of exp2f function. New exp2 function algorithm: 1) Improved performance: 8.176 vs 15.270 by core-math perf tool. 2) Improved accuracy. Only two special values left. 3) Lookup table size reduced twice. Differential Revision: https://reviews.llvm.org/D129005	2022-07-28 10:57:16 +02:00
David Spickett	a0ccba5e19	[llvm] Fix some test failures with EXPENSIVE_CHECKS and libstdc++ DebugLocEntry assumes that it either contains 1 item that has no fragment or many items that all have fragments (see the assert in addValues). When EXPENSIVE_CHECKS is enabled, _GLIBCXX_DEBUG is defined. On a few machines I've checked, this causes std::sort to call the comparator even if there is only 1 item to sort. Perhaps to check that it is implemented properly ordering wise, I didn't find out exactly why. operator< for a DbgValueLoc will crash if this happens because the optional Fragment is empty. Compiler/linker/optimisation level seems to make this happen or not. So I've seen this happen on x86 Ubuntu but the buildbot for release EXPENSIVE_CHECKS did not have this issue. Add an explicit check whether we have 1 item. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D130156	2022-07-28 08:53:38 +00:00
Daniel Bertalan	d1e40f4d58	[lld-macho] Add LOH_ARM64_ADRP_ADD_LDR optimization hint support This hint instructs the linker to optimize an adrp+add+ldr sequence used for loading from a local symbol's address by loading directly if it's close enough, or with an adrp(p)+ldr sequence if it's not. This transformation is the same as what's done for ADRP_LDR_GOT_LDR when the symbol is local. The logic for acting on this hint is therefore moved to a new function which will be called from the existing applyAdrpLdrGotLdr() function. Differential Revision: https://reviews.llvm.org/D130505	2022-07-28 10:45:28 +02:00
Matthias Springer	c1e6caac70	[mlir][transform] Support results on ForeachOp Handles can be yielded from the ForeachOp. Differential Revision: https://reviews.llvm.org/D130640	2022-07-28 10:39:54 +02:00
Nikolas Klauser	d5a3cc1d88	[libc++] Fix merge-conflict in .clang-format	2022-07-28 10:32:02 +02:00
LLVM GN Syncbot	3f6c6e94d6	[gn build] Port `e01b4fe956`	2022-07-28 08:23:10 +00:00
Nikolas Klauser	e01b4fe956	[libc++] Fix unwrapping ranges with different iterators and sentinels Reviewed By: ldionne, huixie90, #libc Spies: arichardson, sstefan1, libcxx-commits, mgorny Differential Revision: https://reviews.llvm.org/D129040	2022-07-28 10:22:41 +02:00
Daniel Bertalan	f2c7f75f61	[lld-macho] Support creating N_SO stab for DWARF5 compile units In DWARF5, the `DW_AT_name` and `DW_AT_comp_dir` attributes are encoded using the `strx*` forms, which specify an index into `__debug_str_offs`. This commit adds that section to DwarfObject, so the debug info parser can resolve these references. The test case was manually adapted from stabs-icf.s. Fixes #51668 Differential Revision: https://reviews.llvm.org/D130559	2022-07-28 09:58:26 +02:00
LLVM GN Syncbot	7fac9c9141	[gn build] Port `8a61749f76`	2022-07-28 07:43:55 +00:00
Gaurav Shukla	7d6ef5caef	[mlir][tensor] Fold `tensor.cast` into `tensor.collapse_shape` op This commit folds a `tensor.cast` op into a `tensor.collapse_shape` op when following two conditions meet: 1. the `tensor.collapse_shape` op consumes result of the `tensor.cast` op. 2. `tensor.cast` op casts to a more dynamic version of the source tensor. This is added as a canonicalization pattern in `tensor.collapse_shape` op. Signed-Off-By: Gaurav Shukla <gaurav@nod-labs.com> Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D130650	2022-07-28 13:11:43 +05:30
Hui Xie	8a61749f76	[libc++][ranges] implement `std::ranges::inplace_merge` Differential Revision: https://reviews.llvm.org/D130627	2022-07-28 08:37:48 +01:00
Fangrui Song	1dc26b80b8	[Driver][PowerPC] Support -mtune= Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D130526	2022-07-28 00:34:04 -07:00
Max Kazantsev	8e9e27ae90	[Test] Fix block name in test	2022-07-28 13:42:14 +07:00
Max Kazantsev	2d1c6e0b44	[LAA] Remove block order sensitivity in LAA algorithm. PR56672 As test in PR56672 shows, LAA produces different results which lead to either positive or negative vectorization decisions depending on the order of blocks in loop. The exact reason of this is not clear to me, however this makes investigation of related bugs extremely complex. Current order of blocks in the loop is arbitrary. It may change, for example, if loop info analysis is dropped and recomputed. Seems that it interferes with LAA's logic. This patch chooses fixed traversal order of blocks in loops, making it RPOT. Note: this is not a fix for bug with incorrect analysis result. It just makes the answer more robust to make the investigation easier. Differential Revision: https://reviews.llvm.org/D130482 Reviewed By: aeubanks, fhahn	2022-07-28 13:36:56 +07:00
Tom Stellard	d9e02a30b1	workflows: Use macos-11 runners macos-10.15 is deprecated and will be removed.	2022-07-27 23:25:58 -07:00
Christian Sigg	f983bdbdae	[MLIR] Fix bazel build after `7356404ace`.	2022-07-28 08:14:18 +02:00
Argyrios Kyrtzidis	a9ae2f2764	[ASTWriter] Replace `const std::string &OutputFile` with `StringRef OutputFile` in some of `ASTWriter` functions, NFC This is to make it consistent with LLVM's string parameter passing convention.	2022-07-27 23:02:33 -07:00
Phoebe Wang	726d9f8e8c	[X86][MC] Avoid emitting incorrect warning for complex FMUL We will insert a new operand which is identical to the Dest for complex FMUL with a mask. https://godbolt.org/z/eTEdnYv3q Complex FMA and FMUL with maskz don't have this problem. Reviewed By: LuoYuanke, skan Differential Revision: https://reviews.llvm.org/D130638	2022-07-28 13:58:34 +08:00
Austin Kerbow	ba0d079c7a	[AMDGPU] Aggressively schedule to reduce RP in occupancy limited regions By not clustering loads and adjusting heuristics to more aggressively reduce register pressure we may be able to increase occupancy for the function if it was dropped in a first pass scheduling. Similarly, try to reduce spilling if register usage exceeds lower bound occupancy. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D130329	2022-07-27 22:34:37 -07:00
Amara Emerson	93e3aeb9a8	[AArch64][GlobalISel] Fix custom legalization of rotates using sext for shift vs zext. Rotates are defined according to DAG documentation as having unsigned shifts, so we need to zero-extend instead of sign-extend here. Fixes issue 56664	2022-07-27 22:10:42 -07:00
Amara Emerson	c16fa781f4	GlobalISel: update legalize-rotr-rotl.mir checks before change.	2022-07-27 22:10:04 -07:00
Sridhar Gopinath	f9a2f6b6ae	[clang-format] Fix the return code of git-clang-format In diff and diffstat modes, the return code is != 0 even when there are no changes between commits. This issue can be fixed by passing --exit-code to git-diff command that returns 0 when there are no changes and using that as the return code for git-clang-format. Fixes #56736. Differential Revision: https://reviews.llvm.org/D129311	2022-07-27 21:01:24 -07:00
Utkarsh Saxena	df537bef63	Use pseudoparser-based folding ranges in ClangdServer. Differential Revision: https://reviews.llvm.org/D130011	2022-07-28 05:43:17 +02:00
Chuanqi Xu	fe1887da36	[NFC] [C++20] [Modules] Add tests for merging redefinitions in modules Add tests for detecting redefinitions in C++20 modules. Some of these may be covered by other tests. But more tests should be always good.	2022-07-28 11:32:47 +08:00
Tom Stellard	b1dace63b1	workflows: Use correct access token when pushing to llvm-project-release-prs repo The checkout action will hard-code the default github actions token in the git config so that all pushes use it. We need to set persist-credentials=false so we can use a token that has permission to push to the llvm-project-release-prs repo.	2022-07-27 20:14:54 -07:00
Carl Ritson	dbda30e294	[AMDGPU][SIFoldOperands] Clear kills when folding COPY Clear all kill flags on source register when folding a COPY. This is necessary because the kills may now be out of order with the uses. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D130622	2022-07-28 11:57:55 +09:00
Chris Bieneman	76e951e803	[Docs] Fix column ordering on clang attribute docs This patch just adjusts the ordering of the headings on the attribute docs to match the order of the column content.	2022-07-27 21:36:43 -05:00
Stella Laurenzo	7356404ace	[mlir] Delete most of the ops from the quant dialect. * https://discourse.llvm.org/t/rfc-removing-the-quant-dialect/3643/8 * Removes most ops. Leaves casts given final comment (can remove more in a followup). * There are a few uses in Tosa keeping some of the utilities alive. In a followup, I will probably elect to just move simplified versions of them into Tosa itself vs having this quasi-library dependency. Differential Revision: https://reviews.llvm.org/D120204	2022-07-27 17:50:42 -07:00
David Blaikie	4bb192b846	DebugInfo: Test vtable homing overriding ctor homing only on itanium since msvc ABI doesn't home vtables	2022-07-28 00:45:00 +00:00
Craig Topper	a304d70ee9	[RISCV] Reorder (and/or/xor (shl X, C1), C2) if we can form ANDI/ORI/XORI. InstCombine and DAGCombine prefer to keep shl before binops. This patch teaches isel to convert to (shl (and/or/xor X, C1 >> C2), C2) if (C1 >> C2) is a simm12. The idea was taken from X86's isel code. There's a special case implemented for a sext_inreg between the shift and the binop. Differential Revision: https://reviews.llvm.org/D130610	2022-07-27 17:35:26 -07:00
Craig Topper	8d87f71e54	[RISCV] Pre-commit tests for D130610. NFC	2022-07-27 17:35:17 -07:00
Craig Topper	1d1d8d6025	[RISCV] Reorder code in lowerFROUND to make the diff in D130659 cleaner. NFC	2022-07-27 17:13:04 -07:00
David Blaikie	4e719e0f16	DebugInfo: Prefer vtable homing over ctor homing. Vtables will be emitted in fewer places than ctors (every ctor references the vtable, so at worst it's the same places - but at best the type has a non-inline key function and the vtable is emitted in one place) Pulling this fix out of `517bbc64db` which was reverted in `4821508d4d`	2022-07-28 00:07:35 +00:00
Amaury Séchet	06da353748	[NFC] Automatically generate CodeGen/VE/Scalar/atomic.ll	2022-07-27 23:52:00 +00:00
Lei Zhang	067daa56a9	[mlir][spirv] Unify resources of different vector sizes This commit extends UnifyAliasedResourcePass to handle the case where aliased resources have different vector sizes. (It still requires all scalar types to be of the same bitwidth.) This is effectively reusing the code for handling different-bitwidth scalar types. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D130671	2022-07-27 19:22:50 -04:00
Lei Zhang	7668e58210	[mlir][spirv] Fix spv.CompositeConstruct assembly and validation This commit fixes spv.CompositeConstruct to assembly to list operand types to enable vector construction out of smaller vectors. Validation is also fixed to properly check the cases for vector construction. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D130669	2022-07-27 19:17:23 -04:00
Matt Arsenault	bfdca1535c	RegAllocGreedy: Fix nondeterminism in tryLastChanceRecoloring tryLastChanceRecoloring iterates over the set of LiveInterval pointers and used that to seed the recoloring stack, which was nondeterministic. Fixes a future test failing about 20% of the time. This just takes the order the interfering vreg was encountered. Not sure if we should try to order this more intelligently.	2022-07-27 19:02:06 -04:00
Shafik Yaghmour	28cd7f86ed	Revert "[Clang] Diagnose ill-formed constant expression when setting a non fixed enum to a value outside the range of the enumeration values" This reverts commit `a3710589f2`.	2022-07-27 15:31:41 -07:00
Jonas Devlieghere	ecda408178	[lldb] Read from the Rosetta shared cache with Xcode 14 Xcode 14 no longer puts the Rosetta expanded shared cache in a directory named "16.0". Instead, it includes the real version number (e.g. 13.0), the build string and the architecture, similar to the device support directory names for iOS, tvOS and watchOS. Currently, when there are multiple directories, we might end up picking the wrong one in GetSDKDirectoryForCurrentOSVersion. The problem is that without the build string we have no way to differentiate between multiple directories with the same version number. This patch fixes the problem by using GetOSBuildString which, as the name implies, returns the build string if known. This also adds a test for Rosetta debugging on Apple Silicon. Depending on whether the Rosetta expanded shared cache is present, the test ensures that there is or isn't a diagnostic about reading out of memory. rdar://97576121 Differential revision: https://reviews.llvm.org/D130540	2022-07-27 15:26:46 -07:00
Craig Topper	98647330bf	[RISCV] Add merge operand to RISCVISD::FCOPYSIGN_VL. Similar to what was done for VRGATHER*_VL recently. This will be used in D130659.	2022-07-27 15:25:34 -07:00
Jim Ingham	27893ff1ad	Call WatchpointList::RemoveAll in Target::Destroy. I noticed that the test TestSetWatchpoint.py was failing every so often on macOS. The failure was in the last assert, that after destroying the SBTarget containing it, the SBWatchpoint was still saying it was valid. IsValid in this case just meant the watchpoint weak pointer could be turned into a shared pointer. The watchpoint shared pointers have two strong references in general, one to the "Target::m_last_created_watchpoint", and one in the Target::m_watchpoint_list. Target::Destroy reset the last created watchpoint but neglected to call RemoveAll on the watchpoint list (it does the analogous work for the internal & external breakpoint lists...) This patch does the equivalent cleanup for the watchpoint list.	2022-07-27 15:15:05 -07:00
Shafik Yaghmour	a3710589f2	[Clang] Diagnose ill-formed constant expression when setting a non fixed enum to a value outside the range of the enumeration values DR2338 clarified that it was undefined behavior to set the value outside the range of the enumerations values for an enum without a fixed underlying type. We should diagnose this with a constant expression context. Differential Revision: https://reviews.llvm.org/D130058	2022-07-27 14:59:35 -07:00
LLVM GN Syncbot	a35596675b	[gn build] Port `6047deb7c2`	2022-07-27 21:44:47 +00:00
bixia1	66088afbc8	[mlir][sparse] Add arith-expand pass to the sparse-compiler pipeline. Modify an existing test to test the situation. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D130658	2022-07-27 14:42:21 -07:00
Paul Kirth	6e9bab71b6	Revert "[llvm][NFC] Refactor code to use ProfDataUtils" This reverts commit `300c9a7881`. We will reland once these issues are ironed out.	2022-07-27 21:38:11 +00:00

... 2 3 4 5 6 ...

431492 Commits All Branches Search

431492 Commits

All Branches