llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	843d43e62a	[X86] computeKnownBitsForTargetNode - add X86ISD::VBROADCAST_LOAD handling This requires us to override the isTargetCanonicalConstantNode callback introduced in D128144, so we can recognise the various cases where a VBROADCAST_LOAD constant is being reused at different vector widths to prevent infinite loops.	2022-06-21 11:48:01 +01:00
Martin Storsjö	4d2eda2bb3	Revert "[LLD] [COFF] Use StringTableBuilder to optimize the string table" This reverts commit `9ffeaaa0ea`. This fixes debugging large executables with lldb and gdb. When StringTableBuilder is used, the string offsets for any string can point anywhere in the string table - while previously, all strings were inserted in order (without deduplication and tail merging). For symbols, there's no complications in encoding the string offset; the offset is encoded as a raw 32 bit binary number in half of the symbol name field. For sections, the string table offset is written as "/<decimaloffset>", but if the decimal offset would be larger than 7 digits, it's instead written as "//<base64offset>". Tools that operate on object files can handle the base64 offset format, but apparently neither lldb nor gdb expect that syntax when locating the debug information section. Prior to the reverted commit, all long section names were located at the start of the string table, so their offset never exceeded the range for the decimal syntax. Just reverting this change for now, as the actual benefit from it was fairly modest. Longer term, lld could write all long section names unoptimized at the start of the string table, followed by all the strings for symbol names, with deduplication and tail merging. And lldb and gdb could be fixed to handle sections with the base64 offset syntax. This fixes https://github.com/mstorsjo/llvm-mingw/issues/289.	2022-06-21 13:25:08 +03:00
Nikita Popov	ab088de873	[SROA] Regenerate test checks (NFC)	2022-06-21 12:24:11 +02:00
Shraiysh Vaishay	66e24da027	[mlir][OpenMP][NFC] Parameter refers to single args and hence changing description for taskgroup allocate clause.	2022-06-21 15:27:01 +05:30
Florian Hahn	2a9313ee0b	[ConstraintElimination] Move logic to check condition to helper (NFC).	2022-06-21 11:50:33 +02:00
Balazs Benics	ae76b2f455	[clang-tidy][docs] Fix wrong sphinx link after `d9afb8c3e8` There was a copy-paste mistake at the embedded link: `clang-tidy/checks/cppcoreguidelines-virtual-class-destructor` -> `clang-tidy/checks/cppcoreguidelines/virtual-class-destructor` Sphinx error: /home/zbebnal/git/llvm-project/clang-tools-extra/docs/ReleaseNotes.rst:168:unknown document: clang-tidy/checks/cppcoreguidelines-virtual-class-destructor Build bot: https://lab.llvm.org/buildbot#builders/115/builds/29805 Differential Revision: https://reviews.llvm.org/D126891	2022-06-21 11:42:09 +02:00
Balazs Benics	d9afb8c3e8	[clang-tidy] cppcoreguidelines-virtual-class-destructor should ignore final classes The `cppcoreguidelines-virtual-class-destructor` supposed to enforce http://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines#c35-a-base-class-destructor-should-be-either-public-and-virtual-or-protected-and-non-virtual Quote: > A base class destructor should be either public and virtual, or > protected and non-virtual [emphasis mine] However, this check still rules the following case: class MostDerived final : public Base { public: MostDerived() = default; ~MostDerived() = default; void func() final; }; Even though `MostDerived` class is marked `final`, thus it should not be considered as a base class. Consequently, the rule is satisfied, yet the check still flags this code. In this patch, I'm proposing to ignore `final` classes since they cannot be //base// classes. Reviewed By: whisperity Differential Revision: https://reviews.llvm.org/D126891	2022-06-21 11:02:18 +02:00
David Green	3f81841474	[AArch64] Add Extract(DUP(C)) as a canonical constant. As a followup to D128144, this adds extract(DUP(C)) as a canonical constant to prevent it being transformed back into a BUILD_VECTOR, leading to an infinite loop.	2022-06-21 09:51:22 +01:00
Carl Ritson	62abc8c200	[AMDGPU] Set GFX11 null export target based on export attributes If shader only has depth exports use MRTZ otherwise use MRT0. Differential Revision: https://reviews.llvm.org/D128185	2022-06-21 09:40:31 +01:00
Nicolas Vasilache	98dbaed1e6	[mlir][SCF] Fold tensor.cast feeding into scf.foreach_thread.parallel_insert_slice Differential Revision: https://reviews.llvm.org/D128247	2022-06-21 01:19:18 -07:00
Matthias Springer	858be16670	[mlir][memref] Fix layout map computation in inferRankReducedResultType Differential Revision: https://reviews.llvm.org/D128160	2022-06-21 10:08:26 +02:00
Nicolas Vasilache	a489aa745b	[mlir][SCF] Add scf::ForeachThread canonicalization. This revision adds the necessary plumbing for canonicalizing scf::ForeachThread with the `AffineOpSCFCanonicalizationPattern`. In the process the `loopMatcher` helper is updated to take OpFoldResult instead of just values. This allows composing various scenarios without the need for an artificial builder. Differential Revision: https://reviews.llvm.org/D128244	2022-06-21 00:54:46 -07:00
Balázs Kéri	957014da2d	[clang][Analyzer] Add errno state to standard functions modeling. This updates StdLibraryFunctionsChecker to set the state of 'errno' by using the new errno_modeling functionality. The errno value is set in the PostCall callback. Setting it in call::Eval did not work for some reason and then every function should be EvalCallAsPure which may be bad to do. Now the errno value and state is not allowed to be checked in any PostCall checker callback because it is unspecified if the errno was set already or will be set later by this checker. Reviewed By: martong, steakhal Differential Revision: https://reviews.llvm.org/D125400	2022-06-21 08:56:41 +02:00
Kazu Hirata	ed8fceaa09	Don't use Optional::getValue (NFC)	2022-06-20 23:35:53 -07:00
Nikolas Klauser	2fcf99d703	[libc++] Implement P0174R2 (Deprecating Vestigial Library Parts in C++17) Reviewed By: ldionne, Mordante, #libc Spies: jwakely, libcxx-commits Differential Revision: https://reviews.llvm.org/D127387	2022-06-21 08:22:44 +02:00
Kazu Hirata	6d5fc1e3d5	[mlir] Don't use Optional::getValue (NFC)	2022-06-20 23:20:25 -07:00
Markus Lavin	3815ae29b5	[machinesink] fix debug invariance issue Do not include debug instructions when comparing block sizes with thresholds. Differential Revision: https://reviews.llvm.org/D127208	2022-06-21 08:13:09 +02:00
Kazu Hirata	ca4af13e48	[clang] Don't use Optional::getValue (NFC)	2022-06-20 22:59:26 -07:00
Kazu Hirata	7a47ee51a1	[llvm] Don't use Optional::getValue (NFC)	2022-06-20 22:45:45 -07:00
Chen Zheng	9cfbe7bbfe	[PowerPC][ctrloop] handles calls in preheader before MTCTRloop	2022-06-21 01:22:39 -04:00
Argyrios Kyrtzidis	bb095880f8	[Support/BLAKE3] Do a CMake check for the `-mavx512vl` flag before applying it	2022-06-20 22:04:14 -07:00
Shraiysh Vaishay	23fec3405b	[mlir][OpenMP] Add omp.taskgroup operation This patch adds omp.taskgroup operation according to OpenMP 5.0 2.17.6. Also added tests for the same. Reviewed By: kiranchandramohan, peixin Differential Revision: https://reviews.llvm.org/D127250	2022-06-21 10:17:24 +05:30
Shraiysh Vaishay	c858f4dbd5	[flang][OpenMP] Fix firstprivate with barrier This patch fixes the unintentional data race in firstprivate implementation. There is a Read-Write race when one thread tries to copy the value inside the omp.parallel region while other thread modifies it from inside the region (using pointers or some other form of indirect access). For detailed discussion please refer to [[ https://discourse.llvm.org/t/issues-with-the-current-implementation-of-privatization-in-openmp-with-fortran/62335 \| discourse ]]. Reviewed By: kiranchandramohan, peixin, NimishMishra Differential Revision: https://reviews.llvm.org/D125689	2022-06-21 10:06:05 +05:30
Argyrios Kyrtzidis	34362f96d2	[Support/BLAKE3] Enable the SIMD implementations for macOS universal builds To accomodate macOS universal configuration include the assembly files and `blake3_neon.c` without a CMake check but instead guard their source with architecture "#ifdef" checks. Differential Revision: https://reviews.llvm.org/D128132	2022-06-20 21:18:44 -07:00
Craig Topper	e01353f816	[RISCV] Add RISCVISD opcode for PseudoAddTPRel. Use it along with RISCVISD::HI and ADD_LO to avoid emitting MachineSDNodes during lowering.	2022-06-20 20:56:52 -07:00
Craig Topper	59cde2133d	Recommit "[RISCV] Enable subregister liveness tracking for RVV." The failure that caused the previous revert has been fixed by https://reviews.llvm.org/D126048 Original commit message: RVV makes heavy use of subregisters due to LMUL>1 and segment load/store tuples. Enabling subregister liveness tracking improves the quality of the register allocation. I've added a command line that can be used to turn it off if it causes compile time or functional issues. I used the command line to keep the old behavior for one interesting test case that was testing register allocation. Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D128016	2022-06-20 20:46:06 -07:00
Serguei Katkov	163c77b2e0	[AARCH64 folding] Do not fold any copy with NZCV There is no instruction to fold NZCV, so, just do not do it. Without the fix the added test case crashes with an assert "Mismatched register size in non subreg COPY" Reviewed By: danilaml Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D127294	2022-06-21 10:38:49 +07:00
Kazu Hirata	d66cbc565a	Don't use Optional::hasValue (NFC)	2022-06-20 20:26:05 -07:00
Kazu Hirata	0916d96d12	Don't use Optional::hasValue (NFC)	2022-06-20 20:17:57 -07:00
Kazu Hirata	064a08cd95	Don't use Optional::hasValue (NFC)	2022-06-20 20:05:16 -07:00
LLVM GN Syncbot	b89f483064	[gn build] Port `a71fe49bb5`	2022-06-21 02:57:40 +00:00
Chen Zheng	a71fe49bb5	[PowerPC] add a new pass to expand ctr loop pseudos This patch implements a new way to generate the CTR loops. Now the intrinsics inserted in hardware loop pass will be mapped to pseudo instructions and these pseudo instructions will be expanded to CTR loop or normal compare+branch loop in this post ISEL pass. Reviewed By: lkail Differential Revision: https://reviews.llvm.org/D122125	2022-06-20 22:57:24 -04:00
Craig Topper	16d3a82de5	[RISCV] Add merge operand to RISCVISD::VRGATHER_VL nodes. Use it in place of VSELECT_VL+VRGATHER_VL. This simplifies the isel patterns. Overall, I think trying to match select+op to create masked instructions in isel doesn't scale. We either need to do it in DAG combine, pre-isel peepole, or post-isel peephole. I don't yet know which is the right answer, but for this case it seemed best to be able to request the masked form directly from lowering. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D128023	2022-06-20 18:58:24 -07:00
chenglin.bi	6c951c5ee6	[SelectionDAG][DAGCombiner] Reuse exist node by reassociate When already have (op N0, N2), reassociate (op (op N0, N1), N2) to (op (op N0, N2), N1) to reuse the exist (op N0, N2) Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D122539	2022-06-21 09:45:19 +08:00
Luo, Yuanke	44e8a205f4	[fastregalloc] Enhance the heuristics for liveout in self loop. For below case, virtual register is defined twice in the self loop. We don't need to spill %0 after the third instruction `%0 = def (tied %0)`, because it is defined in the second instruction `%0 = def`. 1 bb.1 2 %0 = def 3 %0 = def (tied %0) 4 ... 5 jmp bb.1 Reviewed By: MatzeB Differential Revision: https://reviews.llvm.org/D125079	2022-06-21 09:18:49 +08:00
Mogball	d883a02a7c	[mlir][ods] Remove StructAttr Depends on D127373 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D127375	2022-06-21 01:10:05 +00:00
Phoebe Wang	edcc68e86f	[X86] Make sure SF is updated when optimizing for `jg/jge/jl/jle` This fixes issue #56103. Reviewed By: mingmingl Differential Revision: https://reviews.llvm.org/D128122	2022-06-21 09:09:27 +08:00
Brad Smith	7c5957aedb	[Driver] Pass -X to ld for riscv64-fuchsia D127826, add support for Fuchsia which uses lld on riscv64 Reviewed By: MaskRay, phosek Differential Revision: https://reviews.llvm.org/D128134	2022-06-20 21:05:01 -04:00
Jeffrey Tan	5109de2da2	Fix build break introduced by https://reviews.llvm.org/D127702 Fix build break introduced by https://reviews.llvm.org/D127702 Differential Revision: https://reviews.llvm.org/D128234	2022-06-20 17:31:26 -07:00
archsaxe	523adafbd2	[test][AlwaysInline]:Correct comment and file check for always-inline.ll This fixes a useless filecheck and wrong comment for always-inline.ll. Testing has been done using ninja check-llvm and llvm-lit always-inline.ll --show-all. Reviewed By: modimo, hoy Differential Revision: https://reviews.llvm.org/D127815	2022-06-20 16:53:31 -07:00
Pengxuan Zheng	dec1614791	[LLD][COFF] Ignore /pdbcompress flag Microsoft does not seem to document the flag. Ignoring it for now is probably better than getting an unknown flag error. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D128231	2022-06-20 16:48:39 -07:00
lewuathe	0bae40eff6	[mlir][math] Lower cos,sin to libm Lower math.cos and math.sin to libm Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D128028	2022-06-21 08:38:07 +09:00
Jeffrey Tan	8c6e138aa8	Support logpoints in lldb-vscode This patch implements VSCode DAP logpoints feature (also called tracepoint in other VS debugger). This will provide a convenient way for user to do printf style logging debugging without pausing debuggee. Differential Revision: https://reviews.llvm.org/D127702	2022-06-20 16:22:12 -07:00
Nico Weber	0cc7ad4175	Revert "[lld-macho] Show source information for undefined references" This reverts commit `cd7624f153`. See https://reviews.llvm.org/D128184#3597534	2022-06-20 19:15:57 -04:00
Daniel Bertalan	cd7624f153	[lld-macho] Show source information for undefined references The error used to look like this: ld64.lld: error: undefined symbol: _foo >>> referenced by /path/to/bar.o:(symbol _baz+0x4) If DWARF line information is available, we now show where in the source the references are coming from: ld64.lld: error: unreferenced symbol: _foo >>> referenced by: bar.cpp:42 (/path/to/bar.cpp:42) >>> /path/to/bar.o:(symbol _baz+0x4) Differential Revision: https://reviews.llvm.org/D128184	2022-06-20 18:49:42 -04:00
Kazushi (Jam) Marukawa	5ba0a9571b	[Clang][VE] Add missing intrinsics Add missing intrinsics and tests for them. An expanding macro from _vel_pack_f32p to __builtin_ve_vl_pack_f32p and others is already defined in clang/lib/Headers/velintrin.h. Reviewed By: efocht Differential Revision: https://reviews.llvm.org/D128120	2022-06-21 07:30:36 +09:00
Maksim Panchenko	30a6d3ada6	[BOLT][TEST] Fix stack alignment in section-reloc-with-addend.s Misaligned stack can cause a runtime crash. Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D128227	2022-06-20 14:47:37 -07:00
Martin Storsjö	c9fc4336d4	[lldb] Fix building with GCC 7	2022-06-21 00:19:09 +03:00
Ruiling Song	732eed40fd	[AMDGPU] Mark GFX11 dual source blend export as strict-wqm The instructions that generate the source of dual source blend export should run in strict-wqm. That is if any lane in a quad is active, we need to enable all four lanes of that quad to make the shuffling operation before exporting to dual source blend target work correctly. Differential Revision: https://reviews.llvm.org/D127981	2022-06-20 21:58:12 +01:00
Piotr Sobczak	29621c13ef	[AMDGPU] Tag GFX11 LDS loads as using strict_wqm LDS_PARAM_LOAD and LDS_DIRECT_LOAD use EXEC per quad (if any pixel is enabled in the quad, data is written to all 4 pixels/threads in the quad). Tag LDS_PARAM_LOAD and LDS_DIRECT_LOAD as using strict_wqm to enforce this and avoid lane clobbering issues. Note that only the instruction itself is tagged. The implicit uses of these do not need to be set WQM. The reduces unnecessary WQM calculation of M0. Differential Revision: https://reviews.llvm.org/D127977	2022-06-20 21:58:12 +01:00

... 3 4 5 6 7 ...

427648 Commits All Branches Search

427648 Commits

All Branches