llvm-project

Commit Graph

Author	SHA1	Message	Date
David Green	3f90df22f1	[ARM] MVE reverse shuffles. The vectorizer can sometimes make reverse shuffles from indices that count down. In MVE, we don't have a 128bit rev instruction, but we can select this to a VREV64 with some lane movs to swap the two halfs. Ideally this would use VMOVD's, but only gets as far as VMOVS's at the moment. Differential Revision: https://reviews.llvm.org/D69510	2021-09-20 13:48:01 +01:00
Alex Richardson	817e23d481	[update_mir_test_checks.py] Use -NEXT FileCheck directories Previously the script emitted output using plain CHECK directives. This can result in a test passing even if there are some instructions between CHECK directives that should have been removed. It also makes debugging tests that have the output in a different order more difficult since FileCheck can match with a later line and then complain about the "wrong" directive not being found. This will cause quite large diffs when updating existing tests, but I'm not sure we need an opt-in flag here. Depends on D109765 (pre-commit tests) Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D109767	2021-09-20 12:55:56 +01:00
Alex Richardson	7b68c0725d	pre-commit test for D109767 Differential Revision: https://reviews.llvm.org/D109765	2021-09-20 12:55:56 +01:00
Alex Richardson	6d7b3d6b3a	Fix CLANG_ENABLE_STATIC_ANALYZER=OFF building all analyzer source Since https://reviews.llvm.org/D87118, the StaticAnalyzer directory is added unconditionally. In theory this should not cause the static analyzer sources to be built unless they are referenced by another target. However, the clang-cpp target (defined in clang/tools/clang-shlib) uses the CLANG_STATIC_LIBS global property to determine which libraries need to be included. To solve this issue, this patch avoids adding libraries to that property if EXCLUDE_FROM_ALL is set. In case something like this comes up again: `cmake --graphviz=targets.dot` is quite useful to see why a target is included as part of `ninja all`. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D109611	2021-09-20 12:55:56 +01:00
Simon Pilgrim	7fc12b822c	MachOObjectFile - checkOverlappingElement - use const-ref to avoid unnecessary copies. NFCI. Reported by MSVC static analyzer.	2021-09-20 12:53:18 +01:00
Simon Pilgrim	4ab7c0d3fa	[X86] X86TargetTransformInfo - remove unnecessary if-else after early exit. NFCI. (style) Break the if-else chain as they all return.	2021-09-20 12:53:17 +01:00
Simon Pilgrim	ea17b15f2d	[MCA] InstructionTables::execute() - use const-ref iterator in for-range loop. NFCI. Avoid unnecessary copies, reported by MSVC static analyzer.	2021-09-20 12:53:17 +01:00
Valentin Clement	d6929aaa67	[mlir][openacc] Make use of the second counter extension in DataOp translation Make use of runtime extension for the second reference counter used in structured data region. This extension is implemented in D106510 and D106509. Differential Revision: https://reviews.llvm.org/D106517	2021-09-20 13:43:50 +02:00
Michał Górny	b1099120ff	[lldb] [gdb-remote] Always send PID when detaching w/ multiprocess Always send PID in the detach packet when multiprocess extensions are enabled. This is required by qemu's GDB server, as plain 'D' packet results in an error and the emulated system is not resumed. Differential Revision: https://reviews.llvm.org/D110033	2021-09-20 13:29:07 +02:00
Petar Avramovic	e4c46ddd91	[GlobalISel] Improve elimination of dead instructions in legalizer Add eraseInstr(s) utility functions. Before deleting an instruction collects its use instructions. After deletion deletes use instructions that became trivially dead. This patch clears all dead instructions in existing legalizer mir tests. Differential Revision: https://reviews.llvm.org/D109154	2021-09-20 13:00:58 +02:00
Bjorn Pettersson	c8cb7f611f	[NewPM] Make InlinerPass (aka 'inline') a parameterized pass In default pipelines the ModuleInlinerWrapperPass is adding the InlinerPass to the pipeline twice, once due to MandatoryFirst (passing true in the ctor) and then a second time with false as argument. To make it possible to bisect and reduce opt test cases for this part of the pipeline we need to be able to choose between the two different variants of the InlinerPass when running opt. This patch is changing 'inline' to a CGSCC_PASS_WITH_PARAMS in the PassRegistry, making it possible run opt with both -passes=cgscc(inline) and -passes=cgscc(inline<only-mandatory>). Reviewed By: aeubanks, mtrofin Differential Revision: https://reviews.llvm.org/D109877	2021-09-20 12:52:52 +02:00
Andy Wingo	eb3af1e773	[clang][NFC] Remove dead code Remove code that has no effect in SemaType.cpp:processTypeAttrs. Differential Revision: https://reviews.llvm.org/D108360	2021-09-20 11:33:34 +02:00
Alexey Bader	15feaaa359	Add myself as a code owner for SYCL support	2021-09-20 09:32:25 +03:00
Justas Janickas	ca3bebd844	[OpenCL] Supports optional writing to 3d images in C++ for OpenCL 2021 Adds support for a feature macro __opencl_c_3d_image_writes in C++ for OpenCL 2021 enabling a respective optional core feature from OpenCL 3.0. This change aims to achieve compatibility between C++ for OpenCL 2021 and OpenCL 3.0. Differential Revision: https://reviews.llvm.org/D109328	2021-09-20 10:07:38 +01:00
Tim Northover	13aa102e07	AArch64: use ldp/stp for 128-bit atomic load/store in v.84 onwards v8.4 says that normal loads/stores of 128-bytes are single-copy atomic if they're properly aligned (which all LLVM atomics are) so we no longer need to do a full RMW operation to guarantee we got a clean read.	2021-09-20 09:50:11 +01:00
Vladislav Vinogradov	798e4bfbed	[mlir] Fix integration tests failures introduced in D108505	2021-09-20 11:48:24 +03:00
David Spickett	92c9b28347	Revert "[AArch64][SVE] Teach cost model that masked loads/stores are cheap" This reverts commit `734708e04f`. Due to build failures on the 2 stage SVE VLS bot. https://lab.llvm.org/buildbot/#/builders/176/builds/908/steps/11/logs/stdio	2021-09-20 08:45:18 +00:00
Michał Górny	f6e0edc23e	[lldb] [gdb-remote] Recognize aarch64v type from gdbserver Differential Revision: https://reviews.llvm.org/D109899	2021-09-20 10:41:38 +02:00
Michał Górny	92904cc68f	[lldb] [gdb-remote] Remove unused arg from GDBRemoteRegisterContext::ReadRegisterBytes() Differential Revision: https://reviews.llvm.org/D110020	2021-09-20 10:24:01 +02:00
KareemErgawy-TomTom	bdcf4b9b96	[MLIR][Linalg] Make detensoring cost-model more flexible. So far, the CF cost-model for detensoring was limited to discovering pure CF structures. This means, if while discovering the CF component, the cost-model found any op that is not detensorable, it gives up on detensoring altogether. This patch makes it a bit more flexible by cleaning-up the detensorable component from non-detensorable ops without giving up entirely. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D109965	2021-09-20 10:21:31 +02:00
Florian Hahn	7f6a4826ac	[CaptureTracking] Allow passing LI to PointerMayBeCapturedBefore (NFC). isPotentiallyReachable can use LoopInfo to return earlier. This patch allows passing an optional LI to PointerMayBeCapturedBefore. Used in D109844. Reviewed By: nikic, asbirlea Differential Revision: https://reviews.llvm.org/D109978	2021-09-20 09:07:34 +01:00
Fangrui Song	4b80f0125a	[CMake] Add debuginfo-tests to LLVM_ALL_PROJECTS after D110016	2021-09-20 00:56:28 -07:00
Arjun P	76cb876563	[MLIR] Simplex::appendVariable: early return if count == 0	2021-09-20 13:16:56 +05:30
Vladislav Vinogradov	ec03bbe8a7	[mlir] Fix bug in partial dialect conversion The discussion on forum: https://llvm.discourse.group/t/bug-in-partial-dialect-conversion/4115 The `applyPartialConversion` didn't handle the operations, that were marked as illegal inside dynamic legality callback. Instead of reporting error, if such operation was not converted to legal set, the method just added it to `unconvertedSet` in the same way as unknown operations. This patch fixes that and handle dynamically illegal operations as well. The patch includes 2 fixes for existing passes: * `tensor-bufferize` - explicitly mark `std.return` as legal. * `convert-parallel-loops-to-gpu` - ugly fix with marking visited operations to avoid recursive legality checks. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108505	2021-09-20 10:39:10 +03:00
Vladislav Vinogradov	9a2255dfa0	[mlir][NFC] Add explicit "::mlir" namespace to tblgen generated code Reviewed By: lattner, ftynse Differential Revision: https://reviews.llvm.org/D109223	2021-09-20 10:37:50 +03:00
Pavel Labath	966922320f	[lldb] Remove two #ifndef linux from Platform.cpp These have been here since r215992, guarding the calls to HostInfo, but their purpose unclear -- HostInfoLinux provides these functions and they work fine.	2021-09-20 08:30:02 +02:00
Siva Chandra Reddy	f5b8f1247c	[libc][obvious] Add inttypes.h and stdlib.h as deps to *div functions.	2021-09-20 05:54:32 +00:00
Siva Chandra Reddy	5252aa2981	[libc][obvious] Make abs and div functions buildable in default mode.	2021-09-20 05:44:49 +00:00
Max Kazantsev	e9d34c5429	[NFC] Add assert and test showing that revert of D109596 wasn't justified All transforms of IndVars have prerequisite requirement of LCSSA and LoopSimplify form and rely on it. Added test that shows that this actually stands.	2021-09-20 12:01:12 +07:00
Max Kazantsev	471217cff8	Revert "Revert "[IndVars] Replace PHIs if loop exits on 1st iteration"" This reverts commit `6fec6552f5`. The patch was reverted on incorrect claim that this patch may break LCSSA form when the loop is not in a simplify form. All IndVars' transform insure that the loop is in simplify and LCSSA form, so if it wasn't broken before this transform, it will also not be broken after it.	2021-09-20 12:01:10 +07:00
Siva Chandra Reddy	74670e79b0	[libc] Add implementations of div, ldiv, lldiv and imaxdiv. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D109952	2021-09-20 04:51:42 +00:00
Max Kazantsev	def15c5fb6	[SCEV] Support negative values in signed/unsigned predicate reasoning There is a piece of logic that uses the fact that signed and unsigned versions of the same predicate are equivalent when both values are non-negative. It's also true when both of them are negative. Differential Revision: https://reviews.llvm.org/D109957 Reviewed By: nikic	2021-09-20 11:26:33 +07:00
David Blaikie	cb42bb3550	llvm-dwarfdump: pretty type printing: print fully qualified names in function type parameter types	2021-09-19 18:49:15 -07:00
David Blaikie	606ea0dd2a	llvm-dwarfdump: support for type printing "decltype(nullptr)" as "nullptr_t" This should probably be rendered as "std::nullptr_t" but for now clang uses the unqualified name (which is ambiguous with possible user defined name in the global namespace), so match that here.	2021-09-19 17:33:56 -07:00
David Blaikie	11e0b79b05	llvm-dwarfdump: Don't print even an empty string when a type is unprintable	2021-09-19 17:03:10 -07:00
David Blaikie	5bfe5207ef	llvm-dwarfdump: Pretty print names qualified/with scopes	2021-09-19 16:36:01 -07:00
Simon Pilgrim	0e89ff8195	[X86] SimplifyDemandedBits - only narrow a broadcast source if we only have one use. Helps with the regression noted on D109065 - don't truncate a broadcast source if the source has multiple uses.	2021-09-19 22:53:30 +01:00
Kazu Hirata	84b07c9b3a	[llvm] Use pop_back_val (NFC)	2021-09-19 13:44:23 -07:00
Chris Jackson	5ba8020326	[DebugInfo][LSR] Emit shorter expressions from scev-based salvaging The scev-based salvaging for LSR can sometimes produce unnecessarily verbose expressions. This patch adds logic to detect when the value to be recovered and the induction variable differ by only a constant offset. Then, the expression to derive the current iteration count can be omitted from the dbg.value in favour of the offset. Reviewed by: aprantl Differential Revision: https://reviews.llvm.org/D109044	2021-09-19 21:41:44 +01:00
David Blaikie	372e2c24b6	llvm-dwarfdump: Pretty printing types including a space between const and parenthesized references/pointers to arrays	2021-09-19 13:32:53 -07:00
Craig Topper	391fa371fd	[X86] Remove Commutable flag from mpsadbw intrinsics. Unlike psadbw, mpsadbw is not commutable because of how it operates on blocks. We already marked as not commutable for MachineIR, but had it commutable for the tablegened isel patterns. Fixes PR51908.	2021-09-19 13:22:22 -07:00
Craig Topper	2bde3dcd32	[X86] Add test cases for pr51908. NFC	2021-09-19 13:22:22 -07:00
David Blaikie	a51fb58c55	DWARFDie.cpp: Minor follow-up clang-format	2021-09-19 13:06:18 -07:00
David Blaikie	f09ca5c646	DWARFDie: Improve type printing for function and array types - with qualifiers (cv/reference) and pointers to them	2021-09-19 12:59:31 -07:00
Simon Pilgrim	f855ef2601	[X86][Atom] Fix FP uops + port usage Both ports are required in most cases. Update the uops counts + port usage based off the most recent llvm-exegesis captures (PR36895) and what Intel AoM / Agner / InstLatX64 reports as well. Noticed while trying to improve fp costs for vectorization via the D103695 helper script.	2021-09-19 20:39:20 +01:00
Simon Pilgrim	b7342e3137	[X86] Fold SHUFPS(shuffle(x),shuffle(y),mask) -> SHUFPS(x,y,mask') We can combine unary shuffles into either of SHUFPS's inputs and adjust the shuffle mask accordingly. Unlike general shuffle combining, we can be more aggressive and handle multiuse cases as we're not going to accidentally create additional shuffles.	2021-09-19 20:39:19 +01:00
David Blaikie	2ca637c976	llvm-dwarfdump: Refactor type pretty printing tests Move most type tests to a pre-generated assembly file to make it easier to add more weird cases without having to hand craft more DWARF. Move the novel array types that aren't reachable via clang-generated DWARF to a separate file for easy maintenance.	2021-09-19 09:30:38 -07:00
Simon Pilgrim	cf8fac7d07	[X86][Atom] Specific uops for all IMUL/IDIV instructions Based off a mixture of llvm-exegesis captures (PR36895) and Intel AoM / Agner / InstLatX64 reports.	2021-09-19 16:58:52 +01:00
Roman Lebedev	5f2fe48d06	[X86][TLI] SimplifyDemandedVectorEltsForTargetNode(): don't break apart broadcasts from which not just the 0'th elt is demanded Apparently this has no test coverage before D108382, but D108382 itself shows a few regressions that this fixes. It doesn't seem worthwhile breaking apart broadcasts, assuming we want the broadcasted value to be preset in several elements, not just the 0'th one. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D108411	2021-09-19 17:38:32 +03:00
Roman Lebedev	07f1d8f0ca	[X86] lowerShuffleAsDecomposedShuffleMerge(): if both inputs are broadcastable/identities, canonicalize broadcasts as such Split off from D108253. Broadcast is simpler than any other shuffle we might produce to do what we want to do here, so prefer it. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D108382	2021-09-19 17:35:37 +03:00

... 4 5 6 7 8 ...

399598 Commits All Branches Search

399598 Commits

All Branches