llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Gysi	0fb364a97e	[mlir][linalg] Remove IndexedGenericOp support from LinalgToStandard... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102236	2021-05-12 11:56:07 +00:00
Kristina Bessonova	96100f1508	[libcxx] NFC. Correct wordings of _LIBCPP_ASSERT debug messages Differential Revision: https://reviews.llvm.org/D102195	2021-05-12 13:49:57 +02:00
Simon Pilgrim	72e242a286	[X86][AVX] canonicalizeShuffleMaskWithHorizOp - improve support for 256/512-bit vectors Extend the HOP(HOP(X,Y),HOP(Z,W)) and SHUFFLE(HOP(X,Y),HOP(Z,W)) folds to handle repeating 256/512-bit vector cases. This allows us to drop the UNPACK(HOP(),HOP()) custom fold in combineTargetShuffle. This required isRepeatedTargetShuffleMask to be tweaked to support target shuffle masks taking more than 2 inputs.	2021-05-12 12:13:24 +01:00
gbreynoo	81900dc498	[llvm-readelf] Unhide short options to match the command guide The readelf command guide shows the short options used as aliases but these are not found in the help text unless --show-hidden is used, other tools show aliases with --help. This change fixes the help output to be consistent with the command guide. Differential Revision: https://reviews.llvm.org/D102173	2021-05-12 12:09:08 +01:00
gbreynoo	725bc3eb0d	[llvm-symbolizer] Place Mach-O options into the Mach-O option group. In the help output of other tools and in the symbolizer command guide, Mach-O specific options are in their own section. This change fixes the symbolizer help output to be consistent. Differential Revision: https://reviews.llvm.org/D102178	2021-05-12 12:04:54 +01:00
David Sherwood	b7a11274f9	[LoopVectorize] Fix scalarisation crash in widenPHIInstruction for scalable vectors In InnerLoopVectorizer::widenPHIInstruction there are cases where we have to scalarise a pointer induction variable after vectorisation. For scalable vectors we already deal with the case where the pointer induction variable is uniform, but we currently crash if not uniform. For fixed width vectors we calculate every lane of the scalarised pointer induction variable for a given VF, however this cannot work for scalable vectors. In this case I have added support for caching the whole vector value for each unrolled part so that we can always extract an arbitrary element. Additionally, we still continue to cache the known minimum number of lanes too in order to improve code quality by avoiding an extractelement operation. I have adapted an existing test `pointer_iv_mixed` from the file: Transforms/LoopVectorize/consecutive-ptr-uniforms.ll and added it here for scalable vectors instead: Transforms/LoopVectorize/AArch64/sve-widen-phi.ll Differential Revision: https://reviews.llvm.org/D101294	2021-05-12 11:02:11 +01:00
Peter Waller	6e6f9a636b	[AArch64][SVE] Improve sve.convert.to.svbool lowering The sve.convert.to.svbool lowering has the effect of widening a logical <M x i1> vector representing lanes into a physical <16 x i1> vector representing bits in a predicate register. In general, if converting to svbool, the contents of lanes in the physical register might not be known. For sve.convert.to.svbool the new lanes are specified to be zeroed, requiring 'and' instructions to mask off the new lanes. For lanes coming from a ptrue or a comparison, however, they are known to be zero. CodeGen Before: ptrue p0.s, vl16 ptrue p1.s ptrue p2.b and p0.b, p2/z, p0.b, p1.b ret After: ptrue p0.s, vl16 ret Differential Revision: https://reviews.llvm.org/D101544	2021-05-12 10:57:25 +01:00
Michał Górny	71e66da04c	[Process/elf-core] Read PID from FreeBSD prpsinfo Add a function to read NT_PRPSINFO note from FreeBSD core dumps. This is necessary to get the process ID (NT_PRSTATUS has only thread ID). Move the lp64 check from NT_PRSTATUS parsing to the parseFreeBSDNotes() to avoid repeating it. Differential Revision: https://reviews.llvm.org/D101893	2021-05-12 11:51:37 +02:00
Michał Górny	b6c0edb979	[lldb] [Process/elf-core] Fix reading FPRs from FreeBSD/i386 cores The FreeBSD coredumps from i386 systems contain only FSAVE-style NT_FPREGSET. Since we do not really support reading that kind of data anymore, just use NT_X86_XSTATE to get FXSAVE-style data when available. Differential Revision: https://reviews.llvm.org/D101086	2021-05-12 11:51:37 +02:00
Stephen Tozer	fdb055f4f1	Reapply "[DebugInfo] Fix updateDbgUsersToReg to support DBG_VALUE_LIST" Previous crashes caused by this patch were the result of machine subregisters being incorrectly handled in updateDbgUsersToReg; this has been fixed by using RegUnits to determine overlapping registers, instead of using the register values directly. Differential Revision: https://reviews.llvm.org/D101523 This reverts commit `7ca26c5fa2`.	2021-05-12 10:19:57 +01:00
Neal (nealsid)	5af3a6645f	Remove Windows editline from LLDB I don't mean to undo others' work but it looks like the hand-rolled EditLine for LLDB on Windows isn't used. It'd be easier to make changes to bring the other platforms' Editline wrapper up to date (e.g. simplifying char vs wchar_t) without modifying/testing this one too. Reviewed By: amccarth Differential Revision: https://reviews.llvm.org/D102208	2021-05-12 10:05:44 +01:00
Piotr Sobczak	68137ef568	[AMDGPU] Skip invariant loads when avoiding WAR conflicts No need to handle invariant loads when avoiding WAR conflicts, as there cannot be a vector store to the same memory location. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D101177	2021-05-12 10:57:05 +02:00
Qiu Chaofan	cbd93cee9b	Revert "[PowerPC] [Clang] Enable float128 feature on VSX targets" This commit brought build break in some f128 related tests. But that's not the root cause. There exists some differences between Clang and GCC's definition for 128-bit float types on PPC, so macros/functions in glibc may not work with clang -mfloat128 well. We need to handle this carefully and reland it.	2021-05-12 16:51:52 +08:00
Tomas Matheson	34c098b780	[ARM] Prevent spilling between ldrex/strex pairs Based on the same for AArch64: `4751cadcca` At -O0, the fast register allocator may insert spills between the ldrex and strex instructions inserted by AtomicExpandPass when expanding atomicrmw instructions in LL/SC loops. To avoid this, expand to cmpxchg loops and therefore expand the cmpxchg pseudos after register allocation. Required a tweak to ARMExpandPseudo::ExpandCMP_SWAP to use the 4-byte encoding of UXT, since the pseudo instruction can be allocated a high register (R8-R15) which the 2-byte encoding doesn't support. However, the 4-byte encodings are not present for ARM v8-M Baseline. To enable this, two new pseudos are added for Thumb which are only valid for v8mbase, tCMP_SWAP_8 and tCMP_SWAP_16. The previously committed attempt in D101164 had to be reverted due to runtime failures in the test suites. Rather than spending time fixing that implementation (adding another implementation of atomic operations and more divergence between backends) I have chosen to follow the approach taken in D101163. Differential Revision: https://reviews.llvm.org/D101898 Depends on D101912	2021-05-12 09:43:21 +01:00
Tomas Matheson	edf9d88266	[ARM] Precommit test for D101898 Differential Revision: https://reviews.llvm.org/D101912	2021-05-12 09:43:21 +01:00
Alex Orlov	d8e65585f7	Fixed llvm-objcopy to add correct symbol table for ELF with program headers. This fixes the following bugs: https://bugs.llvm.org/show_bug.cgi?id=43935 Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D102258	2021-05-12 12:39:30 +04:00
Djordje Todorovic	44642505ce	[NFC][llvm-dwarfdump] Avoid passing std::string by value in collectStatsForDie()	2021-05-12 01:29:37 -07:00
Guillaume Chatelet	6351993da7	[libc] Simplifies multi implementations This is a roll forward of D101895 with two additional fixes: Original Patch description: > This is a follow up on D101524 which: > > - simplifies cpu features detection and usage, > - flattens target dependent optimizations so it's obvious which implementations are generated, > - provides an implementation targeting the host (march/mtune=native) for the mem* functions, > - makes sure all implementations are unittested (provided the host can run them). Additional fixes: - Fix uninitialized ALL_CPU_FEATURES - Use non pseudo microarch as it is only supported from Clang 12 on Differential Revision: https://reviews.llvm.org/D102233	2021-05-12 07:24:53 +00:00
Dmitry Vyukov	8aa7f28497	scudo: fix CheckFailed-related build breakage I was running: $ ninja check-sanitizer check-msan check-asan \ check-tsan check-lsan check-ubsan check-cfi \ check-profile check-memprof check-xray check-hwasan but missed check-scudo... Differential Revision: https://reviews.llvm.org/D102314	2021-05-12 09:10:34 +02:00
Ulysse Beaugnon	27b2bd7601	[MLIR] Enable conversion from llvm::SMLoc to mlir::Location with OpAsmParser. DialectAsmParser already allows converting an llvm::SMLoc location to a mlir::Location location. This commit adds the same functionality to OpAsmParser. Implementation is copied from DialectAsmParser. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D102165	2021-05-12 09:08:32 +02:00
Dumitru Potop	9a0ea5994b	[mlir] Support alignment in LLVM dialect GlobalOp First step in adding alignment as an attribute to MLIR global definitions. Alignment can be specified for global objects in LLVM IR. It can also be specified as a named attribute in the LLVMIR dialect of MLIR. However, this attribute has no standing and is discarded during translation from MLIR to LLVM IR. This patch does two things: First, it adds the attribute to the syntax of the llvm.mlir.global operation, and by doing this it also adds accessors and verifications. The syntax is "align=XX" (with XX being an integer), placed right after the value of the operation. Second, it allows transforming this operation to and from LLVM IR. It is checked whether the value is an integer power of 2. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D101492	2021-05-12 09:07:20 +02:00
Dmitry Vyukov	1dc838717a	tsan: fix syscall test on aarch64 Add missing includes and use SYS_pipe2 instead of SYS_pipe as it's not present on some arches. Differential Revision: https://reviews.llvm.org/D102311	2021-05-12 09:00:51 +02:00
Martin Storsjö	382c505d9c	[COFF] Fix ARM and ARM64 REL32 relocations to be relative to the end of the relocation This matches how they are defined on X86. This should fix the relative lookup tables pass for COFF, allowing it to be reenabled. Differential Revision: https://reviews.llvm.org/D102217	2021-05-12 09:53:43 +03:00
Dmitry Vyukov	2721e27c3a	sanitizer_common: deduplicate CheckFailed We have some significant amount of duplication around CheckFailed functionality. Each sanitizer copy-pasted a chunk of code. Some got random improvements like dealing with recursive failures better. These improvements could benefit all sanitizers, but they don't. Deduplicate CheckFailed logic across sanitizers and let each sanitizer only print the current stack trace. I've tried to dedup stack printing as well, but this got me into cmake hell. So let's keep this part duplicated in each sanitizer for now. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D102221	2021-05-12 08:50:53 +02:00
Qiu Chaofan	febbe4b5a0	[PowerPC] [Clang] Enable float128 feature on VSX targets Reviewed By: nemanjai, steven.zhang Differential Revision: https://reviews.llvm.org/D92815	2021-05-12 14:33:41 +08:00
Kristina Bessonova	f8306647fa	[libcxx][test] Split more debug mode tests Split a few more debug mode tests missed in D100592. Differential Revision: https://reviews.llvm.org/D102194	2021-05-12 08:28:16 +02:00
Dmitry Vyukov	23596fece0	sanitizer_common: don't write into .rodata setlocale interceptor imitates a write into result, which may be located in .rodata section. This is the only interceptor that tries to do this and I think the intention was to initialize the range for msan. So do that instead. Writing into .rodata shouldn't happen (without crashing later on the actual write) and this traps on my local tsan experiments. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D102161	2021-05-12 07:54:06 +02:00
Vitaly Buka	85a96d82ca	[symbolizer] Fix leak after D96883	2021-05-11 22:51:36 -07:00
Dmitry Vyukov	53558ed8a0	sanitizer_common: fix SIG_DFL warning Currently we have: sanitizer_posix_libcdep.cpp:146:27: warning: cast between incompatible function types from ‘__sighandler_t’ {aka ‘void (*)(int)’} to ‘sa_sigaction_t’ 146 \| sigact.sa_sigaction = (sa_sigaction_t)SIG_DFL; We don't set SA_SIGINFO, so we need to assign to sa_handler. And SIG_DFL is meant for sa_handler, so this gets rid of both compiler warning, type cast and potential runtime misbehavior. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D102162	2021-05-12 07:23:20 +02:00
Dmitry Vyukov	8214764f35	tsan: declare annotations in test.h We already declare subset of annotations in test.h. But some are duplicated and declared in tests. Move all annotation declarations to test.h. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D102152	2021-05-12 07:22:39 +02:00
Qiu Chaofan	6d2df18163	[VectorComine] Restrict single-element-store index to inbounds constant Vector single element update optimization is landed in `2db4979`. But the scope needs restriction. This patch restricts the index to inbounds and vector must be fixed sized. In future, we may use value tracking to relax constant restrictions. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D102146	2021-05-12 13:18:20 +08:00
Dmitry Vyukov	5dad3d1ba9	tsan: mark sigwait as blocking Add a test case reported in: https://github.com/google/sanitizers/issues/1401 and fix it. The code assumes sigwait will process other signals. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D102057	2021-05-12 06:56:18 +02:00
Dmitry Vyukov	04b2ada51c	tsan: add a simple syscall test Add a simple test that uses syscall annotations. Just to ensure at least basic functionality works. Also factor out annotated syscall wrappers into a separate header file as they may be useful for future tests. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D102223	2021-05-12 06:42:11 +02:00
Chia-hung Duan	f653313d4a	[mlir][AsmPrinter] Remove recursion while SSA naming Address the TODO of removing recursion while SSA naming. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D102226	2021-05-12 11:23:01 +08:00
Vitaly Buka	7d101e0f6a	[NFC][msan] Move setlocale test into sanitizer_common	2021-05-11 19:05:07 -07:00
Congzhe Cao	3f8be15f29	[LoopInterchange] Handle lcssa PHIs with multiple predecessors This is a bugfix in the transformation phase. If the original outer loop header branches to both the inner loop (header) and the outer loop latch, and if there is an lcssa PHI node outside the loop nest, then after interchange the new outer latch will have an lcssa PHI node inserted which has two predecessors, i.e., the original outer header and the original outer latch. Currently the transformation assumes it has only one predecessor (the original outer latch) and crashes, since the inserted lcssa PHI node does not take both predecessors as incoming BBs. Reviewed By: Whitney Differential Revision: https://reviews.llvm.org/D100792	2021-05-11 21:30:54 -04:00
Jim Ingham	10c309ad81	Removing test... Actually, I don't think this test is going to be stable enough to be worthwhile. Let me see if I can think of a better way to test this.	2021-05-11 18:27:37 -07:00
Matt Arsenault	cc79aaced0	AMDGPU: Fix SILoadStoreOptimizer for gfx90a This was hardcoding the register class to use for the newly created pointer registers, violating the aligned VGPR requirement.	2021-05-11 21:26:43 -04:00
Jim Ingham	0f2eb7e6e5	This test is failing on Linux, skip while I investigate. The gdb-remote tests are a bit artificial, depending on Python threading, and sleeps. So I'm not 100% surprised it doesn't work straight up on another XSsystem.	2021-05-11 18:13:56 -07:00
Sam Clegg	19cedd3cd3	[lld][WebAssembly] Fix for string merging + negative addends Don't include the relocation addend when calculating the virtual address of a symbol. Instead just pass the symbol's offset and add the addend afterwards. Without this fix we hit the `offset is outside the section` error in MergeInputSegment::getSegmentPiece. This fixes a real world error we were are seeing in emscripten. Differential Revision: https://reviews.llvm.org/D102271	2021-05-11 17:47:57 -07:00
Richard Smith	bb726383ac	Revert "Fix bad mangling of <data-member-prefix> for a closure in the initializer of a variable at global namespace scope." This reverts commit `697ac15a0f`, for which review was not complete. That change was accidentally pushed when an unrelated change was pushed.	2021-05-11 17:46:18 -07:00
Richard Smith	3978333b71	Add test for PR50039. I believe Clang's behavior is correct according to the standard here, but this is an unusual situation for which we had no test coverage, so I'm adding some.	2021-05-11 17:35:34 -07:00
Richard Smith	697ac15a0f	Fix bad mangling of <data-member-prefix> for a closure in the initializer of a variable at global namespace scope. This implements the direction proposed in https://github.com/itanium-cxx-abi/cxx-abi/pull/126. Differential Revision: https://reviews.llvm.org/D101968	2021-05-11 17:35:33 -07:00
Matt Arsenault	6f5ddf6731	GlobalISel: Don't hardcode varargs=false in resultsCompatible	2021-05-11 20:22:06 -04:00
Matt Arsenault	a15ed701ab	AMDGPU: Fix assert on constant load from addrspacecasted pointer This was trying to create a bitcast between different address spaces.	2021-05-11 20:12:20 -04:00
Matt Arsenault	6ecbdb761f	GlobalISel: Make constant fields const	2021-05-11 20:10:55 -04:00
Matt Arsenault	24e2e5df0e	GlobalISel: Split ValueHandler into assignment and emission classes Currently the ValueHandler handles both selecting the type and location for arguments, as well as inserting instructions needed to handle them. Split this so that the determination of the argument handling is independent of the function state. Currently the checks for tail call compatibility do not follow the full assignment logic, so it misses cases where arguments require nontrivial legalization. This should help avoid targets ending up in a buggy state where the argument evaluation may change in different contexts.	2021-05-11 19:50:12 -04:00
Matt Arsenault	2bdfcf0cac	GlobalISel: Move AArch64 AssignFnVarArg to base class We can handle the distinction easily enough in the generic code, and this makes it easier to abstract the selection of type/location from the code to insert code.	2021-05-11 19:50:12 -04:00
Jordan Rupprecht	fec2945998	Revert "[GVN] Clobber partially aliased loads." This reverts commit `6c57044231`. It causes assertion errors due to widening atomic loads, and potentially causes miscompile elsewhere too. Repro, also posted to D95543: ``` $ cat repro.ll ; ModuleID = 'repro.ll' source_filename = "repro.ll" target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128" target triple = "x86_64-unknown-linux-gnu" %struct.widget = type { i32 } %struct.baz = type { i32, %struct.snork } %struct.snork = type { %struct.spam } %struct.spam = type { i32, i32 } @global = external local_unnamed_addr global %struct.widget, align 4 @global.1 = external local_unnamed_addr global i8, align 1 @global.2 = external local_unnamed_addr global i32, align 4 define void @zot(%struct.baz* %arg) local_unnamed_addr align 2 { bb: %tmp = getelementptr inbounds %struct.baz, %struct.baz* %arg, i64 0, i32 1 %tmp1 = bitcast %struct.snork* %tmp to i64* %tmp2 = load i64, i64* %tmp1, align 4 %tmp3 = getelementptr inbounds %struct.baz, %struct.baz* %arg, i64 0, i32 1, i32 0, i32 1 %tmp4 = icmp ugt i64 %tmp2, 4294967295 br label %bb5 bb5: ; preds = %bb14, %bb %tmp6 = load i32, i32* %tmp3, align 4 %tmp7 = icmp ne i32 %tmp6, 0 %tmp8 = select i1 %tmp7, i1 %tmp4, i1 false %tmp9 = zext i1 %tmp8 to i8 store i8 %tmp9, i8* @global.1, align 1 %tmp10 = load i32, i32* @global.2, align 4 switch i32 %tmp10, label %bb11 [ i32 1, label %bb12 i32 2, label %bb12 ] bb11: ; preds = %bb5 br label %bb14 bb12: ; preds = %bb5, %bb5 %tmp13 = load atomic i32, i32* getelementptr inbounds (%struct.widget, %struct.widget* @global, i64 0, i32 0) acquire, align 4 br label %bb14 bb14: ; preds = %bb12, %bb11 br label %bb5 } $ opt -O2 repro.ll -disable-output opt: /home/rupprecht/src/llvm-project/llvm/lib/Transforms/Utils/VNCoercion.cpp:496: llvm::Value llvm::VNCoercion::getLoadValueForLoad(llvm::LoadInst , unsigned int, llvm::Type , llvm::Instruction , const llvm::DataLayout &): Assertion `SrcVal->isSimple() && "Cannot widen volatile/atomic load!"' failed. PLEASE submit a bug report to https://bugs.llvm.org/ and include the crash backtrace. Stack dump: 0. Program arguments: /home/rupprecht/dev/opt -O2 repro.ll -disable-output ... ```	2021-05-11 16:08:53 -07:00
Lang Hames	d63860a052	[JITLink] Fix bogus format string.	2021-05-11 16:04:00 -07:00

1 2 3 4 5 ...

388178 Commits All Branches Search

388178 Commits

All Branches