llvm-project

Commit Graph

Author	SHA1	Message	Date
Mogball	cb3aa49ec0	[MLIR][arith] fix references to std.constant in comments Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D111820	2021-10-14 20:38:47 +00:00
thomasraoux	afad0cdf31	[mlir][vector] Refactor linalg vectorization for reductions Emit reduction during op vectorization instead of doing it when creating the transfer write. This allow us to not broadcast output arguments for reduction initial value. Differential Revision: https://reviews.llvm.org/D111825	2021-10-14 13:37:56 -07:00
Philip Reames	8b31f07cdf	[tests] Add indvars tests showing missing transforms with small IVs This shows the transform side of D109457, but also lets us try other approaches to the same problem. The common trend to all is that we need to explicit reason about UB to disallow possibility of infinite loops.	2021-10-14 13:28:18 -07:00
David Green	e9e6266c70	[AArch64] Add extra tests for fptosisat vector variants	2021-10-14 21:26:24 +01:00
Roman Lebedev	3d7bf6625a	[X86][Costmodel] Improve cost modelling for not-fully-interleaved load While i've modelled most of the relevant tuples for AVX2, that only covered fully-interleaved groups. By definition, interleaving load of stride N means: load NVF elements, and shuffle them into N VF-sized vectors, with 0'th vector containing elements `[0, VF)stride + 0`, and 1'th vector containing elements `[0, VF)*stride + 1`. Example: https://godbolt.org/z/df561Me5E (i64 stride 4 vf 2 => cost 6) Now, not fully interleaved load, is when not all of these vectors is demanded. So at worst, we could just pretend that everything is demanded, and discard the non-demanded vectors. What this means is that the cost for not-fully-interleaved group should be not greater than the cost for the same fully-interleaved group, but perhaps somewhat less. Examples: https://godbolt.org/z/a78dK5Geq (i64 stride 4 (indices 012u) vf 2 => cost 4) https://godbolt.org/z/G91ceo8dM (i64 stride 4 (indices 01uu) vf 2 => cost 2) https://godbolt.org/z/5joYob9rx (i64 stride 4 (indices 0uuu) vf 2 => cost 1) As we have established over the course of last ~70 patches, (wow) `BaseT::getInterleavedMemoryOpCos()` is absolutely bogus, it is usually almost an order of magnitude overestimation, so i would claim that we should at least use the hardcoded costs of fully interleaved load groups. We could go further and adjust them e.g. by the number of demanded indices, but then i'm somewhat fearful of underestimating the cost. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D111174	2021-10-14 23:14:36 +03:00
Philip Reames	7f3861cfdb	autogen tests for ease of update	2021-10-14 13:04:22 -07:00
Craig Topper	79ae9562cc	[RISCV] Remove unused member variable. NFC	2021-10-14 12:56:47 -07:00
Nikita Popov	69853f9920	[IVUsers] Move preheader check into SCEVExpander Rather than checking for loop nest preheaders upfront in IVUsers, move this requirement into isSafeToExpand() from SCEVExpander. Historically, LSR did not check whether SCEVs are safe to expand and fully relied on IVUsers to validate this. Later, support for non-expandable SCEVs was added via rigid formulas. Checking this in isSafeToExpand() makes it more obvious what exactly this check is guarding against, and avoids the awkward loop nest scan. This is a followup to https://reviews.llvm.org/D111493#3055286. Differential Revision: https://reviews.llvm.org/D111681	2021-10-14 21:52:31 +02:00
Aaron Ballman	68157fe15b	Fix a crash on valid consteval code. Not all constants are emitted within the context of a function, so use the module's ASTContext instead because 1) that's the same as the current function ASTContext, and 2) the module can never be null. Fixes PR50787.	2021-10-14 15:48:10 -04:00
Raphael Isemann	482c53fa0d	[lldb] Move ~Platform to source file The called destructors of the members require the includes that are only in the source file.	2021-10-14 21:36:46 +02:00
Frederic Cambus	8ecbcd058f	[Driver][Darwin] Use T reference instead of getToolChain().getTriple(). Differential Revision: https://reviews.llvm.org/D111793	2021-10-14 21:30:39 +02:00
Craig Topper	3ff9cc01f2	[X86] Use CMOVNS for abs instead of CMOVGE. CMOVGE reads SF and OF. CMOVNS only reads SF. This matches with other recent changes to use a single flag where possible. It also matches gcc codegen. I believe this technically changes whether the conditioanl move happens on INT_MIN, but for INT_MIN both registers are the same so it doesn't matter. Differential Revision: https://reviews.llvm.org/D111826	2021-10-14 12:28:28 -07:00
Michael Kruse	19db33c06e	[Polly] Remove support for code generated by gfortran+DragonEgg. DragonEgg is not maintained anymore, hence there is no need for this functionality. Fixes llvm.org/PR52173	2021-10-14 14:12:06 -05:00
Michael Kruse	a5e52ce3f2	[Polly][docs] Fix itemize list for release notes. Make the changes top-level items, instead of subitems of the "Changes..." placeholder.	2021-10-14 13:50:18 -05:00
Aaron Ballman	b9941de0bf	Fix a rejects-valid with consteval on overloaded operators It seems that Clang 11 regressed functionality that was working in Clang 10 regarding calling a few overloaded operators in an immediate context. Specifically, we were not checking for immediate invocations of array subscripting and the arrow operators, but we properly handle the other overloaded operators. This fixes the two problematic operators and adds some test coverage to show they're equivalent to calling the operator directly. This addresses PR50779.	2021-10-14 14:47:29 -04:00
Raphael Isemann	e632e900ac	[lldb] Remove logging from Platform::~Platform Platform instances are stored in a function-local static list. However, the logging code involves locking a function-local static mutex. This only works on some implementations where the Log mutex is by accident destroyed after the Platform list is destroyed. This fixes randomly failing tests due to `recursive_mutex lock failed: Invalid argument`. Reviewed By: kastiglione Differential Revision: https://reviews.llvm.org/D111816	2021-10-14 20:42:45 +02:00
Rob Suderman	59dd418e89	[mlir][tosa] Fix tosa.cast UiToFp32 for tosa-to-linalg Part of the arith update broke UiToFp32. Fixed the lowering and included a new test to detect a regression. Differential Revision: https://reviews.llvm.org/D111772	2021-10-14 11:34:10 -07:00
Raphael Isemann	78e17e23aa	[lldb] Rewrite TestDiamond and document some bugs.	2021-10-14 20:32:07 +02:00
David Tenty	228b3b729d	[libc++][AIX] Add scripts and config for building with the libcxx CI infrastructure This initial change adds the AIX configuration to run-buildbot, an AIX CMake cache file, and appropriate compiler and linker flags for testing AIX to the lit "from scratch" configuration files. Either of the 32-bit or 64-bit configurations can be built by setting `OBJECT_MODE` in the build environment (as is typical for AIX). Reviewed By: ldionne, #libc, #libc_abi Differential Revision: https://reviews.llvm.org/D111244	2021-10-14 14:31:10 -04:00
Nikita Popov	5f05ff081f	[BasicAA] Improve scalable vector handling Currently, DecomposeGEP() bails out on the whole decomposition if it encounters a scalable GEP type anywhere. However, it is fine to still analyze other GEPs that we look through before hitting the scalable GEP. This does mean that the decomposed GEP base is no longer required to be the same as the underlying object. However, I don't believe this property is necessary for correctness anymore. This allows us to compute slightly more precise aliasing results for GEP chains containing scalable vectors, though my primary interest here is simplifying the code. Differential Revision: https://reviews.llvm.org/D110511	2021-10-14 20:23:50 +02:00
Daniel Sanders	0a869ef3a8	[llvm-mca][timeline] Indicate output was stopped due to cycle limit. It can be a bit confusing to stop with no explanation so we should indicate when further output was prevented by the cycle limit. Differential Revision: https://reviews.llvm.org/D111753	2021-10-14 11:10:09 -07:00
Kai Nacke	b050564d3e	[AIX] Ignore case when comparing output from od POSIX does not define the exact output from od tool. While most implementations use lower case characters in hex output, the z/OS USS implementation uses upper case characters. To avoid LIT failures, the FileCheck option to ignore the case must be used when checking hex bytes. Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D111427	2021-10-14 13:51:02 -04:00
Simon Pilgrim	871f773986	[TTI][X86] Merge getInterleavedMemoryOpCostAVX2 into getInterleavedMemoryOpCost. NFC This a NFC refactor patch to merge the AVX2 interleaved cost handling back into the getInterleavedMemoryOpCost base method - while getInterleavedMemoryOpCostAVX512 uses instruction and patterns very specific to AVX512+, much of the costs analysis for AVX2 can be reused for all SSE targets. This is the first step towards improving SSE and AVX1 costs that will reuse the relevant AVX2 costs by splitting some of the tables - for instance AVX1 has very similar costs for most vXi64/vXf64 interleave patterns and many sub-128bit vector costs are the same all the way down to SSE2 (or at least SSSE3). Differential Revision: https://reviews.llvm.org/D111822	2021-10-14 18:46:25 +01:00
Frederic Cambus	f7a3214306	[Driver][WebAssembly] Use ToolChain reference instead of getToolChain(). Differential Revision: https://reviews.llvm.org/D111786	2021-10-14 19:43:59 +02:00
Michael Kruse	5f668bba55	[Polly] Clean up Polly's getting started docs. This patch removes the broken bash scipt (polly.sh) and fixes the broken setup instructions in get_started.html. It also adds instructions for using Ninja and links to the LLVM getting started page. Reviewed By: Meinersbur, InnovativeInventor Differential Revision: https://reviews.llvm.org/D111685	2021-10-14 12:26:57 -05:00
Simon Pilgrim	fcbec7e668	[TTI][X86] Swap getInterleavedMemoryOpCostAVX2/getInterleavedMemoryOpCostAVX512 implementations. NFC. I have some upcoming refactoring for SSE/AVX1 interleaving cost support, and the diff is a lot nicer if the (unaltered) AVX512 implementation isn't stuck between getInterleavedMemoryOpCost and getInterleavedMemoryOpCostAVX2	2021-10-14 18:10:03 +01:00
Simon Pilgrim	13185f0154	[Transforms] eliminateDeadStores - remove unused variable. NFC. The initial MemoryAccess *Current assignment is never used, and all other uses are initialized/used within the worklist loop (and not across multiple iterations) - so move the variable internal to the loop. Fixes scan-build unused assignment warning.	2021-10-14 18:10:03 +01:00
Yitzhak Mandelbaum	b6c218d4fd	[libTooling] Add "switch"-like Stencil combinator Adds `selectBound`, a `Stencil` combinator that allows the user to supply multiple alternative cases, discriminated by bound node IDs. Differential Revision: https://reviews.llvm.org/D111708	2021-10-14 16:45:37 +00:00
Kevin P. Neal	727a891ec8	[FPEnv][InstSimplify] Fold fadd X, 0 ==> X, when we know X is not -0 Currently the fadd optimizations in InstSimplify don't know how to do this NoSignedZeros "X + 0.0 ==> X" fold when using the constrained intrinsics. This adds the support. This review is derived from D106362 with some improvements from D107285 and is a follow-on to D111085. Differential Revision: https://reviews.llvm.org/D111450	2021-10-14 12:32:45 -04:00
Craig Topper	f7ba572483	[RISCV] Update Zba, Zbb, Zbc, and Zbs version from 0.93 to 1.0. I've removed the Zbs W instructions that are not part of the frozen spec. References to B as an extension name have been removed. Tests are updated or split accordingly. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D110669	2021-10-14 09:25:03 -07:00
Vitaly Buka	8282024a74	[sanitizer] Move out stack trace pointer from header StackDepot Trace pointers accessed very rarely and don't need to be in hot data. Depends on D111613. Reviewed By: dvyukov Differential Revision: https://reviews.llvm.org/D111614	2021-10-14 09:23:04 -07:00
Nikita Popov	a8e7d11aca	[ValueTracking] Simplify getKnowledgeValidInContext() call (NFC) This accepts an ArrayRef, there's no need to create a SmallVector.	2021-10-14 18:17:54 +02:00
Wenlei He	a316343e19	[llvm-profgen] Allow generating AutoFDO profile from CSSPGO binary Add `-use-dwarf-correlation` switch to allow llvm-profgen to generate AutoFDO profile for binaries built with CSSPGO (pseudo-probe). Differential Revision: https://reviews.llvm.org/D111776	2021-10-14 09:11:56 -07:00
Joe Loser	1fa27f2a10	[libc++] LWG3480: make (recursive_)directory_iterator C++20 ranges Implement LWG3480 which enables `directory_iterator` and `recursive_directory_iterator` to be both a `borrowed_range` and a `view`. Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D111644	2021-10-14 12:02:18 -04:00
Julien Pages	e4e48e2f02	[AMDGPU] Add more tests for build_vector Differential Revision: https://reviews.llvm.org/D111652	2021-10-14 11:54:17 -04:00
Gabor Marton	ac3edc5af0	[analyzer][solver] Handle simplification to ConcreteInt The solver's symbol simplification mechanism was not able to handle cases when a symbol is simplified to a concrete integer. This patch adds the capability. E.g., in the attached lit test case, the original symbol is `c + 1` and it has a `[0, 0]` range associated with it. Then, a new condition `c == 0` is assumed, so a new range constraint `[0, 0]` comes in for `c` and simplification kicks in. `c + 1` becomes `0 + 1`, but the associated range is `[0, 0]`, so now we are able to realize the contradiction. Differential Revision: https://reviews.llvm.org/D110913	2021-10-14 17:53:29 +02:00
Mark de Wever	25a3463c44	[libc++][NFC] Fixes placement of the return type.	2021-10-14 17:40:45 +02:00
Dave Lee	722a2fb7f9	[lldb] Fix 'frame diagnose' docstring typo	2021-10-14 08:32:20 -07:00
Nicolas Vasilache	82dd977baf	[mlir][Linalg] Tighten canonicalization of InsertSliceOp that triggers infinite loop I am unclear this is reproducible with correct IR but atm the verifier for InsertSliceOp is not powerful enough and this triggers an infinite loop that is worth fixing independently. Differential Revision: https://reviews.llvm.org/D111812	2021-10-14 15:26:03 +00:00
Nicolas Vasilache	0eeaad3012	[mlir][Linalg] Fix insertion point in comprehensive bufferization	2021-10-14 15:24:09 +00:00
luxufan	849b36bf6f	[JITLink][NFC] Add TableManager to replace PerGraph...Builder pass This patch add a TableManager which reponsible for fixing edges that need entries to reference the target symbol and constructing such entries. In the past, the PerGraphGOTAndPLTStubsBuilder pass was used to build GOT and PLT entry, and the PerGraphTLSInfoEntryBuilder pass was used to build TLSInfo entry. By generalizing the behavior of building entry, I added a TableManager which could be reused when built GOT, PLT and TLSInfo entries. If this patch makes sense and can be accepted, I will apply the TableManager to other targets(MachO_x86_64, MachO_arm64, ELF_riscv), and delete the file PerGraphGOTAndPLTStubsBuilder.h Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D110383	2021-10-14 23:09:53 +08:00
Jinsong Ji	4fee8a1691	[NFC][compiler-rt][profile] Remove non-Posix -h option from test We are running `ls -lh` in gcov-execlp.c test in Posix folder. However `-h` is not a POSIX option,ls on some POSIX system (eg: AIX) may not support it. This patch remove this option to avoid break. Reviewed By: anhtuyen Differential Revision: https://reviews.llvm.org/D111807	2021-10-14 15:08:38 +00:00
Ben Shi	c2e5c95a14	[RISCV][test] Add tests of (add (shl r, c0), c1) Reviewed By: craig.topper, luismarques Differential Revision: https://reviews.llvm.org/D111116	2021-10-14 14:53:03 +00:00
Tobias Gysi	3f335ffffe	[mlir][linalg] Fix FusionOnTensors header and make local method static (NFC). Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D111798	2021-10-14 14:09:38 +00:00
Jeremy Morse	b5426ced71	[DebugInfo][InstrRef] Place variable-values PHI using LLVM utilities This patch is very similar to D110173 / `a3936a6c19`, but for variable values rather than machine values. This is for the second instr-ref problem, calculating the correct variable value on entry to each block. The previous lattice based implementation was broken; we now use LLVMs existing PHI placement utilities to work out where values need to merge, then eliminate un-necessary ones through value propagation. Most of the deletions here happen in vlocJoin: it was trying to pick a location for PHIs to happen in, badly, leading to an infinite loop in the MIR test added, where it would repeatedly switch between register locations. The new approach is simpler: either PHIs can be eliminated, or they can't, and the location of the value is a different problem. Various bits and pieces move to the header so that they can be tested in the unit tests. The DbgValue class grows a "VPHI" kind to represent variable value PHIS that haven't been eliminated yet. Differential Revision: https://reviews.llvm.org/D110630	2021-10-14 14:43:43 +01:00
Brian Cain	743e263e08	[hexagon] Add system register, transfer support This commit adds the system reg/regpair definitions and the corresponding register transfer instructions.	2021-10-14 06:37:04 -07:00
Andrew Savonichev	a567fd8a08	Fixup [NVPTX] Add VRFrame and VRFrameLocal to integer register classes	2021-10-14 16:32:16 +03:00
Andrew Savonichev	51eefa8164	[NVPTX] Add VRFrame and VRFrameLocal to integer register classes These registers are used as operands for instructions that expect an integer register, so they should be added to Int32Regs or Int64Regs register classes. Otherwise the machine verifier emits an error for the following LIT tests when LLVM_ENABLE_MACHINE_VERIFIER=1 environment variable is set: * Bad machine code: Illegal physical register for instruction * - function: kernel_func - basic block: %bb.0 entry (0x55c8903d5438) - instruction: %3:int64regs = LEA_ADDRi64 $vrframelocal, 0 - operand 1: $vrframelocal $vrframelocal is not a Int64Regs register. CodeGen/NVPTX/call-with-alloca-buffer.ll CodeGen/NVPTX/disable-opt.ll CodeGen/NVPTX/lower-alloca.ll CodeGen/NVPTX/lower-args.ll CodeGen/NVPTX/param-align.ll CodeGen/NVPTX/reg-types.ll DebugInfo/NVPTX/dbg-declare-alloca.ll DebugInfo/NVPTX/dbg-value-const-byref.ll Differential Revision: https://reviews.llvm.org/D110164	2021-10-14 16:19:03 +03:00
Florian Hahn	094faa5fca	[VectorCombine] Add test showing issue when running VectorCombine early. Running -vector-combine early can introduce new vector operations, blocking loop/SLP vectorization. The added test case could be better optimized by the SLPVectorizer if no new vector operations are added early.	2021-10-14 14:03:02 +01:00
Jonas Paulsson	c0d88613f2	[SystemZ] Remove some now unused ISD XXX_LOOP opcodes.	2021-10-14 14:55:44 +02:00

1 2 3 4 5 ...

401858 Commits All Branches Search

401858 Commits

All Branches