llvm-project

Commit Graph

Author	SHA1	Message	Date
David Green	560856d56a	[AArch64] Format the PerfectShuffle table. NFC	2022-04-06 18:27:57 +01:00
Nathan Sidwell	df4522feb7	[demangler] Fix undocumented Local encoding GCC emits [some] static symbols with an 'L' mangling, which we attempt to demangle. But the module mangling changes have exposed that we were doing so at the wrong level. Such manglings are outside of the ABI as they are internal-linkage, so a bit of reverse engineering was needed. This adjusts the demangler along the same lines as the existing gcc demangler (which is not yet module-aware). 'L' is part of an unqualified name. As before we merely parse the 'L', and then ignore it. Reviewed By: iains Differential Revision: https://reviews.llvm.org/D123138	2022-04-06 10:12:36 -07:00
Craig Topper	1ad36487e9	[LegalizeDAG] Use SelectionDAG::getBoolConstant to simplify some code. NFC	2022-04-06 10:08:11 -07:00
Craig Topper	e13a44b460	[RISCV] Add lowering for vp.sext and vp.zext. Including mask vector inputs. Reviewed By: frasercrmck, rogfer01 Differential Revision: https://reviews.llvm.org/D123150	2022-04-06 09:59:49 -07:00
Craig Topper	5b5f59428c	[DAGCombiner] Replace call getSExtOrTrunc with a truncate. NFC The extend case should never occur. The sign extend would be an arbitrary choice, remove it to avoid confusion.	2022-04-06 09:59:45 -07:00
Arthur Eubanks	1342b861a3	[test][DSE] Precommit test for D123162	2022-04-06 09:54:37 -07:00
Simon Pilgrim	4b569d30fd	[X86] Add Issue #42433 test case	2022-04-06 17:51:55 +01:00
Nico Weber	d0141ef232	[dsymutil] Fix a few TODOs about reporting errors to the user I saw the TODOs while reading this file and figured I'd do them. I haven't seen these happen in practice. No expected behavior change. Differential Revision: https://reviews.llvm.org/D123215	2022-04-06 12:41:02 -04:00
Nico Weber	c9bea51760	[dsymutil] Fix O(n^2) behavior when running on ld64.lld's current ICF output STABS information consists of a list of records in the linked binary that look like this: OSO: path/to/some.o SO: path/to/some.c FUN: sym1 FUN: sym2 ... The linked binary has one such set of records for every .o file linked into it. When dsymutil processes the binary's STABS information, it: 1. Reads the .o file mentioned in the OSO line 2. For each FUN entry after it in the main executable's STABS info: a) it looks up that symbol in the symbol of that .o file b) if it doesn't find it there, it goes through all symbols in the main binary at the same address and sees if any of those match With ICF, ld64.lld's STABS output claims that all identical functions that were folded are in the .o file of the one that's deemed the canonical one. Many small functions might be folded into a single function, so there are .o OSO entries that end up with many FUN lines, but almost none of them exist in the .o file's symbol table. Previously, dsymutil would do a full scan of all symbols in the main executable _for every of these entries_. This patch instead scans all aliases once and remembers them per name. This reduces the alias resolution complexity from O(number_of_aliases_in_o_file * number_of_symbols_in_main_executable) to O(number_of_aliases_in_o_file * log(number_of_aliases_in_o_file)). In practice, it reduces the time spent to run dsymutil on Chromium Framework from 26 min (after https://reviews.llvm.org/D89444) or 12 min (before https://reviews.llvm.org/D89444) to ~8m30s. We probably want to change how ld64.lld writes STABS entries when ICF is enabled, but making dsymutil not have pathological performance for this input seems like a good change as well. No expected behavior change (other than it's faster). I verified that for Chromium Framework, the generated .dSYM is identical with and without this patch. Differential Revision: https://reviews.llvm.org/D123218	2022-04-06 12:39:49 -04:00
Paul Walker	7d3af9ef0f	[DAGCombine] insert_subvector undef, (splat X), N2 -> splat X Differential Revision: https://reviews.llvm.org/D120328	2022-04-06 17:15:38 +01:00
Fraser Cormack	6be5e875be	[RISCV][VP] Add basic RVV codegen for vp.icmp This patch adds the minimum required to successfully lower vp.icmp via the new ISD::VP_SETCC node to RVV instructions. Regular ISD::SETCC goes through a lot of canonicalization which targets may rely on which has not hereto been ported to VP_SETCC. It also supports expansion of individual condition codes and a non-boolean return type. Support for all of that will follow in later patches. In the case of RVV this largely isn't a problem as the vector integer comparison instructions are plentiful enough that it can lower all VP_SETCC nodes on legal integer vectors except for boolean vectors, which regular SETCC folds away immediately into logical operations. Floating-point VP_SETCC operations aren't as well supported in RVV and the backend relies on condition code expansion, so support for those operations will come in later patches. Portions of this code were taken from the VP reference patches. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D122743	2022-04-06 16:51:22 +01:00
LLVM GN Syncbot	324ac838ae	[gn build] Port `d78624975b`	2022-04-06 15:52:20 +00:00
Matthias Springer	7e6de86ff6	[mlir][bufferize][NFC] Remove caller map and ordered func list from FuncAnalysisState These can be local variables. No need to store them in the struct. Differential Revision: https://reviews.llvm.org/D123210	2022-04-07 00:49:24 +09:00
Matthias Springer	cb186bc508	[mlir][bufferize][NFC] Rename ModuleAnalysisState to FuncAnalysisState This is for consistency reasons. `*AnalysisState` always starts with the name of the dialect. Differential Revision: https://reviews.llvm.org/D123209	2022-04-07 00:48:53 +09:00
Mark de Wever	3fb3770983	[libc++] Use cpp20_output_iterator in tests. Adds the new cpp20_output_iterator in the ranges::transform test. Reviewed By: philnik, #libc Differential Revision: https://reviews.llvm.org/D123139	2022-04-06 17:48:27 +02:00
Mark de Wever	d78624975b	[NFC][libc++] Modularize chrono's calendar. The is a followup of D116965 to split the calendar header. This is a preparation to add the formatters for the chrono header. The code is only moved no other changes have been made. Reviewed By: ldionne, #libc, philnik Differential Revision: https://reviews.llvm.org/D122995	2022-04-06 17:47:53 +02:00
Arjun P	1ba6043332	[MLIR][Presburger] Refactor subtraction in preparation for making it iterative Refactor the operation of subtraction by - removing the usage of SimplexRollbackScopeExit since this can't be used in the iterative version - reducing the number of stack variables to make the iterative version easier to follow Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D123156	2022-04-06 16:35:28 +01:00
Roman Lebedev	9be6e7b0f2	[X86] `lowerBuildVectorAsBroadcast()`: with AVX512VL, allow i64->XMM broadcasts from constant pool Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D123221	2022-04-06 18:33:40 +03:00
Sam McCall	af89e4792d	[pseudo] Add crude heuristics to choose taken preprocessor branches. In files where different preprocessing paths are possible, our goal is to choose a preprocessed token sequence which we can parse that pins down as much of the grammatical structure as possible. This forms the "primary parse", and the not-taken branches get parsed later, and are constrained to be compatible with the primary parse. Concretely: int x = #ifdef // TAKEN 2 + 2 + 2 // determined during primary parse to be an expression #else 2 // constrained to be an expression during a secondary parse #endif ; Differential Revision: https://reviews.llvm.org/D121165	2022-04-06 17:22:35 +02:00
Matthias Springer	f4f1cf6c31	[mlir][bufferize] Better analysis for return values of CallOps Support returning arbitrary tensors from functions. Even those that are not equivalent. To that end, additional information is gathered during the analysis phase. In particular, which function args are aliasing with which return values. Also fix bugs in the current implementation when returning equivalent tensors. Various unit tests are added to ensure that we have better test coverage. Note: Returning non-equivalent tensors is only allowed when allowReturnAllocs is enabled. This functionality is useful for unit testing and compatibility with other bufferizations such as the sparse compiler. This is also towards using ModuleBufferization as a replacement for --func-bufferize. Differential Revision: https://reviews.llvm.org/D119120	2022-04-06 23:54:32 +09:00
Matthias Springer	cd7de446fd	[mlir][bufferize] Simplify ModuleBufferization driver * Bufferize FuncOp bodies and boundaries in the same loop. This is in preparation of moving FuncOp bufferization into an external model implementation. * As a side effect, stop bufferization earlier if there was an error. (Do not continue bufferization, fewer error messages.) * Run equivalence analysis of CallOps before the main analysis. This is needed so that equialvence info is propagated properly. Differential Revision: https://reviews.llvm.org/D123208	2022-04-06 23:53:07 +09:00
Matthias Springer	5ab34492d6	[mlir][bufferize] Fix dropped return type in ModuleBufferization Differential Revision: https://reviews.llvm.org/D123192	2022-04-06 23:48:15 +09:00
Paul Walker	1c307b9794	[NFC] Remove redundant IndexType canonicalisation from DAGTypeLegalizer::PromoteIntOp_MSCATTER Promotion does not affect the base element type and so the original index type will remain unchanged. This reflects the behaviour of DAGTypeLegalizer::PromoteIntOp_MGATHER with no tests affected.	2022-04-06 15:30:29 +01:00
Paul Walker	5e407f0887	[SVE] Add gather/scatter tests to highlight bugs in their generated code.	2022-04-06 15:30:29 +01:00
LLVM GN Syncbot	c59e833942	[gn build] Port `afa94306a8`	2022-04-06 14:24:39 +00:00
Sam McCall	afa94306a8	[clangd] Add code action to generate a constructor for a C++ class Differential Revision: https://reviews.llvm.org/D116514	2022-04-06 16:23:50 +02:00
LLVM GN Syncbot	bb47e1fe3d	[gn build] Port `68eac9a6e7`	2022-04-06 14:15:16 +00:00
Sam McCall	68eac9a6e7	[clangd] Code action to declare missing move/copy constructor/assignment Fixes https://github.com/clangd/clangd/issues/973 Differential Revision: https://reviews.llvm.org/D116490	2022-04-06 16:14:42 +02:00
Shengchen Kan	05535f3d07	[X86][tablgen] Add one entry manually into the memory folding table ``` {"MMX_MOVD64grr", "MMX_MOVD64mr"} ``` This pair has different opcodes.	2022-04-06 22:06:15 +08:00
chenglin.bi	87f0d55304	[AArch64] Fold lsr+bfi in tryBitfieldInsertOpFromOr In tryBitfieldInsertOpFromOr, if the new created LSR Node's source is LSR with Imm shift, try to fold them. Fixes https://github.com/llvm/llvm-project/issues/54696 Reviewed By: efriedma, benshi001 Differential Revision: https://reviews.llvm.org/D122915	2022-04-06 22:02:31 +08:00
Nikita Popov	1dc1d5a0d2	[SimplifyLibCalls] Use KnownBits helper APIs (NFC) Use helper APIs for isNonNegative() and getMaxValue() instead of flipping the zero value and having a long comment explaining why that is necessary.	2022-04-06 16:01:24 +02:00
Paul Robinson	31c971145f	[PS4] clang-format PS4CPU.cpp/.h	2022-04-06 06:52:29 -07:00
Augie Fackler	33b1f41914	MemoryBuiltins: getAllocAlignment is now useful for non-allocator funcs This has been true since `dba73135c8`, but didn't matter until now because clang wasn't emitting allocalign attributes. Differential Revision: https://reviews.llvm.org/D121640	2022-04-06 09:51:38 -04:00
Jay Foad	538c77172a	[AMDGPU] Fix unused variable warning after D117484	2022-04-06 14:45:38 +01:00
Jean Perier	c58c64d05c	[flang] Add runtime API to catch unit number out of range Unit numbers must fit on a default integer. It is however possible that the user provides the unit number in UNIT with a wider integer type. In such case, lowering was previously silently narrowing the value and passing the result to the BeginXXX runtime entry points. Cases where the conversion caused overflow were not reported/caught. Most existing compilers catch these errors and raise an IO error. Add a CheckUnitNumberInRange runtime API to do the same in f18. This runtime API has its own error management interface (i.e., does not use GetIoMsg, EndIo, and EnableHandlers) because the usual error management requires BeginXXX to be called to set up the error management. But in this case, the BeginXXX cannot be called since the bad unit number that would be provided to it overflew (and in the worst case scenario, the narrowed value could point to a different valid unit already in use). Hence I decided to make an API that must be called before the BeginXXX and should trigger the whole BeginXXX/.../EndIoStatement to be skipped in case the unit number is too big and the user enabled error recovery. Note that CheckUnitNumberInRange accepts negative numbers (as long as they can fit on a default integer), because unit numbers may be negative if they were created by NEWUNIT. Differential Revision: https://reviews.llvm.org/D123157	2022-04-06 15:38:13 +02:00
Shengchen Kan	f4661b5a55	[X86] Fold MMX_MOVD64from64rr + store to MMX_MOVQ64mr instead of MMX_MOVD64from64mr in auto-generated table This is a follow-up patch for D122241.	2022-04-06 21:33:57 +08:00
zhongyunde	9a2d5cc1da	[SVE][AArch64] Enable first active true vector combine for INTRINSIC_WO_CHAIN WHILELO/LS insn is used very important for SVE loop, and itself is a flag-setting operation, so add it. Reviewed By: paulwalker-arm, david-arm Differential Revision: https://reviews.llvm.org/D122796	2022-04-06 21:01:37 +08:00
Hansang Bae	e4ac11beb7	[OpenMP] Add support for ompt_callback_dispatch This change adds support for ompt_callback_dispatch with the new dispatch chunk type introduced in 5.2. Definitions of the new ompt_work_loop types were also added in the header file. Differential Revision: https://reviews.llvm.org/D122107	2022-04-06 08:01:02 -05:00
zhongyunde	19e5235147	[AArch64][InstCombine] Fold MLOAD and zero extensions into MLOAD Accord the discussion in D122281, we missing an ISD::AND combine for MLOAD because it relies on BuildVectorSDNode is fails for scalable vectors. This patch is intend to handle that, so we can circle back the type MVT::nxv2i32 Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D122703	2022-04-06 20:50:42 +08:00
Louis Dionne	e27a122b3a	[libc++] Support arrays in make_shared and allocate_shared (P0674R1) This patch implements P0674R1, i.e. support for arrays in std::make_shared and std::allocate_shared. Co-authored-by: Zoe Carver <z.zoelec2@gmail.com> Differential Revision: https://reviews.llvm.org/D62641	2022-04-06 08:42:55 -04:00
Shengchen Kan	eddd399c98	[X86][tablgen] Add three entries manually into the memory folding table ``` {X86::MOVLHPSrr,X86::MOVHPSrm} {X86::VMOVLHPSZrr,X86::VMOVHPSZ128rm} {X86::VMOVLHPSrr,X86::VMOVHPSrm} ``` Each of the three pairs has different mnemonic, so we have to add it manually. This is a follow-up patch for D122477.	2022-04-06 20:37:39 +08:00
Nico Weber	edddf384c2	[gn build] (manually) port `83a798d4b0` (abi_breaking_checks in tests)	2022-04-06 08:31:20 -04:00
Simon Pilgrim	3681292294	[AMDGPU] Regenerate shared-op-cycle.ll test	2022-04-06 12:23:17 +01:00
Simon Pilgrim	f743159037	[AMDGPU] Regenerate pv-packing.ll test	2022-04-06 12:23:17 +01:00
Roman Lebedev	34ce9fd864	[TLI] `TargetLowering::SimplifyDemandedVectorElts()`: narrowing bitcast: fill known zero elts from known src bits E.g. in ``` %i0 = zext <2 x i8> to <2 x i16> %i1 = bitcast <2 x i16> to <4 x i8> ``` the `%i0`'s zero bits are known to be `0xFF00` (upper half of every element is known zero), but no elements are known to be zero, and for `%i1`, we don't know anything about zero bits, but the elements under `0b1010` mask are known to be zero (i.e. the odd elements). But, we didn't perform such a propagation. Noticed while investigating more aggressive `vpmaddwd` formation. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D123163	2022-04-06 14:19:31 +03:00
Daniil Kovalev	83a798d4b0	[CodeGen] Place SDNode debug ID declaration under appropriate #if Place PersistentId declaration under #if LLVM_ENABLE_ABI_BREAKING_CHECKS to reduce memory usage when it is not needed. Differential Revision: https://reviews.llvm.org/D120714	2022-04-06 14:09:32 +03:00
Alex Zinenko	82c18dd9ad	[mlir] Fix DialectRegistry::addExtension compile error It appears that the DialectRegistry::addExtension template was never instantiated because it contains an obvious compilation error. Fix it. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D123199	2022-04-06 13:00:34 +02:00
Nathan Sidwell	ba4482f481	[clang][NFC] Add specificity to compatibility hack Add specific dates and versions to note about source_location handling. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D123119	2022-04-06 03:57:36 -07:00
Jeremy Morse	fb6596f1ec	[DebugInfo][InstrRef] Avoid a crash from mixed variable location modes Variable locations now come in two modes, instruction referencing and DBG_VALUE. At -O0 we pick DBG_VALUE to allow fast construction of variable information. Unfortunately, SelectionDAG edits the optimisation level in the presence of opt-bisect-limit, meaning different passes have different views of what variable location mode we should use. That causes assertions when they're mixed. This patch plumbs through a boolean in SelectionDAG from start to instruction emission, so that we don't rely on the current optimisation level for correctness. Differential Revision: https://reviews.llvm.org/D123033	2022-04-06 11:55:38 +01:00
Sven van Haastregt	77c74fd877	[OpenCL] Remove argument names from math builtins This simplifies completeness comparisons against OpenCLBuiltins.td and also makes the header no longer "claim" the argument name identifiers. Continues the direction set out in D119560.	2022-04-06 11:43:59 +01:00

1 2 3 4 5 ...

420339 Commits All Branches Search

420339 Commits

All Branches