llvm-project

Commit Graph

Author	SHA1	Message	Date
Arthur Eubanks	98574aa1d2	[test] Remove references to -fno-legacy-pass-manager in tests This has been the default for a while and we're in the process of removing the legacy PM optimization pipeline.	2022-04-11 13:40:27 -07:00
Fangrui Song	982247dce5	Value::isTransitiveUsedByMetadataOnly: Don't repeatedly add an element to the worklist. NFC	2022-04-11 13:35:25 -07:00
Arthur Eubanks	61d418f971	[test] Remove references to -fexperimental-new-pass-manager in tests This has been the default for a while and we're in the process of removing the legacy PM optimization pipeline.	2022-04-11 13:29:08 -07:00
Richard	d563c2d0e5	[clang-tidy] Support parenthesized literals in modernize-macro-to-enum When scanning a macro expansion to examine it as a candidate enum, first strip off arbitrary matching parentheses from the outside in, then examine what remains to see if it is Lit, +Lit, -Lit or ~Lit. If not, reject it as a possible enum candidate. Differential Revision: https://reviews.llvm.org/D123479 Fixes #54843	2022-04-11 14:06:48 -06:00
Jonas Devlieghere	cbcb3bcee3	[lldb] Don't report progress in the REPL Don't report progress events in the REPL. Most of the progress events are debugger specific which are useful when you're debugging, but not so much when you're waiting for the next line to be executed in the REPL. This patch disables reporting of progress events when in REPL mode. rdar://91502950 Differential revision: https://reviews.llvm.org/D123426	2022-04-11 13:06:40 -07:00
Biplob Mishra	d06fb9045b	AArch64 adding more tests to show the simple scenarios for or/and combine	2022-04-11 20:54:12 +01:00
Sanjay Patel	1206a18d41	[InstCombine] guard against splat-mul corner case The test is already simplified, and I'm not sure how to write a test to exercise the new clause. But it protects the 2-bit pattern from miscompiling as noted in D123453. https://alive2.llvm.org/ce/z/QPyVfv (If we managed to fall into the mul transform, it would wrongly create a zero on this pattern.)	2022-04-11 15:50:13 -04:00
Arjun P	aafb428237	[MLIR][Presburger][Simplex] symbolic lexmin: add some normalization heuristics Normalize some of the division and inequality expressions used, which can improve performance. Also deduplicate some of the normalization functionality throughout the Presburger library. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D123314	2022-04-11 20:46:30 +01:00
Jez Ng	1cff723ff5	[lld-macho][nfc] Use includeInSymtab for all symtab-skipping logic {D123302} got me looking deeper at `includeInSymtab`. I thought it was a little odd that there were excluded (live) symbols for which `includeInSymtab` was false; we shouldn't have so many different ways to exclude a symbol. As such, this diff makes the `L`-prefixed-symbol exclusion code use `includeInSymtab` too. (Note that as part of our support for `__eh_frame`, we will also be excluding all `__eh_frame` symbols from the symtab in a future diff.) Another thing I noticed is that the `emitStabs` code never has to deal with excluded symbols because `SymtabSection::finalize()` already filters them out. As such, I've updated the comments and asserts from {D123302} to reflect this. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D123433	2022-04-11 15:45:46 -04:00
Arjun P	a2b212bafc	[MLIR][Presburger] subtract: fix bug in the non-recursive implementation When making the subtract implementation non-recursive, tail calls were implemented by incrementing the level but not pushing a frame, and returning was implemented as returning to the level corresponding to the number of frames in the stack. This is incorrect, as there could be a case where we tail-recurse at `level`, and then recurse at `level + 1`, pushing a frame. However, because the previous frame was missing, this new frame would be interpreted as corresponding to `level` and not `level + 1`. Fix this by removing the special handling of tail calls and just doing them as normal recursion, as this is the simplest correct implementation and handling them specifically would be a premature optimization. The impact of this bug is only on performance as this can only lead to unnecessary subtractions of the same disjuncts multiples times. As subtraction is idempotent, and rationally empty disjuncts are always discarded, this does not affect the output, so this patch does not include a regression test. (This also does not affect termination.) Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D123327	2022-04-11 20:45:29 +01:00
Fangrui Song	aefa4b60ce	[Driver] Simplify hasFlag pattern with addOptInFlag/addOptOutFlag helpers Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D123468	2022-04-11 12:29:25 -07:00
Nicolai Hähnle	4df4922da6	AMDGPU/SDAG: Custom SETCC (i.e. ballot) is always uniform The AMDGPUISD::SETCC node is like ISD::SETCC, but returns a lane mask instead of a per-lane boolean. The lane mask is uniform. This improves instruction selection for code patterns like ctpop(ballot(x)), which can now use an S_BCNT1_* instruction instead of V_BCNT_*. GlobalISel already selects scalar instructions (an earlier commit added a test case).. Differential Revision: https://reviews.llvm.org/D123432	2022-04-11 14:04:21 -05:00
Mogball	15a65594c1	[mlir][ods] ODS-level Attribute Optimizations This patch contains several ODS-level optimizations to attribute getters and getting. 1. OpAdaptors, when provided a DictionaryAttr, will instantiate an OperationName so that adaptor attribute getters can used cached identifiers. 2. Verifiers will take advantage of attributes stored in sorted order to get all required (non-optional, non-default valued, and non-derived) attributes in one pass over the attribute dictionary and verify that they are present. 3. ODS-generated attribute getters will use "subrange" lookup. Because the attributes are stored in sorted order and ODS knows which attributes are required, the number of required attributes less than and greater than each attribute can be computed. When searching for an attribute, the ends of the search range can be dropped. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D122430	2022-04-11 18:34:07 +00:00
Whitney Tsang	80304c5f88	[LoopUnroll] Always respect user unroll pragma IMO when user provide unroll pragma, compiler should always respect it. It is not clear to me why loop unroll pass currently ensure that the unrolled loop size is limited by PragmaUnrollThreshold. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D119148	2022-04-11 14:33:24 -04:00
Daniel Grumberg	236b6a0eb4	[clang][extract-api] Emit "functionSignature" in SGF for ObjC methods. - Split GlobalRecord into two distinct types to be able to introduce has_function_signature type trait. - Add has_function_signature type trait. - Serialize function signatures as part of serializeAPIRecord for records that are known to have a function signature. Differential Revision: https://reviews.llvm.org/D123304	2022-04-11 19:05:24 +01:00
zijunzhao	2d0475e371	[libcxx] locale_bionic.h: skip ndk-version.h on Android platform The Android platform does not have ndk-version.h, but it will always have up-to-date libc headers, so it does not need any compatibility code intended for past versions of NDK_MAJOR. If ndk-version.h is missing, assume NDK_MAJOR is (conceptually) infinite Bug: https://buganizer.corp.google.com/issues/222341313 Test: None	2022-04-11 17:51:12 +00:00
Fraser Cormack	cab1ecf251	[TableGen][NFC] Reflow Record accessor comments	2022-04-11 18:33:26 +01:00
Fraser Cormack	74dd95face	[TableGen][NFC] Fix copy/paste error in comment	2022-04-11 18:29:29 +01:00
Nico Weber	75196b99fb	[llvm-lib] Add /WX, warn by default on empty inputs, add opt-out lib.exe by default exits successfully without writing an output file when no inputs are passed. llvm-lib has the same behavior, for compatibility. This behavior interacts poorly with build systems: If a static library target had no inputs, llvm-lib would not produce an output file, causing ninja (or make, or a similar system) to successfully run that step, but then re-run it on the next build. After this patch, llvm-lib emits a warning in this case, that with /WX can be turned into an error. That way, ninja (or make, or...) will mark the initial build as failed. People who don't like the warning can use /ignore:emptyoutput to suppress it. The warning also points out the existing flag /llvmlibempty which forces creation of an empty .lib file (this is an extension to lib.exe). Differential Revision: https://reviews.llvm.org/D123517	2022-04-11 13:15:30 -04:00
Craig Topper	76192182d0	[RISCV] Remove riscv-v-fixed-length-vector-elen-max command line option. This was added before Zve extensions were defined. I think users should use Zve32x or Zve32f now. Though we will lose support for limiting ELEN to 16 or 8, but I hope no one was using that. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D123418	2022-04-11 10:14:48 -07:00
Craig Topper	c266e50430	[RISCV] Remove ExtZvl enum from RISCVSubtarget. NFC Having an enum with names that contain the string representation of their value doesn't add any value. We can just use the numbers. Reviewed By: kito-cheng, frasercrmck Differential Revision: https://reviews.llvm.org/D123417	2022-04-11 10:01:17 -07:00
Martin Storsjö	373d08adb4	[lldb] Silence warnings about unused static variables in RegisterInfos_arm64.h Move them to the only source file that included RegisterInfos_arm64.h that actually used these variables. This silences warnings like these: In file included from lldb/source/Plugins/Instruction/ARM64/EmulateInstructionARM64.cpp:42: lldb/source/Plugins/Process/Utility/RegisterInfos_arm64.h:790:35: warning: ‘g_register_infos_mte’ defined but not used [-Wunused-variable] 790 \| static lldb_private::RegisterInfo g_register_infos_mte[] = { \| ^~~~~~~~~~~~~~~~~~~~ lldb/source/Plugins/Process/Utility/RegisterInfos_arm64.h:787:35: warning: ‘g_register_infos_pauth’ defined but not used [-Wunused-variable] 787 \| static lldb_private::RegisterInfo g_register_infos_pauth[] = { \| ^~~~~~~~~~~~~~~~~~~~~~ Differential Revision: https://reviews.llvm.org/D123206	2022-04-11 19:50:48 +03:00
Craig Topper	28cb508195	[TargetLowering][RISCV] Allow truncation when checking if the arguments of a setcc are splats. We're just trying to canonicalize here and won't be using the constant value returned. The attached test changes are because we were previously commuting a seteq X, (splat_vector 0) because we also have (sub 0, X). The 0 is larger than the element type so we don't detect it as a splat without the AllowTruncation flag. By preventing the commute we are able to match it to the vmseq.vx instruction during isel. We only look for constants on the RHS in isel. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D123256	2022-04-11 09:49:36 -07:00
Martin Storsjö	64e4dd329c	[libcxx] [test] Fix back-to-back use of get_temp_file_name() on Windows On non-Windows platforms, get_temp_file_name() uses `mkstemp()`, which picks a unique name and creates a file atomically. The Windows implementation uses `_mktemp_s()`, which doesn't create the file. The documentation of `_mktemp_s()` also says that by design, the function uses the same pattern within a process, as long as that file doesn't exist. Thus previously, two consecutive calls to `get_temp_file_name()` on Windows returned the same file name. Try to create the suggested temp file with `_O_EXCL` (marking the file name as already used for future calls to `_mktemp_s`) and retry if we weren't able to exclusively create the file. This fixes the test failures on Windows observed in D122257. Differential Revision: https://reviews.llvm.org/D122612	2022-04-11 19:46:40 +03:00
Louis Dionne	b9ca1e5a5a	[libc++][NFC] Use noexcept instead of _NOEXCEPT for code compiled into the library We build the library with C++20 anyways, so we can use noexcept directly.	2022-04-11 12:30:38 -04:00
Siva Chandra Reddy	0258f56646	[libc] Add a definition of pthread_attr_t and its getters and setters. Not all attributes have been added to phtread_attr_t in this patch. They will be added gradually in future patches. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D123423	2022-04-11 16:08:49 +00:00
Stephen Tozer	1c8366f9f2	[Dexter] Collate penalties of the same type into a single line for each Currently in Dexter, every step at which a DexExpectWatchValue/Type does not have the correct value is printed on a separate line. This patch reduces the size of the text output by instead printing each incorrect result (i.e. each incorrect value seen, 'Variable optimized out', and so on) on its own line, alongside a list of the steps at which that result was seen. This makes for much less spam in the output when watches are missing or wrong for many steps. Differential Revision: https://reviews.llvm.org/D120716	2022-04-11 17:01:40 +01:00
Vy Nguyen	1477964413	[lld][macho]Fix test to sort symbol table before dumping Details: The test previously expected a specific order of those symbols, which is not guaranteed (could change simply due to hashing changes, etc). So we change it to explicitly sort the symbols before checking contents. PR/53026 Differential Revision: https://reviews.llvm.org/D116813	2022-04-11 12:01:04 -04:00
Sanjay Patel	7783db55af	[InstCombine] try to fold low-mask of ashr to lshr With one-use, we handle this via demanded-bits. But We need to handle extra uses to improve issue #54750. https://alive2.llvm.org/ce/z/aDYkPv	2022-04-11 11:56:40 -04:00
Sanjay Patel	141892d481	[InstCombine] add tests for low-mask of ashr; NFC	2022-04-11 11:56:40 -04:00
Florian Hahn	1ddc719680	Revert "[LICM] Only create load in pre-header when promoting load." This reverts commit `42229b96bf`. This appears to cause crashes on multiple bots.	2022-04-11 17:37:23 +02:00
Kadir Cetinkaya	001e88ac83	[clangd] Performance improvements and cleanup - Inline SymbolID hashing to header - Don't collect references for symbols without a SymbolID - Store referenced symbols, rather than separately storing decls and macros. - Don't defer ref collection to end of translation unit - Perform const_cast when updating reference counts (~0.5% saving) - Introduce caching for getSymbolID in SymbolCollector. (~30% saving) - Don't modify symbolslab if there's no definition location - Don't lex the whole file to deduce spelled tokens, just lex the relevant piece (~8%) Overall this achieves ~38% reduction in time spent inside SymbolCollector compared to baseline (on my machine :)). I'd expect the last optimization to affect dynamic index a lot more, I was testing with clangd-indexer on clangd subfolder of LLVM. As clangd-indexer runs indexing of whole TU at once, we indeed see almost every token from every source included in the TU (hence lexing full files vs just lexing referenced tokens are almost the same), whereas during dynamic indexing we mostly index main file symbols, but we would touch the files defining/declaring those symbols, and lex complete files for nothing, rather than just the token location. The last optimization is also a functional change (added test), previously we used raw tokens from syntax::tokenize, which didn't canonicalize trigraphs/newlines in identifiers, wheres Lexer::getSpelling canonicalizes them. Differential Revision: https://reviews.llvm.org/D122894	2022-04-11 17:15:25 +02:00
LLVM GN Syncbot	5ef0ed7d5a	[gn build] Port `c292b6066c`	2022-04-11 14:58:55 +00:00
Simon Pilgrim	bbbb71ac4d	[AMDGPU] Regenerate insert_vector_dynelt.ll	2022-04-11 15:52:16 +01:00
Nikita Popov	9af8cc8d17	[SimplifyLibCalls] Remove unnecessary inbounds check Even if the GEP is not inbounds, the GEP will have provenance of the global, and accessing past the extent of the global would be undefined behavior.	2022-04-11 16:51:09 +02:00
Nikita Popov	154609b1df	[InstCombine] Add strlen of gep test without inbounds (NFC)	2022-04-11 16:51:09 +02:00
Louis Dionne	c292b6066c	[libc++] Implement P1007R3: std::assume_aligned This supersedes and incoroporates content from both D108906 and D54966, and also some original content. Co-Authored-by: Marshall Clow <mclow.lists@gmail.com> Co-Authored-by: Gonzalo Brito Gadeschi Differential Revision: https://reviews.llvm.org/D118938	2022-04-11 10:46:52 -04:00
Florian Hahn	42229b96bf	[LICM] Only create load in pre-header when promoting load. When only a store is sunk, there is no need to create a load in the pre-header, as the result of the load will never get used. The dead load can can introduce UB, if the function is marked as writeonly. Fixes #51248. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D123473	2022-04-11 16:45:18 +02:00
Louis Dionne	2c6217f252	[libc++] Make .version.pass.cpp tests be compile-only tests We don't really need to run them.	2022-04-11 10:40:04 -04:00
Groverkss	8fc0e89d0b	[MLIR][Presburger] Make PWMAFunction inheritence from space private This patch makes inheritence from PresburgerSpace for PWMAFunction private. The reasoning for this patch is to prevent implicit conversion to PresburgerSpace from PWMAFunction and to not expose all functions exposed by PresburgerSpace in PWMAFunction. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D123076	2022-04-11 20:01:38 +05:30
gysit	973dbe20f6	[mlir][tensor] Add pattern to fold ExtractSliceOp, PadOp chains. The pattern folds chains of tensor::ExtractSliceOp, tensor::PadOp pairs if they pad different dimensions. Repeated tiling and padding of the tiled dimensions may introduce such chains. This canonicalization pattern folds these chains to a single tensor::ExtractSliceOp, tensor::PadOp pair that pads all dimensions at once, which simplifies vectorization and bufferization. Example: ```mlir %0 = tensor.extract_slice %input[16, 0] [%sz0, 64] [1, 1] : tensor<64x64xf32> to tensor<?x64xf32> %1 = tensor.pad %0 low[0, 0] high[%pw0, 0] { ... } : tensor<?x64xf32> to tensor<8x64xf32> %2 = tensor.extract_slice %1[0, 4] [8, %sz1] [1, 1] : tensor<8x64xf32> to tensor<8x?xf32> %res = tensor.pad %2 nofold low[0, 0] high[0, %pw1] { ... } : tensor<8x?xf32> to tensor<8x4xf32> ``` folds into: ```mlir %0 = tensor.extract_slice %input[16, 4] [%sz0, %sz1] [1, 1] : tensor<64x64xf32> to tensor<?x?xf32> %res = tensor.pad %0 nofold low[0, 0] high[%pw0, %pw1] { ... } : tensor<?x?xf32> to tensor<8x4xf32> ``` Reviewed By: nicolasvasilache, hanchung Differential Revision: https://reviews.llvm.org/D122722	2022-04-11 14:28:59 +00:00
Hans Wennborg	40ad667013	[dllexport] odr-use constexpr default args for constructor closures InstantiateDefaultCtorDefaultArgs() is supposed to mark default constructor args as odr-used, since those args will be used when emitting the constructor closure. However, constexpr vars were not getting odr-used since DoMarkVarDeclReferenced() defers them in MaybeODRUseExprs, and the code was calling CleanupVarDeclMarking() which discarded those uses instead of processing them. (This came up in Chromium, crbug.com/1312086) Differential revision: https://reviews.llvm.org/D123405	2022-04-11 16:23:01 +02:00
Ulrich Weigand	f018a5f6e0	[compiler-rt][SystemZ] Skip fuzzer/coverage.test This test is currently marked as XFAIL on s390x, but it is randomly passing, causing build bot issues. Setting as UNSUPPORTED for now.	2022-04-11 16:18:09 +02:00
Nikita Popov	918c5a9260	[Clang] Avoid legacy PM in some tests (NFC) Either remove legacy PM run lines or change them to use new PM.	2022-04-11 16:15:57 +02:00
Nikolas Klauser	732920d847	[libc++] Remove the usage of __init in operator+ `operator+` currently calls `__init`. This patch removes the usage of implementation details. Reviewed By: ldionne, Mordante, #libc Spies: libcxx-commits Differential Revision: https://reviews.llvm.org/D123058	2022-04-11 15:59:52 +02:00
David Spickett	55b6a3186c	[llvm][AArch64] Generate getExtensionFeatures from the list of extensions This takes the AARCH64_ARCH_EXT_NAME in AArch64TargetParser.def and uses it to generate all the "if bit is set add this feature name" code. Which gives us a bunch that we were missing. I've updated testing to include those and reordered them to match the order in the .def. The final part of the test will catch any missing extensions if we somehow manage to not generate an if block for them. This has changed the order of cc1's "-target-feature" output so I've updated some tests in clang to reflect that. Reviewed By: tmatheson Differential Revision: https://reviews.llvm.org/D123296	2022-04-11 13:42:24 +00:00
LLVM GN Syncbot	f5d6d13d72	[gn build] Port `b4ad28da19`	2022-04-11 12:47:08 +00:00
Guoxiong Li	d29f8a5f83	[Clang] Override method ModuleImportRead in MultiplexASTDeserializationListener Fixes https://llvm.org/PR54521 Differential Revision: https://reviews.llvm.org/D123452	2022-04-11 08:38:37 -04:00
Momchil Velikov	b4ad28da19	[CodeGen] Async unwind - add a pass to fix CFI information This pass inserts the necessary CFI instructions to compensate for the inconsistency of the call-frame information caused by linear (non-CGA aware) nature of the unwind tables. Unlike the `CFIInstrInserer` pass, this one almost always emits only `.cfi_remember_state`/`.cfi_restore_state`, which results in smaller unwind tables and also transparently handles custom unwind info extensions like CFA offset adjustement and save locations of SVE registers. This pass takes advantage of the constraints taht LLVM imposes on the placement of save/restore points (cf. `ShrinkWrap.cpp`): * there is a single basic block, containing the function prologue * possibly multiple epilogue blocks, where each epilogue block is complete and self-contained, i.e. CSR restore instructions (and the corresponding CFI instructions are not split across two or more blocks. * prologue and epilogue blocks are outside of any loops Thus, during execution, at the beginning and at the end of each basic block the function can be in one of two states: - "has a call frame", if the function has executed the prologue, or has not executed any epilogue - "does not have a call frame", if the function has not executed the prologue, or has executed an epilogue These properties can be computed for each basic block by a single RPO traversal. From the point of view of the unwind tables, the "has/does not have call frame" state at beginning of each block is determined by the state at the end of the previous block, in layout order. Where these states differ, we insert compensating CFI instructions, which come in two flavours: - CFI instructions, which reset the unwind table state to the initial one. This is done by a target specific hook and is expected to be trivial to implement, for example it could be: ``` .cfi_def_cfa <sp>, 0 .cfi_same_value <rN> .cfi_same_value <rN-1> ... ``` where `<rN>` are the callee-saved registers. - CFI instructions, which reset the unwind table state to the one created by the function prologue. These are the sequence: ``` .cfi_restore_state .cfi_remember_state ``` In this case we also insert a `.cfi_remember_state` after the last CFI instruction in the function prologue. Reviewed By: MaskRay, danielkiss, chill Differential Revision: https://reviews.llvm.org/D114545	2022-04-11 13:27:26 +01:00
Christian Sigg	a0e1c13cae	Remove deprecated `parseSourceFile/String()` overloads. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D123490	2022-04-11 14:11:08 +02:00

1 2 3 4 5 ...

420717 Commits All Branches Search

420717 Commits

All Branches