llvm-project

Commit Graph

Author	SHA1	Message	Date
Muhammad Omair Javaid	553a872465	[LLDB] Adjust DumpDataExtractorTest.Formats for Windows Floating point results mismtach between Visual stdio 2019 and previous versions. This adjusts macro accordingly.	2021-11-04 08:48:26 +05:00
Qiu Chaofan	a84118756c	[PowerPC] Enforce side effects to FPSCR read/set intrinsics Currently, FPSCR is not modeled, so in some early passes (such as early-cse), the read/set intrinsics to FPSCR may get incorrect simplification. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D112380	2021-11-04 11:45:32 +08:00
Ben Vanik	2fcffcd0e8	[ADT] Simplifying hex string parsing so it runs faster in debug modes. This expands the lookup table statically and avoids routing through methods that contain asserts (like StringRef/std::string element accessors and drop_front) such that performance is more predictable across compilation environments. This was primarily driven by slow debug mode performance but has a large benefit in release builds as well. ``` ssd_mobilenet_v2_face_float (42MB .mlir) Debug/MSVC (old): 5.22s Debug/MSVC (new): 0.16s Release/MSVC (old): 0.81s Release/MSVC (new): 0.02s huggingface_minilm (536MB .mlir) Debug/MSVC (old): 65.31s Debug/MSVC (new): 2.03s Release/MSVC (old): 9.93s Release/MSVC (new): 0.27s ``` Now in debug the time is split evenly between lexString, tryGetFromHex, and element attrs hashing, with the next step to making it faster being to combine the work (incremental hashing during conversion, etc) - but this is at least in the right order of magnitude and retains the original API surface. I have not profiled a build with clang but this is strictly less code and simpler data structures so I'd expect improvements there as well. This also fixes a bug where 0xFF bytes in the input would read out of bounds. Reviewed By: dblaikie, stellaraccident Differential Revision: https://reviews.llvm.org/D112105	2021-11-03 20:31:20 -07:00
RamNalamothu	539f500e78	[AMDGPU] Do not add debug locations to the code inside prologue There is no real source location for code inside prologue as it is generated by compiler but source locations are being added to code inside prologue as a side effect of https://reviews.llvm.org/D99269 because buildSpillLoadStore() is using source location of the real instruction in the basic block if any. Fixes: SWDEV-307590 Reviewed By: scott.linder, sebastian-ne Differential Revision: https://reviews.llvm.org/D113100	2021-11-04 08:02:41 +05:30
Julian Lettner	f643afa25f	Revert "Mark tsan cxa_guard_acquire test as unsupported on Darwin" This reverts commit `593275c93c`. This test now passes again.	2021-11-03 19:07:56 -07:00
Matthias Springer	9c137f7668	[mlir][linalg][bufferize] Fix typo in function name Differential Revision: https://reviews.llvm.org/D113162	2021-11-04 10:48:39 +09:00
Jakub Kuderski	3348b841d3	Make enum iteration with seq safe by default By default `llvm::seq` would happily iterate over enums, which may be unsafe if the enum values are not continuous. This patch disable enum iteration with `llvm::seq` and `llvm::seq_inclusive` and adds two new functions: `enum_seq` and `enum_seq_inclusive`. To make sure enum iteration is safe, we require users to declare their enum types as iterable by specializing `enum_iteration_traits<SomeEnum>`. Because it's not always possible to add these traits next to enum definition (e.g., for enums defined in external libraries), we provide an escape hatch to allow iteration on per-callsite basis by passing `force_iteration_on_noniterable_enum`. The main benefit of this approach is that these global declarations via traits can appear just next to enum definitions, making easy to spot when enums are miss-labeled, e.g., after introducing new enum values, whereas `force_iteration_on_noniterable_enum` should stand out and be easy to grep for. This emerged from a discussion with gchatelet@ about reusing llvm's `Sequence.h` in lieu of https://github.com/GPUOpen-Drivers/llpc/blob/dev/lgc/interface/lgc/EnumIterator.h. Reviewed By: dblaikie, gchatelet, aaron.ballman Differential Revision: https://reviews.llvm.org/D107378	2021-11-03 20:52:21 -04:00
Mehdi Amini	0986433401	Revert "Fix iterator_adaptor_base/enumerator_iter to allow composition of llvm::enumerate with llvm::make_filter_range" This reverts commit `ba7a6b314f`. Post-commit review showed that the fix implemented wasn't correct, and a more principled fix is possible.	2021-11-04 00:14:12 +00:00
Volodymyr Sapsai	0a35cc40b8	[clang][objc] Speed up populating the global method pool from modules. For each selector encountered in the source code, we need to load selectors from the imported modules and check that we are calling a selector with compatible types. At the moment, for each module we are storing methods declared in the headers belonging to this module and methods from the transitive closure of imported modules. When a module is imported by a few other modules, methods from the shared module are duplicated in each importer. As the result, we can end up with lots of identical methods that we try to add to the global method pool. Doing this duplicate work is useless and relatively expensive. Avoid processing duplicate methods by storing in each module only its own methods and not storing methods from dependencies. Collect methods from dependencies by walking the graph of module dependencies. The issue was discovered and reported by Richard Howell. He has done the hard work for this fix as he has investigated and provided a detailed explanation of the performance problem. Differential Revision: https://reviews.llvm.org/D110123	2021-11-03 17:11:14 -07:00
Michael Jones	31d797f41e	[libc][NFC] rename str_conv_utils to str_to_integer rename str_conv_utils to str_to_integer to be more in line with str_to_float. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D113061	2021-11-03 15:56:28 -07:00
Jacques Pienaar	a7fc39f213	[mlir] Use _odsPrinter for printer name in generated code The generated name should not be load bearing, so this should be a NFC change. Differential Revision: https://reviews.llvm.org/D113149	2021-11-03 15:34:13 -07:00
Philip Reames	d4708fa480	Backout must-exit based parts of `3fc9882e`, and 412eb0 Not sure these are correct. I think I missed a case when porting this from the original SCEV change to the IndVar changes. I may end up reapplying this later with a comment about how this is correct, but in case the current bad feeling turns out to be true, I'm removing from tree while investigating further.	2021-11-03 15:19:49 -07:00
Jonas Devlieghere	f9e6be5cc1	[lldb] Update tagged pointer command output and test. - Use formatv to print the addresses. - Add check for 0x0 which is treated as an invalid address. - Use a an address that's less likely to be interpreted as a real tagged pointer.	2021-11-03 15:04:36 -07:00
Arthur Eubanks	0ef7ad377f	[NFC] Clarify why LinkAll*.h are actually necessary Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D113074	2021-11-03 15:00:37 -07:00
Arthur Eubanks	88052fc362	[ArgPromo] Preserve FunctionAnalysisManagerCGSCCProxy We already make sure to properly clear analyses for deleted functions. This makes investigating some future potential compile time improvements easier. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D113032	2021-11-03 14:56:58 -07:00
Mogball	6da63573e4	[mlir] fix Debug unittests Flag NDEBUG needed to be changed to LLVM_ENABLE_ABI_BREAKING_CHECKS	2021-11-03 21:34:36 +00:00
Philip Reames	c97bb5d19d	[tests] Precommit for generalization of D112262	2021-11-03 14:33:16 -07:00
Craig Topper	5022ac0771	[RISCV] Use HasVInstructions and HasVInstructionsAnyF in more place in TableGen. NFC Change RISCVSubtarget.hasVInstructionAnyF() to call hasVInstructionsF32 so that any changes to hasVInstructionsF32 are reflected. The files were missed in D112496.	2021-11-03 14:32:45 -07:00
Matthias Braun	847a680733	X86InstrInfo: Support immediates that are +1/-1 different in optimizeCompareInstr This is a re-commit of `e2c7ee0743` which was reverted in `a2a58d91e8`. This includes a fix to consistently check for EFLAGS being live-out. See phabricator review. Original Summary: This extends `optimizeCompareInstr` to re-use previous comparison results if the previous comparison was with an immediate that was 1 bigger or smaller. Example: CMP x, 13 ... CMP x, 12 ; can be removed if we change the SETg SETg ... ; x > 12 changed to `SETge` (x >= 13) removing CMP Motivation: This often happens because SelectionDAG canonicalization tends to add/subtract 1 often when optimizing for fallthrough blocks. Example for `x > C` the fallthrough optimization switches true/false blocks with `!(x > C)` --> `x <= C` and canonicalization turns this into `x < C + 1`. Differential Revision: https://reviews.llvm.org/D110867	2021-11-03 14:12:23 -07:00
Lang Hames	870fc844d1	[ORC-RT] Add SPS serialization for span<const char> / SPSSequence<char>.	2021-11-03 13:43:49 -07:00
Philip Reames	64990f1408	Revert "[indvars] Move a check slightlly earlier [NFC]" This reverts commit `7ff943a9ed`. This wasn't NFC. isSigned != !isUnsigned as there are also relational operators.	2021-11-03 13:38:52 -07:00
River Riddle	7f312f6d79	[mlir] Avoid folding in OpBuilder::tryFold when types change This was missed when tightening fold restrictions in https://reviews.llvm.org/D95991. Differential Revision: https://reviews.llvm.org/D113138	2021-11-03 20:35:46 +00:00
Kirill Stoimenov	a55c4ec1ce	[ASan] Process functions in Asan module pass This came up as recommendation while reviewing D112098. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D112732	2021-11-03 20:27:53 +00:00
alex-t	0a3d755ee9	[AMDGPU] Enable divergence-driven BFE selection Detailed description: This change enables the bit field extract patterns selection to s_bfe_u32 or v_bfe_u32 dependent on the pattern root node divergence. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D110950	2021-11-03 23:26:59 +03:00
Vitaly Buka	91f0a6ad4e	[asan] Disable test on Android Arm 32bit Caused by D111703.	2021-11-03 13:12:56 -07:00
Valentin Clement	52d813edcc	[fir] Use notifyMatchFailure in fir.zero_bits conversion Change emitOpError to notifyMatchFailure in conversion pattern. Post-commit change after D113014 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D113091	2021-11-03 20:45:24 +01:00
Martin Storsjö	a39eba7207	[Support] [Windows] Use RemoveFileOnSignal if unable to use the delete-on-close flag This takes care of cleaning up the temp files on crashes. It doesn't handle cleanup when explicitly killed though. Differential Revision: https://reviews.llvm.org/D112710	2021-11-03 21:29:37 +02:00
Philip Reames	7ff943a9ed	[indvars] Move a check slightlly earlier [NFC]	2021-11-03 12:24:10 -07:00
Philip Reames	3fc9882e88	[indvars] Rotate zext though icmp to reduce loop varying computation This change looks for cases where we can prove that an exit test of a loop can be performed in a narrower bitwidth, and that by doing so we can replace a loop-varying extend with a loop-invariant truncate. The motivation here is that doing this unblocks the trip count analysis for narrow IVs involved in extended compare exit tests. It also has the nice side effect of simply making the code faster, even if we gain no other benefit from the improved analysis ability. I've noted a few places this could be extended, but I think this stands reasonable on it's own as well. Differential Revision: https://reviews.llvm.org/D112262	2021-11-03 12:09:20 -07:00
Vitaly Buka	32eb697c0a	[PassBuilder] Remove unused function after D113072	2021-11-03 12:03:17 -07:00
Keith Smiley	4313c56aa3	[lld-macho] Enable search-paths tests on macOS I'm not sure what the history is here but this test passes on macOS today. It seems like we should unify these tests if they need to run cross platform. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113085	2021-11-03 12:01:36 -07:00
Vitaly Buka	e28c64ecb1	[sanitizer] Disable new test on Android Test added with D113055	2021-11-03 11:57:04 -07:00
River Riddle	a039113446	[mlir] Move the Operation OperandStorage to the first trailing object The main benefits of this change are faster access to operands (no need to compute the offset, as it is now right after the operation), simpler code(no need to manage a lot of the "is the operand storage trailing" logic we had to before). The major downside to this though, is that operand holding operations now grow in size by 1 word (as no matter how we do this change, there will need to be some additional book keeping). Differential Revision: https://reviews.llvm.org/D111695	2021-11-03 18:34:31 +00:00
Vitaly Buka	3131714f8d	[NFC][asan] Use AddressSanitizerOptions in ModuleAddressSanitizerPass Reviewed By: kstoimenov Differential Revision: https://reviews.llvm.org/D113072	2021-11-03 11:32:14 -07:00
Keith Smiley	63e65de3ff	[lld-macho] Cache discovered framework paths On our large iOS project this took a link from 1 minute 45 seconds to 45 seconds. For reference ld64 does the same link in ~20 seconds. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113063	2021-11-03 11:11:54 -07:00
Markus Böck	2e02d2a62f	[mlir] Change ABI breaking use of NDEBUG to LLVM_ENABLE_ABI_BREAKING_CHECKS in DebugActions.h A quick grep for NDEBUG in MLIR revealed a use in DebugActions.h that breaks ABI. This patch changes the use of NDEBUG to LLVM_ENABLE_ABI_BREAKING_CHECKS which has the advantage of being independent of whether clients build their own app in debug or release as it is purely dependant on how MLIR itself was built. Differential Revision: https://reviews.llvm.org/D113088	2021-11-03 19:03:04 +01:00
Kirill Stoimenov	b3145323b5	Revert "[ASan] Process functions in Asan module pass" This reverts commit `76ea87b94e`. Reviewed By: kstoimenov Differential Revision: https://reviews.llvm.org/D113129	2021-11-03 18:01:01 +00:00
Kirill Stoimenov	76ea87b94e	[ASan] Process functions in Asan module pass This came up as recommendation while reviewing D112098. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D112732	2021-11-03 17:51:01 +00:00
Sanjay Patel	7277d2e1c8	[InstCombine] adjust test for icmp fold; NFC I missed that the bitwidth changed from the previous test in the sequence.	2021-11-03 13:35:19 -04:00
Tamir Duberstein	f639882be8	[sanitizer] Allow getsockname with NULL addrlen This is already permitted in getpeername, and returns EFAULT on Linux (does not crash the program). Fixes https://github.com/google/sanitizers/issues/1451. Differential Revision: https://reviews.llvm.org/D113055	2021-11-03 10:23:01 -07:00
Fangrui Song	ab270e4c7c	[docs] Mention --leading-lines instead of --no-leading-lines	2021-11-03 10:21:13 -07:00
Tamir Duberstein	33d9b7b4b2	[sanitizer] Mark before deref in PosixSpawnImpl Read each pointer in the argv and envp arrays before dereferencing it; this correctly marks an error when these pointers point into memory that has been freed. Differential Revision: https://reviews.llvm.org/D113046	2021-11-03 10:18:06 -07:00
Keith Smiley	f79e65e61f	[lld-macho] Cache library paths from findLibrary On top of https://reviews.llvm.org/D113063 this took another 10 seconds off our overall link time. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113073	2021-11-03 10:02:23 -07:00
Louis Dionne	9904bcf2a4	[libc++] Fix GDB pretty printer tests for older Clangs and GCC This was missed by https://llvm.org/D111477, which broke the CI. Differential Revision: https://reviews.llvm.org/D113112	2021-11-03 13:02:04 -04:00
Shivam Gupta	2a7c3f8b02	[Docs] Document scripts that are use to generate assertion in test cases This patch document llvm/utils/update_* python scripts that are used to generate assertions in many of the LLVM regression test cases. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D112936	2021-11-03 22:24:10 +05:30
Harald van Dijk	889c2b97bd	[X86] Fix X32 indirect call generation The check for whether a zero extension was needed was subtly wrong and saw a value that was already 64 bits, so did not extend. Fixes PR52357. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D112860	2021-11-03 16:43:44 +00:00
Sanjay Patel	c85df3c7d5	[InstCombine] refactor fold for icmp with trunc op; NFC There are at least 3 related folds we can add here - see D112634.	2021-11-03 12:43:15 -04:00
Sanjay Patel	d18b7ea621	[InstCombine] add tests for icmp with trunc op; NFC	2021-11-03 12:43:15 -04:00
Roman Lebedev	34b903d8b0	[NFC] Add forgotten `REQUIRES: asserts` into the new costmodel test	2021-11-03 19:40:23 +03:00
Roman Lebedev	9c2469c1dd	[PassManager] `buildModuleOptimizationPipeline()`: schedule `LoopDeletion` pass run before vectorization passes Test thanks to Michael Kuklinski from `#llvm`: https://godbolt.org/z/bdrah5Goo originally inspired by Daniel Lemire's https://lemire.me/blog/2021/10/26/in-c-is-empty-faster-than-comparing-the-size-with-zero/ We manage to deduce that the answer does not require looping, but we do that after the last `LoopDeletion` pass run, so we end up being stuck with a dead loop. Now, as with all things SCEV, this has a very expected ~`+0.12%` compile time performance regression: https://llvm-compile-time-tracker.com/compare.php?from=0ae7bf124a9bca76dd9a91b2f7379168ff13f562&to=c2ae57c9b961aeb4a28c747266949340613a6d84&stat=instructions (for comparison, doing that in function simplification pipeline would have been ~`+0.5` compile time performance regression, D112840) Looking at the transformation stats over vanilla test-suite, i think it's rather expected: ``` \| statistic name \| baseline \| proposed \| Δ \| % \| \|%\| \| \|--------------------------------------------------\|----------:\|----------:\|------:\|-------:\|-------:\| \| scalar-evolution.NumBruteForceTripCountsComputed \| 789 \| 888 \| 99 \| 12.55% \| 12.55% \| \| scalar-evolution.NumTripCountsNotComputed \| 105592 \| 117900 \| 12308 \| 11.66% \| 11.66% \| \| loop-delete.NumBackedgesBroken \| 542 \| 559 \| 17 \| 3.14% \| 3.14% \| \| regalloc.numExtends \| 81 \| 79 \| -2 \| -2.47% \| 2.47% \| \| indvars.NumFoldedUser \| 408 \| 400 \| -8 \| -1.96% \| 1.96% \| \| indvars.NumElimCmp \| 3831 \| 3758 \| -73 \| -1.91% \| 1.91% \| \| scalar-evolution.NumTripCountsComputed \| 299759 \| 304278 \| 4519 \| 1.51% \| 1.51% \| \| loop-delete.NumDeleted \| 8055 \| 8128 \| 73 \| 0.91% \| 0.91% \| \| machine-cse.NumCommutes \| 111 \| 110 \| -1 \| -0.90% \| 0.90% \| \| globaldce.NumFunctions \| 1187 \| 1192 \| 5 \| 0.42% \| 0.42% \| \| codegenprepare.NumSelectsExpanded \| 277 \| 278 \| 1 \| 0.36% \| 0.36% \| \| loop-unroll.NumRuntimeUnrolled \| 13841 \| 13791 \| -50 \| -0.36% \| 0.36% \| \| machinelicm.NumPostRAHoisted \| 1168 \| 1172 \| 4 \| 0.34% \| 0.34% \| \| phi-node-elimination.NumCriticalEdgesSplit \| 83054 \| 82879 \| -175 \| -0.21% \| 0.21% \| \| machine-cse.NumPREs \| 3085 \| 3079 \| -6 \| -0.19% \| 0.19% \| \| branch-folder.NumBranchOpts \| 108122 \| 107942 \| -180 \| -0.17% \| 0.17% \| \| loop-unroll.NumUnrolled \| 40136 \| 40067 \| -69 \| -0.17% \| 0.17% \| \| branch-folder.NumDeadBlocks \| 130818 \| 130607 \| -211 \| -0.16% \| 0.16% \| \| codegenprepare.NumBlocksElim \| 92856 \| 92714 \| -142 \| -0.15% \| 0.15% \| \| instsimplify.NumSimplified \| 103263 \| 103129 \| -134 \| -0.13% \| 0.13% \| \| instcombine.NumConstProp \| 26070 \| 26102 \| 32 \| 0.12% \| 0.12% \| \| instsimplify.NumExpand \| 1716 \| 1718 \| 2 \| 0.12% \| 0.12% \| \| loop-unroll.NumCompletelyUnrolled \| 9236 \| 9225 \| -11 \| -0.12% \| 0.12% \| \| branch-folder.NumHoist \| 2773 \| 2770 \| -3 \| -0.11% \| 0.11% \| \| regalloc.NumReloadsRemoved \| 10822 \| 10834 \| 12 \| 0.11% \| 0.11% \| \| regalloc.NumSnippets \| 11394 \| 11406 \| 12 \| 0.11% \| 0.11% \| \| machine-cse.NumCrossBBCSEs \| 1052 \| 1053 \| 1 \| 0.10% \| 0.10% \| \| machinelicm.NumCSEed \| 99887 \| 99784 \| -103 \| -0.10% \| 0.10% \| \| branch-folder.NumTailMerge \| 72501 \| 72435 \| -66 \| -0.09% \| 0.09% \| \| codegenprepare.NumExtUses \| 22007 \| 21987 \| -20 \| -0.09% \| 0.09% \| \| local.NumRemoved \| 68232 \| 68294 \| 62 \| 0.09% \| 0.09% \| \| loop-vectorize.LoopsAnalyzed \| 75483 \| 75413 \| -70 \| -0.09% \| 0.09% \| ``` Note that i'm only changing current PM, and not touching obsolete PM. This is an alternative to the function simplification pipeline variant of the same change, D112840. It has both less compile time impact (since the additional number of SCEV trip count calculations is way lass less than with the D112840), and it is much more powerful/impactful (almost 2x more loops deleted). I have checked, and doing this after loop rotation is favorable (more loops deleted). Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D112851	2021-11-03 19:24:49 +03:00

1 2 3 4 5 ...

403676 Commits All Branches Search

403676 Commits

All Branches