llvm-project

Commit Graph

Author	SHA1	Message	Date
Zakk Chen	07c3854a75	[RISCV][Clang] Add RVV merge intrinsic functions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D99963	2021-04-11 19:19:01 -07:00
Zakk Chen	01fa222b6d	[RISCV][Clang] Add RVV Type-Convert intrinsic functions. Fix extension macro condition. Support below instructions: 1. Single-Width Floating-Point/Integer Type-Convert Instructions 2. Widening Floating-Point/Integer Type-Convert Instructions 3. Narrowing Floating-Point/Integer Type-Convert Instructions Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D99742	2021-04-11 19:19:01 -07:00
Zakk Chen	5f7739b60e	[RISCV][Clang] Add some RVV Floating-Point intrinsic functions. Support vfclass, vfmerge, vfrec7, vfrsqrt7, vfsqrt instructions. Reviewed By: craig.topper Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D99741	2021-04-11 19:19:01 -07:00
Zakk Chen	98a3ff9d05	[RISCV][Clang] Add more RVV Floating-Point intrinsic functions. Support below instructions. 1. Vector Widening Floating-Point Add/Subtract Instructions 2. Vector Widening Floating-Point Multiply 3. Vector Single-Width Floating-Point Fused Multiply-Add Instructions 4. Vector Widening Floating-Point Fused Multiply-Add Instructions 5. Vector Floating-Point Compare Instructions Reviewed By: craig.topper, HsiangKai Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D99669	2021-04-11 19:19:01 -07:00
Zakk Chen	007ea0e736	[RISCV][Clang] Add some RVV Floating-Point intrinsic functions. Support the following instructions which have the same class. 1. Vector Single-Width Floating-Point Subtract Instructions 2. Vector Single-Width Floating-Point Multiply/Divide Instructions 3. Vector Floating-Point MIN/MAX Instructions 4. Vector Floating-Point Sign-Injection Instructions Reviewed By: craig.topper Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D99668	2021-04-11 19:19:01 -07:00
Zakk Chen	ccc624bfd4	[RISCV][Clang] Add RVV Widening Integer Add/Subtract intrinsic functions. Reviewed By: craig.topper Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D99526	2021-04-11 19:19:01 -07:00
Jim Lin	a3bfddbb6a	[RISCV][NFC] Remove unneeded explict XLenVT type on codegen patterns Customized SDNode has been specified the explict XLenVT type. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100190	2021-04-12 10:16:06 +08:00
Craig Topper	cb4c793e46	[RISCV] Update computeKnownBitsForTargetNode to treat READ_VLENB as being 16 byte aligned. According to the 0.10 spec, VLEN is at least 128 bits and is a power of 2.	2021-04-11 17:54:23 -07:00
Craig Topper	ff902080a9	[RISCV] Use SLLI/SRLI instead of SLLIW/SRLIW for (srl (and X, 0xffff), C) custom isel on RV64. We don't need the sign extending behavior here and SLLI/SRLI are able to compress to C.SLLI/C.SRLI.	2021-04-11 13:59:51 -07:00
Roman Lebedev	8fc8c745cf	[NFCI][SimplifyCFG] PerformValueComparisonIntoPredecessorFolding(): improve Dominator Tree updating Same as with previous patches.	2021-04-11 23:56:23 +03:00
Roman Lebedev	13fca9d816	[NFCI][SimplifyCFG] mergeEmptyReturnBlocks(): improve Dominator Tree updating Same as with previous patches.	2021-04-11 23:56:23 +03:00
Roman Lebedev	0699da1569	[NFCI][Local] MergeBasicBlockIntoOnlyPred(): improve Dominator Tree updating Same as with TryToSimplifyUncondBranchFromEmptyBlock()/MergeBlockIntoPredecessor() patch.	2021-04-11 23:56:23 +03:00
Roman Lebedev	e5692a564a	[NFCI][BasicBlockUtils] MergeBlockIntoPredecessor(): improve Dominator Tree updating Same as with TryToSimplifyUncondBranchFromEmptyBlock() patch.	2021-04-11 23:56:23 +03:00
Roman Lebedev	2def9c3d8e	[NFCI][Local] TryToSimplifyUncondBranchFromEmptyBlock(): improve Dominator Tree updating First, we don't need vector-ness for the predecessor lists. Secondly, like elsewhere, do insertions before deletions. Lastly, the check that we actually need to insert an edge, that it doesn't exist already, is backwards. Instead of looking at successors of every single 'PredOfBB', just always look at predecessors of the 'Succ'. The result is always the same, but we avoid really inefficient code.	2021-04-11 23:56:22 +03:00
Roman Lebedev	6d44b3c56d	[NFCI][DomTreeUpdater] applyUpdates(): reserve space for updates first While, indeed, we may end up pushing less updates that we'd reserve space for, self-dominating updates aren't often enough for that to matter. But this should matter for normal updates.	2021-04-11 23:56:22 +03:00
Florian Hahn	816cf41462	[LoopUnroll] Add AArch64 test case with large vector ops. Add test case to illustrate over-eager unrolling on AArch64, due to the cost-model not estimating the size of vector loads/stores accurately.	2021-04-11 21:39:52 +01:00
Florian Hahn	3219d981d4	[VectorCombine] Add tests for load/extract scalarization. Add tests where scalarizing a vector load + extract is profitable.	2021-04-11 21:39:48 +01:00
Simon Pilgrim	231b87618b	[X86][AVX512] Fold not(kmov(x)) -> kmov(not(x)) and not(widen_subvector(x)) -> widen_subvector(not(x)) Improve AVX512 mask inversion, rG38c799bce801 exposed some missing opportunities to move scalar not() back onto the boolvector types for folding with setcc etc.	2021-04-11 20:07:09 +01:00
Thomas Lively	ea8dd3ee2e	[WebAssembly] Update v128.any_true In the final SIMD spec, there is only a single v128.any_true instruction, rather than one for each lane interpretation because the semantics do not depend on the lane interpretation. Differential Revision: https://reviews.llvm.org/D100241	2021-04-11 11:13:16 -07:00
Simon Pilgrim	13bdac5709	[X86] combineXor - Pull out repeated getOperand() calls. NFCI.	2021-04-11 19:01:59 +01:00
Simon Pilgrim	38c799bce8	[X86] Fold cmpeq/ne(and(X,Y),Y) --> cmpeq/ne(and(~X,Y),0) Followup to D100177, handle an similar (demorgan inverse style) case from PR47797 as well The AVX512 test cases could be further improved if we folded not(iX bitcast(vXi1)) -> (iX bitcast(not(vXi1))) Alive2: https://alive2.llvm.org/ce/z/AnA_-W	2021-04-11 18:42:01 +01:00
Craig Topper	3ae71226ef	[RISCV] Drop earlyclobber constraint from vwadd(u).wx, vwsub(u).wx, vfwadd.wf and vfwsub.wf. The first source has the same EEW as the destination and the other source is a scalar so the overlap constraints don't apply to the unmasked version. For the masked version we have a constraint that the destination can't be V0 so that covers the only overlap issue there. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D100217	2021-04-11 10:19:45 -07:00
Craig Topper	bc0e052730	[RISCV] Teach targetShrinkDemandedConstant to preserve (and X, 0xffff) when zext.h is supported. Similar to what we do for zext.w. Disable the (srl (and X, 0xffff), C) custom isel when zext.h is available.	2021-04-11 10:03:35 -07:00
Craig Topper	48d69edade	[RISCV] Add i8 and i16 srli and srai tests to Zbb/Zbp test files. NFC These require the input to be zero or sign extended. If we have sext.b, sext.h or zext.h instructions we can use them. Otherwise we need to use a pair of shifts to accomplish the zero/sign extend and the final shift. We don't currently use zext.h when it is available.	2021-04-11 10:00:38 -07:00
Roman Lebedev	91248e2db9	[InstCombine] Improve "get low bit mask upto and including bit X" pattern https://alive2.llvm.org/ce/z/3u-48R	2021-04-11 18:08:08 +03:00
Roman Lebedev	07c13b3b5f	[NFC][InstCombine] Add tests for "get low bit mask upto and including bit X" pattern	2021-04-11 18:08:08 +03:00
Roman Lebedev	a36bb7fd76	[InstCombine] (X \| Op01C) + Op1C --> X + (Op01C + Op1C) iff the or is actually an add https://alive2.llvm.org/ce/z/Coc5yf	2021-04-11 18:08:08 +03:00
Roman Lebedev	811167b1f6	[NFC][InstCombine] Add a few test of adding to add-like or	2021-04-11 18:08:08 +03:00
Roman Lebedev	d1ebdbff12	[NFC][LoopVectorize] Autogenerate interleaved-accesses.ll	2021-04-11 18:08:08 +03:00
Roman Lebedev	005881e96e	[LoopIdiom] left-shift-until-bittest: set all allowed no-wrap flags on add/sub I've checked each one of these with alive2, and this is both correct and precise.	2021-04-11 18:08:07 +03:00
Roman Lebedev	0ac1920d03	[NFC][LoopIdiom] left-shift-until-bittest: add small-bitwidth tests	2021-04-11 18:08:07 +03:00
Roman Lebedev	ee6a17eb9f	[NFC][LoopIdiom] Regenerate left-shift-until-bittest.ll	2021-04-11 18:08:07 +03:00
Mark de Wever	ae103003b2	[libc++] [CI] Validate the output of the generated scripts. This adds a CI job validating that the output of utils/generate_feature_test_macro_components.py, libcxx/utils/generate_header_inclusion_tests.py, and utils/generate_header_tests.py are up to date. The validation method has been copied from the Format job. Differential Revision: https://reviews.llvm.org/D99862	2021-04-11 15:35:23 +02:00
Zhang Qing Shan	37388b8ad1	Update personal info in CREDITS.TXT	2021-04-11 19:25:02 +08:00
Sushma Unnibhavi	002c6c1187	Typo fix Reviewed By: dsanders Differential Revision: https://reviews.llvm.org/D100254	2021-04-11 12:24:27 +05:30
Sushma Unnibhavi	e8b0542078	Missing syntax highlighting for LLVM IR in Langref Added syntax highlighting Differential Revision: https://reviews.llvm.org/D100125	2021-04-11 12:19:58 +05:30
Arthur Eubanks	c88b87f9ce	Revert "Remove "Rewrite Symbols" from codegen pipeline" This reverts commit `6210261ecb`. addr-label.ll crashes on armv7.	2021-04-10 23:28:16 -07:00
Arthur Eubanks	6210261ecb	Remove "Rewrite Symbols" from codegen pipeline It breaks up the function pass manager in the codegen pipeline. With empty parameters, it looks at the -mllvm flag -rewrite-map-file. This is likely not in use. Add a check that we only have one function pass manager in the codegen pipeline. This required reverting commit 9583a3f2625818b78c0cf6d473cdedb9f23ad82c: "[AsmPrinter] Delete dead takeDeletedSymbsForFunction()". This was not NFC as initially thought. By coalescing two function psas managers, this exposed the reverted code as necessary. addr-label.ll was crashing due to an emitted blockaddress's block being removed but the label not emitted. Some tests relied on the fact that we had a module pass somewhere in the codegen pipeline. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D99707	2021-04-10 22:38:44 -07:00
patacca	82fbc5d45b	[Polly] Partially refactoring of IslAstInfo and IslNodeBuilder to use isl++. NFC. Polly use algorithms from the Integer Set Library (isl), which is a library written in C and which is incompatible with the rest of the LLVM as it is written in C++. Changes made: - Refactoring the following methods of class IslAstInfo - isParallel() isExecutedInParallel() isReductionParallel() getSchedule() getMinimalDependenceDistance() getBrokenReductions() - Refactoring the following methods of class IslNodeBuilder - getReferencesInSubtree() getScheduleForAstNode() - Refactoring function getBrokenReductionsStr() - Fixed the mismatching function declaration for getScheduleForAstNode() Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D99971	2021-04-10 21:28:02 -05:00
Roman Lebedev	9829f5e6b1	[CVP] @llvm.[us]{min,max}() intrinsics handling If we can tell that either one of the arguments is taken, bypass the intrinsic. Notably, we are indeed fine with non-strict predicate: * UL: https://alive2.llvm.org/ce/z/69qVW9 https://alive2.llvm.org/ce/z/kNFTKf https://alive2.llvm.org/ce/z/AvaPw2 https://alive2.llvm.org/ce/z/oxo53i * UG: https://alive2.llvm.org/ce/z/wxHeGH https://alive2.llvm.org/ce/z/Lf76qx * SL: https://alive2.llvm.org/ce/z/hkeTGS https://alive2.llvm.org/ce/z/eR_b-W * SG: https://alive2.llvm.org/ce/z/wEqRm7 https://alive2.llvm.org/ce/z/FpAsVr Much like with all other comparison handling in CVP, while we could sort-of handle two Value's, at least for plain ICmpInst it does not appear to be worthwhile. This only fires 78 times on test-suite + dt + rs, but we don't canonicalize to these yet. (only SCEV produces them)	2021-04-11 00:33:47 +03:00
Roman Lebedev	d610f05104	[NFC][CVP] Add tests for @llvm.[us]{min,max}() intrinsics	2021-04-11 00:33:47 +03:00
Nikita Popov	8de2f1ff79	[IVUsers] Check LoopSimplify cache earlier (NFC) Check the cache before calling isLoopSimplifyForm(). Otherwise we'd always perform the check for the innermost loop and only skip it for dominating loops.	2021-04-10 22:58:13 +02:00
Wenlei He	00ef28ef21	[CSSPGO] Fix dangling context strings and improve profile order consistency and error handling This patch fixed the following issues along side with some refactoring: 1. Fix bugs where StringRef for context string out live the underlying std::string. We now keep string table in profile generator to hold std::strings. We also do the same for bracketed context strings in profile writer. 2. Make sure profile output strictly follow (total sample, name) order. Previously, there's inconsistency between ProfileMap's key and FunctionSamples's name, leading to inconsistent ordering. This is now fixed by introducing context profile canonicalization. Assertions are also added to make sure ProfileMap's key and FunctionSamples's name are always consistent. 3. Enhanced error handling for profile writing to make sure we bubble up errors properly for both llvm-profgen and llvm-profdata when string table is not populated correctly for extended binary profile. 4. Keep all internal context representation bracket free. This avoids creating new strings for context trimming, merging and preinline. getNameWithContext API is now simplied accordingly. 5. Factor out the code for context trimming and merging into SampleContextTrimmer in SampleProf.cpp. This enables llvm-profdata to use the trimmer when merging profiles. Changes in llvm-profgen will be in separate patch. Differential Revision: https://reviews.llvm.org/D100090	2021-04-10 12:39:10 -07:00
Roman Lebedev	f041757e9c	[NFC][JumpThreading] Increment 'NumFolds' statistic all places terminator becomes uncond	2021-04-10 21:24:29 +03:00
Roman Lebedev	a407738def	[NFC][CVP] Add statistic for function pointer argument non-null-ness deduction	2021-04-10 21:23:20 +03:00
Roman Lebedev	fe7b3ad8d5	[CVP] LVI: Use in-block values when checking value signedness domain This has a huge positive impact on all the folds that use these helpers, as it can be seen on vanilla test-suite + rawspeed + darktable: correlated-value-propagation.NumSRems +75.68% (+ 28) correlated-value-propagation.NumAShrs +63.87% (+198) correlated-value-propagation.NumSDivs +49.42% (+127) correlated-value-propagation.NumSExt + 8.85% (+593) correlated-value-propagation.NumUDivURemsNarrowed + 8.65% (+34) ... while having pretty minimal compile-time impact: https://llvm-compile-time-tracker.com/compare.php?from=e8c7f43e2c2c6f3581ec1c6489ec21ad9f98958a&to=4cd197711e58ee1b2faeee0c35eea54540185569&stat=instructions	2021-04-10 21:10:59 +03:00
Roman Lebedev	257eda0794	[NFC][LVI] getPredicateAt(): drop default value for UseBlockValue The default is likely wrong. Out of all the callees, only a single one needs to pass-in false (JumpThread), everything else either already passes true, or should pass true. Until the default is flipped, at least make it harder to unintentionally add new callees with UseBlockValue=false.	2021-04-10 20:46:01 +03:00
Roman Lebedev	03225969e3	[NFC] Rename LimitingIntrinsic into MinMaxIntrinsic As requested in post-commit review	2021-04-10 20:46:01 +03:00
peter klausler	8f16101c70	[flang] Accept & fold IEEE_SELECTED_REAL_KIND F18 supports the standard intrinsic function SELECTED_REAL_KIND but not its synonym in the standard module IEEE_ARITHMETIC named IEEE_SELECTED_REAL_KIND until this patch. Differential Revision: https://reviews.llvm.org/D100066	2021-04-10 10:01:14 -07:00
Whisperity	8fa3975247	[libtooling][clang-tidy] Fix off-by-one rendering issue with SourceRanges There was an off-by-one issue with calculating the exact end location of token ranges (as given by SomeDecl->getSourceRange()) which resulted in: xxx(something) ^~~~~~~~ // Note the missing ~ under the last character. In addition, a test is added to keep the behaviour in check in the future. This patch hotfixes commit `3b677b81ce`.	2021-04-10 18:52:55 +02:00

1 2 3 4 5 ...

385195 Commits All Branches Search

385195 Commits

All Branches