llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	4eb59f0179	[SelectionDAG][RISCV] Make RegsForValue::getCopyToRegs explicitly zero_extend constants. ComputePHILiveOutRegInfo assumes that constant incoming values to Phis will be zero extended if they aren't a legal type. To guarantee that we should zero_extend rather than any_extend constants. This fixes a bug for RISCV where any_extend of constants can be treated as a sign_extend. Differential Revision: https://reviews.llvm.org/D122053	2022-03-19 18:43:14 -07:00
Craig Topper	268371cf7b	[RISCV] Add test case for miscompile caused by treating ANY_EXTEND of constants as SIGN_EXTEND. The code that inserts AssertZExt based on predecessor information assumes constants are zero extended for phi incoming values this allows AssertZExt to be created in blocks consuming a Phi. SelectionDAG::getNode treats any_extend of i32 constants as sext for RISCV. The code that creates phi incoming values in the predecessors creates an any_extend for the constants which then gets treated as a sext by getNode. This makes the AssertZExt incorrect and can cause zexts to be incorrectly removed. This bug was introduced by D105918 Differential Revision: https://reviews.llvm.org/D122052	2022-03-19 18:43:14 -07:00
Mohammed Nurul Hoque	7afa44f5f5	[RISCV] Add more sign-extending ops to MIR sext.w pass. This patch adds single-bit and bit-counting ops to list of sign-extending ops. A single-bit write propagates sign-extendedness if it's not in the sign-bits. Bit extraction and bit counting always outputs a small number, so sign-extended. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D121152	2022-03-18 18:21:17 +08:00
Craig Topper	bbd2ecf9f0	[RISCV] Add +experimental-zvfh extension to cover half types in vectors. Currently we allow half types in vectors if the scalar Zfh extension is enabled. This behavior is not inline with the vector spec. For f32 and f64 types, the Zve32f, Zve64f, Zve64d, and V explicitly control the availablity of floating point types in vectors. In order to make our compiler compliant, we either need to remove all support for half in vectors or we need an extension to control it. Draft spec here https://github.com/riscv/riscv-v-spec/pull/780 Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D121345	2022-03-17 10:04:02 -07:00
Lian Wang	214afc7116	[RISCV] Add patterns for vnsrl.wi and vnsra.wi instructions Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D121675	2022-03-17 07:22:32 +00:00
Craig Topper	74cf8575f7	[RISCV] Remove stale FIXME from a test. NFC	2022-03-16 14:55:11 -07:00
Craig Topper	2e10671ec7	[RISCV] Improve detection of when to skip (and (srl x, c2) c1) -> (srli (slli x, c3-c2), c3) isel. We have a special case to skip this transform if c1 is 0xffffffff and x is sext_inreg in order to use sraiw+zext.w. But we were only checking that we have a sext_inreg opcode, not how many bits are being sign extended. This commit adds a check that it is a sext_inreg from i32 so we know for sure that an sraiw can be created.	2022-03-16 14:54:34 -07:00
Jessica Clarke	659363c0cc	[RISCV] Ensure PseudoLA* can be hoisted Since we mark the pseudos as mayLoad but do not provide any MMOs, isSafeToMove conservatively returns false, stopping MachineLICM from hoisting the instructions. PseudoLA_TLS_GD does not actually expand to a load, so stop marking that as mayLoad to allow it to be hoisted, and for the others make sure to add MMOs during lowering to indicate they're GOT loads and thus can be freely moved. Fixes https://github.com/llvm/llvm-project/issues/54372 Reviewed By: MaskRay, arichardson Differential Revision: https://reviews.llvm.org/D121654	2022-03-16 18:45:36 +00:00
Jessica Clarke	883f755639	[NFC][RISCV] Pre-commit tests for hoisting of PseudoLLA/PseudoLA* Only PseudoLLA is currently hoisted; this will be fixed in a subsequent commit.	2022-03-16 18:45:19 +00:00
Haocong.Lu	6a54776fe0	[RISCV] Select SRLI+SLLI for AND with leading ones mask Select SRLI+SLLI for and i64 %x, imm if the imm is a leading ones mask. It's useful in RV64 when the mask exceeds simm32 (cannot be generated by LUI). Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D121598	2022-03-16 02:10:57 +00:00
Craig Topper	1bf4bbc492	[LegalizeTypes][RISCV][WebAssembly] Expand ABS in PromoteIntRes_ABS if it will expand to sra+xor+sub later. If we promote the ABS and then Expand in LegalizeDAG, then both the sra and the xor will have their inputs sign extended. This generates extra code on RISCV which lacks an i8 or i16 sign extend instructon. If we expand during type legalization, then only the sra will get its input sign extended. RISCV is able to combine this with the sra by doing a shift left followed by an sra. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D121664	2022-03-15 08:27:39 -07:00
Craig Topper	ad94dfb9a0	[DAGCombiner][RISCV] Adjust (aext (and (trunc x), cst)) -> (and x, cst) to sext cst based on target preference RISCV strong prefers i32 values be sign extended to i64. This combine was always zero extending the constant using APInt methods. This adjusts the code so that it calls getNode using ISD::ANY_EXTEND instead. getNode will call TLI.isSExtCheaperThanZExt to decide how to handle the constant. Tests were copied from D121598 where I noticed that we were creating constants that were hard to materialize. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D121650	2022-03-15 08:26:47 -07:00
Fraser Cormack	a44aeab526	[RISCV] Add MIR tests exposing missed InstAliases The InstAlias framework cannot match registers against zero_reg, which RVV uses to encode unmasked operations. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D92228	2022-03-14 17:53:07 +00:00
Lehua Ding	1648852c98	[RISCV][RVV] Fix vslide1up/down intrinsics overflow bug for SEW=64 on RV32 Reviewed By: craig.topper, kito-cheng Differential Revision: https://reviews.llvm.org/D120899	2022-03-13 18:06:09 +08:00
Craig Topper	fd4d584d6b	[RISCV] Add DAGCombine to fold (bitreverse (bswap X)) to brev8 with Zbkb. If the type is less than XLenVT, type legalization will turn this into (srl (bitreverse (bswap (srl (bswap X), C))), C). We can't completely recover from these shifts. They introduce zeros into the upper bits of the result and we can't easily tell if they are needed. By doing a DAG combine early, we avoid introducing these shifts.	2022-03-12 16:39:39 -08:00
Craig Topper	b55a77d222	[RISCV] Add Zbp command lines to bswap-bitreverse.ll. NFC	2022-03-12 16:23:42 -08:00
Craig Topper	43f668b98e	[RISCV] Move GORCIW/GREVIW formation to isel patterns. Type legalize narrow RISCVISD::GREV/GORC with constant to a larger type without switching to W. Detect sext_inreg+gorci/grevi with a uimm5 immediate during isel to emit GREVIW/GORCIW. This allows us to better propagate known bits information through extended bits after type legalization. It will also simplify a change I'm considering for BREV8 with Zbkb. A future patch will add computeKnownBits support for GORC. A further improvement here would be to use hasAllWUsers and doPeepholeSExtW like we do for SLLIW, but I don't think we have the test coverage for that yet.	2022-03-11 18:02:47 -08:00
Craig Topper	fa62c5326a	[RISCV] Add test cases that show that we're too aggressive about using greviw/gorciw. NFC We currently type legalize to the W form, but type legalization doesn't place any requirements on the extended bits. So we are ok to use GREVI/GORCI for type legalization as long as the control doesn't cross any bits from the extended bits into the lower bits. This can allow us to recognize cases where the extended bits end up being all zeros and we can propagate that information through. My plan is to move greviw/gorciw formation to isel patterns similar to slliw.	2022-03-11 18:02:38 -08:00
Craig Topper	d0969e485c	[RISCV] Optimize vfmv.s.f intrinsic with scalar 0.0 to vmv.s.x with x0. We already do this for RISCVISD::VFMV_S_F_VL and the vfmv.v.f intrinsic. Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D121429	2022-03-11 10:05:43 -08:00
Craig Topper	e0e8edf823	[RISCV] Add isel patterns for masked RISCVISD::FMA_VL with RISCVISD::FNEG_VL. This helps us form vfnmsub, vfnmadd, and vfmusb from masked VP intrinsics. I've used "srcvalue" for the mask parameter in the fneg nodes. We can't match "V0" because that doesn't ensure the mask the is the same. Instead it matches two different nodes and generates two copies to V0 of those separate values. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D120287	2022-03-10 10:05:42 -08:00
Luke	0803dba7dd	[RISCV] Add fixed-length vector instrinsics for segment load Inspired by reviews.llvm.org/D107790. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D119834	2022-03-10 16:23:40 +08:00
Craig Topper	7cd78da8da	[RISCV] Add tests showing the optimization pipeline for O0 and O3. Other targets like ARM, AArch64, and X86 have similar tests. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D120840	2022-03-09 21:42:09 -08:00
Craig Topper	f7a63bca17	[RISCV] Switch undef -> poison in fixed-vector RVV tests	2022-03-09 13:38:36 -08:00
Craig Topper	29511ec7da	[LegalizeTypes][VP] Add widening and splitting support for VP_FMA. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D120854	2022-03-08 09:59:59 -08:00
Craig Topper	c392b9924e	[LegalizeTypes][VP] Add splitting and widening support for VP_FNEG. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D120785	2022-03-08 09:59:34 -08:00
eopXD	550b2eaaa6	[RISCV] Add combination crypto extensions in ISAInfo The crypto extension have several shorthand extensions that don't consist of any extra instructions. Take `zk` for example, while the extension would imply `zkn, zkr, zkt`. The 3 extensions should also combine back into `zk` to maintain the canonical order in isa strings. This patch addresses the above. Reviewed By: VincentWu Differential Revision: https://reviews.llvm.org/D119530	2022-03-08 09:52:38 -08:00
jacquesguan	e55b9b0d0a	[RISCV] Add patterns for vector widening floating-point reduction instructions. Add patterns for vector widening floating-point reduction instructions. Differential Revision: https://reviews.llvm.org/D120390	2022-03-08 10:53:49 +08:00
Zakk Chen	3be907621f	[RISCV] Fix incorrect optimization for masked vmsgeu.vi with 0 immediate. vmsgeu.vi with 0 is always true, but in the masked with mask undisturbed policy, we still need to keep inactive elelemt which come from maskedoff. We could return mask directly if it's mask agnostic policy in the future. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D121080	2022-03-06 19:22:35 -08:00
Craig Topper	bd5f124716	[RISCV] Add SimplifyDemandedBits support for FSR/FSL/FSRW/FSLW.	2022-03-05 21:26:51 -08:00
Zakk Chen	33b61c5678	[RISCV] Fix incorrect codegen introduced by D119688. We should not emit a tail agnostic vlse for a tail undisturbed vmv.s.x In D119688: - if (IsScalarMove && !Node->getOperand(0).isUndef()) + bool HasPassthruOperand = Node->getOpcode() != ISD::SPLAT_VECTOR; + if (HasPassthruOperand && !IsScalarMove && !Node->getOperand(0).isUndef()) break; The IsScalarMove check in the if statement had been changed. Differential Revision: https://reviews.llvm.org/D120963	2022-03-05 06:10:26 -08:00
Zakk Chen	3de970718c	[RISCV][NFC] Precommit test cases for D120963.	2022-03-05 06:10:25 -08:00
Craig Topper	1e569e3b7b	[RISCV] Add CMOV isel pattern for (select (setgt X, -1), Y, Z) setgt X, -1 is the canonical form of setge X, 0. We can swap the select operands and use setlt X, X0 when selecting CMOV. This avoid materializing the -1 in a register.	2022-03-04 22:35:13 -08:00
Craig Topper	0b75b39a70	[RISCV] Merge more rv32/rv64 vector intrinsic tests that contain the same content. Use sed to convert iXLen to i32/i64 before running the test.	2022-03-04 16:34:59 -08:00
Craig Topper	3d4e83f17d	[RISCV] With Zbb, fold (sext_inreg (abs X)) -> (max X, (negw X)) With Zbb, abs is expanded to (max X, neg) by default. If X has 33 or more sign bits, we can expand it a little early using negw instead of neg to save a sext_inreg. If X started as a 32 bit value, type legalization would have inserted a sext before the abs so X having 33 sign bits should always be true. Note: I've used ISD::FREEZE here since we increase the number of uses. Our default expansion for ABS doesn't do that, but I think that's a bug. We can't do this with custom type legalization because ISD::FREEZE doesn't propagate sign bits so later DAG combine won't expand be able to see optmize it. Alives2 https://alive2.llvm.org/ce/z/Gx3RNe Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D120597	2022-03-03 15:42:29 -08:00
jacquesguan	44a430354d	[RISCV] Fold store of vmv.f.s to a vse with VL=1. This patch support the FP part of D109482. Differential Revision: https://reviews.llvm.org/D120235	2022-03-03 16:35:19 +08:00
Craig Topper	6cb42cd666	[RISCV] More correctly ignore Zfinx register classes in getRegForInlineAsmConstraint. Until Zfinx is supported in CodeGen we need to convert all Zfinx register classes to GPR. Remove the zfinx-types.ll test which didn't test anything meaningful since -mattr=zfinx isn't implemented completely in llc. Follow up to D93298.	2022-03-02 11:22:46 -08:00
Craig Topper	ab7a7cc1dd	Revert "[LegalizeTypes][VP] Add splitting and widening support for VP_FNEG." This reverts commit `ac93f95861`. Committed by accident.	2022-03-02 10:00:22 -08:00
Craig Topper	324c0a7206	[SelectionDAG][RISCV] Emit a canonical sign bit test from ExpandIntRes_ABS. Instead of emitting 0 > Hi, emit Hi < 0. If Hi needs to be expanded again this will allow the special case for sign bit tests in ExpandIntOp_SETCC to trigger. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D120761	2022-03-02 09:47:26 -08:00
Craig Topper	a1f8349d77	[RISCV] Don't combine ROTR ((GREV x, 24), 16)->(GREV x, 8) on RV64. This miscompile was introduced in D119527. This was a special pattern for rotate+bswap on RV32. It doesn't work for RV64 since the rotate needs to be half the bitwidth. The equivalent pattern for RV64 is ROTR ((GREV x, 56), 32) so match that instead. This could be generalized further as noted in the new FIXME. Reviewed By: Chenbing.Zheng Differential Revision: https://reviews.llvm.org/D120686	2022-03-02 09:47:06 -08:00
Craig Topper	ac93f95861	[LegalizeTypes][VP] Add splitting and widening support for VP_FNEG. Differential Revision: https://reviews.llvm.org/D120785	2022-03-02 09:47:05 -08:00
Shao-Ce SUN	0e38b29543	[RISCV] add the MC layer support of Zfinx extension This patch added the MC layer support of Zfinx extension. Authored-by: StephenFan Co-Authored-by: Shao-Ce Sun Reviewed By: asb Differential Revision: https://reviews.llvm.org/D93298	2022-03-02 14:25:19 +08:00
Craig Topper	0853ed2b52	[RISCV] Remove accidental negate from recently added i64 abs test. NFC I copied the tests from neg-abs.ll and thought I removed all the negations.	2022-03-01 15:07:51 -08:00
Craig Topper	626ecef1fc	[RISCV] Add more test case for absolute value. NFC This adds tests for i8 through i128 with intrinsic and select forms. Covering rv32 and rv64 with the base ISA, Zbb, and Zbt. Some Zbb tests already covered part of this, but not all. FIXMEs have been added for some obviously suboptimal codegen.	2022-03-01 12:02:44 -08:00
Craig Topper	1f4bb9c69f	[RISCV] Fix the indentation of 'ret' in rvzb-intrinsic.ll tests. NFC Many of these test cases had a single space before 'ret' while every other instruction had two space indentation. I did not audit any other tests for this problem.	2022-03-01 11:37:49 -08:00
Craig Topper	b9d6e8c441	[RISCV] Lower VECTOR_SPLICE to RVV instructions. This lowers VECTOR_SPLICE of scalable vectors to a slidedown follow by a slideup. Fixed vectors are encouraged to use shufflevector instruction. The equivalent patch for fixed vectors is D119039. I've used a tail agnostic slidedown and limited the VL to only the elements that will not be overwritten by the slideup. The slideup uses VLMax for its VL. It unfortunately uses tail undisturbed policy but it isn't required as there is no tail. We just need the merge operand to carry the bits for the lower portion of the result. Care was taken to ensure that either the slideup or slidedown will be able to use a .vi instruction when the immediate is small. Which one uses the immediate depends on the sign of the immediate. Reviewed By: frasercrmck, ABataev Differential Revision: https://reviews.llvm.org/D119303	2022-03-01 10:10:13 -08:00
Craig Topper	bf8054644d	[DAGCombiner] Don't expand (neg (abs x)) if the abs has an additional user. If the types aren't legal, the expansions may get type legalized in a different way preventing code sharing. If the type is legal, we will share some instructions between the two expansions, but we will need an extra register. Since we don't appear to fold (neg (sub A, B)) if the sub has an additional user, I think it makes sense not to expand NABS. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D120513	2022-03-01 07:32:07 -08:00
Craig Topper	c752eb4ae1	[RISCV] Add test cases miscompile of (rotl (grevi X, 24), 16) on RV64. NFC This pattern was moved from isel to DAG combine in D119527, but it lost the RV32 qualification in the process.	2022-03-01 07:32:07 -08:00
Craig Topper	f46890711f	[RISCV] Custom type legalize i32 ISD::ABS on RV64 without Zbb. Default type legalization will create sext_inreg+abs, but we may not be able to remove the sext_inreg. Instead this patch expands abs during type legalization to Y = sraiw X, 31; subw(xor X, Y), Y) which doesn't require the input to be sign extended. This gives a big improvement for some neg-abs tests where the abs is used more than the the neg. Previously the abs was expanded a different way before and after type legalization. Now they are expanded in a similar way enabling more CSE. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D120636	2022-02-28 09:30:27 -08:00
Chenbing Zheng	7f811ce127	[RISCV] Optimize (sext.w, srli) to sraiw with Zba. In this patch, we add a more narrower exclusion for zeroext (srl x) -> srli (slli x), so that it provides an opportunity for the selection of sraiw. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D120467	2022-02-28 10:34:35 +08:00
Craig Topper	5e33bd804b	[RISCV] Remove tab character from test. Autogenerate CHECK lines. NFC This was a test for an infinite loop so the CHECK lines don't really matter, but they'd get generated the next time someone runs the script on the file so might as well do it while I'm touching it.	2022-02-25 11:37:27 -08:00

1 2 3 4 5 ...

1458 Commits