llvm-project

Commit Graph

Author	SHA1	Message	Date
Fraser Cormack	37b41bd087	[RISCV] Add scalable vector fcmp ISel patterns Original patch by @rogfer01. All ordered comparisons except ONE are supported natively, and all unordered comparisons except UNE are expanded into sequences involving explicit NaN checks and mask arithmetic. Additionally, we expand GT,OGT,GE,OGE to their swapped-operand versions, and pattern-match those back to the "original", swapping operands once more. This way we catch both operations and both "vf" and "fv" forms with fewer patterns. Also add support for floating-point splat_vector, with an optimization for splatting fpimm0. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Fraser Cormack <fraser@codeplay.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94242	2021-01-11 19:38:56 +00:00
Craig Topper	131ce834e4	[RISCV] Clear isCodeGenOnly flag on VMSGE(U) pseudo instructions. Remove InstAliases that duplicate the asm strings in the pseudos. The Pseudo class sets isCodeGenOnly=1 which causes the asm strings in the pseudos to be ignored. I think this is why the aliases are needed at all. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D94024	2021-01-10 23:39:08 -08:00
Craig Topper	5cf73dca77	[RISCV] Convert most of the information about RVV Pseudos into bits in TSFlags. This patch moves all but the BaseInstr to bits in TSFlags. For the index fields, we can just use a bit to indicate their presence. The locations of the operands are well defined. This reduces the llc binary by about 32K on my build. It also removes the binary search of the table from the custom inserter. Instead we just check that the SEW op is present. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D94375	2021-01-10 19:15:45 -08:00
Craig Topper	6fc7a92eee	[RISCV] Change ConstraintMask in RISCVII enum to be shifted left. NFC This makes the mask align with the position of the bits in TSFlags which is a little more logical. I might be adding more fields to TSFlags and some might be single bits where just ANDing with mask to test the bit would make sense. While there rename TargetFlags in validateInstruction to reflect that it's just the constraint bits.	2021-01-09 20:22:07 -08:00
Craig Topper	59908fc06a	[RISCV] Use uint16_t instead of unsigned for opcodes in the RVV pseudo instruction table. We currently have about 7000 opcodes in the RISCVGenInstrInfo.inc enum. We can use uint16_t to store these values. We would need to grow by nearly 9x before we run out of space so this should last for a little while. This reduces the llc binary by 32K.	2021-01-09 19:26:32 -08:00
Fraser Cormack	b02eab9058	[RISCV] Add scalable vector icmp ISel patterns Original patch by @rogfer01. The RVV integer comparison instructions are defined in such a way that many LLVM operations are defined by using the "opposite" comparison instruction and swapping the operands. This is done in this patch in most cases, except for the mappings where the immediate range must be adjusted to accomodate: va < i --> vmsle{u}.vi vd, va, i-1, vm va >= i --> vmsgt{u}.vi vd, va, i-1, vm That is left for future optimization; this patch supports all operations but in the case of the missing mappings the immediate will be moved to a scalar register first. Since there are so many condition codes and operand cases to check, it was decided to reduce the test burden by only testing the "vscale x 8" vector types. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Fraser Cormack <fraser@codeplay.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94168	2021-01-09 20:54:34 +00:00
Fraser Cormack	de373ef779	[SelectionDAG] Extend immAll(Ones\|Zeros)V to handle ISD::SPLAT_VECTOR The TableGen immAllOnesV and immAllZerosV helpers implicitly wrapped the ISD::isBuildVectorAll(Ones\|Zeros) helper functions. This was inhibiting their use for targets such as RISC-V which use ISD::SPLAT_VECTOR. In particular, RISC-V had to define its own 'vnot' fragment. In order to extend the scope of these nodes to include support for ISD::SPLAT_VECTOR, two new ISD predicate functions have been introduced: ISD::isConstantSplatVectorAll(Ones\|Zeros). These effectively supersede the older "isBuildVector" predicates, which are now simple wrappers for the new functions. They pass a defaulted boolean toggle which preserves the old behaviour. It is hoped that in time all call-sites can be ported to the "isConstantSplatVector" functions. While the use of ISD::isBuildVectorAll(Ones\|Zeros) has not changed, the behaviour of the TableGen immAll(Ones\|Zeros)V has. To test the new functionality, the custom RISC-V TableGen fragment has been removed and replaced with the built-in 'vnot'. To test their use as pattern-roots, two splat patterns have been updated accordingly. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94223	2021-01-09 17:05:31 +00:00
Roger Ferrer Ibanez	524d8fa9a5	[RISCV] Do not grow the stack a second time when we need to realign the stack This is a first change needed to fix a crash in which the emergency spill splot ends being out of reach. This happens when we run the register scavenger after we have eliminated the frame indexes. The fix for the actual crash will come in a later change. This change removes an extra stack size increase we do in RISCVFrameLowering::determineFrameLayout. We don't have to change the size of the stack here as PEI::calculateFrameObjectOffsets is already doing this with the right size accounting the extra alignment. Differential Revision: https://reviews.llvm.org/D89237	2021-01-09 16:51:09 +00:00
Ben Shi	55f0a1b066	[RISCV] Optimize multiplication with constant 1. Break MUL with specific constant to a SLLI and an ADD/SUB on riscv32 with the M extension. 2. Break MUL with specific constant to two SLLI and an ADD/SUB, if the constant needs a pair of LUI/ADDI to construct. Reviewed by: craig.topper Differential Revision: https://reviews.llvm.org/D93619	2021-01-09 10:37:21 +08:00
Craig Topper	0875a9da2a	[RISCV] Cleanup a few section comments in RISCVInstrInfoVPseudos.td. NFC	2021-01-08 11:36:31 -08:00
Evandro Menezes	946bc50e4c	[RISCV] Define the vfsqrt RVV intrinsics Define the `vfsqrt` IR intrinsics for the respective V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Evandro Menezes <evandro.menezes@sifive.com> Differential Revision: https://reviews.llvm.org/D93745	2021-01-07 17:29:29 -06:00
Fraser Cormack	c9154e8fa3	[RISCV] Add vector mask arithmetic ISel patterns The patterns that want to use 'vnot' use a custom PatFrag. This is because 'vnot' uses immAllOnesV which implicitly uses BUILD_VECTOR rather than SPLAT_VECTOR. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94078	2021-01-07 09:43:25 +00:00
Craig Topper	7a8ced43d7	[RISCV] Fix a few section number comments in RISCVInstrInfoVPseudos.td to match the V extension 1.0 draft spec. NFC The majority of the comments use the 1.0 draft spec section numbers.	2021-01-06 16:38:30 -08:00
Craig Topper	c68faed041	[RISCV] Return a vXi1 vector type from getSetCCResultType if V extension is enabled. nvxXi1 types are legal with V extension and that's the result vmseq/vmsne/vmslt/etc instructions return. No test cases yet because the setcc isel patterns aren't in and we'll need more than basic tests to observe this. I locally tested that this plus D947078, D94168, D94142, and D94149 was enough to be able to handle the overflow result from llvm.sadd.overflow.	2021-01-06 11:50:15 -08:00
Fraser Cormack	e130dea92a	[RISCV] Add vector integer mul/mulh/div/rem ISel patterns There is no test coverage for the mulhs or mulhu patterns as I can't get the DAGCombiner to generate them for scalable vectors. There are a few places in that still need updating for that to work. I left the patterns in regardless as they are correct. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94073	2021-01-06 09:24:07 +00:00
Christudasan Devadasan	d68458bd56	[GlobalISel] Base implementation for sret demotion. If the return values can't be lowered to registers SelectionDAG performs the sret demotion. This patch contains the basic implementation for the same in the GlobalISel pipeline. Furthermore, targets should bring relevant changes during lowerFormalArguments, lowerReturn and lowerCall to make use of this feature. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D92953	2021-01-06 10:30:50 +05:30
Craig Topper	7b5a0e2f88	[RISCV] Move shift ComplexPatterns and custom isel to PatFrags with predicates ComplexPatterns are kind of weird, they don't call any of the predicates on their operands. And their "complexity" used for tablegen ordering purposes in the matcher table is hand specified. This started as an attempt to just use sext_inreg + SLOIPat to implement SLOIW just to have one less Select function. The matching for the or+shl is the same as long as you know the immediate is less than 32 for SLOIW. But that didn't work out because using uimm5 with SLOIPat didn't do anything if it was a ComplexPattern. I realized I could just use a PatFrag with the opcodes I wanted to match and an immediate predicate would then evaluate correctly. This also computes the complexity just like any other pattern does. Then I just needed to check the constraints on the immediates in the predicate. Conveniently the predicate is evaluated after the fragment has been matched. So the structure has already been checked, we just need to find the constants. I'll note that this is unusual, I didn't find any other targets looking through operands in PatFrag predicate. There is a PredicateCodeUsesOperands feature that can be used to collect the operands into an array that is used by AMDGPU/VOP3Instructions.td. I believe that feature exists to handle commuted matching, but since the nodes here use constants, they aren't ever commuted Differential Revision: https://reviews.llvm.org/D91901	2021-01-05 11:37:48 -08:00
Craig Topper	210bc3dc0e	[RISCV] Don't parse 'vmsltu.vi v0, v1, 0' as 'vmsleu.vi v0, v1, -1' vmsltu.vi v0, v1, 0 is always false there is no unsigned number less than 0. vmsleu.vi v0, v1, -1 on the other hand is always true since -1 will be considered unsigned max and all numbers are <= unsigned max. A similar problem exists for vmsgeu.vi v0, v1, 0 which is always true, but becomes vmsgtu.vi v0, v1, -1 which is always false. To match the GNU assembler we'll emit vmsne.vv and vmseq.vv with the same register for these cases instead. I'm using AsmParserOnly pseudo instructions here because we can't match an explicit immediate in an InstAlias. And we can't use a AsmOperand for the zero because the output we want doesn't use an immediate so there's nowhere to name the AsmOperand we want to use. To keep the implementations similar I'm also handling signed with pseudo instructions even though they don't have this issue. This way we can avoid the special renderMethod that decremented by 1 so the immediate we see for the pseudo instruction in processInstruction is 0 and not -1. Another option might have been to have a different simm5_plus1 operand for the unsigned case or just live with the immediate being pre-decremented. I felt this way was clearer, but I'm open to other opinions. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D94035	2021-01-05 10:59:30 -08:00
Craig Topper	249d7de119	[RISCV] Don't print zext.b alias. This alias for andi x, 255 was recently added to the spec. If we print it, code we output can't be compiled with -fno-integrated-as unless the GNU assembler is also a version that supports alias. Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D93826	2021-01-05 10:41:08 -08:00
Craig Topper	c707716c04	[RISCV] Match vmslt(u).vx intrinsics with a small immediate to vmsle(u).vx. There are vmsle(u).vx and vmsle(u).vi instructions, but there is only vmslt(u).vx and no vmslt(u).vi. vmslt(u).vi can be emulated for some immediates by decrementing the immediate and using vmsle(u).vi. To avoid the user needing to know about this, this patch does this conversion. The assembler does the same thing for vmslt(u).vi and vmsge(u).vi pseudoinstructions. There is no vmsge(u).vx intrinsic or instruction so this patch is limited to vmslt(u). Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D94070	2021-01-05 10:20:21 -08:00
Fraser Cormack	1d4411e9ea	[RISCV] Add vector integer min/max ISel patterns Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94012	2021-01-05 09:15:50 +00:00
Craig Topper	fe597efc30	[RISCV] Remove unused method RISCVInstPrinter::printSImm5Plus1. NFC simm5_plus1 is only used by InstAliases so should never be printed.	2021-01-04 12:21:35 -08:00
Craig Topper	dc9ac0e820	[RISCV] Replace i32 with XLenVT in (add AddrFI, simm12) isel patterns. With the i32 these patterns will only fire on RV32, but they don't look RV32 specific. Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D93843	2021-01-04 10:53:27 -08:00
Matt Arsenault	d8938c8bb5	CodeGen: Use Register	2021-01-04 12:53:06 -05:00
Craig Topper	94257d12cb	[RISCV] Remove unused method isUImm5NonZero() from RISCVAsmParser.cpp. NFC The operand predicate that used this has been gone for a while.	2021-01-04 00:17:39 -08:00
Hsiangkai Wang	e4337159e3	[NFC][RISCV] Move vmsge{u}.vx processing to RISCVAsmParser. We could expand vmsge{u}.vx pseudo instructions in RISCVAsmParser. It is more appropriate to expand it before encoding. Differential Revision: https://reviews.llvm.org/D93968	2021-01-02 08:42:53 +08:00
Monk Chiang	1d04cbeb43	[RISCV] Define vector single-width type-convert intrinsic. Define intrinsics: 1. vfcvt.xu.f.v/vfcvt.x.f.v 2. vfcvt.rtz.xu.f.v/vfcvt.rtz.x.f.v 3. vfcvt.f.xu.v/vfcvt.f.x.v We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Monk Chiang <monk.chiang@sifive.com> Differential Revision: https://reviews.llvm.org/D93933	2020-12-31 11:49:30 +08:00
Monk Chiang	2aed9bc98a	[RISCV] Define vector narrowing type-convert intrinsic. Define intrinsics: 1. vfncvt.xu.f.w/vfncvt.x.f.w 2. vfncvt.rtz.xu.f.w/vfncvt.rtz.x.f.w 3. vfncvt.f.xu.w/vfncvt.f.x.w 4. vfncvt.f.f.w/vfncvt.rod.f.f.w We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Monk Chiang <monk.chiang@sifive.com> Differential Revision: https://reviews.llvm.org/D93932	2020-12-31 11:48:28 +08:00
Monk Chiang	fdd30faae5	[RISCV] Define vector widening type-convert intrinsic. Define intrinsics: 1. vfwcvt.xu.f.v/vfwcvt.x.f.v 2. vfwcvt.rtz.xu.f.v/vfwcvt.rtz.x.f.v 3. vfwcvt.f.xu.v/vfwcvt.f.x.v 4. vfwcvt.f.f.v We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Monk Chiang <monk.chiang@sifive.com> Differential Revision: https://reviews.llvm.org/D93855	2020-12-31 11:48:09 +08:00
ShihPo Hung	096b02ebbf	[RISCV] Add intrinsics for vcompress instruction This patch defines vcompress intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential revision: https://reviews.llvm.org/D93809	2020-12-29 18:38:15 -08:00
Zakk Chen	6da0033624	[RISCV] Define vsext/vzext intrinsics. Define vsext/vzext intrinsics.and lower to V instructions. Define new fraction register class fields in LMULInfo and a NoReg to present invalid LMUL register classes. Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93893	2020-12-29 16:50:53 -08:00
Fraser Cormack	f7f09e2b1c	[RISCV] Fill out basic integer RVV ISel patterns This complements the existing RVV ISel patterns for arithmetic, bitwise and shifts with the remaining operations in those categories: sub, and, xor, sra. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93852	2020-12-29 19:32:18 +00:00
Craig Topper	79cbb003c5	[RISCV] Don't use tail agnostic policy on instructions where destination is tied to source If the destination is tied, then user has some control of the register used for input. They would have the ability to control the value of any tail elements. By using tail agnostic we take this option away from them. Its not clear that the intrinsics are defined such that this isn't supposed to work. And undisturbed is a valid implementation for agnostic so code wouldn't even fail to work on all systems if we always used agnostic. The vcompress intrinsic is defined to require tail undisturbed so at minimum we need this for that instruction or need to redefine the intrinsic. I've made an exception here for vmv.s.x/fmv.s.f and reduction instructions which only write to element 0 regardless of the tail policy. This allows us to keep the agnostic policy on those which should allow better redundant vsetvli removal. An enhancement would be to check for undef input and keep the agnostic policy, but we don't have good test coverage for that yet. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D93878	2020-12-29 10:37:58 -08:00
Craig Topper	2ae760e27e	[RISCV] Add earlyclobber of destination register to vmsbf.m/vmsif.m/vmsof.m instructions The spec for these instructions include this note. "The destination register cannot overlap either the source register or the mask register ('v0') if the instruction is masked." So we need earlyclobber to enforce this constraint. I've regenerated the tests with update_llc_test_checks.py to show the effects of the earlyclobber. Reviewed By: khchen, frasercrmck Differential Revision: https://reviews.llvm.org/D93867	2020-12-29 10:00:04 -08:00
Fraser Cormack	aebb4a6052	[RISCV] Rewrite and simplify helper function. NFC. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D93851	2020-12-29 11:29:44 +00:00
Zakk Chen	f3f9ce3b79	[RISCV] Define vmclr.m/vmset.m intrinsics. Define vmclr.m/vmset.m intrinsics and lower to vmxor.mm/vmxnor.mm. Ideally all rvv pseudo instructions could be implemented in C header, but those two instructions don't take an input, codegen can not guarantee that the source register becomes the same as the destination. We expand pseduo-v-inst into corresponding v-inst in RISCVExpandPseudoInsts pass. Reviewed By: craig.topper, frasercrmck Differential Revision: https://reviews.llvm.org/D93849	2020-12-28 18:57:17 -08:00
Zakk Chen	e673d40199	[RISCV] Define vmsbf.m/vmsif.m/vmsof.m/viota.m/vid.v intrinsics. Define those intrinsics and lower to V instructions. Use update_llc_test_checks.py for viota.m tests to check earlyclobber is applied correctly. mask viota.m tests uses the same argument as input and mask for avoid dependency of D93364. We work with @rogfer01 from BSC to come out this patch. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D93823	2020-12-28 05:54:18 -08:00
Fraser Cormack	d85a198e85	[RISCV] Pattern-match more vector-splatted constants This patch extends the pattern-matching capability of vector-splatted constants. When illegally-typed constants are legalized they are canonically sign-extended to XLenVT. This preserves the sign and allows us to match simm5. If they were zero-extended for whatever reason we'd lose that ability: e.g. `(i8 -1) -> (XLenVT 255)` would not be matched under the current logic. To address this we first manually sign-extend the splatted constant from the vector element type to int64_t. This preserves the semantics while removing any implicitly-truncated bits. The corresponding logic for uimm5 was not updated, the rationale being that neither sign- nor zero-extending a legal uimm5 immediate should change that (unless we expect actual "garbage" upper bits). Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93837	2020-12-28 07:11:10 +00:00
Craig Topper	76202f09b5	[RISCV] Improve VMConstraint checking on more unary and nullary instructions. We weren't consistently marking unary instructions as OneInput and vid.v is really ZeroInput but we had no way to mark that. This patch improves this by removing the error prone OneInput constraint. Instead we just always look for the mask in the last operand. It appears that the "CheckReg" variable used for the check on the broken instruction was unitialized or garbage because it was also used for VS1/VS2 constraints. I've scoped the variable locally to each check now. I've gone through and set NoConstraint on instructions that don't have a real VMConstraint and don't have a mask as the last operand. I've also removed the unused enum values in RISCVBaseInfo.h. We never use them in C++ and we have separate versions in a td file. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D93784	2020-12-26 18:47:59 -08:00
Monk Chiang	622ea9cf74	[RISCV] Define vector widening reduction intrinsic. Define vwredsumu/vwredsum/vfwredosum/vfwredsum We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D93807	2020-12-26 21:42:30 +08:00
Zakk Chen	da4a637e99	[RISCV] Define vpopc/vfirst intrinsics. Define vpopc/vfirst intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93795	2020-12-24 19:44:34 -08:00
Kazu Hirata	d6ff5cf995	[Target] Use llvm::any_of (NFC)	2020-12-24 19:43:26 -08:00
Zakk Chen	351c216f36	[RISCV] Define vector mask-register logical intrinsics. Define vector mask-register logical intrinsics and lower them to V instructions. Also define pseudo instructions vmmv.m and vmnot.m. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D93705	2020-12-24 18:59:05 -08:00
ShihPo Hung	912740a864	[RISCV] Add intrinsics for vrgather instruction This patch defines vrgather intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential revision: https://reviews.llvm.org/D93797	2020-12-24 18:16:02 -08:00
Monk Chiang	afd03cd335	[RISCV] Define vector single-width reduction intrinsic. integer group: vredsum/vredmaxu/vredmax/vredminu/vredmin/vredand/vredor/vredxor float group: vfredosum/vfredsum/vfredmax/vfredmin We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D93746	2020-12-25 09:56:01 +08:00
Fraser Cormack	1a7ac29a89	[RISCV] Add ISel support for RVV vector/scalar forms This patch extends the SDNode ISel support for RVV from only the vector/vector instructions to include the vector/scalar and vector/immediate forms. It uses splat_vector to carry the scalar in each case, except when XLEN<SEW (RV32 SEW=64) when a custom node `SPLAT_VECTOR_I64` is used for type-legalization and to encode the fact that the value is sign-extended to SEW. When the scalar is a full 64-bit value we use a sequence to materialize the constant into the vector register. The non-intrinsic ISel patterns have also been split into their own file. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Fraser Cormack <fraser@codeplay.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93312	2020-12-23 20:16:18 +00:00
Craig Topper	e0110a4740	[RISCV] Add intrinsics for vfmv.v.f Also include a special case pattern to use vmv.v.x vd, zero when the argument is 0.0. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D93672	2020-12-23 10:50:48 -08:00
ShihPo Hung	6301871d06	[RISCV] Add intrinsics for vfwmacc, vfwnmacc, vfwmsac, vfwnmsac instructions This patch defines vfwmacc, vfwnmacc, vfwmsc, vfwnmsac intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93693	2020-12-23 00:42:04 -08:00
Zakk Chen	032600b9ae	[RISCV] Define vmerge/vfmerge intrinsics. Define vmerge/vfmerge intrinsics and lower to V instructions. Include support for vector-vector vfmerge by vmerge.vvm. We work with @rogfer01 from BSC to come out this patch. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93674	2020-12-23 00:07:09 -08:00
Evandro Menezes	4d47944393	[RISCV] Define the vfmin, vfmax RVV intrinsics Define the vfmin, vfmax IR intrinsics for the respective V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Evandro Menezes <evandro.menezes@sifive.com> Differential Revision: https://reviews.llvm.org/D93673	2020-12-23 00:27:38 -06:00
ShihPo Hung	ad0a7ad950	[RISCV] Add intrinsics for vf[n]macc/vf[n]msac/vf[n]madd/vf[n]msub instructions This patch defines vfmadd/vfnmacc, vfmsac/vfnmsac, vfmadd/vfnmadd, and vfmsub/vfnmsub lower to V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93691	2020-12-22 18:34:00 -08:00
ShihPo Hung	4268783998	[RISCV] Add intrinsics for vwmacc[u\|su\|us] instructions This patch defines vwmacc[u\|su\|us] intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93675	2020-12-22 18:17:39 -08:00
ShihPo Hung	c8874464b5	[RISCV] Add intrinsics for vslide1up/down, vfslide1up/down instruction This patch adds intrinsics for vslide1up, vslide1down, vfslide1up, vfslide1down. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93608	2020-12-22 18:14:22 -08:00
Craig Topper	53deef9e0b	[RISCV] Remove unneeded !eq comparing a single bit value to 0/1 in RISCVInstrInfoVPseudos.td. NFC Instead we can either use the bit directly. If it was checking for 0 we need to swap the operands or use !not.	2020-12-22 11:57:16 -08:00
Nandor Licker	0586f048d7	[RISCV] Basic jump table lowering This patch enables jump table lowering in the RISC-V backend. In addition to the test case included, the new lowering was tested by compiling the OCaml runtime and running it under qemu. Differential Revision: https://reviews.llvm.org/D92097	2020-12-22 15:05:54 +00:00
Hsiangkai Wang	9a8ef927df	[RISCV] Define vector compare intrinsics. Define vector compare intrinsics and lower them to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93368	2020-12-22 14:08:18 +08:00
Zakk Chen	7a2c8be641	[RISCV] Define vleff intrinsics. Define vleff intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93516	2020-12-21 22:05:38 -08:00
ShihPo Hung	b15ba2cf6f	[RISCV] Add intrinsics for vmacc/vnmsac/vmadd/vnmsub instructions This defines vmadd, vmacc, vnmsub, and vnmsac intrinsics and lower to V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93632	2020-12-21 17:37:20 -08:00
Evandro Menezes	ed73a78924	[RISCV] Define the vand, vor and vxor RVV intrinsics Define the `vand`, `vor` and `vxor` IR intrinsics for the respective V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Evandro Menezes <evandro.menezes@sifive.com> Differential Revision: https://reviews.llvm.org/D93574	2020-12-21 16:20:26 -06:00
Fangrui Song	d9a0c40bce	[MC] Split MCContext::createTempSymbol, default AlwaysAddSuffix to true, and add comments CanBeUnnamed is rarely false. Splitting to a createNamedTempSymbol makes the intention clearer and matches the direction of reverted r240130 (to drop the unneeded parameters). No behavior change.	2020-12-21 14:04:13 -08:00
Monk Chiang	3183add534	[RISCV] Define the remaining vector fixed-point arithmetic intrinsics. This patch base on D93366, and define vector fixed-point intrinsics. 1. vaaddu/vaadd/vasubu/vasub 2. vsmul 3. vssrl/vssra 4. vnclipu/vnclip We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93508	2020-12-20 22:57:07 -08:00
Kazu Hirata	966f1431de	[Target] Use llvm::erase_if (NFC)	2020-12-20 17:43:22 -08:00
ShihPo Hung	d86a00d8fe	[RISCV] Define vslideup/vslidedown intrinsics Differential Revision: https://reviews.llvm.org/D93286	2020-12-20 05:08:15 -08:00
Hsiangkai Wang	41ab45d662	[RISCV] Define vector vfwmul intrinsics. Define vector vfwmul intrinsics and lower them to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93584	2020-12-20 17:39:20 +08:00
Hsiangkai Wang	f86e61d886	[RISCV] Define vector vfwadd/vfwsub intrinsics. Define vector vfwadd/vfwsub intrinsics and lower them to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93583	2020-12-20 17:39:13 +08:00
Hsiangkai Wang	bd576ac8d4	[RISCV] Define vector vfsgnj/vfsgnjn/vfsgnjx intrinsics. Define vector vfsgnj/vfsgnjn/vfsgnjx intrinsics and lower them to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93581	2020-12-20 17:39:04 +08:00
Hsiangkai Wang	62c94f0678	[RISCV] Define vector vfmul/vfdiv/vfrdiv intrinsics. Define vector vfmul/vfdiv/vfrdiv intrinsics and lower them to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93580	2020-12-20 17:38:57 +08:00
Zakk Chen	9cf3b1b666	[RISCV] Define vlxe/vsxe/vsuxe intrinsics. Define vlxe/vsxe intrinsics and lower to vlxei<EEW>/vsxei<EEW> instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D93471	2020-12-19 06:50:20 -08:00
Fraser Cormack	7948cd11d1	[RISCV] Address clang-tidy warnings in RISCVTargetMachine. NFC.	2020-12-18 21:50:55 +00:00
Fraser Cormack	d4ed253d0b	[RISCV] Assume no-op addrspacecasts by default To support OpenCL, which typically uses SPIR as an IR, non-zero address spaces must be accounted for. This patch makes the RISC-V target assume no-op address space casts across the board, which effectively removes the need to support addrspacecast instructions in the backend. For a RISC-V implementation with different configurations or specialized address spaces where casts aren't no-ops, the function can be adjusted as required. Reviewed By: jrtc27 Differential Revision: https://reviews.llvm.org/D93536	2020-12-18 21:03:37 +00:00
Craig Topper	69c8d121f7	[RISCV] Add intrinsics for vsetvli instruction This patch adds two IR intrinsics for vsetvli instruction. One to set the vector length to a user specified value and one to set it to vlmax. The vlmax uses the X0 source register encoding. Clang builtins will follow in a separate patch Differential Revision: https://reviews.llvm.org/D92973	2020-12-18 12:10:09 -08:00
Craig Topper	09468a9148	[RISCV] Sign extend constant arguments to V intrinsics when promoting to XLen. The default behavior for any_extend of a constant is to zero extend. This occurs inside of getNode rather than allowing type legalization to promote the constant which would sign extend. By using sign extend with getNode the constant will be sign extended. This gives a better chance for isel to find a simm5 immediate since all xlen bits are examined there. For instructions that use a uimm5 immediate, this change only affects constants >= 128 for i8 or >= 32768 for i16. Constants that large already wouldn't have been eligible for uimm5 and would need to use a scalar register. If the instruction isn't able to use simm5 or the immediate is too large, we'll need to materialize the immediate in a register. As far as I know constants with all 1s in the upper bits should materialize as well or better than all 0s. Longer term we should probably have a SEW aware PatFrag to ignore the bits above SEW before checking simm5. I updated about half the test cases in some tests to use a negative constant to get coverage for this. Reviewed By: evandro Differential Revision: https://reviews.llvm.org/D93487	2020-12-18 11:43:38 -08:00
Craig Topper	1c3a6671c2	Recommit "[RISCV] Add intrinsics for vfmv.f.s and vfmv.s.f" This time with tests. Original message: Similar to D93365, but for floating point. No need for special ISD opcodes though. We can directly isel these from intrinsics. I had to use anyfloat_ty instead of anyvector_ty in the intrinsics to make LLVMVectorElementType not crash when imported into the -gen-dag-isel tablegen backend. Differential Revision: https://reviews.llvm.org/D93426	2020-12-18 11:19:05 -08:00
Craig Topper	cd3e811864	Revert "[RISCV] Add intrinsics for vfmv.f.s and vfmv.s.f" This reverts commit `46a40c4bc1`. I forgot to git add the tests.	2020-12-18 11:16:36 -08:00
Craig Topper	46a40c4bc1	[RISCV] Add intrinsics for vfmv.f.s and vfmv.s.f Similar to D93365, but for floating point. No need for special ISD opcodes though. We can directly isel these from intrinsics. I had to use anyfloat_ty instead of anyvector_ty in the intrinsics to make LLVMVectorElementType not crash when imported into the -gen-dag-isel tablegen backend. Differential Revision: https://reviews.llvm.org/D93426	2020-12-18 11:11:15 -08:00
Craig Topper	86d282baed	[RISCV] Add intrinsics for vmv.x.s and vmv.s.x This adds intrinsics for vmv.x.s and vmv.s.x. I've used stricter type constraints on these intrinsics than what we've been doing on the arithmetic intrinsics so far. This will allow us to not need to pass the scalar type to the Intrinsic::getDeclaration call when creating these intrinsics. A custom ISD is used for vmv.x.s in order to implement the change in computeNumSignBitsForTargetNode which can remove sign extends on the result. I also modified the MC layer description of these instructions to show the tied source/dest operand. This is different than what we do for masked instructions where we drop the tied source operand when converting to MC. But it is a more accurate description of the instruction. We can't do this for masked instructions since we use the same MC instruction for masked and unmasked. Tools like llvm-mca operate in the MC layer and rely on ins/outs and Uses/Defs for analysis so I don't know if we'll be able to maintain the current behavior for masked instructions. So I went with the accurate description here since it was easy. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D93365	2020-12-18 10:30:48 -08:00
Craig Topper	fc7b7fc066	[RISCV] Add intrinsics for vmv.v.v, vmv.v.x, and vmv.x.i We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Craig Topper <craig.topper@sifive.com> Differential Revision: https://reviews.llvm.org/D93514	2020-12-18 09:49:07 -08:00
Hsiangkai Wang	7087ae7be9	[RISCV] Remove NoVReg to avoid compile warning messages.	2020-12-18 11:37:47 +08:00
Monk Chiang	ee2cb90e3b	[RISCV] Define vsadd/vsaddu/vssub/vssubu intrinsics. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Co-Authored-by: Monk Chiang <monk.chiang@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93366	2020-12-18 10:24:24 +08:00
Zakk Chen	4b07c515ef	[RISCV] Define vlse/vsse intrinsics. Define vlse/vsse intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93445	2020-12-17 17:00:01 -08:00
Hsiangkai Wang	a5e4a513b0	[RISCV] Define vector widening mul intrinsics. Define vector widening mul intrinsics and lower them to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93381	2020-12-17 11:50:33 +08:00
Hsiangkai Wang	dd5281e7cc	[RISCV] Define vector mul/div/rem intrinsics. Define vector mul/div/rem intrinsics and lower them to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93380	2020-12-17 11:50:17 +08:00
Hsiangkai Wang	f03609b5c7	[RISCV] V does not imply F. If users want to use vector floating point instructions, they need to specify 'F' extension additionally. Differential Revision: https://reviews.llvm.org/D93282	2020-12-17 10:57:36 +08:00
Zakk Chen	c1d6d461aa	[RISCV] Define vle/vse intrinsics. Define vle/vse intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93359	2020-12-16 18:08:15 -08:00
Zakk Chen	15ce0ab7ac	[RISCV] Refine vector load/store tablegen pattern, NFC. Refine tablegen pattern for vector load/store, and follow D93012 to separate masked and unmasked definitions for pseudo load/store instructions. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93284	2020-12-15 18:55:55 -08:00
Hsiangkai Wang	c1dac6bac5	[RISCV] Define vfadd/vfsub/vfrsub intrinsics. Define vfadd/vfsub/vfrsub intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93291	2020-12-16 06:31:47 +08:00
Hsiangkai Wang	903f295009	[RISCV] Define vmin/vminu/vmax/vmaxu intrinsics. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93218	2020-12-16 06:31:47 +08:00
Hsiangkai Wang	fd27164563	[RISCV] Define vnsrl/vnsra intrinsics. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93207	2020-12-16 06:31:47 +08:00
Hsiangkai Wang	95795e7a65	[RISCV] Define vsll/vsrl/vsra intrinsics. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93193	2020-12-16 06:31:47 +08:00
Hsiangkai Wang	19db6a652b	[RISCV] Define vadc/vmadc/vsbc/vmsbc intrinsics. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93175	2020-12-16 06:31:47 +08:00
Craig Topper	028efac2d7	[RISCV] Only custom legalize i32 arguments to vector intrinsics on RV64.	2020-12-15 13:54:41 -08:00
Hsiangkai Wang	db48a6de77	[RISCV] Define vwadd/vwaddu/vwsub/vwsubu intrinsics. Define vwadd/vwaddu/vwsub/vwsubu intrinsics and lower to V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93108	2020-12-15 20:15:06 +08:00
Hsiangkai Wang	14a91d676b	[RISCV][NFC] Define scalable vectors for half types. This is a preperation work for vfadd intrinsics. Differential Revision: https://reviews.llvm.org/D93275	2020-12-15 16:23:22 +08:00
Hsiangkai Wang	a6805a0e02	[RISCV] Define vadd/vsub/vrsub intrinsics and lower to V instructions. This patch is based on the proposal from Roger Ferrer Ibanez. http://lists.llvm.org/pipermail/llvm-dev/2020-October/145850.html Differential Revision: https://reviews.llvm.org/D93013	2020-12-15 12:56:49 +08:00
Craig Topper	b094eaa392	[RISCV] Prevent assertion in the assembler if vmerge or vfmerge are given a V0 destination.	2020-12-14 17:22:55 -08:00
Craig Topper	2cf12ae0cc	[RISCV] Handle Match_InvalidSImm5 in RISCVAsmParser::MatchAndEmitInstruction	2020-12-14 17:22:55 -08:00
Craig Topper	413596ee45	[RISCV] Teach debug output from assembly parser to print register names instead of enum values.	2020-12-14 17:22:55 -08:00
Craig Topper	045304701b	[RISCV] Move vtype decoding and printing from RISCVInstPrinter to RISCVBaseInfo. Share with the assembly parser's debug output This moves the vtype decoding and printing to RISCVBaseInfo. This keeps all of the decoding code in the same area as the encoding code. This will make it easier to change the decoding for the 1.0 spec in the future. We're now sharing the printing with the debug output for operands in the assembler. This also fixes that debug output to include the tail and mask agnostic bits. Since the printing code works on the vtype immediate value, we now encode the immediate during parsing and store just the immediate in the operand.	2020-12-14 10:50:26 -08:00
Craig Topper	b577d2df7b	[RISCV] Add a pass to remove duplicate VSETVLI instructions in a basic block. Add simple pass for removing redundant vsetvli instructions within a basic block. This handles the case where the AVL register and VTYPE immediate are the same and no other instructions that change VTYPE or VL are between them. There are going to be more opportunities for improvement in this space as we development more complex tests. Differential Revision: https://reviews.llvm.org/D92679	2020-12-11 10:35:37 -08:00
Hsiangkai Wang	5aa584ec71	[RISCV] Separate masked and unmasked definitions for pseudo instructions. Differential Revision: https://reviews.llvm.org/D93012	2020-12-11 14:02:56 +08:00
Craig Topper	b90e2d850e	[RISCV] Use tail agnostic policy for vsetvli instruction emitted in the custom inserter The compiler is making no effort to preserve upper elements. To do so would require another source operand tied with the destination and a different intrinsic interface to give control of this source to the programmer. This patch changes the tail policy to agnostic so that the CPU doesn't need to make an effort to preserve them. This is consistent with the RVV intrinsic spec here https://github.com/riscv/rvv-intrinsic-doc/blob/master/rvv-intrinsic-rfc.md#configuration-setting Differential Revision: https://reviews.llvm.org/D93080	2020-12-10 19:48:03 -08:00
Craig Topper	e2006ed0f7	[RISCV] Simplify vector instruction handling in RISCVMCInstLower.cpp. Use RegisterClass::contains instead of going through getMinimalPhysRegClass and hasSuperClassEq. Remove the special case for NoRegister. It's identical to the handling for any other regsiter that isn't VRM2/M4/M8.	2020-12-10 13:40:00 -08:00
Sam Elliott	12406ade06	[RISCV] Add (Proposed) Assembler Extend Pseudo-Instructions There is an in-progress proposal for the following pseudo-instructions in the assembler, to complement the existing `sext.w` rv64i instruction: - sext.b - sext.h - zext.b - zext.h - zext.w The `.b` and `.h` variants are available with rv32i and rv64i, and `zext.w` is only available with `rv64i`. These are implemented primarily as pseudo-instructions, as these instructions expand to multiple real instructions. In the case of `zext.b`, this expands to a single rv32/64i instruction, so it is implemented with an InstAlias (like `sext.w` is on rv64i). The proposal is available here: https://github.com/riscv/riscv-asm-manual/pull/61 Reviewed By: asb Differential Revision: https://reviews.llvm.org/D92793	2020-12-10 19:25:51 +00:00
Craig Topper	a1ae3c6ac9	[RISCV][LegalizeDAG] Expand SETO and SETUO comparisons. Teach LegalizeDAG to expand SETUO expansion when UNE isn't legal. If SETUNE isn't legal, UO can use the NOT of the SETO expansion. Removes some complex isel patterns. Most of the test changes are from using XORI instead of SEQZ. Differential Revision: https://reviews.llvm.org/D92008	2020-12-10 09:15:52 -08:00
Fraser Cormack	af5fd65895	[RISCV] Fix missing def operand when creating VSETVLI pseudos The register operand was not being marked as a def when it should be. No tests for this in the main branch as there are not yet any pseudos without a non-negative VLIndex. Also change the type of a virtual register operand from unsigned to Register and adjust formatting. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D92823	2020-12-09 09:35:28 +00:00
Craig Topper	aaa925795f	[RISCV] Use SDLoc created early in RISCVDAGToDAGISel::Select instead of recreating it in multiple cases in the switch. NFC	2020-12-08 21:13:25 -08:00
Craig Topper	846f576bea	[RISCV] Add a table showing the layout of the fields in VTYPE. Rename MaskedOffAgnostic->MaskAgnostic. NFC	2020-12-08 20:41:57 -08:00
Craig Topper	a64998be99	[RISCV] Share VTYPE encoding code between the assembler and the CustomInserter for adding VSETVLI before vector instructions This merges the SEW and LMUL enums that each used into singles enums in RISCVBaseInfo.h. The patch also adds a new encoding helper to take SEW, LMUL, tail agnostic, mask agnostic and turn it into a vtype immediate. I also stopped storing the Encoding in the VTYPE operand in the assembler. It is easy to calculate when adding the operand which should only happen once per instruction. Differential Revision: https://reviews.llvm.org/D92813	2020-12-08 16:04:20 -08:00
Craig Topper	98bca0a605	[RISCV] Add isel patterns for SBCLRI/SBSETI/SBINVI(W) instruction We can use these instructions for single bit immediates that are too large for ANDI/ORI/CLRI. The _10 test cases are to make sure that we still use ANDI/ORI/CLRI for small immediates. Differential Revision: https://reviews.llvm.org/D92262	2020-12-08 12:22:40 -08:00
Craig Topper	fb5b611af9	[RISCV] Detect more errors when parsing vsetvli in the assembler -Reject an "mf1" lmul -Make sure tail agnostic is exactly "tu" or "ta" not just that it starts with "tu" or "ta" -Make sure mask agnostic is exactly "mu" or "ma" not just that it starts with "mu" or "ma" Differential Revision: https://reviews.llvm.org/D92805	2020-12-08 11:25:39 -08:00
Craig Topper	88e58939dc	[RISCV] When parsing vsetvli in the assembler, use StringRef::getAsInteger instead of APInt's string constructor APInt's string constructor asserts on error. Since this is the parser and we don't yet know if the string is a valid integer we shouldn't use that. Instead use StringRef::getAsInteger which returns a bool to indicate success or failure. Since we no longer need APInt, use 'unsigned' instead. Differential Revision: https://reviews.llvm.org/D92801	2020-12-08 11:25:39 -08:00
Craig Topper	3e86fbc971	[RISCV] Replace custom isel code for RISCVISD::READ_CYCLE_WIDE with isel pattern This node returns 2 results and uses a chain. As long as we use a DAG as part of the pseudo instruction definition where we can use the "set" operator, it looks like tablegen can handle use a pattern for this without a problem. I believe the original implementation was copied from PowerPC. This also fixes the pseudo instruction so that it is marked as having side effects to match the definition of CSRRS and the RV64 instruction. And we don't need to explicitly clear mayLoad/mayStore since those can be inferred now. Differential Revision: https://reviews.llvm.org/D92786	2020-12-08 10:23:37 -08:00
Craig Topper	5c819eb389	[RISCV] Form GORCI from (or (rotl/rotr X, Bitwidth/2), X). A rotate by half the bitwidth swaps the bottom and top half which is the same as one of the MSB GREVI stage. We have to do this as a special combine because we prefer to keep (rotl/rotr X, BitWidth/2) as a rotate rather than a single stage GREVI. Differential Revision: https://reviews.llvm.org/D92286	2020-12-07 10:28:04 -08:00
Craig Topper	5fc8f90f0a	[RISCV] Replace a custom SDTypeProfile with SDTIntBinOp which should be sufficient here. On the surface this would be slightly less optimal for the isel table, but due to a tablegen issue with HW mode this ends up generating a smaller isel table.	2020-12-05 20:18:22 -08:00
Hsiangkai Wang	3c12307c7a	[RISCV] Formatting for easier reading (NFC) Authored-by: Hsiangkai Wang <kai.wang@sifive.com>	2020-12-04 23:11:36 -06:00
Craig Topper	03fc4f2e9a	[RISCV] Use fcvt.h/d/f.w if the input is an assertsexti32 not just when the input is sext_inreg.	2020-12-04 18:40:02 -08:00
Craig Topper	5baef6353e	[RISCV] Initial infrastructure for code generation of the RISC-V V-extension The companion RFC (http://lists.llvm.org/pipermail/llvm-dev/2020-October/145850.html) gives lots of details on the overall strategy, but we summarize it here: LLVM IR involving vector types is going to be selected using pseudo instructions (only MachineInstr). These pseudo instructions contain dummy operands to represent the vector type being operated and the vector length for the operation. These two dummy operands, as set by instruction selection, will be used by the custom inserter to prepend every operation with an appropriate vsetvli instruction that ensures the vector architecture is properly configured for the operation. Not in this patch: later passes will remove the redundant vsetvli instructions. Register classes of tuples of vector registers are used to represent vector register groups (LMUL > 1). Those pseudos are eventually lowered into the actual instructions when emitting the MCInsts. About the patch: Because there is a bit of initial infrastructure required, this is the minimal patch that allows us to select instructions for 3 LLVM IR instructions: load, add and store vectors of integers. LLVM IR operations have "whole-vector" semantics (as in they generate values for all the elements). Later patches will extend the information represented in TableGen. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Evandro Menezes <evandro.menezes@sifive.com> Co-Authored-by: Craig Topper <craig.topper@sifive.com> Differential Revision: https://reviews.llvm.org/D89449	2020-12-04 11:39:30 -08:00
Craig Topper	ad923edfc1	[RISCV] Add support for printing pcrel immediates as absolute addresses in llvm-objdump This makes the llvm-objdump output much more readable and closer to binutils objdump. This builds on D76591 It requires changing the OperandType for certain immediates to "OPERAND_PCREL" so tablegen will generate code to pass the instruction's address. This means we can't do the generic check on these instructions in verifyInstruction any more. Should I add it back with explicit opcode checks? Or should we add a new operand flag to control the passing of address instead of matching the name? Differential Revision: https://reviews.llvm.org/D92147	2020-12-04 10:34:12 -08:00
Craig Topper	3fcdf9ca78	[RISCV] Rename FPCCToExtend->FPOpToExpand and FPOpToExtend->FPOpToExpand. NFC These are used to call setOperationAction/setCondCodeAction with the Expand action so it seems that Expand is a better name than Extend.	2020-12-03 16:00:49 -08:00
Craig Topper	a18d5e3e9f	[RISCV] Merge FMV_H_X_RV32/FMV_H_X_RV64 into a single opcode. Same with FMV_X_ANYEXTH_RV32/RV64 Rather than having a different opcode for RV32 and RV64. Let's just say the integer type is XLenVT and use a single opcode for both modes. Differential Revision: https://reviews.llvm.org/D92538	2020-12-03 11:12:40 -08:00
Craig Topper	92c0d5d958	[RISCV] Remove RISCVMergeBaseOffsetOpt from the -O0 pass pipeline. Internally the pass skips any function with the optnone attribute. But that still requires checking each function. If the opt level is set to None we might as well just skip putting in the pipeline at all. This what is already done for many of the passes added by TargetPassConfig. Differential Revision: https://reviews.llvm.org/D92511	2020-12-03 09:58:25 -08:00
Craig Topper	e52a91e156	[RISCV] Add f16 to isFMAFasterThanFMulAndFAdd now that the Zfh extension is supported	2020-12-02 20:31:43 -08:00
Craig Topper	8b403243a8	[RISCV] Initialize MergeBaseOffsetOptPass so it will work with print-before/after-all. If its not in the PassRegistry it's not recognized as a pass when we print before/after. Happened to notice while I was working on a new pass.	2020-12-02 18:04:22 -08:00
Hsiangkai Wang	f7bc7c2981	[RISCV] Support Zfh half-precision floating-point extension. Support "Zfh" extension according to https://github.com/riscv/riscv-isa-manual/blob/zfh/src/zfh.tex Differential Revision: https://reviews.llvm.org/D90738	2020-12-03 09:16:33 +08:00
Fangrui Song	e27e3ba9c9	[RISCVAsmParser] Allow a SymbolRef operand to be a complex expression So that instructions like `lla a5, (0xFF + end) - 4` (supported by GNU as) can be parsed. Add a missing test that an operand like `foo + foo` is not allowed. Reviewed By: jrtc27 Differential Revision: https://reviews.llvm.org/D92293	2020-12-01 16:08:09 -08:00
Craig Topper	40659cd2c6	[RISCV] Rename RISCVGenSystemOperands.inc to RISCVGenSearchableTables.inc to prepare for more tables. NFC D89449 adds more tables so renaming as a pre-commit for that.	2020-11-30 20:47:58 -08:00
Craig Topper	bfc4f29f46	[RISCV] Combine (GORCI (GORCI x, C2), C1) -> (GORCI x, C1\|C2). Unlike GREVI, GORCI stages can't be undone, but they are redundant if done more than once. Differential Revision: https://reviews.llvm.org/D92295	2020-11-30 08:42:46 -08:00
Craig Topper	76d1026b59	[RISCV] Custom legalize bswap/bitreverse to GREVI with Zbp extension to enable them to combine with other GREVI instructions This enables bswap/bitreverse to combine with other GREVI patterns or each other without needing to add more special cases to the DAG combine or new DAG combines. I've also enabled the existing GREVI combine for GREVIW so that it can pick up the i32 bswap/bitreverse on RV64 after they've been type legalized to GREVIW. Differential Revision: https://reviews.llvm.org/D92253	2020-11-30 08:30:40 -08:00
Craig Topper	cbbd7021f1	[RISCV] Only combine (or (GREVI x, shamt), x) -> GORCI if shamt is a power of 2. GORCI performs an OR between each stage. So we need to ensure only one stage is active before doing this combine. Initial attempts at finding a test case for this failed due to the order things get combined. It's most likely that we'll form one stage of GREVI then combine to GORCI before the two stages of GREVI are able to be formed and combined with each other to form a multi stage GREVI. Differential Revision: https://reviews.llvm.org/D92289	2020-11-30 08:10:39 -08:00
Fangrui Song	e6db1416ae	[RISCV] Remove unused Addend parameter from classifySymbolRef. NFC It is confusing as well since in the case of A - B + Cst, the returned Addend is not Cst.	2020-11-29 19:17:59 -08:00
Craig Topper	84aad9b5da	[RISCV] Change predicate on InstAliases for GORCI/GREVI/SHFLI/UNSHFLI to HasStdExtZbp instead of HasStdExtZbbOrZbp. This matches the predicate on the instructions. Though I think some specific encodings are valid in Zbb, but not all of them.	2020-11-29 11:23:23 -08:00
Craig Topper	6ee22ca6ce	[RISCV] Add tests for existing (rotr (bswap X), (i32 16))->grevi pattern for RV32. Extend same pattern to rotl and GREVIW. Not sure why bswap was treated specially. This also applies to bitreverse or generic grevi. We can improve this in future patches. For now I just wanted to get the consistency and the test coverage as I plan to make some other changes around bswap.	2020-11-27 18:09:01 -08:00
Craig Topper	8709d9d872	[RISCV] Replace getSimpleValueType() with getValueType() in DAG combines to prevent asserts with weird types.	2020-11-27 12:49:12 -08:00
Craig Topper	f325b4bbce	[RISCV] Replace sexti32/zexti32 in isel patterns where only one part of their PatFrags can match. NFCI We had an zexti32 after a sign_extend_inreg. The AND X, 0xffffffff part of the zexti32 should never occur since SimplifyDemandedBits from the sign_extend_inreg would have removed it. We also had sexti32 as the root node of a pattern, but SelectionDAGISel matches assertsext early before the tablegen based patterns are evaluated.	2020-11-27 11:37:25 -08:00
Craig Topper	e0481048ab	[RISCV] Don't remove (and X, 0xffffffff) from inputs when matching RISCVISD::DIVUW/REMUW to 64-bit DIVU/REMU. These patterns are using zexti32 which matches either assertzexti32 or (and X, 0xffffffff). But if we match (and X, 0xffffffff) it will remove the AND and the inputs may no longer have the zero bits needed to guarantee the result has enough zeros. This commit changes the patterns to only match assertzexti32. I'm not sure how to test the broken case since the DIVUW/REMUW nodes are created during type legalization, but type legalization won't create an (and X, 0xfffffffff) directly on the inputs. I've also changed the zexti32 on the root of the pattern to just checking for AND. We were previously also matching assertzexti32, but I doubt that pattern would ever occur.	2020-11-26 23:15:41 -08:00
Craig Topper	5836e52063	[RISCV] Add isel patterns to use SBSET for (1 << X) by using X0 as the input.	2020-11-26 15:35:13 -08:00
Craig Topper	d9500c2e23	[RISCV] Add isel patterns for sbsetw/sbclrw/sbinvw with sext_inreg as the root. This handles cases were the input isn't known to be sign extended.	2020-11-26 02:03:06 -08:00
Craig Topper	2254e014a9	[RISCV] Add isel pattern to match (i64 (sra (shl X, 32), C)) to SRAIW if C > 32.	2020-11-25 21:57:48 -08:00
Craig Topper	f78ad68b6d	[RISCV] Remove unused PatFrag argument from the tablegen class used for c.beqz/c.bnez. NFC	2020-11-25 20:35:23 -08:00
Craig Topper	ed95cafbc5	[RISCV] Add an implementation of isFMAFasterThanFMulAndFAdd Start with an assumption that FMA is faster than Fmul+FAdd. If thats not true on some particular implementation we can add a tuning parameter in the future. I've update the fmuladd test cases and added new test cases for fast math flag based contraction. Differential Revision: https://reviews.llvm.org/D91987	2020-11-25 15:07:34 -08:00
Craig Topper	751b0d970e	[RISCV] Make SMIN/SMAX/UMIN/UMAX legal with Zbb extension. This is the logically correct thing to do. But it generates worse code for i32 umin/umax on the rv64 due to type legalize requesting zext even though the arguments are sext. Maybe we can teach type legalizer to use sext for umin/umax for RISCV. It's also producing possibly worse code on i64 on RV32 since we still end up with selects that become branches. But this seems like something we could improve in type legalization or DAG combine. Hopefully this makes D92095 work for RISCV with Zbb.	2020-11-25 12:48:43 -08:00
Craig Topper	c26e8697d7	[RISCV] Custom type legalize i32 fshl/fshr on RV64 with Zbt. This adds custom opcodes for FSLW/FSRW so we can type legalize fshl/fshr without needing to match a sign_extend_inreg. I've used the operand order from fshl/fshr to make the isel pattern similar to the non-W form. It was also hard to decide another order since the register instruction has the shift amount as the second operand, but the immediate instruction has it as the third operand. Differential Revision: https://reviews.llvm.org/D91479	2020-11-25 10:01:47 -08:00
Luís Marques	a8dc2110cd	[RISCV] Add GHC calling convention This is a special calling convention to be used by the GHC compiler. Patch by Andreas Schwab (schwab) Differential Revision: https://reviews.llvm.org/D89788	2020-11-24 22:35:23 +00:00
Luís Marques	e4d9380245	Revert "[RISCV] Add GHC calling convention" This reverts commit `f8317bb256` due to lack of proper attribution.	2020-11-24 22:34:20 +00:00
Luís Marques	f8317bb256	[RISCV] Add GHC calling convention This is a special calling convention to be used by the GHC compiler. Differential Revision: https://reviews.llvm.org/D89788	2020-11-24 21:56:28 +00:00
Fraser Cormack	ca1f2f2716	[RISCV] Combine GREVI sequences This combine step performs the following type of transformation: rev.p a0, a0 # grevi a0, a0, 0b01 rev2.n a0, a0 # grevi a0, a0, 0b10 --> rev.n a0, a0 # grevi a0, a0, 0b11 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D91877	2020-11-24 12:07:13 +00:00
Craig Topper	03dab46d7f	[RISCV] Remove unused VM register class Nothing references this class today so it looks like some leftover. Differential Revision: https://reviews.llvm.org/D91977	2020-11-23 14:17:06 -08:00
Craig Topper	4252f7773a	[SelectionDAG][ARM][AArch64][Hexagon][RISCV][X86] Add SDNPCommutative to fma and fmad nodes in tablegen. Remove explicit commuted patterns from targets. X86 was already specially marking fma as commutable which allowed tablegen to autogenerate commuted patterns. This moves it to the target independent definition and fix up the targets to remove now unneeded patterns. Unfortunately, the tests change because the commuted version of the patterns are generating operands in a different than the explicit patterns. Differential Revision: https://reviews.llvm.org/D91842	2020-11-23 10:09:20 -08:00
Craig Topper	84b8222705	[RISCV] Use separate Lo and Hi MemOperands when expanding BuildPairF64Pseudo and SplitF64Pseudo. We generate two 4 byte loads or two stores as part of the expansion. Previously the MemOperand was set the same for both to cover the full 8 bytes. Now we set a separate 4 byte mem operand for each with a 4 byte offset for the high part.	2020-11-22 00:46:12 -08:00
Craig Topper	9211da4215	[RISCV] Put RV32 before RV64 in the ValueTypeByHwMode and RegInfoByHwMode lists in RISCVRegisterInfo.td Addresses post-commit feedback from `77e25b5bc8`	2020-11-20 12:10:21 -08:00
Craig Topper	77e25b5bc8	[RISCV] Remove RV32 HwMode. Use DefaultMode for RV32 Prior to this the DefaultMode was never selected, but RISCVGenDAGISel.inc, RISCVGenRegisterInfo.inc, RISCVGenGlobalISel.inc all ended up with extra table entries for that mode. This patch removes the RV32 and uses DefaultMode for RV32. This impressively reduces the size of my release+asserts llc binary by about 270K. About 15K from RISCVGenDAGISel.inc, 1-2K from RISCVGenRegisterInfo.inc, but the vast majority from RISCVGenGlobalISel.inc. Differential Revision: https://reviews.llvm.org/D90973	2020-11-20 11:16:06 -08:00
Craig Topper	6a1d8b91ed	[RISCV] Custom type legalize i32 bswap/bitreverse to GREVIW on RV64 with Zbp extension Previously we required a sra to pattern match these properly in isel. If the consumer didn't need the result sign extended we'll have an srl instead of sra and fail to match. This patch switches to custom legalizing to GREVIW using portions of D91259. Differential Revision: https://reviews.llvm.org/D91457	2020-11-20 10:41:01 -08:00
Craig Topper	78767b7f8e	[RISCV] Add RISCVISD::ROLW/RORW use those for custom legalizing i32 rotl/rotr on RV64IZbb. This should result in better utilization of RORIW since we don't need to look for a SIGN_EXTEND_INREG that may not exist. Also remove rotl/rotr isel matching to GREVI and just prefer RORI. This is to keep consistency so we don't have to match ROLW/RORW to GREVIW as well. I imagine RORI/RORIW performance will be the same or better than GREVI. Differential Revision: https://reviews.llvm.org/D91449	2020-11-20 10:25:47 -08:00
Fraser Cormack	1ac9b54831	[RISCV] Lower GREVI and GORCI as custom nodes This moves the recognition of GREVI and GORCI from TableGen patterns into a DAGCombine. This is done primarily to match "deeper" patterns in the future, like (grevi (grevi x, 1) 2) -> (grevi x, 3). TableGen is not best suited to matching patterns such as these as the compile time of the DAG matchers quickly gets out of hand due to the expansion of commutative permutations. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D91259	2020-11-19 18:11:42 +00:00
Craig Topper	6b0fc1f3c1	[RISCV] Add MemOperand to the instruction created by storeRegToStackSlot/loadRegFromStackSlot Differential Revision: https://reviews.llvm.org/D91730	2020-11-18 19:20:03 -08:00
Hsiangkai Wang	44cd03ad04	[RISCV] Use register class VR for V instruction operands directly. @tangxingxin1008 found a bug that regard vadd.vv v1, v3, a0 as a valid V instruction. We should remove the VRegAsmOperand operand class and use VR register class directly. Patched by: tangxingxin1008, Hsiangkai Differential Revision: https://reviews.llvm.org/D91712	2020-11-19 05:59:46 +08:00
Florian Hahn	b2f4c5fddc	[AsmWriter] Factor out mnemonic generation to accessible getMnemonic. This patch factors out the part of printInstruction that gets the mnemonic string for a given MCInst. This is intended to be used subsequently for the instruction-mix remarks to display the final mnemonic (D90040). Unfortunately making `getMnemonic` available to the AsmPrinter seems to require making it virtual. Not sure if there's a way around that with the current layering of the AsmPrinters. Reviewed By: Paul-C-Anagnostopoulos Differential Revision: https://reviews.llvm.org/D90039	2020-11-17 09:47:38 +00:00
Craig Topper	124c93c528	[RISCV] When matching SROIW, check all 64 bits of the OR mask We need to make sure the upper 32 bits are all ones to ensure the result is properly sign extended. Previously we only checked the lower 32 bits of the mask. I've also added a check that the shift amount is less than 32. Without that the original code asserts inside maskLeadingOnes if the SROI check is removed or the SROIW pattern is checked first. I've refactored the code to use early outs to reduce nesting. I've also updated SLOIW matching with the same changes, but I couldn't find a broken test case with the existing code. Differential Revision: https://reviews.llvm.org/D90961	2020-11-16 10:08:15 -08:00
Fraser Cormack	fe9dc2e54a	[RISCV] Use a macro to simplify getTargetNodeName Similar to the X86 and AMDGPU targets, this uses a macro to cut down on repetitive and error-prone code when converting RISCVISD node names to strings in getTargetNodeName. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D91414	2020-11-16 09:33:47 +00:00
serge-sans-paille	9218ff50f9	llvmbuildectomy - replace llvm-build by plain cmake No longer rely on an external tool to build the llvm component layout. Instead, leverage the existing `add_llvm_componentlibrary` cmake function and introduce `add_llvm_component_group` to accurately describe component behavior. These function store extra properties in the created targets. These properties are processed once all components are defined to resolve library dependencies and produce the header expected by llvm-config. Differential Revision: https://reviews.llvm.org/D90848	2020-11-13 10:35:24 +01:00
Craig Topper	0add5f9122	[RISCV] Don't include CodeGen layer files in MC layer -Use MCRegister instead of Register in MC layer. -Move some enums from RISCVInstrInfo.h to RISCVBaseInfo.h to be with other TSFlags bits. Differential Revision: https://reviews.llvm.org/D91114	2020-11-12 07:45:38 -08:00
Craig Topper	9ca02d6fe1	[RISCV] Add an ANDI to shift amount of FSL/FSR instructions The fshl and fshr intrinsics are defined to modulo their shift amount by the bitwidth of one of their inputs. The FSR/FSL instructions read one extra bit from the shift amount. If that bit is set the inputs are swapped. In order to preserve the semantics of the llvm intrinsics we need to make sure that the extra bit isn't set. DAG combine or instcombine may have removed any mask that was originally present. We could be smarter here and try to use computeKnownBits to check if the bit is known zero, but wanted to start with correctness. Differential Revision: https://reviews.llvm.org/D90905	2020-11-12 07:33:40 -08:00
Craig Topper	637f19c36b	[RISCV] Remove traces of Glue from RISCVISD::SELECT_CC We were creating RISCVISD::SELECT_CC nodes with Glue output that was never being used, and the tablegen SDNode had the SDNPInGlue flag instead of the SDNPOutGlue flag. Since we don't seem to need the Glue just get rid of it from both places. Differential Revision: https://reviews.llvm.org/D91199	2020-11-11 09:30:48 -08:00
Craig Topper	70b481e8db	[RISCV] Add missing copyright header to RISCVBaseInfo.cpp. NFC	2020-11-10 11:33:08 -08:00
Craig Topper	5d3fd3df94	[RISCV] Make ctlz/cttz cheap to speculatively execute so CodeGenPrepare won't insert a zero check. Add additional isel patterns for ctzw/clzw instructions. Differential Revision: https://reviews.llvm.org/D91040	2020-11-09 10:13:45 -08:00
Craig Topper	a59076006b	[RISCV] Add isel patterns for using PACK for zext.h and zext.w. Differential Revision: https://reviews.llvm.org/D91024	2020-11-09 10:13:45 -08:00
Craig Topper	4265cbaa34	[RISCV] Make SIGN_EXTEND_INREG from i8/i16 legal when Zbb extension is enabled. This produces better code for sign extend to i64 on RV32 target. Differential Revision: https://reviews.llvm.org/D91023	2020-11-09 10:13:45 -08:00
Craig Topper	c0dd22e44a	[RISCV] Add isel patterns to match sbset/sbclr/sbinv/sbext even if the shift amount isn't masked. This uses the shiftop PatFrags to handle the masked shift amount and unmasked shift amount cases. That also checks XLen as part of the masked amount check so we don't need separate RV32 and RV64 patterns. Differential Revision: https://reviews.llvm.org/D91016	2020-11-09 09:55:26 -08:00
Craig Topper	19313ed580	[RISCV] Remove assertsexti32 from a couple B extension isel patterns that don't demanded the sign extended bits.	2020-11-07 22:43:16 -08:00
Craig Topper	c72358b77f	[RISCV] Use (not X) in instead of (xor X, -1) in isel patterns to improve readability. NFC	2020-11-07 11:50:52 -08:00
Craig Topper	741b04b0b7	[RISCV] Only enable GPR<->FPR32 bitconvert isel patterns on RV32. NFCI Bitconvert requires the bitwidth to match on both sides. On RV64 the GPR size is i64 so bitconvert between f32 isn't possible. The node should never be generated so the pattern won't ever match, but moving the patterns under IsRV32 makes it more obviously impossible. It also moves it to a similar location to the patterns for the custom nodes we use for RV64.	2020-11-05 16:15:25 -08:00
Craig Topper	defe11866a	[RISCV] Add isel patterns for fnmadd/fnmsub with an fneg on the second operand instead of the first. The multiply part of FMA is commutable, but TargetSelectionDAG.td doesn't have it marked as commutable so tablegen won't automatically create the additional patterns. So manually add commuted patterns.	2020-11-05 14:00:25 -08:00
Craig Topper	ce5f4f22e9	[RISCV] Use the 'si' lib call for (double (fp_to_sint/uint i32 X)) when F extension is enabled. D80526 added custom lowering to pick the si lib call on RV64, but this custom handling is only enabled when the F and D extension are both disabled. This prevents the si library call from being used for double when F is enabled but D is not. This patch changes the behavior so we always enable the Custom hook on RV64 and decide in ReplaceNodeResults if we should emit a libcall based on whether the FP type should be softened or not. Differential Revision: https://reviews.llvm.org/D90817	2020-11-05 10:46:45 -08:00
Craig Topper	ce1270fc7e	[RISCV] Remove shadow register list passed to AllocateReg when allocating FP registers for calling convention The _F and _D registers are already sub/super registers. When one gets allocated all its aliases are already marked as allocated. We don't need to explicitly shadow it too. I believe shadow is for calling conventions like 64-bit Windows on X86 where have rules like this CCIfType<[i32], CCAssignToRegWithShadow<[ECX , EDX , R8D , R9D ], [XMM0, XMM1, XMM2, XMM3]>> For that calling convention the argument number determines which register is used regardless of how many scalars or vectors came before it. Removing this removes a question I had in D90738. Differential Revision: https://reviews.llvm.org/D90801	2020-11-05 09:49:42 -08:00
Craig Topper	c623584b6f	[RISCV] Add isel patterns for fshl with immediate to select FSRI/FSRIW There is no FSLI instruction, but we can emulate it using FSRI by swapping operands and subtracting the immediate from the bitwidth. Differential Revision: https://reviews.llvm.org/D90826	2020-11-05 09:37:43 -08:00
Sander de Smalen	d57bba7cf8	[SVE] Return StackOffset for TargetFrameLowering::getFrameIndexReference. To accommodate frame layouts that have both fixed and scalable objects on the stack, describing a stack location or offset using a pointer + uint64_t is not sufficient. For this reason, we've introduced the StackOffset class, which models both the fixed- and scalable sized offsets. The TargetFrameLowering::getFrameIndexReference is made to return a StackOffset, so that this can be used in other interfaces, such as to eliminate frame indices in PEI or to emit Debug locations for variables on the stack. This patch is purely mechanical and doesn't change the behaviour of how the result of this function is used for fixed-sized offsets. The patch adds various checks to assert that the offset has no scalable component, as frame offsets with a scalable component are not yet supported in various places. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D90018	2020-11-05 11:02:18 +00:00
Craig Topper	cc3bf27077	[RISCV] Remove assertsexti32 from fslw/fsrw isel patterns. The operations in these patterns shouldn't be effected by sign bits. And the pattern is starting from a sign_extend_inreg so we aren't expecting sign bits to be passed through either. Differential Revision: https://reviews.llvm.org/D90739	2020-11-04 11:37:58 -08:00
Craig Topper	d47300f503	[RISCV] Correct the operand order for fshl/fshr to fsl/fsr instructions. fsl/fsr take their shift amount in $rs2 or an immediate. The sources are $rs1 and $rs3. fshl/fshr ISD opcodes both concatenate operand 0 in the high bits and operand 1 in the lower bits. fshl returns the high bits after shifting and fshr returns the low bits. So a shift amount of 0 returns operand 0 for fshl and operand 1 for fshr. fsl/fsr concatenate their operands in different orders such that $rs1 will be returned for a shift amount of 0. So $rs1 needs to come from operand 0 of fshl and operand 1 of fshr. Differential Revision: https://reviews.llvm.org/D90735	2020-11-04 11:13:25 -08:00
Craig Topper	0122a4ea66	[RISCV] Remove assertsexti32 from inputs to riscv_sllw/srlw nodes in B extension isel patterns. riscv_sllw/srlw only reads the lower 32 bits of the first operand. And the lower 5 bits of the second operands. Whether the upper 32 bits of the input are sign bits or not doesn't matter. Also use ineg and not to shorten the patterns. Differential Revision: https://reviews.llvm.org/D90668	2020-11-04 10:35:05 -08:00
Craig Topper	857563eaf0	[RISCV] Check all 64-bits of the mask in SelectRORIW. We need to ensure the upper 32 bits of the mask are zero. So that the srl shifts zeroes into the lower 32 bits. Differential Revision: https://reviews.llvm.org/D90585	2020-11-04 10:15:30 -08:00
Craig Topper	3701e33a22	[RISCV] Remove custom isel for (srl (shl val, 32), imm). Use pattern instead. NFCI We don't need custom matching, we just a need a predicate to check the immediate is greater than 32. We can use the existing ImmSub32 to adjust the immediate. I've also used the new predicate in the other location that used ImmSub32. I tried to create a test case where we would break without the greater than 32 check on that pattern, but DAG combine defeated me. Still seemed safer to have it. Differential Revision: https://reviews.llvm.org/D90546	2020-11-04 09:59:14 -08:00
Craig Topper	00eff96e1d	[RISCV] Add missing patterns for rotr with immediate for Zbb/Zbp extensions. DAGCombine doesn't canonicalize rotl/rotr with immediate so we need patterns for both. Remove the custom matcher for rotl to RORI and just use a SDNodeXForm to convert the immediate instead. Doing this gives priority to the rev32/rev16 versions of grevi over rori since an explicit immediate is more precise than any immediate. I also added rotr patterns for rev32/rev16. And removed the (or (shl), (shr)) patterns that should be combined to rotl by DAG combine. There is at least one other grev pattern that probably needs a another rotr pattern, but we need more test coverage first. Differential Revision: https://reviews.llvm.org/D90575	2020-11-03 10:04:52 -08:00
Craig Topper	46e91f6701	[RISCV] Remove isel patterns for fshl/fshr with same inputs. NFC These were being selected to ROL/ROR, but DAG combine should canonicalize fshl/fshr with same inputs to rotl/rotr which we also have patterns for.	2020-11-02 23:12:18 -08:00
Jessica Clarke	7601a21738	[RISCV] Only return DestSourcePair from isCopyInstrImpl for registers ADDI often has a frameindex in operand 1, but consumers of this interface, such as MachineSink, tend to call getReg() on the Destination and Source operands, leading to the following crash when building FreeBSD after this implementation was added in 8cf6778d30: ``` clang: llvm/include/llvm/CodeGen/MachineOperand.h:359: llvm::Register llvm::MachineOperand::getReg() const: Assertion `isReg() && "This is not a register operand!"' failed. PLEASE submit a bug report to https://bugs.llvm.org/ and include the crash backtrace, preprocessed source, and associated run script. Stack dump: #0 0x00007f4286f9b4d0 llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) llvm/lib/Support/Unix/Signals.inc:563:0 #1 0x00007f4286f9b587 PrintStackTraceSignalHandler(void) llvm/lib/Support/Unix/Signals.inc:630:0 #2 0x00007f4286f9926b llvm::sys::RunSignalHandlers() llvm/lib/Support/Signals.cpp:71:0 #3 0x00007f4286f9ae52 SignalHandler(int) llvm/lib/Support/Unix/Signals.inc:405:0 #4 0x00007f428646ffd0 (/lib/x86_64-linux-gnu/libc.so.6+0x3efd0) #5 0x00007f428646ff47 raise /build/glibc-2ORdQG/glibc-2.27/signal/../sysdeps/unix/sysv/linux/raise.c:51:0 #6 0x00007f42864718b1 abort /build/glibc-2ORdQG/glibc-2.27/stdlib/abort.c:81:0 #7 0x00007f428646142a __assert_fail_base /build/glibc-2ORdQG/glibc-2.27/assert/assert.c:89:0 #8 0x00007f42864614a2 (/lib/x86_64-linux-gnu/libc.so.6+0x304a2) #9 0x00007f428d4078e2 llvm::MachineOperand::getReg() const llvm/include/llvm/CodeGen/MachineOperand.h:359:0 #10 0x00007f428d8260e7 attemptDebugCopyProp(llvm::MachineInstr&, llvm::MachineInstr&) llvm/lib/CodeGen/MachineSink.cpp:862:0 #11 0x00007f428d826442 performSink(llvm::MachineInstr&, llvm::MachineBasicBlock&, llvm::MachineInstrBundleIterator<llvm::MachineInstr, false>, llvm::SmallVectorImpl<llvm::MachineInstr>&) llvm/lib/CodeGen/MachineSink.cpp:918:0 #12 0x00007f428d826e27 (anonymous namespace)::MachineSinking::SinkInstruction(llvm::MachineInstr&, bool&, std::map<llvm::MachineBasicBlock, llvm::SmallVector<llvm::MachineBasicBlock, 4u>, std::less<llvm::MachineBasicBlock>, std::allocator<std::pair<llvm::MachineBasicBlock const, llvm::SmallVector<llvm::MachineBasicBlock*, 4u> > > >&) llvm/lib/CodeGen/MachineSink.cpp:1073:0 #13 0x00007f428d824a2c (anonymous namespace)::MachineSinking::ProcessBlock(llvm::MachineBasicBlock&) llvm/lib/CodeGen/MachineSink.cpp:410:0 #14 0x00007f428d824513 (anonymous namespace)::MachineSinking::runOnMachineFunction(llvm::MachineFunction&) llvm/lib/CodeGen/MachineSink.cpp:340:0 ``` Thus, check that operand 1 is also a register in the condition. Reviewed By: arichardson, luismarques Differential Revision: https://reviews.llvm.org/D89090	2020-11-03 03:55:47 +00:00
Craig Topper	9ac2910093	[RISCV] Make SelectRORIW handle the commutability of OR. The SHL and SRL could be in opposite order so account for that. Differential Revision: https://reviews.llvm.org/D90586	2020-11-02 09:32:54 -08:00
Craig Topper	7142ec3aaf	[RISCV] When matching RORIW, make sure the same input is given to both shifts. The code is looking for (sext_inreg (or (shl X, C2), (shr (and Y, C3), C1))). We need to ensure X and Y are the same. Differential Revision: https://reviews.llvm.org/D90580	2020-11-02 09:12:40 -08:00
Simon Pilgrim	36920d5f9d	[RISCV] Avoid std::pair<> in FPReg StringSwitch to avoid MSVC compile failures. NFCI. As discussed on D90322, some MSVC builds are failing with is_trivially_copyable static asserts (see D86126) - we can avoid this by not using the std::pair<unsigned,unsigned> which held both the FP+DP Registers, just handle the FP register and convert to DP on the fly.	2020-11-02 11:30:57 +00:00
Craig Topper	e57237f198	Recommit "[RISCV] Remove include of RISCVRegisterInfo.h from RISCVBaseInfo.h. NFCI" This reverts `781917254d` and recommits `781917254d`. I've changed getRegForInlineAsmConstraint to not use a std::pair of Register in a previous commit. Hopefully that fixes the reported issue with expensive checks on Windows. I'm still not sure exactly why this commit removing an include affected a different file. Original message: RISCVRegisterInfo.h is part of the CodeGen layer. The Utils library is intended to be shared with the MC layer so shouldn't use files from the CodeGen layer. The register enum names are already available from RISCVMCTargetDesc.h. It appears what was coming from this include was a transitive include of the Register class which I've replaced with MCRegister. Register has a constructor from MCRegister so it should be convertible.	2020-11-01 10:35:37 -08:00
Craig Topper	a76cd10fcd	[RISCV] Use 'unsigned' instead of Register in getRegForInlineAsmConstraint. NFC The return value of this interface still uses an 'unsigned' on all targets. So we convert Register back to unsigned at the end. I'm hoping this will prevent the issue that caused the revert of D90322.	2020-11-01 10:16:52 -08:00
Craig Topper	6915c76e10	[RISCV] Don't use DCI.CombineTo to replace a single result. NFCI Just return the new node, which is the standard practice. I also noticed what appeared to be an unnecessary attempt at creating an ANY_EXTEND where the type should already be correct. I replace with an assert to verify the type. Differential Revision: https://reviews.llvm.org/D90444	2020-10-30 10:46:32 -07:00
Simon Pilgrim	781917254d	Revert rG22c383763456 "[RISCV] Remove include of RISCVRegisterInfo.h from RISCVBaseInfo.h" This reverts commit `22c3837634`. This is causing a build failure with MSVC - reported on D90322	2020-10-30 11:59:37 +00:00
Craig Topper	74b078294f	[RISCV] Improve worklist management in the DAG combine for SLLW/SRLW/SRAW This combine makes two calls to SimplifyDemandedBits, one for the LHS and one for the RHS. If the LHS call returns true, we don't make the RHS call. When SimplifyDemandedBits makes a change, it will add the nodes around the change to the DAG combiner worklist. If the simplification happens on the first recursion step, the N will get added to the worklist. But if the simplification happens deeper in the recursion, then N will not be revisited until the next time the DAG combiner runs. This patch explicitly addes N to the worklist anytime a Simplification is made. Without this we might miss additional simplifications on the LHS or never simplify the RHS. Special care also needs to be taken to not add N if it has been CSEd by the simplification. There are similar examples in DAGCombiner and the X86 target, but I don't have a test for it for RISC-V. I've also returned SDValue(N, 0) instead of SDValue() so DAGCombiner knows a change was made and will update its Statistic variable. The test here was constructed so that 2 simplifications happen to the LHS. Without this fix one happens in the post type legalization DAG combine and the other happens after LegalizeDAG. This prevents the RHS from ever being simplified causing the left and right shift to clear the upper 32 bits of the RHS to be left behind. Differential Revision: https://reviews.llvm.org/D90339	2020-10-29 14:52:53 -07:00
Craig Topper	22c3837634	[RISCV] Remove include of RISCVRegisterInfo.h from RISCVBaseInfo.h RISCVRegisterInfo.h is part of the CodeGen layer. The Utils library is intended to be shared with the MC layer so shouldn't use files from the CodeGen layer. The register enum names are already available from RISCVMCTargetDesc.h. It appears what was coming from this include was a transitive include of the Register class which I've replaced with MCRegister. Register has a constructor from MCRegister so it should be convertible.	2020-10-29 11:39:19 -07:00
Evandro Menezes	fe9a7d9627	[RISCV] Use the commercial name for scheduling model (NFC) Use the commercial name for the scheduling model for the SiFive 7 Series.	2020-10-23 16:33:27 -05:00
Kito Cheng	cfa7094e49	[RISCV] Add -mtune support - The goal of this patch is improve option compatible with RISCV-V GCC, -mcpu support on GCC side will sent patch in next few days. - -mtune only affect the pipeline model and non-arch/extension related target feature, e.g. instruction fusion; in td file it called TuneFeatures, which is introduced by X86 back-end[1]. - -mtune accept all valid option for -mcpu and extra alias processor option, e.g. `generic`, `rocket` and `sifive-7-series`, the purpose is option compatible with RISCV-V GCC. - Processor alias for -mtune will resolve according the current target arch, rv32 or rv64, e.g. `rocket` will resolve to `rocket-rv32` or `rocket-rv64`. - Interaction between -mcpu and -mtune: * -mtune has higher priority than -mcpu for pipeline model and TuneFeatures. [1] https://reviews.llvm.org/D85165 Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D89025	2020-10-16 13:55:08 +08:00
sunshaoce	2de693756f	[RISCV] fix a mistake in RISCVInstrInfoV.td A commit of VALUVVNoVm was wrong, fixed it. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D88142	2020-10-15 23:16:53 +08:00
Evandro Menezes	5d6d8a2769	[RISCV] Add SiFive cores to the CPU option Add the SiFive cores E76 and U74 using the SiFive 7 series microarchitecture. Differential Revision: https://reviews.llvm.org/D88759	2020-10-05 15:50:57 -05:00
Evandro Menezes	ed88d96295	[RISCV] Use the extensions in the canonical order (NFC) Fix a mistake in the ordering.	2020-10-05 15:50:57 -05:00
Hsiangkai Wang	067add7b5f	[RISCV] Support vmsge.vx and vmsgeu.vx pseudo instructions in RVV. Implement vmsge{u}.vx pseudo instruction. According to RISC-V V specification, there are different scenarios for this pseudo instruction. I list them below. unmasked va >= x pseudoinstruction: vmsge{u}.vx vd, va, x expansion: vmslt{u}.vx vd, va, x; vmnand.mm vd, vd, vd masked va >= x, vd != v0 pseudoinstruction: vmsge{u}.vx vd, va, x, v0.t expansion: vmslt{u}.vx vd, va, x, v0.t; vmxor.mm vd, vd, v0 masked va >= x, vd == v0 pseudoinstruction: vmsge{u}.vx vd, va, x, v0.t, vt expansion: vmslt{u}.vx vt, va, x; vmandnot.mm vd, vd, vt Use pseudo instruction to model vmsge{u}.vx. The pseudo instruction will convert to different expansion according to the condition. Differential Revision: https://reviews.llvm.org/D84732	2020-10-02 17:20:34 +08:00
Evandro Menezes	c6b18cf967	[RISCV] Use the extensions in the canonical order (NFC) Use the ISA extensions for specific processors in the conventional canonical order.	2020-09-29 20:03:02 -05:00
Michael Collison	764c1b7a4d	[RISCV] Scheduler description for Bullet Add the pipeline model for the RISC-V Bullet micro architecture. Co-authored-by: Evandro Menezes <evandro.menezes@sifive.com>	2020-09-25 18:36:53 -05:00
Evandro Menezes	0291c471aa	[RISCV] Fix formatting (NFC)	2020-09-25 18:15:04 -05:00
Evandro Menezes	1e66e723eb	[RISCV] Merge the pipeline models for Rocket Merge the 32 and 64 bit pipeline models for Rocket into a single file. Differential Revision: https://reviews.llvm.org/D87873	2020-09-24 17:30:40 -05:00
Meera Nakrani	a3d0dce260	[ARM][TTI] Prevents constants in a min(max) or max(min) pattern from being hoisted when in a loop Changes TTI function getIntImmCostInst to take an additional Instruction parameter, which enables us to be able to check it is part of a min(max())/max(min()) pattern that will match SSAT. We can then mark the constant used as free to prevent it being hoisted so SSAT can still be generated. Required minor changes in some non-ARM backends to allow for the optional parameter to be included. Differential Revision: https://reviews.llvm.org/D87457	2020-09-22 11:54:10 +00:00
Evandro Menezes	394d020167	[RISCV] Do not mandate scheduling for CSR instructions Scheduling information is of little value when they may disrupt the pipeline. This patch allows omitting the scheduling information for CSR instructions while still setting `SchedMachineModel::CompleteModel`. For specific cases, any scheduling information added will be used by the scheduler. Differential revision: https://reviews.llvm.org/D85366	2020-09-21 18:24:53 -05:00
Alex Richardson	8cf6778d30	[RISC-V] Implement RISCVInstrInfo::isCopyInstrImpl() This does not result in changes for any of the current tests, but it might improve debug information in some cases. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D86522	2020-09-21 10:21:11 +01:00
Zhaoshi Zheng	1c466477ad	[RISCV] Support Shadow Call Stack Currenlty assume x18 is used as pointer to shadow call stack. User shall pass flags: "-fsanitize=shadow-call-stack -ffixed-x18" Runtime supported is needed to setup x18. If SCS is desired, all parts of the program should be built with -ffixed-x18 to maintain inter-operatability. There's no particuluar reason that we must use x18 as SCS pointer. Any register may be used, as long as it does not have designated purpose already, like RA or passing call arguments. Differential Revision: https://reviews.llvm.org/D84414	2020-09-17 16:02:35 -07:00
Amara Emerson	e5784ef8f6	[GlobalISel] Enable usage of BranchProbabilityInfo in IRTranslator. We weren't using this before, so none of the MachineFunction CFG edges had the branch probability information added. As a result, block placement later in the pipeline was flying blind. This is enabled only with optimizations enabled like SelectionDAG. Differential Revision: https://reviews.llvm.org/D86824	2020-09-09 14:31:12 -07:00
Simon Pilgrim	0dacf3b5ac	RISCVMatInt.h - remove unnecessary includes. NFCI. Add APInt forward declaration and move include to RISCVMatInt.cpp	2020-09-08 18:25:24 +01:00
Ben Shi	c5716447c1	[NFC][RISCV] Simplify pass arg of RISCVMergeBaseOffsetOpt Reviewed By: lenary, asb Differential Revision: https://reviews.llvm.org/D87069	2020-09-03 20:01:23 +08:00
Craig Topper	aab90384a3	[Attributes] Add a method to check if an Attribute has AttrKind None. Use instead of hasAttribute(Attribute::None) There's a special case in hasAttribute for None when pImpl is null. If pImpl is not null we dispatch to pImpl->hasAttribute which will always return false for Attribute::None. So if we just want to check for None its sufficient to just check that pImpl is null. Which can even be done inline. This patch adds a helper for that case which I hope will speed up our getSubtargetImpl implementations. Differential Revision: https://reviews.llvm.org/D86744	2020-08-28 13:23:45 -07:00
Alex Richardson	5ba4d0365b	[RISC-V] fmv.s/fmv.d should be as cheap as a move Since the canonical floatig-point move is fsgnj rd, rs, rs, we should handle this case in RISCVInstrInfo::isAsCheapAsAMove(). Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D86518	2020-08-27 10:32:23 +01:00
Alex Richardson	a11eeb4d4a	[RISC-V] Mark C_MV as a move instruction Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D86517	2020-08-27 10:32:23 +01:00
Alex Richardson	2259ce8c91	[RISC-V] ADDI/ORI/XORI x, 0 should be as cheap as a move The isTriviallyRematerializable hook is only called for instructions that are tagged as isAsCheapAsAMove. Since ADDI 0 is used for "mv" it should definitely be marked with "isAsCheapAsAMove". This change avoids one stack spill in most of the atomic-rmw.ll tests functions. It also avoids stack spills in two of our out-of-tree CHERI tests. ORI/XORI with zero may or may not be the same as a move micro-architecturally, but since we are already doing it for register == x0, we might as well do the same if the immediate is zero. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D86480	2020-08-27 10:32:22 +01:00
luxufan	888c02deee	[RISCV] add the MC layer support of riscv vector Zvamo extension Implements the assemble and disassemble support of RISCV Vector extension zvamo instructions, base on the 0.9 spec version. Reviewed by HsiangKai Differential Revision: https://reviews.llvm.org/D85069	2020-08-27 14:11:38 +08:00
lewis-revill	9e6c09c0d9	[RISCV] Fix inaccurate annotations on PseudoBRIND PseudoBRIND had seemingly inherited incorrect annotations denoting it as a call instruction and that it defines X1/ra. This caused excess save/restore code to be emitted for ra. Differential Revision: https://reviews.llvm.org/D86286	2020-08-21 11:38:42 +01:00
Jay Foad	0819a6416f	[SelectionDAG] Better legalization for FSHL and FSHR In SelectionDAGBuilder always translate the fshl and fshr intrinsics to FSHL and FSHR (or ROTL and ROTR) instead of lowering them to shifts and ORs. Improve the legalization of FSHL and FSHR to avoid code quality regressions. Differential Revision: https://reviews.llvm.org/D77152	2020-08-21 10:32:49 +01:00
Jessica Clarke	3149ec07c0	[RISCV] Enable MCCodeEmitter instruction predicate verifier This ensures that we never encode an instruction which is unavailable, such as if we explicitly insert a forbidden instruction when lowering. This is particularly important on RISC-V given its high degree of modularity, and will become increasingly important as new standard extensions appear. Reviewed By: asb, lenary Differential Revision: https://reviews.llvm.org/D85015	2020-08-20 18:36:54 +01:00
luxufan	6c5039a10f	[RISCV] add the assemble and disassemble support of Zvlsseg instructions This implements the assemble and disassemble support of RISCV Vector extension Zvlsseg instructions, base on the 0.9 spec version. Reviewed by HsiangKai Differential Revision: https://reviews.llvm.org/D84416	2020-08-19 16:22:25 +08:00
Sam Elliott	3f7068ad98	[RISCV] Enable the use of the old mucounteren name The RISC-V Privileged Specification 1.11 defines `mcountinhibit`, which has the same numeric CSR value as `mucounteren` from 1.09.1. This patch enables the use of the old `mucounteren` name. Patch by Yuichi Sugiyama. Reviewed By: lenary, jrtc27, pzheng Differential Revision: https://reviews.llvm.org/D85067	2020-08-17 13:11:49 +01:00
Sam Elliott	5f9ecc5d85	[RISCV] Indirect branch generation in position independent code This fixes the "Unable to insert indirect branch" fatal error sometimes seen when generating position-independent code. Patch by msizanoen1 Reviewed By: jrtc27 Differential Revision: https://reviews.llvm.org/D84833	2020-08-17 13:09:26 +01:00
Craig Topper	c7a0b2684f	[X86][MC][Target] Initial backend support a tune CPU to support -mtune This patch implements initial backend support for a -mtune CPU controlled by a "tune-cpu" function attribute. If the attribute is not present X86 will use the resolved CPU from target-cpu attribute or command line. This patch adds MC layer support a tune CPU. Each CPU now has two sets of features stored in their GenSubtargetInfo.inc tables . These features lists are passed separately to the Processor and ProcessorModel classes in tablegen. The tune list defaults to an empty list to avoid changes to non-X86. This annoyingly increases the size of static tables on all target as we now store 24 more bytes per CPU. I haven't quantified the overall impact, but I can if we're concerned. One new test is added to X86 to show a few tuning features with mismatched tune-cpu and target-cpu/target-feature attributes to demonstrate independent control. Another new test is added to demonstrate that the scheduler model follows the tune CPU. I have not added a -mtune to llc/opt or MC layer command line yet. With no attributes we'll just use the -mcpu for both. MC layer tools will always follow the normal CPU for tuning. Differential Revision: https://reviews.llvm.org/D85165	2020-08-14 15:31:50 -07:00
StephenFan	a96921afa7	[RISCV] eliminate the repetition declare of SDLoc DL Differential revision: https://reviews.llvm.org/D85002	2020-08-03 10:24:30 +08:00
Hsiangkai Wang	47a4a27f47	Upgrade MC to v0.9. Differential revision: https://reviews.llvm.org/D80802	2020-08-01 07:42:06 +08:00
Yuanfang Chen	ca1e69a675	[NFC] remove unused includes of SelectionDAGISel.h	2020-07-20 10:43:29 -07:00
Zakk Chen	294d1eae75	[RISCV] Add support for -mcpu option. Summary: 1. gcc uses `-march` and `-mtune` flag to chose arch and pipeline model, but clang does not have `-mtune` flag, we uses `-mcpu` to chose both infos. 2. Add SiFive e31 and u54 cpu which have default march and pipeline model. 3. Specific `-mcpu` with rocket-rv[32\|64] would select pipeline model only, and use the driver's arch choosing logic to get default arch. Reviewers: lenary, asb, evandro, HsiangKai Reviewed By: lenary, asb, evandro Tags: #llvm, #clang Differential Revision: https://reviews.llvm.org/D71124	2020-07-16 11:46:22 -07:00
lewis-revill	c9c955ada8	[RISCV] Add matching of codegen patterns to RISCV Bit Manipulation Zbt asm instructions This patch provides optimization of bit manipulation operations by enabling the +experimental-b target feature. It adds matching of single block patterns of instructions to specific bit-manip instructions from the ternary subset (zbt subextension) of the experimental B extension of RISC-V. It adds also the correspondent codegen tests. This patch is based on Claire Wolf's proposal for the bit manipulation extension of RISCV: https://github.com/riscv/riscv-bitmanip/blob/master/bitmanip-0.92.pdf Differential Revision: https://reviews.llvm.org/D79875	2020-07-15 12:19:34 +01:00
lewis-revill	d4be33374c	[RISCV] Add matching of codegen patterns to RISCV Bit Manipulation Zbs asm instructions This patch provides optimization of bit manipulation operations by enabling the +experimental-b target feature. It adds matching of single block patterns of instructions to specific bit-manip instructions from the single-bit subset (zbs subextension) of the experimental B extension of RISC-V. It adds also the correspondent codegen tests. This patch is based on Claire Wolf's proposal for the bit manipulation extension of RISCV: https://github.com/riscv/riscv-bitmanip/blob/master/bitmanip-0.92.pdf Differential Revision: https://reviews.llvm.org/D79874	2020-07-15 12:19:34 +01:00
lewis-revill	6144f0a1e5	[RISCV] Add matching of codegen patterns to RISCV Bit Manipulation Zbbp asm instructions This patch provides optimization of bit manipulation operations by enabling the +experimental-b target feature. It adds matching of single block patterns of instructions to specific bit-manip instructions belonging to both the permutation and the base subsets of the experimental B extension of RISC-V. It adds also the correspondent codegen tests. This patch is based on Claire Wolf's proposal for the bit manipulation extension of RISCV: https://github.com/riscv/riscv-bitmanip/blob/master/bitmanip-0.92.pdf Differential Revision: https://reviews.llvm.org/D79873	2020-07-15 12:19:34 +01:00
lewis-revill	31b52b4345	[RISCV] Add matching of codegen patterns to RISCV Bit Manipulation Zbp asm instructions This patch provides optimization of bit manipulation operations by enabling the +experimental-b target feature. It adds matching of single block patterns of instructions to specific bit-manip instructions from the permutation subset (zbp subextension) of the experimental B extension of RISC-V. It adds also the correspondent codegen tests. This patch is based on Claire Wolf's proposal for the bit manipulation extension of RISCV: https://github.com/riscv/riscv-bitmanip/blob/master/bitmanip-0.92.pdf Differential Revision: https://reviews.llvm.org/D79871	2020-07-15 12:19:34 +01:00
lewis-revill	e2692f0ee7	[RISCV] Add matching of codegen patterns to RISCV Bit Manipulation Zbb asm instructions This patch provides optimization of bit manipulation operations by enabling the +experimental-b target feature. It adds matching of single block patterns of instructions to specific bit-manip instructions from the base subset (zbb subextension) of the experimental B extension of RISC-V. It adds also the correspondent codegen tests. This patch is based on Claire Wolf's proposal for the bit manipulation extension of RISCV: https://github.com/riscv/riscv-bitmanip/blob/master/bitmanip-0.92.pdf Differential Revision: https://reviews.llvm.org/D79870	2020-07-15 12:19:34 +01:00
Jessica Clarke	2dc16fbdf0	[RISCV] Duplicate pseudo expansion comment to RISCVMCCodeEmitter Follow-on from D77443. Although we're not fixing any of these pseudo-instructions, the potential for them to be out of sync still exists.	2020-07-15 10:52:42 +01:00
Jessica Clarke	3382c243ba	[RISCV] Fix RISCVInstrInfo::getInstSizeInBytes for atomics pseudos Summary: Without these, the generic branch relaxation pass will underestimate the range required for branches spanning these and we can end up with "fixup value out of range" errors rather than relaxing the branches. Some of the instructions in the expansion may end up being compressed but exactly determining that is awkward, and these conservative values should be safe, if slightly suboptimal in rare cases. Reviewers: asb, lenary, luismarques, lewis-revill Reviewed By: asb, luismarques Subscribers: hiraditya, rbar, johnrusso, simoncook, sabuasal, niosHD, kito-cheng, shiva0217, MaskRay, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, rkruppe, jfb, PkmX, jocewei, psnobl, benna, Jim, s.egerton, pzheng, sameer.abuasal, apazos, evandro, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77443	2020-07-15 10:50:55 +01:00
Roger Ferrer Ibanez	0cbdd2a82a	[RISCV] Fix isStoreToStackSlot Because of the layout of stores (that don't have a destination operand) this check is exactly the same as the one in RISCVInstrInfo::isLoadFromStackSlot. Differential Revision: https://reviews.llvm.org/D81805	2020-07-14 12:36:42 +00:00
Sam Elliott	1d15bbb9d9	Revert "[RISCV] Avoid Splitting MBB in RISCVExpandPseudo" This reverts commit `97106f9d80`. This is based on feedback from https://reviews.llvm.org/D82988#2147105	2020-07-14 11:15:01 +01:00
Matt Arsenault	db091e12b2	RISCV: Avoid GlobalISel build break in a future patch The GlobalISelEmitter is stricter about matching timm instruction outputs to timm inputs (although in an accidental sort of way that doesn't hit a proper import failure error). Also, apparently no intrinsic patterns were importing since the ID enum declaration was missing.	2020-07-13 14:01:57 -04:00
Fangrui Song	4d5fd0ee5e	[MC][RISCV] Set UseIntegratedAssembler to true to align with most other targets. Also, -fintegrated-as is the default for clang -target riscv*.	2020-07-12 21:04:48 -07:00
Zakk Chen	04b9a46c84	[RISCV] Refactor FeatureRVCHints to make ProcessorModel more intuitive Reviewers: luismarques, asb, evandro Reviewed By: asb, evandro Tags: #llvm Differential Revision: https://reviews.llvm.org/D77030	2020-07-09 23:07:39 -07:00
Sam Elliott	97106f9d80	[RISCV] Avoid Splitting MBB in RISCVExpandPseudo Since the `RISCVExpandPseudo` pass has been split from `RISCVExpandAtomicPseudo` pass, it would be nice to run the former as early as possible (The latter has to be run as late as possible to ensure correctness). Running earlier means we can reschedule these pairs as we see fit. Running earlier in the machine pass pipeline is good, but would mean teaching many more passes about `hasLabelMustBeEmitted`. Splitting the basic blocks also pessimises possible optimisations because some optimisations are MBB-local, and others are disabled if the block has its address taken (which is notionally what `hasLabelMustBeEmitted` means). This patch uses a new approach of setting the pre-instruction symbol on the AUIPC instruction to a temporary symbol and referencing that. This avoids splitting the basic block, but allows us to reference exactly the instruction that we need to. Notionally, this approach seems more correct because we do actually want to address a specific instruction. This then allows the pass to be moved much earlier in the pass pipeline, before both scheduling and register allocation. However, to do so we must leave the MIR in SSA form (by not redefining registers), and so use a virtual register for the intermediate value. By using this virtual register, this pass now has to come before register allocation. Reviewed By: luismarques, asb Differential Revision: https://reviews.llvm.org/D82988	2020-07-09 13:54:13 +01:00
Ben Shi	1e9d0811c9	[RISCV] optimize addition with a pair of (addi imm) For an addition with an immediate in specific ranges, a pair of addi-addi can be generated instead of the ordinary lui-addi-add serial. Reviewed By: MaskRay, luismarques Differential Revision: https://reviews.llvm.org/D82262	2020-07-07 18:57:28 -07:00
Ben Shi	cb82de2960	[RISCV] Optimize multiplication by constant ... to shift/add or shift/sub. Do not enable it on riscv32 with the M extension where decomposeMulByConstant may not be an optimization. Reviewed By: luismarques, MaskRay Differential Revision: https://reviews.llvm.org/D82660	2020-07-07 18:50:24 -07:00
Luís Marques	61c2a0bb82	[RISCV] Fold ADDIs into load/stores with nonzero offsets We can often fold an ADDI into the offset of load/store instructions: (load (addi base, off1), off2) -> (load base, off1+off2) (store val, (addi base, off1), off2) -> (store val, base, off1+off2) This is possible when the off1+off2 continues to fit the 12-bit immediate. We remove the previous restriction where we would never fold the ADDIs if the load/stores had nonzero offsets. We now do the fold the the resulting constant still fits a 12-bit immediate, or if off1 is a variable's address and we know based on that variable's alignment that off1+offs2 won't overflow. Differential Revision: https://reviews.llvm.org/D79690	2020-07-06 17:32:57 +01:00
Pengxuan Zheng	d36f2c6a6c	[RISCV] Add mcountinhibit CSR Summary: The mcountinhibit CSR is defined in the ratified 1.11 version of the privileged spec. Reviewers: apazos, asb, lenary, luismarques Reviewed By: asb Subscribers: hiraditya, rbar, johnrusso, simoncook, sabuasal, niosHD, kito-cheng, shiva0217, jrtc27, MaskRay, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, rkruppe, PkmX, jocewei, psnobl, benna, Jim, s.egerton, sameer.abuasal, evandro, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82913	2020-07-01 08:27:00 -07:00
Luís Marques	b2aa546b07	[RISCV] Temporarily move riscv-expand-pseudo pass to PreEmitPass2 The pass to split atomic and non-atomic RISC-V pseudo-instructions was itself split into two passes in D79635 / commit rG2cb0644f90b7, with the splitting of non-atomic instructions being moved to the PreSched2 phase. A comment was added to D79635 detailing a case where this caused problems, so this commit moves the non-atomic split pass back to the PreEmitPass2 phase. This allows the bulk of the changes from D79635 to remain committed, while addressing the the reported problem (the pass split is now almost NFC). Once the root problem is fixed we can move the (non-atomic) instruction splitting pass back to earlier in the pipeline.	2020-07-01 16:26:02 +01:00
Luís Marques	a61fa1a4b9	Revert "[RISCV] Temporarily move riscv-expand-pseudo pass to PreEmitPass2" This reverts commit `05a20a9e9a`.	2020-07-01 16:01:40 +01:00
Luís Marques	05a20a9e9a	[RISCV] Temporarily move riscv-expand-pseudo pass to PreEmitPass2 The pass to split atomic and non-atomic RISC-V pseudo-instructions was itself split into two passes in D79635 / commit rG2cb0644f90b7, with the splitting of non-atomic instructions being moved to the PreSched2 phase. A comment was added to D79635 detailing a case where this caused problems, so this commit moves the non-atomic split pass back to the PreEmitPass2 phase. This allows the bulk of the changes from D79635 to remain committed, while addressing the the reported problem (the pass split is now almost NFC). Once the root problem is fixed we can move the (non-atomic) instruction splitting pass back to earlier in the pipeline.	2020-07-01 15:42:18 +01:00
Sam Elliott	7dc892661e	[RISCV] Implement Hooks to avoid chaining SELECT Summary: This implements two hooks that attempt to avoid control flow for RISC-V. RISC-V will lower SELECTs into control flow, which is not a great idea. The hook `hasMultipleConditionRegisters()` turns off the following DAGCombiner folds: select(C0\|C1, x, y) <=> select(C0, x, select(C1, x, y)) select(C0&C1, x, y) <=> select(C0, select(C1, x, y), y) The second hook `setJumpIsExpensive` controls a flag that has a similar purpose and is used in CodeGenPrepare and the SelectionDAGBuilder. Both of these have the effect of ensuring more logic is done before fewer jumps. Note: with the `B` extension, we may be able to lower select into a conditional move instruction, so at some point these hooks will need to be guarded based on enabled extensions. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D79268	2020-07-01 11:56:31 +01:00
Guillaume Chatelet	28de229bc6	[Alignment][NFC] Migrate MachineFrameInfo::CreateStackObject to Align This patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Differential Revision: https://reviews.llvm.org/D82894	2020-07-01 07:28:11 +00:00
Matt Arsenault	08649f0a9d	RISCV: Don't store function in RISCVMachineFunctionInfo Targets should not depend on the MachineFunction state during the MachineFunctionInfo construction.	2020-06-30 16:08:51 -04:00
Luís Marques	2cb0644f90	[RISCV] Split the pseudo instruction splitting pass Extracts the atomic pseudo-instructions' splitting from `riscv-expand-pseudo` / `RISCVExpandPseudo` into its own pass, `riscv-expand-atomic-pseudo` / `RISCVExpandAtomicPseudo`. This allows for the expansion of atomic operations to continue to happen late (the new pass is added in `addPreEmitPass2`, so those expansions continue to happen in the same place), while the remaining pseudo-instructions can now be expanded earlier and benefit from more optimization passes. The nonatomics pass is now added in `addPreSched2`. Differential Revision: https://reviews.llvm.org/D79635	2020-06-29 14:35:57 +01:00

... 3 4 5 6 7 ...

959 Commits