llvm-project

Commit Graph

Author	SHA1	Message	Date
Hsiangkai Wang	e72b22a40b	[RISCV] Define different pseudo instructions for different FPR. When spilling, the spill size will depend on the size of register class. For .vf vector instructions, it may spill the floating point scalar argument. In order to use the correct load/store instructions for spilling, we need to provide the correct floating point register class for the .vf vector pseudo instructions. In this commit, we define the .vf pseudo instructions as three different kinds of pseudo instructions for half/float/double. For example, PseudoVFADD_M1 will become as PseudoVFADD_F16_M1, PseudoVFADD_F32_M1, and PseudoVFADD_F64_M1. Differential Revision: https://reviews.llvm.org/D95234	2021-01-26 15:48:35 +08:00
Hsiangkai Wang	b69932b550	[RISCV] Implement vlsegff intrinsics. Differential Revision: https://reviews.llvm.org/D95303	2021-01-26 12:02:43 +08:00
Hsiangkai Wang	66a49aef69	[RISCV] Implement vsoxseg/vsuxseg intrinsics. Define vsoxseg/vsuxseg intrinsics and pseudo instructions. Lower vsoxseg/vsuxseg intrinsics to pseudo instructions in RISCVDAGToDAGISel. Differential Revision: https://reviews.llvm.org/D94940	2021-01-23 08:54:56 +08:00
Hsiangkai Wang	97e33feb08	[RISCV] Implement vloxseg/vluxseg intrinsics. Define vloxseg/vluxseg intrinsics and pseudo instructions. Lower vloxseg/vluxseg intrinsics to pseudo instructions in RISCVDAGToDAGISel. Differential Revision: https://reviews.llvm.org/D94903	2021-01-23 08:54:56 +08:00
Craig Topper	f8f1b20e6b	[RISCV] Don't create LMUL=8 pseudo instructions for ternary widening arithmetic instructions These instructions produce 2*SEW result so the input can't have an LMUL=8 or the result would need a non-existant LMUL=16. So only create pseudos for LMUL up to 4. Differential Revision: https://reviews.llvm.org/D95189	2021-01-21 19:29:02 -08:00
ShihPo Hung	9667750331	[RISCV] Add intrinsics for RVV1.0 VFRSQRTE7 & VFRECE7 Reviewed By: craig.topper, frasercrmck Differential Revision: https://reviews.llvm.org/D95113	2021-01-21 18:38:49 -08:00
ShihPo Hung	976cf53cc7	[RISCV] Add intrinsics for vector unordered indexed load in RVV 1.0 Add unordered indexed load: vluxei Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D95028	2021-01-21 18:38:49 -08:00
ShihPo Hung	bea661d9a5	[RISCV] Add intrinsics for RVV 1.0 vrgatherei16 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D95014	2021-01-21 18:38:49 -08:00
Craig Topper	3b5430eb0d	[RISCV] Add a VL output to vleff intrinsics. The fault-only-first-load instructions can reduce VL if an element other than element 0 triggers a memory fault. This can be used to vectorize loops with data dependent exit conditions like strcmp or strlen. This patch adds a VL output to these intrinsics so that the new VL value can be captured by software. This will be expanded to 'csrr gpr, vl' after the vleff instruction during SelectionDAG. By doing this with one intrinsic we are able to guarantee that the csrr reads the VL value produced by the vleff instruction. Having it as a separate intrinsic would make it impossible to guarantee ordering without making every other vector intrinsic have side effects. The intrinsics are expanded during lowering into two ISD nodes that are glued together. These ISD nodes will go through isel separately, but should maintain the glue so that they get emitted adjacently by InstrEmitter. I've only ran the chain through the vleff instruction, allowing the READ_VL to be deleted if it is unused. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D94286	2021-01-21 17:19:58 -08:00
Hsiangkai Wang	b7ab6726b6	[RISCV] New vector load/store in V extension v1.0 Upgrade RISC-V V extension to v1.0-08a0b46. Indexed load/store have ordered and unordered form. New whole vector load/store. Differential Revision: https://reviews.llvm.org/D93614	2021-01-22 07:30:09 +08:00
Hsiangkai Wang	a8b96eadfd	[RISCV] Implement vssseg intrinsics. Define vlsseg intrinsics and pseudo instructions. Lower vlsseg intrinsics to pseudo instructions in RISCVDAGToDAGISel. Differential Revision: https://reviews.llvm.org/D94863	2021-01-21 11:51:35 +08:00
Hsiangkai Wang	e5e329023b	[RISCV] Implement vlsseg intrinsics. Define vlsseg intrinsics and pseudo instructions. Lower vlsseg intrinsics to pseudo instructions in RISCVDAGToDAGISel. Differential Revision: https://reviews.llvm.org/D94763	2021-01-21 11:51:35 +08:00
Hsiangkai Wang	47228f7854	[RISCV] Implement vsseg intrinsics. Define vsseg intrinsics and pseudo instructions. Lower vsseg intrinsics to pseudo instructions in RISCVDAGToDAGISel. Differential Revision: https://reviews.llvm.org/D94688	2021-01-21 11:51:35 +08:00
Hsiangkai Wang	8ca4b174d7	[RISCV] Implement vlseg intrinsics. For Zvlsseg, we need continuous vector registers for the values. We need to define new register classes for the different combinations of (number of fields and LMUL). For example, when the number of fields(NF) = 3, LMUL = 2, the values will be assigned to (V0M2, V2M2, V4M2), (V2M2, V4M2, V6M2), (V4M2, V6M2, V8M2), ... We define the vlseg intrinsics with multiple outputs. There is no way to describe the codegen patterns with multiple outputs in the tablegen files. We do the codegen in RISCVISelDAGToDAG and use EXTRACT_SUBREG to extract the values of output. The multiple scalable vector values will be put into a struct. This patch is depended on the support for scalable vector struct. Differential Revision: https://reviews.llvm.org/D94229	2021-01-20 14:26:04 +08:00
ShihPo Hung	4dae2247fd	[RISCV] refactor VPatBinary (NFC) Make it easier to reuse for intrinsic vrgatherei16 which needs to encode both LMUL & EMUL in the instruction name, like PseudoVRGATHEREI16_VV_M1_M1 and PseudoVRGATHEREI16_VV_M1_M2. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94951	2021-01-19 19:09:56 -08:00
Fraser Cormack	15fd6bae0e	[RISCV] Extend RVV VType info with the type's AVL (NFC) This patch factors out the "VLMax" operand passed to most scalable-vector ISel patterns into a property of each VType. This is seen as a preparatory change to allow RVV in the future to more easily support fixed-length vector types with constrained vector lengths, with the AVL operand set to the length of the fixed-length vector. It has no effect on the scalable code generation path. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D94594	2021-01-19 15:46:56 +00:00
ShihPo Hung	9cf511aa08	[RISCV] Add intrinsics for vector AMO operations Add vamoswap, vamoadd, vamoxor, vamoand, vamoor, vamomin, vamomax, vamominu, vamomaxu intrinsics. Reviewed By: craig.topper, khchen Differential Revision: https://reviews.llvm.org/D94589	2021-01-18 23:11:10 -08:00
Craig Topper	dfc1901d51	[RISCV] Custom lower ISD::VSCALE. This patch custom lowers ISD::VSCALE into a csrr vlenb followed by a shift right by 3 followed by a multiply by the scale amount. I've added computeKnownBits support to indicate that the csrr vlenb always produces 3 trailng bits of 0s so the shift right is "exact". This allows the shift and multiply sequence to be nicely optimized into a single shift or removed completely when the scale amount is a power of 2. The non power of 2 case multiplying by 24 is still producing suboptimal code. We could remove the right shift and use a multiply by 3. Hopefully we can improve DAG combine to fix that since it's not unique to this sequence. This replaces D94144. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D94249	2021-01-13 17:14:49 -08:00
Craig Topper	1730b0f66a	[RISCV] Remove '.mask' from vcompress intrinsic name. NFC It has a mask argument, but isn't a masked instruction. It doesn't use the mask policy of or the v0.t syntax.	2021-01-12 14:46:16 -08:00
Craig Topper	a14040bd4d	[RISCV] Use vmerge.vim for llvm.riscv.vfmerge with a 0.0 scalar operand. We can use a 0 immediate to avoid needing to materialize 0 into an FPR first. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D94459	2021-01-12 11:08:26 -08:00
Evandro Menezes	7470017f24	[RISCV] Define the vfclass RVV intrinsics Define the `vfclass` IR intrinsics for the respective V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Evandro Menezes <evandro.menezes@sifive.com> Differential Revision: https://reviews.llvm.org/D94356	2021-01-11 17:40:09 -06:00
Craig Topper	278a3ea1b2	[RISCV] Use vmv.v.i vd, 0 instead of vmv.v.x vd, x0 for llvm.riscv.vfmv.v.f with 0.0 This matches what we use for integer 0. It's also consistent with the scalar 'mv' pseudo that uses addi rather than add with x0.	2021-01-11 15:08:05 -08:00
Craig Topper	5cf73dca77	[RISCV] Convert most of the information about RVV Pseudos into bits in TSFlags. This patch moves all but the BaseInstr to bits in TSFlags. For the index fields, we can just use a bit to indicate their presence. The locations of the operands are well defined. This reduces the llc binary by about 32K on my build. It also removes the binary search of the table from the custom inserter. Instead we just check that the SEW op is present. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D94375	2021-01-10 19:15:45 -08:00
Craig Topper	0875a9da2a	[RISCV] Cleanup a few section comments in RISCVInstrInfoVPseudos.td. NFC	2021-01-08 11:36:31 -08:00
Evandro Menezes	946bc50e4c	[RISCV] Define the vfsqrt RVV intrinsics Define the `vfsqrt` IR intrinsics for the respective V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Evandro Menezes <evandro.menezes@sifive.com> Differential Revision: https://reviews.llvm.org/D93745	2021-01-07 17:29:29 -06:00
Craig Topper	7a8ced43d7	[RISCV] Fix a few section number comments in RISCVInstrInfoVPseudos.td to match the V extension 1.0 draft spec. NFC The majority of the comments use the 1.0 draft spec section numbers.	2021-01-06 16:38:30 -08:00
Craig Topper	c707716c04	[RISCV] Match vmslt(u).vx intrinsics with a small immediate to vmsle(u).vx. There are vmsle(u).vx and vmsle(u).vi instructions, but there is only vmslt(u).vx and no vmslt(u).vi. vmslt(u).vi can be emulated for some immediates by decrementing the immediate and using vmsle(u).vi. To avoid the user needing to know about this, this patch does this conversion. The assembler does the same thing for vmslt(u).vi and vmsge(u).vi pseudoinstructions. There is no vmsge(u).vx intrinsic or instruction so this patch is limited to vmslt(u). Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D94070	2021-01-05 10:20:21 -08:00
Monk Chiang	1d04cbeb43	[RISCV] Define vector single-width type-convert intrinsic. Define intrinsics: 1. vfcvt.xu.f.v/vfcvt.x.f.v 2. vfcvt.rtz.xu.f.v/vfcvt.rtz.x.f.v 3. vfcvt.f.xu.v/vfcvt.f.x.v We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Monk Chiang <monk.chiang@sifive.com> Differential Revision: https://reviews.llvm.org/D93933	2020-12-31 11:49:30 +08:00
Monk Chiang	2aed9bc98a	[RISCV] Define vector narrowing type-convert intrinsic. Define intrinsics: 1. vfncvt.xu.f.w/vfncvt.x.f.w 2. vfncvt.rtz.xu.f.w/vfncvt.rtz.x.f.w 3. vfncvt.f.xu.w/vfncvt.f.x.w 4. vfncvt.f.f.w/vfncvt.rod.f.f.w We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Monk Chiang <monk.chiang@sifive.com> Differential Revision: https://reviews.llvm.org/D93932	2020-12-31 11:48:28 +08:00
Monk Chiang	fdd30faae5	[RISCV] Define vector widening type-convert intrinsic. Define intrinsics: 1. vfwcvt.xu.f.v/vfwcvt.x.f.v 2. vfwcvt.rtz.xu.f.v/vfwcvt.rtz.x.f.v 3. vfwcvt.f.xu.v/vfwcvt.f.x.v 4. vfwcvt.f.f.v We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Monk Chiang <monk.chiang@sifive.com> Differential Revision: https://reviews.llvm.org/D93855	2020-12-31 11:48:09 +08:00
ShihPo Hung	096b02ebbf	[RISCV] Add intrinsics for vcompress instruction This patch defines vcompress intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential revision: https://reviews.llvm.org/D93809	2020-12-29 18:38:15 -08:00
Zakk Chen	6da0033624	[RISCV] Define vsext/vzext intrinsics. Define vsext/vzext intrinsics.and lower to V instructions. Define new fraction register class fields in LMULInfo and a NoReg to present invalid LMUL register classes. Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93893	2020-12-29 16:50:53 -08:00
Craig Topper	79cbb003c5	[RISCV] Don't use tail agnostic policy on instructions where destination is tied to source If the destination is tied, then user has some control of the register used for input. They would have the ability to control the value of any tail elements. By using tail agnostic we take this option away from them. Its not clear that the intrinsics are defined such that this isn't supposed to work. And undisturbed is a valid implementation for agnostic so code wouldn't even fail to work on all systems if we always used agnostic. The vcompress intrinsic is defined to require tail undisturbed so at minimum we need this for that instruction or need to redefine the intrinsic. I've made an exception here for vmv.s.x/fmv.s.f and reduction instructions which only write to element 0 regardless of the tail policy. This allows us to keep the agnostic policy on those which should allow better redundant vsetvli removal. An enhancement would be to check for undef input and keep the agnostic policy, but we don't have good test coverage for that yet. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D93878	2020-12-29 10:37:58 -08:00
Craig Topper	2ae760e27e	[RISCV] Add earlyclobber of destination register to vmsbf.m/vmsif.m/vmsof.m instructions The spec for these instructions include this note. "The destination register cannot overlap either the source register or the mask register ('v0') if the instruction is masked." So we need earlyclobber to enforce this constraint. I've regenerated the tests with update_llc_test_checks.py to show the effects of the earlyclobber. Reviewed By: khchen, frasercrmck Differential Revision: https://reviews.llvm.org/D93867	2020-12-29 10:00:04 -08:00
Zakk Chen	f3f9ce3b79	[RISCV] Define vmclr.m/vmset.m intrinsics. Define vmclr.m/vmset.m intrinsics and lower to vmxor.mm/vmxnor.mm. Ideally all rvv pseudo instructions could be implemented in C header, but those two instructions don't take an input, codegen can not guarantee that the source register becomes the same as the destination. We expand pseduo-v-inst into corresponding v-inst in RISCVExpandPseudoInsts pass. Reviewed By: craig.topper, frasercrmck Differential Revision: https://reviews.llvm.org/D93849	2020-12-28 18:57:17 -08:00
Zakk Chen	e673d40199	[RISCV] Define vmsbf.m/vmsif.m/vmsof.m/viota.m/vid.v intrinsics. Define those intrinsics and lower to V instructions. Use update_llc_test_checks.py for viota.m tests to check earlyclobber is applied correctly. mask viota.m tests uses the same argument as input and mask for avoid dependency of D93364. We work with @rogfer01 from BSC to come out this patch. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D93823	2020-12-28 05:54:18 -08:00
Monk Chiang	622ea9cf74	[RISCV] Define vector widening reduction intrinsic. Define vwredsumu/vwredsum/vfwredosum/vfwredsum We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D93807	2020-12-26 21:42:30 +08:00
Zakk Chen	da4a637e99	[RISCV] Define vpopc/vfirst intrinsics. Define vpopc/vfirst intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93795	2020-12-24 19:44:34 -08:00
Zakk Chen	351c216f36	[RISCV] Define vector mask-register logical intrinsics. Define vector mask-register logical intrinsics and lower them to V instructions. Also define pseudo instructions vmmv.m and vmnot.m. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D93705	2020-12-24 18:59:05 -08:00
ShihPo Hung	912740a864	[RISCV] Add intrinsics for vrgather instruction This patch defines vrgather intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential revision: https://reviews.llvm.org/D93797	2020-12-24 18:16:02 -08:00
Monk Chiang	afd03cd335	[RISCV] Define vector single-width reduction intrinsic. integer group: vredsum/vredmaxu/vredmax/vredminu/vredmin/vredand/vredor/vredxor float group: vfredosum/vfredsum/vfredmax/vfredmin We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D93746	2020-12-25 09:56:01 +08:00
Fraser Cormack	1a7ac29a89	[RISCV] Add ISel support for RVV vector/scalar forms This patch extends the SDNode ISel support for RVV from only the vector/vector instructions to include the vector/scalar and vector/immediate forms. It uses splat_vector to carry the scalar in each case, except when XLEN<SEW (RV32 SEW=64) when a custom node `SPLAT_VECTOR_I64` is used for type-legalization and to encode the fact that the value is sign-extended to SEW. When the scalar is a full 64-bit value we use a sequence to materialize the constant into the vector register. The non-intrinsic ISel patterns have also been split into their own file. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Fraser Cormack <fraser@codeplay.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93312	2020-12-23 20:16:18 +00:00
Craig Topper	e0110a4740	[RISCV] Add intrinsics for vfmv.v.f Also include a special case pattern to use vmv.v.x vd, zero when the argument is 0.0. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D93672	2020-12-23 10:50:48 -08:00
ShihPo Hung	6301871d06	[RISCV] Add intrinsics for vfwmacc, vfwnmacc, vfwmsac, vfwnmsac instructions This patch defines vfwmacc, vfwnmacc, vfwmsc, vfwnmsac intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93693	2020-12-23 00:42:04 -08:00
Zakk Chen	032600b9ae	[RISCV] Define vmerge/vfmerge intrinsics. Define vmerge/vfmerge intrinsics and lower to V instructions. Include support for vector-vector vfmerge by vmerge.vvm. We work with @rogfer01 from BSC to come out this patch. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93674	2020-12-23 00:07:09 -08:00
Evandro Menezes	4d47944393	[RISCV] Define the vfmin, vfmax RVV intrinsics Define the vfmin, vfmax IR intrinsics for the respective V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Evandro Menezes <evandro.menezes@sifive.com> Differential Revision: https://reviews.llvm.org/D93673	2020-12-23 00:27:38 -06:00
ShihPo Hung	ad0a7ad950	[RISCV] Add intrinsics for vf[n]macc/vf[n]msac/vf[n]madd/vf[n]msub instructions This patch defines vfmadd/vfnmacc, vfmsac/vfnmsac, vfmadd/vfnmadd, and vfmsub/vfnmsub lower to V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93691	2020-12-22 18:34:00 -08:00
ShihPo Hung	4268783998	[RISCV] Add intrinsics for vwmacc[u\|su\|us] instructions This patch defines vwmacc[u\|su\|us] intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93675	2020-12-22 18:17:39 -08:00
ShihPo Hung	c8874464b5	[RISCV] Add intrinsics for vslide1up/down, vfslide1up/down instruction This patch adds intrinsics for vslide1up, vslide1down, vfslide1up, vfslide1down. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93608	2020-12-22 18:14:22 -08:00
Craig Topper	53deef9e0b	[RISCV] Remove unneeded !eq comparing a single bit value to 0/1 in RISCVInstrInfoVPseudos.td. NFC Instead we can either use the bit directly. If it was checking for 0 we need to swap the operands or use !not.	2020-12-22 11:57:16 -08:00

1 2

83 Commits