llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	1e46b6f401	[test] Fix CodeGen/VE/Scalar tests	2021-03-02 15:30:44 -08:00
Kazushi (Jam) Marukawa	24faa87075	[VE] Update VELIntrinsic tests Update comment and style of regression tests for VELIntrinsic Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D94490	2021-01-13 00:12:50 +09:00
Kazushi (Jam) Marukawa	d02de13932	[VE] Support additional VMRGW and VMV intrinsic instructions Support missing VMRGW and VMV intrinsic instructions and add regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D94300	2021-01-11 20:50:31 +09:00
Kazushi (Jam) Marukawa	b72ca79982	[VE] Support intrinsic to isnert/extract_subreg of v512i1 Support insert/extract_subreg intrinsic instructions for v512i1 registers and add regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D94298	2021-01-11 20:40:10 +09:00
Kazushi (Jam) Marukawa	5ead757f1d	[VE] Support pack_f32p and pack_f32a intrinsic instructions Support pack_f32p and pack_f32a intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D94296	2021-01-08 22:59:11 +09:00
Simon Moll	611d3c63f3	[VP] ISD helper functions [VE] isel for vp_add, vp_and This implements vp_add, vp_and for the VE target by lowering them to the VVP_* layer. We also add helper functions for VP SDNodes (isVPSDNode, getVPMaskIdx, getVPExplicitVectorLengthIdx). Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D93766	2021-01-08 14:29:45 +01:00
Simon Moll	eeba70a463	[VE] Expand single-element BUILD_VECTOR to INSERT_VECTOR_ELT We do this mostly to be able to test the insert_vector_elt isel patterns. As long as we don't, most single element insertions show up as `BUILD_VECTOR` in the backend. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D93759	2021-01-08 11:48:01 +01:00
Simon Moll	d1b606f897	[VE] Extract & insert vector element isel Isel and tests for extract_vector_elt and insert_vector_elt. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D93687	2021-01-08 11:46:59 +01:00
Kazushi (Jam) Marukawa	12167632bc	[VE] Add SVOB intrinsic instruction Add SVOB intrinsic instruction and a regression test. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D94279	2021-01-08 18:49:17 +09:00
Kazushi (Jam) Marukawa	f784be0777	[VE] Support SJLJ exception related instructions Support EH_SJLJ_LONGJMP, EH_SJLJ_SETJMP, and EH_SJLJ_SETUP_DISPATCH for SjLj exception handling. NC++ uses SjLj exception handling, so implement it first. Add regression tests also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D94071	2021-01-05 20:19:15 +09:00
Kazushi (Jam) Marukawa	2654f33c47	[VE] Support llvm.eh.sjlj.lsda In order to support SJLJ exception, implement llvm.eh.sjlj.lsda first. Add regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93811	2021-01-05 18:06:14 +09:00
Kazushi (Jam) Marukawa	c287f90ccd	[VE] Change default CPU name to "generic" Change default CPU name of SX-Aurora VE from "ve" to "generic" similar to other architectures. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93836	2021-01-04 20:09:57 +09:00
Simon Moll	c3acda0798	[VE] Vector 'and' isel and tests Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D93709	2020-12-23 13:29:29 +01:00
Kazushi (Jam) Marukawa	8c2ad9e85f	[VE] Correct VMP allocation in calling conv VE used to allocate VM1, VM2, VMP2 (VM4+VM5), and VM3. This patch corrects to allocate VM1, VM2, VMP2 (VM4+VM5), and VM6. Also add a regression test. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93570	2020-12-21 22:42:24 +09:00
Kazushi (Jam) Marukawa	a3a896d1cd	[VE] Optimize LEA combinations Change to optimize references of elements of aggregate data. Also add regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93627	2020-12-21 22:21:10 +09:00
Kazushi (Jam) Marukawa	5e273b845b	[VE] Support STACKSAVE and STACKRESTORE Change to use default expanded code. Add regression tests also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93539	2020-12-21 20:15:50 +09:00
Kazushi (Jam) Marukawa	d99e4a4840	[VE] Support RETURNADDR Implement RETURNADDR for VE. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93545	2020-12-21 20:06:03 +09:00
Kazushi (Jam) Marukawa	af83b74dc2	[VE] Support copy of vector mask registers Support VM and VMP registers in copyPhysReg() function. Also add regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93547	2020-12-19 09:16:43 +09:00
Kazushi (Jam) Marukawa	697226550e	[VE] Support FRAMEADDR Implement FRAMEADDR for VE. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93295	2020-12-15 23:31:19 +09:00
Kazushi (Jam) Marukawa	a2eb07aa55	[VE] Support atomic exchange instructions Support atomic exchange and atomic compare and exchange instructions. Change CAS and TS1AM instructions for ISel patterns. Add selectADDRzi pattern for them. Add TS1AM pseudo instruction also for better ISel. Add shouldExpandAtomicRMWInIR() function to expand all atomicrmw instructions except atomicrmw xchg. Add custom lower for i8/i16 atomicrmw xchg. Modify replaceFI to support CAS/TS1AM instructions which use "reg+disp" operands instead of "reg+imm+disp" operands. And, add several regression tests to check the correctness. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93161	2020-12-15 17:43:11 +09:00
Kazushi (Jam) Marukawa	aefedb1707	[VE] Add logical mask intrinsic instructions Add andm, orm, xorm, eqvm, nndm, negm, pcvm, lzvm, and tovm intrinsic instructions, a few pseudo instructions to expand logical intrinsic using VM512, a mechnism to expand such pseudo instructions, and regression tests. Also, assign vector mask types and vector mask register classes correctly. This is required to use VM512 registers as function arguments. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93093	2020-12-15 01:34:31 +09:00
Kazushi (Jam) Marukawa	87f308ab3d	[VE] Add vgt and vsc intrinsic instructions Add vgt and vsc intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93032	2020-12-11 18:23:43 +09:00
Kazushi (Jam) Marukawa	4b1e329255	[VE] Add vector reduce intrinsic instructions Add vrmax, vrmin, vfrmax, vfrmin, vrand, vror, and vrxor intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92941	2020-12-10 22:21:17 +09:00
Kazushi (Jam) Marukawa	e954ba28bc	[VE][NFC] Disable VP tests VP tests recently added don't work on Release mode. They work on Debug mode, so I disable them on Release mode to make tests work.	2020-12-10 15:13:05 +09:00
Kazushi (Jam) Marukawa	1a2147fead	[VE] Add vsum and vfsum intrinsic instructions Add vsum and vfsum intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92938	2020-12-10 01:11:53 +09:00
Kazushi (Jam) Marukawa	398f29fbb0	[VE] Add vfmk intrinsic instructions Add vfmk intrinsic instructions, a few pseudo instructions to expand vfmk intrinsic using VM512 correctly, and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92758	2020-12-10 00:08:20 +09:00
Simon Moll	3ffbc79357	[VP] Build VP SDNodes Translate VP intrinsics to VP_* SDNodes. The tests check whether a matching vp_* SDNode is emitted. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D91441	2020-12-09 11:36:51 +01:00
Kazushi (Jam) Marukawa	95ea50e4ad	[VE] Correct LVLGen (LVL instruction insert pass) SX Aurora VE uses an intermediate representation similar to VP as its MIR. VE itself uses invidiual VL register as its own vector length register at the hardware level. So, LLVM needs to insert load VL (LVL) instruction just before vector instructions if the value of VL is changed. This LVLGen pass generates LVL instructions for such purpose. Previously, a bug is pointed out in D91416. This patch correct this bug and add a regression test. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D92716	2020-12-09 06:33:53 +09:00
Kazushi (Jam) Marukawa	9d4501e2b4	[VE] Add vcp and vex intrinsic instructions Add vcp and vex intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92752	2020-12-07 22:56:55 +09:00
Kazushi (Jam) Marukawa	03898b79fb	[VE] Add vrcp, vrsqrt, vcvt, vmrg, and vshf intrinsic instructions Add vrcp, vrsqrt, vcvt, vmrg, and vshf intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92750	2020-12-07 20:30:12 +09:00
Kazushi (Jam) Marukawa	67dbc8195d	[VE] Add vfmad, vfmsb, vfnmad, and vfnmsb intrinsic instructions Add vfmad, vfmsb, vfnmad, and vfnmsb intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92697	2020-12-07 19:28:17 +09:00
Kazushi (Jam) Marukawa	23034a4a63	[VE] Add vfsqrt, vfcmp, vfmax, and vfmin intrinsic instructions Add vfsqrt, vfcmp, vfmax, and vfmin intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92651	2020-12-05 07:52:14 +09:00
Kazushi (Jam) Marukawa	e936d1e113	[VE] Add vfadd, vfsub, vfmul, and vfdiv intrinsic instructions Add vfadd, vfsub, vfmul, and vfdiv intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92649	2020-12-04 21:58:51 +09:00
Kazushi (Jam) Marukawa	1365718778	[VE] Add vsll, vsrl, vsla, vsra, and vsfa intrinsic instructions Add vsll, vsrl, vsla, vsra, and vsfa intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92550	2020-12-03 23:19:58 +09:00
Kazushi (Jam) Marukawa	b91238173d	[VE] Add veqv and vseq intrinsic instructions Add veqv and vseq intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92527	2020-12-03 17:39:24 +09:00
Kazushi (Jam) Marukawa	dd0159bd81	[VE] Add vand, vor, and vxor intrinsic instructions Add vand, vor, and vxor intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92454	2020-12-02 22:52:54 +09:00
Kazushi (Jam) Marukawa	c1762bcf0a	[VE] Add vcmp, vmax, and vmin intrinsic instructions Add vcmp, vmax, and vmin intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92387	2020-12-02 11:16:52 +09:00
Kazushi (Jam) Marukawa	10b164d2f7	[VE] Add vmul and vdiv intrinsic instructions Add vmul and vdiv intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92377	2020-12-01 23:03:49 +09:00
Kazushi (Jam) Marukawa	c3fe6ea22e	[VE] Add vadd and vsub intrinsic instructions Add vadd and vsub intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92332	2020-12-01 19:57:22 +09:00
Kazushi (Jam) Marukawa	6834b3d6d5	[VE] Optimize prologue/epilogue instructions about GOT Optimize prologue/epilogue instructions if a given function use GOT but do not call other functions by eliminating FP. Previously, we had wrong implementations taken from other architectures. Update regression tests also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92313	2020-12-01 02:22:31 +09:00
Kazushi (Jam) Marukawa	6fe610535f	[VE] Clean check routines of branch types Previously, these check routines accepted non-generatble instructions. This time, I clean them and add assert for those non-generatable instructions. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92254	2020-12-01 02:19:37 +09:00
Kazushi (Jam) Marukawa	686988a50f	[VE] Optimize prologue/epilogue instructions Optimize eliminate FP mechanism. This time optimize a function which has no call but fixed stack objects. LLVM eliminates FP on such functions now. Also, optimize GOT/PLT registers save/restore instructions if a given function doesn't uses them. In addition, remove generating mechanism of `.cfi` instructions since those are taken from other architectures and not inspected yet. Update regression tests, also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92251	2020-11-30 22:22:33 +09:00
Kazushi (Jam) Marukawa	44a679eaa4	[VE] Change the behaviour of truncate Change the way to truncate i64 to i32 in I64 registers. VE assumed sext values previously. Change it to zext values this time to make it match to the LLVM behaviour. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92226	2020-11-30 22:12:45 +09:00
Kazushi (Jam) Marukawa	3bd78b7cc0	[VE] Optimize emitSPAdjustment function Optimize emitSPAdjustment function to generate as small as possible instructions to adjust SP. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92174	2020-11-28 08:06:31 +09:00
Kazushi (Jam) Marukawa	c2b49b2fb4	[VE] Add comprehensive stackframe tests Add comprehensive stackframe regression tests as a preparation of VEFrameLowering.cpp optimizations. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92170	2020-11-26 22:12:09 +09:00
Simon Moll	b955c7e630	[VE] VE Vector Predicated SDNode, vector add isel and tests VE Vector Predicated (VVP) SDNodes form an intermediate layer between VE vector instructions and the initial SDNodes. We introduce 'vvp_add' with isel and tests as the first of these VVP nodes. VVP nodes have a mask and explicit vector length operand, which we will make proper use of later. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D91802	2020-11-23 17:17:07 +01:00
Kazushi (Jam) Marukawa	02b2bcd940	[VE] Correct types of return/argument values for getAdjustedFrameSize() A getAdjustedFrameSize function may need to handle larger than 32 bits integer, so change int to uint64_t. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91862	2020-11-21 16:08:20 +09:00
Kazushi (Jam) Marukawa	a2dc4ac86b	[VE][NFC] Update missing bulk update tests to use typed sret	2020-11-21 13:11:25 +09:00
Matt Arsenault	20c43d6bd5	OpaquePtr: Bulk update tests to use typed sret	2020-11-20 17:58:26 -05:00
Kazushi (Jam) Marukawa	42389f1e96	[VE] Change threshold for jump table generation Implement getMinimumJumpTableEntries() to specify threshold for jump table genaration. We use 8 for the case of PIC mode to relieve the impact of PIC calculation required to implement PIC mode jump table. Update jump table regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91785	2020-11-20 21:27:18 +09:00

1 2 3

128 Commits