llvm-project

Commit Graph

Author	SHA1	Message	Date
Kazushi (Jam) Marukawa	686988a50f	[VE] Optimize prologue/epilogue instructions Optimize eliminate FP mechanism. This time optimize a function which has no call but fixed stack objects. LLVM eliminates FP on such functions now. Also, optimize GOT/PLT registers save/restore instructions if a given function doesn't uses them. In addition, remove generating mechanism of `.cfi` instructions since those are taken from other architectures and not inspected yet. Update regression tests, also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92251	2020-11-30 22:22:33 +09:00
Kazushi (Jam) Marukawa	44a679eaa4	[VE] Change the behaviour of truncate Change the way to truncate i64 to i32 in I64 registers. VE assumed sext values previously. Change it to zext values this time to make it match to the LLVM behaviour. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92226	2020-11-30 22:12:45 +09:00
Kazushi (Jam) Marukawa	3bd78b7cc0	[VE] Optimize emitSPAdjustment function Optimize emitSPAdjustment function to generate as small as possible instructions to adjust SP. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92174	2020-11-28 08:06:31 +09:00
Kazushi (Jam) Marukawa	c2b49b2fb4	[VE] Add comprehensive stackframe tests Add comprehensive stackframe regression tests as a preparation of VEFrameLowering.cpp optimizations. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92170	2020-11-26 22:12:09 +09:00
Simon Moll	b955c7e630	[VE] VE Vector Predicated SDNode, vector add isel and tests VE Vector Predicated (VVP) SDNodes form an intermediate layer between VE vector instructions and the initial SDNodes. We introduce 'vvp_add' with isel and tests as the first of these VVP nodes. VVP nodes have a mask and explicit vector length operand, which we will make proper use of later. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D91802	2020-11-23 17:17:07 +01:00
Kazushi (Jam) Marukawa	02b2bcd940	[VE] Correct types of return/argument values for getAdjustedFrameSize() A getAdjustedFrameSize function may need to handle larger than 32 bits integer, so change int to uint64_t. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91862	2020-11-21 16:08:20 +09:00
Kazushi (Jam) Marukawa	a2dc4ac86b	[VE][NFC] Update missing bulk update tests to use typed sret	2020-11-21 13:11:25 +09:00
Matt Arsenault	20c43d6bd5	OpaquePtr: Bulk update tests to use typed sret	2020-11-20 17:58:26 -05:00
Kazushi (Jam) Marukawa	42389f1e96	[VE] Change threshold for jump table generation Implement getMinimumJumpTableEntries() to specify threshold for jump table genaration. We use 8 for the case of PIC mode to relieve the impact of PIC calculation required to implement PIC mode jump table. Update jump table regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91785	2020-11-20 21:27:18 +09:00
Simon Moll	ffe6c97f6b	[VE] VEC_BROADCAST, lowering and isel This defines the vec_broadcast SDNode along with lowering and isel code. We also remove unused type mappings for the vector register classes (all vector MVTs that are not used in the ISA go). We will implement support for short vectors later by intercepting nodes with illegal vector EVTs before LLVM has had a chance to widen them. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D91646	2020-11-19 09:44:56 +01:00
Kazushi (Jam) Marukawa	132d6d73ea	[VE] Add vmv intrinsic instructions Add vmv intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91700	2020-11-19 08:05:35 +09:00
Kazushi (Jam) Marukawa	f4517bbd73	[VE] Implement JumpTable Implement JumpTable to make BRIND work on VE. Update an existing br_jt regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91582	2020-11-17 22:43:10 +09:00
Kazushi (Jam) Marukawa	3a5c0ea895	[VE] Add vbrd intrinsic instructions Add vbrd intrinsic instructions and a regression test. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91569	2020-11-17 19:04:18 +09:00
Kazushi (Jam) Marukawa	38621c45a8	[VE] Add lvm/svm intrinsic instructions Add lvm/svm intrinsic instructions and a regression test. Change RegisterInfo to specify that VM0/VMP0 are constant and reserved registers. This modifies a vst regression test, so update it. Also add pseudo instructions for VM512 register classes and mechanism to expand them after register allocation. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91541	2020-11-17 07:05:36 +09:00
Kazushi (Jam) Marukawa	44a4f93925	[VE] Optimize leaf functions Optimize leaf functions by not generating save/restore for callee saved registers. Update regression tests also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91539	2020-11-17 00:38:01 +09:00
Simon Moll	a598c08ac8	[VE] fastcc and vreg-to-vreg copy This defines a 'fastcc' for the VE target and implements vreg-to-vreg copy for parameter passing. The 'fastcc' extends the standard CC for SX-Aurora with register passing of vector-typed parameters and return values. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D90842	2020-11-16 16:24:22 +01:00
Kazushi (Jam) Marukawa	37e7a80aed	[VE] Add lsv/lvs intrinsic instructions Add lsv/lvs intrinsic instructions and a regression test. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91526	2020-11-16 23:42:51 +09:00
Kazushi (Jam) Marukawa	e0c92c6c03	[VE] Add pfchv intrinsic instructions Add pfchv intrinsic instructions and a regression test. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91522	2020-11-16 20:10:44 +09:00
Simon Moll	1c00d096a6	[VE] LVLGen sets VL before vector insts The VE backend represents vector instructions with an explicit 'i32' vector length operand. In the VE ISA, the vector length is always read from the VL hardware register. The LVLGen pass inserts 'lvl' instructions as necessary to set VL to the right value before each vector instruction. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D91416	2020-11-16 09:19:14 +01:00
Kazushi (Jam) Marukawa	02ab46ef73	[VE] Add vst intrinsic instructions Add vst intrinsic instructions and a regression test. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91406	2020-11-13 19:11:57 +09:00
Kazushi (Jam) Marukawa	410626c9b5	[VE] Support vld intrinsics Add intrinsics for vector load instructions. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91332	2020-11-13 07:34:42 +09:00
Kazushi (Jam) Marukawa	a72d384249	[VE] Change the default type of v64 register class Change the default type of v64 register class from v512i32 to v256f64. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91301	2020-11-12 19:07:07 +09:00
Kazushi (Jam) Marukawa	dd6f607ea8	[VE] Implement FoldImmediate Implement FoldImmediate for only integer aritihmetic operations. Add regression tests also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91150	2020-11-11 08:08:32 +09:00
Kazushi (Jam) Marukawa	c84b2c49be	[VE] Support inline assembly with vector regsiters Support inline assembly with vector registers. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91146	2020-11-10 20:55:38 +09:00
Kazushi (Jam) Marukawa	b65ef65b22	[VE] Support inline assembly Support inline assembly with scalar registers. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91119	2020-11-10 18:56:22 +09:00
Kazushi (Jam) Marukawa	43df29e206	[VE] Optimize address calculation Optimize address calculations using LEA/LEASL instructions. Update comments in VEISelLowering.cpp also. Update an existing regression test optimized by this modification. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90878	2020-11-06 19:46:59 +09:00
Kazushi (Jam) Marukawa	b8745751f1	[VE][NFC] Update rem.ll regression test `Replace ISD::SREM handling with KnownBits::srem to reduce code duplication` (`bf04e34383`) changed the result of rem.ll regression test. So, updating it.	2020-11-06 10:44:29 +09:00
Kazushi (Jam) Marukawa	f0e585d585	[VE] Add isReMaterializable and isAsCheapAsAMove flags Add isReMaterializable and isCheapAsAMove flags to integer instructions which cost cheap. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90833	2020-11-06 06:09:10 +09:00
Simon Moll	351c10cc72	[VE] Add +vpu attribute `+vpu` controls whether VEISelLowering adds any vregs. This defaults to `-vpu` to have scalar code generation out of the box. We bring up vector isel under the `+vpu` flag. Once vector isel is stable we switch to `+vpu` and advertise vregs and vops in TTI. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D90465	2020-11-04 12:42:00 +01:00
Simon Moll	1943b7f15d	[VE][NFC] move scalar tests to Scalar/ Restructure lit tests to follow downstream Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D90468	2020-10-30 16:44:32 +01:00
Kazushi (Jam) Marukawa	b22e32a9c8	[VE] Specify to expand BRIND and BR_JT BRIND and BR_JT are not implmented yet, so expand them atm. Add regression tests too. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90283	2020-10-28 18:50:20 +09:00
Kazushi (Jam) Marukawa	cfefef50c1	[VE] Support atomic store Support atomic store instructions and add a regression test. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90137	2020-10-27 00:28:11 +09:00
Kazushi (Jam) Marukawa	f32992ad24	[VE] Support atomic load Support atomic load instruction and add a regression test. VE uses release consitency, so need to insert fence around atomic instructions. This patch enable AtomicExpandPass and use emitLeadingFence and emitTrailingFence mechanism for such purpose. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90135	2020-10-26 18:02:45 +09:00
Kazushi (Jam) Marukawa	52f03fe115	[VE] Support atomic fence Support atomic fence instruction and add a regression test. Add MEMBARRIER pseudo insturction also to use it as a barrier against to the compiler optimizations. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90112	2020-10-26 17:03:09 +09:00
Kazushi (Jam) Marukawa	6bb60d3e26	[VE] Add setcc for fp128 Add setcc for fp128 and clean existing ISel patterns. Also add a regression test. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89683	2020-10-19 21:36:57 +09:00
Kazushi (Jam) Marukawa	fb2bb6fad4	[VE] Add cast to/from fp128 patterns Add cast to/from fp128 patterns. Clean other cast patterns too. Update a regression test by adding missing tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89682	2020-10-19 21:35:27 +09:00
Kazushi (Jam) Marukawa	8796746b2a	[VE] Support select_cc Add missing ISel patterns related to select_cc DAG nodes. Add regression test of all combination of possible scalar types. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89672	2020-10-19 18:54:25 +09:00
Kazushi (Jam) Marukawa	25955cbae4	[VE] Support br_cc comparing fp128 Support br_cc instruction comparing fp128 values. Add a br_cc.ll regression test for all kind of br_cc instructions. And, clean existing branch regression tests, this time. Clean a brcond.ll regression test for brcond instruction. Remove mixed branch1.ll regression test. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89627	2020-10-19 18:29:39 +09:00
Kazushi (Jam) Marukawa	af8b444de3	[VE] Update ISel patterns for select instruction Add an ISel pattern for fp128 select instruction and optimize generated code for other types' select. instructions. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89509	2020-10-19 18:28:21 +09:00
Kazushi (Jam) Marukawa	410e5b17cf	[VE] Support fabs/fcos/fsin/fsqrt math functions VE doesn't have instruction for fabs/fcos/fsin/fsqrt, so expand them. Add regression tests also. Update fcopysign regression test, also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89457	2020-10-16 06:27:38 +09:00
Kazushi (Jam) Marukawa	8e7b108e80	[VE] Change to expand SHL_PARTS/SRA_PARTS/SRL_PARTS VE doesn't have SHL_PARTS/SRA_PARTS/SRL_PARTS instructions, so need to expand them. Add regression tests too. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89396	2020-10-15 09:04:34 +09:00
Kazushi (Jam) Marukawa	66be2e00ef	[VE] Support copysign math function VE doesn't have instruction for copysign, so expand it. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89228	2020-10-12 21:06:19 +09:00
Kazushi (Jam) Marukawa	9d6d4b07a2	[VE] Support fneg and frem VE doesn't have fneg or frem instruction, so change them to expand. Add regression tests also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89205	2020-10-12 19:19:29 +09:00
Kazushi (Jam) Marukawa	6c32bc4875	[VE] Change to expand BRCOND VE doesn't have BRCOND instruction, so need to expand it. Also add a regression test. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89173	2020-10-12 19:18:37 +09:00
Kazushi (Jam) Marukawa	5b5e78a431	[VE] Support register and frame-index pair correctly Support register and frame-index pair correctly as operands of generic load/store instrucitons, e.g. LD1BZXrri, STLrri, and etc. Add regression tests also. Differential Revision: https://reviews.llvm.org/D88779	2020-10-05 18:36:53 +09:00
Kazushi (Jam) Marukawa	1034262e0a	[VE] Support TargetBlockAddress Change to handle TargetBlockAddress and add a regression test for it. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D88576	2020-10-01 00:48:21 +09:00
Kazushi (Jam) Marukawa	40f1e7e804	[VE] Support f128 Support f128 using VE instructions. Update regression tests. I've noticed there is no load or store i128 test, so I add them too. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D86035	2020-08-17 17:26:52 +09:00
Kazushi (Jam) Marukawa	5d549219df	[VE] Change to promote i32 AND/OR/XOR operations VE has only 64 bits AND/OR/XOR instructions. We pretended that VE has 32 bits instructions also, but doing it increase the number of generated instructions. Therefore, we decide to promote 32 bits operations and use only 64 bits instructions in back end. We also avoid pretending that VE has 32 bits LEA instruction. Update regression tests also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D85726	2020-08-12 16:23:50 +09:00
Kazushi (Jam) Marukawa	59703f1736	[VE] Update bit operations Change bitreverse/bswap/ctlz/ctpop/cttz regression tests to support i128 and signext/zeroext i32 types. This patch also change the way to support i32 types using 64 bits VE instructions. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D85712	2020-08-11 19:42:12 +09:00
Kazushi (Jam) Marukawa	63bc5d7863	[VE] Change to expand multiply related instructions Change to expand MULHU/MULHS/UMUL_LOHI/SMUL_LOHI for i32 and i64 since those instructions are not available on Aurora SX VE. Some of them are used in expansion of i128 multiply, so need to modify them to support i128. Then, update basic arithmetic regression tests of i128 and signed/unsigned i32 typed integer values. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D85490	2020-08-07 18:22:25 +09:00

1 2

87 Commits