llvm-project

Commit Graph

Author	SHA1	Message	Date
Francesco Petrogalli	e2cc12e412	[SveEmitter] Builtins for SVE matrix multiply `mmla`. Summary: Guarded by __ARM_FEATURE_SVE_MATMUL_INT8: * svmmla_u32 * svmmla_s32 * svusmmla_s32 Guarded by __ARM_FEATURE_SVE_MATMUL_FP32: * svmmla_f32 Guarded by __ARM_FEATURE_SVE_MATMUL_FP64: * svmmla_f64 Reviewers: sdesmalen, kmclaughlin, efriedma, rengolin Subscribers: tschuett, kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D79639	2020-05-18 22:02:19 +00:00
Sander de Smalen	91cb13f90d	[SveEmitter] Add builtins for svqadd, svqsub and svdot This patch adds builtins for saturating add/sub instructions: - svqadd, svqadd_n - svqsub, svqsub_n and builtins for dot product instructions: - svdot, svdot_lane	2020-05-07 12:28:18 +01:00
Sander de Smalen	3cb8b4c193	[SveEmitter] Add builtins for SVE2 Polynomial arithmetic This patch adds builtins for: - sveorbt - sveortb - svpmul - svpmullb, svpmullb_pair - svpmullt, svpmullt_pair The svpmullb and svpmullt builtins are expressed using the svpmullb_pair and svpmullt_pair LLVM IR intrinsics, respectively. Reviewers: SjoerdMeijer, efriedma, rengolin Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D79480	2020-05-07 11:53:04 +01:00
Sander de Smalen	5ba329059f	[SveEmitter] Add builtins for svreinterpret The reinterpret builtins are generated separately because they need the cross product of all types, 121 functions in total, which is inconvenient to specify in the arm_sve.td file. Reviewers: SjoerdMeijer, efriedma, ctetreau, rengolin Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78756	2020-05-05 13:04:44 +01:00
Sander de Smalen	aed6bd6f42	Reland D78750: [SveEmitter] Add builtins for svdupq and svdupq_lane Edit: Changed a few CHECK lines into CHECK-DAG lines. This reverts commit `90f3f62cb0`.	2020-05-05 10:42:11 +01:00
Sander de Smalen	90f3f62cb0	Revert "[SveEmitter] Add builtins for svdupq and svdupq_lane" It seems this patch broke some buildbots, so reverting until I have had a chance to investigate. This reverts commit `6b90a6887d`.	2020-05-04 21:31:55 +01:00
Sander de Smalen	6b90a6887d	[SveEmitter] Add builtins for svdupq and svdupq_lane * svdupq builtins that duplicate scalars to every quadword of a vector are defined using builtins for svld1rq (load and replicate quadword). * svdupq builtins that duplicate boolean values to fill a predicate vector are defined using `svcmpne`. Reviewers: SjoerdMeijer, efriedma, ctetreau Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78750	2020-05-04 20:38:47 +01:00
Sander de Smalen	334931f54b	[SveEmitter] Add builtins for shifts. This patch adds builtins for: - svasrd - svlsl - svlsr	2020-05-01 22:27:24 +01:00
Sander de Smalen	1a720d49dc	[SveEmitter] Add builtins for various FP operations Unary: - svexpa, svtmad, svtsmul, svtssel, svscale, svrecpe, svrecps, svrsqrte, svrsqrts, Binary: - svabd, svadd, svdiv, svdivr, svmin, svmax, svminnm, svmaxnm, svmul, svmulx, svsub, svsubr, svmul_lane Complex: - svcadd, svcmla	2020-05-01 17:37:43 +01:00
Sander de Smalen	42a56bf63f	[SveEmitter] Add builtins for gather prefetches Patch by Andrzej Warzynski Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78677	2020-04-29 11:52:49 +01:00
Sander de Smalen	03f419f3eb	[SveEmitter] IsInsertOp1SVALL and builtins for svqdec[bhwd] and svqinc[bhwd] Some ACLE builtins leave out the argument to specify the predicate pattern, which is expected to be expanded to an SV_ALL pattern. This patch adds the flag IsInsertOp1SVALL to insert SV_ALL as the second operand. Reviewers: efriedma, SjoerdMeijer Reviewed By: SjoerdMeijer Tags: #clang Differential Revision: https://reviews.llvm.org/D78401	2020-04-27 11:45:10 +01:00
Sander de Smalen	0ddb2034c1	[SveEmitter] Add builtins for compares and ReverseCompare flag. The IsReverseCompare flag tells CGBuiltin to swap the operands, so that a LT/LE intrinsics can be expressed in terms of GE/GT intrinsics. This patch also adds builtins for the wide-variants of the compares. Reviewers: SjoerdMeijer, efriedma, ctetreau Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78747	2020-04-24 14:33:47 +01:00
Sander de Smalen	823e2a670a	[SveEmitter] Add builtins for contiguous prefetches This patch also adds the enum `sv_prfop` for the prefetch operation specifier and checks to ensure the passed enum values are valid. Reviewers: SjoerdMeijer, efriedma, ctetreau Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78674	2020-04-24 11:35:59 +01:00
Sander de Smalen	a5e0389b2a	[AArch64] Define ACLE FP conversion intrinsics with more specific predicate. This patch changes the FP conversion intrinsics to take a predicate that matches the number of lanes for the vector with the widest element type as opposed to using <vscale x 16 x i1>. For example: ```<vscale x 4 x float> @llvm.aarch64.sve.fcvt.f32f16(<vscale x 4 x float>, <vscale x 4 x i1>, <vscale x 8 x half>)``` now uses <vscale x 4 x i1> instead of <vscale x 16 x i1> And similar for: ```<vscale x 4 x float> @llvm.aarch64.sve.fcvt.f32f64(<vscale x 4 x float>, <vscale x 2 x i1>, <vscale x 2 x double>)``` where the predicate now matches the wider type, so <vscale x 2 x i1>. Reviewers: efriedma, SjoerdMeijer, paulwalker-arm, rengolin Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78402	2020-04-23 10:53:23 +01:00
Sander de Smalen	002164461b	[SveEmitter] Add builtins for FP conversions This adds the flag IsOverloadCvt which tells CGBulitin to use the result type and the type of the last operand as the overloaded types for the LLVM IR intrinsic. This also adds the flag IsFPConvert, which is needed to avoid converting the predicate of the operation from svbool_t to a predicate with fewer lanes, as the LLVM IR intrinsics use the <vscale x 16 x i1> as the predicate. Reviewers: SjoerdMeijer, efriedma Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78239	2020-04-23 10:49:06 +01:00
Sander de Smalen	662cbaf647	[SveEmitter] Add IsOverloadNone flag and builtins for svpfalse and svcnt[bhwd]_pat Add the IsOverloadNone flag to tell CGBuiltin that it does not have an overloaded type. This is used for e.g. svpfalse which does not take any arguments and always returns a svbool_t. This patch also adds builtins for svcntb_pat, svcnth_pat, svcntw_pat and svcntd_pat, as those don't require custom codegen. Reviewers: SjoerdMeijer, efriedma, rovka Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D77596	2020-04-22 16:42:08 +01:00
Sander de Smalen	41d52662d5	[SveEmitter] Add support for _n form builtins The ACLE has builtins that take a scalar value that is to be expanded into a vector by the operation. While the ISA may have an instruction that takes an immediate or a scalar to represent this, the LLVM IR intrinsic may not, so Clang will have to splat the scalar value. This patch also adds the _n forms for svabd, svadd, svdiv, svdivr, svmax, svmin, svmul, svmulh, svub and svsubr. Reviewers: SjoerdMeijer, efriedma, rovka Reviewed By: SjoerdMeijer Tags: #clang Differential Revision: https://reviews.llvm.org/D77594	2020-04-22 14:23:54 +01:00
Andrzej Warzynski	72f565899d	[SveEmitter] Implement builtins for gathers/scatters This patch adds builtins for: * regular, first-faulting and non-temporal gather loads * regular and non-temporal scatter stores Differential Revision: https://reviews.llvm.org/D77735	2020-04-22 13:21:39 +01:00
Sander de Smalen	fc64539749	[SveEmitter] Add immediate checks for lanes and complex imms Adds another bunch of of intrinsics that take immediates with varying ranges based, some being a complex rotation immediate which are a set of allowed immediates rather than a range. svmla_lane: lane immediate ranging 0..(128/(1sizeinbits(elt)) - 1) svcmla_lane: lane immediate ranging 0..(128/(2sizeinbits(elt)) - 1) svdot_lane: lane immediate ranging 0..(128/(4*sizeinbits(elt)) - 1) svcadd: complex rotate immediate [90, 270] svcmla: svcmla_lane: complex rotate immediate [0, 90, 180, 270] Reviewers: efriedma, SjoerdMeijer, rovka Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D76680	2020-04-20 15:10:54 +01:00
Sander de Smalen	515020c091	[SveEmitter] Add more immediate operand checks. This patch adds a number of intrinsics that take immediates with varying ranges based on the element size one of the operands. svext: immediate ranging 0 to (2048/sizeinbits(elt) - 1) svasrd: immediate ranging 1..sizeinbits(elt) svqshlu: immediate ranging 1..sizeinbits(elt)/2 ftmad: immediate ranging 0..(sizeinbits(elt) - 1) Reviewers: efriedma, SjoerdMeijer, rovka, rengolin Reviewed By: SjoerdMeijer Tags: #clang Differential Revision: https://reviews.llvm.org/D76679	2020-04-20 14:41:58 +01:00
Eric Fiselier	af2968e37f	[clang] Fix invalid comparator in tablegen Summary: The current version of the comparator does not introduce a strict weak ordering. Reviewers: fowles, bkramer, sdesmalen Reviewed By: sdesmalen Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D78323	2020-04-16 18:38:32 -04:00
Christopher Tetreault	464a0697e3	[SVE] Fix unsigned is always >= 0 Reviewers: efriedma, sdesmalen Reviewed By: sdesmalen Subscribers: tschuett, rkruppe, psnobl, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D78131	2020-04-15 15:23:49 -07:00
Sander de Smalen	c8a5b30bac	[SveEmitter] Add range checks for immediates and predicate patterns. Summary: This patch adds a mechanism to easily add range checks for a builtin's immediate operands. This patch is tested with the qdech intrinsic, which takes both an enum for the predicate pattern, as well as an immediate for the multiplier. Reviewers: efriedma, SjoerdMeijer, rovka Reviewed By: efriedma, SjoerdMeijer Subscribers: mgorny, tschuett, mgrang, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76678	2020-04-14 16:49:32 +01:00
Sander de Smalen	f6ea026f17	[SveEmitter] Fix encoding/decoding of SVETypeFlags Summary: This issue was introduced when reworking D75861. The bug isn't actually hit with current unit tests because the contiguous loads/stores infer the EltType and the MemEltType from the pointer and result, rather than using the flags. But it will be needed for other intrinsics, such as gather/scatter. Reviewers: SjoerdMeijer, Andrzej Reviewed By: SjoerdMeijer Subscribers: andwar, tschuett, cfe-commits, llvm-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76617	2020-04-14 15:48:28 +01:00
Sander de Smalen	17a68c61a9	[SveEmitter] Implement builtins for contiguous loads/stores This adds builtins for all contiguous loads/stores, including non-temporal, first-faulting and non-faulting. Reviewers: efriedma, SjoerdMeijer Reviewed By: SjoerdMeijer Tags: #clang Differential Revision: https://reviews.llvm.org/D76238	2020-04-14 15:24:57 +01:00
Benjamin Kramer	4065e92195	Upgrade some instances of std::sort to llvm::sort. NFC.	2020-03-28 19:23:29 +01:00
Sander de Smalen	981f0802b3	[SVE] Generate overloaded functions for ACLE intrinsics. The SVE ACLE allows using a short-form for the intrinsics, e.g. the following two declarations generate the same code: svuint32_t svld1(svbool_t, uint32_t const ); svuint32_t svld1_u32(svbool_t, uint32_t const ); using the attribute: __clang_arm_builtin_alias so that any call to svld1(svbool_t, uint32_t const *) will map to __builtin_sve_svld1_u32. Reviewers: SjoerdMeijer, miyuki, efriedma, simon_tatham, rengolin Reviewed By: SjoerdMeijer Tags: #clang Differential Revision: https://reviews.llvm.org/D75861	2020-03-19 09:36:23 +00:00
Sander de Smalen	c5b81466c2	Reland D75470 [SVE] Auto-generate builtins and header for svld1. Reworked the patch to avoid sharing a header (SVETypeFlags.h) between include/clang/Basic and utils/TableGen/SveEmitter.cpp. Now the patch generates the enum/flags which is included in TargetBuiltins.h. Also renamed one of the SveEmitter options to be in line with MVE. Summary: This is a first patch in a series for the SveEmitter to generate the arm_sve.h header file and builtins. I've tried my best to strip down this patch as best as I could, but there are still a few changes that are not necessarily exercised by the load intrinsics in this patch, mostly around the SVEType class which has some common logic to represent types from a type and prototype string. I thought it didn't make much sense to remove that from this patch and split it up.	2020-03-18 11:16:28 +00:00
Sander de Smalen	6ce537ccfc	Revert "[SVE] Auto-generate builtins and header for svld1." This reverts commit `8b409eabaf`. Reverting this patch for now because it breaks some buildbots.	2020-03-16 15:22:15 +00:00
Sander de Smalen	8b409eabaf	[SVE] Auto-generate builtins and header for svld1. This is a first patch in a series for the SveEmitter to generate the arm_sve.h header file and builtins. I've tried my best to strip down this patch as best as I could, but there are still a few changes that are not necessarily exercised by the load intrinsics in this patch, mostly around the SVEType class which has some common logic to represent types from a type and prototype string. I thought it didn't make much sense to remove that from this patch and split it up. Reviewers: efriedma, rovka, SjoerdMeijer, rsandifo-arm, rengolin Reviewed By: SjoerdMeijer Tags: #clang Differential Revision: https://reviews.llvm.org/D75470	2020-03-16 10:52:37 +00:00
Benjamin Kramer	5cc9dea78a	[tblgen] Remove unused private field. NFC.	2020-03-15 16:51:22 +01:00
Sander de Smalen	5087ace651	[Clang][SVE] Parse builtin type string for scalable vectors This patch adds 'q' to mean 'scalable vector' in the builtin type string, and for SVE will return the matching builtin type as defined in the C/C++ language extensions for SVE. This patch also adds some scaffolding to generate the arm_sve.h header file, and some builtin definitions (+CodeGen) to be able to implement some simple masked load intrinsics that use the ACLE types, such as: svint8_t test_svld1_s8(svbool_t pg, const int8_t *base) { return svld1_s8(pg, base); } Reviewers: efriedma, rjmccall, rovka, rsandifo-arm, rengolin Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D75298	2020-03-15 14:34:52 +00:00

32 Commits