llvm-project

Commit Graph

Author	SHA1	Message	Date
Sander de Smalen	e951b045bf	[AArch64][SVE] Regression test all ACLE tests with C++ We found issues with a number of intrinsics when building them with C++, so it makes sense to guard these tests with some extra RUN lines to build the tests in C++ mode.	2021-04-22 13:24:04 +01:00
Francesco Petrogalli	d54e4dded7	[sve][acle] Enable feature macros for SVE ACLE extensions. Summary: The following feature macros have been added: __ARM_FEATURE_SVE_BF16 __ARM_FEATURE_SVE_MATMUL_INT8 __ARM_FEATURE_SVE_MATMUL_FP32 __ARM_FEATURE_SVE_MATMUL_FP64 The driver has been updated to enable them accordingly to the value of the target feature passed at command line. The SVE ACLE tests using the macros have been modified to work with the target feature instead of passing the macro at command line. Reviewers: sdesmalen, efriedma, c-rhodes, kmclaughlin, SjoerdMeijer, rengolin Subscribers: tschuett, kristof.beyls, rkruppe, psnobl, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D82623	2020-06-30 18:33:03 +00:00
Cullen Rhodes	d45cf9105b	[AArch64][SVE2] Guard while intrinsics on scalar bfloat feature macro Summary: `svwhilerw_bf16` and `svwhilewr_bf16` intrinsics use the scalar `bfloat16_t` type which is predicated on `__ARM_FEATURE_BF16_SCALAR_ARITHMETIC`. This patch changes the feature guard from `__ARM_FEATURE_SVE_BF16` to the scalar bfloat feature macro. The verify tests for `+bf16` are also removed in this patch. The purpose of these checks was to match the SVE2 ACLE tests that look for an implicit declaration warning if the feature isn't set. They worked when the intrinsics were guarded on `__ARM_FEATURE_SVE_BF16` as the `bfloat16_t` was guarded on a different macro, but with both the type and intrinsic guarded on the same macro an earlier error is triggered in the ACLE regarding the type and we don't get a warning as we do for SVE2. Reviewers: sdesmalen, fpetrogalli, kmclaughlin, rengolin, efriedma Reviewed By: sdesmalen, fpetrogalli Differential Revision: https://reviews.llvm.org/D82578	2020-06-26 10:25:42 +00:00
Francesco Petrogalli	7200fa38a9	[sve][acle] Add some C intrinsics for brain float types. Summary: The following intrinsics has been added: svuint16_t svcnt[_bf16]_m(svuint16_t inactive, svbool_t pg, svbfloat16_t op) svuint16_t svcnt[_bf16]_x(svbool_t pg, svbfloat16_t op) svuint16_t svcnt[_bf16]_z(svbool_t pg, svbfloat16_t op) svbfloat16_t svtbl[_bf16](svbfloat16_t data, svuint16_t indices) svbfloat16_t svtbl2[_bf16](svbfloat16x2_t data, svuint16_t indices) svbfloat16_t svtbx[_bf16](svbfloat16_t fallback, svbfloat16_t data, svuint16_t indices) Reviewers: c-rhodes, kmclaughlin, efriedma, sdesmalen, ctetreau Subscribers: tschuett, hiraditya, rkruppe, psnobl, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D82429	2020-06-25 16:31:01 +00:00
Sander de Smalen	fabe67728e	[AArch64][SVE] Enable __ARM_FEATURE_SVE macros. This patch enables the following macros when their corresponding target attributes are set: __ARM_FEATURE_SVE (+sve) __ARM_FEATURE_SVE2 (+sve2) __ARM_FEATURE_SVE2_AES (+sve2-aes) __ARM_FEATURE_SVE2_BITPERM (+sve2-bitperm) __ARM_FEATURE_SVE2_SHA3 (+sve2-sha3) __ARM_FEATURE_SVE2_SM4 (+sve2-sm4) This implies that the base SVE and SVE2 ACLE (00bet2) are now feature complete, meaning that all intrinsics are implemented in LLVM and Clang. Disclaimer: To implement the ACLE we have had to fix up many parts of LLVM to make it support scalable vectors. We have also used many target-specific intrinsics to reduce reliance on parts of LLVM where we know scalable vectors may not yet be handled properly (e.g. some transformation might drop the 'scalable' flag on a vector type). While we've done a best effort with the limited testing that is available to us, we're still working to improve the stability of the implementation. Additionally, Clang may print warnings that code may have miscompiled. We find this often to be a false alarm where the wrong interfaces have been used in LLVM and where resulting code is not actually incorrect. However, this warrants a bug report and investigation. If you find any bugs or issues, please raise them on bugs.llvm.org and let us know! Reviewers: rengolin, efriedma, david-arm, SjoerdMeijer Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D81725	2020-06-25 08:14:19 +01:00
Cullen Rhodes	05e10ee0ae	[AArch64][SVE2] Add bfloat16 support to whilerw/whilewr intrinsics Reviewed By: fpetrogalli Differential Revision: https://reviews.llvm.org/D82399	2020-06-24 10:06:31 +00:00
Sander de Smalen	e51c1d06a9	[SveEmitter] Add builtins for svtbl2 Reviewers: david-arm, efriedma, c-rhodes Reviewed By: c-rhodes Tags: #clang Differential Revision: https://reviews.llvm.org/D81462	2020-06-17 09:41:38 +01:00
Sander de Smalen	4cad97595f	[SveEmitter] Add builtins for svmovlb and svmovlt These builtins are expanded in CGBuiltin to use intrinsics for (signed/unsigned) shift left long top/bottom. Reviewers: efriedma, SjoerdMeijer Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D79579	2020-05-11 09:41:58 +01:00
Sander de Smalen	96a581d0f0	[SveEmitter] Add builtins for SVE2 svtbx (extended table lookup) This patch adds builtins for: - svtbx	2020-05-07 16:15:57 +01:00
Sander de Smalen	e46043bba7	[SveEmitter] Add builtins for SVE2 Optional extensions (AES, SHA3, SM4, BITPERM) This patch adds various builtins under their corresponding feature macros: Defined under __ARM_FEATURE_SVE2_AES: - svaesd - svaese - svaesimc - svaesmc - svpmullb_pair - svpmullt_pair Defined under __ARM_FEATURE_SVE2_SHA3: - svrax1 Defined under __ARM_FEATURE_SVE2_SM4: - svsm4e - svsm4ekey Defined under __ARM_FEATURE_SVE2_BITPERM: - svbdep - svbext - svbgrp	2020-05-07 16:15:57 +01:00
Sander de Smalen	f22cdc3cc3	[SveEmitter] Add builtins for SVE2 Character match instructions This patch adds builtins for: - svmatch - svnmatch	2020-05-07 16:15:57 +01:00
Sander de Smalen	ae652241bd	[SveEmitter] Add builtins for SVE2 Vector histogram count instructions This patch adds builtins for: - svhistcnt - svhistseg	2020-05-07 16:15:57 +01:00
Sander de Smalen	fa0371f4fd	[SveEmitter] Add builtins for SVE2 Floating-point integer binary logarithm instructions This patch adds builtins for: - svlogb	2020-05-07 16:15:57 +01:00
Sander de Smalen	086722c18e	[SveEmitter] Add builtins for SVE2 Floating-point widening multiply-accumulate This patch adds builtins for: - svmlalb, svmlalb_lane - svmlalt, svmlalt_lane - svmlslb, svmlslb_lane - svmlslt, svmlslt_lane	2020-05-07 16:15:57 +01:00
Sander de Smalen	e76256e7c1	[SveEmitter] Add builtins for SVE2 Complex integer dot product This patch adds builtins for: - svcdot, svcdot_lane	2020-05-07 16:09:31 +01:00
Sander de Smalen	867bfae93f	[SveEmitter] Add builtins for SVE2 Widening complex integer arithmetic This patch adds builtins for: - svaddlbt - svqdmlalbt - svqdmlslbt - svsublbt - svsubltb	2020-05-07 16:09:31 +01:00
Sander de Smalen	f525820755	[SveEmitter] Add builtins for SVE2 Narrowing DSP operations This patch adds builtins for: - svaddhnb - svaddhnt - svqrshrnb - svqrshrnt - svqrshrunb - svqrshrunt - svqshrnb - svqshrnt - svqshrunb - svqshrunt - svqxtnb - svqxtnt - svqxtunb - svqxtunt - svraddhnb - svraddhnt - svrshrnb - svrshrnt - svrsubhnb - svrsubhnt - svshrnb - svshrnt - svsubhnb - svsubhnt	2020-05-07 16:09:31 +01:00
Sander de Smalen	b0b658e7fc	[SveEmitter] Add builtins for SVE2 Widening DSP operations This patch adds builtins for: - svabalb - svabalt - svabdlb - svabdlt - svaddlb - svaddlt - svaddwb - svaddwt - svmlalb, svmlalb_lane - svmlalt, svmlalt_lane - svmlslb, svmlslb_lane - svmlslt, svmlslt_lane - svmullb, svmullb_lane - svmullt, svmullt_lane - svqdmlalb, svqdmlalb_lane - svqdmlalt, svqdmlalt_lane - svqdmlslb, svqdmlslb_lane - svqdmlslt, svqdmlslt_lane - svqdmullb, svqdmullb_lane - svqdmullt, svqdmullt_lane - svshllb - svshllt - svsublb - svsublt - svsubwb - svsubwt	2020-05-07 16:09:31 +01:00
Sander de Smalen	ce7f50c2ce	[SveEmitter] Add builtins for SVE2 Uniform complex integer arithmetic This patch adds builtins for: - svcadd - svqcadd - svcmla - svcmla_lane - svqrdcmlah - svqrdcmlah_lane	2020-05-07 16:09:31 +01:00
Sander de Smalen	5e9bc21eea	[SveEmitter] Add builtins for SVE2 Multiplication by indexed elements This patch adds builtins for: - svmla_lane - svmls_lane - svmul_lane	2020-05-07 15:21:37 +01:00
Sander de Smalen	60615cfb43	[SveEmitter] Add builtins for SVE2 Large integer arithmetic This patch adds builtins for: - svadclb - svadclt - svsbclb - svsbclt	2020-05-07 15:21:37 +01:00
Sander de Smalen	36aab0c055	[SveEmitter] Add builtins for SVE2 Bitwise ternary logical instructions This patch adds builtins for: - svbcax - svbsl - svbsl1n - svbsl2n - sveor3 - svnbsl - svxar	2020-05-07 15:21:37 +01:00
Sander de Smalen	b0348af108	[SveEmitter] Add builtins for SVE2 widening pairwise arithmetic This patch adds builtins for: - svadalp	2020-05-07 15:21:37 +01:00
Sander de Smalen	7ff05002d0	[SveEmitter] Add builtins for SVE2 Non-widening pairwise arithmetic This patch adds builtins for: - svaddp - svmaxnmp - svmaxp - svminnmp - svminp	2020-05-07 15:21:37 +01:00
Sander de Smalen	0d22076531	[SveEmitter] Add builtins for SVE2 uniform DSP operations This patch adds builtins for: - svqdmulh, svqdmulh_lane - svqrdmlah, svqrdmlah_lane - svqrdmlsh, svqrdmlsh_lane - svqrdmulh, svqrdmulh_lane	2020-05-07 13:31:46 +01:00
Sander de Smalen	5fa0eeec6e	[SveEmitter] Add more SVE2 builtins for shift operations This patch adds builtins for: - svqshlu - svrshr - svrsra - svsli - svsra - svsri	2020-05-07 13:31:46 +01:00
Sander de Smalen	dc2986f9dc	[SveEmitter] Add builtins for SVE2 saturating shift left and addition This patch adds builtins for: - svqrshl - svqshl - svsqadd - svuqadd	2020-05-07 13:31:46 +01:00
Sander de Smalen	b32d14c30e	[SveEmitter] Add builtins for SVE2 uniform DSP operations This patch adds builtins for: - svqadd, svhadd, svrhadd - svqsub, svhsub, svqusbr, svhsubr - svqabs - svqneg - svrecpe - svrsqrte	2020-05-07 13:31:46 +01:00
Sander de Smalen	3cb8b4c193	[SveEmitter] Add builtins for SVE2 Polynomial arithmetic This patch adds builtins for: - sveorbt - sveortb - svpmul - svpmullb, svpmullb_pair - svpmullt, svpmullt_pair The svpmullb and svpmullt builtins are expressed using the svpmullb_pair and svpmullt_pair LLVM IR intrinsics, respectively. Reviewers: SjoerdMeijer, efriedma, rengolin Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D79480	2020-05-07 11:53:04 +01:00
Sander de Smalen	a5e0389b2a	[AArch64] Define ACLE FP conversion intrinsics with more specific predicate. This patch changes the FP conversion intrinsics to take a predicate that matches the number of lanes for the vector with the widest element type as opposed to using <vscale x 16 x i1>. For example: ```<vscale x 4 x float> @llvm.aarch64.sve.fcvt.f32f16(<vscale x 4 x float>, <vscale x 4 x i1>, <vscale x 8 x half>)``` now uses <vscale x 4 x i1> instead of <vscale x 16 x i1> And similar for: ```<vscale x 4 x float> @llvm.aarch64.sve.fcvt.f32f64(<vscale x 4 x float>, <vscale x 2 x i1>, <vscale x 2 x double>)``` where the predicate now matches the wider type, so <vscale x 2 x i1>. Reviewers: efriedma, SjoerdMeijer, paulwalker-arm, rengolin Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78402	2020-04-23 10:53:23 +01:00
Sander de Smalen	002164461b	[SveEmitter] Add builtins for FP conversions This adds the flag IsOverloadCvt which tells CGBulitin to use the result type and the type of the last operand as the overloaded types for the LLVM IR intrinsic. This also adds the flag IsFPConvert, which is needed to avoid converting the predicate of the operation from svbool_t to a predicate with fewer lanes, as the LLVM IR intrinsics use the <vscale x 16 x i1> as the predicate. Reviewers: SjoerdMeijer, efriedma Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78239	2020-04-23 10:49:06 +01:00
Sander de Smalen	2d1baf606a	[SveEmitter] Add builtins for svwhilerw/svwhilewr This also adds the IsOverloadWhileRW flag which tells CGBuiltin to use the result predicate type and the first pointer type as the overloaded types for the LLVM IR intrinsic. Reviewers: SjoerdMeijer, efriedma Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78238	2020-04-22 21:49:18 +01:00
Sander de Smalen	1559485e60	[SveEmitter] Add builtins for svwhile This also adds the IsOverloadWhile flag which tells CGBuiltin to use both the default type (predicate) and the type of the second operand (scalar) as the overloaded types for the LLMV IR intrinsic. Reviewers: SjoerdMeijer, efriedma, rovka Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D77595	2020-04-22 21:47:47 +01:00
Andrzej Warzynski	72f565899d	[SveEmitter] Implement builtins for gathers/scatters This patch adds builtins for: * regular, first-faulting and non-temporal gather loads * regular and non-temporal scatter stores Differential Revision: https://reviews.llvm.org/D77735	2020-04-22 13:21:39 +01:00
Sander de Smalen	515020c091	[SveEmitter] Add more immediate operand checks. This patch adds a number of intrinsics that take immediates with varying ranges based on the element size one of the operands. svext: immediate ranging 0 to (2048/sizeinbits(elt) - 1) svasrd: immediate ranging 1..sizeinbits(elt) svqshlu: immediate ranging 1..sizeinbits(elt)/2 ftmad: immediate ranging 0..(sizeinbits(elt) - 1) Reviewers: efriedma, SjoerdMeijer, rovka, rengolin Reviewed By: SjoerdMeijer Tags: #clang Differential Revision: https://reviews.llvm.org/D76679	2020-04-20 14:41:58 +01:00

35 Commits