llvm-project

Commit Graph

Author	SHA1	Message	Date
Paul Walker	7e06474f3c	[Clang] Remove bogus "REQUIRES arm-registered-target" from SVE ACLE tests. Many of the SVE ACLE tests have gained entries as follows: REQUIRES: aarch64-registered-target \|\| arm-registered-target which can cause test failures when only arm-registered-target is available because only aarch64-registered-target supports SVE.	2021-12-01 18:45:38 +00:00
Saleem Abdulrasool	c17d9b4b12	headers: optionalise some generated resource headers This splits out the generated headers and conditonalises them upon the target being enabled. The motivation here is that the RISCV header alone added 10MB to the resource directory, which was previously at 10MB, increasing the build size and time. This header is contributing ~50% of the size of the resource headers (~10MB). The ARM generated headers are contributing about ~10% or 1MB. This could be extended further adding only the static resource headers for the targets that the LLVM build supports. The changes to the tests for ARM mirror what the RISCV target already did and rnk identified as a possible issue. Testing: cmake -G Ninja -D LLVM_TARGETS_TO_BUILD=X86 -D LLVM_ENABLE_PROJECTS="clang;lld" ../clang ninja check-clang Differential Revision: https://reviews.llvm.org/D112890 Reviewed By: craig.topper	2021-11-09 22:30:29 +00:00
Sander de Smalen	fabe67728e	[AArch64][SVE] Enable __ARM_FEATURE_SVE macros. This patch enables the following macros when their corresponding target attributes are set: __ARM_FEATURE_SVE (+sve) __ARM_FEATURE_SVE2 (+sve2) __ARM_FEATURE_SVE2_AES (+sve2-aes) __ARM_FEATURE_SVE2_BITPERM (+sve2-bitperm) __ARM_FEATURE_SVE2_SHA3 (+sve2-sha3) __ARM_FEATURE_SVE2_SM4 (+sve2-sm4) This implies that the base SVE and SVE2 ACLE (00bet2) are now feature complete, meaning that all intrinsics are implemented in LLVM and Clang. Disclaimer: To implement the ACLE we have had to fix up many parts of LLVM to make it support scalable vectors. We have also used many target-specific intrinsics to reduce reliance on parts of LLVM where we know scalable vectors may not yet be handled properly (e.g. some transformation might drop the 'scalable' flag on a vector type). While we've done a best effort with the limited testing that is available to us, we're still working to improve the stability of the implementation. Additionally, Clang may print warnings that code may have miscompiled. We find this often to be a false alarm where the wrong interfaces have been used in LLVM and where resulting code is not actually incorrect. However, this warrants a bug report and investigation. If you find any bugs or issues, please raise them on bugs.llvm.org and let us know! Reviewers: rengolin, efriedma, david-arm, SjoerdMeijer Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D81725	2020-06-25 08:14:19 +01:00
Sander de Smalen	1a720d49dc	[SveEmitter] Add builtins for various FP operations Unary: - svexpa, svtmad, svtsmul, svtssel, svscale, svrecpe, svrecps, svrsqrte, svrsqrts, Binary: - svabd, svadd, svdiv, svdivr, svmin, svmax, svminnm, svmaxnm, svmul, svmulx, svsub, svsubr, svmul_lane Complex: - svcadd, svcmla	2020-05-01 17:37:43 +01:00
Sander de Smalen	fc64539749	[SveEmitter] Add immediate checks for lanes and complex imms Adds another bunch of of intrinsics that take immediates with varying ranges based, some being a complex rotation immediate which are a set of allowed immediates rather than a range. svmla_lane: lane immediate ranging 0..(128/(1sizeinbits(elt)) - 1) svcmla_lane: lane immediate ranging 0..(128/(2sizeinbits(elt)) - 1) svdot_lane: lane immediate ranging 0..(128/(4*sizeinbits(elt)) - 1) svcadd: complex rotate immediate [90, 270] svcmla: svcmla_lane: complex rotate immediate [0, 90, 180, 270] Reviewers: efriedma, SjoerdMeijer, rovka Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D76680	2020-04-20 15:10:54 +01:00

5 Commits