llvm-project

Commit Graph

Author	SHA1	Message	Date
Kevin Qin	ad64f6d4e5	[AArch64] Change int64_t from 'long long int' to 'long int' for AArch64 target. Most 64-bit targets define int64_t as long int, and AArch64 should make same definition to follow LP64 model. In GNU tool chain, int64_t is defined as long int for 64-bit target. So to get consistent with GNU, it's better Changing int64_t from 'long long int' to 'long int', otherwise clang will get different name mangling suffix compared with g++. llvm-svn: 202004	2014-02-24 02:45:03 +00:00
Tim Northover	db3e5e2408	AArch64: look up EmitAArch64Scalar support before calling. This fixes one immediate bug where an expression with side-effects could be emitted twice during a NEON call. It also prepares the way for folding CodeGen for many of the SISD intrinsics into a table, reducing code size and hopefully increasing performance eventually ("binary search + few switch cases" should be better than "lots of switch cases"). llvm-svn: 201667	2014-02-19 11:55:06 +00:00
Tim Northover	2163a0e497	ARM & AArch64: move struct definition outside function. Apparently it's not True C++. rdar://problem/16035743 still. llvm-svn: 201663	2014-02-19 10:56:23 +00:00
Tim Northover	544e79eb30	ARM NEON: use more flexible TableGen field for defs. We used to have special handling for isCrypto and isA64 bits in the NeonEmitter.cpp file (it knew the former was predicated on __ARM_FEATURE_CRYPTO and the latter on __aarch64__ and went through various contortions to make sure the correct intrinsics were emitted under the correct guard. This is ugly and has obvious scalability problems (e.g. vcvtX intrinsics are needed, which are ARMv8 only but available on both, yet another category). This patch moves the #if predicate into the arm_neon.td file directly and makes NeonEmitter.cpp agnostic about what goes in there. It also deduplicates arm_neon.td so that each desired intrinsic is mentioned in just one place (necessary because of the new mechanism for creating arm_neon.h). rdar://problem/16035743 llvm-svn: 201660	2014-02-19 10:37:09 +00:00
Tim Northover	12670418a3	ARM & AArch64: merge the semantic checking of NEON intrinsics There are two kinds of automatically generated tests for NEON intrinsics, both of which can be merged without adversely affecting users. 1. We check that a valid kind of __builtin_neon_XYZ overload is requested (e.g. we're not asking for a float32x4_t version when it only accepts integers. Since the __builtin_neon_XYZ intrinsics should only be used in arm_neon.h, relaxing this test and permitting AArch64 types for AArch32 should not cause a problem. The extra arm_neon.h definitions should be #ifdefed out anyway. 2. We check that intrinsics which take immediates are actually given compile-time constants within range. Since all NEON intrinsics should be backwards compatible, these tests should be identical on AArch64 and AArch32 anyway. This patch, therefore, merges the separate AArch64 and 32-bit checks. rdar://problem/16035743 llvm-svn: 201659	2014-02-19 10:37:05 +00:00
Tim Northover	26562486ad	Whitespace cleanup (mostly stray tabs, a few not-quite-empty lines). llvm-svn: 201234	2014-02-12 12:56:48 +00:00
Tim Northover	3402dc7d52	ARM NEON: fix range checking on immediates. Previously, range checking on the __builtin_neon_XYZ_v Clang intrinsics didn't take account of the type actually passed to the call, which meant a request like "vext_s16(a, b, 7)" was allowed through (TableGen was conservative and allowed 0-7 for all types). This caused an assert in the backend because the lane doesn't make sense. llvm-svn: 201232	2014-02-12 12:04:59 +00:00
Ana Pazos	9883d6d2b5	[AArch64] Fixed vget/vset_lane_f16 implementation Replaced cast and vreinterepret operations with code to reinterpret bitwise the types float16_t and int16_t. llvm-svn: 201112	2014-02-10 21:20:53 +00:00
Tim Northover	02e38609e7	ARM: implement support for crypto intrinsics in arm_neon.h llvm-svn: 200708	2014-02-03 17:28:04 +00:00
Tim Northover	c322f838bc	ARM & AArch64: share the BI__builtin_neon enum defs. llvm-svn: 200470	2014-01-30 14:47:51 +00:00
Jiangning Liu	bb59b3daa9	For AArch64 Neon, fix intrinsics implementation using nested macros. llvm-svn: 200114	2014-01-26 03:38:42 +00:00
Kevin Qin	fb79d7f843	[AArch64 NEON] Support poly128_t and implement relevant intrinsic. llvm-svn: 196888	2013-12-10 06:49:01 +00:00
Ana Pazos	6a8b8b5f0d	Implemented vget/vset_lane_f16 intrinsics llvm-svn: 196535	2013-12-05 21:13:24 +00:00
Hao Liu	a5246fde90	[AArch64]Add missing floating point convert, round and misc intrinsics. E.g. int64x1_t vcvt_s64_f64(float64x1_t a) -> FCVTZS Dd, Dn llvm-svn: 196211	2013-12-03 06:07:13 +00:00
Hao Liu	4b850c5e0d	revert r196152. This is a duplicate implementation. E.g. this patch defines: float64_t vabd_f64(float64_t a, float64_t b) But there is already a similar intrinsic "vabdd_f64" with the same types. Also, this intrinsic will be conflicted to the vector type intrinsic as following(Which is implemented by me and will be committed to trunk): float64x1_t vabd_f64(float64x1_t a, float64x1_t b). Two functions shouldn't have a same name in arm_neon.h. According to ARM ACLE document, such vabd_f64 with float64_t is not existing. So I revert this commit. llvm-svn: 196205	2013-12-03 05:35:17 +00:00
Jiangning Liu	cc1da2c938	Add some missing AArch64 Neon intrinsics like vmull_high_n_s16 and friends. llvm-svn: 196189	2013-12-03 01:28:55 +00:00
Chad Rosier	b0574f3bf7	[AArch64] Add missing NEON scalar floating-point to integer convert ACLEs. llvm-svn: 196152	2013-12-02 21:07:24 +00:00
Hao Liu	46a6ed9e64	AArch64: Two intrinsics are expected to return float64 not float32 in arm_neon.h llvm-svn: 195943	2013-11-29 02:31:42 +00:00
Hao Liu	8a0099e02c	Fix the problem that the range check for scalar narrow shift is too wide. E.g. the immediate value of vshrns_n_s16 is [1,16], which should be [1,8]. llvm-svn: 195942	2013-11-29 02:13:17 +00:00
Jiangning Liu	ee3e08799c	Fix the AArch64 NEON bug exposed by checking constant integer argument range of ACLE intrinsics. llvm-svn: 195844	2013-11-27 14:02:55 +00:00
Alp Toker	965f882588	Remove a whole lot of unused variables There are about 30 removed in this patch, generated by a new FixIt I haven't got round to submitting yet. llvm-svn: 195814	2013-11-27 05:22:15 +00:00
Tim Northover	5bb34ca4df	ARM: define & use __ARM_NEON on ARM32 (as per ACLE) There seem to be quite a few references to the old macro __ARM_NEON__ on the internet, so I don't think it's a good idea to remove it entirely (at least yet), but the canonical name does not have the trailing underscores so we should use that ourselves. llvm-svn: 195353	2013-11-21 12:36:34 +00:00
Ana Pazos	2b02688fd9	Implemented Neon scalar by element intrinsics. Intrinsics implemented: vqdmull_lane, vqdmulh_lane, vqrdmulh_lane, vqdmlal_lane, vqdmlsl_lane scalar Neon intrinsics. llvm-svn: 195326	2013-11-21 07:36:33 +00:00
Jiangning Liu	3311f374a8	Add predicate for AArch64 crypto instructions. llvm-svn: 195069	2013-11-19 01:38:19 +00:00
Jiangning Liu	c8b0a1ad95	Clean up predefined macros for AArch64 to follow ACLE 2.0. llvm-svn: 195068	2013-11-19 01:33:17 +00:00
Hao Liu	5e4ce1ae9d	Implement the newly added AArch64 ACLE functions for ld1/st1 with 2/3/4 vectors. The functions are like: vst1_s8_x2 ... llvm-svn: 194991	2013-11-18 06:33:43 +00:00
Ana Pazos	6f2a47a9e5	Implemented aarch64 Neon scalar vmulx_lane intrinsics Implemented aarch64 Neon scalar vfma_lane intrinsics Implemented aarch64 Neon scalar vfms_lane intrinsics Implemented legacy vmul_n_f64, vmul_lane_f64, vmul_laneq_f64 intrinsics (v1f64 parameter type) using Neon scalar instructions. Implemented legacy vfma_lane_f64, vfms_lane_f64, vfma_laneq_f64, vfms_laneq_f64 intrinsics (v1f64 parameter type) using Neon scalar instructions. llvm-svn: 194889	2013-11-15 23:33:31 +00:00
Kevin Qin	caac85e612	[AArch64 neon] support poly64 and relevant intrinsic functions. llvm-svn: 194660	2013-11-14 03:29:16 +00:00
Kevin Qin	1718af6f0a	Implement aarch64 neon instruction class misc. llvm-svn: 194657	2013-11-14 02:45:18 +00:00
Jiangning Liu	18b707cb3f	Implement AArch64 NEON instruction set AdvSIMD (table). llvm-svn: 194649	2013-11-14 01:57:55 +00:00
Chad Rosier	249c714bb4	[AArch64] Add support for NEON scalar floating-point convert to fixed-point instructions. llvm-svn: 194395	2013-11-11 18:04:22 +00:00
Jiangning Liu	c628af66c7	Implement AArch64 Neon instruction set Perm. llvm-svn: 194124	2013-11-06 03:35:53 +00:00
Kevin Qin	9eece7b5e0	Implemented aarch64 neon intrinsic vcopy_lane with float type. llvm-svn: 194042	2013-11-05 02:05:44 +00:00
Chad Rosier	bdca387884	[AArch64] Add support for NEON scalar shift immediate instructions. llvm-svn: 193791	2013-10-31 19:29:05 +00:00
Chad Rosier	4d55e6e0a4	[AArch64] Add support for NEON scalar floating-point compare instructions. llvm-svn: 193692	2013-10-30 15:20:07 +00:00
Chad Rosier	3c03dee1d1	[AArch64] Add support for NEON scalar extract narrow instructions. llvm-svn: 192971	2013-10-18 14:03:36 +00:00
Kevin Qin	f22bf50443	Implemented aarch64 SIMD copy related ACLE intrinsic : vget_lane, vset_lane, vcopy_lane, vcreate, vdup_n, vdup_lane, vmov_n. llvm-svn: 192411	2013-10-11 02:34:30 +00:00
Chad Rosier	0babda4b9c	[AArch64] Add support for NEON scalar signed/unsigned integer to floating-point convert instructions. llvm-svn: 192232	2013-10-08 20:43:46 +00:00
Jiangning Liu	b96ebac02b	Implement aarch64 neon instruction set AdvSIMD (Across). llvm-svn: 192029	2013-10-05 08:22:55 +00:00
Jiangning Liu	4617e9dc85	Implement aarch64 neon instruction set AdvSIMD (3V elem). llvm-svn: 191945	2013-10-04 09:21:17 +00:00
Jiangning Liu	1bda93a252	Implement aarch64 neon instruction set AdvSIMD (3V Diff), covering the following 26 instructions, SADDL, UADDL, SADDW, UADDW, SSUBL, USUBL, SSUBW, USUBW, ADDHN, RADDHN, SABAL, UABAL, SUBHN, RSUBHN, SABDL, UABDL, SMLAL, UMLAL, SMLSL, UMLSL, SQDMLAL, SQDMLSL, SMULL, UMULL, SQDMULL, PMULL llvm-svn: 190289	2013-09-09 02:21:08 +00:00
Hao Liu	b1852eed38	Inplement aarch64 neon instructions in AdvSIMD(shift). About 24 shift instructions: sshr,ushr,ssra,usra,srshr,urshr,srsra,ursra,sri,shl,sli,sqshlu,sqshl,uqshl,shrn,sqrshr$ and 4 convert instructions: scvtf,ucvtf,fcvtzs,fcvtzu llvm-svn: 189926	2013-09-04 09:29:13 +00:00
Kevin Qin	c076d0682b	mangle aarch64 Neon ACLE scalar instrinsic name with BHSD suffix. llvm-svn: 189574	2013-08-29 07:55:15 +00:00
Hao Liu	4efa1402fe	Clang and AArch64 backend patches to support shll/shl and vmovl instructions and ACLE functions llvm-svn: 188452	2013-08-15 08:26:30 +00:00
Tim Northover	2fe823a6c3	AArch64: initial NEON support Patch by Ana Pazos - Completed implementation of instruction formats: AdvSIMD three same AdvSIMD modified immediate AdvSIMD scalar pairwise - Completed implementation of instruction classes (some of the instructions in these classes belong to yet unfinished instruction formats): Vector Arithmetic Vector Immediate Vector Pairwise Arithmetic - Initial implementation of instruction formats: AdvSIMD scalar two-reg misc AdvSIMD scalar three same - Intial implementation of instruction class: Scalar Arithmetic - Initial clang changes to support arm v8 intrinsics. Note: no clang changes for scalar intrinsics function name mangling yet. - Comprehensive test cases for added instructions To verify auto codegen, encoding, decoding, diagnosis, intrinsics. llvm-svn: 187568	2013-08-01 09:23:19 +00:00
Michael Gottesman	1169a52c83	[NeonIntrinsicTestEmitter] vld1/vst1 do not require the :64 hint. llvm-svn: 184786	2013-06-24 21:25:39 +00:00
Michael Gottesman	c6b5e56c19	[NeonIntrinsicTestEmitter] Fix incorrect FileCheck pattern where we were expecting a ',' prefix to alignment hints. llvm-svn: 184785	2013-06-24 21:25:37 +00:00
Michael Gottesman	d95c49a91c	[NeonIntrinsicTestEmitter] Add requirement to arm neon intrinsic tests for the feature long_tests. This will prevent the tests from running on normal make check. You will need to actually pass in --param run_long_tests=true to LIT in order to run these. llvm-svn: 184784	2013-06-24 21:25:34 +00:00
Jim Grosbach	d10f1c04aa	ARM: Improve codegen for vget_low_* and vget_high_ intrinsics. These intrinsics use the __builtin_shuffle() function to extract the low and high half, respectively, of a 128-bit NEON vector. Currently, they're defined to use bitcasts to simplify the emitter, so we get code like: uint16x4_t vget_low_u32(uint16x8_t __a) { return (uint32x2_t) __builtin_shufflevector((int64x2_t) __a, (int64x2_t) __a, 0); } While this works, it results in those bitcasts going all the way through to the IR, resulting in code like: %1 = bitcast <8 x i16> %in to <2 x i64> %2 = shufflevector <2 x i64> %1, <2 x i64> undef, <1 x i32> %zeroinitializer %3 = bitcast <1 x i64> %2 to <4 x i16> We can instead easily perform the operation directly on the input vector like: uint16x4_t vget_low_u16(uint16x8_t __a) { return __builtin_shufflevector(__a, __a, 0, 1, 2, 3); } Not only is that much easier to read on its own, it also results in cleaner IR like: %1 = shufflevector <8 x i16> %in, <8 x i16> undef, <4 x i32> <i32 0, i32 1, i32 2, i32 3> This is both easier to read and easier for the back end to reason about effectively since the operation is obfuscating the source with bitcasts. rdar://13894163 llvm-svn: 181865	2013-05-15 02:40:04 +00:00
Michael Gottesman	3508389233	[neonemitter tests] Change triple of emitted tests to thumbv7s to match the target cpu being swift. Also specify the target-abi to apcs-gnu. llvm-svn: 180233	2013-04-25 00:10:14 +00:00
Michael Gottesman	6cd3e560fd	[6/6] ARM Neon Intrinsic Tablegen Test Generator. Added GenerateChecksForIntrinsic method to generate FileCheck patterns for generated arm neon tests. Reviewed by Bob Wilson. llvm-svn: 179644	2013-04-16 23:00:26 +00:00
Michael Gottesman	1d712fe52d	[5/6] ARM Neon Intrinsic Tablegen Test Generator. Changed the test generation target cpu type from cortex-a9 to swift. Reviewed by Bob Wilson. llvm-svn: 179642	2013-04-16 22:55:01 +00:00
Michael Gottesman	d44c8f7d20	[4/6] ARM Neon Intrinsic Tablegen Test Generator. Added code to NeonEmitter::runTests so that GenTest gets all of the needed arguments to invoke the neon test generation methods. Reviewed by Bob Wilson. llvm-svn: 179640	2013-04-16 22:48:52 +00:00
Michael Gottesman	095c58f1c4	[3/6] ARM Neon Intrinsic Tablegen Test Generator. Refactored out the method InstructionTypeCode from MangleName for use in further patches which perform neon tablegen test generation. Reviewed by Bob Wilson. llvm-svn: 179636	2013-04-16 22:07:30 +00:00
Michael Gottesman	fc89cc2a91	[2/6] ARM Neon Intrinsic Tablegen Test Generator. This patch causes OpInst records to be silently identified with their Non-Op inst counterparts so that the same test generation infrastructure can be used to generate tests. Reviewed by Bob Wilson. llvm-svn: 179628	2013-04-16 21:18:42 +00:00
Bob Wilson	2b59395d0e	Define Neon intrinsics as "static inline" to avoid warning. rdar://13108414 We had been defining Neon intrinsics as "static" with always_inline attributes. If you use them from an extern inline function, you get a warning, e.g.: static function 'vadd_u8' is used in an inline function with external linkage This change simply adds the inline keyword to avoid that warning. llvm-svn: 179406	2013-04-12 20:17:20 +00:00
Joerg Sonnenberger	691a16b444	Don't throw exceptions in clang-tblgen by switching to PrintFatalError. Add locations in a number of places, where they are available for free. llvm-svn: 166691	2012-10-25 16:37:08 +00:00
Richard Smith	f44b8ee52e	Placate the mingw32 buildbot by suffixing 64-bit constants with ULL. llvm-svn: 161831	2012-08-14 03:55:16 +00:00
Richard Smith	7d6d47b862	Fix undefined behavior (and wrong code, as far as I can tell) in NEON builtin tablegen code, found by -fcatch-undefined-behavior. I would appreciate if someone more familiar with the NEON code could point me in the direction of how to write a test for this. We appear to have essentially no test coverage whatsoever for these builtins. llvm-svn: 161827	2012-08-14 01:28:02 +00:00
Jim Grosbach	cc6b1816fd	TableGen: Remove extraneous \ character from arm_neon.h definitions. llvm-svn: 161244	2012-08-03 17:30:46 +00:00
Jakob Stoklund Olesen	995e0e13fa	Make clang-tblgen backends functions instead of TableGenBackends. Get rid of a bunch of header files. TableGen output should be unaffected. Patch by Sean Silva! llvm-svn: 158388	2012-06-13 05:12:41 +00:00
Jim Grosbach	6acd46f5e9	TableGen: Remove extraneous '\' at EOL in generated tests. llvm-svn: 157700	2012-05-30 18:18:29 +00:00
Jim Grosbach	6f855e3024	ARM: Support marking intrinsic definitions as 'unavailable' llvm-svn: 156490	2012-05-09 18:17:30 +00:00
David Blaikie	8a40f700e6	Remove unreachable code in Clang. (replace with llvm_unreachable where appropriate or when GCC requires it) llvm-svn: 148292	2012-01-17 06:56:22 +00:00
Bob Wilson	bd646de67f	Relax type checking for a few Neon intrinsics. <rdar://problem/10538555> Not long ago, I tightened up the type checking for pointer arguments of Neon intrinsics to match the specifications provided by ARM. One consequence was that it became impossible to access the unaligned versions of a few Neon load/store operations. Since there are just a few of these intrinsics where it makes a difference, I think it's better to relax the type checking than to either introduce new non-standard unaligned intrinsics or to disallow intrinsics for the unaligned operations. llvm-svn: 146963	2011-12-20 06:16:48 +00:00
Bob Wilson	89d14247ff	Fix Neon builtin pointer argument checking for "sret" builtins. The code for checking Neon builtin pointer argument types was assuming that there would only be one pointer argument. But, for vld2-4 builtins, the first argument is a special sret pointer where the result will be stored. So, instead of scanning all the arguments to find a pointer, have TableGen figure out the index of the pointer argument that needs checking. That's better than scanning all the arguments regardless. <rdar://problem/10448804> llvm-svn: 144834	2011-11-16 21:32:23 +00:00
Bob Wilson	e4d7723b87	Check pointer types for arguments of Neon load/store macros. rdar://9958031 The Neon load/store intrinsics need to be implemented as macros to avoid hiding alignment attributes on the pointer arguments, and the macros can only evaluate those pointer arguments once (in case they have side effects), so it has been hard to get the right type checking for those pointers. I tried various alternatives in the arm_neon.h header, but it's much more straightforward to just check directly in Sema. llvm-svn: 144075	2011-11-08 05:04:11 +00:00
Bob Wilson	98bc98caa8	Clean up type flags for overloaded Neon builtins. No functional change. This patch just adds a simple NeonTypeFlags class to replace the various hardcoded constants that had been used until now. Unfortunately I couldn't figure out a good way to avoid duplicating that class between clang and TableGen, but since it's small and rarely changes, that's not so bad. llvm-svn: 144054	2011-11-08 01:16:11 +00:00
Bob Wilson	3b476aec6d	Add __nodebug__ attribute to functions in arm_neon.h This matches what we do for Intel vector intrinsics. <rdar://problem/10280207> llvm-svn: 141958	2011-10-14 16:55:33 +00:00
Peter Collingbourne	bee583fd6e	Add the Clang tblgen backends to Clang, and flip the switch to cause the build systems to use clang-tblgen. llvm-svn: 141291	2011-10-06 13:03:08 +00:00

1 2 3

120 Commits