llvm-project

Commit Graph

Author	SHA1	Message	Date
vhscampos	f6e11a36c4	[ARM][AArch64] Implement __cls, __clsl and __clsll intrinsics from ACLE Summary: Writing support for three ACLE functions: unsigned int __cls(uint32_t x) unsigned int __clsl(unsigned long x) unsigned int __clsll(uint64_t x) CLS stands for "Count number of leading sign bits". In AArch64, these two intrinsics can be translated into the 'cls' instruction directly. In AArch32, on the other hand, this functionality is achieved by implementing it in terms of clz (count number of leading zeros). Reviewers: compnerd Reviewed By: compnerd Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D69250	2019-10-28 11:06:58 +00:00
vhscampos	5d35b7d9e1	[ARM][AArch64] Implement __arm_rsrf, __arm_rsrf64, __arm_wsrf & __arm_wsrf64 Summary: Adding support for ACLE intrinsics. Patch by Michael Platings. Reviewers: chill, t.p.northover, efriedma Reviewed By: chill Subscribers: kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D69297	2019-10-28 10:59:18 +00:00
Roman Lebedev	6fcd4e080f	[Clang][Codegen] Disable arm_acle.c test. This test is broken by design. Clang codegen tests should not depend on llvm middle-end behaviour, they should only test clang codegen. Yet this test runs whole optimization pipeline. I've really tried to fix it, but there isn't just a few things that depend on passes, but everything there does. llvm-svn: 372015	2019-09-16 17:46:08 +00:00
JF Bastien	dbc0a5df8d	Allow prefetching from non-zero address spaces Summary: This is useful for targets which have prefetch instructions for non-default address spaces. <rdar://problem/42662136> Subscribers: nemanjai, javed.absar, hiraditya, kbarton, jkorous, dexonsmith, cfe-commits, llvm-commits, RKSimon, hfinkel, t.p.northover, craig.topper, anemet Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D65254 llvm-svn: 367032	2019-07-25 16:11:57 +00:00
Kyrylo Tkachov	eb72138340	[AArch64] Implement __jcvt intrinsic from Armv8.3-A The jcvt intrinsic defined in ACLE [1] is available when ARM_FEATURE_JCVT is defined. This change introduces the AArch64 intrinsic, wires it up to the instruction and a new clang builtin function. The __ARM_FEATURE_JCVT macro is now defined when an Armv8.3-A or higher target is used. I've implemented the target detection logic in Clang so that this feature is enabled for architectures from armv8.3-a onwards (so -march=armv8.4-a also enables this, for example). make check-all didn't show any new failures. [1] https://developer.arm.com/docs/101028/latest/data-processing-intrinsics Differential Revision: https://reviews.llvm.org/D64495 llvm-svn: 366197	2019-07-16 09:27:39 +00:00
Leonard Chan	f66309203e	[clang][NewPM] Add -fno-experimental-new-pass-manager to tests As per the discussion on D58375, we disable test that have optimizations under the new PM. This patch adds -fno-experimental-new-pass-manager to RUNS that: - Already run with optimizations (-O1 or higher) that were missed in D58375. - Explicitly test new PM behavior along side some new PM RUNS, but are missing this flag if new PM is enabled by default. - Specify -O without the number. Based on getOptimizationLevel(), it seems the default is 2, and the IR appears to be the same when changed to -O2, so update the test to explicitly say -O2 and provide -fno-experimental-new-pass-manager`. Differential Revision: https://reviews.llvm.org/D63156 llvm-svn: 364066	2019-06-21 16:03:06 +00:00
Sam Parker	b9ea36f9c1	[ARM] ACLE Chapter 9 intrinsics Implemented the remaining integer data processing intrinsics from the ARM ACLE v2.1 spec, such as parallel arithemtic and DSP style multiplications. Differential Revision: https://reviews.llvm.org/D32282 llvm-svn: 302131	2017-05-04 08:37:59 +00:00
Chad Rosier	c22abb3820	[ARM] Use generic bitreverse intrinsic, rather than ARM specific rbit. The backend already supports lowering this intrinsic to a rbit instruction. llvm-svn: 291582	2017-01-10 18:55:11 +00:00
Chad Rosier	5a4a1be690	[AArch64] Use generic bitreverse intrinsic, rather than AArch64 specific. Differential Revision: https://reviews.llvm.org/D28400 llvm-svn: 291574	2017-01-10 17:20:28 +00:00
Oliver Stannard	7aa90f5735	[ARM,AArch64] Fix __rev16l and __rev16ll intrinsics These two intrinsics are defined in arm_acle.h. __rev16l needs to rotate by 16 bits, bit it was actually rotating by 2 bits. For AArch64, where long is 64 bits, this would still be wrong. __rev16ll was incorrect, it reversed the bytes in each 32-bit word, rather than each 16-bit halfword. The correct implementation is to apply __rev16 to the top and bottom words of the 64-bit value. For AArch32 targets, these get compiled down to the hardware rev16 instruction at -O1 and above. For AArch64 targets, the 64-bit ones get compiled to two 32-bit rev16 instructions, because there is not currently a pattern for the 64-bit rev16 instruction. Differential Revision: http://reviews.llvm.org/D14609 llvm-svn: 253211	2015-11-16 14:58:50 +00:00
Ranjeet Singh	e8accef866	[ARM] Replace hard coded metadata arguments in tests with a regex. Differential Revision: http://reviews.llvm.org/D10507 llvm-svn: 239932	2015-06-17 19:56:30 +00:00
Reid Kleckner	717820faa0	Wildcard out some SSA value names from the ACLE intrinsic test case llvm-svn: 239757	2015-06-15 20:55:43 +00:00
Luke Cheeseman	59b2d83909	This patch implements clang support for the ACLE special register intrinsics in section 10.1, __arm_{w,r}sr{,p,64}. This includes arm_acle.h definitions with builtins and codegen to support these, the intrinsics are implemented by generating read/write_register calls which get appropriately lowered in the backend based on the register string provided. SemaChecking is also implemented to fault invalid parameters. Differential Revision: http://reviews.llvm.org/D9697 llvm-svn: 239737	2015-06-15 17:51:01 +00:00
Yi Kong	623393f31e	arm_acle: Implement data processing intrinsics Summary: ACLE 2.0 section 9.2 defines the following "miscellaneous data processing intrinsics": `__clz`, `__cls`, `__ror`, `__rev`, `__rev16`, `__revsh` and `__rbit`. `__clz` has already been implemented in the arm_acle.h header file. The rest are not supported yet. This patch completes ACLE data processing intrinsics. Reviewers: t.p.northover, rengolin Reviewed By: rengolin Subscribers: aemerson, mroth, llvm-commits Differential Revision: http://reviews.llvm.org/D4983 llvm-svn: 216658	2014-08-28 09:44:07 +00:00
Yi Kong	6891746cd8	arm_acle: Add mappings for dbg intrinsic This completes all ACLE hint intrinsics. llvm-svn: 216453	2014-08-26 12:48:11 +00:00
Yi Kong	0705e0065e	arm_acle: Implement swap intrinsic Insert the LDREX/STREX instruction sequence specified in ARM ACLE 2.0, as SWP instruction is deprecated since ARMv6. llvm-svn: 216446	2014-08-26 09:50:54 +00:00
Yi Kong	45a09319bf	ARM: Add mappings for ACLE prefetch intrinsics Implement __pld, __pldx, __pli and __plix builtin intrinsics as specified in ARM ACLE 2.0. llvm-svn: 215599	2014-08-13 23:20:15 +00:00
Kevin Qin	110db6f2ad	[AArch64] Implement Clang CLI interface proposal about "-march". 1. Revert "Add default feature for CPUs on AArch64 target in Clang" at r210625. Then, all enabled feature will by passed explicitly by -target-feature in -cc1 option. 2. Get "-mfpu" deprecated. 3. Implement support of "-march". Usage is: -march=armv8-a+[no]feature For instance, "-march=armv8-a+neon+crc+nocrypto". Here "armv8-a" is necessary, and CPU names are not acceptable. Candidate features are fp, neon, crc and crypto. Where conflicting feature modifiers are specified, the right-most feature is used. 4. Implement support of "-mtune". Usage is: -march=CPU_NAME For instance, "-march=cortex-a57". This option will ONLY get micro-architectural feature enabled specifying to target CPU, like "+zcm" and "+zcz" for cyclone. Any architectural features WON'T be modified. 5. Change usage of "-mcpu" to "-mcpu=CPU_NAME+[no]feature", which is an alias to "-march={feature of CPU_NAME}+[no]feature" and "-mtune=CPU_NAME" together. Where this option is used in conjunction with -march or -mtune, those options take precedence over the appropriate part of this option. llvm-svn: 213353	2014-07-18 07:03:22 +00:00
Yi Kong	28d7b02687	ARM: Add ACLE memory barrier intrinsic mapping llvm-svn: 213261	2014-07-17 12:45:17 +00:00
Yi Kong	472e521cec	ARM: Add NOP intrinsic mapping in arm_acle.h llvm-svn: 212950	2014-07-14 15:32:29 +00:00
Saleem Abdulrasool	3b165e7dbb	tests: use a more precise target for tests llvm-svn: 212892	2014-07-12 23:40:53 +00:00
Saleem Abdulrasool	572250d60a	CodeGen: support hint intrinsics from ACLE on AArch64 This adds support for the ACLE hint intrinsics on AArch64 similar to ARM. This is required to properly support ACLE on AArch64. llvm-svn: 212890	2014-07-12 23:27:22 +00:00
Yi Kong	4e00ce7d0c	Improve comments of ARM ACLE header file and tests Include section number in ARM ACLE specification for easier navigation. llvm-svn: 212887	2014-07-12 22:48:13 +00:00
Renato Golin	47843efcf6	Add the __qdbl intrinsic to the arm_acle.h header Patch by: Moritz Roth llvm-svn: 212264	2014-07-03 10:14:52 +00:00
Alp Toker	f76e6d8e6b	Get arm_acle tests from r211962 working llvm-svn: 211979	2014-06-28 06:51:27 +00:00
Yi Kong	a44c4d7173	Introduce arm_acle.h supporting existing LLVM builtin intrinsics Summary: This patch introduces ACLE header file, implementing extensions that can be directly mapped to existing Clang intrinsics. It implements for both AArch32 and AArch64. Reviewers: t.p.northover, compnerd, rengolin Reviewed By: compnerd, rengolin Subscribers: rnk, echristo, compnerd, aemerson, mroth, cfe-commits Differential Revision: http://reviews.llvm.org/D4296 llvm-svn: 211962	2014-06-27 21:25:42 +00:00

26 Commits