llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexey Samsonov	4c1a96f519	Propagate SanitizerKind into CodeGenFunction::EmitCheck() call. Make sure CodeGenFunction::EmitCheck() knows which sanitizer it emits check for. Make CheckRecoverableKind enum an implementation detail and move it away from header. Currently CheckRecoverableKind is determined by the type of sanitizer ("unreachable" and "return" are unrecoverable, "vptr" is always-recoverable, all the rest are recoverable). This will change in future if we allow to specify which sanitizers are recoverable, and which are not by -fsanitize-recover= flag. No functionality change. llvm-svn: 221635	2014-11-10 22:27:30 +00:00
Alexey Samsonov	edf99a92c0	Introduce a SanitizerKind enum to LangOptions. Use the bitmask to store the set of enabled sanitizers instead of a bitfield. On the negative side, it makes syntax for querying the set of enabled sanitizers a bit more clunky. On the positive side, we will be able to use SanitizerKind to eventually implement the new semantics for -fsanitize-recover= flag, that would allow us to make some sanitizers recoverable, and some non-recoverable. No functionality change. llvm-svn: 221558	2014-11-07 22:29:38 +00:00
Reid Kleckner	06ea7d6213	Lower __builtin_fabs* to @llvm.fabs.* mingw64's headers implement fabs by calling __builtin_fabs, so using the library call results in an infinite loop. If the backend legalizes @llvm.fabs as a call to fabs later, things should work out, as the crt provides a definition. llvm-svn: 221206	2014-11-03 23:52:09 +00:00
Reid Kleckner	4cad00abf3	Remove dead AST type argument to EmitFAbs llvm-svn: 221205	2014-11-03 23:51:40 +00:00
Alexey Samsonov	035462c1cf	Get rid of SanitizerOptions::Disabled global. NFC. SanitizerOptions is not even a POD now, so having global variable of this type, is not nice. Instead, provide a regular constructor and clear() method, and let each CodeGenFunction has its own copy of SanitizerOptions it uses. llvm-svn: 220920	2014-10-30 19:33:44 +00:00
Saleem Abdulrasool	a25fbef088	CodeGen: add __readfsdword builtin The Windows NT SDK uses __readfsdword and declares it as a compiler provided builtin (#pragma intrinsic(__readfsword). Because intrin.h is not referenced by winnt.h, it is not possible to provide an out-of-line definition for the intrinsic. Provide a proper compiler builtin definition. llvm-svn: 220859	2014-10-29 16:35:41 +00:00
Matt Arsenault	2174a9dc28	R600: Update for div_fmas intrinsic change llvm-svn: 220339	2014-10-21 22:21:41 +00:00
Hal Finkel	d2208b59cf	Add __sync_fetch_and_nand (again) Prior to GCC 4.4, __sync_fetch_and_nand was implemented as: { tmp = ptr; ptr = ~tmp & value; return tmp; } but this was changed in GCC 4.4 to be: { tmp = ptr; ptr = ~(tmp & value); return tmp; } in response to this change, support for sync_fetch_and_nand (and sync_nand_and_fetch) was removed in r99522 in order to avoid miscompiling code depending on the old semantics. However, at this point: 1. Many years have passed, and the amount of code relying on the old semantics is likely smaller. 2. Through the work of many contributors, all LLVM backends have been updated such that "atomicrmw nand" provides the newer GCC 4.4+ semantics (this process was complete July of 2014 (added to the release notes in r212635). 3. The lack of this intrinsic is now a needless impediment to porting codes from GCC to Clang (I've now seen several examples of this). It is true, however, that we still set GNUC_MINOR to 2 (corresponding to GCC 4.2). To compensate for this, and to address the original concern regarding code relying on the old semantics, I've added a warning that specifically details the fact that the semantics have changed and that we provide the newer semantics. Fixes PR8842. llvm-svn: 218905	2014-10-02 20:53:50 +00:00
Jan Vesely	b4379f9c2c	CGBuiltin: Use frem instruction rather than libcall to implement fmod AFAICT the semantics of frem match libm's fmod. Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 218488	2014-09-26 01:19:41 +00:00
Hal Finkel	bcc06085a8	Add __builtin_assume and __builtin_assume_aligned using @llvm.assume. This makes use of the recently-added @llvm.assume intrinsic to implement a __builtin_assume(bool) intrinsic (to provide additional information to the optimizer). This hooks up __assume in MS-compatibility mode to mirror __builtin_assume (the semantics have been intentionally kept compatible), and implements GCC's __builtin_assume_aligned as assume((p - o) & mask == 0). LLVM now contains special logic to deal with assumptions of this form. llvm-svn: 217349	2014-09-07 22:58:14 +00:00
James Molloy	163b1ba471	[ARMv8] Add support for 32-bit MIN/MAXNM and directed rounding. This patch adds support for the 32bit numeric max/min and directed round-to-integral NEON intrinsics that were added as part of v8, along with unit tests. Patch by Graham Hunter! llvm-svn: 217242	2014-09-05 13:50:34 +00:00
Tom Stellard	c4e0c1075b	CGBuiltin: Use @llvm.fabs rather than fabs libcall when emitting builtins Using the intrinsic allows the SelectionDAGBuilder to turn this call into the FABS Node and also the intrinsic is something the vectorizer knows how to vectorize. This patch also sets the readnone attribute on this call, which should enable additional optmizations. llvm-svn: 217042	2014-09-03 15:24:29 +00:00
Craig Topper	5fc8fc2d31	Simplify creation of a bunch of ArrayRefs by using None, makeArrayRef or just letting them be implicitly created. llvm-svn: 216528	2014-08-27 06:28:36 +00:00
Yi Kong	1d268af094	ARM: Add dbg builtin intrinsic llvm-svn: 216452	2014-08-26 12:48:06 +00:00
Hal Finkel	6208251923	Implement __builtin_signbitl for PowerPC PowerPC uses the special PPC_FP128 type for long double on Linux, which is composed of two 64-bit doubles. The higher-order double (which contains the overall sign) comes first, and so the __builtin_signbitl implementation requires special handling to extract the sign bit. Fixes PR20691. llvm-svn: 216341	2014-08-24 03:47:06 +00:00
Alexey Samsonov	70b9c01bd4	Pass expressions instead of argument ranges to EmitCall/EmitCXXConstructorCall. Summary: This is a first small step towards passing generic "Expr" instead of ArgBeg/ArgEnd pair into EmitCallArgs() family of methods. Having "Expr" will allow us to get the corresponding FunctionDecl and its ParmVarDecls, thus allowing us to alter CodeGen depending on the function/parameter attributes. No functionality change. Test Plan: regression test suite Reviewers: rnk Reviewed By: rnk Subscribers: aemerson, cfe-commits Differential Revision: http://reviews.llvm.org/D4915 llvm-svn: 216214	2014-08-21 20:26:47 +00:00
Matt Arsenault	dbb84916d9	R600: Add ldexp intrinsic llvm-svn: 215738	2014-08-15 17:44:32 +00:00
Yi Kong	a5548431a5	AArch64: Prefetch intrinsic llvm-svn: 215569	2014-08-13 19:18:20 +00:00
Yi Kong	26d104a9ec	ARM: Prefetch intrinsics llvm-svn: 215568	2014-08-13 19:18:14 +00:00
Yi Kong	1083eb5c11	AArch64: Resolve some FIXMEs in CGBuiltin left over from backend merge Merge vrshr_n_v and vqshlu_n_v with ARM. Remove FIXME comments for others as they can't actually be shared. NFC. Differential Revision: http://reviews.llvm.org/D4697 llvm-svn: 214173	2014-07-29 09:25:17 +00:00
Tim Northover	40956e64f2	AArch64: update Clang for merged arm64/aarch64 triples. The main subtlety here is that the Darwin tools still need to be given "-arch arm64" rather than "-arch aarch64". Fortunately this already goes via a custom function to handle weird edge-cases in other architectures, and it tested. I removed a few arm64_be tests because that really isn't an interesting thing to worry about. No-one using big-endian is also referring to the target as arm64 (at least as far as toolchains go). Mostly they date from when arm64 was a separate target and we did need a parallel name simply to test it at all. Now aarch64_be is sufficient. llvm-svn: 213744	2014-07-23 12:32:58 +00:00
Alexey Samsonov	24cad99307	[UBSan] Add !nosanitize metadata to the code generated by UBSan. This is used to mark the instructions emitted by Clang to implement variety of UBSan checks. Generally, we don't want to instrument these instructions with another sanitizers (like ASan). Reviewed in http://reviews.llvm.org/D4544 llvm-svn: 213291	2014-07-17 18:46:27 +00:00
Hal Finkel	3e49fda0d4	Add basic (noop) CodeGen support for __assume Clang supports __assume, at least at the semantic level, when MS extensions are enabled. Unfortunately, trying to actually compile code using __assume would result in this error: error: cannot compile this builtin function yet __assume is an optimizer hint, and can be ignored at the IR level. Until LLVM supports assumptions at the IR level, a noop lowering is valid, and that is what is done here. llvm-svn: 213206	2014-07-16 22:44:54 +00:00
Matt Arsenault	8587711164	Add codegen for more R600 builtins llvm-svn: 213079	2014-07-15 17:23:46 +00:00
Yi Kong	4d5e23f53a	ARM: Implement __builtin_arm_nop intrinsic This patch implements __builtin_arm_nop intrinsic for AArch32 and AArch64, which generates hint 0x0, the alias of NOP instruction. This intrinsic is necessary to implement ACLE __nop intrinsic. Differential Revision: http://reviews.llvm.org/D4495 llvm-svn: 212947	2014-07-14 15:20:09 +00:00
Saleem Abdulrasool	572250d60a	CodeGen: support hint intrinsics from ACLE on AArch64 This adds support for the ACLE hint intrinsics on AArch64 similar to ARM. This is required to properly support ACLE on AArch64. llvm-svn: 212890	2014-07-12 23:27:22 +00:00
Reid Kleckner	ed5d4adb36	MS extension: Make __noop be the integer zero, not void We still don't accept '__noop;', and we don't consider __noop to be the integer literal zero. More work is needed. llvm-svn: 212839	2014-07-11 20:22:55 +00:00
Saleem Abdulrasool	e700cab4e9	CodeGen: add support for a few MSVC ARM intrinsics This adds support for simple MSVC compatibility mode intrinsics. These intrinsics are simple in that they are either directly passed through to the annotated MSBuiltin intrinsic or they mirror existing GCC builtins. llvm-svn: 212378	2014-07-05 20:10:05 +00:00
Saleem Abdulrasool	96bfda8dbc	CodeGen: add support for MSBuiltin aliases This completes the infrastructure for the new MSBuiltin aliases in the instruction definitions. These behave similar to the GCCBuiltin in that they can be implicitly constructed without special handling unless needed. With this change it is possible to annotate an LLVM intrinsic in the backend instruction definitions and indicate it as a builtin in the Builtin*.def files in clang via LANGBUILTIN. That will automatically pass through the instruction much as a GCCBuiltin. Note that there is no need for the special handling for ensuring that the compatibility flag is enabled since the filtering on the LANGBUILTIN will automatically prevent the intrinsic from bleeding into non-MS compatible compiler invocations. llvm-svn: 212359	2014-07-04 21:49:39 +00:00
Saleem Abdulrasool	ece7217f70	ARM: rename ARM builtins to use __builtin_arm prefix This corrects SVN r212196's naming change to use the proper prefix of `__builtin_arm_` instead of `__builtin_`. Thanks to Yi Kong for pointing out the incorrect naming! llvm-svn: 212253	2014-07-03 02:43:20 +00:00
Saleem Abdulrasool	4bddd9d400	CodeGen: make target builtins support languages This extends the target builtin support to allow language specific annotations (i.e. LANGBUILTIN). This is to allow MSVC compatibility whilst retaining the ability to have EABI targets use a __builtin_ prefix. This is merely to allow uniformity in the EABI case where the unprefixed name is provided as an alias in the header. llvm-svn: 212196	2014-07-02 17:41:27 +00:00
Tim Northover	3acd6bd0b6	ARM: add support for v8 ldaex/stlex builtins. ARMv8 adds (to both AArch32 and AArch64) acquiring and releasing variants of the exclusive operations, in line with the C++11 memory model. This adds support for two new intrinsics to expose them to C & C++ developers directly: __builtin_arm_ldaex and __builtin_arm_stlex, in direct analogy with the versions with no implicit barrier. rdar://problem/15885451 llvm-svn: 212175	2014-07-02 12:56:02 +00:00
Craig Topper	00bbdcf9b3	Remove llvm:: from uses of ArrayRef. llvm-svn: 211987	2014-06-28 23:22:23 +00:00
Matt Arsenault	56f008d538	Add R600 builtin codegen. llvm-svn: 211631	2014-06-24 20:45:01 +00:00
Tim Northover	6ea28bdef5	ARM: remove dead CodeGen functions. These two are no longer being used by NEON codegen. llvm-svn: 211586	2014-06-24 12:07:44 +00:00
Jim Grosbach	e59c43dc21	Fix spelling. s/overloaed/overloaded/ llvm-svn: 211530	2014-06-23 20:28:43 +00:00
Saleem Abdulrasool	114efe0dc8	CodeGen: improve ms instrincics support Add support for _InterlockedCompareExchangePointer, _InterlockExchangePointer, _InterlockExchange. These are available as a compiler intrinsic on ARM and x86. These are used directly by the Windows SDK headers without use of the intrin header. llvm-svn: 211216	2014-06-18 20:51:10 +00:00
Jim Grosbach	79140826bc	AArch64: Support for __builtin_arm_rbit() and __builtin_arm_rbit64(). __builtin_arm_rbit() and __builtin_arm_rbit64(). rdar://9283021 llvm-svn: 211060	2014-06-16 21:56:02 +00:00
Jim Grosbach	171ec34544	ARM: Support for __builtin_arm_rbit() intrinsic. Reverse the bits in a word. Maps to the RBIT instruction. rdar://9283021 llvm-svn: 211059	2014-06-16 21:55:58 +00:00
Tim Northover	b49b04bbe0	IR-change: cmpxchg operations now return { iN, i1 }. This is a minimal fix for clang. I'll soon add support for generating weak variants when requested, but that's not really necessary for the LLVM change in isolation. llvm-svn: 210907	2014-06-13 14:24:59 +00:00
Richard Smith	760520bcb7	Add __builtin_operator_new and __builtin_operator_delete, which act like calls to the normal non-placement ::operator new and ::operator delete, but allow optimizations like new-expressions and delete-expressions do. llvm-svn: 210137	2014-06-03 23:27:44 +00:00
Michael J. Spencer	5ce26687f2	[CodeGen] Don't use SizeTy for EmitNeonSplat. llvm-svn: 210042	2014-06-02 19:48:59 +00:00
Michael J. Spencer	dd59775f06	[CodeGen] Don't cast and use SizeTy instead of Int32Ty when constructing {extract,insert} vector element instructions. llvm-svn: 209942	2014-05-31 00:22:12 +00:00
Tim Northover	573cbee543	AArch64/ARM64: rename ARM64 components to AArch64 This keeps Clang consistent with backend naming conventions. llvm-svn: 209579	2014-05-24 12:52:07 +00:00
Tim Northover	25e8a6754e	AArch64/ARM64: update Clang after AArch64 removal. A few (mostly CodeGen) parts of Clang were tightly coupled to the AArch64 backend. Now that it's gone, they will not even compile. I've also deduplicated RUN lines in many of the AArch64 tests. This might improve "make check-all" time noticably: some of those NEON tests were monsters. llvm-svn: 209578	2014-05-24 12:51:25 +00:00
Craig Topper	8a13c4180e	[C++11] Use 'nullptr'. CodeGen edition. llvm-svn: 209272	2014-05-21 05:09:00 +00:00
Hao Liu	9f9492b657	[ARM64]Fix the bug right shift uint64_t by 64 generates incorrect result. llvm-svn: 208761	2014-05-14 08:59:30 +00:00
Saleem Abdulrasool	956c2ec532	CodeGen: complete ARM ACLE hint 8.4 support Add support for the remaining hints from the ACLE. Although __dbg is listed as a hint, it is handled different, so it is not covered by this change. llvm-svn: 207930	2014-05-04 02:52:25 +00:00
Saleem Abdulrasool	38ed6de3a0	CodeGen: rename __builtin_arm_sevl to __sevl ACLE adds the __sevl() extension. Rename the hint from a custom name to the ACLE specified name. llvm-svn: 207829	2014-05-02 06:53:57 +00:00
James Molloy	fa40368d9d	[ARM64] Add arm64_be where it was accidentally missed from a bunch of if-conditions. I think this is the last commit for ARM64 big endian in clang. This commit makes arm_neon.h compile correctly. llvm-svn: 207624	2014-04-30 10:11:40 +00:00
Hao Liu	a19a2e2da6	[ARM64]Fix a bug cannot select UQSHL/SQSHL with constant i64 shift amount. llvm-svn: 207401	2014-04-28 07:36:12 +00:00
Saleem Abdulrasool	2b99f2f7a4	CodeGen: remove an unused variable llvm-svn: 207390	2014-04-28 02:29:11 +00:00
Sylvestre Ledru	464902589e	remove useless code llvm-svn: 207360	2014-04-27 14:57:31 +00:00
Saleem Abdulrasool	b9f07e3dbc	CodeGen: add __yield intrinsic for ARM The __yield intrinsic generates a hint instruction to indicate that the thread is not performing any useful operations at the moment. This is for compatibility with MSVC, although, the intrinsic is also part of the ACLE, and is enabled globally as a result. llvm-svn: 207275	2014-04-25 21:13:29 +00:00
Saleem Abdulrasool	0fd930e86c	CodeGen: replace use of @llvm.arm.sevl with @llvm.arm.hint Use the new generic @llvm.arm.hint hint intrinsic rather than the specialised @llvm.arm.sevl hint instruction. llvm-svn: 207243	2014-04-25 17:25:46 +00:00
Tim Northover	b17f9a4609	ARM64: add a few bits of polynomial intrinsic codegen. llvm-svn: 205303	2014-04-01 12:23:08 +00:00
Tim Northover	74b2def0c5	ARM64: add missing ldN/stN intrinsics and enable tests. llvm-svn: 205296	2014-04-01 10:37:47 +00:00
Tim Northover	0c68faa455	ARM64: enable aarch64-neon-intrinsics.c test This adds support for the various NEON intrinsics used by aarch64-neon-intrinsics.c (originally written for AArch64) and enables the test. My implementations are designed to be semantically correct, the actual code quality looks like its a wash between the two backends, and is frequently different (hence the large number of CHECK changes). llvm-svn: 205210	2014-03-31 15:47:09 +00:00
Dmitri Gribenko	f461da5306	Remove unused variable llvm-svn: 205169	2014-03-31 07:52:35 +00:00
Tim Northover	6166ec21be	ARM64: remove currently trivial switch statement llvm-svn: 205167	2014-03-31 07:20:13 +00:00
Tim Northover	97606edd75	ARM64: Fix GCC warning in CGBuiltin.cpp llvm-svn: 205104	2014-03-29 15:26:07 +00:00
Tim Northover	a2ee433c8d	ARM64: initial clang support commit. This adds Clang support for the ARM64 backend. There are definitely still some rough edges, so please bring up any issues you see with this patch. As with the LLVM commit though, we think it'll be more useful for merging with AArch64 from within the tree. llvm-svn: 205100	2014-03-29 15:09:45 +00:00
Christian Pirker	f01cd6f57b	Add ARM big endian Target (armeb, thumbeb) Reviewed at http://llvm-reviews.chandlerc.com/D3096 llvm-svn: 205008	2014-03-28 14:40:46 +00:00
Reid Kleckner	597e81dea1	-fms-extensions: Add __va_start builtin, which is used for x64 The main difference between __va_start and __builtin_va_start is that the address of the va_list has already been taken, and the va_list is always a char*. __va_end and __va_arg are not needed. llvm-svn: 204821	2014-03-26 15:38:33 +00:00
Renato Golin	c491a8d457	Add support for __builtin___clear_cache in Clang Adding the mapping between __builtin___clear_cache into @llvm.clear_cache llvm-svn: 204820	2014-03-26 15:36:05 +00:00
Timur Iskhodzhanov	f7af2e6de8	Fix a compile-time warning lib/CodeGen/CGBuiltin.cpp:3136:12: warning: variable ‘TblPos’ set but not used [-Wunused-but-set-variable] llvm-svn: 204599	2014-03-24 11:09:01 +00:00
Arnaud A. de Grandmaison	6756a497a1	Cleanup dead assignments reported by scan-build llvm-svn: 204569	2014-03-23 20:28:07 +00:00
Tim Northover	0622b3a67a	Update for IR: add a second AtomicOrdering to cmpxchg insts. rdar://problem/15996804 llvm-svn: 203560	2014-03-11 10:49:03 +00:00
Ted Kremenek	90097491ed	Remove 'break' dominated by 'return' in 'EmitBuiltinExpr'. llvm-svn: 203080	2014-03-06 05:37:38 +00:00
Tim Northover	b44e080dbb	AArch64: use less cluttered intrinsic for vtbl/vtbx The table is always 128-bit so there's no reason to specify it every time we want the intrinsic. llvm-svn: 202259	2014-02-26 11:55:15 +00:00
Tim Northover	2df47cedeb	AArch64: use different type modifier in arm_neon.td The 'f' modifier is designed for integer type arguments really (according to its documentation). It's better to use the "half width, same number" modifier. Should be no user-visible change. llvm-svn: 202152	2014-02-25 13:53:01 +00:00
Christian Pirker	9b019ae899	Add AArch64 big endian Target (aarch64_be) llvm-svn: 202151	2014-02-25 13:51:00 +00:00
Warren Hunt	20e4a5d2af	Reapply 201734 but with appropriate gcc compatibility Because GCC incorrectly defines _mm_prefetch to take anything that casts to void, people have started using that behavior. The previous patch that made _mm_prefetch actually take a const char broke compatibility with existing code. This update to the patch leaves the macro that defines _mm_prefetch with the (void*) cast when _MSC_VER is not defined. llvm-svn: 201901	2014-02-21 23:08:53 +00:00
Tim Northover	a0c95eb2d6	Remove commas at the end of lists (C++11 again) llvm-svn: 201849	2014-02-21 12:16:59 +00:00
Tim Northover	8fe03d6111	ARM & AArch64: use table for EmitCommonNeonBuiltinExpr This extends the intrinsic lookup table format slightly, and adds entries for use the shared ARM/AArch64 definitions. The benefit is currently smaller than for the SISD intrinsics (there's more custom code implementing this set), but a few lines are saved and there's scope for future expansion. llvm-svn: 201848	2014-02-21 11:57:24 +00:00
Tim Northover	2d83796860	AArch64: refactor table-driven NEON lookup. This extracts the table-driven intrinsic lookup phase into a separate function, to be used by EmitCommonNeonBuiltinExpr soon. It also simplifies the logic used in that lookup, since VectorCastArgN and ScalarArgN were actually identical. llvm-svn: 201847	2014-02-21 11:57:20 +00:00
Daniel Jasper	2f0f297bdb	Revert r201734 and r201742. This breaks backwards compatibility with existing code. Previously, this was defined as #define _mm_prefetch(a, sel) (__builtin_prefetch((void )(a), 0, (sel))) Which basically accepts any pointer. Changing this to char simply breaks a lot of existing code. I have tried changing char* to "const void*", which seems to be the right thing as per Intel specification this should work on basically any pointer. However, apparently this breaks windows compatibility (because of a conflicting declaration in windows.h). So, we probably need to #ifdef this based on whether clang is compiling for windows. According to Chandler, this might be done by introducing an additional symbol to a fake type in BuiltinsX86.def and then condition the type expansion on the platform. llvm-svn: 201775	2014-02-20 11:10:48 +00:00
Warren Hunt	40d6f29ad8	Add _mm_prefetch and some others as MS builtins This patch adds several built-ins that are required for ms compatibility. _mm_prefetch must be a built-in because it takes a compile-time constant argument and our prior approach of using a #define to the current built-in doesn't work in the presence of re-declaration of _mm_prefetch. The others can be obtained by including the windows system headers. If a user includes the windows system headers but not intrin.h they still need to work and therefore must be built-in because we don't get a chance to implement them in intrin.h in this case. llvm-svn: 201734	2014-02-19 23:20:20 +00:00
Tim Northover	db3e5e2408	AArch64: look up EmitAArch64Scalar support before calling. This fixes one immediate bug where an expression with side-effects could be emitted twice during a NEON call. It also prepares the way for folding CodeGen for many of the SISD intrinsics into a table, reducing code size and hopefully increasing performance eventually ("binary search + few switch cases" should be better than "lots of switch cases"). llvm-svn: 201667	2014-02-19 11:55:06 +00:00
Tim Northover	0f6c9d0a9b	ARM NEON: add vcvtX (with rounding mode) intrinsics to v8 ARM. These instructions (well, the f32 ones) are supported on 32-bit ARMv8, not just AArch64. Now that the arm_neon.td refactoring is complete, adding them is surprisingly simple. rdar://problem/16035743 llvm-svn: 201661	2014-02-19 10:37:13 +00:00
Tim Northover	1994fa7d3d	ARM & AArch64 NEON: share the vabs implementation. This changes ARM to use @llvm.fabs for floating-point vabs. Patterns already existed in the backend, and it might help mid-end phases since it's more likely to be understood than @llvm.arm.neon.vabs. llvm-svn: 201313	2014-02-13 10:44:17 +00:00
Tim Northover	02b438754c	AArch64: share slgihtly more NEON implementation with ARM. The s64/u64 vcvt conversion operations are actually pretty much identical to the s32/u32 ones in implementation, and can be shared with just one extra variable. llvm-svn: 201145	2014-02-11 11:27:44 +00:00
Tim Northover	d23fc6cceb	ARM: move vshll NEON implementation to common code Now that both ARM backends use the same implementation for vshll operations, the code can be shared. This is also a necessary LLVM/Clang interface update. llvm-svn: 201094	2014-02-10 16:20:36 +00:00
Tim Northover	a2e0a27d26	ARM: implement vshrn NEON intrinsic in terms of shr/trunc Now the backend supports the natural LLVM IR, we can shamelessly steal the AArch64 front-end code to implement the vshrn intrinsic on 32-bit ARM. llvm-svn: 201086	2014-02-10 14:04:12 +00:00
Tim Northover	7ffb2c5523	ARM & AArch64: combine implementation of vcaXYZ intrinsics Now that the back-end intrinsics are more regular, there's no need for the special handling these got in the front-end, so they can be moved to EmitCommonNeonBuiltinExpr. llvm-svn: 200769	2014-02-04 14:55:52 +00:00
Tim Northover	02e38609e7	ARM: implement support for crypto intrinsics in arm_neon.h llvm-svn: 200708	2014-02-03 17:28:04 +00:00
Tim Northover	51ab388266	AArch64: use new non-polymorphic crypto intrinsics The LLVM backend now has invariant types on the various crypto-intrinsics, because in all cases there's only really one interpretation. llvm-svn: 200707	2014-02-03 17:28:00 +00:00
Tim Northover	5309111c22	ARM & AArch64: unify the rest of the completely shared NEON implementations This should be the last routine patch: AArch64 does still delegate to EmitARMBuiltinExpr, but the remaining instances have complications of one sort or another so some more cunning thought will be needed. llvm-svn: 200528	2014-01-31 10:46:52 +00:00
Tim Northover	ba1e344d90	ARM & AArch64: another block of miscellaneous NEON sharing. llvm-svn: 200527	2014-01-31 10:46:49 +00:00
Tim Northover	027b4ee607	ARM & AArch64: move shared vld/vst intrinsics to common implementation. llvm-svn: 200526	2014-01-31 10:46:45 +00:00
Tim Northover	9d3ab5fe9f	ARM & AArch64: more instructions into common block llvm-svn: 200525	2014-01-31 10:46:41 +00:00
Tim Northover	61fc835d6e	ARM & AArch64: merge another NEON block completely. llvm-svn: 200524	2014-01-31 10:46:36 +00:00
Tim Northover	58c4474dea	ARM & AArch64: extend shared NEON implementation to first block. This extends the refactoring to the whole of the first block of trivial correspondences (as a fairly arbitrary boundary). llvm-svn: 200472	2014-01-30 14:48:01 +00:00
Tim Northover	ac85c341ae	ARM & AArch64: fully share NEON implementation of permutation intrinsics As a starting point, this moves the CodeGen for NEON permutation instructions (vtrn, vzip, vuzp) into a new shared function. llvm-svn: 200471	2014-01-30 14:47:57 +00:00
Tim Northover	c322f838bc	ARM & AArch64: share the BI__builtin_neon enum defs. llvm-svn: 200470	2014-01-30 14:47:51 +00:00
Kevin Qin	ce1f0e85ba	[AArch64 NEON] Fix a bug about vcles_f32 and vcled_f64. As vcles_f32() and vcled_f64 are implemented by FCMGE, operands should make a swap. llvm-svn: 199866	2014-01-23 03:42:06 +00:00
Hao Liu	f96fd37888	[AArch64]The compare to zero intrinsics should be implemented by 'icmp/fcmp' and 'sext' not 'zext'. Modify the implementation by replacing zext with sext. llvm-svn: 197898	2013-12-23 02:44:00 +00:00
Chad Rosier	6030c84a2f	[AArch64] Refactor NEON floating-point Max/Min/Maxnm/Minnm across vector AArch64 intrinsics to use f32 types, rather than their vector equivalents. llvm-svn: 197091	2013-12-11 23:21:39 +00:00
Chad Rosier	c520fce72d	[AArch64] Add NEON scalar floating-point compare LLVM AArch64 intrinsics that use f32/f64 types, rather than their vector equivalents. llvm-svn: 197071	2013-12-11 21:03:56 +00:00
Chad Rosier	edd4403510	[AArch64] Refactor the NEON scalar floating-point reciprocal step and floating-point reciprocal square root step LLVM AArch64 intrinsics to use f32/f64 types, rather than their vector equivalents. llvm-svn: 197070	2013-12-11 21:03:54 +00:00

1 2 3 4 5 ...

595 Commits