llvm-project

Commit Graph

Author	SHA1	Message	Date
Bob Wilson	363bd1a815	PGO: rename FileCheck variable to follow the existing convention. I added this "STF" variable without noticing that all the other counter names end with a "C". Renaming it to "STC" for consistency. llvm-svn: 203165	2014-03-06 21:35:59 +00:00
Reid Kleckner	8d4a16ec3a	Add tests for MS inline asm change r203146 llvm-svn: 203147	2014-03-06 19:19:36 +00:00
Bob Wilson	9c86656d62	Run -fprofile-instr tests with %clang_cc1. This should help avoid problems like the buildbot fallout from my change in r203085. I left the CodeGenCXX tests alone for now. llvm-svn: 203131	2014-03-06 17:18:34 +00:00
Bob Wilson	da1ebedeea	PGO: Use the main file name to help distinguish functions with local linkage. In addition, for all functions, use the name from the llvm::Function to identify the function in the profile data. Compute that "function name", including the file name for local functions, once when assigning the PGO counters and store it in the CodeGenPGO class. Move the code to add InlineHint and Cold attributes out of StartFunction(), because the "function name" string isn't available at that point. llvm-svn: 203075	2014-03-06 04:55:41 +00:00
Raul E. Silvera	57a9850961	Update clang test to cover for new treatment of intrinsics as readnone. llvm-svn: 203056	2014-03-06 01:37:10 +00:00
Tim Northover	926a235fea	AArch64: convert NEON tests to use CHECK-LABEL. llvm-svn: 202703	2014-03-03 11:34:36 +00:00
Hal Finkel	f7a07a5010	Add a PPC inline asm constraint type for single CR bits This adds support for the PPC "wc" inline asm constraint (used for allocating individual CR bits). Support for this constraint type was recently added to the LLVM PowerPC backend. Although gcc does not currently support allocating individual CR bits, this identifier choice has been coordinated with the gcc PowerPC team, and will be marked as reserved for this purpose in the gcc constraints.md file. Prior to this change, none of the multi-character PPC constraints were handled correctly (the '^' escape character was not being added as required by the parsing code in LLVM). This should now be fixed. I'll add tests for these other constraints as support is added for them in the backend. llvm-svn: 202658	2014-03-02 18:24:18 +00:00
Warren Hunt	fed55979b1	Fixed an assertion failure related to bitfield lowering. When lowering a bitfield, CGRecordLowering would assign the wrong storage type to a bitfield in some cases and trigger an assertion. In these cases the layout was still correct, just the bitfield info was wrong. llvm-svn: 202562	2014-03-01 00:38:40 +00:00
Bob Wilson	1e3f3bf950	Add a testcase for r202437. llvm-svn: 202468	2014-02-28 05:57:14 +00:00
Tim Northover	efe7a5e1c8	ARM NEON: fix tests after r202137 llvm-svn: 202143	2014-02-25 11:48:25 +00:00
Tim Northover	3d4575cc1b	AArch64 NEON: add 64-bit scalar intrinsics for _f64 mla/mls etc. These seem to be supported by GCC, and do make sense architecturally so we should probably have them. llvm-svn: 202138	2014-02-25 11:13:49 +00:00
Tim Northover	87da936164	ARM NEON: add _f16 support to a couple of vector-shuffling intrinsics. llvm-svn: 202137	2014-02-25 11:13:42 +00:00
Roman Divacky	bd01646489	Add a test for r202059. llvm-svn: 202064	2014-02-24 19:24:15 +00:00
Aaron Ballman	7c19ab17c7	Exposing the noduplicate attribute within Clang, which marks functions so that the optimizer does not duplicate code. Patch thanks to Marcello Maggioni! llvm-svn: 201941	2014-02-22 16:59:24 +00:00
Warren Hunt	fb00c88703	Complete Rewrite of CGRecordLayoutBuilder CGRecordLayoutBuilder was aging, complex, multi-pass, and shows signs of existing before ASTRecordLayoutBuilder. It redundantly performed many layout operations that are now performed by ASTRecordLayoutBuilder and asserted that the results were the same. With the addition of support for the MS-ABI, such as placement of vbptrs, vtordisps, different bitfield layout and a variety of other features, CGRecordLayoutBuilder was growing unwieldy in its redundancy. This patch re-architects CGRecordLayoutBuilder to not perform any redundant layout but rather, as directly as possible, lower an ASTRecordLayout to an llvm::type. The new architecture is significantly smaller and simpler than the CGRecordLayoutBuilder and contains fewer ABI-specific code paths. It's also one pass. The architecture of the new system is described in the comments. For the most part, the new system simply takes all of the fields and bases from an ASTRecordLayout, sorts them, inserts padding and dumps a record. Bitfields, unions and primary virtual bases make this process a bit more complicated. See the inline comments. In addition, this patch updates a few lit tests due to the fact that the new system computes more accurate llvm types than CGRecordLayoutBuilder. Each change is commented individually in the review. Differential Revision: http://llvm-reviews.chandlerc.com/D2795 llvm-svn: 201907	2014-02-21 23:49:50 +00:00
Rafael Espindola	33ebd2171e	Accept -no-integrated-as in -cc1 and forward it to llvm. llvm-svn: 201837	2014-02-21 03:14:07 +00:00
Rafael Espindola	f9e1e5e9a3	Remove really old "APPLE LOCAL" markers. llvm-svn: 201791	2014-02-20 14:09:04 +00:00
Daniel Jasper	2f0f297bdb	Revert r201734 and r201742. This breaks backwards compatibility with existing code. Previously, this was defined as #define _mm_prefetch(a, sel) (__builtin_prefetch((void )(a), 0, (sel))) Which basically accepts any pointer. Changing this to char simply breaks a lot of existing code. I have tried changing char* to "const void*", which seems to be the right thing as per Intel specification this should work on basically any pointer. However, apparently this breaks windows compatibility (because of a conflicting declaration in windows.h). So, we probably need to #ifdef this based on whether clang is compiling for windows. According to Chandler, this might be done by introducing an additional symbol to a fake type in BuiltinsX86.def and then condition the type expansion on the platform. llvm-svn: 201775	2014-02-20 11:10:48 +00:00
Warren Hunt	7281928be6	Updated to r201734. Removed unused declaration from lit test. Also updating lit test to be more roboust (changing fixed offsets to flexible offsets) llvm-svn: 201742	2014-02-19 23:57:54 +00:00
Warren Hunt	40d6f29ad8	Add _mm_prefetch and some others as MS builtins This patch adds several built-ins that are required for ms compatibility. _mm_prefetch must be a built-in because it takes a compile-time constant argument and our prior approach of using a #define to the current built-in doesn't work in the presence of re-declaration of _mm_prefetch. The others can be obtained by including the windows system headers. If a user includes the windows system headers but not intrin.h they still need to work and therefore must be built-in because we don't get a chance to implement them in intrin.h in this case. llvm-svn: 201734	2014-02-19 23:20:20 +00:00
Tim Northover	db3e5e2408	AArch64: look up EmitAArch64Scalar support before calling. This fixes one immediate bug where an expression with side-effects could be emitted twice during a NEON call. It also prepares the way for folding CodeGen for many of the SISD intrinsics into a table, reducing code size and hopefully increasing performance eventually ("binary search + few switch cases" should be better than "lots of switch cases"). llvm-svn: 201667	2014-02-19 11:55:06 +00:00
Tim Northover	0f6c9d0a9b	ARM NEON: add vcvtX (with rounding mode) intrinsics to v8 ARM. These instructions (well, the f32 ones) are supported on 32-bit ARMv8, not just AArch64. Now that the arm_neon.td refactoring is complete, adding them is surprisingly simple. rdar://problem/16035743 llvm-svn: 201661	2014-02-19 10:37:13 +00:00
Bob Wilson	bf854f0f53	Change PGO instrumentation to compute counts in a separate AST traversal. Previously, we made one traversal of the AST prior to codegen to assign counters to the ASTs and then propagated the count values during codegen. This patch now adds a separate AST traversal prior to codegen for the -fprofile-instr-use option to propagate the count values. The counts are then saved in a map from which they can be retrieved during codegen. This new approach has several advantages: 1. It gets rid of a lot of extra PGO-related code that had previously been added to codegen. 2. It fixes a serious bug. My original implementation (which was mailed to the list but never committed) used 3 counters for every loop. Justin improved it to move 2 of those counters into the less-frequently executed breaks and continues, but that turned out to produce wrong count values in some cases. The solution requires visiting a loop body before the condition so that the count for the condition properly includes the break and continue counts. Changing codegen to visit a loop body first would be a fairly invasive change, but with a separate AST traversal, it is easy to control the order of traversal. I've added a testcase (provided by Justin) to make sure this works correctly. 3. It improves the instrumentation overhead, reducing the number of counters for a loop from 3 to 1. We no longer need dedicated counters for breaks and continues, since we can just use the propagated count values when visiting breaks and continues. To make this work, I needed to make a change to the way we count case statements, going back to my original approach of not including the fall-through in the counter values. This was necessary because there isn't always an AST node that can be used to record the fall-through count. Now case statements are handled the same as default statements, with the fall-through paths branching over the counter increments. While I was at it, I also went back to using this approach for do-loops -- omitting the fall-through count into the loop body simplifies some of the calculations and make them behave the same as other loops. Whenever we start using this instrumentation for coverage, we'll need to add the fall-through counts into the counter values. llvm-svn: 201528	2014-02-17 19:21:09 +00:00
Adrian Prantl	549c514799	Revert "Debug info: Make DWARF4 the default for Darwin, too." I'm holding this change to give maintainers of Darwin buildbots more time to update their toolchains. This reverts commit r201375. llvm-svn: 201520	2014-02-17 17:40:52 +00:00
Nico Rieck	e6a1582595	Fix broken CHECK lines llvm-svn: 201477	2014-02-16 07:29:41 +00:00
Manman Ren	f1a6a2d930	PGO: fix a bug in parsing pgo data. When a function has a single counter, we will offset the pointer by 1 when parsing the next function. If a function has multiple counters, we are okay after skipping rest of the counters. llvm-svn: 201456	2014-02-15 01:29:02 +00:00
Adrian Prantl	27edf47bc0	Debug info: Make DWARF4 the default for Darwin, too. llvm-svn: 201375	2014-02-14 00:29:33 +00:00
Daniel Sanders	753e17629d	Re-commit: Demote EmitRawText call in AsmPrinter::EmitInlineAsm() and remove hasRawTextSupport() call Summary: AsmPrinter::EmitInlineAsm() will no longer use the EmitRawText() call for targets with mature MC support. Such targets will always parse the inline assembly (even when emitting assembly). Targets without mature MC support continue to use EmitRawText() for assembly output. The hasRawTextSupport() check in AsmPrinter::EmitInlineAsm() has been replaced with MCAsmInfo::UseIntegratedAs which when true, causes the integrated assembler to parse inline assembly (even when emitting assembly output). UseIntegratedAs is set to true for targets that consider any failure to parse valid assembly to be a bug. Target specific subclasses generally enable the integrated assembler in their constructor. The default value can be overridden with -no-integrated-as. All tests that rely on inline assembly supporting invalid assembly (for example, those that use mnemonics such as 'foo' or 'hello world') have been updated to disable the integrated assembler. Changes since review (and last commit attempt): - Fixed test failures that were missed due to configuration of local build. (fixes crash.ll and a couple others). - Fixed tests that happened to pass because the local build was on X86 (should fix 2007-12-17-InvokeAsm.ll) - mature-mc-support.ll's should no longer require all targets to be compiled. (should fix ARM and PPC buildbots) - Object output (-filetype=obj and similar) now forces the integrated assembler to be enabled regardless of default setting or -no-integrated-as. (should fix SystemZ buildbots) Reviewers: rafael Reviewed By: rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2686 llvm-svn: 201333	2014-02-13 14:44:26 +00:00
John McCall	76e1818a2b	ms_struct layout replaces platform-specific behavior like useBitFieldTypeAlignment() and appears to ignore the special bit-packing semantics of __attribute__((packed)). Further flesh out an already-extensive comment. llvm-svn: 201282	2014-02-13 00:50:08 +00:00
John McCall	5d4d61f64f	Change testcase to use FileCheck. llvm-svn: 201281	2014-02-13 00:50:02 +00:00
Daniel Sanders	abe212a3b8	Revert r201237+r201238: Demote EmitRawText call in AsmPrinter::EmitInlineAsm() and remove hasRawTextSupport() call It introduced multiple test failures in the buildbots. llvm-svn: 201241	2014-02-12 15:39:20 +00:00
Daniel Sanders	2f235aebdb	Arcanist failed to commit the two clang test corrections that should have been in r201237. llvm-svn: 201238	2014-02-12 14:46:15 +00:00
David Blaikie	68ccb3b6de	Remove bad debug info test. This test case doesn't belong in Clang (it's testing IndVarSimplify) but in an effort to reproduce the test case this was intended to cover (by essentially reverting r134441) I wasn't able to reproduce the failure this test case should've produced. So I haven't ported this down to LLVM, instead I'm just deleting it. I suspect the test is just underconstrained, but I've no great interest in trying hard to fix it right now - if anyone else wants to, I'd be more than welcome to that. llvm-svn: 201178	2014-02-11 21:16:44 +00:00
Robert Lytton	15abd1881f	XCore target: add section information. Xcore target ABI requires const data that is externally visible to be handled differently if it has C-language linkage rather than C++ language linkage. llvm-svn: 201142	2014-02-11 10:34:51 +00:00
Oliver Stannard	405bdeddd1	AAPCS: Do not split structs after CPRC allocated on stack According to the AAPCS, we can split structs between GPRs and the stack, except for when an argument has already been allocated on the stack. This can occur when a large number of floating-point arguments fill up the VFP registers, and are alllocated on the stack before the general-purpose argument registers are full. llvm-svn: 201137	2014-02-11 09:25:50 +00:00
Josh Magee	e0fc1a80cb	[stackprotector] Add command line option -fstack-protector-strong This option has the following effects: * It adds the sspstrong IR attribute to each function within the CU. * It defines the macro __SSP_STRONG__ with the value of 2. Differential Revision: http://llvm-reviews.chandlerc.com/D2717 llvm-svn: 201120	2014-02-11 01:35:14 +00:00
Ana Pazos	9883d6d2b5	[AArch64] Fixed vget/vset_lane_f16 implementation Replaced cast and vreinterepret operations with code to reinterpret bitwise the types float16_t and int16_t. llvm-svn: 201112	2014-02-10 21:20:53 +00:00
Oliver Stannard	5e8558fce0	Fix AAPCS compliance for HFAs containing doubles and long doubles An HFA is defined as a struct containing floating point values of the same machine type. In the 32-bit ABI, double and long double have the same machine type, so a struct with a mixture of these types must be an HFA (assuming it meets the other criteria). llvm-svn: 200971	2014-02-07 11:25:57 +00:00
Manman Ren	215893317b	Try to fix ppc bot failure. llvm-svn: 200880	2014-02-05 21:40:10 +00:00
Manman Ren	67a28136ad	PGO: instrumentation based profiling sets function attributes. We collect a maximal function count among all functions in the pgo data file. For functions that are hot, we set its InlineHint attribute. For functions that are cold, we set its Cold attribute. We currently treat functions with >= 30% of the maximal function count as hot and functions with <= 1% of the maximal function count are treated as cold. These two numbers are from preliminary tuning on SPEC. This commit should not affect non-PGO builds and should boost performance on instrumentation based PGO. llvm-svn: 200874	2014-02-05 20:40:15 +00:00
Tim Northover	02e38609e7	ARM: implement support for crypto intrinsics in arm_neon.h llvm-svn: 200708	2014-02-03 17:28:04 +00:00
Timur Iskhodzhanov	ad47776d90	Use an Itanium triple in DWARF debug info tests This should fix the clang part of the breakage in r200340. llvm-svn: 200435	2014-01-30 01:01:36 +00:00
Artyom Skrobov	e72a6f7a70	Cortex-M3 and Cortex-M4 should not enable hwdiv-arm (committing again, with an updated test) llvm-svn: 200385	2014-01-29 09:43:07 +00:00
John McCall	30268ca2e0	Extensively comment bitfield layout, rearrange some code for legibility, and fix a bug with bitfields in packed ms_structs. rdar://15926990 llvm-svn: 200379	2014-01-29 07:53:44 +00:00
Amara Emerson	9dc7878ac5	[ARM] Fix AAPCS-VFP non-compliance when returning HFA from variadic functions. Arguments and return values must always be marshalled as for the base AAPCS when the callee is a variadic function. Patch by Oliver Stannard! llvm-svn: 200307	2014-01-28 10:56:36 +00:00
Reid Kleckner	020acd88ec	Test case for clobbers on cpuid in ms inline asm Tests r200279 in LLVM. llvm-svn: 200280	2014-01-28 02:09:28 +00:00
Robert Lytton	1a2292614c	XCore target exception handling Implement __builtin_eh_return_data_regno() llvm-svn: 200231	2014-01-27 17:56:25 +00:00
Jiangning Liu	bb59b3daa9	For AArch64 Neon, fix intrinsics implementation using nested macros. llvm-svn: 200114	2014-01-26 03:38:42 +00:00
Justin Bogner	d8740b6e72	test/CodeGen: Finish fixing the typo in r199862 llvm-svn: 199910	2014-01-23 17:34:24 +00:00
Serge Pavlov	09f9924acf	Fix to PR8880 (clang dies processing a for loop) Due to statement expressions supported as GCC extension, it is possible to put 'break' or 'continue' into a loop/switch statement but outside its body, for example: for ( ; ({ if (first) { first = 0; continue; } 0; }); ) This code is rejected by GCC if compiled in C mode but is accepted in C++ code. GCC bug 44715 tracks this discrepancy. Clang used code generation that differs from GCC in both modes: only statement of the third expression of 'for' behaves as if it was inside loop body. This change makes code generation more close to GCC, considering 'break' or 'continue' statement in condition and increment expressions of a loop as it was inside the loop body. It also adds error for the cases when 'break'/'continue' appear outside loop due to this syntax. If code generation differ from GCC, warning is issued. Differential Revision: http://llvm-reviews.chandlerc.com/D2518 llvm-svn: 199897	2014-01-23 15:05:00 +00:00
Kevin Qin	ce1f0e85ba	[AArch64 NEON] Fix a bug about vcles_f32 and vcled_f64. As vcles_f32() and vcled_f64 are implemented by FCMGE, operands should make a swap. llvm-svn: 199866	2014-01-23 03:42:06 +00:00
Justin Bogner	be614c735c	CodeGen: Fix tracking of PGO counters for the logical or operator This adds tests for both logical or and for logical and, which was already correct. llvm-svn: 199865	2014-01-23 02:54:30 +00:00
Justin Bogner	fdac0cad1f	test/CodeGen: Fix a typo llvm-svn: 199862	2014-01-23 02:54:20 +00:00
Mark Seaborn	74020868ee	Handle va_arg on struct types for the le32 target (PNaCl and Emscripten) PNaCl and Emscripten can both handle va_arg IR instructions with struct type. Also add a test to cover generating a va_arg IR instruction from va_arg in C on le32 (as already handled by VisitVAArgExpr() in CGExprScalar.cpp), which was not covered by a test before. (This fixes https://code.google.com/p/nativeclient/issues/detail?id=2381) Differential Revision: http://llvm-reviews.chandlerc.com/D2539 llvm-svn: 199830	2014-01-22 20:11:01 +00:00
Adrian Prantl	3eff225a44	Debug info: use the file a typedef is defined in as its decl_file instead of the current compilation unit. As a side effect this enables many more LTO uniquing opportunities. This reapplies r199757 with a better testcase. llvm-svn: 199760	2014-01-21 18:42:27 +00:00
Adrian Prantl	cb6e1257ff	revert 199757 for buildbot breakage. llvm-svn: 199758	2014-01-21 18:23:43 +00:00
Adrian Prantl	83788519a5	Debug info: use the file a typedef is defined in as its decl_file instead of the current compilation unit. As a side effect this enables many more LTO uniquing opportunities. rdar://problem/15851206 llvm-svn: 199757	2014-01-21 18:20:52 +00:00
Rafael Espindola	e1bd71fea4	Use private linkage for utf-16 objc strings too. llvm-svn: 199709	2014-01-21 02:57:56 +00:00
Rafael Espindola	6839d23be7	Now that r199688 avoids the real issue, use private linkage for objc strings. llvm-svn: 199705	2014-01-21 01:50:12 +00:00
Rafael Espindola	d19f80a0b4	Give explicit sections for string constants used in NSStrings. Without them they can be merged with non unnamed_addr constants during LTO. The resulting constant is not unnamed_addr and goes in a different section, which causes ld64 to crash. A testcase that would crash before: * file1.mm: void g(id notification) { [notification valueForKey:@"name"]; } * file2.cpp: extern const char js_name_str[] = "name"; * file3.cpp extern bool JS_GetProperty(const char *name); extern const char js_name_str[]; bool js_ReportUncaughtException() { JS_GetProperty(js_name_str); } run clang file1.mm -o file1.o -c -w -emit-llvm clang file2.cpp -o file2.o -c -w -emit-llvm clang file3.cpp -o file3.o -c -w ld -dylib -o XUL file1.o file2.o file3.o -undefined dynamic_lookup. llvm-svn: 199688	2014-01-20 20:33:18 +00:00
Jakob Stoklund Olesen	497332c05f	SPARCv9 implements long double as an IEEE quad. llvm-svn: 199399	2014-01-16 16:43:19 +00:00
Jan Wen Voung	1f9c4ee464	Ensure i686-nacl long long is aligned 8 bytes (like malign-double) Set NaCl OSTargetInfo to have LongLongAlign = 64. Otherwise, it will pick up the setting of 32 from X86_32TargetInfo. llvm-svn: 199335	2014-01-15 21:42:41 +00:00
Roman Divacky	dd9bfb2c1a	Make -fno-inline attach NoInline attribute to all functions that are not marked as AlwaysInline or ForceInline. This moves us to what gcc does with -fno-inline. The attribute approach was discussed to be better than switching to InlineAlways inliner in presence of LTO. llvm-svn: 199324	2014-01-15 19:07:16 +00:00
Chandler Carruth	b653131345	Move a bunch of tests to directly use the CC1 layer. This at least saves a subprocess invocation which is pretty significant on Windows. It also likely saves a bunch of thrashing the host machine needlessly. Finally it makes the tests much more predictable and less dependent on the host. For example 'header_lookup1.c' was passing '-fno-ms-extensions' just to thwart the host detection adding it into the compilation. By runnig CC1 directly we don't have to deal with such oddities. llvm-svn: 199308	2014-01-15 09:08:07 +00:00
Hans Wennborg	c9bd88e681	Remove the -cxx-abi command-line flag. This makes the C++ ABI depend entirely on the target: MS ABI for -win32 triples, Itanium otherwise. It's no longer possible to do weird combinations. To be able to run a test with a specific ABI without constraining it to a specific triple, new substitutions are added to lit: %itanium_abi_triple and %ms_abi_triple can be used to get the current target triple adjusted to the desired ABI. For example, if the test suite is running with the i686-pc-win32 target, %itanium_abi_triple will expand to i686-pc-mingw32. Differential Revision: http://llvm-reviews.chandlerc.com/D2545 llvm-svn: 199250	2014-01-14 19:35:09 +00:00
Tim Northover	8799445065	Darwin: add __sinpi (etc) and __exp10 libbuiltins These functions have the same constness properties of the normal libm functions, which allows LLVM to optimise code better in general. There are also a couple of specific optimisations that only trigger when these are properly marked. rdar://problem/13729466 llvm-svn: 199249	2014-01-14 19:26:03 +00:00
Nico Rieck	bb0554f959	Update CodeGen to use DLL storage class for dllimport/dllexport With the old linkage types removed, set the linkage to external for both dllimport and dllexport to reflect what's currently supported. llvm-svn: 199220	2014-01-14 15:23:53 +00:00
Jakob Stoklund Olesen	899b4f3624	This test is passing on SPARC. llvm-svn: 199189	2014-01-14 06:19:29 +00:00
Jakob Stoklund Olesen	6e1aaf27c1	Puny 24-byte structs are returned by value on SPARC. Pad these structs up so they are sret-returned even on that architecture. llvm-svn: 199188	2014-01-14 06:19:26 +00:00
Hans Wennborg	9125b08b52	Update tests in preparation for using the MS ABI for Win32 targets In preparation for making the Win32 triple imply MS ABI mode, make all tests pass in this mode, or make them use the Itanium mode explicitly. Differential Revision: http://llvm-reviews.chandlerc.com/D2401 llvm-svn: 199130	2014-01-13 19:48:13 +00:00
Benjamin Kramer	06e0dadede	test case hygiene. llvm-svn: 199017	2014-01-11 21:22:35 +00:00
Rafael Espindola	26d0f7ce7d	Use 'w' instead of 'c' to represent the win32 mangling. This change was requested to avoid confusion if we ever support non windows coff systems. llvm-svn: 198939	2014-01-10 13:42:17 +00:00
Justin Bogner	ef512b9929	CodeGen: Initial instrumentation based PGO implementation llvm-svn: 198640	2014-01-06 22:27:43 +00:00
Rafael Espindola	c418ae93a8	Update for llvm's DataLayout including mangling information. llvm-svn: 198439	2014-01-03 19:22:05 +00:00
Rafael Espindola	961728064e	Remove the now unused 's' specifications. llvm-svn: 198308	2014-01-02 14:06:59 +00:00
Jiangning Liu	94b0f0278e	For AArch64 Neon, simplify scalar dup by lane0 for fp. llvm-svn: 198195	2013-12-30 02:45:09 +00:00
Jiangning Liu	38799b1471	Add some missing test cases for ACLE intrinsics of AArch64 NEON. llvm-svn: 197994	2013-12-25 01:23:43 +00:00
Hao Liu	f96fd37888	[AArch64]The compare to zero intrinsics should be implemented by 'icmp/fcmp' and 'sext' not 'zext'. Modify the implementation by replacing zext with sext. llvm-svn: 197898	2013-12-23 02:44:00 +00:00
Rafael Espindola	9ec8d08eb1	Small simplification: p0 is the same as p. llvm-svn: 197700	2013-12-19 16:54:10 +00:00
Matt Arsenault	8ba4882c4b	Update SI datalayout for 32-bit private pointers llvm-svn: 197660	2013-12-19 05:33:14 +00:00
Rafael Espindola	dc265edb3b	On spacv8 f128 is only aligned to 64 bits. LLVM already got this right. Found on "Figure 3-1: Scalar Types" on http://sparc.com/standards/psABI3rd.pdf. llvm-svn: 197651	2013-12-19 03:03:04 +00:00
Rafael Espindola	1c09b264e3	Fix the DataLayout string produced by clang for NaCl. Reviewed by Derek Schuff. llvm-svn: 197628	2013-12-18 23:41:04 +00:00
Rafael Espindola	ea03a1ff1c	Add a test for mipsel-nacl too. llvm-svn: 197617	2013-12-18 22:40:42 +00:00
Rafael Espindola	0ea96eba43	Add -f64:32:64 to the darwin ppc32 DataLayout. A f64 inside a struct can be 32 bit aligned on darwin. llvm-svn: 197577	2013-12-18 15:16:50 +00:00
Rafael Espindola	754207bc5c	Use arm-nacl-gnueabi instead of arm-nacl to match the previous tests. llvm-svn: 197550	2013-12-18 04:53:17 +00:00
Rafael Espindola	667c576169	Split this test into one per supporter nacl arch. Right now clang produces the same DataLayout for all of them, but it could, for example, add 'n' specifications when the end architecture is given. No functionality change, this should just make future changes easier to read. llvm-svn: 197549	2013-12-18 04:35:56 +00:00
Rafael Espindola	4960968509	Print the 'p' specification before the 'i' specification. No functionality change. llvm-svn: 197548	2013-12-18 04:14:53 +00:00
Rafael Espindola	c2e60f52ae	Add a 's' specifications to AArch64. This has no functionality change as clang adds explicit alignment info for byval arguments. The only difference is that now the clang produced DataLayout string for AArch64 is identical to the LLVM produced one. llvm-svn: 197538	2013-12-17 23:30:58 +00:00
Rafael Espindola	07eeb29386	Use triples that match the -target-abi option. llvm-svn: 197522	2013-12-17 21:01:22 +00:00
Rafael Espindola	f034b6e4c2	Remove -f128:128 from the DataLayout strings. It is the default. llvm-svn: 197504	2013-12-17 16:07:35 +00:00
Rafael Espindola	12256302cf	The PS3 is a ppc64 and has 64 bit registers. Update DataLayout accordingly. llvm-svn: 197502	2013-12-17 15:40:00 +00:00
Rafael Espindola	26c67b7879	Remove -f16:16:32 from the XCore DataLayout string. This makes it identical to the string llvm produces. llvm-svn: 197500	2013-12-17 14:34:42 +00:00
Rafael Espindola	8ddf8bce91	Reorder these DataLayout entries to match the order LLVM uses. This completes the cleanup/refactoring of DataLayout on the clang side. Next is figuring out the differences between the llvm and clang produced strings llvm-svn: 197442	2013-12-17 00:04:48 +00:00
Rafael Espindola	2da3532aba	The preferred alignment defaults to the ABI one. Omit it if it is the same. llvm-svn: 197440	2013-12-16 23:27:41 +00:00
Rafael Espindola	91b0cbf3fc	Remove another default I missed before. llvm-svn: 197437	2013-12-16 23:03:23 +00:00
Rafael Espindola	04c685b5e4	Clang DataLayout string cleanup: don't print other defaults. I missed these in previous commits. llvm-svn: 197435	2013-12-16 22:50:41 +00:00
Rafael Espindola	7f53473de7	Remove dead data. The f80:128:128 was followed by a f80:32:32 and so never used. Looks like this was there since r91746. llvm-svn: 197433	2013-12-16 22:15:35 +00:00
Rafael Espindola	47debc0136	Clang DataLayout string cleanup: don't print the pointer defaults. llvm-svn: 197430	2013-12-16 21:59:14 +00:00
Rafael Espindola	61a69257a4	Clang DataLayout string cleanup: don't print the aggregate defaults. llvm-svn: 197429	2013-12-16 21:51:30 +00:00
Rafael Espindola	8a91f2fd85	Clang DataLayout string cleanup: don't print the vector defaults. llvm-svn: 197427	2013-12-16 21:38:22 +00:00
Rafael Espindola	20b0d92767	Clang DataLayout string cleanup: don't print the FP defaults. llvm-svn: 197422	2013-12-16 20:34:33 +00:00
Rafael Espindola	32083d503b	Clang DataLayout string cleanup: don't print the integer defaults. llvm-svn: 197421	2013-12-16 20:21:07 +00:00
Rafael Espindola	c4d672a49d	Misc test cleanups. * tbaa-struct.cpp always has a 64 bit pointer. * f32:32:32, f64:64:64 and f128:128:128 are defaults, don't assume they are printed. llvm-svn: 197415	2013-12-16 19:53:26 +00:00
Chad Rosier	75df5680fe	[AArch64] Fix v1fx patterns for Floating-point Multiply Extend and Floating-point Compare to Zero. llvm-svn: 197403	2013-12-16 18:29:54 +00:00
Rafael Espindola	ee4b398828	Add tests for all DescriptionString in Targets.cpp. These right now just test that the same string is present in two files, but will become more useful as clang's handling of DataLayout is refactored. llvm-svn: 197347	2013-12-15 17:53:44 +00:00
Rafael Espindola	a8df53f4ad	Consolidate DataLayout string testing in one file. llvm-svn: 197276	2013-12-13 21:49:53 +00:00
Rafael Espindola	46ea763e7c	Convert test to FileCheck. llvm-svn: 197269	2013-12-13 20:11:02 +00:00
Rafael Espindola	4c50d46e0c	Convert test to FileCheck llvm-svn: 197267	2013-12-13 19:44:40 +00:00
Rafael Espindola	f62bcc0d9c	Use a: and s: instead of a0: and s0: in the DataLayout strings. They are equivalent and the size of 'a' and 's' is unused. llvm-svn: 197256	2013-12-13 18:40:15 +00:00
Rafael Espindola	14c57ebed5	Convert test to FileCheck and make it more strict. llvm-svn: 197248	2013-12-13 17:47:34 +00:00
Rafael Espindola	984c9d86cb	Add a clang side test for pr18235 too. llvm-svn: 197242	2013-12-13 16:11:31 +00:00
Kevin Qin	daaae418d8	Fix Incorrect CHECK message [0-31]+ in test case. In regular expression, [0-31]+ equals to [0-3]+, not the number from 0 to 31. So change it to [0-9]+. llvm-svn: 197112	2013-12-12 02:17:35 +00:00
Reid Kleckner	5dc20b13e7	Update clang MS inline asm tests for r196939 llvm-svn: 196940	2013-12-10 18:27:51 +00:00
Daniel Sanders	c309be2f1f	[mips][msa] Correct sld and sldi builtins. Summary: The result register of these instructions is also the first operand. Reviewers: jacksprat, dsanders Reviewed By: dsanders Differential Revision: http://llvm-reviews.chandlerc.com/D2362 Differential Revision: http://llvm-reviews.chandlerc.com/D2363 llvm-svn: 196910	2013-12-10 11:37:00 +00:00
Kevin Qin	fb79d7f843	[AArch64 NEON] Support poly128_t and implement relevant intrinsic. llvm-svn: 196888	2013-12-10 06:49:01 +00:00
Hao Liu	844a7da243	[AArch64]Add missing pair intrinsics such as: int32_t vminv_s32(int32x2_t a) which should be compiled into SMINP Vd.2S,Vn.2S,Vm.2S llvm-svn: 196750	2013-12-09 03:52:22 +00:00
Ana Pazos	6a8b8b5f0d	Implemented vget/vset_lane_f16 intrinsics llvm-svn: 196535	2013-12-05 21:13:24 +00:00
Alp Toker	f6a24ce40f	Fix a tranche of comment, test and doc typos llvm-svn: 196510	2013-12-05 16:25:25 +00:00
Alp Toker	d473363876	Correct hyphenations in comments and assert messages This patch tries to avoid unrelated changes other than fixing a few hyphen-related ambiguities in nearby lines. llvm-svn: 196466	2013-12-05 04:47:09 +00:00
Reid Kleckner	739756c0f9	[ms-cxxabi] Construct and destroy call arguments in the correct order Summary: MSVC destroys arguments in the callee from left to right. Because C++ objects have to be destroyed in the reverse order of construction, Clang has to construct arguments from right to left and destroy arguments from left to right. This patch fixes the ordering by reversing the order of evaluation of all call arguments under the MS C++ ABI. Fixes PR18035. Reviewers: rsmith Differential Revision: http://llvm-reviews.chandlerc.com/D2275 llvm-svn: 196402	2013-12-04 19:23:12 +00:00
Richard Sandiford	cdd86884a4	[SystemZ] Fix handling of pass-by-pointer arguments I'd misunderstood getIndirect() to mean that the argument should be passed as a pointer at the ABI level, with the ByVal argument choosing caller-copy semantics over no-caller-copy (callee-copy-on-write) semantics. But getIndirect(x) actually means that x is passed by pointer at the IR level but (at least on all other targets I looked at) directly at the ABI level. getIndirect(x, false) selects a pointer to a caller-made copy, which is what SystemZ was aiming for. This fixes a miscompilation of c-index-test. Structure arguments were being passed by pointer, but no copy was being made, so a write in the callee stomped over a caller's local variable. llvm-svn: 196370	2013-12-04 09:59:57 +00:00
Kevin Qin	ad53b87c70	[AArch64 NEON] Add ACLE intrinsic vceqz_f64. llvm-svn: 196361	2013-12-04 08:02:11 +00:00
Kevin Qin	8903f8df4b	[AArch64 NEON] Add missing compare intrinsics. llvm-svn: 196359	2013-12-04 07:53:09 +00:00
NAKAMURA Takumi	0acd8a7561	clang/test: REQUIRES: s/x86-64-registered-target/x86-registered-target/ llvm-svn: 196350	2013-12-04 03:41:33 +00:00
NAKAMURA Takumi	7cbe30fc43	clang/test: REQUIRES: s/ppc{32\|64}-registered-target/powerpc-registered-target/ llvm-svn: 196349	2013-12-04 03:41:15 +00:00
NAKAMURA Takumi	a1d1388a2b	clang/test/CodeGen/builtins-nvptx.c: Prune "REQUIRES: nvptx64-registered-target". "nvptx" should imply it. llvm-svn: 196348	2013-12-04 03:41:02 +00:00
Hao Liu	a5246fde90	[AArch64]Add missing floating point convert, round and misc intrinsics. E.g. int64x1_t vcvt_s64_f64(float64x1_t a) -> FCVTZS Dd, Dn llvm-svn: 196211	2013-12-03 06:07:13 +00:00
Hao Liu	38658a8186	AArch64: add missing ACLE intrinsics mapping to general arithmetic operation from VFP instructions. E.g. float64x1_t vadd_f64(float64x1_t a, float64x1_t b) -> FADD Dd, Dn, Dm. llvm-svn: 196209	2013-12-03 05:58:49 +00:00
Jiangning Liu	e82327ddd3	Patch by Ana Pazos. Fixed vcopy_laneq_f64 intrinsic implementation. llvm-svn: 196206	2013-12-03 05:36:55 +00:00
Hao Liu	4b850c5e0d	revert r196152. This is a duplicate implementation. E.g. this patch defines: float64_t vabd_f64(float64_t a, float64_t b) But there is already a similar intrinsic "vabdd_f64" with the same types. Also, this intrinsic will be conflicted to the vector type intrinsic as following(Which is implemented by me and will be committed to trunk): float64x1_t vabd_f64(float64x1_t a, float64x1_t b). Two functions shouldn't have a same name in arm_neon.h. According to ARM ACLE document, such vabd_f64 with float64_t is not existing. So I revert this commit. llvm-svn: 196205	2013-12-03 05:35:17 +00:00
Hao Liu	ce258820ca	AArch64: Add missing scalar pair intrinsics. E.g. "float32_t vaddv_f32(float32x2_t a)" to be matched into "faddp s0, v1.2s". llvm-svn: 196199	2013-12-03 03:40:08 +00:00
Jiangning Liu	2d31f6f610	Add some missing AArch64 Neon intrinsics like vuqadd_s64 and friends. llvm-svn: 196191	2013-12-03 01:33:16 +00:00
Jiangning Liu	cc1da2c938	Add some missing AArch64 Neon intrinsics like vmull_high_n_s16 and friends. llvm-svn: 196189	2013-12-03 01:28:55 +00:00
Chad Rosier	25052adf21	[AArch64] Implemented vcopy_lane patterns using scalar DUP instruction. Patch by Ana Pazos! llvm-svn: 196153	2013-12-02 21:07:27 +00:00
Chad Rosier	b0574f3bf7	[AArch64] Add missing NEON scalar floating-point to integer convert ACLEs. llvm-svn: 196152	2013-12-02 21:07:24 +00:00
Hao Liu	8a0099e02c	Fix the problem that the range check for scalar narrow shift is too wide. E.g. the immediate value of vshrns_n_s16 is [1,16], which should be [1,8]. llvm-svn: 195942	2013-11-29 02:13:17 +00:00
Jiangning Liu	24173dd4b1	Add missing intrinsic function vbsl_f64 for AArch64 NEON. llvm-svn: 195940	2013-11-29 01:38:49 +00:00
Jiangning Liu	c8a9d762d3	Add missing intrinsic function vcombine_f64 for AArch64 NEON. llvm-svn: 195937	2013-11-29 01:29:57 +00:00
Jiangning Liu	ee3e08799c	Fix the AArch64 NEON bug exposed by checking constant integer argument range of ACLE intrinsics. llvm-svn: 195844	2013-11-27 14:02:55 +00:00
Chad Rosier	9e59285cc8	[AArch64] Add support for NEON scalar floating-point absolute difference. llvm-svn: 195804	2013-11-27 01:46:19 +00:00
Chad Rosier	52e31b20cb	[AArch64] Add support for NEON scalar floating-point to integer convert instructions. llvm-svn: 195789	2013-11-26 22:17:51 +00:00
Manman Ren	4b7f23d885	Debug Info: add a "Debug Info Version" module flag to output the current debug info version number. Will error out when modules have different version numbers. llvm-svn: 195495	2013-11-22 19:42:45 +00:00
Justin Bogner	7fa2eb9f49	Revert r193994 and part of r193995 Not long ago I made the CodeGen of for loops simplify the condition at -O0 in the same way we do for if and conditionals. Unfortunately this ties how loops and simple conditions work together too tightly, which makes features such as instrumentation based PGO awkward. Ultimately, we should find a more general way to simplify the logic in a given condition, but for now we'll just avoid using EmitBranchOnBool for loops, like we already do for while and do loops. llvm-svn: 195438	2013-11-22 10:20:43 +00:00
Artyom Skrobov	be36c42df6	Deleting three tests that are redundant with test/Preprocessor/arm-target-features.c and test/Driver/arm-cortex-cpus.c llvm-svn: 195430	2013-11-22 09:21:51 +00:00
Jiangning Liu	72b624ac16	For AArch64, intrinsic vget_low_xxx can be optimized away. llvm-svn: 195409	2013-11-22 02:46:20 +00:00
Ana Pazos	dbd1a22496	Implemented Neon scalar vdup_lane intrinsics. Fixed scalar dup alias and added test case. llvm-svn: 195329	2013-11-21 08:15:01 +00:00
Ana Pazos	2b02688fd9	Implemented Neon scalar by element intrinsics. Intrinsics implemented: vqdmull_lane, vqdmulh_lane, vqrdmulh_lane, vqdmlal_lane, vqdmlsl_lane scalar Neon intrinsics. llvm-svn: 195326	2013-11-21 07:36:33 +00:00
Justin Holewinski	f9329ff650	[NVPTX] Update ABI handling For PTX, we want the target to handle struct returns directly. llvm-svn: 195268	2013-11-20 20:35:34 +00:00
Reid Kleckner	cc99e26475	Add a mangler entry point for TBAA rather than using RTTI directly Summary: RTTI is not yet implemented for the Microsoft C++ ABI and isn't expected soon. We could easily add the mangling, but the error is what prevents us from silently miscompiling code that expects RTTI. Instead, add a new mangleTypeName entry point that simply forwards to mangleName or mangleType to produce a string that isn't part of the ABI. Itanium can continue to use RTTI names to avoid unecessary test breakage. This also seems like the right design. The fact that TBAA names happen to be RTTI names is now an implementation detail of the mangler, rather than part of TBAA. Differential Revision: http://llvm-reviews.chandlerc.com/D2153 llvm-svn: 195168	2013-11-19 23:23:00 +00:00
Hao Liu	171cedf61e	Implement AArch64 neon instructions class SIMD lsone and SIMD lone-post. llvm-svn: 195079	2013-11-19 02:17:31 +00:00
Jiangning Liu	fe916e20f2	Implement AArch64 SISD intrinsics for vget_high and vget_low. llvm-svn: 195073	2013-11-19 01:46:34 +00:00
Jiangning Liu	3311f374a8	Add predicate for AArch64 crypto instructions. llvm-svn: 195069	2013-11-19 01:38:19 +00:00
Hao Liu	5e4ce1ae9d	Implement the newly added AArch64 ACLE functions for ld1/st1 with 2/3/4 vectors. The functions are like: vst1_s8_x2 ... llvm-svn: 194991	2013-11-18 06:33:43 +00:00
Hao Liu	9e49704f59	Implement vreinterpret ACLE functions in Clang. llvm-svn: 194954	2013-11-17 09:32:59 +00:00
Ana Pazos	6f2a47a9e5	Implemented aarch64 Neon scalar vmulx_lane intrinsics Implemented aarch64 Neon scalar vfma_lane intrinsics Implemented aarch64 Neon scalar vfms_lane intrinsics Implemented legacy vmul_n_f64, vmul_lane_f64, vmul_laneq_f64 intrinsics (v1f64 parameter type) using Neon scalar instructions. Implemented legacy vfma_lane_f64, vfms_lane_f64, vfma_laneq_f64, vfms_laneq_f64 intrinsics (v1f64 parameter type) using Neon scalar instructions. llvm-svn: 194889	2013-11-15 23:33:31 +00:00
Chad Rosier	7fa60db4a9	These ACLE tests no longer need to cast the return value. llvm-svn: 194854	2013-11-15 21:28:24 +00:00
Chad Rosier	7aaee48bf0	[AArch64] Add support for legacy AArch32 NEON scalar shift right by immediate and accumulate instructions. llvm-svn: 194732	2013-11-14 22:02:24 +00:00
Kevin Qin	3058bf4533	Remove a test failure. llvm-svn: 194678	2013-11-14 07:00:00 +00:00
Kevin Qin	91ac11387c	Add test case for AArch64 NEON poly64 intrinsic. llvm-svn: 194674	2013-11-14 06:49:00 +00:00
Kevin Qin	9e255dd532	Add test cases for AArch64 NEON instruction set misc. llvm-svn: 194672	2013-11-14 06:44:42 +00:00
Jiangning Liu	18b707cb3f	Implement AArch64 NEON instruction set AdvSIMD (table). llvm-svn: 194649	2013-11-14 01:57:55 +00:00
Reid Kleckner	59e4a6f5e2	-fms-extensions: Recognize _alloca as an alias for the alloca builtin Differential Revision: http://llvm-reviews.chandlerc.com/D1989 llvm-svn: 194617	2013-11-13 22:58:53 +00:00
Reid Kleckner	cf8933d1b7	Only provide MS builtins when -fms-extensions is on We already have builtins that are only available in GNU mode, so this mirrors that. Reviewers: rsmith Differential Revision: http://llvm-reviews.chandlerc.com/D2128 llvm-svn: 194615	2013-11-13 22:47:22 +00:00
Chad Rosier	e714a962b5	[AArch64] Tests for legacy AArch32 NEON scalar shift by immediate instructions. A number of non-overloaded intrinsics have been replaced by thier overloaded counterparts. llvm-svn: 194599	2013-11-13 20:05:44 +00:00
Weiming Zhao	87bb4920e9	add intrinsics: __builtin_arm_{dmb,dsb} for ARM llvm-svn: 194513	2013-11-12 21:42:50 +00:00
Daniel Sanders	8b59af15ed	[mips][msa] Enable inlinse assembly for MSA. Like GCC, this re-uses the 'f' constraint and a new 'w' print-modifier: asm ("ldi.w %w0, 1", "=f"(result)); Unlike GCC, the 'w' print-modifer is not _required_ to produce the intended output. This is a consequence of differences in the internal handling of the registers in each compiler. To be source-compatible between the compilers, users must use the 'w' print-modifier. MSA registers (including control registers) are supported in clobber lists. llvm-svn: 194476	2013-11-12 12:56:01 +00:00
Daniel Sanders	9626df1eb2	[mips] Added fpu register tests to tests/CodeGen/mips-clobber-reg.c llvm-svn: 194474	2013-11-12 11:38:20 +00:00
Daniel Sanders	923af19f29	[mips] Small fixes to test/CodeGen/mips-clobber-reg.c Fixed the following: - Whitespace at end of most lines - $11 test actually testing $10 llvm-svn: 194473	2013-11-12 11:15:48 +00:00
Robert Lytton	eaf6f36e6d	XCore target requires preferred alignment. The xcore llvm backend does not handle 8 byte alignment viz: "%BadAlignment = alloca i64, align 8" So getPreferredTypeAlign() must never overalign. llvm-svn: 194462	2013-11-12 10:09:34 +00:00
Akira Hatanaka	c4baedd71d	[mips] Partially revert r193640. Stack alignment should not be determined by the floating point register mode. llvm-svn: 194426	2013-11-11 22:10:46 +00:00
Chad Rosier	09f5251c4d	[AArch64] The shift right/left and insert immediate builtins expect 3 source operands, a vector, an element to insert, and a shift amount. llvm-svn: 194407	2013-11-11 19:11:19 +00:00
Chad Rosier	249c714bb4	[AArch64] Add support for NEON scalar floating-point convert to fixed-point instructions. llvm-svn: 194395	2013-11-11 18:04:22 +00:00
Will Dietz	949ec546c4	ubsan: Only emit constants for filenames and type descriptors once. Produces neater IR in significantly less time. (~18% faster -O0 compile time for sqlite3 with -fsanitize=undefined) llvm-svn: 194231	2013-11-08 01:09:22 +00:00
Jiangning Liu	c628af66c7	Implement AArch64 Neon instruction set Perm. llvm-svn: 194124	2013-11-06 03:35:53 +00:00
Jiangning Liu	37f5bb1b28	Implement AArch64 Neon instruction set Bitwise Extract. llvm-svn: 194119	2013-11-06 02:26:12 +00:00
Jiangning Liu	34a7109b47	Implement AArch64 Neon Crypto instruction classes AES, SHA, and 3 SHA. llvm-svn: 194086	2013-11-05 17:42:24 +00:00
Kevin Qin	9eece7b5e0	Implemented aarch64 neon intrinsic vcopy_lane with float type. llvm-svn: 194042	2013-11-05 02:05:44 +00:00
Justin Bogner	e0ccdb1a28	CodeGen: Test that simple expressions are simplified at -O0 llvm-svn: 193995	2013-11-04 16:13:23 +00:00
Bob Wilson	2c82c3d033	OS X 10.9+ and iOS 7+ support load/store of big atomic objects. rdar://13973577 Patch by Fariborz Jahanian. llvm-svn: 193935	2013-11-02 23:27:49 +00:00
Chad Rosier	74329d6cff	[AArch64] Add support for NEON scalar fixed-point convert to floating-point instructions. llvm-svn: 193817	2013-10-31 22:37:08 +00:00
Chad Rosier	bdca387884	[AArch64] Add support for NEON scalar shift immediate instructions. llvm-svn: 193791	2013-10-31 19:29:05 +00:00
Daniel Sanders	d5f554f0bb	[mips][msa] Correct definition of bins[lr] and CHECK-DAG-ize related tests llvm-svn: 193695	2013-10-30 15:45:42 +00:00
Daniel Sanders	ab94b537d7	[mips][msa] Added support for matching bmnz, bmnzi, bmz, and bmzi from normal IR (i.e. not intrinsics) Also corrected the definition of the intrinsics for these instructions (the result register is also the first operand), and added intrinsics for bsel and bseli to clang (they already existed in the backend). These four operations are mostly equivalent to bsel, and bseli (the difference is which operand is tied to the result). As a result some of the tests changed as described below. bitwise.ll: - bsel.v test adapted so that the mask is unknown at compile-time. This stops it emitting bmnzi.b instead of the intended bsel.v. - The bseli.b test now tests the right thing. Namely the case when one of the values is an uimm8, rather than when the condition is a uimm8 (which is covered by bmnzi.b) compare.ll: - bsel.v tests now (correctly) emits bmnz.v instead of bsel.v because this is the same operation (see MSA.txt). i8.ll - CHECK-DAG-ized test. - bmzi.b test now (correctly) emits equivalent bmnzi.b with swapped operands because this is the same operation (see MSA.txt). - bseli.b still emits bseli.b though because the immediate makes it distinguishable from bmnzi.b. vec.ll: - CHECK-DAG-ized test. - bmz.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). - bsel.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). llvm-svn: 193693	2013-10-30 15:20:38 +00:00
Chad Rosier	4d55e6e0a4	[AArch64] Add support for NEON scalar floating-point compare instructions. llvm-svn: 193692	2013-10-30 15:20:07 +00:00
Daniel Sanders	d74b130cc9	[mips][msa] Added support for matching bins[lr]i.[bhwd] from normal IR (i.e. not intrinsics) This required correcting the definition of the bins[lr]i intrinsics because the result is also the first operand. It also required removing the (arbitrary) check for 32-bit immediates in MipsSEDAGToDAGISel::selectVSplat(). Currently using binsli.d with 2 bits set in the mask doesn't select binsli.d because the constant is legalized into a ConstantPool. Similar things can happen with binsri.d with more than 10 bits set in the mask. The resulting code when this happens is correct but not optimal. llvm-svn: 193687	2013-10-30 14:45:14 +00:00
Akira Hatanaka	618b29813a	[mips] Align the stack to 16-bytes for -mfp64. llvm-svn: 193640	2013-10-29 19:00:35 +00:00
Richard Smith	426a47bddb	Fix a parser crash when there are #pragmas in a context which requires a single statement (after a case label, if, etc). Patch by Olivier Goffart! llvm-svn: 193545	2013-10-28 22:04:30 +00:00
Alp Toker	a933f94c92	FileCheckize llvm-svn: 193474	2013-10-26 15:43:55 +00:00
Alp Toker	1774954274	Quote wildcard in test's grep argument The * could otherwise cause shell pathname expansion. llvm-svn: 193473	2013-10-26 14:52:48 +00:00
Manman Ren	c94122e05b	Intrinsics: fix extract & insert when index is out of bound. Now, all extract & insert intrinsics should have the correct and operation to ignore higher bits. rdar://15250497 llvm-svn: 193267	2013-10-23 20:33:14 +00:00
Daniel Sanders	ffd8df29b6	[mips][msa] Add intrinsics that map to pseudo-instructions. Unlike the previously added intrinsics, these do not map to a single instruction on MIPS32. They are provided for regularity (to round out the .[bhw] variants of the same operation) and compatibility with GCC. Includes: copy_[us].d, fill.d, insert.d, insve.d llvm-svn: 193237	2013-10-23 10:12:44 +00:00
Richard Smith	6b53e224eb	Split -fsanitize=bounds to -fsanitize=array-bounds (for the frontend-inserted check using the ubsan runtime) and -fsanitize=local-bounds (for the middle-end check which inserts traps). Remove -fsanitize=local-bounds from -fsanitize=undefined. It does not produce useful diagnostics and has false positives (PR17635), and is not a good compromise position between UBSan's checks and ASan's checks. Map -fbounds-checking to -fsanitize=local-bounds to restore Clang's historical behavior for that flag. llvm-svn: 193205	2013-10-22 22:51:04 +00:00
Rafael Espindola	d53ffa0a70	Treat aliases as definitions. This fixes pr17639. Before this patch clang would consider void foo(void) __attribute((alias("__foo"))); a declaration. It now correctly handles it as a definition. Initial patch by Alp Toker. I added support for variables. llvm-svn: 193200	2013-10-22 21:39:03 +00:00
Manman Ren	be38b9e15f	_mm_extract_epi16: use "& 7" when index is out of bound. This is in line with implementation of _mm_extract_pi16. rdar://15250497 llvm-svn: 193187	2013-10-22 19:24:42 +00:00
Chad Rosier	c2a0b13c25	[AArch64] Add the constraint to NEON scalar mla/mls instructions. llvm-svn: 193118	2013-10-21 20:12:01 +00:00
Rafael Espindola	156f634aa1	Make this test pass -verify. Instead of using not, just drop the fastcall attribute which was causing an warning: calling convention 'fastcall' ignored for this target llvm-svn: 193110	2013-10-21 19:48:28 +00:00
Matheus Almeida	70fbf77546	[mips][msa] Fix definition of SLD instruction. The second parameter of the SLD intrinsic is the number of columns (GPR) to slide left the source array. llvm-svn: 193076	2013-10-21 11:47:56 +00:00
Chad Rosier	3c03dee1d1	[AArch64] Add support for NEON scalar extract narrow instructions. llvm-svn: 192971	2013-10-18 14:03:36 +00:00
Chad Rosier	e7465644c6	[AArch64] Add support for NEON scalar three register different instruction class. The instruction class includes the signed saturating doubling multiply-add long, signed saturating doubling multiply-subtract long, and the signed saturating doubling multiply long instructions. llvm-svn: 192909	2013-10-17 18:12:50 +00:00
Daniel Sanders	c835a9fe80	[mips][msa] Added most of the remaining builtins Includes: and.v, bmnz.v, bmz.v, bnz.[bhwdv], bz.[bhwdv], cfcmsa, ctcmsa, fcaf, fcor, fcueq, fcul[et], fcun, fcune, fsaf, fsueq, fsul[et], fsun, fsune, ftrunc hadd_[su].[hwd], hsub_[su].[hwd], insert.[bhw], insve.[bhw], ld.[bhwd], move.v, nor.v, or.v, srar.[bhwd], srari.[bhwd], srlr.[bhwd], srlri.[bhwd], st.[bhwd], subsus_u.[bhwd], subsuu_s.[bhwd], vshf.[bhwd], xor.v llvm-svn: 192896	2013-10-17 13:57:25 +00:00

... 2 3 4 5 6 ...

2504 Commits