llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	a216cad0fc	[complex] Teach Clang to preserve different-type operands to arithmetic operators where one type is a C complex type, and to emit both the efficient and correct implementation for complex arithmetic according to C11 Annex G using this extra information. For both multiply and divide the old code was writing a long-hand reduced version of the math without any of the special handling of inf and NaN recommended by the standard here. Instead of putting more complexity here, this change does what GCC does which is to emit a libcall for the fully general case. However, the old code also failed to do the proper minimization of the set of operations when there was a mixed complex and real operation. In those cases, C provides a spec for much more minimal operations that are valid. Clang now emits the exact suggested operations. This change isn't just about performance though, without minimizing these operations, we again lose the correct handling of infinities and NaNs. It is critical that this happen in the frontend based on assymetric type operands to complex math operations. The performance implications of this change aren't trivial either. I've run a set of benchmarks in Eigen, an open source mathematics library that makes heavy use of complex. While a few have slowed down due to the libcall being introduce, most sped up and some by a huge amount: up to 100% and 140%. In order to make all of this work, also match the algorithm in the constant evaluator to the one in the runtime library. Currently it is a broken port of the simplifications from C's Annex G to the long-hand formulation of the algorithm. Splitting this patch up is very hard because none of this works without the AST change to preserve non-complex operands. Sorry for the enormous change. Follow-up changes will include support for sinking the libcalls onto cold paths in common cases and fastmath improvements to allow more aggressive backend folding. Differential Revision: http://reviews.llvm.org/D5698 llvm-svn: 219557	2014-10-11 00:57:18 +00:00
Reid Kleckner	79b0fd7a48	Promote null pointer constants used as arguments to variadic functions Make it possible to pass NULL through variadic functions on 64-bit Windows targets. The Visual C++ headers define NULL to 0, when they should define it to 0LL on Win64 so that NULL is a pointer-sized integer. Fixes PR20949. Reviewers: thakis, rsmith Differential Revision: http://reviews.llvm.org/D5480 llvm-svn: 219456	2014-10-10 00:05:45 +00:00
Alexey Bataev	9b280eab66	Fix compatibility issues in tests for PredefinedExpr with MSVC. llvm-svn: 219405	2014-10-09 11:58:26 +00:00
Robert Khasanov	b9f3a911c9	[AVX512] Added VPCMPEQ intrinisics to headers. Added tests. Patch by Maxim Blumenthal <maxim.blumenthal@intel.com> llvm-svn: 219319	2014-10-08 17:18:13 +00:00
Hal Finkel	64567a80d2	Emit @llvm.assume for non-parameter lvalue align_value-attribute loads We already add the align parameter attribute for function parameters that have the align_value attribute (or those with a typedef type having that attribute), which is an important special case, but does not handle pointers with value alignment assumptions that come into scope in any other way. To handle the general case, emit an @llvm.assume-based alignment assumption whenever we load the pointer-typed lvalue of an align_value-attributed variable (except for function parameters, which we already deal with at entry). I'll also note that this is more general than Intel's described support in: https://software.intel.com/en-us/articles/data-alignment-to-assist-vectorization which states that the compiler inserts __assume_aligned directives in response to align_value-attributed variables only for function parameters and for the initializers of local variables. I think that we can make the optimizer deal with this more-general scheme (which could lead to a lot of calls to @llvm.assume inside of loop bodies, for example), but if not, I'll rework this to be less aggressive. llvm-svn: 219052	2014-10-04 15:26:49 +00:00
Duncan P. N. Exon Smith	3c51fa6aae	Revert "Revert "DI: LLVM schema change: fold constants into string"" This reverts commit r218917, effectively reapplying r218913. Original commit message follows. -- Update debug info testcases for an LLVM metadata schema change to fold metadata constant operands into a single `MDString`. Part of PR17891. llvm-svn: 219011	2014-10-03 20:01:52 +00:00
Hal Finkel	189c699cad	Make test/CodeGen/atomic-ops.c free-standing This test includes stdint.h (via stdatomic.h), which might include system headers (and that might not work, depending on the system configuration). Attempting to fix llvm-clang-lld-x86_64-debian-fast. llvm-svn: 218960	2014-10-03 05:04:49 +00:00
Hal Finkel	6970ac8b0a	Add an implementation of C11's stdatomic.h Adds a Clang-specific implementation of C11's stdatomic.h header. On systems, such as FreeBSD, where a stdatomic.h header is already provided, we defer to that header instead (using our __has_include_next technology). Otherwise, we provide an implementation in terms of our __c11_atomic_* intrinsics (that were created for this purpose). C11 7.1.4p1 requires function declarations for atomic_thread_fence, atomic_signal_fence, atomic_flag_test_and_set, atomic_flag_test_and_set_explicit, and atomic_flag_clear, and requires that they have external linkage. Accordingly, we provide these declarations, but if a user elides the shadowing macros and uses them, then they must have a libc (or similar) that actually provides definitions. atomic_flag is implemented using _Bool as the underlying type. This is consistent with the implementation provided by FreeBSD and also GCC 4.9 (at least when __GCC_ATOMIC_TEST_AND_SET_TRUEVAL == 1). Patch by Richard Smith (rebased and slightly edited by me -- Richard said I should drive at this point). llvm-svn: 218957	2014-10-03 04:29:40 +00:00
Duncan P. N. Exon Smith	834c265e85	Revert "DI: LLVM schema change: fold constants into string" This reverts commit r218913 while I investigate some bots. llvm-svn: 218917	2014-10-02 22:15:09 +00:00
Duncan P. N. Exon Smith	02b418a875	DI: LLVM schema change: fold constants into string Update debug info testcases for an LLVM metadata schema change to fold metadata constant operands into a single `MDString`. Part of PR17891. llvm-svn: 218913	2014-10-02 21:56:07 +00:00
Hal Finkel	1b0d24e03a	Initial support for the align_value attribute This adds support for the align_value attribute. This attribute is supported by Intel's compiler (versions 14.0+), and several of my HPC users have requested support in Clang. It specifies an alignment assumption on the values to which a pointer points, and is used by numerical libraries to encourage efficient generation of vector code. Of course, we already have an aligned attribute that can specify enhanced alignment for a type, so why is this additional attribute important? The problem is that if you want to specify that an input array of T is, say, 64-byte aligned, you could try this: typedef double aligned_double attribute((aligned(64))); void foo(aligned_double P) { double x = P[0]; // This is fine. double y = P[1]; // What alignment did those doubles have again? } the access here to P[1] causes problems. P was specified as a pointer to type aligned_double, and any object of type aligned_double must be 64-byte aligned. But if P[0] is 64-byte aligned, then P[1] cannot be, and this access causes undefined behavior. Getting round this problem requires a lot of awkward casting and hand-unrolling of loops, all of which is bad. With the align_value attribute, we can accomplish what we'd like in a well defined way: typedef double aligned_double_ptr attribute((align_value(64))); void foo(aligned_double_ptr P) { double x = P[0]; // This is fine. double y = P[1]; // This is fine too. } This attribute does not create a new type (and so it not part of the type system), and so will only "propagate" through templates, auto, etc. by optimizer deduction after inlining. This seems consistent with Intel's implementation (thanks to Alexey for confirming the various Intel-compiler behaviors). As a final note, I would have chosen to call this aligned_value, not align_value, for better naming consistency with the aligned attribute, but I think it would be more useful to users to adopt Intel's name. llvm-svn: 218910	2014-10-02 21:21:25 +00:00
Hal Finkel	d2208b59cf	Add __sync_fetch_and_nand (again) Prior to GCC 4.4, __sync_fetch_and_nand was implemented as: { tmp = ptr; ptr = ~tmp & value; return tmp; } but this was changed in GCC 4.4 to be: { tmp = ptr; ptr = ~(tmp & value); return tmp; } in response to this change, support for sync_fetch_and_nand (and sync_nand_and_fetch) was removed in r99522 in order to avoid miscompiling code depending on the old semantics. However, at this point: 1. Many years have passed, and the amount of code relying on the old semantics is likely smaller. 2. Through the work of many contributors, all LLVM backends have been updated such that "atomicrmw nand" provides the newer GCC 4.4+ semantics (this process was complete July of 2014 (added to the release notes in r212635). 3. The lack of this intrinsic is now a needless impediment to porting codes from GCC to Clang (I've now seen several examples of this). It is true, however, that we still set GNUC_MINOR to 2 (corresponding to GCC 4.2). To compensate for this, and to address the original concern regarding code relying on the old semantics, I've added a warning that specifically details the fact that the semantics have changed and that we provide the newer semantics. Fixes PR8842. llvm-svn: 218905	2014-10-02 20:53:50 +00:00
Job Noorman	ac95cd5c22	Make sure aggregates are properly alligned on MSP430. llvm-svn: 218666	2014-09-30 11:19:13 +00:00
NAKAMURA Takumi	6ed6ef7ac2	clang/test/CodeGen/builtin-assume-aligned.c: Fix for -Asserts. llvm-svn: 218507	2014-09-26 09:37:15 +00:00
Hal Finkel	ee90a223ea	Support the assume_aligned function attribute In addition to __builtin_assume_aligned, GCC also supports an assume_aligned attribute which specifies the alignment (and optional offset) of a function's return value. Here we implement support for the assume_aligned attribute by making use of the @llvm.assume intrinsic. llvm-svn: 218500	2014-09-26 05:04:30 +00:00
Jan Vesely	b4379f9c2c	CGBuiltin: Use frem instruction rather than libcall to implement fmod AFAICT the semantics of frem match libm's fmod. Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 218488	2014-09-26 01:19:41 +00:00
Nico Weber	8f63ae1d4c	Simplify tests. This reverts bits of r218166 that are no longer necessary now that r218394 made -Wmissing-prototype-for-cc a regular warning. llvm-svn: 218400	2014-09-24 18:25:54 +00:00
Reid Kleckner	2e0717e129	Downgrade error about stdcall decls with no prototype to a warning Fixes PR21027. The MIDL compiler produces code that does this. If we wanted to improve the warning, I think we could do this: void __stdcall f(); // Don't warn without -Wstrict-prototypes. void g() { f(); // Might warn, the user probably meant for f to take no args. f(1, 2, 3); // Warn, we have no idea what args f takes. f(1); // Error, this is insane, one of these calls is broken. } Reviewers: thakis Differential Revision: http://reviews.llvm.org/D5481 llvm-svn: 218394	2014-09-24 17:49:24 +00:00
Robert Khasanov	ea13042cf2	[x86] Fixed argument types in intrinsics: _addcarryx_u64 _addcarry_u64 _subborrow_u64 Thanks Pasi Parviainen for notice. llvm-svn: 218376	2014-09-24 06:45:23 +00:00
Daniel Sanders	caf534ef96	[mips] Fix r218248's testcase to use -O1 instead of -O3. llvm-svn: 218298	2014-09-23 08:58:04 +00:00
Ehsan Akhgari	3e2db26efc	ms-inline-asm: Add a test case for the usage of labels in bracket expressions Summary: This is a test for this patch: http://reviews.llvm.org/D5445. Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D5446 llvm-svn: 218271	2014-09-22 20:41:39 +00:00
Kaelyn Takata	a1e18cc5b9	Fix test/CodeGen/mips-varargs.c to use %clang_cc1 Only tests under test/Driver should use %clang, and test/CodeGen in particular must always use %clang_cc1. llvm-svn: 218260	2014-09-22 18:06:01 +00:00
NAKAMURA Takumi	22a0fd416c	clang/test/CodeGen/mips-varargs.c: Fixup for -Asserts. llvm-svn: 218256	2014-09-22 16:40:05 +00:00
Daniel Sanders	8d36a61f52	[mips] Correct alignment of vectors passed in varargs for the O32 ABI. Summary: Vectors are normally 16-byte aligned, however the O32 ABI enforces a maximum alignment of 8-bytes since the base of the stack is 8-byte aligned. Previously, this was enforced on the caller side, but not on the callee side. This fixes the output of OpenCL's printf when given vectors. Reviewers: atanasyan Reviewed By: atanasyan Subscribers: llvm-commits, pekka.jaaskelainen Differential Revision: http://reviews.llvm.org/D5433 llvm-svn: 218248	2014-09-22 13:27:06 +00:00
Ehsan Akhgari	31097581aa	ms-inline-asm: Scope inline asm labels to functions Summary: This fixes PR20023. In order to implement this scoping rule, we piggy back on the existing LabelDecl machinery, by creating LabelDecl's that will carry the "internal" name of the inline assembly label, which we will rewrite the asm label to. Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4589 llvm-svn: 218230	2014-09-22 02:21:54 +00:00
Nico Weber	d191063c6c	Follow-up to r214408: Warn on other callee-cleanup functions without prototype too. According to lore, we used to verifier-fail on: void __thiscall f(); int main() { f(1); } So that's fixed now. System headers use prototype-less __stdcall functions, so make that a warning that's DefaultError -- then it fires on regular code but is suppressed in system headers. Since it's used in system headers, we have codegen tests for this; massage them slightly so that they still compile. llvm-svn: 218166	2014-09-19 23:07:12 +00:00
Robert Khasanov	2c589bcc5e	[x86] Add _addcarry_u{32\|64} and _subborrow_u{32\|64}. They are added to adxintrin.h but outside __ADX__ block. These intrinics generates adc and sbb correspondingly that were available before ADX llvm-svn: 218118	2014-09-19 10:29:22 +00:00
Robert Khasanov	83c419b349	[x86] Added _addcarryx_u32, _addcarryx_u64 intrinsics llvm-svn: 218117	2014-09-19 10:17:06 +00:00
Akira Hatanaka	e867e422e2	[X86, inlineasm] Do not allow using constraint 'x' for a variable larger than 128-bit unless the target CPU supports AVX. rdar://problem/11846140 llvm-svn: 218082	2014-09-18 21:58:54 +00:00
Hans Wennborg	3c619a43d5	[X86, inline-asm] Allow 256-bit wide operands for the 'x' constraints The 'x' constraint is for "any SSE register", and GCC seems to include the 256-bit ymm registers in that concept. llvm-svn: 218073	2014-09-18 20:24:04 +00:00
Akira Hatanaka	974131ea88	[X86, inlineasm] Check that the output size is correct for the given constraint. llvm-svn: 218064	2014-09-18 18:17:18 +00:00
Akira Hatanaka	3ab9ada59c	Fix test case. This is another follow-up patch to r217996. llvm-svn: 218003	2014-09-18 00:29:04 +00:00
Akira Hatanaka	d7e375d4b3	Fix test case. This is a follow-up to r217994. llvm-svn: 217996	2014-09-18 00:04:10 +00:00
Akira Hatanaka	31c6d3b71e	[X86, inline-asm] Check that the input size is correct for constraints R, q, Q, S, D, A, y, x, f, t, and u. This is a follow-up patch for r167717. rdar://problem/11846140 rdar://problem/17476970 llvm-svn: 217994	2014-09-17 23:35:14 +00:00
Alexey Samsonov	8e1162c71d	Implement nonnull-attribute sanitizer Summary: This patch implements a new UBSan check, which verifies that function arguments declared to be nonnull with __attribute__((nonnull)) are actually nonnull in runtime. To implement this check, we pass FunctionDecl to CodeGenFunction::EmitCallArgs (where applicable) and if function declaration has nonnull attribute specified for a certain formal parameter, we compare the corresponding RValue to null as soon as it's calculated. Test Plan: regression test suite Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits, rnk Differential Revision: http://reviews.llvm.org/D5082 llvm-svn: 217389	2014-09-08 17:22:45 +00:00
NAKAMURA Takumi	4b04c11d00	clang/test/CodeGen/builtin-assume*.c: Fixup for -Asserts. llvm-svn: 217352	2014-09-08 01:12:55 +00:00
Hal Finkel	bcc06085a8	Add __builtin_assume and __builtin_assume_aligned using @llvm.assume. This makes use of the recently-added @llvm.assume intrinsic to implement a __builtin_assume(bool) intrinsic (to provide additional information to the optimizer). This hooks up __assume in MS-compatibility mode to mirror __builtin_assume (the semantics have been intentionally kept compatible), and implements GCC's __builtin_assume_aligned as assume((p - o) & mask == 0). LLVM now contains special logic to deal with assumptions of this form. llvm-svn: 217349	2014-09-07 22:58:14 +00:00
Chandler Carruth	2949e548f4	[x86] Clean up the x86 builtin specs to reflect r217310 in LLVM which made the 8-bit masks actually 8-bit arguments to these intrinsics. These builtins are a mess. Many were missing the I qualifier which I added where obviously correct. Most aren't tested, but I've updated the relevant tests. I've tried to catch all the things that should become 'c' in this round. It's also frustrating because the set of these is really ad-hoc and doesn't really map that cleanly to the set supported by either GCC or LLVM. Oh well... llvm-svn: 217311	2014-09-06 10:30:51 +00:00
James Molloy	163b1ba471	[ARMv8] Add support for 32-bit MIN/MAXNM and directed rounding. This patch adds support for the 32bit numeric max/min and directed round-to-integral NEON intrinsics that were added as part of v8, along with unit tests. Patch by Graham Hunter! llvm-svn: 217242	2014-09-05 13:50:34 +00:00
Hans Wennborg	d71907dd07	Don't emit prologues or epilogues for naked functions (PR18791, PR20028) For naked functions with parameters, Clang would still emit stores in the prologue that would clobber the stack, because LLVM doesn't set up a stack frame. (This shows up in -O0 compiles, because the stores are optimized away otherwise.) For example: __attribute__((naked)) int f(int x) { asm("movl $42, %eax"); asm("retl"); } Would result in: _Z1fi: movl 12(%esp), %eax movl %eax, (%esp) <--- Oops. movl $42, %eax retl Differential Revision: http://reviews.llvm.org/D5183 llvm-svn: 217198	2014-09-04 22:16:33 +00:00
Reid Kleckner	9b3e3dfc54	MS inline asm: Allow __asm blocks to set a return value If control falls off the end of a function after an __asm block, MSVC assumes that the inline assembly filled the EAX and possibly EDX registers with an appropriate return value. This functionality is used in inline functions returning 64-bit integers in system headers, so we need some amount of compatibility. This is implemented in Clang by adding extra output constraints to every inline asm block, and storing the resulting output registers into the return value slot. If we see an asm block somewhere in the function body, we emit a normal epilogue instead of marking the end of the function with a return type unreachable. Normal returns in functions not using this functionality will overwrite the return value slot, and in most cases LLVM should be able to eliminate the dead stores. Fixes PR17201. Reviewed By: majnemer Differential Revision: http://reviews.llvm.org/D5177 llvm-svn: 217187	2014-09-04 20:04:38 +00:00
Reid Kleckner	a4ab03ec21	MS inline asm: Add a test for xgetbv clobbers llvm-svn: 217174	2014-09-04 16:58:47 +00:00
Daniel Sanders	e5018b6c00	[mips] Mark aggregates returned in registers with the 'inreg' attribute. Summary: This allows us to easily find them in the backend after the aggregates have been lowered to other types. This is important on big-endian targets using the N32/N64 ABI's since these ABI's must shift small structures into the upper bits of the register. Reviewers: atanasyan Reviewed By: atanasyan Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5005 llvm-svn: 217160	2014-09-04 15:05:39 +00:00
Daniel Sanders	ed39f58390	[mips] Zero-sized structs cannot be ignored in MipsABIInfo::classifyReturnType() for O32 Summary: They are returned indirectly which causes the other arguments to move to the next argument slot. With this, utils/ABITest does not discover any failing cases in the first 500 attempts on big/little endian for O32. Previously some of these failed. Also tested N32/N64 little endian (big endian has other known issues) with no issues. Reviewers: atanasyan Reviewed By: atanasyan Subscribers: atanasyan, cfe-commits Differential Revision: http://reviews.llvm.org/D4811 llvm-svn: 217147	2014-09-04 13:28:14 +00:00
Tom Stellard	c4e0c1075b	CGBuiltin: Use @llvm.fabs rather than fabs libcall when emitting builtins Using the intrinsic allows the SelectionDAGBuilder to turn this call into the FABS Node and also the intrinsic is something the vectorizer knows how to vectorize. This patch also sets the readnone attribute on this call, which should enable additional optmizations. llvm-svn: 217042	2014-09-03 15:24:29 +00:00
Hans Wennborg	2029991d74	Check in a test case for the problem with late-dropped dllimport (PR20803) llvm-svn: 216749	2014-08-29 17:36:11 +00:00
James Molloy	90d6101410	Use store size instead of alloc size when coercing. Previously, EnterStructPointerForCoercedAccess used Alloc size when determining how to convert. This was problematic, because there were situations were the alloc size was larger than the store size. For example, if the first element of a structure were i24 and the destination type were i32, the old code would generate a GEP and a load i24. The code should compare store sizes to ensure the whole object is loaded. I have attached a test case. This patch modifies the output of arm64-be-bitfield.c test case, but the new IR seems to be equivalent, and after -O3, the compiler generates identical ARM assembly. (asr x0, x0, #54) Patch by Thomas Jablin! llvm-svn: 216722	2014-08-29 10:17:52 +00:00
David Majnemer	0392cf892f	CodeGen: Don't completely mess-up optimized atomic libcalls Summary: We did a great job getting this wrong: - We messed up which LLVM IR types to use for arguments and return values. The optimized libcalls use integer types for values. Clang attempted to use the IR type which corresponds to the value passed in instead of using an appropriately sized integer type. This would result in violations of the ABI for, as an example, floating point types. - We didn't bother recording the result of the atomic libcall in the destination memory. Instead, call the functions with arguments matching the type of the libcall prototype's parameters. This fixes PR20780. Differential Revision: http://reviews.llvm.org/D5098 llvm-svn: 216714	2014-08-29 07:27:49 +00:00
Kostya Serebryany	4a9187a810	call __asan_load_cxx_array_cookie when loading array cookie in asan mode. Summary: The current implementation of asan cookie is incorrect: we add nosanitize metadata to the cookie load, but the metadata may be lost and we will instrument the load from poisoned memory. This change replaces the load with a call to __asan_load_cxx_array_cookie (r216692) Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D5111 llvm-svn: 216702	2014-08-29 01:01:32 +00:00
Hans Wennborg	0a20f5417c	Better codegen support for DLL attributes being dropped after the first declaration (PR20792) For the following code: __declspec(dllimport) int f(int x); int user(int x) { return f(x); } int f(int x) { return 1; } Clang will drop the dllimport attribute in the AST, but CodeGen would have already put it on the LLVM::Function, and that would never get updated. (The same thing happens for global variables.) This makes Clang check dropped DLL attribute case each time the LLVM object is referenced. This isn't perfect, because we will still get it wrong if the function is never referenced by codegen after the attribute is dropped, but this handles the common cases and makes us not fail in the verifier. llvm-svn: 216699	2014-08-29 00:16:06 +00:00
Yi Kong	623393f31e	arm_acle: Implement data processing intrinsics Summary: ACLE 2.0 section 9.2 defines the following "miscellaneous data processing intrinsics": `__clz`, `__cls`, `__ror`, `__rev`, `__rev16`, `__revsh` and `__rbit`. `__clz` has already been implemented in the arm_acle.h header file. The rest are not supported yet. This patch completes ACLE data processing intrinsics. Reviewers: t.p.northover, rengolin Reviewed By: rengolin Subscribers: aemerson, mroth, llvm-commits Differential Revision: http://reviews.llvm.org/D4983 llvm-svn: 216658	2014-08-28 09:44:07 +00:00
Alexey Samsonov	9fc9bf83a8	Properly handle multiple nonnull attributes in CodeGen llvm-svn: 216638	2014-08-28 00:53:20 +00:00
Richard Smith	00cc1c09c3	Fix regression in r216520: don't apply nonnull to non-pointer function parameters in the IR. llvm-svn: 216574	2014-08-27 18:56:18 +00:00
Oliver Stannard	ed8ecc8429	Allow __fp16 as a function arg or return type for AArch64 ACLE 2.0 allows __fp16 to be used as a function argument or return type. This enables this for AArch64. This also fixes an existing bug that causes clang to not allow homogeneous floating-point aggregates with a base type of __fp16. This is valid for AAPCS64, but not for AAPCS-VFP. llvm-svn: 216558	2014-08-27 16:31:57 +00:00
NAKAMURA Takumi	6107a8f4db	Quick fix to test/CodeGen/2007-06-18-SextAttrAggregate.c for x86_64-mingw32, corresponding to r216507. FIXME: Explicit triplets might be given here. llvm-svn: 216557	2014-08-27 16:22:26 +00:00
Oliver Stannard	2bfdc5b517	Move some ARM-specific code from CGCall.cpp to TargetInfo.cpp This tidies up some ARM-specific code added by r208417 to move it out of the target-independent parts of clang into TargetInfo.cpp. This also has the advantage that we can now flatten struct arguments to variadic AAPCS functions. llvm-svn: 216535	2014-08-27 10:43:15 +00:00
Julien Lerouge	10dcff81be	Re-apply r216491 (Win64 ABI shouldn't extend integer type arguments.) This time though, preserve the extension for bool types since that's compatible with what MSVC expects. See http://reviews.llvm.org/D4380 llvm-svn: 216507	2014-08-27 00:36:55 +00:00
Julien Lerouge	e8d34fa172	Revert 216491, it breaks CodeGenCXX/microsoft-abi-member-pointers.cpp llvm-svn: 216496	2014-08-26 22:11:53 +00:00
Julien Lerouge	0056256b55	Win64 ABI shouldn't extend integer type arguments. Summary: MSVC doesn't extend integer types smaller than 64bit, so to preserve binary compatibility, clang shouldn't either. For example, the following C code built with MSVC: unsigned test(unsigned v); unsigned foobar(unsigned short); int main() { return test(0xffffffff) + foobar(28); } Produces the following: 0000000000000004: B9 FF FF FF FF mov ecx,0FFFFFFFFh 0000000000000009: E8 00 00 00 00 call test 000000000000000E: 89 44 24 20 mov dword ptr [rsp+20h],eax 0000000000000012: 66 B9 1C 00 mov cx,1Ch 0000000000000016: E8 00 00 00 00 call foobar And as you can see, when setting up the call to foobar, only cx is overwritten. If foobar is compiled with clang, then the zero extension added by clang means the rest of the register, which contains garbage, could be used. For example if foobar is: unsigned foobar(unsigned short v) { return v; } Compiled with clang -fomit-frame-pointer -O3 gives the following assembly: foobar: 0000000000000000: 89 C8 mov eax,ecx 0000000000000002: C3 ret And that function would return garbage because the 16 most significant bits of ecx still contain garbage from the first call. With this change, the code for that function is now: foobar: 0000000000000000: 0F B7 C1 movzx eax,cx 0000000000000003: C3 ret Reviewers: chapuni, rnk Reviewed By: rnk Subscribers: majnemer, cfe-commits Differential Revision: http://reviews.llvm.org/D4380 llvm-svn: 216491	2014-08-26 21:52:27 +00:00
Fariborz Jahanian	ffc120a900	revert patch r216469. llvm-svn: 216485	2014-08-26 21:10:47 +00:00
Quentin Colombet	bb9a858b25	[test/CodeGen/ARM] Update arm_neon_intrinsics test case to actually test the lowering of the intrinsics. Prior to this commit, most of the copy-related intrinsics could be optimized away. The situation is still not ideal as there are several possibilities to lower a given intrinsic. Currently, we match LLVM behavior. llvm-svn: 216474	2014-08-26 18:43:31 +00:00
Fariborz Jahanian	840438bb06	c11- Check for c11 language option as documentation says feature is c11 about nested struct declarations must have struct-declarator-list. Without this change, code which was meant for c99 breaks. rdar://18125536 llvm-svn: 216469	2014-08-26 18:13:47 +00:00
Yi Kong	6891746cd8	arm_acle: Add mappings for dbg intrinsic This completes all ACLE hint intrinsics. llvm-svn: 216453	2014-08-26 12:48:11 +00:00
Yi Kong	1d268af094	ARM: Add dbg builtin intrinsic llvm-svn: 216452	2014-08-26 12:48:06 +00:00
Yi Kong	0705e0065e	arm_acle: Implement swap intrinsic Insert the LDREX/STREX instruction sequence specified in ARM ACLE 2.0, as SWP instruction is deprecated since ARMv6. llvm-svn: 216446	2014-08-26 09:50:54 +00:00
Kostya Serebryany	4ee6904288	[clang/asan] call __asan_poison_cxx_array_cookie after operator new[] Summary: PR19838 When operator new[] is called and an array cookie is created we want asan to detect buffer overflow bugs that touch the cookie. For that we need to a) poison the shadow for the array cookie (call __asan_poison_cxx_array_cookie). b) ignore the legal accesses to the cookie generated by clang (add 'nosanitize' metadata) Reviewers: timurrrr, samsonov, rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4774 llvm-svn: 216434	2014-08-26 02:29:59 +00:00
Hal Finkel	6208251923	Implement __builtin_signbitl for PowerPC PowerPC uses the special PPC_FP128 type for long double on Linux, which is composed of two 64-bit doubles. The higher-order double (which contains the overall sign) comes first, and so the __builtin_signbitl implementation requires special handling to extract the sign bit. Fixes PR20691. llvm-svn: 216341	2014-08-24 03:47:06 +00:00
David Majnemer	58e4ea904b	CodeGen: Skip unnamed bitfields when handling designated initializers We would accidently initialize unnamed bitfields instead of the following field. llvm-svn: 216313	2014-08-23 01:48:50 +00:00
Quentin Colombet	a1c34d3560	[test/CodeGen/ARM] Adpat test to match new codegen after r216274. Moreover, rework some patterns to actually check the emitted instructions instead of matching unrelated string! E.g., some of the "// CHECK: vmov" were matching stuff like ".globl funcname_with_vmov" instead of actual instructions. llvm-svn: 216275	2014-08-22 18:08:37 +00:00
Quentin Colombet	ffe5e5a42d	[test/CodeGen/ARM] Adpat test to match new codegen after r216236. llvm-svn: 216249	2014-08-22 00:27:52 +00:00
Fariborz Jahanian	91b2fa2a9a	ext_vector IRGen. Patch to allow indexing into ext_vector_type's 'hi/lo' components when used as lvalue. rdar://18031917 pr20697 llvm-svn: 215991	2014-08-19 17:17:40 +00:00
Adam Nemet	2278fcbf0c	[AVX512] Add FMA intrinsics Part of <rdar://problem/17688758> llvm-svn: 215666	2014-08-14 17:17:57 +00:00
Justin Bogner	085c4b294b	Revert "CodeGen: When bitfields fall on natural boundaries, split them up" It fits better with LLVM's memory model to try to do this in the backend. Specifically, narrowing wide loads in the backends should be relatively straightforward and is generally valuable, whereas widening loads tends to be very constrained. Discussion here: http://lists.cs.uiuc.edu/pipermail/cfe-commits/Week-of-Mon-20140811/112581.html This reverts commit r215614. llvm-svn: 215648	2014-08-14 15:44:29 +00:00
Rafael Espindola	764837431a	Delete support for AuroraUX. auroraux.org is not resolving. llvm-svn: 215644	2014-08-14 15:14:51 +00:00
Pekka Jaaskelainen	ab751a8f71	Fix a crash when compiling blocks in OpenCL with multiple address spaces. llvm-svn: 215629	2014-08-14 09:37:50 +00:00
Justin Bogner	caf1c6e3dd	CodeGen: When bitfields fall on natural boundaries, split them up Currently when laying out bitfields that don't need any padding, we represent them as a wide enough int to contain all of the bits. This can be hard on the backend since we'll do things like represent stores to a few bits as loading an i144, masking it with a large constant, and storing it back. This turns up in less pathological cases where we load and mask 64 bit word on a 32 bit platform when we actually only need to access 32 bits. This leads to bad code being generated in most of our 32 bit backends. In practice, there are often natural breaks in bitfields, and it's a fairly simple and effective heuristic to split these fields into legal integer sized chunks when it will be equivalent (ie, it won't force us to add any extra padding). llvm-svn: 215614	2014-08-14 02:42:10 +00:00
Yi Kong	45a09319bf	ARM: Add mappings for ACLE prefetch intrinsics Implement __pld, __pldx, __pli and __plix builtin intrinsics as specified in ARM ACLE 2.0. llvm-svn: 215599	2014-08-13 23:20:15 +00:00
Justin Bogner	5ea05aed15	test/CodeGen: Don't rely on a value's number in check lines The tests in r215568 hard code a value as %0 in their checks. This isn't correct in asserts builds. llvm-svn: 215585	2014-08-13 21:54:06 +00:00
Yi Kong	a5548431a5	AArch64: Prefetch intrinsic llvm-svn: 215569	2014-08-13 19:18:20 +00:00
Yi Kong	26d104a9ec	ARM: Prefetch intrinsics llvm-svn: 215568	2014-08-13 19:18:14 +00:00
Adam Nemet	4abc07cb75	[AVX512] Add intrinsics for FP scalar broadcasts Similar approach to the set1 intrinsics is used: implement in terms of vector initializers and then ensure with an LLVM test that a broadcast is generated at the end. Part of <rdar://problem/17688758> llvm-svn: 215486	2014-08-13 00:29:01 +00:00
Alexey Samsonov	de443c5002	[UBSan] Add returns-nonnull sanitizer. Summary: This patch adds a runtime check verifying that functions annotated with "returns_nonnull" attribute do in fact return nonnull pointers. It is based on suggestion by Jakub Jelinek: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20140623/223693.html. Test Plan: regression test suite Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4849 llvm-svn: 215485	2014-08-13 00:26:40 +00:00
David Blaikie	77bbb5fd0b	DebugInfo: Blocks: Do not depend on LLVM argument numbering when choosing the debug info argument numbering. Due to the possible presence of return-by-out parameters, using the LLVM argument number count when numbering debug info arguments can end up off-by-one. This could produce two arguments with the same number, which would in turn cause LLVM to emit only one of those arguments (whichever it found last) or assert (r215157). llvm-svn: 215227	2014-08-08 17:10:14 +00:00
Adam Nemet	5bf7baa938	[AVX512] Add intrinsic for valignd/q Note that similar to palingr, we could further optimize these to emit shufflevector when the shift count is <=64. This however does not change the overall design that unlike palignr we would still need the LLVM intrinsic corresponding to this intruction to handle the >64 cases. (palignr uses the psrldq intrinsic in this case.) llvm-svn: 214891	2014-08-05 17:28:23 +00:00
David Majnemer	c017d3613e	MS ABI: Aligned tentative definitions don't have CommonLinkage int __declspec(align(16)) foo; is a tentative definition but the storage for that variable should not have CommonLinkage. llvm-svn: 214828	2014-08-05 00:01:13 +00:00
Bill Schmidt	ccbe0a8022	[PPC64LE] Fix wrong IR for vec_sld and vec_vsldoi My original LE implementation of the vsldoi instruction, with its altivec.h interfaces vec_sld and vec_vsldoi, produces incorrect shufflevector operations in the LLVM IR. Correct code is generated because the back end handles the incorrect shufflevector in a consistent manner. This patch and a companion patch for LLVM correct this problem by removing the fixup from altivec.h and the corresponding fixup from the PowerPC back end. Several test cases are also modified to reflect the now-correct LLVM IR. The vec_sums and vec_vsumsws interfaces in altivec.h are also fixed, because they used vec_perm calls intended to be recognized as vsldoi instructions. These vec_perm calls are now replaced with code that more clearly shows the intent of the transformation. llvm-svn: 214801	2014-08-04 23:21:26 +00:00
Joerg Sonnenberger	466a31eb65	vcfsx and dss instructions require immediates, variables are not valid. llvm-svn: 214635	2014-08-02 15:07:21 +00:00
Alexey Samsonov	d9ad5cec0c	[ASan] Use metadata to pass source-level information from Clang to ASan. Instead of creating global variables for source locations and global names, just create metadata nodes and strings. They will be transformed into actual globals in the instrumentation pass (if necessary). This approach is more flexible: 1) we don't have to ensure that our custom globals survive all the optimizations 2) if globals are discarded for some reason, we will simply ignore metadata for them and won't have to erase corresponding globals 3) metadata for source locations can be reused for other purposes: e.g. we may attach source location metadata to alloca instructions and provide better descriptions for stack variables in ASan error reports. No functionality change. llvm-svn: 214604	2014-08-02 00:35:50 +00:00
Reid Kleckner	e2d6429493	MS inline asm: Tests for r214550 These tests seem like an exception to the rule against assembly emitting tests in clang. I made an LLVM side change that can only be tested by setting up the inline assembly machinery that is only implemented by Clang. llvm-svn: 214552	2014-08-01 20:23:29 +00:00
Daniel Sanders	2ef3cdd3d5	Revert r214497: [mips] Defer va_arg expansion to the backend. It appears that the backend does not handle all cases that were handled by clang. In particular, it does not handle structs as used in SingleSource/UnitTests/2003-05-07-VarArgs. llvm-svn: 214512	2014-08-01 13:26:28 +00:00
Daniel Sanders	cd8ba86990	[mips] Defer va_arg expansion to the backend. Summary: This patch causes clang to emit va_arg instructions to the backend instead of expanding them into an implementation itself. The backend already implements va_arg since this is necessary for NaCl so this patch is removing redundant code. Together with the llvm patch (D4556) that accounts for the effect of endianness on the expansion of va_arg, this fixes PR19612. Depends on D4556 Reviewers: sstankovic, dsanders Reviewed By: dsanders Subscribers: rnk, cfe-commits Differential Revision: http://reviews.llvm.org/D4742 llvm-svn: 214497	2014-08-01 10:29:21 +00:00
Hans Wennborg	f51dc3b5d4	Local extern redeclarations of dllimport variables stay dllimport even if they don't specify the attribute llvm-svn: 214425	2014-07-31 19:29:39 +00:00
Ehsan Akhgari	9f507382dd	ms-inline-asm: Add a test to ensure that call doesn't clobber eax. Note that it's not clear whether this is the right behavior, please see the review for the discussion. Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4577 llvm-svn: 214401	2014-07-31 13:43:17 +00:00
Richard Smith	77be48ac47	PR18097: Support initializing an _Atomic(T) from an object of C++ class type T or a class derived from T. We already supported this when initializing _Atomic(T) from T for most (and maybe all) other reasonable values of T. llvm-svn: 214390	2014-07-31 06:31:19 +00:00
Adam Nemet	da82bcc4dd	[AVX512] Add unaligned FP load intrinsics Part of <rdar://problem/17688758> llvm-svn: 214380	2014-07-31 04:00:39 +00:00
Rafael Espindola	42affc2db6	Update for llvm change. llvm-svn: 214356	2014-07-30 22:52:16 +00:00
Adam Nemet	2db1d2fb32	[AVX512] Add intrinsic for knot Part of <rdar://problem/17688758> llvm-svn: 214316	2014-07-30 16:51:27 +00:00
Adam Nemet	c871ff95f3	[AVX512] Add some of the FP cast intrinsics Part of <rdar://problem/17688758> llvm-svn: 214315	2014-07-30 16:51:24 +00:00
Adam Nemet	f42e7a274a	[AVX512] Add set1 intrinsics (Dropped the byte and word variants from the patch. Turns out these are not part of AVX512F but only AVX512BW/VL.) Part of <rdar://problem/17688758> llvm-svn: 214314	2014-07-30 16:51:22 +00:00
Richard Smith	00db2f13f8	PR20473: Don't "deduplicate" string literals with the same value but different lengths! In passing, simplify string literal deduplication by relying on LLVM to deduplicate the underlying constant values. llvm-svn: 214222	2014-07-29 21:20:12 +00:00
Tobias Grosser	b3af390087	Revert "Emit column debug information for loads" This broke the following gdb tests: gdb.base__annota1.exp gdb.base__consecutive.exp gdb.python__py-symtab.exp gdb.reverse__consecutive-precsave.exp gdb.reverse__consecutive-reverse.exp I will look into this. This reverts commit 214162. llvm-svn: 214163	2014-07-29 06:53:14 +00:00
Tobias Grosser	01b923d55b	Emit column debug information for loads This allows us to give more precise diagnostics. Diego kindly tested the impact on debug info size: "The increase on average debug sizes is 0.1%. The total file size increase is ~0%." llvm-svn: 214162	2014-07-29 06:10:47 +00:00
Adam Nemet	fce1ad0b99	[AVX512] Add non-masking FP store intrinsics Part of <rdar://problem/17688758> llvm-svn: 214099	2014-07-28 17:14:45 +00:00
Adam Nemet	a3ebe6214b	[AVX512] Add FP add/sub/mul intrinsics Part of <rdar://problem/17688758> llvm-svn: 214098	2014-07-28 17:14:42 +00:00
Adam Nemet	062ba618f5	[AVX512] Add CHECK-LABELs to test/CodeGen/avx512f-builtins.c llvm-svn: 214095	2014-07-28 17:14:36 +00:00
Ulrich Weigand	8afad61a93	[PowerPC] Support ELFv1/ELFv2 ABI selection via -mabi= option While Clang now supports both ELFv1 and ELFv2 ABIs, their use is currently hard-coded via the target triple: powerpc64-linux is always ELFv1, while powerpc64le-linux is always ELFv2. These are of course the most common scenarios, but in principle it is possible to support the ELFv2 ABI on big-endian or the ELFv1 ABI on little-endian systems (and GCC does support that), and there are some special use cases for that (e.g. certain Linux kernel versions could only be built using ELFv1 on LE). This patch implements the Clang side of supporting this, based on the LLVM commit 214072. The command line options -mabi=elfv1 or -mabi=elfv2 select the desired ABI if present. (If not, Clang uses the same default rules as now.) Specifically, the patch implements the following changes based on the presence of the -mabi= option: In the driver: - Pass the appropiate -target-abi flag to the back-end - Select the correct dynamic loader version (/lib64/ld64.so.[12]) In the preprocessor: - Define _CALL_ELF to the appropriate value (1 or 2) In the compiler back-end: - Select the correct ABI in TargetInfo.cpp - Select the desired ABI for LLVM via feature (elfv1/elfv2) llvm-svn: 214074	2014-07-28 13:17:52 +00:00
Ehsan Akhgari	755597c83d	Fix test/CodeGen/ms-inline-asm.c from r213916. llvm-svn: 213919	2014-07-25 02:39:33 +00:00
Ehsan Akhgari	fa2d9aa798	Fix test/CodeGen/ms-inline-asm.cpp from r213916. llvm-svn: 213918	2014-07-25 02:35:50 +00:00
Ehsan Akhgari	2f93b448a8	clang-cl: Merge adjacent single-line __asm blocks Summary: This patch extends the __asm parser to make it keep parsing input tokens as inline assembly if a single-line __asm line is followed by another line starting with __asm too. It also makes sure that we correctly keep matching braces in such situations by separating the notions of how many braces we are matching and whether we are in single-line asm block mode. Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4598 llvm-svn: 213916	2014-07-25 02:27:14 +00:00
Mark Heffernan	c888e41c0c	Add support for #pragma nounroll. llvm-svn: 213885	2014-07-24 18:09:38 +00:00
Mark Heffernan	44ca416a64	Rename metadata in test which was missed when renaming loop unroll metadata in r213771. llvm-svn: 213775	2014-07-23 17:59:07 +00:00
Mark Heffernan	450c23843e	In unroll pragma syntax and loop hint metadata, change "enable" forms to a new form using the string "full". llvm-svn: 213771	2014-07-23 17:31:31 +00:00
Tim Northover	18b7512faa	AArch64: use aarch64_be instead of arm64_be in all tests. arm64_be doesn't really exist; it was useful for testing while AArch64 and ARM64 were separate, but now the only real way to refer to the system is aarch64_be. llvm-svn: 213747	2014-07-23 12:57:31 +00:00
Robert Lytton	26149def6e	remove hardcoded metadata numbers from tests llvm-svn: 213659	2014-07-22 14:47:42 +00:00
Elena Demikhovsky	fcc6df310d	AVX-512: Added intrinsics to clang. The set is small, that what I have right now. Everybody is welcome to add more. llvm-svn: 213641	2014-07-22 11:31:39 +00:00
Mark Heffernan	34735af3cb	Rename metadata llvm.loop.vectorize.unroll to llvm.loop.vectorize.interleave. llvm-svn: 213587	2014-07-21 23:10:56 +00:00
Mark Heffernan	bd26f5ea4d	Add support for '#pragma unroll'. llvm-svn: 213574	2014-07-21 18:08:34 +00:00
Ulrich Weigand	601957fa23	[PowerPC] Optimize passing certain aggregates by value In addition to enabling ELFv2 homogeneous aggregate handling, LLVM support to pass array types directly also enables a performance enhancement. We can now pass (non-homogeneous) aggregates that fit fully in registers as direct integer arrays, using an element type to encode the alignment requirement (that would otherwise go to the "byval align" field). This is preferable since "byval" forces the back-end to write the aggregate out to the stack, even if it could be passed fully in registers. This is particularly annoying on ELFv2, if there is no parameter save area available, since we then need to allocate space on the callee's stack just to hold those aggregates. Note that to implement this optimization, this patch does not attempt to fully anticipate register allocation rules as (defined in the ABI and) implemented in the back-end. Instead, the patch is simply passing any aggregate passed by value using the array mechanism if its size is up to 64 bytes. This means that some of those will end up being passed in stack slots anyway, but the generated code shouldn't be any worse either. (Large aggregates remain passed using "byval" to enable optimized copying via memcpy etc.) llvm-svn: 213495	2014-07-21 00:56:36 +00:00
Ulrich Weigand	b712237da6	[PowerPC] Support the ELFv2 ABI This patch implements clang support for the PowerPC ELFv2 ABI. Together with a series of companion patches in LLVM, this makes clang/LLVM fully usable on powerpc64le-linux. Most of the ELFv2 ABI changes are fully implemented on the LLVM side. On the clang side, we only need to implement some changes in how aggregate types are passed by value. Specifically, we need to: - pass (and return) "homogeneous" floating-point or vector aggregates in FPRs and VRs (this is similar to the ARM homogeneous aggregate ABI) - return aggregates of up to 16 bytes in one or two GPRs The second piece is trivial to implement in any case. To implement the first piece, this patch makes use of infrastructure recently enabled in the LLVM PowerPC back-end to support passing array types directly, where the array element type encodes properties needed to handle homogeneous aggregates correctly. Specifically, the array element type encodes: - whether the parameter should be passed in FPRs, VRs, or just GPRs/stack slots (for float / vector / integer element types, respectively) - what the alignment requirements of the parameter are when passed in GPRs/stack slots (8 for float / 16 for vector / the element type size for integer element types) -- this corresponds to the "byval align" field With this support in place, the clang part simply needs to detect whether an aggregate type implements a float / vector homogeneous aggregate as defined by the ELFv2 ABI, and if so, pass/return it as array type using the appropriate float / vector element type. llvm-svn: 213494	2014-07-21 00:48:09 +00:00
Hal Finkel	48d53e2c4c	Use the dereferenceable attribute on C99 array parameters with static In C99, an array parameter declarator might have the form: direct-declarator '[' 'static' type-qual-list[opt] assign-expr ']' where the static keyword indicates that the caller will always provide a pointer to the beginning of an array with at least the number of elements specified by the assignment expression. For constant sizes, we can use the new dereferenceable attribute to pass this information to the optimizer. For VLAs, we don't know the size, but (for addrspace(0)) do know that the pointer must be nonnull (and so we can use the nonnull attribute). llvm-svn: 213444	2014-07-19 01:41:07 +00:00
Oliver Stannard	e022851f3b	[ARM] Fix AAPCS regression caused by r211898 r211898 introduced a regression where a large struct, which would normally be passed ByVal, was causing padding to be inserted to prevent the backend from using some GPRs, in order to follow the AAPCS. However, the type of the argument was not being set correctly, so the backend cannot align 8-byte aligned struct types on the stack. The fix is to not insert the padding arguments when the argument is being passed ByVal. llvm-svn: 213359	2014-07-18 09:09:31 +00:00
Kevin Qin	110db6f2ad	[AArch64] Implement Clang CLI interface proposal about "-march". 1. Revert "Add default feature for CPUs on AArch64 target in Clang" at r210625. Then, all enabled feature will by passed explicitly by -target-feature in -cc1 option. 2. Get "-mfpu" deprecated. 3. Implement support of "-march". Usage is: -march=armv8-a+[no]feature For instance, "-march=armv8-a+neon+crc+nocrypto". Here "armv8-a" is necessary, and CPU names are not acceptable. Candidate features are fp, neon, crc and crypto. Where conflicting feature modifiers are specified, the right-most feature is used. 4. Implement support of "-mtune". Usage is: -march=CPU_NAME For instance, "-march=cortex-a57". This option will ONLY get micro-architectural feature enabled specifying to target CPU, like "+zcm" and "+zcz" for cyclone. Any architectural features WON'T be modified. 5. Change usage of "-mcpu" to "-mcpu=CPU_NAME+[no]feature", which is an alias to "-march={feature of CPU_NAME}+[no]feature" and "-mtune=CPU_NAME" together. Where this option is used in conjunction with -march or -mtune, those options take precedence over the appropriate part of this option. llvm-svn: 213353	2014-07-18 07:03:22 +00:00
Alexey Samsonov	c993933e78	Check-labelize ubsan tests llvm-svn: 213334	2014-07-17 23:53:44 +00:00
NAKAMURA Takumi	0c5f4edba4	clang/test/CodeGen/ms-inline-asm.c: Fix for -Asserts. llvm-svn: 213329	2014-07-17 22:51:49 +00:00
Nico Weber	9a08847e6d	Add a test for PR20343 after llvm r213303. llvm-svn: 213305	2014-07-17 20:25:36 +00:00
Alexey Samsonov	24cad99307	[UBSan] Add !nosanitize metadata to the code generated by UBSan. This is used to mark the instructions emitted by Clang to implement variety of UBSan checks. Generally, we don't want to instrument these instructions with another sanitizers (like ASan). Reviewed in http://reviews.llvm.org/D4544 llvm-svn: 213291	2014-07-17 18:46:27 +00:00
Yi Kong	28d7b02687	ARM: Add ACLE memory barrier intrinsic mapping llvm-svn: 213261	2014-07-17 12:45:17 +00:00
Ehsan Akhgari	d86ca7a9c7	Upstream an MS inline assembly test from Mozilla's inline assembly code Summary: I'm planning on upstreaming some test cases for the inline assembly usage in the Mozilla code base. A lot of these test cases test the recent fixes to this code. Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4508 llvm-svn: 213255	2014-07-17 11:38:22 +00:00
Yi Kong	19a29ac0d0	Port memory barriers intrinsics to AArch64 Memory barrier __builtin_arm_[dmb, dsb, isb] intrinsics are required to implement their corresponding ACLE and MSVC intrinsics. This patch ports ARM dmb, dsb, isb intrinsic to AArch64. Requires LLVM r213247. Differential Revision: http://reviews.llvm.org/D4521 llvm-svn: 213250	2014-07-17 10:52:06 +00:00
Tim Northover	6dbcbac98b	IR: update Clang to use polymorphic __fp16 conversion intrinsics. There should be no change in semantics at this stage. llvm-svn: 213249	2014-07-17 10:51:31 +00:00
Hal Finkel	3e49fda0d4	Add basic (noop) CodeGen support for __assume Clang supports __assume, at least at the semantic level, when MS extensions are enabled. Unfortunately, trying to actually compile code using __assume would result in this error: error: cannot compile this builtin function yet __assume is an optimizer hint, and can be ignored at the IR level. Until LLVM supports assumptions at the IR level, a noop lowering is valid, and that is what is done here. llvm-svn: 213206	2014-07-16 22:44:54 +00:00
David Majnemer	ade4bee761	CodeGen: Let arrays be inputs to inline asm An array showing up in an inline assembly input is accepted in ICC and GCC 4.8 This fixes PR20201. Differential Revision: http://reviews.llvm.org/D4382 llvm-svn: 212954	2014-07-14 16:27:53 +00:00
Yi Kong	472e521cec	ARM: Add NOP intrinsic mapping in arm_acle.h llvm-svn: 212950	2014-07-14 15:32:29 +00:00
Yi Kong	4d5e23f53a	ARM: Implement __builtin_arm_nop intrinsic This patch implements __builtin_arm_nop intrinsic for AArch32 and AArch64, which generates hint 0x0, the alias of NOP instruction. This intrinsic is necessary to implement ACLE __nop intrinsic. Differential Revision: http://reviews.llvm.org/D4495 llvm-svn: 212947	2014-07-14 15:20:09 +00:00
Yi Kong	19222dcb4c	Add test cases for AArch64 hints codegen llvm-svn: 212909	2014-07-13 16:17:30 +00:00
Saleem Abdulrasool	3b165e7dbb	tests: use a more precise target for tests llvm-svn: 212892	2014-07-12 23:40:53 +00:00
Saleem Abdulrasool	572250d60a	CodeGen: support hint intrinsics from ACLE on AArch64 This adds support for the ACLE hint intrinsics on AArch64 similar to ARM. This is required to properly support ACLE on AArch64. llvm-svn: 212890	2014-07-12 23:27:22 +00:00
Yi Kong	4e00ce7d0c	Improve comments of ARM ACLE header file and tests Include section number in ARM ACLE specification for easier navigation. llvm-svn: 212887	2014-07-12 22:48:13 +00:00
Hal Finkel	d8442b1b21	Add nonnull in CodeGen for __attribute__((returns_nonnull)) As a follow-up to r212835, also add the LLVM nonnull function attribute when __attribute__((returns_nonnull)) is provided. llvm-svn: 212874	2014-07-12 04:51:04 +00:00
Alexey Samsonov	15c9669615	[ASan] Collect unmangled names of global variables in Clang to print them in error reports. Currently ASan instrumentation pass creates a string with global name for each instrumented global (to include global names in the error report). Global name is already mangled at this point, and we may not be able to demangle it at runtime (e.g. there is no __cxa_demangle on Android). Instead, create a string with fully qualified global name in Clang, and pass it to ASan instrumentation pass in llvm.asan.globals metadata. If there is no metadata for some global, ASan will use the original algorithm. This fixes https://code.google.com/p/address-sanitizer/issues/detail?id=264. llvm-svn: 212872	2014-07-12 00:42:52 +00:00
Reid Kleckner	f392ec6ecc	Form a CallExpr from __noop without parens MSVC accepts __noop without any trailing parens and treats it like a literal zero. We don't treat __noop as an integer literal, but now at least we can parse a naked __noop expression. Reviewers: rsmith Differential Revision: http://reviews.llvm.org/D4476 llvm-svn: 212860	2014-07-11 23:54:29 +00:00
Reid Kleckner	ed5d4adb36	MS extension: Make __noop be the integer zero, not void We still don't accept '__noop;', and we don't consider __noop to be the integer literal zero. More work is needed. llvm-svn: 212839	2014-07-11 20:22:55 +00:00
Hal Finkel	82504f03ce	Add nonnull in CodeGen for __attribute__((nonnull)) We now have an LLVM-level nonnull attribute that can be applied to function parameters, and we emit it for reference types (as of r209723), but did not emit it when an __attribute__((nonnull)) was provided. Now we will. llvm-svn: 212835	2014-07-11 17:35:21 +00:00
Alexey Samsonov	848560125d	[UBSan] Introduce type-based blacklisting. Teach UBSan vptr checker to ignore technically invalud down-casts on blacklisted types. Based on http://reviews.llvm.org/D4407 by Byoungyoung Lee! llvm-svn: 212770	2014-07-10 22:34:19 +00:00
Ulrich Weigand	b4153254b7	Fix (and reenable) ppc64-align-struct.c test for non-assert builds. llvm-svn: 212757	2014-07-10 19:19:03 +00:00
David Blaikie	cceed090d2	Quick (attempted) fix for non-asserts builds for a test introduced in r212743. llvm-svn: 212752	2014-07-10 18:40:54 +00:00
Ulrich Weigand	581badce4b	[PowerPC] ABI support for aligned by-value aggregates This patch adds support for respecting the ABI and type alignment of aggregates passed by value. Currently, all aggregates are aligned at 8 bytes in the parameter save area. This is incorrect for two reasons: - Aggregates that need alignment of 16 bytes or more should be aligned at 16 bytes in the parameter save area. This is implemented by using an appropriate "byval align" attribute in the IR. - Aggregates that need alignment beyond 16 bytes need to be dynamically realigned by the caller. This is implemented by setting the Realign flag of the ABIArgInfo::getIndirect call. In addition, when expanding a va_arg call accessing a type that is aligned at 16 bytes in the argument save area (either one of the aggregate types as above, or a vector type which is already aligned at 16 bytes), code needs to align the va_list pointer accordingly. Reviewed by Hal Finkel. llvm-svn: 212743	2014-07-10 17:20:07 +00:00
Ulrich Weigand	f4eba98853	[PowerPC] ABI support for non-Altivec vector types This patch adds support for passing arguments of non-Altivec vector type (i.e. defined via attribute ((vector_size (...)))) on powerpc64-linux. While such types are not mentioned in the formal ABI document, this patch implements a calling convention compatible with GCC: - Vectors of size < 16 bytes are passed in a GPR - Vectors of size > 16 bytes are passed via reference Note that vector types with a number of elements that is not a power of 2 are not supported by GCC, so there is no pre-existing ABI to follow. We choose to pass those (of size < 16) as if widened to the next power of two, so they might end up in a vector register or in a GPR. (Sizes > 16 are always passed via reference as well.) Reviewed by Hal Finkel. llvm-svn: 212734	2014-07-10 16:39:01 +00:00
Daniel Sanders	cfbb71dfb6	[mips] clz is defined to give 32 for zero. Similarly, dclz gives 64. Summary: While debugging another issue, I noticed that Mips currently specifies that the count leading zero builtins are undefined when the input is zero. The architecture specifications say that the clz and dclz instructions write 32 or 64 respectively when given zero. This doesn't fix any bugs that I'm aware of but it may improve optimisation in some cases. Differential Revision: http://reviews.llvm.org/D4431 llvm-svn: 212618	2014-07-09 13:43:19 +00:00
Tim Northover	e8c3721165	ARM: use LLVM's atomicrmw instructions when ldrex/strex are available. Having some kind of weird kernel-assisted ABI for these when the native instructions are available appears to be (and should be) the exception; OSs have been gradually opting in for years and the code was getting silly. So let LLVM decide whether it's possible/profitable to inline them by default. Patch by Phoebe Buckheister. llvm-svn: 212598	2014-07-09 09:24:43 +00:00
Alexey Samsonov	e7a8ccfaad	[Sanitizer] Reduce the usage of sanitizer blacklist in CodeGenModule Get rid of cached CodeGenModule::SanOpts, which was used to turn off sanitizer codegen options if current LLVM Module is blacklisted, and use plain LangOpts.Sanitize instead. 1) Some codegen decisions (turning TBAA or writable strings on/off) shouldn't depend on the contents of blacklist. 2) llvm.asan.globals should always be created, even if the module is blacklisted - soon Clang's CodeGen where we read sanitizer blacklist files, so we should properly report which globals are blacklisted to the backend. llvm-svn: 212499	2014-07-07 23:34:34 +00:00
Nico Weber	9b982078e9	Add an AST node for __leave statements, hook it up. Codegen is still missing (and I won't work on that), but __leave is now as implemented as __try and friends. llvm-svn: 212425	2014-07-07 00:12:30 +00:00
Ehsan Akhgari	0f89fac7a5	Add support for nested blocks in Microsoft inline assembly This fixes http://llvm.org/PR20204. llvm-svn: 212389	2014-07-06 05:26:54 +00:00
Saleem Abdulrasool	e700cab4e9	CodeGen: add support for a few MSVC ARM intrinsics This adds support for simple MSVC compatibility mode intrinsics. These intrinsics are simple in that they are either directly passed through to the annotated MSBuiltin intrinsic or they mirror existing GCC builtins. llvm-svn: 212378	2014-07-05 20:10:05 +00:00
Ehsan Akhgari	3e0dd89adf	Add a test case for the tilde operator in Microsoft inline assembly llvm-svn: 212373	2014-07-05 15:04:06 +00:00
Alexey Bataev	b75daeecbe	Fixed test CodeGen/captured-statements.c for powerpc64-linux. llvm-svn: 212311	2014-07-04 04:06:11 +00:00
Gerolf Hoflehner	2344cfc335	Restore global static array in test case llvm-svn: 212285	2014-07-03 19:30:33 +00:00
Yi Kong	4efadfb0b0	[ARM] Implement ISB memory barrier intrinsic Adds support for __builtin_arm_isb. Also corrects DSB and ISB instructions modelling by adding has-side-effects property. llvm-svn: 212277	2014-07-03 16:01:25 +00:00
Renato Golin	47843efcf6	Add the __qdbl intrinsic to the arm_acle.h header Patch by: Moritz Roth llvm-svn: 212264	2014-07-03 10:14:52 +00:00
Robert Lytton	57765d5347	Move the calling of emitTargetMD() later. Summary: Because a global created by GetOrCreateLLVMGlobal() is not finalised until later viz: extern char a[]; char f(){ return a[5];} char a[10]; Change MangledDeclNames to use a MapVector rather than a DenseMap so that the Metadata is output in order of original declaration, so to make deterministic and improve human readablity. Differential Revision: http://reviews.llvm.org/D4176 llvm-svn: 212263	2014-07-03 09:30:33 +00:00
Christian Pirker	c3d3217525	ARMEB: Fix function result return for composite types Reviewed at http://reviews.llvm.org/D4364 llvm-svn: 212261	2014-07-03 09:28:12 +00:00
Saleem Abdulrasool	ece7217f70	ARM: rename ARM builtins to use __builtin_arm prefix This corrects SVN r212196's naming change to use the proper prefix of `__builtin_arm_` instead of `__builtin_`. Thanks to Yi Kong for pointing out the incorrect naming! llvm-svn: 212253	2014-07-03 02:43:20 +00:00
Saleem Abdulrasool	4bddd9d400	CodeGen: make target builtins support languages This extends the target builtin support to allow language specific annotations (i.e. LANGBUILTIN). This is to allow MSVC compatibility whilst retaining the ability to have EABI targets use a __builtin_ prefix. This is merely to allow uniformity in the EABI case where the unprefixed name is provided as an alias in the header. llvm-svn: 212196	2014-07-02 17:41:27 +00:00
Alexey Samsonov	4f319cca42	[ASan] Print exact source location of global variables in error reports. See https://code.google.com/p/address-sanitizer/issues/detail?id=299 for the original feature request. Introduce llvm.asan.globals metadata, which Clang (or any other frontend) may use to report extra information about global variables to ASan instrumentation pass in the backend. This metadata replaces llvm.asan.dynamically_initialized_globals that was used to detect init-order bugs. llvm.asan.globals contains the following data for each global: 1) source location (file/line/column info); 2) whether it is dynamically initialized; 3) whether it is blacklisted (shouldn't be instrumented). Source location data is then emitted in the binary and can be picked up by ASan runtime in case it needs to print error report involving some global. For example: 0x... is located 4 bytes to the right of global variable 'C::array' defined in '/path/to/file:17:8' (0x...) of size 40 These source locations are printed even if the binary doesn't have any debug info. This is an ABI-breaking change. ASan initialization is renamed to __asan_init_v4(). Pre-built libraries compiled with older Clang will not work with the fresh runtime. llvm-svn: 212188	2014-07-02 16:54:41 +00:00
Tim Northover	3acd6bd0b6	ARM: add support for v8 ldaex/stlex builtins. ARMv8 adds (to both AArch32 and AArch64) acquiring and releasing variants of the exclusive operations, in line with the C++11 memory model. This adds support for two new intrinsics to expose them to C & C++ developers directly: __builtin_arm_ldaex and __builtin_arm_stlex, in direct analogy with the versions with no implicit barrier. rdar://problem/15885451 llvm-svn: 212175	2014-07-02 12:56:02 +00:00
Tim Northover	1471cb17ae	X86: inline all atomic operations up to 128-bits. The backend can cope with all of these now, so Clang should give it the chance. On CPUs without cmpxchg16b (e.g. the original athlon64) LLVM can reform the libcalls. rdar://problem/13496295 llvm-svn: 212173	2014-07-02 10:25:45 +00:00
Alexey Bataev	f94baeb363	Added test for capturing VLA types if the captured variable is a function parameter. llvm-svn: 212170	2014-07-02 07:05:22 +00:00
Gerolf Hoflehner	012dff0b23	Enable test/CodeGen/indirect-goto.c in 64b for local arrays In 32b mode the reference count for block addresses is not zero. This prevents inlining and constant folding and causes the test to fail. Changing the triple allows runnning the test in 64b mode. The array in foo2 is now local instead of static until at lower optimization levels the interprocedural constant propagator is invoked before the global optimizer. llvm-svn: 212092	2014-07-01 05:10:06 +00:00
Bob Wilson	84941b92e6	Temporarily disable the indirect-goto.c test. llvm r212077 causes this test to fail. We need to reorder some passes and possibly make other changes to reenable the optimization being tested here. llvm-svn: 212091	2014-07-01 04:56:06 +00:00
Andrea Di Biagio	eb606a3c27	[x86] Add Clang support for intrinsic __rdpmc. This patch adds intrinsic __rdpmc to header file 'ia32intrin.h'. Intrinsic __rdmpc can be used to read performance monitoring counters. It is implemented as a direct call to __builtin_ia32_rdpmc. It takes as input a value representing the index of the performance counter to read. The value of the performance counter is then returned as a unsigned 64-bit quantity. llvm-svn: 212053	2014-06-30 18:23:58 +00:00
Alexey Bataev	06812bc98b	Second part of fix in CodeGen/captured-statements-nested.c llvm-svn: 212028	2014-06-30 09:14:10 +00:00
Alexey Bataev	e686c1d7ef	Test fix llvm-svn: 212026	2014-06-30 09:05:08 +00:00
Alexey Bataev	41ff27e9a1	Fixed incompatibility in CodeGen/captured-statements-nested.c with MSVC llvm-svn: 212025	2014-06-30 08:37:48 +00:00
Alexey Bataev	be5af7b9c7	Fixed CodeGen/captured-statements-nested.c test llvm-svn: 212024	2014-06-30 08:17:11 +00:00
Alexey Bataev	6de13e86e9	Disable CodeGen/captured-statements-nested.c llvm-svn: 212018	2014-06-30 05:07:42 +00:00
Alexey Bataev	83222d6109	Fixed CodeGen/captured-statements-nested.c test llvm-svn: 212016	2014-06-30 05:02:50 +00:00
Alexey Bataev	8dfca43296	Disable CodeGen/captured-statements-nested.c llvm-svn: 212014	2014-06-30 03:30:41 +00:00
Alexey Bataev	18da16cab5	Temp XFAIL CodeGen/captured-statements-nested.c to fix the test llvm-svn: 212013	2014-06-30 03:14:43 +00:00
Alexey Bataev	aca7fcf276	Using of variable length arrays in captured statements and OpenMP constructs. Differential Revision: http://reviews.llvm.org/D4067 llvm-svn: 212010	2014-06-30 02:55:54 +00:00
Alp Toker	f082e73696	Remove some incorrect test suppressions These don't actually require any registered backend to run. This commit tests the water with a handful of fixes for what is a more widespread problem. llvm-svn: 212008	2014-06-30 01:34:09 +00:00
Saleem Abdulrasool	24bd7da2d2	Basic: fix handling for Windows Itanium environment This corrects the handling for i686-windows-itanium. This environment is nearly identical to Windows MSVC, except it uses the itanium ABI for C++. llvm-svn: 211991	2014-06-28 23:34:11 +00:00
Alp Toker	f76e6d8e6b	Get arm_acle tests from r211962 working llvm-svn: 211979	2014-06-28 06:51:27 +00:00
Yi Kong	a44c4d7173	Introduce arm_acle.h supporting existing LLVM builtin intrinsics Summary: This patch introduces ACLE header file, implementing extensions that can be directly mapped to existing Clang intrinsics. It implements for both AArch32 and AArch64. Reviewers: t.p.northover, compnerd, rengolin Reviewed By: compnerd, rengolin Subscribers: rnk, echristo, compnerd, aemerson, mroth, cfe-commits Differential Revision: http://reviews.llvm.org/D4296 llvm-svn: 211962	2014-06-27 21:25:42 +00:00
Oliver Stannard	3f32b9be7f	[ARM] Fix AAPCS non-compliance caused by very large structs This is a fix to the code in clang which inserts padding arguments to ensure that the ARM backend can emit AAPCS-VFP compliant code. This code needs to track the number of registers which have been allocated in order to do this. When passing a very large struct (>64 bytes) by value, clang emits IR which takes a pointer to the struct, but the backend converts this back to passing the struct in registers and on the stack. The bug was that this was being considered by clang to only use one register, meaning that there were situations in which padding arguments were incorrectly emitted by clang. llvm-svn: 211898	2014-06-27 13:59:27 +00:00
James Molloy	b452f78ad2	[ARM-BE] Generate correct NEON intrinsics for big endian systems. The NEON intrinsics in arm_neon.h are designed to work on vectors "as-if" loaded by (V)LDR. We load vectors "as-if" (V)LD1, so the intrinsics are currently incorrect. This patch adds big-endian versions of the intrinsics that does the "obvious but dumb" thing of reversing all vector inputs and all vector outputs. This will produce extra REVs, but we trust the optimizer to remove them. llvm-svn: 211893	2014-06-27 11:53:35 +00:00
Eli Bendersky	b198b4e864	Rename loop unrolling and loop vectorizer metadata to have a common prefix. [Clang part] These patches rename the loop unrolling and loop vectorizer metadata such that they have a common 'llvm.loop.' prefix. Metadata name changes: llvm.vectorizer.* => llvm.loop.vectorizer.* llvm.loopunroll.* => llvm.loop.unroll.* This was a suggestion from an earlier review (http://reviews.llvm.org/D4090) which added the loop unrolling metadata. Patch by Mark Heffernan. llvm-svn: 211712	2014-06-25 15:42:16 +00:00
James Molloy	b8fd41926c	CHECK-LABEL'ify this test. llvm-svn: 211687	2014-06-25 11:50:56 +00:00
James Molloy	7d64a0eec4	[AArch32] Fix a stupid error in an architectural guard The < 8 instead of <= 8 meant that a bunch of vreinterprets were not available on v8 AArch32. Simplify the guard to just !defined(aarch64) while we're at it, and enable some v8 AArch32 testing. llvm-svn: 211686	2014-06-25 11:46:24 +00:00
David Majnemer	0c43d8077e	AST: Initialization with dllimport functions in C The C++ language requires that the address of a function be the same across all translation units. To make __declspec(dllimport) useful, this means that a dllimported function must also obey this rule. MSVC implements this by dynamically querying the import address table located in the linked executable. This means that the address of such a function in C++ is not constant (which violates other rules). However, the C language has no notion of ODR nor does it permit dynamic initialization whatsoever. This requires implementations to _not_ dynamically query the import address table and instead utilize a wrapper function that will be synthesized by the linker which will eventually query the import address table. The effect this has is, to say the least, perplexing. Consider the following C program: __declspec(dllimport) void f(void); typedef void (*fp)(void); static const fp var = &f; const fp fun() { return &f; } int main() { return fun() == var; } MSVC will statically initialize "var" with the address of the wrapper function and "fun" returns the address of the actual imported function. This means that "main" will return false! Note that LLVM's optimizers are strong enough to figure out that "main" should return true. However, this result is dependent on having optimizations enabled! N.B. This change also permits the usage of dllimport declarators inside of template arguments; they are sufficiently constant for such a purpose. Add tests to make sure we don't regress here. llvm-svn: 211677	2014-06-25 08:15:07 +00:00
Rafael Espindola	0a500af186	Correctly Load Mixed FP-GP Variadic Arguments for x86-64. According to the x86-64 ABI, structures with both floating point and integer members are split between floating-point and general purpose registers, and consecutive 32-bit floats can be packed into a single floating point register. In the case of variadic functions these are stored to memory and the position recorded in the va_list. This was already correctly implemented in llvm.va_start. The problem is that the code in clang for implementing va_arg was reading floating point registers from the wrong location. Patch by Thomas Jablin. Fixes PR20018. llvm-svn: 211626	2014-06-24 20:01:50 +00:00
Ulrich Weigand	bebc55b13b	[PowerPC] Fix small argument stack slot offset for LE When small arguments (structures < 8 bytes or "float") are passed in a stack slot in the ppc64 SVR4 ABI, they must reside in the least significant part of that slot. On BE, this means that an offset needs to be added to the stack address of the parameter, but on LE, the least significant part of the slot has the same address as the slot itself. For the most part, this is handled in the LLVM back-end, where I just fixed the LE case in commit r211368. However, there is one piece of the clang front-end that is also aware of these stack-slot offsets: PPC64_SVR4_ABIInfo::EmitVAArg. This patch updates that routine to take endianness into account. llvm-svn: 211370	2014-06-20 16:37:40 +00:00
Oliver Stannard	e3a4fb6512	Add module flags metadata to record the settings for enum and wchar width Add module flags metadata to record the settings for enum and wchar width, to allow correct ARM build attribute generation llvm-svn: 211354	2014-06-20 12:43:07 +00:00
Oliver Stannard	c8e3b5f849	Improve robustness of tests for module flags metadata Fix clang tests to not break if the ID numbers of module flags metadata nodes change. llvm-svn: 211276	2014-06-19 16:10:21 +00:00
Saleem Abdulrasool	11415c6120	tests: relax ms-intrinsics test Relax the tests to allow for differences between release and debug builds. This should fix the buildbots. Thanks to Benjamin Kramer and Eric Christo for their invaluable tip that this was release build specific issue. llvm-svn: 211227	2014-06-18 21:48:44 +00:00
Saleem Abdulrasool	114efe0dc8	CodeGen: improve ms instrincics support Add support for _InterlockedCompareExchangePointer, _InterlockExchangePointer, _InterlockExchange. These are available as a compiler intrinsic on ARM and x86. These are used directly by the Windows SDK headers without use of the intrin header. llvm-svn: 211216	2014-06-18 20:51:10 +00:00
Tim Northover	831d728f9a	AArch64: re-enable tests that were looking for a non-existent backend. In the final phase of the merge, I managed to disable a bunch of Clang tests accidentally. Fortunately none of them seem to have broken in the interim. llvm-svn: 211149	2014-06-18 08:37:28 +00:00
James Molloy	dee4ab08ba	Rewrite ARM NEON intrinsic emission completely. There comes a time in the life of any amateur code generator when dumb string concatenation just won't cut it any more. For NeonEmitter.cpp, that time has come. There were a bunch of magic type codes which meant different things depending on the context. There were a bunch of special cases that really had no reason to be there but the whole thing was so creaky that removing them would cause something weird to fall over. There was a 1000 line switch statement for code generation involving string concatenation, which actually did lexical scoping to an extent (!!) with a bunch of semi-repeated cases. I tried to refactor this three times in three different ways without success. The only way forward was to rewrite the entire thing. Luckily the testing coverage on this stuff is absolutely massive, both with regression tests and the "emperor" random test case generator. The main change is that previously, in arm_neon.td a bunch of "Operation"s were defined with special names. NeonEmitter.cpp knew about these Operations and would emit code based on a huge switch. Actually this doesn't make much sense - the type information was held as strings, so type checking was impossible. Also TableGen's DAG type actually suits this sort of code generation very well (surprising that...) So now every operation is defined in terms of TableGen DAGs. There are a bunch of operators to use, including "op" (a generic unary or binary operator), "call" (to call other intrinsics) and "shuffle" (take a guess...). One of the main advantages of this apart from making it more obvious what is going on, is that we have proper type inference. This has two obvious advantages: 1) TableGen can error on bad intrinsic definitions easier, instead of just generating wrong code. 2) Calls to other intrinsics are typechecked too. So we no longer need to work out whether the thing we call needs to be the Q-lane version or the D-lane version - TableGen knows that itself! Here's an example: before: case OpAbdl: { std::string abd = MangleName("vabd", typestr, ClassS) + "(__a, __b)"; if (typestr[0] != 'U') { // vabd results are always unsigned and must be zero-extended. std::string utype = "U" + typestr.str(); s += "(" + TypeString(proto[0], typestr) + ")"; abd = "(" + TypeString('d', utype) + ")" + abd; s += Extend(utype, abd) + ";"; } else { s += Extend(typestr, abd) + ";"; } break; } after: def OP_ABDL : Op<(cast "R", (call "vmovl", (cast $p0, "U", (call "vabd", $p0, $p1))))>; As an example of what happens if you do something wrong now, here's what happens if you make $p0 unsigned before the call to "vabd" - that is, $p0 -> (cast "U", $p0): arm_neon.td:574:1: error: No compatible intrinsic found - looking up intrinsic 'vabd(uint8x8_t, int8x8_t)' Available overloads: - float64x2_t vabdq_v(float64x2_t, float64x2_t) - float64x1_t vabd_v(float64x1_t, float64x1_t) - float64_t vabdd_f64(float64_t, float64_t) - float32_t vabds_f32(float32_t, float32_t) ... snip ... This makes it seriously easy to work out what you've done wrong in fairly nasty intrinsics. As part of this I've massively beefed up the documentation in arm_neon.td too. Things still to do / on the radar: - Testcase generation. This was implemented in the previous version and not in the new one, because - Autogenerated tests are not being run. The testcase in test/ differs from the autogenerated version. - There were a whole slew of special cases in the testcase generation that just felt (and looked) like hacks. If someone really feels strongly about this, I can try and reimplement it too. - Big endian. That's coming soon and should be a very small diff on top of this one. llvm-svn: 211101	2014-06-17 13:11:27 +00:00
Jim Grosbach	8ddd66928c	AArch64: Fix silly think-o in tests. rdar://9283021 llvm-svn: 211064	2014-06-16 22:18:26 +00:00
Jim Grosbach	79140826bc	AArch64: Support for __builtin_arm_rbit() and __builtin_arm_rbit64(). __builtin_arm_rbit() and __builtin_arm_rbit64(). rdar://9283021 llvm-svn: 211060	2014-06-16 21:56:02 +00:00
Jim Grosbach	171ec34544	ARM: Support for __builtin_arm_rbit() intrinsic. Reverse the bits in a word. Maps to the RBIT instruction. rdar://9283021 llvm-svn: 211059	2014-06-16 21:55:58 +00:00
Tim Northover	ba2b33b4fe	Fix test for release builds. llvm-svn: 210934	2014-06-13 20:00:38 +00:00
Tim Northover	cadbbe1537	Atomics: emit "cmpxchg weak" where possible Most builtins date from before the "cmpxchg weak" was a gleam in the C++ committee's eye, so fortunately not much needs to change. But a few of them do acknowledge that failure is possible. For these, we'll emit the usual cartesian product of cmpxchg operations if we can't statically determine weakness. CodeGen can sort it out later if the function gets inlined. The only other non-trivial aspect of this is (I think) that we emit the scalar expression for "IsWeak" once, at the beginning, and propagate its value through the successive blocks. There's not much in it, but it's slightly more consistent with the existing handling of FailureOrder. llvm-svn: 210932	2014-06-13 19:43:04 +00:00
Alexey Samsonov	e595e1ade0	Remove top-level Clang -fsanitize= flags for optional ASan features. Init-order and use-after-return modes can currently be enabled by runtime flags. use-after-scope mode is not really working at the moment. The only problem I see is that users won't be able to disable extra instrumentation for init-order and use-after-scope by a top-level Clang flag. But this instrumentation was implicitly enabled for quite a while and we didn't hear from users hurt by it. llvm-svn: 210924	2014-06-13 17:53:44 +00:00
Tim Northover	b49b04bbe0	IR-change: cmpxchg operations now return { iN, i1 }. This is a minimal fix for clang. I'll soon add support for generating weak variants when requested, but that's not really necessary for the LLVM change in isolation. llvm-svn: 210907	2014-06-13 14:24:59 +00:00
Tim Northover	d7756c5a68	Tests: use CHECK-LABEL to help debugging failures llvm-svn: 210906	2014-06-13 14:24:48 +00:00
Brad Smith	378e7f9b78	Use dwarf-2 by default on OpenBSD and FreeBSD. The Tools.cpp part of the patch partially based on a patch from FreeBSD's LLVM tree. llvm-svn: 210883	2014-06-13 03:35:37 +00:00
Eli Bendersky	86483b3a0c	Add loop unroll pragma support http://reviews.llvm.org/D4089 Patch by Mark Heffernan. llvm-svn: 210667	2014-06-11 17:56:26 +00:00
Bill Schmidt	56a6967000	[PPC64LE] Fix vec_sld and vec_vsldoi for little endian The vec_sld and vec_vsldoi interfaces perform a left-shift on vector arguments for both big and little endian. However, because they rely on the vec_perm interface which is endian-dependent, the permutation vector needs to be reversed for LE to get the proper shift direction. I've added some extra testing for these interfaces for LE in the builtins-ppc-altivec.c. llvm-svn: 210657	2014-06-11 15:48:46 +00:00
Reid Kleckner	4173f6aff9	Really fix DOS newlines introduced in r210330 r210369 didn't quite catch all of them. llvm-svn: 210593	2014-06-10 21:35:24 +00:00
Evgeniy Stepanov	2be29929be	Fix line numbers for code inlined from __nodebug__ functions. Instructions from __nodebug__ functions don't have file:line information even when inlined into no-nodebug functions. As a result, intrinsics (SSE and other) from <*intrin.h> clang headers _never_ have file:line information. With this change, an instruction without !dbg metadata gets one from the call instruction when inlined. Fixes PR19001. llvm-svn: 210459	2014-06-09 09:09:19 +00:00
Bill Schmidt	7f6596bb13	[PPC64LE] Implement little-endian semantics for vec_sums The PowerPC vsumsws instruction, accessed via vec_sums, is defined architecturally with a big-endian bias, in that the second input vector and the result always reference big-endian element 3 (little-endian element 0). For ease of porting, the programmer wants elements 3 in both cases. To provide this semantics, for little endian we generate a permute for the second input vector prior to the vsumsws instruction, and generate a permute for the result vector following the vsumsws instruction. The correctness of this code is tested by the new sums.c test added in a previous patch, as well as the modifications to builtins-ppc-altivec.c in the present patch. llvm-svn: 210449	2014-06-09 03:31:47 +00:00
Joey Gouly	41181d140c	Convert tests I recently add to use -verify instead of FileCheck. This uncovered something strange. Diagnostics for InlineAsm have source locations that don't really map to where they are within the .c source file. llvm-svn: 210440	2014-06-08 21:28:54 +00:00
Bill Schmidt	d7c53a91df	[PPC64LE] Implement little-endian semantics for vec_unpack[hl] The PowerPC vector-unpack-high and vector-unpack-low instructions are defined architecturally with a big-endian bias, in that the vector element numbering is assumed to be "left to right" regardless of whether the processor is in big-endian or little-endian mode. This effectively reverses the meaning of "high" and "low." Such a definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_unpackh and vec_unpackl interfaces are designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The desired semantics can be achieved by using the opposite instruction for little-endian mode. That is, when a call to vec_unpackh appears in the code, a vector-unpack-low is generated, and when a call to vec_unpackl appears in the code, a vector-unpack-high is generated. The correctness of this code is tested by the new unpack.c test added in a previous patch, as well as the modifications to builtins-ppc-altivec.c in the present patch. Note that these interfaces were originally incorrectly implemented when they take a vector pixel argument. This patch corrects this implementation for both big- and little-endian code generation. llvm-svn: 210391	2014-06-07 02:20:52 +00:00
Bill Schmidt	86f673a005	[PPC64LE] Update test for vec_sum2s interface Commit r210384 prematurely included changes to the little-endian implementation of the vec_sum2s interface. This patch modifies test/CodeGen/builtins-ppc-altivec.c to test those changes. llvm-svn: 210389	2014-06-07 01:47:42 +00:00
Bill Schmidt	7f0a5c5141	[PPC64LE] Update builtins-ppc-altivec.c for PPC64 and PPC64LE The Altivec builtin test case test/CodeGen/builtins-ppc-altivec.c has always been executed only for 32-bit PowerPC. These tests are equally valid for 64-bit PowerPC. This patch updates the test to be run for three targets: powerpc-unknown-unknown, powerpc64-unknown-unknown, and powerpc64le-unknown-unknown. The expected code generation changes for some of the Altivec builtins for little endian, so this patch adds new CHECK-LE variants to the test for the powerpc64le target. These tests satisfy the testing requirements for some previous patches committed over the last couple of days for lib/Headers/altivec.h: r210279 for vec_perm, r210337 for vec_mul[eo], and r210340 for vec_pack. llvm-svn: 210384	2014-06-06 23:12:00 +00:00
Aaron Ballman	b06b15aa28	Adding a new #pragma for the vectorize and interleave optimization hints. Patch thanks to Tyler Nowicki! llvm-svn: 210330	2014-06-06 12:40:24 +00:00
Joey Gouly	5798b26c65	When an inline-asm diagnostic is reported by the backend, report it with the correct severity. Previously all inline-asm diagnostics were reported as errors. llvm-svn: 210286	2014-06-05 21:23:42 +00:00
Renato Golin	0d2f580200	Fix bot for named register test llvm-svn: 210275	2014-06-05 16:52:20 +00:00
Renato Golin	2e31e4e47b	Add pointer types to global named register This patch adds support for pointer types in global named registers variables. It'll be lowered as a pair of read/write_register and inttoptr/ptrtoint calls. Also adds some early checks on types on SemaDecl to avoid the assert. Tests changed accordingly. (PR19837) llvm-svn: 210274	2014-06-05 16:45:22 +00:00
Robert Lytton	6adb20f720	XCore target: Fix 'typestring' binding qualifier to the array and not the type Differential Revision: http://reviews.llvm.org/D3949 llvm-svn: 210250	2014-06-05 09:06:21 +00:00
Rafael Espindola	27c60b512a	Update for llvm API change. Aliases in llvm now hold an arbitrary expression. llvm-svn: 210063	2014-06-03 02:42:01 +00:00
Michael J. Spencer	dd59775f06	[CodeGen] Don't cast and use SizeTy instead of Int32Ty when constructing {extract,insert} vector element instructions. llvm-svn: 209942	2014-05-31 00:22:12 +00:00
Adam Nemet	286ae08e7d	Implement AVX1 vbroadcast intrinsics with vector initializers These intrinsics are special because they directly take a memory operand (AVX2 adds the register counterparts). Typically, other non-memop intrinsics take registers and then it's left to isel to fold memory operands. In order to LICM intrinsics directly reading memory, we require that no stores are in the loop (LICM) or that the folded load accesses constant memory (MachineLICM). When neither is the case we fail to hoist a loop-invariant broadcast. We can work around this limitation if we expose the load as a regular load and then just implement the broadcast using the vector initializer syntax. This exposes the load to LICM and other optimizations. At the IR level this is translated into a series of insertelements. The sequence is already recognized as a broadcast so there is no impact on the quality of codegen. _mm256_broadcast_pd and _mm256_broadcast_ps are not updated by this patch because right now we lack the DAG-combiner smartness to recover the broadcast instructions. This will be tackled in a follow-on. There will be completing changes on the LLVM side to remove the LLVM intrinsics and to auto-upgrade bitcode files. Fixes <rdar://problem/16494520> llvm-svn: 209846	2014-05-29 20:47:29 +00:00
Alexey Samsonov	c054d9813c	[ASan] Hoist blacklisting globals from init-order checking to Clang. Clang knows about the sanitizer blacklist and it makes no sense to add global to the list of llvm.asan.dynamically_initialized_globals if it will be blacklisted in the instrumentation pass anyway. Instead, we should do as much blacklisting as possible (if not all) in the frontend. llvm-svn: 209789	2014-05-29 01:43:53 +00:00
Sanjay Patel	1585fb94ab	added Intel's BMI intrinsic variants (fixes PR19431 - http://llvm.org/bugs/show_bug.cgi?id=19431) llvm-svn: 209769	2014-05-28 20:26:57 +00:00
Warren Hunt	583db1979c	Reverting 209503 - Breaks asan blacklists I opened a discussion on cfe-commits. Ideally we've got a few things that need to happen. CompilerRT should probably have blacklists tests. Asan should probably not depend on that specific field. llvm-svn: 209766	2014-05-28 19:17:45 +00:00
NAKAMURA Takumi	753d70ce53	Let clang/test/CodeGen/pr19841.cpp tolerant of MS mangler. llvm-svn: 209726	2014-05-28 10:53:06 +00:00
Renato Golin	a627a103d0	Fix pr19841, bb are also unnamed llvm-svn: 209668	2014-05-27 17:01:21 +00:00
Renato Golin	345c9cc5f4	Fix pr19841.cpp on release mode llvm-svn: 209666	2014-05-27 16:51:36 +00:00
Renato Golin	e7b3d5dcb4	Revert small change to EmitDeclRefLValue That small change, although it looked harmless, it made emitting the LValue on the PHI node without the proper cast. Reverting it fixes PR19841. llvm-svn: 209663	2014-05-27 16:46:27 +00:00
Nico Rieck	755a36f593	IRGen: Add more tests for dll attributes llvm-svn: 209596	2014-05-25 10:34:16 +00:00
Tim Northover	573cbee543	AArch64/ARM64: rename ARM64 components to AArch64 This keeps Clang consistent with backend naming conventions. llvm-svn: 209579	2014-05-24 12:52:07 +00:00
Tim Northover	25e8a6754e	AArch64/ARM64: update Clang after AArch64 removal. A few (mostly CodeGen) parts of Clang were tightly coupled to the AArch64 backend. Now that it's gone, they will not even compile. I've also deduplicated RUN lines in many of the AArch64 tests. This might improve "make check-all" time noticably: some of those NEON tests were monsters. llvm-svn: 209578	2014-05-24 12:51:25 +00:00
Hans Wennborg	e9277401b7	This test doesn't need -O2 -disable-llvm-optzns I forgot to fix this one in r209145. We use these flags on dllimport tests to make sure we emit code for available_externaly functions and don't inline the IR. llvm-svn: 209564	2014-05-23 23:29:44 +00:00
Nico Rieck	4da7debf7d	Fix broken FileCheck prefix llvm-svn: 209541	2014-05-23 19:07:25 +00:00
Robert Lytton	57dd5cf441	Fix '-main-file-name <name>' so that it is used for the ModuleID. Summary: Previously, you could not specify the original file name when passing a preprocessed file into the compiler Now you can use 'clang -Xclang -main-file-name -Xclang <original file name> ...' Or 'clang -cc1 -main-file-name <original file name> ...' llvm-svn: 209503	2014-05-23 07:34:08 +00:00
Matt Arsenault	328b52e88a	Forgot to add updated datalayout test llvm-svn: 209465	2014-05-22 18:57:49 +00:00
Renato Golin	9258aa5543	Make global named registers internal variables llvm-svn: 209289	2014-05-21 10:40:27 +00:00
Eric Christopher	6c553d6240	Remove test. Replacing it with a backend test with the optimized IR. llvm-svn: 209260	2014-05-21 00:00:01 +00:00
Eric Christopher	bd8652d272	Make this test emit llvm IR rather than assembly. llvm-svn: 209255	2014-05-20 23:23:51 +00:00
Duncan P. N. Exon Smith	6f782b12aa	Fix testcase from r209228 llvm-svn: 209229	2014-05-20 19:20:23 +00:00
Duncan P. N. Exon Smith	d22b97c30b	GlobalValue: Testcase for hidden visibility and local linkage This is a testcase for r209227, a change in LLVM that automatically sets visibility to default when the linkage is changed to local (rather than asserting). What this testcase triggers is hard to reproduce otherwise: the `GlobalValue` is created (with non-local linkage), the visibility is set to hidden, and then the linkage is set to local. PR19760 llvm-svn: 209228	2014-05-20 19:04:31 +00:00
Peter Collingbourne	41af7c2fdc	Implement the flatten attribute. This is a GNU attribute that causes calls within the attributed function to be inlined where possible. It is implemented by giving such calls the alwaysinline attribute. Differential Revision: http://reviews.llvm.org/D3816 llvm-svn: 209217	2014-05-20 17:12:51 +00:00
Robert Lytton	db8c1cb02c	XCore target: sort typestring enum fields alphabetically llvm-svn: 209196	2014-05-20 07:19:33 +00:00
Adrian Prantl	2dbdd20d37	Demote the "Debug Info Version" module flag to llvm::Module::Warning behavior on mismatch. The AutoUpgrader will drop incompatible debug info any way and also emit a warning diagnostic for it. rdar://problem/16926122 llvm-svn: 209182	2014-05-19 23:40:06 +00:00
Renato Golin	c296d951a7	Using SmallString and correct addr var llvm-svn: 209180	2014-05-19 23:25:25 +00:00
Renato Golin	156a853ccb	Fix usage of string when StringRef was needed Also adding a variable to the test, so release bots match %1. This should also calm the gdb buildbot. . llvm-svn: 209171	2014-05-19 22:36:19 +00:00
Peter Collingbourne	b4728c12e8	Implement the no_split_stack attribute. This is a GNU attribute that allows split stacks to be turned off on a per-function basis. Differential Revision: http://reviews.llvm.org/D3817 llvm-svn: 209167	2014-05-19 22:14:34 +00:00
Renato Golin	230c5eb4bd	Non-allocatable Global Named Register This patch implements global named registers in Clang, lowering to the just created intrinsics in LLVM (@llvm.read/write_register). A new type of LValue had to be created (Register), which just adds support to carry the metadata node containing the name of the register. Two new methods to emit loads and stores interoperate with another to emit the named metadata node. No guarantees are being made and only non-allocatable global variable named registers are being supported. Local named register support is unchanged. llvm-svn: 209149	2014-05-19 18:15:42 +00:00
Oliver Stannard	a3afc69b94	ARM: PCS non-compliance when struct is padded to avoid register/stack split, and requires internal padding When we were padding a struct to avoid splitting it between registers and the stack, we were throwing away the type which the argument should be coerced to. llvm-svn: 209122	2014-05-19 13:10:05 +00:00

... 3 4 5 6 7 ...

2889 Commits