llvm-project

Commit Graph

Author	SHA1	Message	Date
Tom Stellard	d99fb956a3	R600: Use a Southern Islands GPU as the default for the amdgcn target llvm-svn: 227315	2015-01-28 15:38:44 +00:00
Craig Topper	335e218760	[X86] Add intrinsics for AVX512 128 and 256 bit integer comparison of word and byte vectors. llvm-svn: 227186	2015-01-27 09:16:29 +00:00
Pete Cooper	f051cbf631	Don't generate llvm.expect intrinsics with -O0. The backend won't run LowerExpect on -O0. In a debug LTO build, this results in llvm.expect intrinsics being in the LTO IR which doesn't know how to optimize them. Thanks to Chandler for the suggestion and review. Differential revision: http://reviews.llvm.org/D7183 llvm-svn: 227135	2015-01-26 20:51:58 +00:00
Craig Topper	b4789096c0	[X86] Add AVX512 integer comparison intrinsics for word and byte vectors. llvm-svn: 227079	2015-01-26 09:24:10 +00:00
Craig Topper	2f25a5a875	[X86] Add more of the AVX512 integer comparision intrinsics. This adds 128 and 256 bit vectors of dwords and qwords. llvm-svn: 227075	2015-01-26 08:11:49 +00:00
Craig Topper	4cac1c2318	[X86] Add AVX512F integer comparision intrinsics to header file. llvm-svn: 227067	2015-01-25 23:30:07 +00:00
Justin Bogner	9fa8c9984c	test: Convert some tests to FileCheck These were all doing trivial greps. It's better to use FileCheck. llvm-svn: 227007	2015-01-24 17:39:36 +00:00
Justin Bogner	2af264a44a	test: Remove two redundant lines from this test The FileCheck already checks for these lines, no need to grep as well. llvm-svn: 227006	2015-01-24 17:39:32 +00:00
Sanjay Patel	76c9e0986c	Process the -fno-signed-zeros optimization flag (PR20870) The driver currently accepts but ignores the -fno-signed-zeros flag. This patch passes the flag through and enables 'nsz' fast-math-flag generation in IR. The existing OpenCL flag for the same functionality is made into an alias here. It may be removed in a subsequent patch. This should resolve bug 20870 ( http://llvm.org/bugs/show_bug.cgi?id=20870 ); patches for the optimizer were checked in at: http://llvm.org/viewvc/llvm-project?view=revision&revision=225050 http://llvm.org/viewvc/llvm-project?view=revision&revision=224583 Differential Revision: http://reviews.llvm.org/D6873 llvm-svn: 226915	2015-01-23 16:40:50 +00:00
Reid Kleckner	2a2e156318	SEH: Emit the constant filter 1 as a catch-all Minor optimization of code like __try { ... } __except(1) { ... }. llvm-svn: 226766	2015-01-22 02:25:56 +00:00
Reid Kleckner	1d59f99f5c	Initial support for Win64 SEH IR emission The lowering looks a lot like normal EH lowering, with the exception that the exceptions are caught by executing filter expression code instead of matching typeinfo globals. The filter expressions are outlined into functions which are used in landingpad clauses where typeinfo would normally go. Major aspects that still need work: - Non-call exceptions in __try bodies won't work yet. The plan is to outline the __try block in the frontend to keep things simple. - Filter expressions cannot use local variables until capturing is implemented. - __finally blocks will not run after exceptions. Fixing this requires work in the LLVM SEH preparation pass. The IR lowering looks like this: // C code: bool safe_div(int n, int d, int r) { __try { r = normal_div(n, d); } __except(_exception_code() == EXCEPTION_INT_DIVIDE_BY_ZERO) { return false; } return true; } ; LLVM IR: define i32 @filter(i8* %e, i8* %fp) { %ehptrs = bitcast i8* %e to i32 %ehrec = load i32 %ehptrs %code = load i32* %ehrec %matches = icmp eq i32 %code, i32 u0xC0000094 %matches.i32 = zext i1 %matches to i32 ret i32 %matches.i32 } define i1 zeroext @safe_div(i32 %n, i32 %d, i32* %r) { %rr = invoke i32 @normal_div(i32 %n, i32 %d) to label %normal unwind to label %lpad normal: store i32 %rr, i32* %r ret i1 1 lpad: %ehvals = landingpad {i8, i32} personality i32 (...) @__C_specific_handler catch i8* bitcast (i32 (i8, i8)* @filter to i8) %ehptr = extractvalue {i8, i32} %ehvals, i32 0 %sel = extractvalue {i8, i32} %ehvals, i32 1 %filter_sel = call i32 @llvm.eh.seh.typeid.for(i8 bitcast (i32 (i8, i8)* @filter to i8*)) %matches = icmp eq i32 %sel, %filter_sel br i1 %matches, label %eh.except, label %eh.resume eh.except: ret i1 false eh.resume: resume } Reviewers: rjmccall, rsmith, majnemer Differential Revision: http://reviews.llvm.org/D5607 llvm-svn: 226760	2015-01-22 01:36:17 +00:00
Rafael Espindola	e855c2ae0a	Revert "Try to fix -Asserts build bots." This reverts commit r226758. Looks like rnk's 226757 fixed the real issue. Sorry for the noise. llvm-svn: 226759	2015-01-22 01:33:41 +00:00
Rafael Espindola	b88c11281c	Try to fix -Asserts build bots. llvm-svn: 226758	2015-01-22 01:26:39 +00:00
Reid Kleckner	395dad8213	Give the block inlining test a triple to determinise output It fails on Windows due to another temporary being emitted first, so the LLVM internal renaming scheme gives out the name __block_descriptor_tmp1. llvm-svn: 226757	2015-01-22 01:19:19 +00:00
Rafael Espindola	e5df59ff78	Emit DeferredDeclsToEmit in a DFS order. Currently we emit DeferredDeclsToEmit in reverse order. This patch changes that. The advantages of the change are that * The output order is a bit closer to the source order. The change to test/CodeGenCXX/pod-member-memcpys.cpp is a good example. * If we decide to deffer more, it will not cause as large changes in the estcases as it would without this patch. llvm-svn: 226751	2015-01-22 00:24:57 +00:00
Adam Nemet	f893edeaea	[AVX512] Add sub-vector FP extracts Analogous to AVX2, these need to be implemented as macros to properly propagate the immediate index operand. Part of <rdar://problem/17688758> llvm-svn: 226496	2015-01-19 20:12:05 +00:00
Adrian Prantl	7c6f944cdf	Migrate all uses of DIVariable's FlagIndirectVariable to use a DIExpression with a DW_OP_deref instead. llvm-svn: 226474	2015-01-19 17:51:58 +00:00
Craig Topper	f557b09f14	[x86] Mark that the AVX-512 cmpps/cmppd builtins need an ICE for the comparison immediate. This requires converting to a macro in the header file. llvm-svn: 226421	2015-01-19 01:18:19 +00:00
Eric Christopher	49b425d9d2	Remove pathname dependence. Also rewrite test to use FileCheck at the same time. Patch by David Callahan. llvm-svn: 226331	2015-01-16 22:03:52 +00:00
Rafael Espindola	d9b26d563a	Add comdat to string literal variables on COFF. llvm-svn: 226317	2015-01-16 20:32:35 +00:00
Adam Nemet	c0cff244fc	[AVX512] Add intrinsics for masked aligned FP loads and stores Part of <rdar://problem/17688758> llvm-svn: 226298	2015-01-16 18:51:50 +00:00
Adam Nemet	d47dec4690	Fix typo in r225922. llvm-svn: 226297	2015-01-16 18:51:46 +00:00
Duncan P. N. Exon Smith	8d3ef611ce	IR: Move MDLocation into place (clang testcases) Update testcases to match LLVM change in r226048. llvm-svn: 226049	2015-01-14 22:28:03 +00:00
Daniel Sanders	998c910262	[mips] Handle transparent unions correctly. Summary: This fixes MultiSource/Applications/lemon on big-endian N32 by correcting the handling of the argument to wait(). glibc defines it as a transparent union of void* and int. Such unions are passed according to the rules of the first member so the argument must be passed as if it were a void (sign extended from i32 to i64) and not as a union (shifted to the upper bits of an i64). wait() already behaves correctly on big-endian O32 and N64 since the union is already the same size as an argument slot. Reviewers: atanasyan Reviewed By: atanasyan Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6963 llvm-svn: 225981	2015-01-14 12:00:12 +00:00
Adam Nemet	63a951eb1c	[AVX512] Add FP unpack intrinsics These are implemented with __builtin_shufflevector just like AVX. We have some tests on the LLVM side to assert that these shufflevectors do indeed generate the corresponding unpck instruction. Part of <rdar://problem/17688758> llvm-svn: 225922	2015-01-14 01:31:17 +00:00
Daniel Sanders	cdcb580d4e	[mips] Fix va_arg() for pointer types on big-endian N32. Summary: The Mips ABI's treat pointers in the same way as integers. They are sign-extended to 32-bit for O32, and 64-bit for N32/N64. This doesn't matter for O32 and N64 where pointers are already the correct width but it does matter for big-endian N32, where pointers are 32-bit and need promoting. The caller side is already passing pointers correctly. This patch corrects the callee. Reviewers: vmedic, atanasyan Reviewed By: atanasyan Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6812 llvm-svn: 225782	2015-01-13 10:47:00 +00:00
Alexey Samsonov	8845952b54	Reimplement -fsanitize-recover family of flags. Introduce the following -fsanitize-recover flags: - -fsanitize-recover=<list>: Enable recovery for selected checks or group of checks. It is forbidden to explicitly list unrecoverable sanitizers here (that is, "address", "unreachable", "return"). - -fno-sanitize-recover=<list>: Disable recovery for selected checks or group of checks. - -f(no-)?sanitize-recover is now a synonym for -f(no-)?sanitize-recover=undefined,integer and will soon be deprecated. These flags are parsed left to right, and mask of "recoverable" sanitizer is updated accordingly, much like what we do for -fsanitize= flags. -fsanitize= and -fsanitize-recover= flag families are independent. CodeGen change: If there is a single UBSan handler function, responsible for implementing multiple checks, which have different recoverable setting, then we emit two handler calls instead of one: the first one for the set of "unrecoverable" checks, another one - for set of "recoverable" checks. If all checks implemented by a handler have the same recoverability setting, then the generated code will be the same. llvm-svn: 225719	2015-01-12 22:39:12 +00:00
Rafael Espindola	0d4fb98504	[patch][pr19848] Produce explicit comdats in clang. The llvm IR until recently had no support for comdats. This was a problem when targeting C++ on ELF/COFF as just using weak linkage would cause quite a bit of dead bits to remain on the executable (unless -ffunction-sections, -fdata-sections and --gc-sections were used). To fix the problem, llvm's codegen will just assume that any weak or linkonce that is not in an explicit comdat should be output in one with the same name as the global. This unfortunately breaks cases like pr19848 where a weak symbol is not xpected to be part of any comdat. Now that we have explicit comdats in the IR, we can finally get both cases right. This first patch just makes clang give explicit comdats to GlobalValues where t is allowed to. A followup patch to llvm will then stop implicitly producing comdats. llvm-svn: 225705	2015-01-12 22:13:53 +00:00
David Majnemer	f1fdf4a80c	CodeGen: Simplify consecutive '%' modifiers LLVM the consecutive '%' modifiers are redundant, skip them. llvm-svn: 225602	2015-01-11 09:13:56 +00:00
David Majnemer	14d4e7bdbf	CodeGen: Simplify consecutive '&' modifiers LLVM the consecutive '&' modifiers are redundant, skip them. llvm-svn: 225601	2015-01-11 09:09:01 +00:00
Duncan P. N. Exon Smith	4bbe428cc5	IR: Add 'distinct' MDNodes to bitcode and assembly (clang) Update testcases for LLVM change in r225474 to make `MDNode`s explicitly distinct (when they aren't uniqued). Part of PR22111. llvm-svn: 225475	2015-01-08 22:39:28 +00:00
Tom Stellard	d8e38a3206	R600: Handle amdgcn triple For now there is no difference between amdgcn and r600. llvm-svn: 225294	2015-01-06 20:34:47 +00:00
David Blaikie	b9a23c9155	DebugInfo: Provide a less subtle way to set the debug location of simple ret instructions un-XFAILing the test XFAIL'd in r225086 after it regressed in r225083. llvm-svn: 225090	2015-01-02 22:07:26 +00:00
David Blaikie	5e9e13f54a	Temporarily XFAIL fallout from r225083 while investigating. Between this behavior and that fixed by r225083/r225000, I'll take the latter over the former for now, but I'm immediately working on understanding/addressing this behavior too. (the fact that the code change in r225083 caused this change in behavior is a bit troubling anyway - given that it looks & claims to be just a preformance thing) llvm-svn: 225086	2015-01-02 19:49:28 +00:00
Craig Topper	2094d8fe88	[x86] Add the (v)cmpps/pd/ss/sd builtins to match gcc. Use them in the sse intrinsic files. This still lower to the same intrinsics as before. This is preparation for bounds checking the immediate on the avx version of the builtin so we don't pass illegal immediates into the backend. Since SSE uses a smaller size immediate its not possible to bounds check when using a shared builtin. Rather than creating a clang specific builtin for the different immediate, I decided (after consulting with Chandler) that it was better to match gcc. llvm-svn: 224879	2014-12-27 06:59:57 +00:00
David Majnemer	fd4f63ad4b	Adjust the rest of the tests due to r224849. llvm-svn: 224865	2014-12-26 18:45:57 +00:00
David Majnemer	ca7e485c3f	Update tests due to r224849 Inferring nuw caused some clang tests to change their output. llvm-svn: 224851	2014-12-26 10:29:40 +00:00
Nico Weber	4f477fbe57	Add a triple to try and get this test passing on the ARM bots. llvm-svn: 224747	2014-12-23 01:07:10 +00:00
Nico Weber	08ef80f4b8	Rename test.cc files to test.cpp. The lit.cfg files only add .cpp to suffixes, so these tests used to never run, oops. (Also tweak to of these tests in minor ways to make the actually pass.) llvm-svn: 224718	2014-12-22 18:13:07 +00:00
Alexey Bataev	7cb1789011	Fix for PR21915: assert on multidimensional VLA in function arguments. Fixed assertion on type checking for arguments and parameters on function call if arguments are pointers to VLA Differential Revision: http://reviews.llvm.org/D6655 llvm-svn: 224504	2014-12-18 06:54:53 +00:00
Eric Christopher	560cc4fb44	Make sure that arm-linux-gnu is still the apcs-gnu ABI when we use clang -cc1 matching the front end and backend. Fix up a couple of tests that were testing aapcs for arm-linux-gnu. The test that removes the aapcs abi calling convention removes them because the default triple matches what the backend uses for the calling convention there and so it doesn't need to be explicitly stated - see the code in TargetInfo.cpp. llvm-svn: 224491	2014-12-18 02:08:55 +00:00
Saleem Abdulrasool	86b881c63e	CodeGen: implement __emit intrinsic For MSVC compatibility, add the `__emit' builtin. This is used in the Windows SDK headers, and must therefore be implemented as a builtin rather than an intrinsic. The `__emit' builtin provides a mechanism to emit a 16-bit opcode instruction into the stream. The value must be a compile time constant expression. No guarantees are made about the CPU and memory states after the execution of the instruction. Due to the unchecked nature of the builtin, only support this on Windows on ARM. llvm-svn: 224438	2014-12-17 17:52:30 +00:00
Toma Tabacu	9941195a9f	[mips] Always clobber $1 for MIPS inline asm. Summary: Because GCC doesn't use $1 for code generation, inline assembly code can use $1 without having to add it to the clobbers list. LLVM, on the other hand, does not shy away from using $1, and this can cause conflicts with inline assembly which assumes GCC-like code generation. A solution to this problem is to make Clang automatically clobber $1 for all MIPS inline assembly. This is not the optimal solution, but it seems like a necessary compromise, for now. Reviewers: dsanders Reviewed By: dsanders Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6638 llvm-svn: 224428	2014-12-17 12:02:58 +00:00
Adrian Prantl	98f1f27810	Update this testcase for the new metadata assembler syntax. llvm-svn: 224262	2014-12-15 19:25:33 +00:00
Duncan P. N. Exon Smith	b3a66691f8	IR: Make metadata typeless in assembly, clang side Match LLVM changes from r224257. llvm-svn: 224259	2014-12-15 19:10:08 +00:00
Alexey Bataev	a47ae907e8	Fixed test/CodeGen/atomic_ops.c for compatibility with hexagon target llvm-svn: 224231	2014-12-15 06:12:42 +00:00
Alexey Bataev	452d8e1133	Bugfix for Codegen of atomic load/store/other ops. Currently clang fires assertions on x86-64 on any atomic operations for long double operands. Patch fixes codegen for such operations. Differential Revision: http://reviews.llvm.org/D6499 llvm-svn: 224230	2014-12-15 05:25:25 +00:00
David Majnemer	ee8d04d8dd	CodeGen: Loads/stores to allocas for atomic ops shouldn't be volatile Don't inherit the volatile-ness of the input pointer to the volatile operation for memory allocated on the side. This fixes PR17306. llvm-svn: 224110	2014-12-12 08:16:09 +00:00
Paul Robinson	0855695159	Instead of having -Os/-Oz add OptimizeForSize/MinSize first, and later having OptimizeNone remove them again, just don't add them in the first place if the function already has OptimizeNone. Note that MinSize can still appear due to attributes on different declarations; a future patch will address that. llvm-svn: 224047	2014-12-11 20:14:04 +00:00
Paul Robinson	aae2fba540	Diagnose attributes 'optnone' and 'minsize' on the same declaration. Eventually we'll diagnose them on different declarations, but let's get this part out of the way first. llvm-svn: 223985	2014-12-10 23:34:36 +00:00
Paul Robinson	621b6d3bf7	Revert r223980 as it had wrong commit message. llvm-svn: 223984	2014-12-10 23:32:57 +00:00
Paul Robinson	2936851426	Rename a couple of preprocessor symbols to be more descriptive. NFC. Review feedback from recent changes to GetSVN.cmake. llvm-svn: 223980	2014-12-10 23:12:37 +00:00
Kostya Serebryany	597dcc7a8d	No memcpy for copy ctor with -fsanitize-address-field-padding=1 Summary: When -fsanitize-address-field-padding=1 is present don't emit memcpy for copy constructor. Thanks Nico for the extra test case. Test Plan: regression tests Reviewers: thakis, rsmith Reviewed By: rsmith Subscribers: rsmith, cfe-commits Differential Revision: http://reviews.llvm.org/D6515 llvm-svn: 223563	2014-12-06 01:23:08 +00:00
Eric Christopher	0e2618857c	Have the driver and the target code agree on what the default ABI is for each machine. Fix up darwin tests that were testing for aapcs on armv7-ios when the actual ABI is apcs. Should be no user visible change without -cc1. llvm-svn: 223429	2014-12-05 01:06:59 +00:00
Reid Kleckner	2c8d86ca05	Add test for __umulh llvm-svn: 223319	2014-12-03 23:52:26 +00:00
Anton Korobeynikov	d90dd7977e	Fix invalid calling convention used for libcalls on ARM. ARM ABI specifies that all the libcalls use soft FP ABI (even hard FP binaries). These days clang emits _mulsc3 / _muldc3 calls with default (C) calling convention which would be translated into AAPCS_VFP LLVM calling and thus the result of complex multiplication will be bogus. Introduce a way for a target to specify explicitly calling convention for libcalls. Right now this is temporary correctness fix. Ultimately, we'll end with intrinsic for complex multiplication and all calling convention decisions for libcalls will be put into backend. llvm-svn: 223123	2014-12-02 16:04:58 +00:00
Justin Holewinski	6e9bfa344c	[NVPTX] Fix type error for some builtins in BuiltinsNVPTX.def llvm-svn: 223116	2014-12-02 12:58:24 +00:00
Tim Northover	b047bfae32	AArch64: simplify PCS mapping. Now that LLVM can count the registers needed to implement AAPCS rules, we don't need to duplicate that logic here. This means we can drop the explicit padding and also use more natural types in many cases (e.g. "struct { float arr[3]; }" used to end up as "[2 x double]" to avoid holes on the stack. The one wrinkle is that AAPCS va_arg was also using the register counting machinery. But the local replacement isn't too bad. llvm-svn: 222904	2014-11-27 21:02:49 +00:00
David Majnemer	659be55daa	CodeGen: Fix emission of __atomic_compare_exchange We (wrongly) discarded the return value of the call. llvm-svn: 222798	2014-11-25 23:44:32 +00:00
Chandler Carruth	cc75b75b9d	Update Clang tests that run the LLVM optimizer to reflect the changed canonicalization in r222748. No interesting functionality changed here. llvm-svn: 222749	2014-11-25 10:10:37 +00:00
Tim Northover	bdcc1ed66d	testing: make test use FileCheck The "grep internal \| count" was fragile when your source or remote paths could contain the word "internal". llvm-svn: 222685	2014-11-24 21:03:34 +00:00
Paul Robinson	4ece682586	Correctly remove OptimizeForSize from functions marked OptimizeNone. This allows using __attribute__((optnone)) and the -Os/-Oz options. Fixes PR21604. llvm-svn: 222683	2014-11-24 20:51:42 +00:00
Saleem Abdulrasool	aca550fdb5	CodeGen: make i686-windows-itanium more similar to msvc The itanium environment follows the system calling convention for structures. Pass small aggregates via registers. llvm-svn: 222680	2014-11-24 20:14:29 +00:00
Saleem Abdulrasool	ec5c624550	CodeGen: tweak struct ABI handling Cygwin and MinGW fail to conform to the underlying system's structure passing ABI. Make the check more precise to ensure that we correctly generate code for the itanium environment. llvm-svn: 222626	2014-11-23 02:16:24 +00:00
David Majnemer	d8cd8f7b6e	CodeGen: Make atomic operations play nice with address spaces We were being a little sloppy with our pointer/address space casts. This fixes PR21643. llvm-svn: 222615	2014-11-22 10:44:12 +00:00
Alexey Samsonov	cfb97aa620	Remove support for undocumented SpecialCaseList entries. "global-init", "global-init-src" and "global-init-type" were originally used to blacklist entities in ASan init-order checker. However, they were never documented, and later were replaced by "=init" category. Old blacklist entries should be converted as follows: * global-init:foo -> global:foo=init * global-init-src:bar -> src:bar=init * global-init-type:baz -> type:baz=init llvm-svn: 222401	2014-11-20 01:27:19 +00:00
Chad Rosier	36577d037f	Revert "[Reassociate] Update test cases due to r222142." This reverts commit r222144. Commit r222142 is being reverted due to a spec2006/gcc execution-time regression. Update mips-varargs test as well. llvm-svn: 222397	2014-11-19 23:20:35 +00:00
Daniel Sanders	59229dcb29	Allow EmitVAArg() to promote types and use this to fix some N32/N64 vararg issues for Mips. Summary: With this patch, passing a va_list to another function and reading 10 int's from it works correctly on a big-endian target. Based on a pair of patches by David Chisnall, one of which I've reworked for the current trunk. Reviewers: theraven, atanasyan Reviewed By: theraven, atanasyan Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6248 llvm-svn: 222339	2014-11-19 10:01:35 +00:00
Eric Christopher	8be702dbaa	This test also requires an aarch64 target. llvm-svn: 222268	2014-11-18 22:36:11 +00:00
Justin Hibbits	90ca05e5e5	Add PIC-level support to Clang. Summary: This distinguishes between -fpic and -fPIC now, with the additions in LLVM for PIC level support. Test Plan: No regressions Reviewers: echristo, rafael Reviewed By: rafael Subscribers: rnk, emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D5400 llvm-svn: 222227	2014-11-18 06:17:20 +00:00
Chad Rosier	0837f63fd2	[Reassociate] Update test cases due to r222142. llvm-svn: 222144	2014-11-17 16:34:47 +00:00
Reid Kleckner	b1be683074	Fix IRGen for passing transparent unions We have had a test for this for a long time with a FIXME saying what we should be doing. This just does it. Fixes PR21573. llvm-svn: 222074	2014-11-15 01:41:41 +00:00
Fariborz Jahanian	68e7938361	This patch fixes couple of bugs for predefined expression used inside blocks. It fixes a crash in naming code for __func__ etc. when used in a block declared globally. It also brings back old naming convention for predefined expression which was broken. rdar://18961148 llvm-svn: 222065	2014-11-14 23:55:27 +00:00
Anton Korobeynikov	5f951ee8bd	Recommit r222044 with a test fix - it does not make sense to hunt for a typedef before arithmetic conversion in all rare corner cases. llvm-svn: 222049	2014-11-14 22:09:15 +00:00
Anton Korobeynikov	50fc68f2d9	Again revert r222044 to resolve darwin objc test fails. llvm-svn: 222047	2014-11-14 21:54:46 +00:00
Anton Korobeynikov	dc12b367bc	Follow-up to D6217 Summary: Ok, here is somewhat addition to D6217 aiming to preserve old darwin behavior wrt the typedefed types. The actual change to SemaChecking turned out to be pretty gross, in particular: 1. We need to extract the typedef'ed type for proper diagnostics 2. We need to walk over paren expressions as well Reviewers: chandlerc, rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6256 llvm-svn: 222044	2014-11-14 21:41:07 +00:00
Bill Schmidt	8ff672d397	[PowerPC] Enable vec_perm for long long and double vector types for VSX VSX makes the "vector long long" and "vector double" types available. This patch enables the vec_perm interface for these types. The same builtin is generated regardless of the specified type, so no additional work or testing is needed in the back end. Tests are added to ensure this builtin is generated by the front end. llvm-svn: 221988	2014-11-14 13:10:13 +00:00
Bill Schmidt	cee13a2712	[PowerPC] Add VSX builtins for vec_div This patch adds builtin support for xvdivdp and xvdivsp, along with a new test case. The builtins are accessed using vec_div in altivec.h. Builtins are listed (mostly) alphabetically there, so inserting these changed the line numbers for deprecation warnings tested in test/Headers/altivec-intrin.c. There is a companion patch for LLVM. llvm-svn: 221984	2014-11-14 12:10:51 +00:00
Anton Korobeynikov	50a3cbd7c0	Temporary revert r221818 until all the problems with objc stuff will be resolved. llvm-svn: 221829	2014-11-12 23:15:38 +00:00
Anton Korobeynikov	0140aa8756	Fix fallout from r219557 Summary: Consider the following nifty 1 liner: (0 ? csqrtl(2.0f) : sqrtl(2.0f)). One can easily obtain such code from e.g. tgmath. Right now it produces an assertion because we fail to do the promotion real => _Complex real. The case was properly handled previously (old handleOtherComplexFloatConversion routine), but was forgotten in the current version. This seems to be about fallout from r219557 Reviewers: chandlerc, rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6217 llvm-svn: 221821	2014-11-12 22:19:06 +00:00
Bill Schmidt	9ec8cea02b	[PowerPC] Add vec_vsx_ld and vec_vsx_st intrinsics This patch enables the vec_vsx_ld and vec_vsx_st intrinsics for PowerPC, which provide programmer access to the lxvd2x, lxvw4x, stxvd2x, and stxvw4x instructions. New code in altivec.h defines these in terms of new builtins, which are themselves defined in BuiltinsPPC.def. The builtins are converted to LLVM intrinsics in CGBuiltin.cpp. Additional code is added to builtins-ppc-vsx.c to verify the correct generation of the intrinsics. Note that I moved the other VSX builtins so all VSX builtins will be alphabetical in their own section in BuiltinsPPC.def. There is a companion patch for LLVM. llvm-svn: 221768	2014-11-12 04:19:56 +00:00
Kostya Serebryany	4133eabb45	[clang/asan] Do not emit memcpy for trivial operator= when -fsanitize-address-field-padding >= 1 Summary: If we've added poisoned paddings to a type do not emit memcpy for operator=. Test Plan: regression tests. Reviewers: majnemer, rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6160 llvm-svn: 221739	2014-11-11 23:38:13 +00:00
Alexey Samsonov	e396bfc064	Bundle conditions checked by UBSan with sanitizer kinds they implement. Summary: This change makes CodeGenFunction::EmitCheck() take several conditions that needs to be checked (all of them need to be true), together with sanitizer kinds these checks are for. This would allow to split one call into UBSan runtime into several calls in case different sanitizer kinds would have different recoverability settings. Tests should be fixed accordingly, I'm working on it. Test Plan: regression test suite. Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6219 llvm-svn: 221716	2014-11-11 22:03:54 +00:00
Alexey Samsonov	cfc9d3fe17	Simplify the test by using multiple --check-prefix arguments llvm-svn: 221713	2014-11-11 21:50:44 +00:00
Tim Northover	5a1558ec31	ARM ABI: simplify decisions on whether args can be expanded. Homogeneous aggregates on AAPCS_VFP ARM need to be passed without being flattened (e.g. [2 x float] rather than "float, float") for various weird ABI reasons. However, this isn't the case for anything else; further, we know at the ABIArgInfo::getDirect callsites whether this flattening is allowed. So, we can get more unified ARM code, with a simpler Clang, by just using that knowledge directly. llvm-svn: 221559	2014-11-07 22:30:50 +00:00
Roman Divacky	5cd8df6d1d	Since the file has both ppc and ppc64 tests in it rename it. llvm-svn: 221285	2014-11-04 18:49:15 +00:00
Roman Divacky	c294022900	Rewrite the test to not require asserts. llvm-svn: 221284	2014-11-04 18:48:20 +00:00
NAKAMURA Takumi	06ac98299f	Remove "REQUIRES:shell" from tests. They work for me. llvm-svn: 221269	2014-11-04 13:41:33 +00:00
Reid Kleckner	06ea7d6213	Lower __builtin_fabs* to @llvm.fabs.* mingw64's headers implement fabs by calling __builtin_fabs, so using the library call results in an infinite loop. If the backend legalizes @llvm.fabs as a call to fabs later, things should work out, as the crt provides a definition. llvm-svn: 221206	2014-11-03 23:52:09 +00:00
Roman Divacky	1ae35b902b	Require asserts to unbreak the buildbots. llvm-svn: 221174	2014-11-03 19:50:48 +00:00
Roman Divacky	8a12d84264	Implement vaarg lowering for ppc32. Lowering of scalars and aggregates is supported. Complex numbers are not. llvm-svn: 221170	2014-11-03 18:32:54 +00:00
Hans Wennborg	606bd6dcc5	Don't dllimport inline functions when targeting MinGW (PR21366) It turns out that MinGW never dllimports of exports inline functions. This means that code compiled with Clang would fail to link with MinGW-compiled libraries since we might try to import functions that are not imported. To fix this, make Clang never dllimport inline functions when targeting MinGW. llvm-svn: 221154	2014-11-03 14:24:45 +00:00
Craig Topper	8c7f251e98	Add FSGSBASE intrinsics to x86 intrinsic headers. llvm-svn: 221130	2014-11-03 06:51:41 +00:00
Craig Topper	e1c664b136	Add _lzcnt_u32 and _lzcnt_u64 to lzcntintrin.h to match Intel documentation names for these intrinsics. llvm-svn: 221066	2014-11-01 22:50:57 +00:00
Reid Kleckner	80944df6f4	Implement IRGen for the x86 vectorcall convention The most complex aspect of the convention is the handling of homogeneous vector and floating point aggregates. Reuse the homogeneous aggregate classification code that we use on PPC64 and ARM for this. This convention also has a C mangling, and we apparently implement that in both Clang and LLVM. Reviewed By: majnemer Differential Revision: http://reviews.llvm.org/D6063 llvm-svn: 221006	2014-10-31 22:00:51 +00:00
Bill Schmidt	691e01d94e	[PowerPC] Initial VSX intrinsic support, with min/max for vector double Now that we have initial support for VSX, we can begin adding intrinsics for programmer access to VSX instructions. This patch performs the necessary enablement in the front end, and tests it by implementing intrinsics for minimum and maximum using the vector double data type. The main change in the front end is to no longer disallow "vector" and "double" in the same declaration (lib/Sema/DeclSpec.cpp), but "vector" and "long double" must still be disallowed. The new intrinsics are accessed via vec_max and vec_min with changes in lib/Headers/altivec.h. Note that for v4f32, we already access corresponding VMX builtins, but with VSX enabled we should use the forms that allow all 64 vector registers. The new built-ins are defined in include/clang/Basic/BuiltinsPPC.def. I've added a new test in test/CodeGen/builtins-ppc-vsx.c that is similar to, but much smaller than, builtins-ppc-altivec.c. This allows us to test VSX IR generation without duplicating CHECK lines for the existing bazillion Altivec tests. Since vector double is now legal when VSX is available, I've modified the error message, and changed where we test for it and for vector long double, since the target machine isn't visible in the old place. This serendipitously removed a not-pertinent warning about 'long' being deprecated when used with 'vector', when "vector long double" is encountered and we just want to issue an error. The existing tests test/Parser/altivec.c and test/Parser/cxx-altivec.cpp have been updated accordingly, and I've added test/Parser/vsx.c to verify that "vector double" is now legitimate with VSX enabled. There is a companion patch for LLVM. llvm-svn: 220989	2014-10-31 19:19:24 +00:00
Kostya Serebryany	5f1b4e8f58	ignore -mconstructor-aliases when adding field paddings for asan Summary: When we are adding field paddings for asan even an empty dtor has to remain in the code, so we ignore -mconstructor-aliases if the paddings are going to be added. Test Plan: added a test Reviewers: rsmith, rnk, rafael Reviewed By: rafael Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6038 llvm-svn: 220986	2014-10-31 19:01:02 +00:00
Reid Kleckner	e9f6a717dd	Fix ARM HVA classification of classes with non-virtual bases Reuse the PPC64 HVA detection algorithm for ARM and AArch64. This is a nice code deduplication, since they are roughly identical. A few virtual method extension points are needed to understand how big an HVA can be and what element types it can have for a given architecture. Also make the record expansion code work in the presence of non-virtual bases. Reviewed By: uweigand, asl Differential Revision: http://reviews.llvm.org/D6045 llvm-svn: 220972	2014-10-31 17:10:41 +00:00
Hao Liu	6d45b8c385	[AArch64]Add 2 intrinsics vmov_n_p64/vmovq_n_p64, the alias for vdup_n_p64/vdup_n_p64. As this change is too small, commit it directly. llvm-svn: 220946	2014-10-31 02:41:37 +00:00
Saleem Abdulrasool	a9884bfe3d	test: generalise the test matching The value names may change in different builds, use pattern based tests. llvm-svn: 220861	2014-10-29 16:53:16 +00:00
Saleem Abdulrasool	a25fbef088	CodeGen: add __readfsdword builtin The Windows NT SDK uses __readfsdword and declares it as a compiler provided builtin (#pragma intrinsic(__readfsword). Because intrin.h is not referenced by winnt.h, it is not possible to provide an out-of-line definition for the intrinsic. Provide a proper compiler builtin definition. llvm-svn: 220859	2014-10-29 16:35:41 +00:00
Eli Bendersky	95338a09c0	Pass aggregates on the stack without splitting in NVPTX. Following the NVVM IR specifications, arguments of aggregate type should be passed on the stack without splitting (byval). http://reviews.llvm.org/D6020 Patch by Jacques Pienaar. llvm-svn: 220854	2014-10-29 13:43:21 +00:00
Ulrich Weigand	a094f0428b	[PowerPC ABI] Bug 21398 - Consider C++ base classes in HA classification As discussed in bug 21398, PowerPC ABI code needs to consider C++ base classes when classifying a class as homogeneous aggregate (or not) for ABI purposes. llvm-svn: 220852	2014-10-29 13:23:20 +00:00
NAKAMURA Takumi	7acc8a36c7	clang/test/CodeGen/captured-statements-nested.c: Tweak for -Asserts. llvm-svn: 220851	2014-10-29 13:21:52 +00:00
Alexey Bataev	330de03083	Improved capturing variable-length array types in CapturedStmt. An updated implemnentation of VLA types capturing based on previously committed solution for Lambdas. This version captures the whole VLA type instead of particular variables which are part of VLA size expression and allows to use previusly calculated size of VLA type in captured regions. Required for OpenMP. Differential Revision: http://reviews.llvm.org/D5099 llvm-svn: 220850	2014-10-29 12:21:55 +00:00
Kostya Serebryany	68c29da4c5	Do not insert asan paddings after fields that have flexible arrays. Summary: We should avoid a tail padding not only if the last field has zero size but also if the last field is a struct with a flexible array. If/when http://reviews.llvm.org/D5478 is committed, this will also handle the case of structs with zero-sized arrays. Reviewers: majnemer, rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D5924 llvm-svn: 220708	2014-10-27 19:34:10 +00:00
NAKAMURA Takumi	729be14435	Prune CRLF. llvm-svn: 220678	2014-10-27 12:37:26 +00:00
Rafael Espindola	5a1106f8fc	Make this test a bit stricter by checking clang's output too. llvm-svn: 220604	2014-10-25 01:51:19 +00:00
Reid Kleckner	d7857f05f4	Add frontend support for __vectorcall Wire it through everywhere we have support for fastcall, essentially. This allows us to parse the MSVC "14" CTP headers, but we will miscompile them because LLVM doesn't support __vectorcall yet. Reviewed By: Aaron Ballman Differential Revision: http://reviews.llvm.org/D5808 llvm-svn: 220573	2014-10-24 17:42:17 +00:00
Daniel Sanders	aa1b35590f	[mips] Mark aggregate arguments passed in registers with the inreg attribute Summary: This allows us to easily identify them in the backend which in turn allows us to handle them correctly for big-endian targets (where they must be shifted into the upper bits of the register). Depends on D5961 Reviewers: atanasyan Reviewed By: atanasyan Subscribers: cfe-commits, theraven Differential Revision: http://reviews.llvm.org/D5962 llvm-svn: 220566	2014-10-24 15:30:16 +00:00
Daniel Sanders	5b445b3844	[mips] Promote all integral/enumeration types to the GPR width Summary: Ensure all integral/enumeration types are appropriately annotated with signext/zeroext. In particular, i32 now has these attributes when using the N32/N64 ABI. This paves the way for accurately representing the way the N32/N64 ABI's promotes integer arguments to i64. Reviewers: atanasyan Reviewed By: atanasyan Subscribers: cfe-commits, theraven Differential Revision: http://reviews.llvm.org/D5961 llvm-svn: 220563	2014-10-24 14:42:42 +00:00
David Blaikie	60a877b5b9	DebugInfo: Omit scopes in -gmlt to reduce metadata size (on disk and in memory) I haven't done any actual impact analysis of this change as it's a strict improvement, but I'd be curious to know how much it helps. llvm-svn: 220408	2014-10-22 19:34:33 +00:00
Alexey Samsonov	6d87ce8bd5	Fixup for r220403: Use getFileLoc() instead of getSpellingLoc() in SanitizerBlacklist. This also handles the case where function name (not its body) is obtained from macro expansion. llvm-svn: 220407	2014-10-22 19:34:25 +00:00
Alexey Samsonov	fa7a8569bb	SanitizerBlacklist: Use spelling location for blacklisting purposes. When SanitizerBlacklist decides if the SourceLocation is blacklisted, we need to first turn it into a SpellingLoc before fetching the filename and scanning "src:" entries. Otherwise we will fail to fecth the correct filename for function definitions coming from macro expansion. llvm-svn: 220403	2014-10-22 18:26:07 +00:00
Jiangning Liu	2bafc2d5ae	Remove including <complex.h> in test case, and change to use _Complex instead. llvm-svn: 220258	2014-10-21 02:19:58 +00:00
Jiangning Liu	444822bbcf	Lower compound assignment for the missing type llvm::Type::FP128TyID. llvm-svn: 220257	2014-10-21 01:34:34 +00:00
David Majnemer	8e133965c8	CodeGen: ConstStructBuilder must verify packed constraints after padding This reverts commit r220169 which reverted r220153. However, it also contains additional changes: - We may need to add padding after we've packed the struct. This occurs when the aligned next field offset is greater than the new field's offset. When this occurs, we make the struct packed. However, once packed the next field offset might be less than the new feild's offset. It is in this case that we might further pad the struct. - We would pad structs which were perfectly sized! This behavior is immensely old. This behavior came from blindly subtracting NextFieldOffsetInChars from RecordSize. This doesn't take into account the fact that the struct might have a greater overall alignment than the last field. llvm-svn: 220175	2014-10-19 23:40:06 +00:00
Chandler Carruth	bf972bb2e0	Revert r220153: "CodeGen: ConstStructBuilder must verify packed constraints after padding" This commit caused two tests in LNT to regress. I'm able to reproduce on any platform and will send reproduction steps to the original commit log. This should restore the LNT bots that have been failing. llvm-svn: 220169	2014-10-19 19:41:46 +00:00
Chandler Carruth	0c4b230b32	[complex] Teach the complex math IR gen to emit direct math and a NaN-test prior to the call to the library function. This should automatically make fastmath (including just non-NaNs) able to avoid the expensive libcalls and also open the door to more advanced folding in LLVM based on the rules for complex math. Two important notes to remember: first is that this isn't yet a proper limited range mode, it's still just improving the unlimited range mode. Also, it isn't really perfecet w.r.t. what an unlimited range mode should be doing because it isn't quite handling the flags produced by all the operations in the way desirable for that mode, but then neither is compiler-rt's libcall. When the compiler-rt libcall is improved to carefully manage flags, the code emitted here should be improved correspondingly. And it is still a long-term desirable thing to add a limited range mode to Clang that would be able to use direct math without library calls here. Special thanks to Steve Canon for the careful review on this patch and teaching me about these issues. =D Differential Revision: http://reviews.llvm.org/D5756 llvm-svn: 220167	2014-10-19 19:13:49 +00:00
David Majnemer	afefe97e1c	CodeGen: ConstStructBuilder must verify packed constraints after padding Before, ConstStructBuilder::AppendBytes would check packed constraints prior to padding being added before the field's offset. However, adding this padding might force our struct to be packed. Because we wouldn't check after adding padding, ConstStructBuilder would be in an inconsistent state leading to a crash. This fixes PR21300. llvm-svn: 220153	2014-10-19 00:03:10 +00:00
Alexey Samsonov	a0ac3c2bf0	[ASan] Improve blacklisting of global variables. This commit changes the way we blacklist global variables in ASan. Now the global is excluded from instrumentation (either regular bounds checking, or initialization-order checking) if: 1) Global is explicitly blacklisted by its mangled name. This part is left unchanged. 2) SourceLocation of a global is in blacklisted source file. This changes the old behavior, where instead of looking at the SourceLocation of a variable we simply considered llvm::Module identifier. This was wrong, as identifier may not correspond to the file name, and we incorrectly disabled instrumentation for globals coming from #include'd files. 3) Global is blacklisted by type. Now we build the type of a global variable using Clang machinery (QualType::getAsString()), instead of llvm::StructType::getName(). After this commit, the active users of ASan blacklist files may have to revisit them (this is a backwards-incompatible change). llvm-svn: 220097	2014-10-17 22:37:33 +00:00
Kostya Serebryany	644492139f	fix -fsanitize-address-field-padding for the cases with virtual base classes Summary: Correctly compute the non-virtual size of a class. Test Plan: Build SPEC 2016 with -fsanitize-address-field-padding Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D5848 llvm-svn: 220089	2014-10-17 21:02:13 +00:00
Hans Wennborg	0b603cc4e9	Move test/CodeGen/sections.c to CodeGenCXX/sections.cpp The test was running with -xc++. Seems it wants to be a C++ file. llvm-svn: 220069	2014-10-17 18:13:21 +00:00
NAKAMURA Takumi	e316722f4d	Add explicit triple to clang/test/CodeGen/sanitize-address-field-padding.cpp, for now. It's incompatible to ms mangling. llvm-svn: 220037	2014-10-17 12:48:01 +00:00
Joerg Sonnenberger	aa3e9f5a0f	complex long double support for PowerPC llvm-svn: 220034	2014-10-17 11:51:19 +00:00
Renato Golin	031e817630	User c-tor name to fix the sanitizer test llvm-svn: 220030	2014-10-17 10:09:25 +00:00
Renato Golin	de44aec0e6	Trying to fix failing Clang sanitizer test on ARM bots llvm-svn: 220029	2014-10-17 09:40:21 +00:00
Kostya Serebryany	23387754f8	trying to fix the new test again, this time for the clang-cmake-armv7-a15 bot llvm-svn: 220002	2014-10-17 00:47:30 +00:00
Alexey Samsonov	1444bb9fc8	SanitizerBlacklist: blacklist functions by their source location. This commit changes the way we blacklist functions in ASan, TSan, MSan and UBSan. We used to treat function as "blacklisted" and turned off instrumentation in it in two cases: 1) Function is explicitly blacklisted by its mangled name. This part is not changed. 2) Function is located in llvm::Module, whose identifier is contained in the list of blacklisted sources. This is completely wrong, as llvm::Module may not correspond to the actual source file function is defined in. Also, function can be defined in a header, in which case user had to blacklist the .cpp file this header was #include'd into, not the header itself. Such functions could cause other problems - for instance, if the header was included in multiple source files, compiled separately and linked into a single executable, we could end up with both instrumented and non-instrumented version of the same function participating in the same link. After this change we will make blacklisting decision based on the SourceLocation of a function definition. If a function is not explicitly defined in the source file, (for example, the function is compiler-generated and responsible for initialization/destruction of a global variable), then it will be blacklisted if the corresponding global variable is defined in blacklisted source file, and will be instrumented otherwise. After this commit, the active users of blacklist files may have to revisit them. This is a backwards-incompatible change, but I don't think it's possible or makes sense to support the old incorrect behavior. I plan to make similar change for blacklisting GlobalVariables (which is ASan-specific). llvm-svn: 219997	2014-10-17 00:20:19 +00:00
Hans Wennborg	528c926b3c	test/CodeGen/sections.c: add triple llvm-svn: 219969	2014-10-16 21:36:23 +00:00
Kostya Serebryany	330e9f6c5f	trying to fix the new test on hexagon-build llvm-svn: 219965	2014-10-16 21:22:40 +00:00
Kostya Serebryany	293dc9be6e	Insert poisoned paddings between fields in C++ classes so that AddressSanitizer can find intra-object-overflow bugs Summary: The general approach is to add extra paddings after every field in AST/RecordLayoutBuilder.cpp, then add code to CTORs/DTORs that poisons the paddings (CodeGen/CGClass.cpp). Everything is done under the flag -fsanitize-address-field-padding. The blacklist file (-fsanitize-blacklist) allows to avoid the transformation for given classes or source files. See also https://code.google.com/p/address-sanitizer/wiki/IntraObjectOverflow Test Plan: run SPEC2006 and some of the Chromium tests with -fsanitize-address-field-padding Reviewers: samsonov, rnk, rsmith Reviewed By: rsmith Subscribers: majnemer, cfe-commits Differential Revision: http://reviews.llvm.org/D5687 llvm-svn: 219961	2014-10-16 20:54:52 +00:00
Hans Wennborg	899ded9cdf	MS Compat: mark globals emitted in read-only sections const They cannot be written to, so marking them const makes sense and may improve optimisation. As a side-effect, SectionInfos has to be moved from Sema to ASTContext. It also fixes this problem, that occurs when compiling ATL: warning LNK4254: section 'ATL' (C0000040) merged into '.rdata' (40000040) with different attributes The ATL headers are putting variables in a special section that's marked read-only. However, Clang currently can't model that read-onlyness in the IR. But, by making the variables const, the section does become read-only, and the linker warning is avoided. Differential Revision: http://reviews.llvm.org/D5812 llvm-svn: 219960	2014-10-16 20:52:46 +00:00
Rafael Espindola	c55172ecbc	Update for llvm change. llvm-svn: 219952	2014-10-16 20:00:22 +00:00
Bradley Smith	04ee8aa1fc	[AArch64] Enable A53 erratum workaround (835769) by default for Android targets llvm-svn: 219933	2014-10-16 16:35:14 +00:00
Alexander Eremin	670c62770e	specify dwarf version for Solaris llvm-svn: 219901	2014-10-16 05:55:24 +00:00
David Majnemer	bb525f7c20	CodeGen: Don't drop thread_local when emitting __thread aliases CodeGen wouldn't mark the aliasee as thread_local if the aliasee was a tentative definition. Even if the definition was already emitted, it would never mark the alias as thread_local. This fixes PR21288. llvm-svn: 219859	2014-10-15 22:38:23 +00:00
Saleem Abdulrasool	4c879bed5b	test: simplify test further Remove the use of an unnecessary function. NFC. llvm-svn: 219850	2014-10-15 21:37:52 +00:00
Tim Northover	147cd2f6e5	ARM: remove ARM/Thumb distinction for preferred alignment. Thumb1 has legitimate reasons for preferring 32-bit alignment of types i1/i8/i16, since the 16-bit encoding of "add rD, sp, #imm" requires #imm to be a multiple of 4. However, this is a trade-off betweem code size and RAM usage; the DataLayout string is not the best place to represent it even if desired. So this patch removes the extra Thumb requirements, hopefully making ARM and Thumb completely compatible in this respect. llvm-svn: 219735	2014-10-14 22:12:21 +00:00
Tim Northover	b98dc4b015	ARM: set preferred aggregate alignment to 32 universally. Before, ARM and Thumb mode code had different preferred alignments, which could lead to some rather unexpected results. There's justification for reducing it from the default 64-bits (wasted space), but I don't think there is for going below 32-bits. There's no actual ABI change here, just to reassure people. llvm-svn: 219720	2014-10-14 20:57:29 +00:00
Saleem Abdulrasool	64ab4de443	CodeGen: correct mangling for blocks This addresses a regression introduced with SVN r219393. A block may be contained within another block. In such a scenario, we would end up within a BlockDecl, which is not a NamedDecl (as the names are synthesised). The cast to a NamedDecl of the DeclContext would then assert as the types are unrelated. Restore the mangling behaviour to that prior to SVN r219393. If the current block is contained within a BlockDecl, walk up to the parent DeclContext, recursively, until we have a non-BlockDecl. This is expected to be a NamedDecl. Add in a couple of asserts to ensure that the assumption that we only encounter a block within a NamedDecl or a BlockDecl. llvm-svn: 219696	2014-10-14 17:20:14 +00:00
Tyler Nowicki	c724a83e20	Allow constant expressions in pragma loop hints. Previously loop hints such as #pragma loop vectorize_width(#) required a constant. This patch allows a constant expression to be used as well. Such as a non-type template parameter or an expression (2 * c + 1). Reviewed by Richard Smith llvm-svn: 219589	2014-10-12 20:46:07 +00:00
Chandler Carruth	b29a743891	[complex] Teach the other two binary operators on complex numbers (== and !=) to support mixed complex and real operand types. This requires removing an assert from SemaChecking, and adding support both to the constant evaluator and the code generator to synthesize the imaginary part when needed. This seemed somewhat cleaner than having just the comparison operators force real-to-complex conversions. I've added test cases for these operations. I'm really terrified that there were no tests in-tree which exercised this. This turned up when trying to build R after my change to the complex type lowering. llvm-svn: 219570	2014-10-11 11:03:30 +00:00
Chandler Carruth	686de24128	[complex] Use the much more powerful EmitCall routine to call libcalls for complex math. This should fix the windows build bots that started having trouble here and generally fix complex libcall emission on targets which use sret for complex data types. It also makes the code a bit simpler (despite calling into a much more complex bucket of code). llvm-svn: 219565	2014-10-11 09:24:41 +00:00
Chandler Carruth	a216cad0fc	[complex] Teach Clang to preserve different-type operands to arithmetic operators where one type is a C complex type, and to emit both the efficient and correct implementation for complex arithmetic according to C11 Annex G using this extra information. For both multiply and divide the old code was writing a long-hand reduced version of the math without any of the special handling of inf and NaN recommended by the standard here. Instead of putting more complexity here, this change does what GCC does which is to emit a libcall for the fully general case. However, the old code also failed to do the proper minimization of the set of operations when there was a mixed complex and real operation. In those cases, C provides a spec for much more minimal operations that are valid. Clang now emits the exact suggested operations. This change isn't just about performance though, without minimizing these operations, we again lose the correct handling of infinities and NaNs. It is critical that this happen in the frontend based on assymetric type operands to complex math operations. The performance implications of this change aren't trivial either. I've run a set of benchmarks in Eigen, an open source mathematics library that makes heavy use of complex. While a few have slowed down due to the libcall being introduce, most sped up and some by a huge amount: up to 100% and 140%. In order to make all of this work, also match the algorithm in the constant evaluator to the one in the runtime library. Currently it is a broken port of the simplifications from C's Annex G to the long-hand formulation of the algorithm. Splitting this patch up is very hard because none of this works without the AST change to preserve non-complex operands. Sorry for the enormous change. Follow-up changes will include support for sinking the libcalls onto cold paths in common cases and fastmath improvements to allow more aggressive backend folding. Differential Revision: http://reviews.llvm.org/D5698 llvm-svn: 219557	2014-10-11 00:57:18 +00:00
Reid Kleckner	79b0fd7a48	Promote null pointer constants used as arguments to variadic functions Make it possible to pass NULL through variadic functions on 64-bit Windows targets. The Visual C++ headers define NULL to 0, when they should define it to 0LL on Win64 so that NULL is a pointer-sized integer. Fixes PR20949. Reviewers: thakis, rsmith Differential Revision: http://reviews.llvm.org/D5480 llvm-svn: 219456	2014-10-10 00:05:45 +00:00
Alexey Bataev	9b280eab66	Fix compatibility issues in tests for PredefinedExpr with MSVC. llvm-svn: 219405	2014-10-09 11:58:26 +00:00
Robert Khasanov	b9f3a911c9	[AVX512] Added VPCMPEQ intrinisics to headers. Added tests. Patch by Maxim Blumenthal <maxim.blumenthal@intel.com> llvm-svn: 219319	2014-10-08 17:18:13 +00:00
Hal Finkel	64567a80d2	Emit @llvm.assume for non-parameter lvalue align_value-attribute loads We already add the align parameter attribute for function parameters that have the align_value attribute (or those with a typedef type having that attribute), which is an important special case, but does not handle pointers with value alignment assumptions that come into scope in any other way. To handle the general case, emit an @llvm.assume-based alignment assumption whenever we load the pointer-typed lvalue of an align_value-attributed variable (except for function parameters, which we already deal with at entry). I'll also note that this is more general than Intel's described support in: https://software.intel.com/en-us/articles/data-alignment-to-assist-vectorization which states that the compiler inserts __assume_aligned directives in response to align_value-attributed variables only for function parameters and for the initializers of local variables. I think that we can make the optimizer deal with this more-general scheme (which could lead to a lot of calls to @llvm.assume inside of loop bodies, for example), but if not, I'll rework this to be less aggressive. llvm-svn: 219052	2014-10-04 15:26:49 +00:00
Duncan P. N. Exon Smith	3c51fa6aae	Revert "Revert "DI: LLVM schema change: fold constants into string"" This reverts commit r218917, effectively reapplying r218913. Original commit message follows. -- Update debug info testcases for an LLVM metadata schema change to fold metadata constant operands into a single `MDString`. Part of PR17891. llvm-svn: 219011	2014-10-03 20:01:52 +00:00
Hal Finkel	189c699cad	Make test/CodeGen/atomic-ops.c free-standing This test includes stdint.h (via stdatomic.h), which might include system headers (and that might not work, depending on the system configuration). Attempting to fix llvm-clang-lld-x86_64-debian-fast. llvm-svn: 218960	2014-10-03 05:04:49 +00:00
Hal Finkel	6970ac8b0a	Add an implementation of C11's stdatomic.h Adds a Clang-specific implementation of C11's stdatomic.h header. On systems, such as FreeBSD, where a stdatomic.h header is already provided, we defer to that header instead (using our __has_include_next technology). Otherwise, we provide an implementation in terms of our __c11_atomic_* intrinsics (that were created for this purpose). C11 7.1.4p1 requires function declarations for atomic_thread_fence, atomic_signal_fence, atomic_flag_test_and_set, atomic_flag_test_and_set_explicit, and atomic_flag_clear, and requires that they have external linkage. Accordingly, we provide these declarations, but if a user elides the shadowing macros and uses them, then they must have a libc (or similar) that actually provides definitions. atomic_flag is implemented using _Bool as the underlying type. This is consistent with the implementation provided by FreeBSD and also GCC 4.9 (at least when __GCC_ATOMIC_TEST_AND_SET_TRUEVAL == 1). Patch by Richard Smith (rebased and slightly edited by me -- Richard said I should drive at this point). llvm-svn: 218957	2014-10-03 04:29:40 +00:00
Duncan P. N. Exon Smith	834c265e85	Revert "DI: LLVM schema change: fold constants into string" This reverts commit r218913 while I investigate some bots. llvm-svn: 218917	2014-10-02 22:15:09 +00:00
Duncan P. N. Exon Smith	02b418a875	DI: LLVM schema change: fold constants into string Update debug info testcases for an LLVM metadata schema change to fold metadata constant operands into a single `MDString`. Part of PR17891. llvm-svn: 218913	2014-10-02 21:56:07 +00:00
Hal Finkel	1b0d24e03a	Initial support for the align_value attribute This adds support for the align_value attribute. This attribute is supported by Intel's compiler (versions 14.0+), and several of my HPC users have requested support in Clang. It specifies an alignment assumption on the values to which a pointer points, and is used by numerical libraries to encourage efficient generation of vector code. Of course, we already have an aligned attribute that can specify enhanced alignment for a type, so why is this additional attribute important? The problem is that if you want to specify that an input array of T is, say, 64-byte aligned, you could try this: typedef double aligned_double attribute((aligned(64))); void foo(aligned_double P) { double x = P[0]; // This is fine. double y = P[1]; // What alignment did those doubles have again? } the access here to P[1] causes problems. P was specified as a pointer to type aligned_double, and any object of type aligned_double must be 64-byte aligned. But if P[0] is 64-byte aligned, then P[1] cannot be, and this access causes undefined behavior. Getting round this problem requires a lot of awkward casting and hand-unrolling of loops, all of which is bad. With the align_value attribute, we can accomplish what we'd like in a well defined way: typedef double aligned_double_ptr attribute((align_value(64))); void foo(aligned_double_ptr P) { double x = P[0]; // This is fine. double y = P[1]; // This is fine too. } This attribute does not create a new type (and so it not part of the type system), and so will only "propagate" through templates, auto, etc. by optimizer deduction after inlining. This seems consistent with Intel's implementation (thanks to Alexey for confirming the various Intel-compiler behaviors). As a final note, I would have chosen to call this aligned_value, not align_value, for better naming consistency with the aligned attribute, but I think it would be more useful to users to adopt Intel's name. llvm-svn: 218910	2014-10-02 21:21:25 +00:00
Hal Finkel	d2208b59cf	Add __sync_fetch_and_nand (again) Prior to GCC 4.4, __sync_fetch_and_nand was implemented as: { tmp = ptr; ptr = ~tmp & value; return tmp; } but this was changed in GCC 4.4 to be: { tmp = ptr; ptr = ~(tmp & value); return tmp; } in response to this change, support for sync_fetch_and_nand (and sync_nand_and_fetch) was removed in r99522 in order to avoid miscompiling code depending on the old semantics. However, at this point: 1. Many years have passed, and the amount of code relying on the old semantics is likely smaller. 2. Through the work of many contributors, all LLVM backends have been updated such that "atomicrmw nand" provides the newer GCC 4.4+ semantics (this process was complete July of 2014 (added to the release notes in r212635). 3. The lack of this intrinsic is now a needless impediment to porting codes from GCC to Clang (I've now seen several examples of this). It is true, however, that we still set GNUC_MINOR to 2 (corresponding to GCC 4.2). To compensate for this, and to address the original concern regarding code relying on the old semantics, I've added a warning that specifically details the fact that the semantics have changed and that we provide the newer semantics. Fixes PR8842. llvm-svn: 218905	2014-10-02 20:53:50 +00:00
Job Noorman	ac95cd5c22	Make sure aggregates are properly alligned on MSP430. llvm-svn: 218666	2014-09-30 11:19:13 +00:00
NAKAMURA Takumi	6ed6ef7ac2	clang/test/CodeGen/builtin-assume-aligned.c: Fix for -Asserts. llvm-svn: 218507	2014-09-26 09:37:15 +00:00
Hal Finkel	ee90a223ea	Support the assume_aligned function attribute In addition to __builtin_assume_aligned, GCC also supports an assume_aligned attribute which specifies the alignment (and optional offset) of a function's return value. Here we implement support for the assume_aligned attribute by making use of the @llvm.assume intrinsic. llvm-svn: 218500	2014-09-26 05:04:30 +00:00
Jan Vesely	b4379f9c2c	CGBuiltin: Use frem instruction rather than libcall to implement fmod AFAICT the semantics of frem match libm's fmod. Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 218488	2014-09-26 01:19:41 +00:00
Nico Weber	8f63ae1d4c	Simplify tests. This reverts bits of r218166 that are no longer necessary now that r218394 made -Wmissing-prototype-for-cc a regular warning. llvm-svn: 218400	2014-09-24 18:25:54 +00:00
Reid Kleckner	2e0717e129	Downgrade error about stdcall decls with no prototype to a warning Fixes PR21027. The MIDL compiler produces code that does this. If we wanted to improve the warning, I think we could do this: void __stdcall f(); // Don't warn without -Wstrict-prototypes. void g() { f(); // Might warn, the user probably meant for f to take no args. f(1, 2, 3); // Warn, we have no idea what args f takes. f(1); // Error, this is insane, one of these calls is broken. } Reviewers: thakis Differential Revision: http://reviews.llvm.org/D5481 llvm-svn: 218394	2014-09-24 17:49:24 +00:00
Robert Khasanov	ea13042cf2	[x86] Fixed argument types in intrinsics: _addcarryx_u64 _addcarry_u64 _subborrow_u64 Thanks Pasi Parviainen for notice. llvm-svn: 218376	2014-09-24 06:45:23 +00:00
Daniel Sanders	caf534ef96	[mips] Fix r218248's testcase to use -O1 instead of -O3. llvm-svn: 218298	2014-09-23 08:58:04 +00:00
Ehsan Akhgari	3e2db26efc	ms-inline-asm: Add a test case for the usage of labels in bracket expressions Summary: This is a test for this patch: http://reviews.llvm.org/D5445. Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D5446 llvm-svn: 218271	2014-09-22 20:41:39 +00:00
Kaelyn Takata	a1e18cc5b9	Fix test/CodeGen/mips-varargs.c to use %clang_cc1 Only tests under test/Driver should use %clang, and test/CodeGen in particular must always use %clang_cc1. llvm-svn: 218260	2014-09-22 18:06:01 +00:00
NAKAMURA Takumi	22a0fd416c	clang/test/CodeGen/mips-varargs.c: Fixup for -Asserts. llvm-svn: 218256	2014-09-22 16:40:05 +00:00
Daniel Sanders	8d36a61f52	[mips] Correct alignment of vectors passed in varargs for the O32 ABI. Summary: Vectors are normally 16-byte aligned, however the O32 ABI enforces a maximum alignment of 8-bytes since the base of the stack is 8-byte aligned. Previously, this was enforced on the caller side, but not on the callee side. This fixes the output of OpenCL's printf when given vectors. Reviewers: atanasyan Reviewed By: atanasyan Subscribers: llvm-commits, pekka.jaaskelainen Differential Revision: http://reviews.llvm.org/D5433 llvm-svn: 218248	2014-09-22 13:27:06 +00:00
Ehsan Akhgari	31097581aa	ms-inline-asm: Scope inline asm labels to functions Summary: This fixes PR20023. In order to implement this scoping rule, we piggy back on the existing LabelDecl machinery, by creating LabelDecl's that will carry the "internal" name of the inline assembly label, which we will rewrite the asm label to. Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4589 llvm-svn: 218230	2014-09-22 02:21:54 +00:00
Nico Weber	d191063c6c	Follow-up to r214408: Warn on other callee-cleanup functions without prototype too. According to lore, we used to verifier-fail on: void __thiscall f(); int main() { f(1); } So that's fixed now. System headers use prototype-less __stdcall functions, so make that a warning that's DefaultError -- then it fires on regular code but is suppressed in system headers. Since it's used in system headers, we have codegen tests for this; massage them slightly so that they still compile. llvm-svn: 218166	2014-09-19 23:07:12 +00:00
Robert Khasanov	2c589bcc5e	[x86] Add _addcarry_u{32\|64} and _subborrow_u{32\|64}. They are added to adxintrin.h but outside __ADX__ block. These intrinics generates adc and sbb correspondingly that were available before ADX llvm-svn: 218118	2014-09-19 10:29:22 +00:00
Robert Khasanov	83c419b349	[x86] Added _addcarryx_u32, _addcarryx_u64 intrinsics llvm-svn: 218117	2014-09-19 10:17:06 +00:00
Akira Hatanaka	e867e422e2	[X86, inlineasm] Do not allow using constraint 'x' for a variable larger than 128-bit unless the target CPU supports AVX. rdar://problem/11846140 llvm-svn: 218082	2014-09-18 21:58:54 +00:00
Hans Wennborg	3c619a43d5	[X86, inline-asm] Allow 256-bit wide operands for the 'x' constraints The 'x' constraint is for "any SSE register", and GCC seems to include the 256-bit ymm registers in that concept. llvm-svn: 218073	2014-09-18 20:24:04 +00:00
Akira Hatanaka	974131ea88	[X86, inlineasm] Check that the output size is correct for the given constraint. llvm-svn: 218064	2014-09-18 18:17:18 +00:00
Akira Hatanaka	3ab9ada59c	Fix test case. This is another follow-up patch to r217996. llvm-svn: 218003	2014-09-18 00:29:04 +00:00
Akira Hatanaka	d7e375d4b3	Fix test case. This is a follow-up to r217994. llvm-svn: 217996	2014-09-18 00:04:10 +00:00
Akira Hatanaka	31c6d3b71e	[X86, inline-asm] Check that the input size is correct for constraints R, q, Q, S, D, A, y, x, f, t, and u. This is a follow-up patch for r167717. rdar://problem/11846140 rdar://problem/17476970 llvm-svn: 217994	2014-09-17 23:35:14 +00:00
Alexey Samsonov	8e1162c71d	Implement nonnull-attribute sanitizer Summary: This patch implements a new UBSan check, which verifies that function arguments declared to be nonnull with __attribute__((nonnull)) are actually nonnull in runtime. To implement this check, we pass FunctionDecl to CodeGenFunction::EmitCallArgs (where applicable) and if function declaration has nonnull attribute specified for a certain formal parameter, we compare the corresponding RValue to null as soon as it's calculated. Test Plan: regression test suite Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits, rnk Differential Revision: http://reviews.llvm.org/D5082 llvm-svn: 217389	2014-09-08 17:22:45 +00:00
NAKAMURA Takumi	4b04c11d00	clang/test/CodeGen/builtin-assume*.c: Fixup for -Asserts. llvm-svn: 217352	2014-09-08 01:12:55 +00:00
Hal Finkel	bcc06085a8	Add __builtin_assume and __builtin_assume_aligned using @llvm.assume. This makes use of the recently-added @llvm.assume intrinsic to implement a __builtin_assume(bool) intrinsic (to provide additional information to the optimizer). This hooks up __assume in MS-compatibility mode to mirror __builtin_assume (the semantics have been intentionally kept compatible), and implements GCC's __builtin_assume_aligned as assume((p - o) & mask == 0). LLVM now contains special logic to deal with assumptions of this form. llvm-svn: 217349	2014-09-07 22:58:14 +00:00
Chandler Carruth	2949e548f4	[x86] Clean up the x86 builtin specs to reflect r217310 in LLVM which made the 8-bit masks actually 8-bit arguments to these intrinsics. These builtins are a mess. Many were missing the I qualifier which I added where obviously correct. Most aren't tested, but I've updated the relevant tests. I've tried to catch all the things that should become 'c' in this round. It's also frustrating because the set of these is really ad-hoc and doesn't really map that cleanly to the set supported by either GCC or LLVM. Oh well... llvm-svn: 217311	2014-09-06 10:30:51 +00:00
James Molloy	163b1ba471	[ARMv8] Add support for 32-bit MIN/MAXNM and directed rounding. This patch adds support for the 32bit numeric max/min and directed round-to-integral NEON intrinsics that were added as part of v8, along with unit tests. Patch by Graham Hunter! llvm-svn: 217242	2014-09-05 13:50:34 +00:00
Hans Wennborg	d71907dd07	Don't emit prologues or epilogues for naked functions (PR18791, PR20028) For naked functions with parameters, Clang would still emit stores in the prologue that would clobber the stack, because LLVM doesn't set up a stack frame. (This shows up in -O0 compiles, because the stores are optimized away otherwise.) For example: __attribute__((naked)) int f(int x) { asm("movl $42, %eax"); asm("retl"); } Would result in: _Z1fi: movl 12(%esp), %eax movl %eax, (%esp) <--- Oops. movl $42, %eax retl Differential Revision: http://reviews.llvm.org/D5183 llvm-svn: 217198	2014-09-04 22:16:33 +00:00
Reid Kleckner	9b3e3dfc54	MS inline asm: Allow __asm blocks to set a return value If control falls off the end of a function after an __asm block, MSVC assumes that the inline assembly filled the EAX and possibly EDX registers with an appropriate return value. This functionality is used in inline functions returning 64-bit integers in system headers, so we need some amount of compatibility. This is implemented in Clang by adding extra output constraints to every inline asm block, and storing the resulting output registers into the return value slot. If we see an asm block somewhere in the function body, we emit a normal epilogue instead of marking the end of the function with a return type unreachable. Normal returns in functions not using this functionality will overwrite the return value slot, and in most cases LLVM should be able to eliminate the dead stores. Fixes PR17201. Reviewed By: majnemer Differential Revision: http://reviews.llvm.org/D5177 llvm-svn: 217187	2014-09-04 20:04:38 +00:00
Reid Kleckner	a4ab03ec21	MS inline asm: Add a test for xgetbv clobbers llvm-svn: 217174	2014-09-04 16:58:47 +00:00
Daniel Sanders	e5018b6c00	[mips] Mark aggregates returned in registers with the 'inreg' attribute. Summary: This allows us to easily find them in the backend after the aggregates have been lowered to other types. This is important on big-endian targets using the N32/N64 ABI's since these ABI's must shift small structures into the upper bits of the register. Reviewers: atanasyan Reviewed By: atanasyan Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5005 llvm-svn: 217160	2014-09-04 15:05:39 +00:00
Daniel Sanders	ed39f58390	[mips] Zero-sized structs cannot be ignored in MipsABIInfo::classifyReturnType() for O32 Summary: They are returned indirectly which causes the other arguments to move to the next argument slot. With this, utils/ABITest does not discover any failing cases in the first 500 attempts on big/little endian for O32. Previously some of these failed. Also tested N32/N64 little endian (big endian has other known issues) with no issues. Reviewers: atanasyan Reviewed By: atanasyan Subscribers: atanasyan, cfe-commits Differential Revision: http://reviews.llvm.org/D4811 llvm-svn: 217147	2014-09-04 13:28:14 +00:00
Tom Stellard	c4e0c1075b	CGBuiltin: Use @llvm.fabs rather than fabs libcall when emitting builtins Using the intrinsic allows the SelectionDAGBuilder to turn this call into the FABS Node and also the intrinsic is something the vectorizer knows how to vectorize. This patch also sets the readnone attribute on this call, which should enable additional optmizations. llvm-svn: 217042	2014-09-03 15:24:29 +00:00
Hans Wennborg	2029991d74	Check in a test case for the problem with late-dropped dllimport (PR20803) llvm-svn: 216749	2014-08-29 17:36:11 +00:00
James Molloy	90d6101410	Use store size instead of alloc size when coercing. Previously, EnterStructPointerForCoercedAccess used Alloc size when determining how to convert. This was problematic, because there were situations were the alloc size was larger than the store size. For example, if the first element of a structure were i24 and the destination type were i32, the old code would generate a GEP and a load i24. The code should compare store sizes to ensure the whole object is loaded. I have attached a test case. This patch modifies the output of arm64-be-bitfield.c test case, but the new IR seems to be equivalent, and after -O3, the compiler generates identical ARM assembly. (asr x0, x0, #54) Patch by Thomas Jablin! llvm-svn: 216722	2014-08-29 10:17:52 +00:00
David Majnemer	0392cf892f	CodeGen: Don't completely mess-up optimized atomic libcalls Summary: We did a great job getting this wrong: - We messed up which LLVM IR types to use for arguments and return values. The optimized libcalls use integer types for values. Clang attempted to use the IR type which corresponds to the value passed in instead of using an appropriately sized integer type. This would result in violations of the ABI for, as an example, floating point types. - We didn't bother recording the result of the atomic libcall in the destination memory. Instead, call the functions with arguments matching the type of the libcall prototype's parameters. This fixes PR20780. Differential Revision: http://reviews.llvm.org/D5098 llvm-svn: 216714	2014-08-29 07:27:49 +00:00
Kostya Serebryany	4a9187a810	call __asan_load_cxx_array_cookie when loading array cookie in asan mode. Summary: The current implementation of asan cookie is incorrect: we add nosanitize metadata to the cookie load, but the metadata may be lost and we will instrument the load from poisoned memory. This change replaces the load with a call to __asan_load_cxx_array_cookie (r216692) Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D5111 llvm-svn: 216702	2014-08-29 01:01:32 +00:00
Hans Wennborg	0a20f5417c	Better codegen support for DLL attributes being dropped after the first declaration (PR20792) For the following code: __declspec(dllimport) int f(int x); int user(int x) { return f(x); } int f(int x) { return 1; } Clang will drop the dllimport attribute in the AST, but CodeGen would have already put it on the LLVM::Function, and that would never get updated. (The same thing happens for global variables.) This makes Clang check dropped DLL attribute case each time the LLVM object is referenced. This isn't perfect, because we will still get it wrong if the function is never referenced by codegen after the attribute is dropped, but this handles the common cases and makes us not fail in the verifier. llvm-svn: 216699	2014-08-29 00:16:06 +00:00
Yi Kong	623393f31e	arm_acle: Implement data processing intrinsics Summary: ACLE 2.0 section 9.2 defines the following "miscellaneous data processing intrinsics": `__clz`, `__cls`, `__ror`, `__rev`, `__rev16`, `__revsh` and `__rbit`. `__clz` has already been implemented in the arm_acle.h header file. The rest are not supported yet. This patch completes ACLE data processing intrinsics. Reviewers: t.p.northover, rengolin Reviewed By: rengolin Subscribers: aemerson, mroth, llvm-commits Differential Revision: http://reviews.llvm.org/D4983 llvm-svn: 216658	2014-08-28 09:44:07 +00:00
Alexey Samsonov	9fc9bf83a8	Properly handle multiple nonnull attributes in CodeGen llvm-svn: 216638	2014-08-28 00:53:20 +00:00
Richard Smith	00cc1c09c3	Fix regression in r216520: don't apply nonnull to non-pointer function parameters in the IR. llvm-svn: 216574	2014-08-27 18:56:18 +00:00
Oliver Stannard	ed8ecc8429	Allow __fp16 as a function arg or return type for AArch64 ACLE 2.0 allows __fp16 to be used as a function argument or return type. This enables this for AArch64. This also fixes an existing bug that causes clang to not allow homogeneous floating-point aggregates with a base type of __fp16. This is valid for AAPCS64, but not for AAPCS-VFP. llvm-svn: 216558	2014-08-27 16:31:57 +00:00
NAKAMURA Takumi	6107a8f4db	Quick fix to test/CodeGen/2007-06-18-SextAttrAggregate.c for x86_64-mingw32, corresponding to r216507. FIXME: Explicit triplets might be given here. llvm-svn: 216557	2014-08-27 16:22:26 +00:00
Oliver Stannard	2bfdc5b517	Move some ARM-specific code from CGCall.cpp to TargetInfo.cpp This tidies up some ARM-specific code added by r208417 to move it out of the target-independent parts of clang into TargetInfo.cpp. This also has the advantage that we can now flatten struct arguments to variadic AAPCS functions. llvm-svn: 216535	2014-08-27 10:43:15 +00:00

... 2 3 4 5 6 ...

2983 Commits