llvm-project

Commit Graph

Author	SHA1	Message	Date
Richard Trieu	dc4cb02470	Revert r295421, new ODR checker for modules, to fix build bot. llvm-svn: 295427	2017-02-17 07:19:24 +00:00
Richard Trieu	cb6b72628e	Add better ODR checking for modules. A slightly weaker form of ODR checking than previous attempts, but hopefully won't break the modules build bot. Future work will be needed to catch all cases. When objects are imported for modules, there is a chance that a name collision will cause an ODR violation. Previously, only a small number of such violations were detected. This patch provides a stronger check based on AST nodes. The information needed to uniquely identify an object is taken from the AST and put into a one-dimensional byte stream. This stream is then hashed to give a value to represent the object, which is stored with the other object data in the module. When modules are loaded, and Decl's are merged, the hash values of the two Decl's are compared. Only Decl's with matched hash values will be merged. Mismatch hashes will generate a module error, and if possible, point to the first difference between the two objects. The transform from AST to byte stream is a modified depth first algorithm. Due to references between some AST nodes, a pure depth first algorithm could generate loops. For Stmt nodes, a straight depth first processing occurs. For Type and Decl nodes, they are replaced with an index number and only on first visit will these nodes be processed. As an optimization, boolean values are saved and stored together in reverse order at the end of the byte stream to lower the ammount of data that needs to be hashed. Compile time impact was measured at 1.5-2.0% during module building, and negligible during builds without module building. Differential Revision: https://reviews.llvm.org/D21675 llvm-svn: 295421	2017-02-17 05:54:30 +00:00
Argyrios Kyrtzidis	74790484b5	[index] Improvde how we handle synthesized ObjC properties and the associated ivars. Related synthesized properties with the ivar they use with the 'accessor' relation, and make sure we mark them 'implicit' when appropriate. Patch by Nathan Hawes! https://reviews.llvm.org/D30012 llvm-svn: 295416	2017-02-17 04:49:41 +00:00
Vedant Kumar	55875b9955	Retry: [ubsan] Reduce null checking of C++ object pointers (PR27581) This patch teaches ubsan to insert exactly one null check for the 'this' pointer per method/lambda. Previously, given a load of a member variable from an instance method ('this->x'), ubsan would insert a null check for 'this', and another null check for '&this->x', before allowing the load to occur. Similarly, given a call to a method from another method bound to the same instance ('this->foo()'), ubsan would a redundant null check for 'this'. There is also a redundant null check in the case where the object pointer is a reference ('Ref.foo()'). This patch teaches ubsan to remove the redundant null checks identified above. Testing: check-clang and check-ubsan. I also compiled X86FastISel.cpp with -fsanitize=null using patched/unpatched clangs based on r293572. Here are the number of null checks emitted: ------------------------------------- \| Setup \| # of null checks \| ------------------------------------- \| unpatched, -O0 \| 21767 \| \| patched, -O0 \| 10758 \| ------------------------------------- Changes since the initial commit: don't rely on IRGen of C labels in the test. Differential Revision: https://reviews.llvm.org/D29530 llvm-svn: 295401	2017-02-17 02:03:51 +00:00
Vedant Kumar	4f94a94bea	Revert "[ubsan] Reduce null checking of C++ object pointers (PR27581)" This reverts commit r295391. It breaks this bot: http://lab.llvm.org:8011/builders/clang-with-thin-lto-ubuntu/builds/1898 I need to not rely on labels in the IR test. llvm-svn: 295396	2017-02-17 01:42:36 +00:00
Vedant Kumar	3e5a9a6be8	[ubsan] Reduce null checking of C++ object pointers (PR27581) This patch teaches ubsan to insert exactly one null check for the 'this' pointer per method/lambda. Previously, given a load of a member variable from an instance method ('this->x'), ubsan would insert a null check for 'this', and another null check for '&this->x', before allowing the load to occur. Similarly, given a call to a method from another method bound to the same instance ('this->foo()'), ubsan would a redundant null check for 'this'. There is also a redundant null check in the case where the object pointer is a reference ('Ref.foo()'). This patch teaches ubsan to remove the redundant null checks identified above. Testing: check-clang and check-ubsan. I also compiled X86FastISel.cpp with -fsanitize=null using patched/unpatched clangs based on r293572. Here are the number of null checks emitted: ------------------------------------- \| Setup \| # of null checks \| ------------------------------------- \| unpatched, -O0 \| 21767 \| \| patched, -O0 \| 10758 \| ------------------------------------- Differential Revision: https://reviews.llvm.org/D29530 llvm-svn: 295391	2017-02-17 01:05:42 +00:00
Alex Lorenz	55aaa844cb	Use correct fix-it location for -Wblock-capture-autoreleasing The '__autoreleasing' keyword should be inserted after the Objective-C pointer type. rdar://30123548 llvm-svn: 295381	2017-02-16 23:15:36 +00:00
Richard Smith	efa919ab07	Properly set up the DeclContext for parameters of implicit deduction guides; this is needed for deferred instantiation of default arguments. llvm-svn: 295379	2017-02-16 21:29:21 +00:00
Arpith Chacko Jacob	fc711b1f47	[OpenMP] Teams reduction on the NVPTX device. This patch implements codegen for the reduction clause on any teams construct for elementary data types. It builds on parallel reductions on the GPU. Subsequently, the team master writes to a unique location in a global memory scratchpad. The last team to do so loads and reduces this array to calculate the final result. This patch emits two helper functions that are used by the OpenMP runtime on the GPU to perform reductions across teams. Patch by Tian Jin in collaboration with Arpith Jacob Reviewers: ABataev Differential Revision: https://reviews.llvm.org/D29879 llvm-svn: 295335	2017-02-16 16:48:49 +00:00
Arpith Chacko Jacob	101e8fb1f3	[OpenMP] Parallel reduction on the NVPTX device. This patch implements codegen for the reduction clause on any parallel construct for elementary data types. An efficient implementation requires hierarchical reduction within a warp and a threadblock. It is complicated by the fact that variables declared in the stack of a CUDA thread cannot be shared with other threads. The patch creates a struct to hold reduction variables and a number of helper functions. The OpenMP runtime on the GPU implements reduction algorithms that uses these helper functions to perform reductions within a team. Variables are shared between CUDA threads using shuffle intrinsics. An implementation of reductions on the NVPTX device is substantially different to that of CPUs. However, this patch is written so that there are minimal changes to the rest of OpenMP codegen. The implemented design allows the compiler and runtime to be decoupled, i.e., the runtime does not need to know of the reduction operation(s), the type of the reduction variable(s), or the number of reductions. The design also allows reuse of host codegen, with appropriate specialization for the NVPTX device. While the patch does introduce a number of abstractions, the expected use case calls for inlining of the GPU OpenMP runtime. After inlining and optimizations in LLVM, these abstractions are unwound and performance of OpenMP reductions is comparable to CUDA-canonical code. Patch by Tian Jin in collaboration with Arpith Jacob Reviewers: ABataev Differential Revision: https://reviews.llvm.org/D29758 llvm-svn: 295333	2017-02-16 16:20:16 +00:00
Arpith Chacko Jacob	bd6344c0be	Revert r295319 while investigating buildbot failure. llvm-svn: 295323	2017-02-16 14:25:35 +00:00
Arpith Chacko Jacob	8e170fc857	[OpenMP] Parallel reduction on the NVPTX device. This patch implements codegen for the reduction clause on any parallel construct for elementary data types. An efficient implementation requires hierarchical reduction within a warp and a threadblock. It is complicated by the fact that variables declared in the stack of a CUDA thread cannot be shared with other threads. The patch creates a struct to hold reduction variables and a number of helper functions. The OpenMP runtime on the GPU implements reduction algorithms that uses these helper functions to perform reductions within a team. Variables are shared between CUDA threads using shuffle intrinsics. An implementation of reductions on the NVPTX device is substantially different to that of CPUs. However, this patch is written so that there are minimal changes to the rest of OpenMP codegen. The implemented design allows the compiler and runtime to be decoupled, i.e., the runtime does not need to know of the reduction operation(s), the type of the reduction variable(s), or the number of reductions. The design also allows reuse of host codegen, with appropriate specialization for the NVPTX device. While the patch does introduce a number of abstractions, the expected use case calls for inlining of the GPU OpenMP runtime. After inlining and optimizations in LLVM, these abstractions are unwound and performance of OpenMP reductions is comparable to CUDA-canonical code. Patch by Tian Jin in collaboration with Arpith Jacob Reviewers: ABataev Differential Revision: https://reviews.llvm.org/D29758 llvm-svn: 295319	2017-02-16 14:03:36 +00:00
Anastasia Stulova	58984e7087	[OpenCL] Correct ndrange_t implementation Removed ndrange_t as Clang builtin type and added as a struct type in the OpenCL header. Use type name to do the Sema checking in enqueue_kernel and modify IR generation accordingly. Review: D28058 Patch by Dmitry Borisenkov! llvm-svn: 295311	2017-02-16 12:27:47 +00:00
Anastasia Stulova	9d98a316c5	[OpenCL] Disallow blocks capture other blocks (v2.0, s6.12.5) llvm-svn: 295307	2017-02-16 11:13:30 +00:00
Richard Trieu	e55fb7f6f1	Revert r295284: Add better ODR checking for modules. Fix modules build bot. llvm-svn: 295293	2017-02-16 07:09:18 +00:00
Craig Topper	f0d1147fae	[AVX-512] Replace 512-bit masked packss/packus builtins and replace with new unmasked builtins. These new unmasked builtins will enable us to easily support optimizing these builtins in InstCombine in the backend. llvm-svn: 295291	2017-02-16 06:32:07 +00:00
Richard Trieu	f351ac8987	Add better ODR checking for modules. Recommit r293585 that was reverted in r293611 with new fixes. The previous issue was determined to be an overly aggressive AST visitor from forward declared objects. The visitor will now only deeply visit certain Decl's and only do a shallow information extraction from all other Decl's. When objects are imported for modules, there is a chance that a name collision will cause an ODR violation. Previously, only a small number of such violations were detected. This patch provides a stronger check based on AST nodes. The information needed to uniquely identify an object is taken from the AST and put into a one-dimensional byte stream. This stream is then hashed to give a value to represent the object, which is stored with the other object data in the module. When modules are loaded, and Decl's are merged, the hash values of the two Decl's are compared. Only Decl's with matched hash values will be merged. Mismatch hashes will generate a module error, and if possible, point to the first difference between the two objects. The transform from AST to byte stream is a modified depth first algorithm. Due to references between some AST nodes, a pure depth first algorithm could generate loops. For Stmt nodes, a straight depth first processing occurs. For Type and Decl nodes, they are replaced with an index number and only on first visit will these nodes be processed. As an optimization, boolean values are saved and stored together in reverse order at the end of the byte stream to lower the ammount of data that needs to be hashed. Compile time impact was measured at 1.5-2.0% during module building, and negligible during builds without module building. Differential Revision: https://reviews.llvm.org/D21675 llvm-svn: 295284	2017-02-16 04:53:40 +00:00
Richard Smith	f0393bf0a5	Revert r295277 to fix buildbot. llvm-svn: 295281	2017-02-16 04:22:56 +00:00
Faisal Vali	051e3a2b7d	[cxx1z-constexpr-lambda] Implement captures - thus completing implementation of constexpr lambdas. Enable evaluation of captures within constexpr lambdas by using a strategy similar to that used in CodeGen: - when starting evaluation of a lambda's call operator, create a map from VarDecl's to a closure's FieldDecls - every time a VarDecl (or '*this) that represents a capture is encountered while evaluating the expression via the expression evaluator (specifically the LValueEvaluator) in ExprConstant.cpp - it is replaced by the corresponding FieldDecl LValue (an Lvalue-to-Rvalue conversion on this LValue representation then determines the right rvalue when needed). Thanks to Richard Smith and Hubert Tong for their review and feedback! https://reviews.llvm.org/D29748 llvm-svn: 295279	2017-02-16 04:12:21 +00:00
Richard Smith	86a1b135f0	Add missing "deduced A == A" check for function template partial ordering. This appears to be the only template argument deduction context where we were missing this check. Surprisingly, other implementations also appear to miss the check in this case; it may turn out that important code is relying on the widespread non-conformance here, in which case we'll need to reconsider. llvm-svn: 295277	2017-02-16 03:49:44 +00:00
Evgeniy Stepanov	287b04b000	Add missing regexp quantifiers in a test. llvm-svn: 295267	2017-02-16 01:35:23 +00:00
Vedant Kumar	f4217f862a	[Sema] Add lvalue-to-rvalue cast in direct-list-initialization of enum After r264564, we allowed direct-list-initialization of an enum from an integral value in C++1z mode, so long as that value can convert to the enum's underlying type. In this kind of initialization, we need a lvalue-to-rvalue conversion for the initializer value if it is not a rvalue. This lets us accept the following code: enum class A : unsigned {}; A foo(unsigned x) { return A{x}; } Differential Revision: https://reviews.llvm.org/D29723 llvm-svn: 295266	2017-02-16 01:20:00 +00:00
Richard Smith	4e05eaa9e8	[c++1z] Diagnose non-deducible template parameters in deduction guide templates, per [temp.param]p11. llvm-svn: 295264	2017-02-16 00:36:47 +00:00
Hans Wennborg	cac8ce06dd	[dllimport] Check for dtor references in functions Destructor references are not modelled explicitly in the AST. This adds checks for destructor calls due to variable definitions and temporaries. If a dllimport function references a non-dllimport destructor, it must not be emitted available_externally, as the referenced destructor might live across the DLL boundary and isn't exported. llvm-svn: 295258	2017-02-15 23:28:10 +00:00
Hans Wennborg	6c3d625fd9	[dllimport] Look through typedefs and arrays in HasNonDllImportDtor The function is used to check whether a type is a class with non-dllimport destructor. It needs to look through typedefs and array types. llvm-svn: 295257	2017-02-15 23:28:07 +00:00
George Burgess IV	9584508d5c	[Modules] Consider enable_if attrs in isSameEntity. Two functions that differ only in their enable_if attributes are considered overloads, so we should check for those when we're trying to figure out if two functions are mergeable. We need to do the same thing for pass_object_size, as well. Looks like that'll be a bit less trivial, since we sometimes do these merging checks before we have pass_object_size attributes available (see the merge checks in ASTDeclReader::VisitFunctionDecl that happen before we read parameters, and merge checks in calls to ReadDeclAs<>()). llvm-svn: 295252	2017-02-15 22:43:27 +00:00
Richard Smith	90e043dae0	PR24440: Do not silently discard a fold-expression appearing as the operand of a cast-expression. llvm-svn: 295224	2017-02-15 19:57:10 +00:00
Argyrios Kyrtzidis	7d90ed0ac9	[index] USR generation: use getTemplateArgs() instead of getTemplateInstantiationArgs() Otherwise we may end up creating a different USR for the definition of a function, vs its declaration. llvm-svn: 295191	2017-02-15 16:16:27 +00:00
Sagar Thakur	9d0ed930ec	[XRAY][MIPS] Add -fxray-instrument for mips/mipsel/mips64/mips64el Summary: Adds xray instrument option for mips/mipsel/mips64/mips64el. Reviewed by sdardis, dberris Differential: D27698 llvm-svn: 295163	2017-02-15 10:41:38 +00:00
Eric Christopher	32ac5e41ef	Add a definition for __STRUCT_PARM_ALIGN__ for elfv2 and 64-bit darwin platforms to match what other compilers produce. llvm-svn: 295156	2017-02-15 07:50:11 +00:00
Akira Hatanaka	6697eff4b1	[Sema] Disallow returning a __block variable via a move. r274291 made changes to prefer calling a move constructor to calling a copy constructor when returning from a function. This caused programs to crash when a __block variable in the heap was moved out and used later. This commit fixes the bug by disallowing moving out of __block variables implicitly. rdar://problem/28181080 Differential Revision: https://reviews.llvm.org/D29908 llvm-svn: 295150	2017-02-15 05:15:28 +00:00
Richard Smith	171e4b545d	Fix assertion failure due to implicit special member lookup lacking a source location. llvm-svn: 295149	2017-02-15 04:18:23 +00:00
Richard Smith	650e5b28d2	Speculatively revert r295118 to see if it's what's causing the modules selfhost buildbots to fail. llvm-svn: 295146	2017-02-15 03:29:24 +00:00
Tim Shen	def72aad0b	[VLA] Fix the test failure on msvc by specifying the triple. Differential Revision: https://reviews.llvm.org/D24333 llvm-svn: 295127	2017-02-15 00:01:12 +00:00
Tim Shen	b34d0ef2ca	[VLA] Handle VLA size expression in a full-expression context. Summary: Previously the cleanups (e.g. dtor calls) are inserted into the outer scope (e.g. function body scope), instead of it's own scope. After the fix, the cleanups are inserted right after getting the size value. This fixes pr30306. Reviewers: rsmith Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D24333 llvm-svn: 295123	2017-02-14 23:46:37 +00:00
Richard Smith	222b30b9d3	Stop asserting when a meaningless -std= flag is passed for a non-compilation input kind; go back to silently ignoring the flag. llvm-svn: 295122	2017-02-14 23:41:38 +00:00
Richard Smith	1d5f95f52f	Do not implicitly instantiate the definition of a class template specialization that has been explicitly specialized! We assume in various places that we can tell the template specialization kind of a class type by looking at the declaration produced by TagType::getDecl. That was previously not quite true: for an explicit specialization, we could have first seen a template-id denoting the specialization (with a use that does not trigger an implicit instantiation of the defintiion) and then seen the first explicit specialization declaration. TagType::getDecl would previously return an arbitrary declaration when called on a not-yet-defined class; it now consistently returns the most recent declaration in that case. llvm-svn: 295118	2017-02-14 23:27:44 +00:00
Aaron Ballman	52d0aaac13	Improve diagnostic reporting when using __declspec without enabling __declspec as a keyword. Fixes PR31936. llvm-svn: 295114	2017-02-14 22:47:20 +00:00
Richard Smith	c91daf1cf9	[Driver] Report available language standards on user error In case user did not provide valid standard name for -std option, available values (with short description) will be reported. Patch by Paweł Żukowski! llvm-svn: 295113	2017-02-14 22:44:20 +00:00
Reid Kleckner	fb9f647e5f	MS inline asm: Filter MXCSR out of the inferred clobber list Since r295004, LLVM has started modelling this new register, but we don't have GCC constraint inline asm spellings for it yet. llvm-svn: 295107	2017-02-14 21:38:17 +00:00
Vedant Kumar	9503004089	[profiling] Update test cases to deal with name variable change (NFC) This is a re-try of r295085: fix up some test cases that assume that profile name variables are preserved by the instrprof pass. This catches one additional case in test/CoverageMapping/unused_names.c. llvm-svn: 295101	2017-02-14 20:03:56 +00:00
Vedant Kumar	0e052d89bd	Revert "[profiling] Update test case to deal with name variable change (NFC)" This reverts commit r295085, because the corresponding llvm change was reverted. llvm-svn: 295100	2017-02-14 20:03:53 +00:00
Vedant Kumar	4b5315f8ac	[profiling] Update test case to deal with name variable change (NFC) The 'profn' name variables shouldn't show up after we run the instrprof pass, see https://reviews.llvm.org/D29921 for more details. llvm-svn: 295085	2017-02-14 18:49:03 +00:00
Akira Hatanaka	f1b3fc7356	[CodeGen][ObjC] Use the type of the captured field of the enclosing block or lambda. This is a follow-up to r281682, which fixed a bug in computeBlockInfo where the captured VarDecl's type, rather than the captured field type of the enclosing lambda or block, was used to compute the layout of a block. This commit makes similar changes to enterBlockScope. This is necessary to correctly determine whether a block capture requires cleanup. rdar://problem/30388124 llvm-svn: 295034	2017-02-14 06:46:55 +00:00
NAKAMURA Takumi	eb7702e07d	clang/test/Parser/cxx1z-class-template-argument-deduction.cpp: Tweak to ignore thiscall. Line 38: multiple overloads of 'f' instantiate to the same signature 'void (int) __attribute__((thiscall))' llvm-svn: 295020	2017-02-14 03:18:24 +00:00
Richard Smith	c27b3d7623	Canonicalize implicit deduction guide parameter types when forming a deduction guide from a constructor. The purpose of this change is to avoid triggering instantiation of the class when substituting back into the deduction guide if it uses a typedef member. We will still instantiate the class if the constructor (explicitly or implicitly, directly or indirectly) uses the current instantiation in a way that we can't canonicalize out, but that seems unavoidable. llvm-svn: 295016	2017-02-14 01:49:59 +00:00
Richard Smith	cbe079321e	[c++1z] Add some more tests for class template argument deduction, add feature-test macro, and mark feature as done on status page. llvm-svn: 295011	2017-02-14 00:55:25 +00:00
David Majnemer	dc169759ca	[MS ABI] Correctly mangling vbase destructors They are a little bit of a special case in the mangling. They are always mangled without taking into account their virtual-ness of the destructor. They are also mangled to return void, unlike the actual destructor. This fixes PR31931. Differential Revision: https://reviews.llvm.org/D29912 llvm-svn: 295010	2017-02-14 00:54:11 +00:00
Richard Smith	3291877656	[c++1z] Synthesize implicit deduction guides from constructors on demand. Rank such guides below explicit ones, and ensure that references to the class's template parameters are not treated as forwarding references. We make a few tweaks to the wording in the current standard: 1) The constructor parameter list is copied faithfully to the deduction guide, without losing default arguments or a varargs ellipsis (which the standard wording loses by omission). 2) If the class template declares no constructors, we add a T() -> T<...> guide (which will only ever work if T has default arguments for all non-pack template parameters). 3) If the class template declares nothing that looks like a copy or move constructor, we add a T(T<...>) -> T<...> guide. #2 and #3 follow from the "pretend we had a class type with these constructors" philosophy for deduction guides. llvm-svn: 295007	2017-02-14 00:25:28 +00:00
Nick Lewycky	0752762180	When the new expr's array size is an ICE, emit it as a constant expression. This bypasses integer sanitization checks which are redundant on the expression since it's been checked by Sema. Fixes a clang codegen assertion on "void test() { new int[0+1]{0}; }" when building with -fsanitize=signed-integer-overflow. llvm-svn: 295006	2017-02-13 23:49:55 +00:00

1 2 3 4 5 ...

30560 Commits