llvm-project

Commit Graph

Author	SHA1	Message	Date
Vedant Kumar	9d2a16b9b1	[Coverage] Support for C++17 if initializers Differential Revision: https://reviews.llvm.org/D25572 llvm-svn: 284293	2016-10-14 23:38:16 +00:00
Vedant Kumar	f2a6ec5521	[Coverage] Support for C++17 switch initializers Differential Revision: https://reviews.llvm.org/D25539 llvm-svn: 284292	2016-10-14 23:38:13 +00:00
Douglas Katzman	3ed0f643fc	Implement no_sanitize_address for global vars llvm-svn: 284272	2016-10-14 19:55:09 +00:00
Manman Ren	3b5dbf23a4	Module: emit initializers in submodules when importing the parent module. When importing the parent module, module initializers in submodules should be emitted. rdar://28740482 llvm-svn: 284263	2016-10-14 18:55:44 +00:00
Albert Gutowski	1deab38717	Implement __stosb intrinsic as a volatile memset Summary: We need `__stosb` to be an intrinsic, because SecureZeroMemory function uses it without including intrin.h. Implementing it as a volatile memset is not consistent with MSDN specification, but it gives us target-independent IR while keeping the most important properties of `__stosb`. Reviewers: rnk, hans, thakis, majnemer Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D25334 llvm-svn: 284253	2016-10-14 17:33:05 +00:00
Albert Gutowski	5e08df0266	Add 64-bit MS _Interlocked functions as builtins again Summary: Previously global 64-bit versions of _Interlocked functions broke buildbots on i386, so now I'm adding them as builtins for x86-64 and ARM only (should they be also on AArch64? I had problems with testing it for AArch64, so I left it) Reviewers: hans, majnemer, mstorsjo, rnk Subscribers: cfe-commits, aemerson Differential Revision: https://reviews.llvm.org/D25576 llvm-svn: 284172	2016-10-13 22:35:07 +00:00
Justin Lebar	23d954241b	[CUDA] Emit deferred diagnostics during Sema rather than during codegen. Summary: Emitting deferred diagnostics during codegen was a hack. It did work, but usability was poor, both for us as compiler devs and for users. We don't codegen if there are any sema errors, so for users this meant that they wouldn't see deferred errors if there were any non-deferred errors. For devs, this meant that we had to carefully split up our tests so that when we tested deferred errors, we didn't emit any non-deferred errors. This change moves checking for deferred errors into Sema. See the big comment in SemaCUDA.cpp for an overview of the idea. This checking adds overhead to compilation, because we have to maintain a partial call graph. As a result, this change makes deferred errors a CUDA-only concept (whereas before they were a general concept). If anyone else wants to use this framework for something other than CUDA, we can generalize at that time. This patch makes the minimal set of test changes -- after this lands, I'll go back through and do a cleanup of the tests that we no longer have to split up. Reviewers: rnk Subscribers: cfe-commits, rsmith, tra Differential Revision: https://reviews.llvm.org/D25541 llvm-svn: 284158	2016-10-13 20:52:12 +00:00
Saleem Abdulrasool	887a82c5d6	CodeGen: ensure that the runtime calling convention matches Incorrect specification of the calling convention results in UB which can cause the code path to be eliminated. Simplify the existing code by using the RuntimeCall constructor in `CodeGenFunction`. llvm-svn: 284154	2016-10-13 19:45:08 +00:00
Arnold Schwaighofer	3d01ad116c	Swift Calling Convention: Fix out of bounds access Use iterator instead of address of element in vector It is not valid to access one after the last element. rdar://28759508 llvm-svn: 284150	2016-10-13 19:19:37 +00:00
Albert Gutowski	397d81bb9a	Implement MS _ReturnAddress and _AddressOfReturnAddress intrinsics Reviewers: rnk, thakis, majnemer, hans Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D25540 llvm-svn: 284131	2016-10-13 16:03:42 +00:00
Alexey Bataev	2f5ed34279	Fix for PR30639: CGDebugInfo Null dereference with OpenMP array access, by Erich Keane OpenMP creates a variable array type with a a null size-expr. The Debug generation failed to due to this. This patch corrects the openmp implementation, updates the tests, and adds a new one for this condition. Differential Revision: https://reviews.llvm.org/D25373 llvm-svn: 284110	2016-10-13 09:52:46 +00:00
Albert Gutowski	2a0621e58a	Implement MS _BitScan intrinsics Summary: _BitScan intrinsics (and some others, for example _Interlocked and _bittest) are supposed to work on both ARM and x86. This is an attempt to isolate them, avoiding repeating their code or writing separate function for each builtin. Reviewers: hans, thakis, rnk, majnemer Subscribers: RKSimon, cfe-commits, aemerson Differential Revision: https://reviews.llvm.org/D25264 llvm-svn: 284060	2016-10-12 22:01:05 +00:00
Arnold Schwaighofer	4fc955e669	Declare WinX86_64ABIInfo to satisfy SwiftABI info This is minimal support that allows swift's test cases on non windows platforms to pass. rdar://28738985 llvm-svn: 284032	2016-10-12 18:59:24 +00:00
Arnold Schwaighofer	5d2c510cf6	Pass the end of a component to SwiftAggLowering's enumerateComponents callback This is usefull for determining whether components overlap. llvm-svn: 283932	2016-10-11 20:34:03 +00:00
Mehdi Amini	7186a4323e	Revert "Change Builtins name to be stored as StringRef instead of raw pointers (NFC)" This reverts commit r283802. It introduces temporarily static initializers, because StringRef ctor isn't (yet) constexpr for string literals. I plan to get there this week, but apparently GCC is so terrible with these static initializer right now (10 min+ extra codegen time was reported) that I'll hold on to this patch till the constexpr one is ready, and land these at the same time. llvm-svn: 283920	2016-10-11 19:04:24 +00:00
Hal Finkel	8f96e82cb8	Add an option to save the backend-produced YAML optimization record to a file The backend now has the capability to save information from optimizations, the same information that can be used to generate optimization diagnostics but in machine-consumable form, into an output file. This can be enabled when using opt (see r282539), and this change enables it when using clang. The idea is that other tools will be able to consume these files, and perhaps in combination with the original source code, produce various kinds of optimization reports for users (and for compiler developers). We now have at-least two tools that can consume these files: * tools/llvm-opt-report * utils/opt-viewer Using the flag -fsave-optimization-record will cause the YAML file to be generated; the file name will be based on the output file name (if we're using -c or -S and have an output name), or the input file name. When we're using CUDA, or some other offloading mechanism, separate files are generated for each backend target. The output file name can be specified by the user using -foptimization-record-file=filename. Differential Revision: https://reviews.llvm.org/D25225 llvm-svn: 283834	2016-10-11 00:26:09 +00:00
Mehdi Amini	004b9c7aae	Store FileEntry::Filename as a StringRef instead of raw pointer (NFC) llvm-svn: 283815	2016-10-10 22:52:47 +00:00
Mehdi Amini	b1bdc47309	Change Builtins name to be stored as StringRef instead of raw pointers (NFC) llvm-svn: 283802	2016-10-10 21:34:29 +00:00
Nick Lewycky	6fdfaedd9d	Make the LValue created in EmitValueForIvarAtOffset have the same Qualifiers in the LValue as the QualType in the LValue. No functionality change intended. llvm-svn: 283795	2016-10-10 20:07:13 +00:00
Albert Gutowski	fcea61c563	Implement MS read/write barriers and __faststorefence intrinsic Reviewers: hans, rnk, majnemer Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D25442 llvm-svn: 283793	2016-10-10 19:40:51 +00:00
Richard Smith	b2f0f05742	Re-commit r283722, reverted in r283750, with a fix for a CUDA-specific use of past-the-end iterator. Original commit message: P0035R4: Semantic analysis and code generation for C++17 overaligned allocation. llvm-svn: 283789	2016-10-10 18:54:32 +00:00
Albert Gutowski	7216f17653	Implement __emul, __emulu, _mul128 and _umul128 MS intrinsics Reviewers: rnk, thakis, majnemer, hans Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D25353 llvm-svn: 283785	2016-10-10 18:09:27 +00:00
Justin Lebar	562914e505	Use unique_ptr for VPtrLocationsMap and VPtrInfoVector. Reviewers: timshen Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D25422 llvm-svn: 283770	2016-10-10 16:26:29 +00:00
Daniel Jasper	e9abe64816	Revert "P0035R4: Semantic analysis and code generation for C++17 overaligned allocation." This reverts commit r283722. Breaks: Clang.SemaCUDA.device-var-init.cu Clang.CodeGenCUDA.device-var-init.cu http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-expensive/884/ llvm-svn: 283750	2016-10-10 14:13:55 +00:00
Richard Smith	189e52fcdf	P0035R4: Semantic analysis and code generation for C++17 overaligned allocation. llvm-svn: 283722	2016-10-10 06:42:31 +00:00
Justin Lebar	9fdb46e71c	[CUDA] Do a better job at detecting wrong-side calls. Summary: Move CheckCUDACall from ActOnCallExpr and BuildDeclRefExpr to DiagnoseUseOfDecl. This lets us catch some edge cases we were missing, specifically around class operators. This necessitates a few other changes: - Avoid emitting duplicate deferred diags in CheckCUDACall. Previously we'd carefully placed our call to CheckCUDACall such that it would only ever run once for a particular callsite. But now this isn't the case. - Emit deferred diagnostics from a template specialization/instantiation's primary template, in addition to from the specialization/instantiation itself. DiagnoseUseOfDecl ends up putting the deferred diagnostics on the template, rather than the specialization, so we need to check both. Reviewers: rsmith Subscribers: cfe-commits, tra Differential Revision: https://reviews.llvm.org/D24573 llvm-svn: 283637	2016-10-08 01:07:11 +00:00
Richard Smith	0511d23aeb	PR22924, PR22845, some of CWG1464: When checking the initializer for an array new expression, distinguish between the case of a constant and non-constant initializer. In the former case, if the bound is erroneous (too many initializer elements, bound is negative, or allocated size overflows), reject, and take the bound into account when determining whether we need to default-construct any elements. In the remanining cases, move the logic to check for default-constructibility of trailing elements into the initialization code rather than inventing a bogus array bound, to cope with cases where the number of initialized elements is not the same as the number of initializer list elements (this can happen due to string literal initialization or brace elision). This also fixes rejects-valid and crash-on-valid errors when initializing a new'd array of character type from a braced string literal. llvm-svn: 283406	2016-10-05 22:41:02 +00:00
Justin Lebar	3e6449b4f4	[CUDA] Mark device functions as nounwind. Summary: This prevents clang from emitting 'invoke's and catch statements. Things previously mostly worked thanks to TryToMarkNoThrow() in CodeGenFunction. But this is not a proper IPO, and it doesn't properly handle cases like mutual recursion. Fixes bug 30593. Reviewers: tra Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D25166 llvm-svn: 283272	2016-10-04 23:41:49 +00:00
Justin Lebar	49e7614efb	[CUDA] Destroy deferred diagnostics before destroying the ASTContext's PartialDiagnostic allocator. Summary: This will let us (in a separate patch) allocate deferred diagnostics in the ASTContext's PartialDiagnostic arena. Reviewers: rnk Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D25260 llvm-svn: 283271	2016-10-04 23:41:45 +00:00
Albert Gutowski	f3a0bce155	Separate builtins for x84-64 and i386; implement __mulh and __umulh Summary: We need x86-64-specific builtins if we want to implement some of the MS intrinsics - winnt.h contains definitions of some functions for i386, but not for x86-64 (for example _InterlockedOr64), which means that we cannot treat them as builtins for both i386 and x86-64, because then we have definitions of builtin functions in winnt.h on i386. Reviewers: thakis, majnemer, hans, rnk Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D24598 llvm-svn: 283264	2016-10-04 22:29:49 +00:00
Sanjay Patel	0bb72c1424	[clang] make reciprocal estimate codegen a function attribute The motivation for the change is that we can't have pseudo-global settings for codegen living in TargetOptions because that doesn't work with LTO. Ideally, these reciprocal attributes will be moved to the instruction-level via FMF, metadata, or something else. But making them function attributes is at least an improvement over the current state. I'm committing this patch ahead of the related LLVM patch to avoid bot failures, but if that patch needs to be reverted, then this should be reverted too. Differential Revision: https://reviews.llvm.org/D24815 llvm-svn: 283251	2016-10-04 20:44:05 +00:00
Vedant Kumar	e356f1a50c	[ubsan] Disable bounds-check for flexible array ivars This eliminates a class of false positives for -fsanitize=array-bounds on instrumented ObjC projects. Differential Revision: https://reviews.llvm.org/D22227 llvm-svn: 283249	2016-10-04 20:36:04 +00:00
Gor Nishanov	97e3b6d895	[coroutines] Adding builtins for coroutine intrinsics and backendutil support. Summary: With this commit simple coroutines can be created in plain C using coroutine builtins. Reviewers: rnk, EricWF, rsmith Subscribers: modocache, mgorny, mehdi_amini, beanz, cfe-commits Differential Revision: https://reviews.llvm.org/D24373 llvm-svn: 283155	2016-10-03 22:44:48 +00:00
Vedant Kumar	30914f3d1c	[ARC] Ignore qualifiers in copy-restore expressions When ARC is enabled, an ObjCIndirectCopyRestoreExpr models the passing of a function argument s.t: * The argument is copied into a temporary, * The temporary is passed into the function, and * After the function call completes, the temporary is move-assigned back to the original location of the argument. The argument type and the parameter type must agree "except possibly in qualification". This commit weakens an assertion in EmitCallArg() to actually reflect that. llvm-svn: 283116	2016-10-03 15:29:22 +00:00
Yaxun Liu	ea6b796e0e	[OpenCL] Fix bug in __builtin_astype causing invalid LLVM cast instructions __builtin_astype is used to cast OpenCL opaque types to other types, as such, it needs to be able to handle casting from and to pointer types correctly. Current it cannot handle 1) casting between pointers of different addr spaces 2) casting between pointer type and non-pointer types. This patch fixes that. Differential Revision: https://reviews.llvm.org/D25123 llvm-svn: 283114	2016-10-03 14:41:50 +00:00
Aditya Kumar	e84372b039	Alias must point to a definition Reapplying the patch after modifying the test case. Inlining the destructor caused the compiler to generate bad IR which failed the Verifier in the backend. https://llvm.org/bugs/show_bug.cgi?id=30341 This patch disables alias to available_externally definitions. Reviewers: eugenis, rsmith Differential Revision: https://reviews.llvm.org/D24682 llvm-svn: 283063	2016-10-02 03:06:36 +00:00
Hal Finkel	415c2a38f2	[PowerPC] Enable soft-float for PPC64, and +soft-float -> -hard-float Enable soft-float support on PPC64, as the backend now supports it. Also, the backend now uses -hard-float instead of +soft-float, so set the target features accordingly. Fixes PR26970. llvm-svn: 283061	2016-10-02 02:10:45 +00:00
Mehdi Amini	99d1b29503	Use StringRef for MemoryBuffer identifier API (NFC) llvm-svn: 283043	2016-10-01 16:38:28 +00:00
Mehdi Amini	117296c0a0	Use StringRef in Pass/PassManager APIs (NFC) llvm-svn: 283004	2016-10-01 02:56:57 +00:00
Mehdi Amini	b7fb124512	Use StringRef in Triple API (NFC) llvm-svn: 282996	2016-10-01 01:16:22 +00:00
Saleem Abdulrasool	8dbaf5cb4d	CodeGen: inherit DLLExport attribute in Windows Itanium When emitting the fundamental type information constants, inherit the DLLExportAttr from `__fundamental_type_info`. We would previously not honor the `__declspec(dllexport)` on the type information. llvm-svn: 282980	2016-09-30 23:11:05 +00:00
Martin Storsjo	ed95a08ea4	[MS] Implement __iso_volatile loads/stores as builtins These are supposed to produce the same as normal volatile pointer loads/stores. When -volatile:ms is specified, normal volatile pointers are forced to have atomic semantics (as is the default on x86 in MSVC mode). In that case, these builtins should still produce non-atomic volatile loads/stores without acquire/release semantics, which the new test verifies. These are only available on ARM (and on AArch64, although clang doesn't support AArch64/Windows yet). This implements what is missing for PR30394, making it possible to compile C++ for ARM in MSVC mode with MSVC headers. Differential Revision: https://reviews.llvm.org/D24986 llvm-svn: 282900	2016-09-30 19:13:46 +00:00
Victor Leschuk	b3e7d68d5c	Cosmetic fix: deleted unnecessary line break in comment. llvm-svn: 282846	2016-09-30 06:39:48 +00:00
Justin Lebar	9091055efa	Move UTF functions into namespace llvm. Summary: This lets people link against LLVM and their own version of the UTF library. I determined this only affects llvm, clang, lld, and lldb by running $ git grep -wl 'UTF[0-9]\+\\|\bConvertUTF\bisLegalUTF\\|getNumBytesFor' \| cut -f 1 -d '/' \| sort \| uniq clang lld lldb llvm Tested with ninja lldb ninja check-clang check-llvm check-lld (ninja check-lldb doesn't complete for me with or without this patch.) Reviewers: rnk Subscribers: klimek, beanz, mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D24996 llvm-svn: 282822	2016-09-30 00:38:45 +00:00
Richard Smith	a560ccf2af	Switch to a different workaround for unimplementability of P0145R3 in MS ABIs. Instead of ignoring the evaluation order rule, ignore the "destroy parameters in reverse construction order" rule for the small number of problematic cases. This only causes incorrect behavior in the rare case where both parameters to an overloaded operator <<, >>, ->*, &&, \|\|, or comma are of class type with non-trivial destructor, and the program is depending on those parameters being destroyed in reverse construction order. We could do a little better here by reversing the order of parameter destruction for those functions (and reversing the argument evaluation order for all direct calls, not just those with operator syntax), but that is not a complete solution to the problem, as the same situation can be reached by an indirect function call. Approach reviewed off-line by rnk. llvm-svn: 282777	2016-09-29 21:30:12 +00:00
Aditya Kumar	09a8c7d489	Revert "[PR30341] Alias must point to a definition" This reverts commit r282679. Ninja check fails, reverting to debug the issue. llvm-svn: 282710	2016-09-29 11:37:23 +00:00
Aditya Kumar	13a18fecdd	[PR30341] Alias must point to a definition Inlining the destructor caused the compiler to generate bad IR which failed the Verifier in the backend. https://llvm.org/bugs/show_bug.cgi?id=30341 This patch disables alias to available_externally definitions. Reviewers: eugenis, rsmith Differential Revision: https://reviews.llvm.org/D24682 llvm-svn: 282679	2016-09-29 03:32:04 +00:00
Richard Smith	762672a73a	Re-commit r282556, reverted in r282564, with a fix to CallArgList::addFrom to function correctly when targeting MS ABIs (this appears to have never mattered prior to this change). Update test case to always cover both 32-bit and 64-bit Windows ABIs, since they behave somewhat differently from each other here. Update test case to also cover operators , && and \|\|, which it appears are also affected by P0145R3 (they're not explicitly called out by the design document, but this is the emergent behavior of the existing wording). Original commit message: P0145R3 (C++17 evaluation order tweaks): evaluate the right-hand side of assignment and compound-assignment operators before the left-hand side. (Even if it's an overloaded operator.) This completes the implementation of P0145R3 + P0400R0 for all targets except Windows, where the evaluation order guarantees for <<, >>, and ->* are unimplementable as the ABI requires the function arguments are evaluated from right to left (because parameter destructors are run from left to right in the callee). llvm-svn: 282619	2016-09-28 19:09:10 +00:00
Artem Belevich	fda9905062	[CUDA] added __nvvm_atom_{sys\|cta}_* builtins. These builtins are available on sm_60+ GPU only. Differential Revision: https://reviews.llvm.org/D24944 llvm-svn: 282609	2016-09-28 17:47:35 +00:00
Richard Smith	4499145a5f	Revert r282556. This change made several bots unhappy. llvm-svn: 282564	2016-09-28 02:20:06 +00:00
Richard Smith	97a616d624	P0145R3 (C++17 evaluation order tweaks): evaluate the right-hand side of assignment and compound-assignment operators before the left-hand side. (Even if it's an overloaded operator.) This completes the implementation of P0145R3 + P0400R0 for all targets except Windows, where the evaluation order guarantees for <<, >>, and ->* are unimplementable as the ABI requires the function arguments are evaluated from right to left (because parameter destructors are run from left to right in the callee). llvm-svn: 282556	2016-09-27 23:44:22 +00:00
Alex Lorenz	08780529b3	[Coverage] The coverage region for switch covers the code after the switch. This patch fixes a regression introduced in r262697 that changed the way the coverage regions for switches are constructed. The PGO instrumentation counter for a switch statement refers to the counter at the exit of the switch. Therefore, the coverage region for the switch statement should cover the code that comes after the switch, and not the switch statement itself. rdar://28480997 Differential Revision: https://reviews.llvm.org/D24981 llvm-svn: 282554	2016-09-27 23:30:36 +00:00
Adam Nemet	b4e64a77d3	Shorten DiagnosticInfoOptimizationRemark* to OptimizationRemark*. NFC With the new streaming interface in LLVM, these class names need to be typed a lot and it's way too looong. llvm-svn: 282545	2016-09-27 22:19:29 +00:00
Adam Nemet	699fc5b191	Adapt to LLVM optimization remark interface change. NFC llvm-svn: 282540	2016-09-27 20:55:12 +00:00
Adam Nemet	95d0c628cf	Revert "Adapt to LLVM optimization remark interface change. NFC" This reverts commit r282500. llvm-svn: 282504	2016-09-27 16:39:27 +00:00
Adam Nemet	8f1e871088	Adapt to LLVM optimization remark interface change. NFC llvm-svn: 282500	2016-09-27 16:15:21 +00:00
Nemanja Ivanovic	10e2b5dcaa	[Power9] Builtins for ELF v.2 ABI conformance - front end portion This patch corresponds to review: https://reviews.llvm.org/D24397 It adds the __POWER9_VECTOR__ macro and the -mpower9-vector option along with a number of altivec.h functions (refer to the code review for a list). llvm-svn: 282481	2016-09-27 10:45:22 +00:00
Richard Smith	4088571c51	Remove default argument from lambda to appease old MSVC. llvm-svn: 282464	2016-09-27 00:53:24 +00:00
Richard Smith	bde62d78e9	P0145R3 (C++17 evaluation order tweaks): evaluate the base expression before the pointer-to-member expression in calls through .* and ->* expressions. llvm-svn: 282457	2016-09-26 23:56:57 +00:00
Richard Smith	9e67b9922b	P0145R3 (C++17 evaluation order tweaks): consistently emit the LHS of array subscripting before the RHS, regardless of which is the base and which is the index. llvm-svn: 282453	2016-09-26 23:49:47 +00:00
Konstantin Zhuravlyov	5b48d725a0	[AMDGPU] Expose flat work group size, register and wave control attributes __attribute__((amdgpu_flat_work_group_size(<min>, <max>))) - request minimum and maximum flat work group size __attribute__((amdgpu_waves_per_eu(<min>[, <max>]))) - request minimum and/or maximum waves per execution unit Differential Revision: https://reviews.llvm.org/D24513 llvm-svn: 282371	2016-09-26 01:02:57 +00:00
Peter Collingbourne	2d3a26ffb9	Update clang for r282299. llvm-svn: 282301	2016-09-23 21:43:51 +00:00
Sjoerd Meijer	e9eb0913a9	Revert of r282255 because of "Fell off the end of a string-switch" buildbot failures. llvm-svn: 282257	2016-09-23 15:37:17 +00:00
Sjoerd Meijer	0bfdab7a38	Fix for r280064 that added options for fp denormals and exceptions. These options were forgotten to be copied in setCommandLineOpts. llvm-svn: 282255	2016-09-23 15:21:33 +00:00
Alexey Bader	465c18973d	[OpenCL] Augment pipe built-ins with pipe packet size and alignment. Reviewers: Anastasia, vpykhtin Subscribers: dmitry, cfe-commits Differential Revision: https://reviews.llvm.org/D23992 llvm-svn: 282252	2016-09-23 14:20:00 +00:00
Saleem Abdulrasool	82f6added3	CodeGen: further merge cstring literal construction Use the new CreateCStringLiteral in an additional site. Now all the C string literals are created in one function. Furthermore, mark the additional literal as an `unnamed_addr constant`. llvm-svn: 281997	2016-09-20 18:38:54 +00:00
Nick Lewycky	d9bce5062e	Replace 'isProvablyNonNull' with existing utility llvm::IsKnownNonNull which handles more cases. Noticed by inspection. Because of how the IR generation works, this isn't expected to cause an observable difference. llvm-svn: 281979	2016-09-20 15:49:58 +00:00
Dehao Chen	dd6f8cab08	Remove InstructionCombining and its related pass from sample pgo passes as we can handle "invoke" correctly. Summary: We previously relies on InstructionCombining pass to remove invoke instructions. Now that we can inline invoke instructions correctly, we do not need these passes any more. Reviewers: dnovillo Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D24730 llvm-svn: 281910	2016-09-19 16:02:52 +00:00
Saleem Abdulrasool	3f307518f8	CodeGen: mark ObjC cstring literals as unnamed_addr These are all emitted into a section with a cstring_literal attribute. The attribute permits the linker to coalesce the string contents. The address of the strings are not important. llvm-svn: 281855	2016-09-18 16:12:14 +00:00
Saleem Abdulrasool	0c54dc862e	CodeGen: mark ObjC cstring literals as constant These strings are constants, mark them as such. This doesn't matter too much in practice on MachO since the constants are placed into a special section and not referred to directly. llvm-svn: 281854	2016-09-18 16:12:04 +00:00
Saleem Abdulrasool	271106cbb9	CodeGen: refactor the ObjC cstring literal creation This refactors the cstring literal creation as mentioned in the couple of FIXMEs littered in the various invocations to CreateMetadataVar. This centralises the definition of the literals, and will enable changing the literal creation to a single site. NFC. llvm-svn: 281798	2016-09-16 23:41:13 +00:00
Richard Smith	d8e3ac3185	Fix a couple of wrong-code bugs in switch-on-constant optimization: * recurse through intermediate LabelStmts and AttributedStmts when checking whether a statement inside a switch declares a variable * if the end of a compound statement is reachable from the chosen case label, and the compound statement contains a variable declaration, it's not valid to just emit the contents of the compound statement -- we must emit the statement itself or we lose the scope (and thus end lifetimes at the wrong point) llvm-svn: 281797	2016-09-16 23:30:39 +00:00
Saleem Abdulrasool	39217d4d05	CodeGen: use pointer rather than reference in range loop Address post-commit comments from Justin Bogner. Explicitly indicate that the dereferenced iterator provides a pointer rather than a reference. NFC. llvm-svn: 281730	2016-09-16 14:24:26 +00:00
John McCall	d23b27e0d8	Alter the iOS/tvOS ARM64 C++ ABI to ignore the upper half of the virtual table offset in a member function pointer. We are reserving this space for future ABI use relating to alternative v-table configurations. In the meantime, continue to zero-initialize this space when actually emitting a member pointer literal. This will successfully interoperate with existing compilers. Future versions of the compiler may place additional data in this location, and at that point, code emitted by compilers prior to this patch will fail if exposed to such a member pointer. This is therefore a somewhat hard ABI break. However, because it is limited to an uncommon case of an uncommon language feature, and especially because interoperation with the standard library does not depend on member pointers, we believe that with a sufficiently advance compiler change the impact of this break will be minimal in practice. llvm-svn: 281693	2016-09-16 02:40:45 +00:00
Akira Hatanaka	d542ccfc97	[CodeGen][ObjC] Block captures should inherit the type of the captured field in the enclosing lambda or block. This patch fixes a bug in code-gen where it uses the type of the declared variable rather than the type of the capture of the enclosing lambda or block for the block capture. For example, in the following function, code-gen currently uses i32* for the block capture "a" because "a" is passed to foo1 as a reference, but it should use i32 since the enclosing lambda captures "a" by value. void foo1(int &a) { auto lambda = [a]{ auto block1 = ^{ i = a; }; block1(); }; lambda(); } rdar://problem/18586386 Differential Revision: https://reviews.llvm.org/D21104 llvm-svn: 281682	2016-09-16 00:02:06 +00:00
Albert Gutowski	727ab8a803	Add some MS aliases for existing intrinsics Reviewers: thakis, compnerd, majnemer, rsmith, rnk Subscribers: alexshap, cfe-commits Differential Revision: https://reviews.llvm.org/D24330 llvm-svn: 281540	2016-09-14 21:19:43 +00:00
Dehao Chen	5d4f0be5b8	Convert finite to builtin Summary: This patch converts finite/__finite to builtin functions so that it will be inlined by compiler. Reviewers: hfinkel, davidxl, efriedma Subscribers: efriedma, llvm-commits Differential Revision: https://reviews.llvm.org/D24483 llvm-svn: 281509	2016-09-14 17:34:14 +00:00
Saleem Abdulrasool	7246dcc880	CodeGen: simplify the logic a slight bit Move the definition of `getTriple()` into the header. It would just call `getTarget().getTriple()`. Inline the definition to allow the compiler to see the same amount of the layout as previously. Remove the more verbose `getTarget().getTriple()` in favour of `getTriple()`. llvm-svn: 281487	2016-09-14 15:17:46 +00:00
Kostya Serebryany	60cdd6113f	[sanitizer-coverage] add yet another flavour of coverage instrumentation: trace-pc-guard. The intent is to eventually replace all of {bool coverage, 8bit-counters, trace-pc} with just this one. Clang part llvm-svn: 281432	2016-09-14 01:39:49 +00:00
Hans Wennborg	1b3aee7ff9	Also don't inline dllimport functions referring to non-dllimport constructors. The AST walker wasn't visiting CXXConstructExprs before. This is a follow-up to r281395. llvm-svn: 281413	2016-09-13 22:51:42 +00:00
Akira Hatanaka	255abad9b1	[CodeGen] Fix an assert in EmitNullConstant. r235815 changed CGRecordLowering::accumulateBases to ignore non-virtual bases of size 0, which prevented adding those non-virtual bases to CGRecordLayout's NonVirtualBases. This caused clang to assert when CGRecordLayout::getNonVirtualBaseLLVMFieldNo was called in EmitNullConstant. This commit fixes the bug by ignoring zero-sized non-virtual bases in EmitNullConstant. rdar://problem/28100139 Differential Revision: https://reviews.llvm.org/D24312 llvm-svn: 281405	2016-09-13 22:13:02 +00:00
Albert Gutowski	fc19fa3721	Temporary fix for MS _Interlocked intrinsics llvm-svn: 281401	2016-09-13 21:51:37 +00:00
Albert Gutowski	9918cb6573	Reverse commit 281375 (breaks building Chromium) llvm-svn: 281399	2016-09-13 21:24:51 +00:00
Hans Wennborg	93f7547260	Try harder to not inline dllimport functions referencing non-dllimport functions In r246338, code was added to check for this, but it failed to take into account implicit destructor invocations because those are not reflected in the AST. This adds a separate check for them. llvm-svn: 281395	2016-09-13 21:08:20 +00:00
Albert Gutowski	ce7a9a47b2	Add bunch of _Interlocked builtins Reviewers: compnerd, thakis, Prazek, majnemer, rnk Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D24153 llvm-svn: 281378	2016-09-13 19:43:33 +00:00
Albert Gutowski	ae3fb3113f	Add some MS aliases for existing intrinsics Reviewers: thakis, compnerd, majnemer, rsmith, rnk Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D24330 llvm-svn: 281375	2016-09-13 19:26:42 +00:00
Manman Ren	e6be26c8d4	ObjectiveC generics: Add ObjCTypeParamType in the type system. We also need to add ObjCTypeParamTypeLoc. ObjCTypeParamType supports the representation of "T <protocol>" where T is a type parameter. Before this, we use TypedefType to represent the type parameter for ObjC. ObjCTypeParamType has "ObjCTypeParamDecl *OTPDecl" and it extends from ObjCProtocolQualifiers. It is a non-canonical type and is canonicalized to the underlying type with the protocol qualifiers. rdar://24619481 rdar://25060179 Differential Revision: http://reviews.llvm.org/D23079 llvm-svn: 281355	2016-09-13 17:25:08 +00:00
Adam Nemet	1eea3e577d	Reapply r281276 with passing -emit-llvm in one of the tests Original commit message: Add -fdiagnostics-show-hotness Summary: I've recently added the ability for optimization remarks to include the hotness of the corresponding code region. This uses PGO and allows filtering of the optimization remarks by relevance. The idea was first discussed here: http://thread.gmane.org/gmane.comp.compilers.llvm.devel/98334 The general goal is to produce a YAML file with the remarks. Then, an external tool could dynamically filter these by hotness and perhaps by other things. That said it makes sense to also expose this at the more basic level where we just include the hotness info with each optimization remark. For example, in D22694, the clang flag was pretty useful to measure the overhead of the additional analyses required to include hotness. (Without the flag we don't even run the analyses.) For the record, Hal has already expressed support for the idea of this patch on IRC. Differential Revision: https://reviews.llvm.org/D23284 llvm-svn: 281293	2016-09-13 04:32:40 +00:00
Peter Collingbourne	eeb56abe64	Update Clang for D20147 ("DebugInfo: New metadata representation for global variables.") Differential Revision: http://reviews.llvm.org/D20415 llvm-svn: 281285	2016-09-13 01:13:19 +00:00
Adam Nemet	f2b6883ac8	Revert "Add -fdiagnostics-show-hotness" This reverts commit r281276. Many bots are failing. llvm-svn: 281279	2016-09-13 00:16:49 +00:00
Reid Kleckner	6c7b1c6212	[DebugInfo] Deduplicate debug info limiting logic We should be doing the same checks when a type is completed as we do when a complete type is used during emission. Previously, we duplicated the logic, and it got out of sync. This could be observed with dllimported classes. Also reduce a test case for this slightly. Implementing review feedback from David Blaikie on r281057. llvm-svn: 281278	2016-09-13 00:01:23 +00:00
Adam Nemet	a340eff335	Add -fdiagnostics-show-hotness Summary: I've recently added the ability for optimization remarks to include the hotness of the corresponding code region. This uses PGO and allows filtering of the optimization remarks by relevance. The idea was first discussed here: http://thread.gmane.org/gmane.comp.compilers.llvm.devel/98334 The general goal is to produce a YAML file with the remarks. Then, an external tool could dynamically filter these by hotness and perhaps by other things. That said it makes sense to also expose this at the more basic level where we just include the hotness info with each optimization remark. For example, in D22694, the clang flag was pretty useful to measure the overhead of the additional analyses required to include hotness. (Without the flag we don't even run the analyses.) For the record, Hal has already expressed support for the idea of this patch on IRC. Differential Revision: https://reviews.llvm.org/D23284 llvm-svn: 281276	2016-09-12 23:48:16 +00:00
Saleem Abdulrasool	62c07eb2fa	CodeGen: use some range-based for loops Use range-based for loops to simplify the logic. Add an explicit check for MachO as the inline asm uses MachO specific directives. llvm-svn: 281261	2016-09-12 21:15:23 +00:00
David Majnemer	cb60a4305b	[MS ABI] Add /include directives for dynamic TLS MSVC emits /include directives in the .drective section for the __dyn_tls_init function (decorated as ___dyn_tls_init@12 for 32-bit). This fixes PR30347. llvm-svn: 281189	2016-09-12 02:51:43 +00:00
Saleem Abdulrasool	4fab7454c5	CodeGen: remove unnecessary else case Refactor the assignment so that its much more clear that the if-clause contains the lookup, and once cached is directly used. NFC. llvm-svn: 281150	2016-09-11 01:25:15 +00:00
Reid Kleckner	22466a92e1	[DebugInfo] Ensure complete type is emitted with -fstandalone-debug The logic for upgrading a class from a forward decl to a complete type was not checking the debug info emission level before applying the vtable optimization. This meant we ended up without debug info for a class which was required to be complete. I noticed it because it triggered an assertion during CodeView emission, but that's a separate issue. llvm-svn: 281057	2016-09-09 17:03:53 +00:00
Reid Kleckner	c9404e1039	[codeview] Extend the heuristic for detecting classes imported from DLLs If a dynamic class contains a dllimport method, then assume the class may not be constructed in this DLL, and therefore the vtable will live in a different PDB. This heuristic is still incomplete, and will miss things like abstract base classes that are only constructed on one side of the DLL interface. That said, this heuristic does detect some cases that are currently problematic, and may be useful to other projects that don't use many DLLs. llvm-svn: 281053	2016-09-09 16:27:04 +00:00
Amaury Sechet	21f51b3a32	Update clang for D21514. NFC Summary: As per title. Reviewers: ahatanak, bkramer, whitequark, mehdi_amini, void Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D21515 llvm-svn: 281018	2016-09-09 04:42:49 +00:00
Richard Smith	8df390f9eb	C++ Modules TS: Add parsing and some semantic analysis support for export-declarations. These don't yet have an effect on name visibility; we still export everything by default. llvm-svn: 280999	2016-09-08 23:14:54 +00:00
Albert Gutowski	b6a11acb53	Implement MS _rot intrinsics Reviewers: thakis, Prazek, compnerd, rnk Subscribers: majnemer, cfe-commits Differential Revision: https://reviews.llvm.org/D24311 llvm-svn: 280997	2016-09-08 22:32:19 +00:00
Simon Pilgrim	4acc49e58d	Moved unreachable to appease msvc, gcc and clang llvm-svn: 280921	2016-09-08 11:03:41 +00:00
Simon Pilgrim	48c32b1504	Fixed a 'not all control paths return a value' warning on MSVC builds llvm-svn: 280917	2016-09-08 09:59:58 +00:00
Peter Collingbourne	e53683f97b	CodeGen: Clean up implementation of vtable initializer builder. NFC. - Simplify signature of CreateVTableInitializer function. - Move vtable component builder to a separate function. - Remove unnecessary accessors from VTableLayout class. This is in preparation for a future change that will alter the type of the vtable initializer. Differential Revision: https://reviews.llvm.org/D22642 llvm-svn: 280897	2016-09-08 01:14:39 +00:00
Reid Kleckner	e5a321b5e8	[MS] Fix prologue this adjustment when 'this' is passed indirectly Move the logic for doing this from the ABI argument lowering into EmitParmDecl, which runs for all parameters. Our codegen is slightly suboptimal in this case, as we may leave behind a dead store after optimization, but it's 32-bit inalloca, and this fixes the bug in a robust way. Fixes PR30293 llvm-svn: 280836	2016-09-07 18:21:30 +00:00
Reid Kleckner	034e727001	[MS] Fix 'this' type when calling virtual methods with inalloca If the virtual method comes from a secondary vtable, then the type of the 'this' parameter should be i8, and not a pointer to the complete class. In the MS ABI, the 'this' parameter on entry points to the vptr containing the virtual method that was called, so we use i8 instead of the normal type. We had a mismatch where the CGFunctionInfo of the call didn't match the CGFunctionInfo of the declaration, and this resulted in some assertions, but now both sides agree the type of 'this' is i8*. Fixes one issue raised in PR30293 llvm-svn: 280815	2016-09-07 15:15:51 +00:00
Matt Arsenault	8afb5cd894	Fix whitespace issues ^M and extra space llvm-svn: 280786	2016-09-07 07:07:59 +00:00
Leny Kholodov	df050fd585	Formatting with clang-format patch r280701 llvm-svn: 280718	2016-09-06 17:06:14 +00:00
Leny Kholodov	80c047d2c4	DebugInfo: use llvm::DINode::DIFlags type for debug info flags Use llvm::DINode::DIFlags type (strongly typed enum) for debug flags instead of unsigned int to avoid problems on platforms with sizeof(int) < 4: we already have flags with values > (1 << 16). Patch by: Victor Leschuk <vleschuk@gmail.com> Differential Revision: https://reviews.llvm.org/D23767 llvm-svn: 280701	2016-09-06 10:48:04 +00:00
Alexey Bader	3e0b817b91	[OpenCL] Remove access qualifiers on images in arg info metadata. Summary: Remove access qualifiers on images in arg info metadata: * kernel_arg_type * kernel_arg_base_type Image access qualifiers are inseparable from type in clang implementation, but OpenCL spec provides a special query to get access qualifier via clGetKernelArgInfo with CL_KERNEL_ARG_ACCESS_QUALIFIER. Besides that OpenCL conformance test_api get_kernel_arg_info expects image types without access qualifier. Patch by Evgeniy Tyurin. Reviewers: bader, yaxunl, Anastasia Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D23915 llvm-svn: 280699	2016-09-06 10:10:28 +00:00
Honggyu Kim	2b0e424b2f	[Frontend] Fix mcount inlining bug Since some profiling tools, such as gprof, ftrace, and uftrace, use -pg option to generate a mcount function call at the entry of each function. Function invocation can be detected by this hook function. But mcount insertion is done before function inlining phase in clang, sometime a function that already has a mcount call can be inlined in the middle of another function. This patch adds an attribute "counting-function" to each function rather than emitting the mcount call directly in frontend so that this attribute can be processed in backend. Then the mcount calls can be properly inserted in backend after all the other optimizations are completed. Link: https://llvm.org/bugs/show_bug.cgi?id=28660 Reviewers: hans, rjmccall, hfinkel, rengolin, compnerd Subscribers: shenhan, cfe-commits Differential Revision: https://reviews.llvm.org/D22666 llvm-svn: 280355	2016-09-01 11:29:21 +00:00
Honggyu Kim	2bbdeacf31	Remove whitespace to test commit access llvm-svn: 280337	2016-09-01 06:14:45 +00:00
Nick Lewycky	97e49ac59e	Add -fprofile-dir= to clang. -fprofile-dir=path allows the user to specify where .gcda files should be emitted when the program is run. In particular, this is the first flag that causes the .gcno and .o files to have different paths, LLVM is extended to support this. -fprofile-dir= does not change the file name in the .gcno (and thus where lcov looks for the source) but it does change the name in the .gcda (and thus where the runtime library writes the .gcda file). It's different from a GCOV_PREFIX because a user can observe that the GCOV_PREFIX_STRIP will strip paths off of -fprofile-dir= but not off of a supplied GCOV_PREFIX. To implement this we split -coverage-file into -coverage-data-file and -coverage-notes-file to specify the two different names. The !llvm.gcov metadata node grows from a 2-element form {string coverage-file, node dbg.cu} to 3-elements, {string coverage-notes-file, string coverage-data-file, node dbg.cu}. In the 3-element form, the file name is already "mangled" with .gcno/.gcda suffixes, while the 2-element form left that to the middle end pass. llvm-svn: 280306	2016-08-31 23:04:32 +00:00
Reid Kleckner	598124296b	[codeview] Don't emit vshape info for classes without vfptrs Classes with no virtual methods or whose virtual methods were all inherited from virtual bases don't have a vfptr at offset zero. We were crashing attempting to get the layout of that non-existent vftable. We don't need any vshape info in this case because the debugger can infer it from the base class information. The current class may not introduce any virtual methods if we are in this situation. llvm-svn: 280287	2016-08-31 20:35:01 +00:00
Reid Kleckner	dc124996d2	[codeview] Pass through vftable shape information The shape is really just the number of methods in the vftable, since we don't support 16 bit far calls. All calls are near. Encode this number in the size of the artificial __vtbl_ptr_type DIDerivedType that we generate. For DWARF, this will be a normal pointer, but for codeview this will be a wide pointer that gets pattern matched into a VFTableShape record. Insert this type into the element list of all dynamic classes when emitting CodeView, so that the backend can emit the shape even if the vptr lives in a primary base class. Fixes PR28150 llvm-svn: 280255	2016-08-31 16:11:43 +00:00
Igor Kudrin	fc05ee344c	[Coverage] Suppress creating a code region if the same area is covered by an expansion region. In most cases these code regions are just redundant, but sometimes they could be assigned to the counter of the parent code region instead of the counter of the nested block. Differential Revision: https://reviews.llvm.org/D23987 llvm-svn: 280199	2016-08-31 07:04:16 +00:00
Sjoerd Meijer	0a8d4216ad	This adds new options -fdenormal-fp-math and passes through option -ffast-math to CC1, which are translated to function attributes and can e.g. be mapped on build attributes FP_exceptions and FP_denormal. Setting these build attributes allows better selection of floating point libraries. Differential Revision: https://reviews.llvm.org/D23840 llvm-svn: 280064	2016-08-30 08:09:45 +00:00
Hal Finkel	84832a7a79	[PowerPC] Update the DWARF register-size table The PPC64 DWARF register-size table did not match the ABI specification (or GCC, for that matter). Fix that, and add a regression test. Fixes PR27931. llvm-svn: 280053	2016-08-30 02:38:34 +00:00
Kostya Serebryany	3b41971763	[sanitizer-coverage] add two more modes of instrumentation: trace-div and trace-gep, mostly usaful for value-profile-based fuzzing; clang part llvm-svn: 280044	2016-08-30 01:27:03 +00:00
Igor Kudrin	8545dae226	[Coverage] Prevent creating a redundant counter if a nested body ends with a macro. If there were several nested statements arranged in a way that all of them end up with the same macro, then the expansion of this macro was assigned with all the corresponding counters of these statements. As a result, the wrong counter value was shown for the macro in llvm-cov. This patch fixes the issue by preventing adding a counter for an expanded source range if it already has an assigned counter, which is expected to come from the most specific statement. Differential Revision: https://reviews.llvm.org/D23160 llvm-svn: 279962	2016-08-29 11:48:50 +00:00
Reid Kleckner	d8b0466e19	Widen type of __offset_flags in RTTI on Mingw64 Otherwise we can't handle secondary base classes at offsets greater than 2**24. This agrees with libstdc++abi. We could extend this change to other LLP64 platforms, but then we would want to update libc++abi and it would require additional review. Fixes PR29116 llvm-svn: 279786	2016-08-25 22:16:30 +00:00
Reid Kleckner	b04449d97a	[MS] Win64 va_arg should expect large arguments to be passed indirectly Fixes PR20569 llvm-svn: 279774	2016-08-25 20:42:26 +00:00
Reid Kleckner	44051e63de	[MS] Pass non-trivially-copyable objects indirectly on Windows ARM This isn't exactly what MSVC does, unfortunately. MSVC does not pass objects with destructors but no copy constructors by address. More ARM expertise is required to really understand what should be done here. Fixes PR29136. llvm-svn: 279764	2016-08-25 18:23:28 +00:00
David Blaikie	a45c31a5b4	DebugInfo: Add flag to CU to disable emission of inline debug info into the skeleton CU In cases where .dwo/.dwp files are guaranteed to be available, skipping the extra online (in the .o file) inline info can save a substantial amount of space - see the original r221306 for more details there. llvm-svn: 279651	2016-08-24 18:29:58 +00:00
Adam Nemet	9c84859075	[Pragma] Clear loop distribution attribute between loops llvm-svn: 279608	2016-08-24 04:31:56 +00:00
Adrian Prantl	09906a6e87	Add comments. NFC llvm-svn: 279490	2016-08-22 22:38:16 +00:00
Adrian Prantl	a72972b985	Module debug info: Don't assert when encountering an incomplete definition in isDefinedInClangModule() and assume that the incomplete definition is not defined in the module. This broke the -gmodules self host recently. rdar://problem/27894367 llvm-svn: 279485	2016-08-22 22:23:58 +00:00
Matt Arsenault	88d7da01ca	AMDGPU: Handle structs directly in AMDGPUABIInfo Structs are currently handled as pointer + byval, which makes AMDGPU LLVM backend generate incorrect code when structs are used. This patch changes struct argument to be handled directly and without flattening, which Clover (Mesa 3D Gallium OpenCL state tracker) will be able to handle. Flattening would expand the struct to individual elements and pass each as a separate argument, which Clover can not handle. Furthermore, such expansion does not fit the OpenCL programming model which requires to explicitely specify each argument index, size and memory location. Patch by Vedran Miletić llvm-svn: 279463	2016-08-22 19:25:59 +00:00
David Blaikie	87173f108a	PR29086: DebugInfo: Improve support for fixed array dimensions in variable length arrays llvm-svn: 279445	2016-08-22 17:49:56 +00:00
Yaxun Liu	26f7566ff8	Re-commit [OpenCL] AMDGCN: Fix size_t type There was a premature cast to pointer type in emitPointerArithmetic which caused assertion in tests with assertion enabled. llvm-svn: 279206	2016-08-19 05:17:25 +00:00
Changpeng Fang	03bdd8f797	AMDGPU: Add clang builtin for ds_swizzle. Summary: int __builtin_amdgcn_ds_swizzle (int a, int imm); while imm is a constant. Differential Revision: http://reviews.llvm.org/D23682 llvm-svn: 279165	2016-08-18 22:04:54 +00:00
Justin Bogner	882f861cc7	CodeGen: Rename a variable to better fit LLVM style. NFC llvm-svn: 279159	2016-08-18 21:46:54 +00:00
Saleem Abdulrasool	be25c486dc	CodeGen: use range based for loop, NFC llvm-svn: 279154	2016-08-18 21:40:06 +00:00
Yaxun Liu	dea5ccb04b	Revert [OpenCL] AMDGCN: Fix size_t type due to regressions in test/CodeGen/exprs.c on certain platforms. llvm-svn: 279127	2016-08-18 20:01:06 +00:00
Yaxun Liu	6305f8a351	[OpenCL] AMDGCN: Fix size_t type Pointers of certain GPUs in AMDGCN target in private address space is 32 bit but pointers in other address spaces are 64 bit. size_t type should be defined as 64 bit for these GPUs so that it could hold pointers in all address spaces. Also fixed issues in pointer arithmetic codegen by using pointer specific intptr type. Differential Revision: https://reviews.llvm.org/D23361 llvm-svn: 279121	2016-08-18 19:34:04 +00:00
Diana Picus	8b44bbc077	Revert "[OpenMP] Sema and parsing for 'teams distribute simd’ pragma" This reverts commit r279003 as it breaks some of our buildbots (e.g. clang-cmake-aarch64-quick, clang-x86_64-linux-selfhost-modules). The error is in OpenMP/teams_distribute_simd_ast_print.cpp: clang: /home/buildslave/buildslave/clang-cmake-aarch64-quick/llvm/include/llvm/ADT/DenseMap.h:527: bool llvm::DenseMapBase<DerivedT, KeyT, ValueT, KeyInfoT, BucketT>::LookupBucketFor(const LookupKeyT&, const BucketT&) const [with LookupKeyT = clang::Stmt; DerivedT = llvm::DenseMap<clang::Stmt, long unsigned int>; KeyT = clang::Stmt; ValueT = long unsigned int; KeyInfoT = llvm::DenseMapInfo<clang::Stmt>; BucketT = llvm::detail::DenseMapPair<clang::Stmt, long unsigned int>]: Assertion `!KeyInfoT::isEqual(Val, EmptyKey) && !KeyInfoT::isEqual(Val, TombstoneKey) && "Empty/Tombstone value shouldn't be inserted into map!"' failed. llvm-svn: 279045	2016-08-18 09:25:07 +00:00
Adrian Prantl	576b2dbec5	Support object-file-wrapped modules in clang -module-file-info. rdar://problem/24504815 llvm-svn: 279004	2016-08-17 23:13:53 +00:00
Kelvin Li	0e3bde8216	[OpenMP] Sema and parsing for 'teams distribute simd’ pragma This patch is to implement sema and parsing for 'teams distribute simd’ pragma. This patch is originated by Carlo Bertolli. Differential Revision: https://reviews.llvm.org/D23528 llvm-svn: 279003	2016-08-17 23:13:03 +00:00
Adrian Prantl	26cb1d2660	Module debug info: Fix a bug in handling record decls without fields. The previous condition would erroneously mark all CXXRecordDecls that didn't have any fields as being defined in a clang module. This patch fixes the condition to only apply to explicit template instantiations. <rdar://problem/27771823> llvm-svn: 278952	2016-08-17 18:27:24 +00:00
Adrian Prantl	fd5ac8a0ea	Debug info: Mark noreturn functions with DIFlagNoReturn. This affects functions with the C++11 [[ noreturn ]] and C11 _Noreturn specifiers. Patch by Victor Leschuk! https://reviews.llvm.org/D23168 llvm-svn: 278942	2016-08-17 16:20:32 +00:00
Mehdi Amini	406aa22c6f	[ThinLTO] Adapt backend invocation to llvm API changes. Reviewers: tejohnson Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D23579 llvm-svn: 278906	2016-08-17 06:23:08 +00:00
Duncan P. N. Exon Smith	01f574cdd5	CodeGen: Avoid dereferencing end() in ScalarExprEmitter::EmitOverflowCheckedBinOp Use BB.getNextNode(), which returns nullptr on end(), instead of &*BB.getIterator(), which is UB on end(). CodeGenFunction::createBasicBlock expects nullptr in this case already. llvm-svn: 278898	2016-08-17 03:15:29 +00:00
Chandler Carruth	b72c19f1a6	[PM] Update Clang for LLVM's r278896 which re-organized a header. (sorry this didn't get landed closer in time...) llvm-svn: 278897	2016-08-17 03:09:11 +00:00
Adrian McCarthy	992429843b	Emit debug info for dynamic classes if they are imported from a DLL. With -debug-info-kind=limited, we omit debug info for dynamic classes that live in other TUs. This reduces duplicate type information. When statically linked, the type information comes together. But if your binary has a class derived from a base in a DLL, the base class info is not available to the debugger. The decision is made in shouldOmitDefinition (CGDebugInfo.cpp). Per a suggestion from rnk, I've tweaked the decision so that we do include definitions for classes marked as DLL imports. This should be a relatively small number of classes, so we don't pay a large price for duplication of the type info, yet it should cover most cases on Windows. Essentially this makes debug info for DLLs independent, but we still assume that all TUs within the same DLL will be consistently built with (or without) debug info and the debugger will be able to search across the debug info within that scope to resolve any declarations into definitions, etc. llvm-svn: 278861	2016-08-16 22:11:18 +00:00
Reid Kleckner	66e7717b46	Revert "[X86] Add xgetbv/x[X86] Add xgetbv xsetbv intrinsics to non-windows platforms" This reverts commit r278783. It breaks usage of _xgetbv on Windows. llvm-svn: 278814	2016-08-16 16:04:14 +00:00
James Molloy	5980232178	Left shifts of negative values are defined if -fwrapv is set This means we shouldn't emit ubsan detection code or warn. Fixes PR25552. llvm-svn: 278786	2016-08-16 09:45:36 +00:00
Marina Yatsina	197b65f833	[X86] Add xgetbv/x[X86] Add xgetbv xsetbv intrinsics to non-windows platforms commit on behalf of guyblank Differential Revision: https://reviews.llvm.org/D21959 llvm-svn: 278783	2016-08-16 08:13:36 +00:00
Justin Lebar	60dcc1344a	Add the notion of deferred diagnostics. Summary: This patch lets you create diagnostics that are emitted if and only if a particular FunctionDecl is codegen'ed. This is necessary for CUDA, where some constructs -- e.g. calls from host+device functions to host functions when compiling for device -- are allowed to appear in semantically-correct programs, but only if they're never codegen'ed. Reviewers: rnk Subscribers: cfe-commits, tra Differential Revision: https://reviews.llvm.org/D23241 llvm-svn: 278735	2016-08-15 20:38:56 +00:00
David Majnemer	b439dfe6ba	[CodeGen] Ignore unnamed bitfields before handling vector fields We processed unnamed bitfields after our logic for non-vector field elements in records larger than 128 bits. The vector logic would determine that the bit-field disqualifies the record from occupying a register despite the unnamed bit-field not participating in the record size nor its alignment. N.B. This behavior matches GCC and ICC. llvm-svn: 278656	2016-08-15 07:20:40 +00:00
David Majnemer	b229cb0a43	[CodeGen] Correctly implement the AVX512 psABI rules An __m512 vector type wrapped in a structure should be passed in a vector register. Our prior implementation was based on a draft version of the psABI. This fixes PR28975. N.B. The update to the ABI was made here: https://github.com/hjl-tools/x86-psABI/commit/30f9c9 llvm-svn: 278655	2016-08-15 06:39:18 +00:00
Richard Smith	da38363784	P0217R3: code generation support for decomposition declarations. llvm-svn: 278642	2016-08-15 01:33:41 +00:00
Artem Belevich	4c09318be2	[CUDA] Place GPU binary into .nv_fatbin section and align it by 8. This matches the way nvcc encapsulates GPU binaries into host object file. Now cuobjdump can deal with clang-compiled object files. Differential Revision: https://reviews.llvm.org/D23429 llvm-svn: 278549	2016-08-12 18:44:01 +00:00
Teresa Johnson	9e3f4746d5	CodeGen: Replace ThinLTO backend implementation with a client of LTO/Resolution. Summary: This changes clang to use the llvm::lto::thinBackend function instead of its own less comprehensive ThinLTO backend implementation. Patch by Peter Collingbourne Reviewers: tejohnson, mehdi_amini Subscribers: cfe-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D21545 llvm-svn: 278541	2016-08-12 18:12:08 +00:00
Joey Gouly	ddbda40245	[OpenCL] Change block descriptor address space to constant. The block descriptor is a GlobalVariable in the LLVM IR, so it shouldn't be in the private address space. llvm-svn: 278234	2016-08-10 15:57:02 +00:00
Chandler Carruth	4c5e8ccf74	[x86] Fix a really nasty bug introduced in r276417 where alignment constraints were added to _mm256_broadcast_{pd,ps} intel intrinsics. The spec for these intrinics is ... pretty much silent on alignment. This is especially frustrating considering the amount of discussion of alignment in the load and store instrinsics. So I was forced to rely on the specification for the VBROADCASTF128 instruction. That instruction's spec is also completely silent on alignment. Fortunately, when it comes to the instruction's spec, silence is enough. There is no #GP fault option for an underaligned address so this instruction, and by inference the intrinsic, can read any alignment. As it happens, the old code worked exactly this way and in fact we have plenty of code that hands pointers with less than 16-byte alignment to these intrinsics. This code broke pretty spectacularly with this commit. Fortunately, the fix is super simple! Change a 16 to a 1, and ta da! Anyways, a lot of debugging for a really boring fix. =] llvm-svn: 278202	2016-08-10 07:32:47 +00:00
Yaxun Liu	ffb60901fe	[OpenCL] Handle -cl-fp32-correctly-rounded-divide-sqrt Let the driver pass the option to frontend. Do not set precision metadata for division instructions when this option is set. Set function attribute "correctly-rounded-divide-sqrt-fp-math" based on this option. Differential Revision: https://reviews.llvm.org/D22940 llvm-svn: 278155	2016-08-09 20:10:18 +00:00
Charles Davis	0e37911334	Revert "[Attr] Add support for the `ms_hook_prologue` attribute." This reverts commit r278050. It depends on r278048, which will be reverted. llvm-svn: 278052	2016-08-08 21:19:08 +00:00
Charles Davis	3e43970d71	[Attr] Add support for the `ms_hook_prologue` attribute. Summary: Based on a patch by Michael Mueller. This attribute specifies that a function can be hooked or patched. This mechanism was originally devised by Microsoft for hotpatching their binaries (which they're constantly updating to stay ahead of crackers, script kiddies, and other ne'er-do-wells on the Internet), but it's now commonly abused by Windows programs that want to hook API functions. It is for this reason that this attribute was added to GCC--hence the name, `ms_hook_prologue`. Depends on D19908. Reviewers: rnk, aaron.ballman Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D19909 llvm-svn: 278050	2016-08-08 21:03:39 +00:00
Oliver Stannard	218c4cbd3d	[ARM] Command-line options for embedded position-independent code This patch (with the corresponding ARM backend patch) adds support for some new relocation models: * Read-only position independence (ROPI): Code and read-only data is accessed PC-relative. The offsets between all code and RO data sections are known at static link time. * Read-write position independence (RWPI): Read-write data is accessed relative to a static base register. The offsets between all writeable data sections are known at static link time. These two modes are independent (they specify how different objects should be addressed), so they can be used individually or together. These modes are intended for bare-metal systems or systems with small real-time operating systems. They are designed to avoid the need for a dynamic linker, the only initialisation required is setting the static base register to an appropriate value for RWPI code. There is one C construct not currently supported by these modes: global variables initialised to the address of another global variable or function, where that address is not known at static-link time. There are a few possible ways to solve this: * Disallow this, and require the user to write their own initialisation function if they need variables like this. * Emit dynamic initialisers for these variables in the compiler, called from the .init_array section (as is currently done for C++ dynamic initialisers). We have a patch to do this, described in my original RFC email (http://lists.llvm.org/pipermail/llvm-dev/2015-December/093022.html), but the feedback from that RFC thread was that this is not something that belongs in clang. * Use a small dynamic loader to fix up these variables, by adding the difference between the load and execution address of the relevant section. This would require linker co-operation to generate a table of addresses that need fixing up. Differential Revision: https://reviews.llvm.org/D23196 llvm-svn: 278016	2016-08-08 15:28:40 +00:00
David Blaikie	2a58a18d67	PR26423: Assert on valid use of using declaration of a function with an undeduced auto return type For now just disregard the using declaration in this case. Suboptimal, but wiring up the ability to have declarations of functions that are separate from their definition (we currently only do that for member functions) and have differing return types (we don't have any support for that) is more work than seems reasonable to at least fix this crash. llvm-svn: 277852	2016-08-05 19:03:01 +00:00
Wei Ding	91c8450967	AMDGPU : Add Clang builtin intrinsics for compare with the full wavefront result. Differential Revision: http://reviews.llvm.org/D22934 llvm-svn: 277824	2016-08-05 15:38:46 +00:00
Kelvin Li	0253287633	[OpenMP] Sema and parsing for 'teams distribute' pragma This patch is to implement sema and parsing for 'teams distribute' pragma. Differential Revision: https://reviews.llvm.org/D23189 llvm-svn: 277818	2016-08-05 14:37:37 +00:00
Alexey Bader	d81623261a	[OpenCL] Added underscores to the names of 'to_addr' OpenCL built-ins. Summary: In order to re-define OpenCL built-in functions 'to_{private,local,global}' in OpenCL run-time library LLVM names must be different from the clang built-in function names. Reviewers: yaxunl, Anastasia Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D23120 llvm-svn: 277743	2016-08-04 18:06:27 +00:00
Yaxun Liu	99444cb860	[OpenCL] Fix size of image type The size of image type is reported incorrectly as size of a pointer to address space 0, which causes error when casting image type to pointers by __builtin_astype. The fix is to get image address space from TargetInfo then report the size accordingly. Differential Revision: https://reviews.llvm.org/D22927 llvm-svn: 277647	2016-08-03 20:38:06 +00:00
Paul Robinson	78fb132af0	Add FIXMEs for MSVC 2013 hacks in r277211. NFC. llvm-svn: 277396	2016-08-01 22:12:46 +00:00
Saleem Abdulrasool	4a7130a8fb	CodeGen: simplify the CC handling for TLS wrappers Use the calling convention of the wrapper directly to set the calling convention to ensure that the calling convention matches. Incorrectly setting the calling convention results in the code path being entirely nullified as InstCombine + SimplifyCFG will prune the mismatched CC calls. llvm-svn: 277390	2016-08-01 21:31:24 +00:00
Reid Kleckner	755220bcef	[codeview] Skip injected class names in nested record emission We were already trying to do this, but our check wasn't quite right. Fixes PR28790 llvm-svn: 277367	2016-08-01 18:56:13 +00:00
Hans Wennborg	bc1b58d086	Fix VS2013 build of CGOpenMPRuntime.cpp It seems the compiler was getting confused by the in-class initializers in local struct MapInfo, so moving those to a default constructor instead. llvm-svn: 277256	2016-07-30 00:41:37 +00:00
Paul Robinson	15c840052e	Fix CGOpenMPRuntime.cpp for VS2013. NFC. I don't know why these changes work but they do. llvm-svn: 277211	2016-07-29 20:46:16 +00:00
Saleem Abdulrasool	369f4d64a2	CodeGen: try harder to make the CFString structure RW The previous change was insufficient to mark the content as read-write as the structure itself was marked constant. Adjust this and add tests to ensure that the section is marked appropriately as being read-write. llvm-svn: 277200	2016-07-29 19:15:51 +00:00
Matt Masten	6731dead22	Initial vectorization support for svml calls (short vector math library). Differential Revision: https://reviews.llvm.org/D19544 llvm-svn: 277167	2016-07-29 16:44:24 +00:00
Yaxun Liu	0bc4b2d337	[OpenCL] Generate opaque type for sampler_t and function call for the initializer Currently Clang use int32 to represent sampler_t, which have been a source of issue for some backends, because in some backends sampler_t cannot be represented by int32. They have to depend on kernel argument metadata and use IPA to find the sampler arguments and global variables and transform them to target specific sampler type. This patch uses opaque pointer type opencl.sampler_t* for sampler_t. For each use of file-scope sampler variable, it generates a function call of __translate_sampler_initializer. For each initialization of function-scope sampler variable, it generates a function call of __translate_sampler_initializer. Each builtin library can implement its own __translate_sampler_initializer(). Since the real sampler type tends to be architecture dependent, allowing it to be initialized by a library function simplifies backend design. A typical implementation of __translate_sampler_initializer could be a table lookup of real sampler literal values. Since its argument is always a literal, the returned pointer is known at compile time and easily optimized to finally become some literal values directly put into image read instructions. This patch is partially based on Alexey Sotkin's work in Khronos Clang (`3d4eec6162`). Differential Revision: https://reviews.llvm.org/D21567 llvm-svn: 277024	2016-07-28 19:26:30 +00:00
Samuel Antao	44bcdb3731	[OpenMP] Change name of variable in mappble expression. This attempts to fix a failure in Windows bots pottentially related with a reserved keyword. llvm-svn: 276988	2016-07-28 15:31:29 +00:00
Samuel Antao	cf3f83e46b	[OpenMP] Do not use default argument in lambda from mappable expressions handlers. Windows bots were complaining about that. llvm-svn: 276981	2016-07-28 14:47:35 +00:00
Samuel Antao	6890b09634	[OpenMP] Code generation for the is_device_ptr clause Summary: This patch adds support for the is_device_ptr clause. It expands SEMA to use the mappable expression logic that can only be tested with code generation in place and check conflicts with other data sharing related clauses using the mappable expressions infrastructure. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, kkwli0, ABataev Subscribers: caomhin, cfe-commits Differential Revision: https://reviews.llvm.org/D22788 llvm-svn: 276978	2016-07-28 14:25:09 +00:00
Samuel Antao	cc10b85789	[OpenMP] Codegen for use_device_ptr clause. Summary: This patch adds support for the use_device_ptr clause. It includes changes in SEMA that could not be tested without codegen, namely, the use of the first private logic and mappable expressions support. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, kkwli0, ABataev Subscribers: caomhin, cfe-commits Differential Revision: https://reviews.llvm.org/D22691 llvm-svn: 276977	2016-07-28 14:23:26 +00:00
Samuel Antao	03a3cec480	[OpenMP] Add support to map member expressions with references to pointers. Summary: This patch add support to map pointers through references in class members. Although a reference does not have storage that a user can access, it still has to be mapped in order to get the deep copy right and the dereferencing code work properly. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, kkwli0, ABataev Subscribers: caomhin, cfe-commits Differential Revision: https://reviews.llvm.org/D22787 llvm-svn: 276934	2016-07-27 22:52:16 +00:00
Samuel Antao	403ffd409f	[OpenMP] Add support for mapping array sections through pointer references. Summary: This patch fixes a bug in the map of array sections whose base is a reference to a pointer. The existing mapping support was not prepared to deal with it, causing the compiler to crash. Mapping a reference to a pointer enjoys the same characteristics of a regular pointer, i.e., it is passed by value. Therefore, the reference has to be materialized in the target region. Reviewers: hfinkel, carlo.bertolli, kkwli0, ABataev Subscribers: caomhin, cfe-commits Differential Revision: https://reviews.llvm.org/D22690 llvm-svn: 276933	2016-07-27 22:49:49 +00:00
Justin Lebar	e56360a2cd	[CUDA] Align kernel launch args correctly when the LLVM type's alignment is different from the clang type's alignment. Summary: Before this patch, we computed the offsets in memory of args passed to GPU kernel functions by throwing all of the args into an LLVM struct. clang emits packed llvm structs basically whenever it feels like it, and packed structs have alignment 1. So we cannot rely on the llvm type's alignment matching the C++ type's alignment. This patch fixes our codegen so we always respect the clang types' alignments. Reviewers: rnk Subscribers: cfe-commits, tra Differential Revision: https://reviews.llvm.org/D22879 llvm-svn: 276927	2016-07-27 22:36:21 +00:00
Justin Lebar	ed4f172c00	Don't crash when generating code for __attribute__((naked)) member functions. Summary: Previously this crashed inside EmitThisParam(). There should be no prelude for naked functions, so just skip the whole thing. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D22715 llvm-svn: 276925	2016-07-27 22:04:24 +00:00
Nirav Dave	993a139847	Add flags to toggle preservation of assembly comments Summary: Add -fpreserve-as-comments and -fno-preserve-as-comments. Reviewers: echristo, rnk Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D22883 llvm-svn: 276907	2016-07-27 19:57:40 +00:00
Pirama Arumuga Nainar	bb846a32e4	Adjust coercion of aggregates on RenderScript Summary: In RenderScript, the size of the argument or return value emitted in the IR is expected to be the same as the size of corresponding qualified type. For ARM and AArch64, the coercion performed by Clang can change the parameter or return value to a type whose size is different (usually larger) than the original aggregate type. Specifically, this can happen in the following cases: - Aggregate parameters of size <= 64 bytes and return values smaller than 4 bytes on ARM - Aggregate parameters and return values smaller than bytes on AArch64 This patch coerces the cases above to an integer array that is the same size and alignment as the original aggregate. A new field is added to TargetInfo to detect a RenderScript target and limit this coercion just to that case. Tests added to test/CodeGen/renderscript.c Reviewers: rsmith Subscribers: aemerson, srhines, llvm-commits Differential Revision: https://reviews.llvm.org/D22822 llvm-svn: 276904	2016-07-27 19:01:51 +00:00
Vedant Kumar	efd319a2ad	[Coverage] Do not write out coverage mappings with zero entries After r275121, we stopped mapping regions from system headers. Lambdas declared in regions belonging to system headers started producing empty coverage mappings, since the files corresponding to their spelling locs were being ignored. The coverage reader doesn't know what to do with these empty mappings. This commit makes sure that we don't produce them and adds a test. I'll make the reader stricter in a follow-up commit. llvm-svn: 276716	2016-07-26 00:24:59 +00:00
Xinliang David Li	b65f8ae9e8	[Profile] Use a flag to enable PGO rather than the profraw filename Patch by Jake VanAdrighem Differential Revision: http://reviews.llvm.org/D22608 llvm-svn: 276517	2016-07-23 04:28:59 +00:00
Richard Smith	bdb84f374c	P0217R3: Parsing support and framework for AST representation of C++1z decomposition declarations. There are a couple of things in the wording that seem strange here: decomposition declarations are permitted at namespace scope (which we partially support here) and they are permitted as the declaration in a template (which we reject). llvm-svn: 276492	2016-07-22 23:36:59 +00:00
Xinliang David Li	b7b335a2ce	[Profile] Enable profile merging with -fprofile-generat[=<dir>] This patch enables raw profile merging for this option which is the new intended behavior. llvm-svn: 276484	2016-07-22 22:25:01 +00:00
Anna Thomas	142ea99832	Clang changes for overloading invariant.start and end intrinsics This change depends on the corresponding LLVM change at: https://reviews.llvm.org/D22519 The llvm.invariant.start and llvm.invariant.end intrinsics currently support specifying invariant memory objects only in the default address space. With this LLVM change, these intrinsics are overloaded for any adddress space for memory objects and we can use these llvm invariant intrinsics in non-default address spaces. Example: llvm.invariant.start.p1i8(i64 4, i8 addrspace(1)* %ptr) This overloaded intrinsic is needed for representing final or invariant memory in managed languages. llvm-svn: 276448	2016-07-22 17:50:08 +00:00
Anna Thomas	b772151a17	test commit. update comment grammatically. NFC llvm-svn: 276425	2016-07-22 15:37:56 +00:00
Simon Pilgrim	2d8517303c	[X86][AVX] Added support for lowering to VBROADCASTF128/VBROADCASTI128 with generic IR As discussed on D22460, I've updated the vbroadcastf128 pd256/ps256 builtins to map directly to generic IR - load+splat a 128-bit vector to both lanes of a 256-bit vector. Fix for PR28657. llvm-svn: 276417	2016-07-22 13:58:56 +00:00
Wolfgang Pieb	24e03341af	Reverting r275115 which caused PR28634. When empty (forwarding) basic blocks that are referenced by user labels are removed, incorrect code may be generated. llvm-svn: 276361	2016-07-21 23:28:18 +00:00
Erik Pilkington	1ac8adfcab	[CodeGen] Fix a crash when constant folding switch statement Differential revision: https://reviews.llvm.org/D22542 llvm-svn: 276350	2016-07-21 22:31:40 +00:00
Adrian McCarthy	ab1e786503	Reroll "Include unreferenced nested types in member list only for CodeView" Another attempt at r276271, hopefully without breaking ModuleDebugInfo test. llvm-svn: 276317	2016-07-21 18:43:20 +00:00
Adrian McCarthy	a9a89ae77f	Revert "Include unreferenced nested types in member list only for CodeView" Patch broke ModuleDebugInfo test on the build bots (but not locally). Again. svn revision: r276271 This reverts commit 9da8a1b05362bc96f2855fb32b5588b89407685d. llvm-svn: 276279	2016-07-21 13:41:25 +00:00
Adrian McCarthy	e89c62a102	Include unreferenced nested types in member list only for CodeView Unreferenced nested structs and classes were omitted from the debug info. In DWARF, this was intentional, to avoid bloat. But for CodeView, we want this information to be consistent with what Microsoft tools would produce and expect. llvm-svn: 276271	2016-07-21 13:16:14 +00:00
Davide Italiano	b99fabd4ec	[CodeGen] Handle recursion in LLVMIRGeneration Timer. This can happen when emitting a local decl, which triggers loading a decl imported from an AST file, which we then hand to the AST consumer. Timer is not allowed to recurse so an assertion fire. Keep a reference counter to avoid this problem. LGTM'd by Richard Smith on IRC. Differential Revision: https://reviews.llvm.org/D20748 llvm-svn: 276242	2016-07-21 06:28:48 +00:00
Kelvin Li	986330c190	[OpenMP] Sema and parsing for 'target simd' pragma This patch is to implement sema and parsing for 'target simd' pragma. Differential Revision: https://reviews.llvm.org/D22479 llvm-svn: 276203	2016-07-20 22:57:10 +00:00
John McCall	4c7718d51b	When copying an array into a lambda, destroy temporaries from the copy-constructor immediately and enter a partial array cleanup for previously-copied elements. Fixes PR28595. llvm-svn: 276180	2016-07-20 21:02:43 +00:00
Yaxun Liu	37ceedeabd	[OpenCL] AMDGCN target will generate images in constant address space Allows AMDGCN target to generate images (such as %opencl.image2d_t) in constant address space. Images will still be generated in global address space by default. Added tests to existing opencl-types.cl in test\CodeGenOpenCL. Patch by Aaron En Ye Shi. Differential Revision: https://reviews.llvm.org/D22523 llvm-svn: 276161	2016-07-20 19:21:11 +00:00
Richard Smith	dc1f042171	[modules] Don't emit initializers for VarDecls within a module eagerly whenever we first touch any part of that module. Instead, defer them until the first time that module is (transitively) imported. The initializer step for a module then recursively initializes modules that its own headers imported. For example, this avoids running the <iostream> global initializer in programs that don't actually use iostreams, but do use other parts of the standard library. llvm-svn: 276159	2016-07-20 19:10:16 +00:00
Reid Kleckner	8ad06d6546	[MS] Improve VPtrInfo field names and doc comments 'ReusingBase' was a terrible name. It might actually refer to the most derived class, which is not a base. 'BaseWithVPtr' was also bad, since again, it could refer to the most derived class. It was actually the first base to introduce the vptr, so now it is 'IntroducingObject'. llvm-svn: 276120	2016-07-20 14:40:25 +00:00
Yaxun Liu	f2e8ab2566	[OpenCL] Fixes bug of missing OCL version metadata on the AMDGCN target Added the opencl.ocl.version metadata to be emitted with amdgcn. Created a static function emitOCLVerMD which is shared between triple spir and target amdgcn. Also added new testcases to existing test file, spir_version.cl inside test/CodeGenOpenCL. Patch by Aaron En Ye Shi. Differential Revision: https://reviews.llvm.org/D22424 llvm-svn: 276010	2016-07-19 19:39:45 +00:00
Alexey Bataev	5140e748b5	[OPENMP] Improved processing of 'priority' clause, NFC. Removed some old comments + improved handling of 'priority' clause value during codegen after comments from Richard Smith. llvm-svn: 275945	2016-07-19 04:21:09 +00:00
Vedant Kumar	d04929d875	[Coverage] Remove '..' from filenames after getting an absolute path Failure to do this breaks relative paths which begin with '..'. This issue was caught by the (still nascent) coverage bot. llvm-svn: 275924	2016-07-18 22:32:02 +00:00
Vedant Kumar	14f8fb6864	[Coverage] Normalize '..' out of filename strings This fixes the issue of having duplicate entries for the same file in a coverage report s.t none of the entries actually displayed the correct coverage information. llvm-svn: 275913	2016-07-18 21:01:27 +00:00
Mehdi Amini	9670f847b8	[NFC] Header cleanup Summary: Removed unused headers, replaced some headers with forward class declarations Patch by: Eugene <claprix@yandex.ru> Differential Revision: https://reviews.llvm.org/D20100 llvm-svn: 275882	2016-07-18 19:02:11 +00:00
Saleem Abdulrasool	7093e21ea5	CodeGen: honour dllstorage on ObjC types Add support for ObjC types to respect the DLLImport/DLLExport storage annotations. This only effects COFF output. This would allow usage with clang/C2, but not with clang/LLVM due to hard coded section names. llvm-svn: 275737	2016-07-17 22:27:44 +00:00
Saleem Abdulrasool	a088ad9721	CodeGen: whitespace, formatting cleanups, NFC Format some code which was oddly formatted. Use a bit of auto to make the code more legible. NFC. llvm-svn: 275736	2016-07-17 22:27:41 +00:00
Saleem Abdulrasool	e5f3eae854	CodeGen: whitespace cleanup, StringRef usage in ObjC EH type construction Clean up some formatting issues and use a bit more StringRef based operations instead of SmallStrings. NFC. llvm-svn: 275735	2016-07-17 22:27:38 +00:00
David Majnemer	58ed0f3a5c	[CodeGen] Some assorted cleanups No functional change, just some cleanups: - Use auto when it is appropriate. - There were some strange static_casts which were superfluous. - Use range-based for loops when appropriate. - The dyn_cast_or_null construct was used when null was impossible. llvm-svn: 275699	2016-07-17 00:39:12 +00:00
Saleem Abdulrasool	10fd1ff56a	CodeGen: use StringRefs more in ObjC class generation, NFC Rather than building up a number of SmallString-s in order to construct a std::string, use more StringRefs and construct the string once before use. This avoids unnecessary string constructions. NFC. llvm-svn: 275697	2016-07-16 22:42:06 +00:00
Saleem Abdulrasool	bc2d9998ea	CodeGen: simplify using a local variable, NFC Add a couple of local variables for the class interface and the super class interface. This allows for the repeated access of the information to be cached and makes the code simpler to understand. NFC. llvm-svn: 275696	2016-07-16 22:42:04 +00:00
Matt Arsenault	c7536a5d60	AMDGPU: Remove legacy ldexp builtin llvm-svn: 275623	2016-07-15 21:33:06 +00:00
Matt Arsenault	c86671da09	AMDGPU: Update for rsq intrinsic changes llvm-svn: 275622	2016-07-15 21:33:02 +00:00
Wei Ding	ea41f356bb	AMDGPU: Add Clang Builtin for v_lerp_u8 Differential Revision: http://reviews.llvm.org/D22380 llvm-svn: 275577	2016-07-15 16:43:03 +00:00
Peter Collingbourne	03f8907f65	Frontend: Simplify ownership model for clang's output streams. This changes the CompilerInstance::createOutputFile function to return a std::unique_ptr<llvm::raw_ostream>, rather than an llvm::raw_ostream implicitly owned by the CompilerInstance. This in most cases required that I move ownership of the output stream to the relevant ASTConsumer. The motivation for this change is to allow BackendConsumer to be a client of interfaces such as D20268 which take ownership of the output stream. Differential Revision: http://reviews.llvm.org/D21537 llvm-svn: 275507	2016-07-15 00:55:40 +00:00
Kelvin Li	a579b9196c	[OpenMP] Sema and parsing for 'target parallel for simd' pragma This patch is to implement sema and parsing for 'target parallel for simd' pragma. Differential Revision: http://reviews.llvm.org/D22096 llvm-svn: 275365	2016-07-14 02:54:56 +00:00
Richard Smith	a547eb27fa	P0305R0: Semantic analysis and code generation for C++17 init-statement for 'if' and 'switch': if (stmt; condition) { ... } Patch by Anton Bikineev! Some minor formatting and comment tweets by me. llvm-svn: 275350	2016-07-14 00:11:03 +00:00
Aaron Ballman	7d2aecbc76	Add XRay flags to Clang. We implement two flags to control the XRay behaviour: -fxray-instrument: enables XRay annotation of IR -fxray-instruction-threshold: configures the threshold for function size (looking at IR instructions), and allow LLVM to decide whether to add the nop sleds later on in the process. Also implements the related xray_always_instrument and xray_never_instrument function attributes. Patch by Dean Michael Berris. llvm-svn: 275330	2016-07-13 22:32:15 +00:00
Carlo Bertolli	70594e9282	[OpenMP] Initial implementation of parse+sema for OpenMP clause 'is_device_ptr' of target http://reviews.llvm.org/D22070 llvm-svn: 275282	2016-07-13 17:16:49 +00:00
Carlo Bertolli	2404b17192	[OpenMP] Initial implementation of parse+sema for clause use_device_ptr of 'target data' http://reviews.llvm.org/D21904 This patch is similar to the implementation of 'private' clause: it adds a list of private pointers to be used within the target data region to store the device pointers returned by the runtime. Please refer to the following document for a full description of what the runtime witll return in this case (page 10 and 11): https://github.com/clang-omp/OffloadingDesign I am happy to answer any question related to the runtime interface to help reviewing this patch. llvm-svn: 275271	2016-07-13 15:37:16 +00:00
Alexey Bader	10e9e59898	[OpenCL] Fix code generation of kernel pipe parameters. Improved test with user define structure pipe type case. Reviewers: Anastasia, pxli168 Subscribers: yaxunl, cfe-commits Differential revision: http://reviews.llvm.org/D21744 llvm-svn: 275259	2016-07-13 10:28:13 +00:00
Saleem Abdulrasool	4f515a6e80	CodeGen: minor cleanup, NFC Initialise more members in initializer lists. Invert the condition that had grown to be pretty confusing. The `_objc_empty_vtable` is only used on macOS <10.9. This simplifies the code. NFC. llvm-svn: 275241	2016-07-13 02:58:44 +00:00
David Majnemer	526793d14c	[MS ABI] Support throwing/catching __unaligned types We need to mark the appropriate bits in ThrowInfo and HandlerType so that the personality routine can correctly handle qualification conversions. llvm-svn: 275154	2016-07-12 04:42:50 +00:00
Vedant Kumar	93205af066	[Coverage] Do not map regions from system headers Do not assign source regions located within system headers file ID's, and do not construct counter mapping regions out of them. This makes coverage reports less cluttered and less mysterious. E.g using the "assert" macro doesn't cause assert.h to appear in reports, and it no longer shows the "assertion failed" branch as an uncovered region. It also makes coverage mapping sections a bit smaller (e.g a 1% reduction in a stage2 build of bin/llvm-as). llvm-svn: 275121	2016-07-11 22:57:46 +00:00
Vedant Kumar	c468bb8b29	[Coverage] Move logic to skip decl's into a helper (NFC) llvm-svn: 275120	2016-07-11 22:57:44 +00:00
Wolfgang Pieb	5675c96987	Prevent the creation of empty (forwarding) blocks resulting from nested ifs. Summary: Nested if statements can generate empty BBs whose terminator branches unconditionally to its successor. These branches are not eliminated to help generate better line number information in some cases, but there is no reason to keep the empty blocks that result from nested ifs. Reviewers: mehdi_amini, dblaikie, echristo Subscribers: mehdi_amini, cfe-commits Differential review: http://reviews.llvm.org/D11360 llvm-svn: 275115	2016-07-11 22:22:23 +00:00
David Majnemer	60e5bdc470	[CodeGen] Treat imported static local variables as declarations Imported variables cannot really be definitions for the purposes of IR generation. llvm-svn: 275040	2016-07-11 04:28:21 +00:00
Jan Vesely	d7e03a5bd9	AMDGPU: Export workitem builtins Reviewers: tstellardAMD Differential Revision: http://reviews.llvm.org/D20299 llvm-svn: 275030	2016-07-10 22:38:04 +00:00
Sean Silva	9ac6ae2a99	Delete dead code. We were just setting DisableUnitAtATime to its default value. llvm-svn: 275005	2016-07-10 00:57:52 +00:00
David Majnemer	177553511d	[MS ABI] Some code cleanups Don't create unnecessary truncations if the result will not be used. Also prefer preforming math before the truncation, it makes it a little easier to reason about. llvm-svn: 274984	2016-07-09 19:26:25 +00:00
Saleem Abdulrasool	0295f8ce39	CodeGen: tweak CFString section for COFF, ELF Place the structure data into `cfstring`. This both isolates the structures to permit coalescing in the future (by the linker) as well as ensures that it doesnt get marked as read-only data. The structures themselves are not read-only, only the string contents. llvm-svn: 274956	2016-07-09 01:59:51 +00:00
Yaxun Liu	79c99fb7eb	[OpenCL] Add missing -cl-no-signed-zeros option into driver Add OCL option -cl-no-signed-zeros to driver options. Also added to opencl.cl testcases. Patch by Aaron En Ye Shi. Differential Revision: http://reviews.llvm.org/D22067 llvm-svn: 274923	2016-07-08 20:28:29 +00:00
Craig Topper	f2f1a099a7	[CodeGen] Use llvm::Type::getVectorNumElements instead of casting to llvm::VectorType and calling getNumElements. This is equivalent and shorter. llvm-svn: 274823	2016-07-08 02:17:35 +00:00
Craig Topper	0160063aeb	[X86] Reuse existing lambda and remove unnecessary argument from vector cmp builtin handling. NFC llvm-svn: 274821	2016-07-08 01:57:24 +00:00
Craig Topper	925ef0a135	[X86] Remove a couple calls to create V2F64 and V4F32 types for builtin handling. Just get the type from the operand of the builtin instead. NFC llvm-svn: 274820	2016-07-08 01:48:44 +00:00
David Majnemer	6fbeee307e	[AST] Use ArrayRef in more interfaces ArrayRef is a little better than passing around a pointer/length pair. No functional change is intended. llvm-svn: 274732	2016-07-07 04:43:07 +00:00
Adrian McCarthy	20128d94e5	Revert "Retry "Include debug info for nested structs and classes"" Reverting because it causes a test failure on build bots (Modules/ModuleDebugInfo.cpp). Failure does not reproduce locally. svn revision: rL274698 This reverts commit 3c5ed6599b086720aab5b8bd6941149d066806a6. llvm-svn: 274706	2016-07-06 23:28:34 +00:00
Adrian McCarthy	0a8cb648c9	Retry "Include debug info for nested structs and classes" This should work now that the LLVM-side of the change has landed successfully. Original Differential Revision: http://reviews.llvm.org/D21705 This reverts commit a30322e861c387e1088f47065d0438c6bb019879. llvm-svn: 274698	2016-07-06 22:39:15 +00:00
David Majnemer	36a6e00d6e	[CodeGen, DebugInfo] Use hasLocalLinkage instead of hasInternalLinkage For the purpose of emitting debug info, entities with private linkage should be treated the same as internal linkage. While this doesn't change anything in practice, it makes the code a little less confusing. llvm-svn: 274677	2016-07-06 21:07:53 +00:00
Adrian McCarthy	743f7f1aff	Revert "Include debug info for nested structs and classes" This reverts commit 0af5ee9631c7c167dc40498b415876553e314c95. llvm-svn: 274633	2016-07-06 15:15:38 +00:00
Adrian McCarthy	73d726a6cc	Include debug info for nested structs and classes This includes nested types in the member list, even if there are no members of that type. Note that structs and classes have themselves as an "implicit struct" as the first member, so we skip implicit ones. Differential Revision: http://reviews.llvm.org/D21705 llvm-svn: 274628	2016-07-06 14:46:42 +00:00
Craig Topper	425d02d33e	[X86] Use native IR for immediate values 0-7 of packed fp cmp builtins. This makes them the same as what is done when using the SSE builtins for these same encodings. llvm-svn: 274608	2016-07-06 06:27:31 +00:00
Kelvin Li	787f3fcc6b	[OpenMP] Sema and parsing for 'distribute simd' pragma Summary: This patch is an implementation of sema and parsing for the OpenMP composite pragma 'distribute simd'. Differential Revision: http://reviews.llvm.org/D22007 llvm-svn: 274604	2016-07-06 04:45:38 +00:00
Craig Topper	46e7555d4b	[AVX512] Use the generic ctlz intrinsic to implement the vplzcntd/q builtins. llvm-svn: 274603	2016-07-06 04:24:29 +00:00
Vedant Kumar	1d137f54a3	Delete some dead code, NFC Found using clang's code coverage tool. llvm-svn: 274599	2016-07-06 03:08:47 +00:00
Anastasia Stulova	db7a31cce7	[OpenCL] An implementation of device side enqueue (DSE) from OpenCL v2.0 s6.13.17. - Added new Builtins: enqueue_kernel, get_kernel_work_group_size and get_kernel_preferred_work_group_size_multiple. These Builtins use custom check to diagnose parameters of the passed Blocks i. e. variable number of 'local void*' type params, and check different overloads specified in Table 6.31 of OpenCL v2.0. - IR is generated as an internal library call for each OpenCL Builtin, reusing ObjC Block implementation. Review: http://reviews.llvm.org/D20249 llvm-svn: 274540	2016-07-05 11:31:24 +00:00
Kelvin Li	4a39add05e	[OpenMP] Sema and parse for 'distribute parallel for simd' Summary: This patch is an implementation of sema and parsing for the OpenMP composite pragma 'distribute parallel for simd'. Differential Revision: http://reviews.llvm.org/D21977 llvm-svn: 274530	2016-07-05 05:00:15 +00:00
Anastasia Stulova	7f8d6dc0ef	[OpenCL] Make OpenCL Builtins added according to the right version. Currently we only have OpenCL 2.0 Builtins i.e. pipes or address space conversions. They have to be added only in the version 2.0 compilation mode to make the identifiers available for use in the other versions. Review: http://reviews.llvm.org/D20249 llvm-svn: 274509	2016-07-04 16:07:18 +00:00
Craig Topper	ac1823f6e9	[AVX512] Modify what indices we emit for the zero vector we use for zero extension of the result of a v2i1 or v4i1 masked compare. This way we emit something that the backend easily interprets as a concatenation rather than a true shuffle. This delivers slightly better codegen with the current backend capabilities. llvm-svn: 274484	2016-07-04 07:09:46 +00:00
Benjamin Kramer	6d1c10bb8e	[CUDA] Move argument type lists to the stack. NFC. llvm-svn: 274433	2016-07-02 12:03:57 +00:00
Benjamin Kramer	309347385e	Use arrays or initializer lists to feed ArrayRefs instead of SmallVector where possible. No functionality change intended llvm-svn: 274432	2016-07-02 11:41:41 +00:00

... 3 4 5 6 7 ...

10404 Commits