llvm-project

Commit Graph

Author	SHA1	Message	Date
Adrian Prantl	0dabc2adba	Module debugging: Also correctly handle typedef'd foward-declared members. Thanks again to Richard Smith for pointing this out. llvm-svn: 267630	2016-04-26 23:37:38 +00:00
Adrian Prantl	88d7917970	Module debugging: Use the definition to determine module-defined types. Follow-up to r267464. Thanks to Richard Smith for pointing this out! llvm-svn: 267611	2016-04-26 21:58:18 +00:00
Anastasia Stulova	0937a7d814	[SPIR] Remove an assert mandating SPIR for OpenCL sources only. SPIR target can be used for C/C++ inputs too (i.e. in OpenCL compatible mode for the libs creation). Patch by Neil Henning! Review: http://reviews.llvm.org/D19478 llvm-svn: 267561	2016-04-26 15:14:01 +00:00
Kostya Serebryany	0da4442f14	trying to fix the windows build broken by r267496 llvm-svn: 267513	2016-04-26 01:53:49 +00:00
NAKAMURA Takumi	90d96bf41e	CGOpenMPRuntime.h: Prune '\param IfCond' in r267395. [-Wdocumentation] llvm-svn: 267503	2016-04-26 00:45:00 +00:00
Jacques Pienaar	e74d91314a	[lanai] Update handling of structs in arguments to be passed in registers. Previously aggregate types were passed byval, change the ABI to pass these in registers instead. llvm-svn: 267496	2016-04-26 00:09:29 +00:00
Adrian Prantl	05fefa4a85	Module Debugging: Fix the condition for determining whether a template instantiation is in a module. This patch fixes the condition for determining whether the debug info for a template instantiation will exist in an imported clang module by: - checking whether the ClassTemplateSpecializationDecl is complete and - checking that the instantiation was in a module by looking at the first field. I also added a negative check to make sure that a typedef to a forward-declared template (with the definition outside of the module) is handled correctly. http://reviews.llvm.org/D19443 rdar://problem/25553724 llvm-svn: 267464	2016-04-25 20:52:40 +00:00
Alexey Bataev	7292c29bb5	[OPENMP 4.5] Codegen for 'taskloop' directive. The taskloop construct specifies that the iterations of one or more associated loops will be executed in parallel using OpenMP tasks. The iterations are distributed across tasks created by the construct and scheduled to be executed. The next code will be generated for the taskloop directive: #pragma omp taskloop num_tasks(N) lastprivate(j) for( i=0; i<NGRAINSTRIDE-1; i+=STRIDE ) { int th = omp_get_thread_num(); #pragma omp atomic counter++; #pragma omp atomic th_counter[th]++; j = i; } Generated code: task = __kmpc_omp_task_alloc(NULL,gtid,1,sizeof(struct task),sizeof(struct shar),&task_entry); psh = task->shareds; psh->pth_counter = &th_counter; psh->pcounter = &counter; psh->pj = &j; task->lb = 0; task->ub = NGRAINSTRIDE-2; task->st = STRIDE; __kmpc_taskloop( NULL, // location gtid, // gtid task, // task structure 1, // if clause value &task->lb, // lower bound &task->ub, // upper bound STRIDE, // loop increment 0, // 1 if nogroup specified 2, // schedule type: 0-none, 1-grainsize, 2-num_tasks N, // schedule value (ignored for type 0) (void*)&__task_dup_entry // tasks duplication routine ); llvm-svn: 267395	2016-04-25 12:22:29 +00:00
Adrian Prantl	1858c664de	Debug info: Apply an empty debug location for global OpenMP destructors. LLVM really wants a debug location on every inlinable call in a function with debug info, because it otherwise cannot set up inlining debug info. This change applies an artificial line 0 debug location (which is how DWARF marks automatically generated code that has no corresponding source code) to the .__kmpc_global_dtor_. functions to avoid the LLVM Verifier complaining. llvm-svn: 267369	2016-04-24 22:22:29 +00:00
Duncan P. N. Exon Smith	383f8413cf	DebugInfo: Adapt to loss of DITypeRef in LLVM r267296 LLVM stopped using MDString-based type references, and DIBuilder no longer fills 'retainedTypes:' with every DICompositeType that has an 'identifier:' field. There are just minor changes to keep the same behaviour in CFE. Leaving 'retainedTypes:' unfilled has a dramatic impact on the output order of the IR though. There are a huge number of testcase changes, which were unfortunately not really scriptable. llvm-svn: 267297	2016-04-23 21:08:27 +00:00
Rong Xu	f932f54254	PGO] PGOFuncName meta data if PGOFuncName is different from function's raw name Write out the PGOFuncName meta data if PGOFuncName is different from function's raw name. This should only apply to internal linkage functions. This is to be consumed by indirect-call promotion when called in LTO optimization pass. Differential Revision: http://reviews.llvm.org/D18624 llvm-svn: 267224	2016-04-22 21:19:05 +00:00
Reid Kleckner	ea53dba78b	Fix a bug involving deferred decl emission and PCH For various reasons, involving dllexport and class linkage compuations, we have to wait until after the semicolon after a class declaration to emit inline methods. These are "deferred" decls. Before this change, finishing the tag decl would trigger us to deserialize some PCH so that we could make a "pretty" IR-level type. Deserializing the PCH triggered calls to HandleTopLevelDecl, which, when done, checked the deferred decl list, and emitted some dllexported decls that weren't ready. Avoid this re-entrancy. Deferred decls should not get emitted when a tag is finished, they should only be emitted after a real top level decl in the main file. llvm-svn: 267186	2016-04-22 18:46:33 +00:00
Alexey Bataev	feddd64bff	[OPENMP] Fix for PR27463: Privatizing struct fields with array type causes code generation failure. The codegen part of firstprivate clause for member decls used type of original variable without skipping reference type from OMPCapturedExprDecl. Patch fixes this problem. llvm-svn: 267125	2016-04-22 09:05:03 +00:00
Alexey Bataev	5dff95c04d	[OPENMP] Fix for LCV in simd directives in explicit clauses. If loop control variable for simd-based directives is explicitly marked as linear/lastprivate in clauses, codegen for such construct would crash. Patch fixes this problem. llvm-svn: 267101	2016-04-22 03:56:56 +00:00
Derek Bruening	256c2e14c7	[esan] EfficiencySanitizer driver flags Summary: Adds a framework to enable the instrumentation pass for the new EfficiencySanitizer ("esan") family of tools. Adds a flag for esan's cache fragmentation tool via -fsanitize=efficiency-cache-frag. Adds appropriate tests for the new flag. Reviewers: eugenis, vitalybuka, aizatsky, filcab Subscribers: filcab, kubabrecka, llvm-commits, zhaoqin, kcc Differential Revision: http://reviews.llvm.org/D19169 llvm-svn: 267059	2016-04-21 21:32:04 +00:00
Adrian Prantl	50fd1a87c4	Module Debugging: Emit the canonical debug info for Objective-C classes in the compile unit that contains their implementation even if their interface is declared in a module. The private @implementation of an @interface may have additional hidden ivars so we should not defer to the public version of the type that is found in the module. <rdar://problem/25541798> llvm-svn: 266937	2016-04-20 23:59:32 +00:00
Alexey Bataev	48591dd98c	[OPENMP] Codegen for untied tasks. If the untied clause is present on a task construct, any thread in the team can resume the task region after a suspension. Patch adds proper codegen for untied tasks. llvm-svn: 266853	2016-04-20 04:01:36 +00:00
Teresa Johnson	b10474903c	Enable ODR uniquing of DITypes for ThinLTO backends Summary: This is a follow-on to apply Duncan's new DIType ODR uniquing from r266549 and r266713 in more places. When invoking ThinLTO backend compiles via clang (for a distributed build), invoke enableDebugTypeODRUniquing() before parsing the module. Reviewers: dexonsmith, joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D19264 llvm-svn: 266852	2016-04-20 02:23:52 +00:00
Marcin Koscielnicki	4005070e1b	[AArch64] Fix D19098 fallout. The intrinsic is now called llvm.thread.pointer, not llvm.aarch64.thread.pointer. Also, the code handling it in CGBuiltin.cpp is dead - it's already covered by GCCBuiltin. Remove it. Differential Revision: http://reviews.llvm.org/D19099 llvm-svn: 266817	2016-04-19 20:51:00 +00:00
Ahmed Bougacha	1d9de10130	[ARM NEON] Define vfms_f32 on ARM, and all vfms using vfma. r259537 added vfma/vfms to armv7, but the builtin was only lowered on the AArch64 side. Instead of supporting it on ARM, get rid of it. The vfms builtin lowered to: %nb = fsub float -0.0, %b %r = @llvm.fma.f32(%a, %nb, %c) Instead, define the operation in terms of vfma, and swap the multiplicands. It now lowers to: %na = fsub float -0.0, %a %r = @llvm.fma.f32(%na, %b, %c) This matches the instruction more closely, and lets current LLVM generate the "natural" operand ordering: fmls.2s v0, v1, v2 instead of the crooked (but equivalent): fmls.2s v0, v2, v1 Except for theses changes, assembly is identical. LLVM accepts both commutations, and the LLVM tests in: test/CodeGen/AArch64/arm64-fmadd.ll test/CodeGen/AArch64/fp-dp3.ll test/CodeGen/AArch64/neon-fma.ll test/CodeGen/ARM/fusedMAC.ll already check either the new one only, or both. Also verified against the test-suite unittests. llvm-svn: 266807	2016-04-19 19:44:45 +00:00
Manman Ren	01b705e9c0	ObjC Class Property: don't emit class properties on old deployment targets. For old deployment targets, emit nil for all class property lists. rdar://25616128 llvm-svn: 266800	2016-04-19 19:05:03 +00:00
Sanjay Patel	3fd323fa36	reduce indentation; NFCI llvm-svn: 266787	2016-04-19 18:06:33 +00:00
Ahmed Bougacha	40a34c2e2a	[CodeGen] Widen non-power-of-2 vector HFA base types. Currently, for the ppc64--gnu and aarch64 ABIs, we recognize: typedef __attribute__((__ext_vector_type__(3))) float v3f32; typedef __attribute__((__ext_vector_type__(16))) char v16i8; struct HFA { v3f32 a; v16i8 b; }; as an HFA. Since the first type encountered is used as the base type, we pass the HFA as: [2 x <3 x float>] Which leads to incorrect IR (relying on padding values) when the second field is used. Instead, explicitly widen the vector (after size rounding) in isHomogeneousAggregate. Differential Revision: http://reviews.llvm.org/D18998 llvm-svn: 266784	2016-04-19 17:54:29 +00:00
Ahmed Bougacha	8862cae75b	[CodeGen] Fix whitespace. NFC. llvm-svn: 266783	2016-04-19 17:54:24 +00:00
Sanjay Patel	341890ad70	reduce indentation; NFCI llvm-svn: 266765	2016-04-19 17:13:14 +00:00
Alexey Bataev	995e861ba6	Revert "[OPENMP] Codegen for untied tasks." This reverts commit r266754. llvm-svn: 266755	2016-04-19 16:36:01 +00:00
Alexey Bataev	823acfacdf	[OPENMP] Codegen for untied tasks. If the untied clause is present on a task construct, any thread in the team can resume the task region after a suspension. Patch adds proper codegen for untied tasks. llvm-svn: 266754	2016-04-19 16:27:55 +00:00
Alexey Bataev	bec9572213	Revert "[OPENMP] Codegen for untied tasks." This reverts commit 266722. llvm-svn: 266724	2016-04-19 09:27:38 +00:00
Alexey Bataev	26b2577f6b	[OPENMP] Codegen for untied tasks. If the untied clause is present on a task construct, any thread in the team can resume the task region after a suspension. Patch adds proper codegen for untied tasks. llvm-svn: 266722	2016-04-19 09:10:27 +00:00
Adrian Prantl	2526fca8d8	[ObjC++] Fix crash when emitting debug info for a block member capturing this. rdar://problem/23871824 llvm-svn: 266698	2016-04-18 23:48:16 +00:00
JF Bastien	dda2cb17a3	NFC: unify clang / LLVM atomic ordering This makes the C11 / C++11 ABI atomic ordering accessible from LLVM, as discussed in http://reviews.llvm.org/D18200#inline-151433 This re-applies r266574 which I had reverted in r266575. Depends on http://reviews.llvm.org/D18875 Original review: http://reviews.llvm.org/D18876 llvm-svn: 266641	2016-04-18 18:01:49 +00:00
Xinliang David Li	90364ca1a7	Update InstrProf pass creator API reference llvm-svn: 266638	2016-04-18 17:48:12 +00:00
JF Bastien	a76c91fbf6	Revert "NFC: unify clang / LLVM atomic ordering" This reverts commit b0495df9eae2824bee830cc4c94f5441f0d4cbc9. Same as for the corresponding LLVM revert, an assert seems to fire. llvm-svn: 266575	2016-04-17 21:28:50 +00:00
JF Bastien	0601a77cf0	NFC: unify clang / LLVM atomic ordering Summary: Depends on http://reviews.llvm.org/D18875 This makes the C11 / C++11 ABI atomic ordering accessible from LLVM, as discussed in http://reviews.llvm.org/D18200#inline-151433 Reviewers: jyknight, reames Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18876 llvm-svn: 266574	2016-04-17 21:01:09 +00:00
Duncan P. N. Exon Smith	f9521b0bb7	DebugInfo: Make DICompositeTypes distinct most of the time Since elements of most kinds of DICompositeType have back references, most are involved in uniquing cycles. Except via the ODR 'identifier:' field, which doesn't care about the storage type (see r266549), they have no hope of being uniqued. Distinct nodes are far more efficient, so use them for most kinds of DICompositeType definitions (i.e., when DIType::isForwardDecl is false). The exceptions: - DW_TAG_array_type, since their elements never have back-references and they never have ODR 'identifier:' fields; - DW_TAG_enumeration_type when there is no ODR 'identifier:' field, since their elements usually don't have back-references. This breaks the last major uniquing cycle I'm aware of in the debug info graph. The impact won't be enormous for C++ because references to ODR-uniqued nodes still use string-based DITypeRefs; but this should prevent a regression in C++ when we drop the string-based references. This wouldn't have been reasonable until r266549, when composite types stopped relying on being uniqued by structural equivalence to prevent blow-ups at LTO time. llvm-svn: 266556	2016-04-17 07:45:08 +00:00
Nemanja Ivanovic	d7d45bf8ce	Revert 266186 as it breaks anything that includes type_traits on some platforms Since this patch provided support for the __float128 type but disabled it on all platforms by default, some platforms can't compile type_traits with -std=gnu++11 since there is a specialization with __float128. This reverts the patch until D19125 is approved (i.e. we know which platforms need this support enabled). llvm-svn: 266460	2016-04-15 18:04:13 +00:00
Adrian Prantl	e76bda544b	Update to match LLVM changes for PR27284. (Reverse the ownership between DICompileUnit and DISubprogram.) http://reviews.llvm.org/D19034 <rdar://problem/25256815> llvm-svn: 266445	2016-04-15 15:55:45 +00:00
Reid Kleckner	9305fd1f86	[CodeGen] Avoid ctor/dtor boilerplate with some C++11 Non-owning pointers that cache LLVM types and constants can use 'nullptr' default member initializers so that we don't need to mention them in the constructor initializer list. Owning pointers should use std::unique_ptr so that we don't need to manually delete them in the destructor. They also don't need to be mentioned in the constructor at that point. NFC llvm-svn: 266263	2016-04-13 23:37:17 +00:00
Richard Smith	aa165cf759	[modules] Remove CXX_CTOR_INITIALIZERS_OFFSETS table. Instead of storing an ID of a table entry in the corresponding decl, store an offset from the current record to the relevant CXX_CTOR_INITIALIZERS record. This results in fewer indirections and a minor .pcm file size reduction. llvm-svn: 266254	2016-04-13 21:57:08 +00:00
Nemanja Ivanovic	50f29e06a1	Enable support for __float128 in Clang This patch corresponds to review: http://reviews.llvm.org/D15120 It adds support for the __float128 keyword, literals and a target feature to enable it. This support is disabled by default on all targets and any target that has support for this type is free to add it. Based on feedback that I've received from target maintainers, this appears to be the right thing for most targets. I have not heard from the maintainers of X86 which I believe supports this type. I will subsequently investigate the impact of enabling this on X86. llvm-svn: 266186	2016-04-13 09:49:45 +00:00
Alexey Bader	b62f14400f	[OpenCL] Move OpenCLImageTypes.def from clangAST to clangBasic library. Putting OpenCLImageTypes.def to clangAST library violates layering requirement: "It's not OK for a Basic/ header to include an AST/ header". This fixes the modules build. Differential revision: http://reviews.llvm.org/D18954 Reviewers: Richard Smith, Vassil Vassilev. llvm-svn: 266180	2016-04-13 08:33:41 +00:00
Akira Hatanaka	2d3690bc98	[ObjC] Pop all cleanups created in EmitObjCForCollectionStmt before exiting the for-in loop. This commit fixes a bug where EmitObjCForCollectionStmt didn't pop cleanups for captures. For example, in the following for-in loop, a block which captures self is passed to foo1: for (id x in [self foo1:^{ use(self); }]) { use(x); break; } Previously, the code in EmitObjCForCollectionStmt wouldn't pop the cleanup for the captured self before exiting the loop, which caused code-gen to generate an IR in which objc_release was called twice on the captured self. This commit fixes the bug by entering a RunCleanupsScope before the loop condition is evaluated and forcing its cleanup before exiting the loop. rdar://problem/16865751 Differential Revision: http://reviews.llvm.org/D18618 llvm-svn: 266147	2016-04-12 23:10:58 +00:00
David Blaikie	7164767de2	Add a fixme for an old patch I had lying around that I'm not going to finish any time so n llvm-svn: 266127	2016-04-12 21:22:48 +00:00
Yaxun Liu	b5e80c3117	Pass -backend-option to LLVM when there is no target machine. Clang should pass -backend-option to LLVM even though there is no target machine, since LLVM passes are used when emitting LLVM IR. Differential Revision: http://reviews.llvm.org/D17552 llvm-svn: 266117	2016-04-12 20:22:32 +00:00
Yaxun Liu	b7b6d0fc66	[OpenCL] Handle AddressSpaceConversion when target address space does not change. In codegen different address spaces may be mapped to the same address space for a target, e.g. in x86/x86-64 all address spaces are mapped to 0. Therefore AddressSpaceConversion should be translated by CreatePointerBitCastOrAddrSpaceCast instead of CreateAddrSpaceCast. Differential Revision: http://reviews.llvm.org/D18713 llvm-svn: 266107	2016-04-12 19:03:49 +00:00
Alexey Bataev	e48a5fc56d	[OPENMP 4.0] Support for 'uniform' clause in 'declare simd' directive. OpenMP 4.0 defines clause 'uniform' in 'declare simd' directive: 'uniform' '(' <argument-list> ')' The uniform clause declares one or more arguments to have an invariant value for all concurrent invocations of the function in the execution of a single SIMD loop. The special this pointer can be used as if was one of the arguments to the function in any of the linear, aligned, or uniform clauses. llvm-svn: 266041	2016-04-12 05:28:34 +00:00
Mehdi Amini	a0ef3ed8c3	Emit the module hash by default with -flto=thin. Reviewers: tejohnson Subscribers: joker.eph, cfe-commits Differential Revision: http://reviews.llvm.org/D18947 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265977	2016-04-11 18:45:20 +00:00
Benjamin Kramer	5ff6747e04	Remove redundant conditions of the form (A \|\| (!A && B)) -> (A \|\| B) Found by cppcheck! PR27286 PR27287 PR27288 PR27289 llvm-svn: 265918	2016-04-11 08:26:13 +00:00
Dmitry Polukhin	85eda12d09	[GCC] Attribute ifunc support in clang This patch add support for GCC attribute((ifunc("resolver"))) for targets that use ELF as object file format. In general ifunc is a special kind of function alias with type @gnu_indirect_function. LLVM patch http://reviews.llvm.org/D15525 Differential Revision: http://reviews.llvm.org/D15524 llvm-svn: 265917	2016-04-11 07:48:59 +00:00
Adrian Prantl	826824ea3d	Use NoDebug compile units to mark debug metadata used only for sample-based profiling and optimization remarks and indicate that no debug info shall be emitted for these compile units. http://reviews.llvm.org/D18808 <rdar://problem/25427165> llvm-svn: 265862	2016-04-08 22:43:06 +00:00
Saleem Abdulrasool	f7b3d6c55b	Move EABIVersion from CodeGenOptions to TargetOptions It is possible to argue that the EABIVersion field is similar in spirit to the ABI field in TargetOptions. It represents the embedded ABI that the target follows. This will allow us to thread this information into the target information construction. llvm-svn: 265807	2016-04-08 16:52:05 +00:00
Saleem Abdulrasool	10a4972a8d	revert SVN r265702, r265640 Revert the two changes to thread CodeGenOptions into the TargetInfo allocation and to fix the layering violation by moving CodeGenOptions into Basic. Code Generation is arguably not particularly "basic". This addresses Richard's post-commit review comments. This change purely does the mechanical revert and will be followed up with an alternate approach to thread the desired information into TargetInfo. llvm-svn: 265806	2016-04-08 16:52:00 +00:00
Alexey Bader	954ba21f85	[OpenCL] Complete image types support. I. Current implementation of images is not conformant to spec in the following points: 1. It makes no distinction with respect to access qualifiers and therefore allows to use images with different access type interchangeably. The following code would compile just fine: void write_image(write_only image2d_t img); kernel void foo(read_only image2d_t img) { write_image(img); } // Accepted code which is disallowed according to s6.13.14. 2. It discards access qualifier on generated code, which leads to generated code for the above example: call void @write_image(%opencl.image2d_t* %img); In OpenCL2.0 however we can have different calls into write_image with read_only and wite_only images. Also generally following compiler steps have no easy way to take different path depending on the image access: linking to the right implementation of image types, performing IR opts and backend codegen differently. 3. Image types are language keywords and can't be redeclared s6.1.9, which can happen currently as they are just typedef names. 4. Default access qualifier read_only is to be added if not provided explicitly. II. This patch corrects the above points as follows: 1. All images are encapsulated into a separate .def file that is inserted in different points where image handling is required. This avoid a lot of code repetition as all images are handled the same way in the code with no distinction of their exact type. 2. The Cartesian product of image types and image access qualifiers is added to the builtin types. This simplifies a lot handling of access type mismatch as no operations are allowed by default on distinct Builtin types. Also spec intended access qualifier as special type qualifier that are combined with an image type to form a distinct type (see statement above - images can't be created w/o access qualifiers). 3. Improves testing of images in Clang. Author: Anastasia Stulova Reviewers: bader, mgrang. Subscribers: pxli168, pekka.jaaskelainen, yaxunl. Differential Revision: http://reviews.llvm.org/D17821 llvm-svn: 265783	2016-04-08 13:40:33 +00:00
Aaron Ballman	829b5d42af	Silencing a 32-bit shift implicit conversion warning from MSVC; NFC. llvm-svn: 265782	2016-04-08 12:21:58 +00:00
Sanjoy Das	f60a0d7452	Adapt to LLVM API change Replace mayBeOverridden with isInterposable llvm-svn: 265767	2016-04-08 01:31:02 +00:00
Richard Smith	351241c83e	Replace Sema-level implementation of -fassume-sane-operator-new with a CodeGen-level implementation. Instead of adding an attribute to clang's FunctionDecl, add the IR attribute directly. This means a module built with this flag is now compatible with code built without it and vice versa. This change also results in the 'noalias' attribute no longer being added to calls to operator new in the IR; it's now only added to the declaration. It also fixes a bug where we failed to add the attribute to the 'nothrow' versions (because we didn't implicitly declare them, there was no good time to inject a fake attribute). llvm-svn: 265728	2016-04-07 21:46:12 +00:00
Saleem Abdulrasool	94cfc603d1	Basic: move CodeGenOptions from Frontend This is a mechanical move of CodeGenOptions from libFrontend to libBasic. This fixes the layering violation introduced earlier by threading CodeGenOptions into TargetInfo. It should also fix the modules based self-hosting builds. NFC. llvm-svn: 265702	2016-04-07 17:49:44 +00:00
Sanjay Patel	ae7a9df7bf	make __builtin_isfinite more efficient (PR27145) isinf (is infinite) and isfinite should be implemented with the same function except we change the comparison operator. See PR27145 for more details: https://llvm.org/bugs/show_bug.cgi?id=27145 Ref: forked off of the discussion in D18513. Differential Revision: http://reviews.llvm.org/D18648 llvm-svn: 265675	2016-04-07 14:29:05 +00:00
Benjamin Kramer	5d28c7f9a2	Move class into an anonymous namespace. NFC. llvm-svn: 265654	2016-04-07 10:14:54 +00:00
JF Bastien	dd11ee7452	NFC: use AtomicOrdering isStrongerThan Summary: As discussed in D18775. Reviewers: jyknight Differential Revision: http://reviews.llvm.org/D18840 llvm-svn: 265617	2016-04-06 23:37:36 +00:00
Tim Northover	1390b4479e	Restore slightly less dodgy diagnostic handler for inline asm Turns out it was there mostly to prevent Clang asking people to report a bug. This time we report something to Clang's real diagnostics handler so that it exits with something approximating a real error and tidies up after itself. llvm-svn: 265592	2016-04-06 19:58:07 +00:00
JF Bastien	92f4ef1017	NFC: make AtomicOrdering an enum class Summary: See LLVM change D18775 for details, this change depends on it. Reviewers: jyknight, reames Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18776 llvm-svn: 265569	2016-04-06 17:26:42 +00:00
Justin Lebar	d3a44f6885	[CUDA] Add -fcuda-flush-denormals-to-zero. Summary: Setting this flag causes all functions are annotated with the "nvvm-f32ftz" = "true" attribute. In addition, we annotate the module with "nvvm-reflect-ftz" set to 0 or 1, depending on whether -cuda-flush-denormals-to-zero is set. This is read by the NVVMReflect pass. Reviewers: tra, rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18671 llvm-svn: 265435	2016-04-05 18:26:20 +00:00
Nirav Dave	d2f44d8de0	Add -fno-jump-tables and-fjump-tables flags Add no-jump-tables flag to disable use of jump tables when lowering switch statements Reviewers: echristo, hans Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18407 llvm-svn: 265425	2016-04-05 17:50:43 +00:00
James Y Knight	e635215009	Fix "suggest parentheses" warning. llvm-svn: 265355	2016-04-04 22:35:56 +00:00
John McCall	8cde42c400	Fix an unused-variable warning by using the variable in the place it was supposed to have been used. llvm-svn: 265344	2016-04-04 20:39:50 +00:00
John McCall	12f2352152	IRGen-level lowering for the Swift calling convention. llvm-svn: 265324	2016-04-04 18:33:08 +00:00
Carlo Bertolli	c687225b43	[OPENMP] Codegen for teams directive for NVPTX This patch implements the teams directive for the NVPTX backend. It is different from the host code generation path as it: Does not call kmpc_fork_teams. All necessary teams and threads are started upon touching the target region, when launching a CUDA kernel, and their execution is coordinated through sequential and parallel regions within the target region. Does not call kmpc_push_num_teams even if a num_teams of thread_limit clause is present. Setting the number of teams and the thread limit is implemented by the nvptx-related runtime. Please note that I am now passing a Clang Expr * to emitPushNumTeams instead of the originally chosen llvm::Value * type. The reason for that is that I want to avoid emitting expressions for num_teams and thread_limit if they are not needed in the target region. http://reviews.llvm.org/D17963 llvm-svn: 265304	2016-04-04 15:55:02 +00:00
Douglas Gregor	24ae22c047	[Objective-C] Introduce objc_runtime_visible attribute. The objc_runtime_visible attribute deals with an odd corner case where a particular Objective-C class is known to the Objective-C runtime (and, therefore, accessible by name) but its symbol has been hidden for some reason. For such classes, teach CodeGen to use objc_lookUpClass to retrieve the Class object, rather than referencing the class symbol directly. Classes annotated with objc_runtime_visible have two major limitations that fall out from places where Objective-C metadata needs to refer to the class (or metaclass) symbol directly: * One cannot implement a subclass of an objc_runtime_visible class. * One cannot implement a category on an objc_runtime_visible class. Implements rdar://problem/25494092. llvm-svn: 265201	2016-04-01 23:23:52 +00:00
Akira Hatanaka	8af7bb28aa	[CodeGen] Emit lifetime.end intrinsic after objects are destructed in landing pads. Previously, lifetime.end intrinsics were inserted only on normal control flows. This prevented StackColoring from merging stack slots for objects that were destroyed on the exception handling control flow since it couldn't tell their lifetime ranges were disjoint. This patch fixes code-gen to emit the intrinsic on both control flows. rdar://problem/22181976 Differential Revision: http://reviews.llvm.org/D18196 llvm-svn: 265197	2016-04-01 22:58:55 +00:00
Adrian Prantl	3563c55aa0	Adapt to LLVM API change in r265077. EmissionKind moved from DIBuilder to DICompileUnit. <rdar://problem/25427165> llvm-svn: 265078	2016-03-31 23:57:45 +00:00
Tim Northover	8c824a07ae	Diagnostics: remove dodgy handler for bitcode inlineasm diagnostics. Whatever crash it was there to present appears to have been fixed in the backend now, and it had the nasty side-effect of causing clang to exit(0) and leave a .o containing goodness knows what even when an error hit. llvm-svn: 265038	2016-03-31 19:19:24 +00:00
Betul Buyukkurt	3da993c419	[PGO] Avoid instrumenting constants at value sites Value profiling should not profile constants and/or constant expressions when they appear as callees in call instructions. Constant expressions form when a direct callee has bitcasts or inttoptr(ptrtint (callee)) nests surrounding it. Value profiling should avoid instrumenting such cases. Mostly NFC. llvm-svn: 265037	2016-03-31 18:41:34 +00:00
Akira Hatanaka	68ab7fe1c8	[CodeGenCXX] Fix ItaniumCXXABI::getAlignmentOfExnObject to return 8-byte alignment on Darwin. Itanium C++ ABI specifies that _Unwind_Exception should be double-word aligned (16B). To conform to the ABI, libraries implementing exception handling declare the struct with __attribute__((aligned)), which aligns the unwindHeader field (and the end of __cxa_exception) to the default target alignment (which is typically 16-bytes). struct __cxa_exception { ... // struct is declared with __attribute__((aligned)). _Unwind_Exception unwindHeader; }; Based on the assumption that _Unwind_Exception is declared with __attribute__((aligned)), ItaniumCXXABI::getAlignmentOfExnObject returns the target default alignment for __attribute__((aligned)). It turns out that libc++abi, which is used on Darwin, doesn't declare the struct with the attribute and therefore doesn't guarantee that unwindHeader is aligned to the alignment specified by the ABI, which in some cases causes the program to crash because of unaligned memory accesses. This commit avoids crashes due to unaligned memory accesses by having getAlignmentOfExnObject return an 8-byte alignment on Darwin. I've only fixed the problem for Darwin, but we should also figure out whether other platforms using libc++abi need similar fixes. rdar://problem/25314277 Differential revision: http://reviews.llvm.org/D18479 llvm-svn: 264998	2016-03-31 06:36:07 +00:00
Matt Arsenault	3fb963389e	AMDGPU: Add frexp_mant + frexp_exp builtins llvm-svn: 264960	2016-03-30 22:57:40 +00:00
Aaron Ballman	abd466ed04	Silencing warnings from MSVC 2015 Update 2. Both of these changes silence "C4334 '<<': result of 32-bit shift implicitly converted to 64 bits (was 64-bit shift intended?)". NFC. llvm-svn: 264932	2016-03-30 21:33:34 +00:00
Stephan Bergmann	17d7d14571	For MS ABI, emit dllexport friend functions defined inline in class ...as that is apparently what MSVC does. This is an updated version of r263738, which had to be reverted in r263740 due to test failures. The original version had erroneously emitted functions that are defined in class templates, too (see the updated "Handle friend functions" code in EmitDeferredDecls, lib/CodeGen/ModuleBuilder.cpp). (The updated tests needed to be split out into their own dllexport-ms-friend.cpp because of the CHECK-NOTs which would have interfered with subsequent CHECK-DAGs in dllexport.cpp.) Differential Revision: http://reviews.llvm.org/D18430 llvm-svn: 264841	2016-03-30 06:27:31 +00:00
Betul Buyukkurt	cb6f5f16e6	[PGO] Move the instrumentation point closer to the value site. For terminator instructions, the value profiling instrumentation happens in a basic block other than where the value site resides. This CR moves the instrumentation point prior to the value site. Mostly NFC. llvm-svn: 264783	2016-03-29 20:44:09 +00:00
Alexey Bataev	5a3af13d93	[OPENMP] Remove extra code transformation. For better support of some specific GNU extensions some extra transformation of AST nodes were introduced. These transformations are very hard to handle. The code is improved in handling of these extensions by using captured expressions construct. llvm-svn: 264709	2016-03-29 08:58:54 +00:00
Alexey Bataev	14fa1c6b60	[OPENMP] Allow runtime insert its own code inside OpenMP regions. Solution unifies interface of RegionCodeGenTy type to allow insert runtime-specific code before/after main codegen action defined in CGStmtOpenMP.cpp file. Runtime should not define its own RegionCodeGenTy for general OpenMP directives, but must be allowed to insert its own (required) code to support target specific codegen. llvm-svn: 264700	2016-03-29 05:34:15 +00:00
Adam Nemet	1e217bc25f	[PGO] More comments how function pointers for indirect calls are mapped to function names Summary: Hopefully this will make it easier for the next person to figure all this out... Reviewers: bogner, davidxl Subscribers: davidxl, cfe-commits Differential Revision: http://reviews.llvm.org/D18489 llvm-svn: 264681	2016-03-28 22:18:53 +00:00
Jacques Pienaar	d964cc22d1	[lanai] Add Lanai backend to clang driver. Changes to clang to add Lanai backend. Adds a new target, ABI and toolchain. General Lanai backend discussion on llvm-dev thread "[RFC] Lanai backend" (http://lists.llvm.org/pipermail/llvm-dev/2016-February/095118.html). Differential Revision: http://reviews.llvm.org/D17002 llvm-svn: 264655	2016-03-28 21:02:54 +00:00
Alexey Bataev	f539faa733	Revert "[OPENMP] Allow runtime insert its own code inside OpenMP regions." Reverting because of failed tests. llvm-svn: 264577	2016-03-28 12:58:34 +00:00
Alexey Bataev	424be92831	[OPENMP] Allow runtime insert its own code inside OpenMP regions. Solution unifies interface of RegionCodeGenTy type to allow insert runtime-specific code before/after main codegen action defined in CGStmtOpenMP.cpp file. Runtime should not define its own RegionCodeGenTy for general OpenMP directives, but must be allowed to insert its own (required) code to support target specific codegen. llvm-svn: 264576	2016-03-28 12:52:58 +00:00
Alexey Bataev	f662b5943c	Revert "[OPENMP] Allow runtime insert its own code inside OpenMP regions." This reverts commit 3ee791165100607178073f14531a0dc90c622b36. llvm-svn: 264570	2016-03-28 10:12:03 +00:00
Alexey Bataev	b8c425c4f7	[OPENMP] Allow runtime insert its own code inside OpenMP regions. Solution unifies interface of RegionCodeGenTy type to allow insert runtime-specific code before/after main codegen action defined in CGStmtOpenMP.cpp file. Runtime should not define its own RegionCodeGenTy for general OpenMP directives, but must be allowed to insert its own (required) code to support target specific codegen. llvm-svn: 264569	2016-03-28 09:53:43 +00:00
David Majnemer	2041b46b76	Fix serialization/deserialization for __uuidof I broke this back in r264529 because I forgot to serialize the UuidAttr member. Fix this by replacing the UuidAttr with a StringRef which is properly serialized and deserialized. llvm-svn: 264562	2016-03-28 03:19:50 +00:00
David Majnemer	6568760090	Use the correct alignment for uuid descriptors The _GUID_ descriptors emitted by MSVC have alignment 8 for 64-bit builds: we should do the same if the linker picks the "wrong" COMDAT. llvm-svn: 264530	2016-03-27 04:46:14 +00:00
David Majnemer	1dbc7a7a5a	Improve the representation of CXXUuidofExpr Keep a pointer to the UuidAttr that the CXXUuidofExpr corresponds to. This makes translating from __uuidof to the underlying constant a lot more straightforward. llvm-svn: 264529	2016-03-27 04:46:07 +00:00
Duncan P. N. Exon Smith	f72d5b608d	CGLoopInfo: Use the MD_loop metadata kind from r264371, NFC Besides a small compile-time speedup, there should be no real functionality change here. llvm-svn: 264372	2016-03-25 00:38:14 +00:00
Easwaran Raman	8160812e26	Attach profile summary information to Module. Differential Revision: http://reviews.llvm.org/D18289 llvm-svn: 264342	2016-03-24 21:32:25 +00:00
Reid Kleckner	00381aa142	Revert "Recommitted r263425 "Supporting all entities declared in lexical scope in LLVM debug info." After fixing PR26942 (the fix is included in this commit)." This reverts commit r264281. This change breaks building Chromium for iOS. We'll upload a reproducer to the PR soon. llvm-svn: 264333	2016-03-24 20:38:43 +00:00
Amjad Aboud	abb04956b6	Recommitted r263425 "Supporting all entities declared in lexical scope in LLVM debug info." After fixing PR26942 (the fix is included in this commit). Differential Revision: http://reviews.llvm.org/D18350 llvm-svn: 264281	2016-03-24 13:30:41 +00:00
Xiuli Pan	972bea8a2e	[OpenCL] Add ocl and spir version for spir target Summary: Add opencl.spir.version and opencl.ocl.version metadata for CodeGen to identify OpenCL version. Reviewers: yaxunl, Anastasia Subscribers: cfe-commits, pekka.jaaskelainen Differential Revision: http://reviews.llvm.org/D17596 llvm-svn: 264241	2016-03-24 03:57:17 +00:00
Matt Arsenault	08087c52eb	Add missing __builtin_bitreverse8 Also add documentation for bitreverse builtins llvm-svn: 264203	2016-03-23 22:14:43 +00:00
Vasileios Kalintiris	e5c095923d	Fix warning about extra semicolon. NFC. llvm-svn: 264035	2016-03-22 10:41:20 +00:00
Arpith Chacko Jacob	5c309e475d	[OpenMP] Base support for target directive codegen on NVPTX device. Summary: This patch adds base support for codegen of the target directive on the NVPTX device. Reviewers: ABataev Differential Revision: http://reviews.llvm.org/D17877 Reworked test case after buildbot failure on windows. Updated patch to integrate r263837 and test case nvptx_target_firstprivate_codegen.cpp. llvm-svn: 264018	2016-03-22 01:48:56 +00:00
Justin Lebar	717d2b0a0d	[CUDA] Implement atomicInc and atomicDec builtins These functions cannot be implemented as atomicrmw or cmpxchg instructions, so they are implemented as a call to the NVVM intrinsics @llvm.nvvm.atomic.load.inc.32.p0i32 and @llvm.nvvm.atomic.load.dec.32.p0i32. Patch by Jason Henline. Reviewers: jlebar Differential Revision: http://reviews.llvm.org/D18322 llvm-svn: 264009	2016-03-22 00:09:28 +00:00
Pete Cooper	948677131f	Revert "Convert some ObjC msgSends to runtime calls." This reverts commit r263607. This change caused more objc_retain/objc_release calls in the IR but those are then incorrectly optimized by the ARC optimizer. Work is going to have to be done to ensure the ARC optimizer doesn't optimize user written RR, but that should land before this change. This change will also need to be updated to take account for any changes required to ensure that user written calls to RR are distinct from those inserted by ARC. llvm-svn: 263984	2016-03-21 20:50:03 +00:00
Faisal Vali	dc6b596ebb	[Cxx1z] Implement Lambda Capture of this by Value as [=,this] (P0018R3) Implement lambda capture of this by copy. For e.g.: struct A { int d = 10; auto foo() { return [this] (auto a) mutable { d+=a; return d; }; } }; auto L = A{}.foo(); // A{}'s lifetime is gone. // Below is still ok, because this was captured by value. assert(L(10) == 20); assert(L(100) == 120); If the capture was implicit, or [this] (i.e. this was captured by reference), this code would be otherwise undefined. Implementation Strategy: - amend the parser to accept this in the lambda introducer - add a new king of capture LCK_StarThis - teach Sema::CheckCXXThisCapture to handle by copy captures of the enclosing object (i.e. this) - when CheckCXXThisCapture does capture by copy, the corresponding initializer expression for the closure's data member direct-initializes it thus making a copy of 'this'. - in codegen, when assigning to CXXThisValue, if this was captured by copy, make sure it points to the corresponding field member, and not, unlike when captured by reference, what the field member points to. - mark feature as implemented in svn Much gratitude to Richard Smith for his carefully illuminating reviews! llvm-svn: 263921	2016-03-21 09:25:37 +00:00
Richard Smith	01694c340d	P0184R0: Allow types of 'begin' and 'end' expressions in range-based for loops to differ. llvm-svn: 263895	2016-03-20 10:33:40 +00:00
Manman Ren	5e5d046a4f	[TLS on Darwin] use CXX_FAST_TLS calling convention for tls_init. This makes sure we don't generate a lot of code to spill/reload CSRs when calling tls_init from the access functions. This helps performance when tls_init is not inlined into the access functions. llvm-svn: 263854	2016-03-18 23:35:21 +00:00
Carlo Bertolli	b74bfc80a4	[OPENMP] Implementation of codegen for firstprivate clause of target directive This patch implements the following aspects: It extends sema to check that a variable is not reference in both a map clause and firstprivate or private. This is needed to ensure correct functioning at codegen level, apart from being useful for the user. It implements firstprivate for target in codegen. The implementation applies to both host and nvptx devices. It adds regression tests for codegen of firstprivate, host and device side when using the host as device, and nvptx side. Please note that the regression test for nvptx codegen is missing VLAs. This is because VLAs currently require saving and restoring the stack which appears not to be a supported operation by nvptx backend. It adds a check in sema regression tests for target map, firstprivate, and private clauses. http://reviews.llvm.org/D18203 llvm-svn: 263837	2016-03-18 21:43:32 +00:00
Pirama Arumuga Nainar	8e2e9d6f4c	Add -fnative-half-arguments-and-returns Summary: r246764 handled __fp16 arguments and returns for AAPCS, but skipped this handling for OpenCL. Simlar to OpenCL, RenderScript also handles __fp16 type natively. This patch adds the -fnative-half-arguments-and-returns command line flag to allow such languages to skip this coercion of __fp16. Reviewers: srhines, olista01 Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18138 llvm-svn: 263795	2016-03-18 16:58:36 +00:00
Arpith Chacko Jacob	129fa9a048	Revert r263783 as buildbot failure is being investigated. llvm-svn: 263784	2016-03-18 12:39:40 +00:00
Arpith Chacko Jacob	ac563708ab	[OpenMP] Base support for target directive codegen on NVPTX device. Summary: Reworked test case after buildbot failure on windows. This patch adds base support for codegen of the target directive on the NVPTX device. Reviewers: ABataev Differential Revision: http://reviews.llvm.org/D17877 llvm-svn: 263783	2016-03-18 11:47:43 +00:00
Reid Kleckner	4084504caa	Revert "For MS ABI, emit dllexport friend functions defined inline in class" This reverts commit r263738. This appears to cause a failure in CXX/temp/temp.decls/temp.friend/p1.cpp llvm-svn: 263740	2016-03-17 20:06:58 +00:00
Reid Kleckner	0f6caf66e9	For MS ABI, emit dllexport friend functions defined inline in class Summary: ...as that is apparently what MSVC does Reviewers: rnk Patch by Stephan Bergmann Differential Revision: http://reviews.llvm.org/D15267 llvm-svn: 263738	2016-03-17 19:52:20 +00:00
Alexey Bataev	a839dddf92	[OPENMP 4.0] Use 'declare reduction' constructs in 'reduction' clauses. OpenMP 4.0 allows to define custom reduction operations using '#pragma omp declare reduction' construct. Patch allows to use this custom defined reduction operations in 'reduction' clauses. llvm-svn: 263701	2016-03-17 10:19:46 +00:00
Carlo Bertolli	a03acfa359	[OPENMP] Support for codegen of private clause of target, host side This patch adds support for codegen of private clause of target and a regression test for host code generation, when the host is used as target device. I believe that code generation for nvptx backend would not require anything additional or different to what is done for the host. http://reviews.llvm.org/D18105 llvm-svn: 263654	2016-03-16 19:04:22 +00:00
Roman Levenstein	35aa5cecf2	Add attributes for preserve_mostcc/preserve_allcc calling conventions to the C/C++ front-end Till now, preserve_mostcc/preserve_allcc calling convention attributes were only available at the LLVM IR level. This patch adds attributes for preserve_mostcc/preserve_allcc calling conventions to the C/C++ front-end. The code was mostly written by Juergen Ributzka. I just added support for the AArch64 target and tests. Differential Revision: http://reviews.llvm.org/D18025 llvm-svn: 263647	2016-03-16 18:00:46 +00:00
Yaron Keren	ebd142692d	Directly get the canonical Type instead of going around through a CanQualType temporary, NFC. llvm-svn: 263635	2016-03-16 12:14:43 +00:00
Pete Cooper	be6c750a8e	Convert some ObjC msgSends to runtime calls. It is faster to directly call the ObjC runtime for methods such as retain/release instead of sending a message to those functions. This patch adds support for converting messages to retain/release/alloc/autorelease to their equivalent runtime calls. Tests included for the positive case of applying this transformation, negative tests that we ensure we only convert "alloc" to objc_alloc, not "alloc2", and also a driver test to ensure we enable this only for supported runtime versions. Reviewed by John McCall. Differential Revision: http://reviews.llvm.org/D14737 llvm-svn: 263607	2016-03-16 00:33:21 +00:00
Arpith Chacko Jacob	9cb61faa61	Revert commit http://reviews.llvm.org/D17877 to fix tests on x86. llvm-svn: 263589	2016-03-15 21:26:34 +00:00
Arpith Chacko Jacob	5e1493b560	[OpenMP] Base support for target directive codegen on NVPTX device. Summary: This patch adds base support for codegen of the target directive on the NVPTX device. Reviewers: ABataev Differential Revision: http://reviews.llvm.org/D17877 llvm-svn: 263587	2016-03-15 21:04:57 +00:00
Evgeniy Stepanov	02279ed12d	[cfi] Don't emit checks for disabled CFI kinds. In the cross-DSO CFI mode clang emits __cfi_check_fail that handles errors triggered from other modules with targets in the current module. With this change, __cfi_check_fail will handle errors for CFI kinds that are not enabled in the current module as if they have the trapping behaviour (-fsanitize-trap=...). This fixes a bug where some combinations of -fsanitize* flags may result in a link failure due to a missing sanitizer runtime library for the diagnostic calls in __cfi_check_fail. llvm-svn: 263578	2016-03-15 20:19:29 +00:00
Arpith Chacko Jacob	fc46c25d74	Reverted http://reviews.llvm.org/D17877 to fix tests. llvm-svn: 263555	2016-03-15 16:19:13 +00:00
Arpith Chacko Jacob	c61744c26b	[OpenMP] Base support for target directive codegen on NVPTX device. Summary: This patch adds base support for codegen of the target directive on the NVPTX device. Reviewers: ABataev Differential Revision: http://reviews.llvm.org/D17877 llvm-svn: 263552	2016-03-15 15:24:52 +00:00
Teresa Johnson	a0d29406cb	[ThinLTO] Clang side of renaming of function index (NFC) This is the companion to an LLVM patch that renamed the function index data structures and files to use the more general module summary index. (Recommit after fixing LLVM side to add back missed file) llvm-svn: 263514	2016-03-15 00:04:44 +00:00
Teresa Johnson	376b46128f	Revert "[ThinLTO] Clang side of renaming of function index (NFC)" This reverts commit r263491. Missed a file on the LLVM side. llvm-svn: 263494	2016-03-14 21:18:17 +00:00
Teresa Johnson	9c6cd5df8c	[ThinLTO] Clang side of renaming of function index (NFC) This is the companion to an LLVM patch that renamed the function index data structures and files to use the more general module summary index. llvm-svn: 263491	2016-03-14 21:06:06 +00:00
Peter Collingbourne	4e6a540024	CodeGen: Use 32-bit gep offsets to address vtable address points. The relative vtable ABI will use a struct rather than an array as the type of a vtable. LLVM only allows 32-bit integers as struct indices, so we need to use 32-bit integers to get addresses of address points. In order to keep the code simple, we might as well do that unconditionally. It's probably a reasonable implementation limit to support no more than 2 billion virtual functions per class. This change causes quite a bit of churn in the test suite, so I'm making it separately. Differential Revision: http://reviews.llvm.org/D18113 llvm-svn: 263469	2016-03-14 19:07:10 +00:00
Peter Collingbourne	0446e7cfae	CodeGen: Mark functions used in vtables as unnamed_addr. This marks virtual function declarations, as well as runtime library functions __cxa_pure_virtual, __cxa_deleted_virtual and _purecall, as unnamed_addr. This will allow us to correctly form relative references to them from vtables in the relative vtable ABI. Differential Revision: http://reviews.llvm.org/D18071 llvm-svn: 263464	2016-03-14 18:41:59 +00:00
Benjamin Kramer	35bc38af20	Revert "Recommitted r261634 "Supporting all entities declared in lexical scope in LLVM debug info." After fixing PR26715 at r263379." This reverts commit r263425. Breaks self-host. llvm-svn: 263436	2016-03-14 14:58:28 +00:00
Amjad Aboud	22c997deb6	Recommitted r261634 "Supporting all entities declared in lexical scope in LLVM debug info." After fixing PR26715 at r263379. llvm-svn: 263425	2016-03-14 12:03:55 +00:00
Mehdi Amini	557c20a886	Remove compile time PreserveName in favor of a runtime cc1 -discard-value-names option Summary: This flag is enabled by default in the driver when NDEBUG is set. It is forwarded on the LLVMContext to discard all value names (but GlobalValue) for performance purpose. This an improved version of D18024 Reviewers: echristo, chandlerc Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18127 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 263394	2016-03-13 21:05:23 +00:00
Eric Christopher	02e3dd4b2e	Temporarily revert these patches: commit 60d9845f6a037122d9be9a6d92d4de617ef45b04 Author: Mehdi Amini <mehdi.amini@apple.com> Date: Fri Mar 11 18:48:02 2016 +0000 Fix clang crash: when CodeGenAction is initialized without a context, use the member and not the parameter From: Mehdi Amini <mehdi.amini@apple.com> git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@263273 91177308-0d34-0410-b5e6-96231b3b80d8 commit af7ce3bf04a75ad5124b457b805df26006bd215b Author: Mehdi Amini <mehdi.amini@apple.com> Date: Fri Mar 11 17:32:58 2016 +0000 Fix build: use -> with pointers and not . Silly typo. From: Mehdi Amini <mehdi.amini@apple.com> git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@263267 91177308-0d34-0410-b5e6-96231b3b80d8 commit d0eea119192814954e7368c77d0dc5a9eeec1fbb Author: Mehdi Amini <mehdi.amini@apple.com> Date: Fri Mar 11 17:15:44 2016 +0000 Remove compile time PreserveName switch based on NDEBUG Summary: Following r263086, we are now relying on a flag on the Context to discard Value names in release builds. Reviewers: chandlerc Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18024 From: Mehdi Amini <mehdi.amini@apple.com> git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@263257 91177308-0d34-0410-b5e6-96231b3b80d8 until we can fix the Release builds. This reverts commits 263257, 263267, 263273 llvm-svn: 263320	2016-03-12 01:47:11 +00:00
Mehdi Amini	4661b39e33	Fix clang crash: when CodeGenAction is initialized without a context, use the member and not the parameter From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 263273	2016-03-11 18:48:02 +00:00
Reid Kleckner	e10b601537	[SEH] Remove nounwind/noinline from outlined finally funclets With the new EH representation this is no longer necessary. llvm-svn: 263269	2016-03-11 17:36:16 +00:00
Mehdi Amini	7850f596e2	Fix build: use -> with pointers and not . Silly typo. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 263267	2016-03-11 17:32:58 +00:00
Mehdi Amini	e803fc3276	Remove compile time PreserveName switch based on NDEBUG Summary: Following r263086, we are now relying on a flag on the Context to discard Value names in release builds. Reviewers: chandlerc Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18024 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 263257	2016-03-11 17:15:44 +00:00
Chandler Carruth	4ddaadca76	Update to include the new header file providing createGVNPass. llvm-svn: 263210	2016-03-11 09:02:43 +00:00
John McCall	f26e73df75	Add a coerce-and-expand ABIArgInfo as a generalization of some of the things we do with Expand / Direct. NFC for now, but this will be used by swiftcall expansion. llvm-svn: 263192	2016-03-11 04:30:43 +00:00
John McCall	c56a8b3284	Preserve ExtParameterInfos into CGFunctionInfo. As part of this, make the function-arrangement interfaces a little simpler and more semantic. NFC. llvm-svn: 263191	2016-03-11 04:30:31 +00:00
Alexey Samsonov	ae81bbb496	EmitCXXStructorCall -> EmitCXXDestructorCall. NFC. This function is only used in Microsoft ABI and only to emit destructors. Rename/simplify it accordingly. llvm-svn: 263081	2016-03-10 00:20:37 +00:00
Alexey Samsonov	efa956cea0	Remove unused function arguments. NFC. llvm-svn: 263080	2016-03-10 00:20:33 +00:00
Kit Barton	fbab158767	[PPC] FE support for generating VSX [negated] absolute value instructions Includes new built-in, conversion of built-in to target-independent intrinsic and update in the header file. Tests are also updated. There is a second part in the backend for which I will post a separate code-review. BACKEND PART SHOULD BE COMMITTED FIRST. Phabricator: http://reviews.llvm.org/D17816 llvm-svn: 263051	2016-03-09 19:28:31 +00:00
Alexey Bataev	ef549a8955	[OPENMP 4.5] Codegen for data members in 'linear' clause OpenMP 4.5 allows privatization of non-static data members in OpenMP constructs. Patch adds proper codegen support for data members in 'linear' clause llvm-svn: 263003	2016-03-09 09:49:09 +00:00
Alexey Bataev	78849fb464	[OPENMP 4.5] Codegen for data members in 'linear' clause. OpenMP 4.5 allows to use data members in private clauses. Patch adds codegen support for 'linear' clause. llvm-svn: 263002	2016-03-09 09:49:00 +00:00
Richard Smith	6365e46459	Fix -Werror build. llvm-svn: 262965	2016-03-08 23:16:16 +00:00
Richard Smith	872307e2ac	P0017R1: In C++1z, an aggregate class can have (public non-virtual) base classes; these are initialized as if they were data members. llvm-svn: 262963	2016-03-08 22:17:41 +00:00
Adrian Prantl	5a9a42778a	Module Debugging: Fix a crash when emitting debug info for nested tag types whose DeclContext is not yet complete by deferring their emission. rdar://problem/24918680 llvm-svn: 262851	2016-03-07 20:58:52 +00:00
Carlo Bertolli	0ff587d4a4	[OPENMP] Codegen for distribute directive: fix bug in ordering of parameters. llvm-svn: 262833	2016-03-07 16:19:13 +00:00
Carlo Bertolli	fc35ad2bbc	Reapply r262741 [OPENMP] Codegen for distribute directive This patch provide basic implementation of codegen for teams directive, excluding all clauses except dist_schedule. It also fixes parts of AST reader/writer to enable correct pre-compiled header handling. http://reviews.llvm.org/D17170 llvm-svn: 262832	2016-03-07 16:04:49 +00:00
Amjad Aboud	faea560286	Resolved Bug 26414. https://llvm.org/bugs/show_bug.cgi?id=26414 Since interrupt handler must be returned with iret, tail call can't be used. Differential Revision: http://reviews.llvm.org/D17853 llvm-svn: 262830	2016-03-07 14:22:46 +00:00
Simon Pilgrim	1feae0d0bb	Fixed -Wdocumentation warning - typo in a parameter name llvm-svn: 262783	2016-03-05 22:35:55 +00:00
Samuel Antao	bf4d18d3d2	Revert r262741 - [OPENMP] Codegen for distribute directive Was causing a failure in one of the buildbot slaves. llvm-svn: 262744	2016-03-04 21:02:14 +00:00
Carlo Bertolli	4a56e3831d	[OPENMP] Codegen for distribute directive This patch provide basic implementation of codegen for teams directive, excluding all clauses except dist_schedule. It also fixes parts of AST reader/writer to enable correct pre-compiled header handling. http://reviews.llvm.org/D17170 llvm-svn: 262741	2016-03-04 20:24:58 +00:00
James Y Knight	b214cbc785	Make TargetInfo store an actual DataLayout instead of a string. Use it to calculate UserLabelPrefix, instead of specifying it (often incorrectly). Note that the actual user label prefix has always come from the DataLayout, and is handled within LLVM. The main thing clang's TargetInfo::UserLabelPrefix did was to set the #define value. Having these be different from each-other is just silly. Differential Revision: http://reviews.llvm.org/D17183 llvm-svn: 262737	2016-03-04 19:00:41 +00:00
Alexey Bataev	c5b1d320b8	[OPENMP 4.0] Codegen for 'declare reduction' construct. Emit function for 'combiner' part of 'declare reduction' construct and 'initialilzer' part, if any. llvm-svn: 262699	2016-03-04 09:22:22 +00:00
Vedant Kumar	22bd871ea6	[Coverage] Fix the start/end locations of switch statements While pushing switch statements onto the region stack we neglected to specify their start/end locations. This results in a crash (PR26825) if we end up in nested macro expansions without enough information to handle the relevant file exits. I added a test in switchmacro.c and fixed up a bunch of incorrect CHECK lines that specify strange end locations for switches. llvm-svn: 262697	2016-03-04 08:07:15 +00:00
David Majnemer	e2ae228c76	[X86] Pass __m64 types via SSE registers for GCC compatibility For compatibility with GCC, classify __m64 as SSE. However, clang is a platform compiler for certain targets; retain our old behavior on those targets: classify __m64 as integer. This fixes PR26832. llvm-svn: 262688	2016-03-04 05:26:16 +00:00
Carlo Bertolli	6ad7b5aff2	[OPENMP] firstprivate and private clauses of teams, host codegeneration Add code generation support for firstprivate and private clauses of teams on the host. Add extensive regression tests including lambda functions and vla testing. http://reviews.llvm.org/D17582 llvm-svn: 262663	2016-03-03 22:09:40 +00:00
Carlo Bertolli	430d8ecc55	Add code generation for teams directive inside target region llvm-svn: 262652	2016-03-03 20:34:23 +00:00
Samuel Antao	b68e2db8f9	[OpenMP] Code generation for teams - kernel launching Summary: This patch implements the launching of a target region in the presence of a nested teams region, i.e calls tgt_target_teams with the required arguments gathered from the enclosed teams directive. The actual codegen of the region enclosed by the teams construct will be contributed in a separate patch. Reviewers: hfinkel, arpith-jacob, kkwli0, carlo.bertolli, ABataev Subscribers: cfe-commits, caomhin, fraggamuffin Differential Revision: http://reviews.llvm.org/D17019 llvm-svn: 262625	2016-03-03 16:20:23 +00:00
Alexey Bataev	94a4f0cb5f	[OPENMP 4.0] Initial support for 'omp declare reduction' construct. Add parsing, sema analysis and serialization/deserialization for 'declare reduction' construct. User-defined reductions are defined as #pragma omp declare reduction( reduction-identifier : typename-list : combiner ) [initializer ( initializer-expr )] These custom reductions may be used in 'reduction' clauses of OpenMP constructs. The combiner specifies how partial results can be combined into a single value. The combiner can use the special variable identifiers omp_in and omp_out that are of the type of the variables being reduced with this reduction-identifier. Each of them will denote one of the values to be combined before executing the combiner. It is assumed that the special omp_out identifier will refer to the storage that holds the resulting combined value after executing the combiner. As the initializer-expr value of a user-defined reduction is not known a priori the initializer-clause can be used to specify one. Then the contents of the initializer-clause will be used as the initializer for private copies of reduction list items where the omp_priv identifier will refer to the storage to be initialized. The special identifier omp_orig can also appear in the initializer-clause and it will refer to the storage of the original variable to be reduced. Differential Revision: http://reviews.llvm.org/D11182 llvm-svn: 262582	2016-03-03 05:21:39 +00:00
Alexey Bataev	2bbf7217ea	[OPENMP 4.5] Initial support for data members in 'linear' clause. OpenMP 4.5 allows to privatize data members of current class in member functions. Patch adds initial support for privatization of data members in 'linear' clause, no codegen support. llvm-svn: 262578	2016-03-03 03:52:24 +00:00
Rong Xu	9c6f1538cc	[PGO] Change profile use cc1 option to handle IR level profiles This patch changes cc1 option for PGO profile use from -fprofile-instr-use=<path> to -fprofile-instrument-use-path=<path>. -fprofile-instr-use=<path> is now a driver only option. In addition to decouple the cc1 option from the driver level option, this patch also enables IR level profile use. cc1 option handling now reads the profile header and sets CodeGenOpt ProfileUse (valid values are {None, Clang, LLVM} -- this is a common enum for -fprofile-instrument={}, for the profile instrumentation), and invoke the pipeline to enable the respective PGO use pass. Reviewers: silvas, davidxl Differential Revision: http://reviews.llvm.org/D17737 llvm-svn: 262515	2016-03-02 20:59:36 +00:00
Nico Weber	cbbaeb1307	Serialize `#pragma detect_mismatch`. This is like r262493, but for pragma detect_mismatch instead of pragma comment. The two pragmas have similar behavior, so use the same approach for both. llvm-svn: 262506	2016-03-02 19:28:54 +00:00
Artem Belevich	8c1ec1ef38	[CUDA] Do not generate unnecessary runtime init code. Differential Revision: http://reviews.llvm.org/D17780 llvm-svn: 262499	2016-03-02 18:28:53 +00:00
Artem Belevich	42e1949b46	[CUDA] Emit host-side 'shadows' for device-side global variables ... and register them with CUDA runtime. This is needed for commonly used cudaMemcpy*() APIs that use address of host-side shadow to access their counterparts on device side. Fixes PR26340 Differential Revision: http://reviews.llvm.org/D17779 llvm-svn: 262498	2016-03-02 18:28:50 +00:00
Nico Weber	6622029d5e	Serialize `#pragma comment`. `#pragma comment` was handled by Sema calling a function on ASTConsumer, and CodeGen then implementing this function and writing things to its output. Instead, introduce a PragmaCommentDecl AST node and hang one off the TranslationUnitDecl for every `#pragma comment` line, and then use the regular serialization machinery. (Since PragmaCommentDecl has codegen relevance, it's eagerly deserialized.) http://reviews.llvm.org/D17799 llvm-svn: 262493	2016-03-02 17:28:48 +00:00
David Majnemer	ec4b7341cc	[Sema] PR26444 fix crash when alignment value is >= 216 Sema allows max values up to 228, use unsigned instead of unsiged short to hold values that large. Differential Revision: http://reviews.llvm.org/D17248 Patch by Don Hinton! llvm-svn: 262466	2016-03-02 06:48:47 +00:00
Alexey Bataev	61205070c4	[OPENMP 4.5] Codegen for data members in 'reduction' clause. OpenMP 4.5 allows to privatize non-static data members of current class in non-static member functions. Patch supports codegen for non-static data members in 'reduction' clauses. llvm-svn: 262460	2016-03-02 04:57:40 +00:00
Reid Kleckner	8f1b1f5921	Reword a misleading comment discussing landingpads and SEH SEH doesn't use landingpads anymore. llvm-svn: 262382	2016-03-01 19:51:48 +00:00
David Majnemer	25eb165f18	[MSVC Compat] Correctly handle finallys nested within finallys We'd lose track of the parent CodeGenFunction, leading us to get confused with regard to which function a nested finally belonged to. Differential Revision: http://reviews.llvm.org/D17752 llvm-svn: 262379	2016-03-01 19:42:53 +00:00
Rong Xu	522b5cb375	[PGO] clang cc1 option change to enable IR level instrumentation This patch expands cc1 option -fprofile-instrument= with a new value: -fprofile-instrument=llvm which enables IR level PGO instrumentation. Reviewers: davidxl, silvas Differential Revision: http://reviews.llvm.org/D17622 llvm-svn: 262239	2016-02-29 18:54:59 +00:00
Matt Arsenault	2d9339890f	Add __builtin_canonicalize llvm-svn: 262122	2016-02-27 09:06:18 +00:00
Xiuli Pan	11e13f60ea	[OpenCL] Refine OpenCLImageAccessAttr to OpenCLAccessAttr Summary: OpenCL access qualifiers are now not only used for image types, refine it to avoid misleading, Add semacheck for OpenCL access qualifier as well as test caees. Reviewers: pekka.jaaskelainen, Anastasia, aaron.ballman Subscribers: aaron.ballman, cfe-commits Differential Revision: http://reviews.llvm.org/D16040 llvm-svn: 261961	2016-02-26 03:13:03 +00:00
Alexey Bataev	005248ac8a	[OPENMP 4.5] Codegen for member decls in 'lastprivate' clause. OpenMP 4.5 allows to privatize non-static member decls in non-static member functions. Patch captures such decls by reference in general (for bitfields, by value) and then operates with this capture. For bitfields, at the end of codegen for lastprivates original bitfield is updated with the value of captured copy. llvm-svn: 261824	2016-02-25 05:25:57 +00:00
Justin Lebar	ddd97faeec	[CUDA] Mark all CUDA device-side function defs, decls, and calls as convergent. Summary: This is important for e.g. the following case: void sync() { __syncthreads(); } void foo() { do_something(); sync(); do_something_else(): } Without this change, if the optimizer does not inline sync() (which it won't because __syncthreads is also marked as noduplicate, for now anyway), it is free to perform optimizations on sync() that it would not be able to perform on __syncthreads(), because sync() is not marked as convergent. Similarly, we need a notion of convergent calls, since in the case when we can't statically determine a call's target(s), we need to know whether it's safe to perform optimizations around the call. This change is conservative; the optimizer will remove these attrs where it can, see r260318, r260319. Reviewers: majnemer Subscribers: cfe-commits, jhen, echristo, tra Differential Revision: http://reviews.llvm.org/D17056 llvm-svn: 261779	2016-02-24 21:55:11 +00:00
Peter Collingbourne	fb532b9a34	Add whole-program vtable optimization feature to Clang. This patch introduces the -fwhole-program-vtables flag, which enables the whole-program vtable optimization feature (D16795) in Clang. Differential Revision: http://reviews.llvm.org/D16821 llvm-svn: 261767	2016-02-24 20:46:36 +00:00
Manman Ren	42ff39051d	Objective-C: Add a size field to non-fragile category metadata. This is mainly for extensibility. Note that fragile category metadata, metadata for classes and protocols all have a size field. Initial patch was provided by Greg Parker. rdar://problem/24804226 llvm-svn: 261756	2016-02-24 17:49:50 +00:00
David Majnemer	971d31be6f	[WinEH] Make sure terminate handlers have funclet operands Calls to the terminate handler must be annotated within the exception region they are within. llvm-svn: 261751	2016-02-24 17:02:45 +00:00
James Y Knight	29b5f086ca	Default vaarg lowering should support indirect struct types. Fixes PR11517 for SPARC. On most targets, clang lowers va_arg itself, eschewing the use of the llvm vaarg instruction. This is necessary (at least for now) as the type argument to the vaarg instruction cannot represent all the ABI information that is needed to support complex calling conventions. However, on targets with a simpler varrags ABIs, the LLVM instruction can work just fine, and clang can simply lower to it. Unfortunately, even on such targets, vaarg with a struct argument would fail, because the default lowering to vaarg was naive: it didn't take into account the ABI attribute computed by classifyArgumentType. In particular, for the DefaultABIInfo, structs are supposed to be passed indirectly and so llvm's vaarg instruction should be emitted with a pointer argument. Now, vaarg instruction emission is able to use computed ABIArgInfo for the provided argument type, which allows the default ABI support to work for structs too. I haven't touched the EmitVAArg implementation for PPC32_SVR4 or XCore, although I believe both are now redundant, and could be switched over to use the default implementation as well. Differential Revision: http://reviews.llvm.org/D16154 llvm-svn: 261717	2016-02-24 02:59:33 +00:00
Adrian Prantl	4594606c00	Reapply r261657. Remove an unnecessary workaround introduced in r259975. (NFC) Now that LLVM r259973 allows replacing a temporary type with another temporary we can rely on the original implementation. It is possible for enums to be created as part of their own declcontext. In this case a FwdDecl will be created twice. This doesn't cause a problem because both FwdDecls are entered into the ReplaceMap: finalize() will replace the first FwdDecl with the second and then replace the second with complete type. Thanks to echristo for pointing this out. # Conflicts: # lib/CodeGen/CGDebugInfo.cpp llvm-svn: 261673	2016-02-23 19:30:08 +00:00
Hans Wennborg	8a118b63d8	Revert r261634 "Supporting all entities declared in lexical scope in LLVM debug info." and r261657 r261634 and r261633 seems to have caused PR26715. r261657 depends on the former two. llvm-svn: 261670	2016-02-23 19:10:16 +00:00
Adrian Prantl	cc43f581b4	Remove an unnecessary workaround introduced in r259975. (NFC) Now that LLVM r259973 allows replacing a temporary type with another temporary we can rely on the original implementation. It is possible for enums to be created as part of their own declcontext. In this case a FwdDecl will be created twice. This doesn't cause a problem because both FwdDecls are entered into the ReplaceMap: finalize() will replace the first FwdDecl with the second and then replace the second with complete type. Thanks to echristo for pointing this out. llvm-svn: 261657	2016-02-23 17:13:47 +00:00
Amjad Aboud	30e7a8f694	Supporting all entities declared in lexical scope in LLVM debug info. Differential Revision: http://reviews.llvm.org/D15977 llvm-svn: 261634	2016-02-23 13:37:18 +00:00
Dan Gohman	1fcd10ca4e	[WebAssembly] Lower va_arg in clang. This uses the general emitVoidPtrVAArg lowering logic for everything, since this supports all types, and we don't have any special requirements. llvm-svn: 261557	2016-02-22 19:17:40 +00:00
David Majnemer	46e39cc6b0	[MS ABI] Correctly handle dllimport'd explicit instantiation declaration w/ vbases We gave a VBTable dllimport storage class and external linkage while also providing an initializer. An initializer is only valid if the VBTable has available_externally linkage. Fix this by setting the linkage to available_externally in situ while generating the initializer. This fixes PR26686. llvm-svn: 261535	2016-02-22 17:22:08 +00:00
Nirav Dave	9a8f97e967	Add support for Android Vector calling convention for AArch64 This modification applies the following Android commit when we have an Android environment. This is the sole non-renderscript in the Android repo commit 9212d4fb30a3ca2f4ee966dd2748c35573d9682c Author: Tim Murray <timmurray@google.com> Date: Fri Aug 15 16:00:15 2014 -0700 Update vector calling convention for AArch64. bug 16846318 Change-Id: I3cfd167758b4bd634d8480ee6ba6bb55d61f82a7 Reviewers: srhines, jyknight Subscribers: mcrosier, aemerson, rengolin, tberghammer, danalbert, srhines Differential Revision: http://reviews.llvm.org/D17448 llvm-svn: 261533	2016-02-22 16:48:42 +00:00
Manman Ren	92e0a71589	Class Property: Fix a crash with old ABI when generating metadata in classes. rdar://23891898 llvm-svn: 261466	2016-02-21 05:31:05 +00:00
Roman Divacky	039b970c97	Fix handling of vaargs on PPC32 when going from regsave to overflow. It can happen that when we only have 1 more register left in the regsave area we need to store a value bigger than 1 register and therefore we go to the overflow area. In this case we have to leave the last slot in the regsave area unused and keep using overflow area. Do this by storing a limit value to the used register counter in the overflow block. Issue diagnosed by and solution tested by Mark Millard! llvm-svn: 261422	2016-02-20 08:31:24 +00:00
Anastasia Stulova	6bdbcbb3d9	[OpenCL] Generate metadata for opencl_unroll_hint attribute Add support for opencl_unroll_hint attribute from OpenCL v2.0 s6.11.5. Reusing most of metadata generation from CGLoopInfo helper class. The code is based on Khronos OpenCL compiler: https://github.com/KhronosGroup/SPIR/tree/spirv-1.0 Patch by Liu Yaxun (Sam)! Differential Revision: http://reviews.llvm.org/D16686 llvm-svn: 261350	2016-02-19 18:30:11 +00:00
Alexey Bataev	50b3c95992	[OPENMP] Improved layout of CGOpenMPRuntime class, NFC. llvm-svn: 261315	2016-02-19 10:38:26 +00:00
Richard Trieu	cc3949d99a	Remove use of builtin comma operator. Cleanup for upcoming Clang warning -Wcomma. No functionality change intended. llvm-svn: 261271	2016-02-18 22:34:54 +00:00
Serge Pavlov	41c1f79dfe	Avoid double deletion in Clang driver. Llvm module object is shared between CodeGenerator and BackendConsumer, in both classes it is stored as std::unique_ptr, which is not a good design solution and can cause double deletion error. Usually it does not occur because in BackendConsumer::HandleTranslationUnit the ownership of CodeGenerator over the module is taken away. If however this method is not called, the module is deleted twice and compiler crashes. As the module owned by BackendConsumer is always the same as CodeGenerator has, pointer to llvm module can be removed from BackendGenerator. Differential Revision: http://reviews.llvm.org/D15450 llvm-svn: 261222	2016-02-18 16:42:09 +00:00
Alexey Bataev	8ffcc949b1	[OPENMP] Fix codegen for lastprivate loop counters. Patch fixes bug with codegen for lastprivate loop counters. Also it may improve performance for lastprivates calculations in some cases. llvm-svn: 261209	2016-02-18 13:48:15 +00:00
Kostya Serebryany	d4590c7304	[sanitizer-coverage] implement -fsanitize-coverage=trace-pc. This is similar to trace-bb, but has a different API. We already use the equivalent flag in GCC for Linux kernel fuzzing. We may be able to use this flag with AFL too llvm-svn: 261159	2016-02-17 21:34:43 +00:00
Akira Hatanaka	9d8ac61fec	[CodeGen] Fix an assert in CodeGenFunction::EmitFunctionEpilog The assert is triggered because isObjCRetainableType() is called on the canonicalized return type that has been stripped of the typedefs and attributes attached to it. To fix this assert, this commit gets the original return type from CurCodeDecl or BlockInfo and uses it instead of the canoicalized type. rdar://problem/24470031 Differential Revision: http://reviews.llvm.org/D16914 llvm-svn: 261151	2016-02-17 21:09:50 +00:00
Alexey Bataev	417089fc7e	[OPENMP 4.5] Codegen support for data members in 'firstprivate' clause. Added codegen for captured data members in non-static member functions. llvm-svn: 261089	2016-02-17 13:19:37 +00:00
Mehdi Amini	a7c0940d72	Teach clang to use the ThinLTO pipeline Summary: Use the new pipeline implemented in D17115 Reviewers: tejohnson Subscribers: joker.eph, cfe-commits Differential Revision: http://reviews.llvm.org/D17272 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 261045	2016-02-17 00:42:20 +00:00
Alexey Bataev	3392d76081	[OPENMP] Improved handling of pseudo-captured expressions in OpenMP. Expressions inside 'schedule'\|'dist_schedule' clause must be captured in combined directives to avoid possible crash during codegen. Patch improves handling of such constructs llvm-svn: 260954	2016-02-16 11:18:12 +00:00
Alexey Bataev	cd8b6a2cf1	[OPENMP] Remove extra sync barriers for 'firstprivate' clause. Sync barrier will be emitted after generation of firstprivate variables only if one of the firstprivate vars is used in lastprivate clause. llvm-svn: 260877	2016-02-15 08:07:17 +00:00
Samuel Antao	2de62b0c89	[OpenMP] Rename the offload entry points. Summary: Unlike other outlined regions in OpenMP, offloading entry points have to have be visible (external linkage) for the device side. Using dots in the names of the entries can be therefore problematic for some toolchains, e.g. NVPTX. Also the patch drops the column information in the unique name of the entry points. The parsing of directives ignore unknown tokens, preventing several target regions to be implemented in the same line. Therefore, the line information is sufficient for the name to be unique. Also, the preprocessor printer does not preserve the column information, causing offloading-entry detection issues if the host uses an integrated preprocessor and the target doesn't (or vice versa). Reviewers: hfinkel, arpith-jacob, carlo.bertolli, kkwli0, ABataev Subscribers: cfe-commits, fraggamuffin, caomhin Differential Revision: http://reviews.llvm.org/D17179 llvm-svn: 260837	2016-02-13 23:35:10 +00:00
Benjamin Kramer	0bb97746a8	RValue refs do not work that way. llvm-svn: 260823	2016-02-13 16:00:13 +00:00
Benjamin Kramer	0772c42385	Reduce the number of implicit StringRef->std::string conversions by threading StringRef through more APIs. No functionality change intended. llvm-svn: 260815	2016-02-13 13:42:54 +00:00
Matt Arsenault	9b277b4ad4	AMDGPU: Add sin/cos builtins llvm-svn: 260783	2016-02-13 01:21:09 +00:00
Matt Arsenault	f5c1f47181	AMDGPU: Update builtin for intrinsic change llvm-svn: 260781	2016-02-13 01:03:09 +00:00

... 2 3 4 5 6 ...

9904 Commits