llvm-project

Commit Graph

Author	SHA1	Message	Date
Samuel Antao	3f465c095b	[OpenMP] Reorganize code to allow specialized code generation for different devices. Summary: Different devices may in some cases require different code generation schemes in order to implement OpenMP. This is required not only for performance reasons, but also because it may not be possible to have the current (default) implementation working for these devices. E.g. GPU's cannot implement the same scheme a target such as powerpc or x86b would use, in the sense that it does not have the ability to fork threads, instead all the threads are always executing and need to be managed by the implementation. This patch proposes a reorganization of the code in the OpenMP code generation to pave the way to have specialized implementation of OpenMP support. More than a "real" patch this is more a request for comments in order to understand if what is proposed is acceptable or if there are better/easier ways to do it. In this patch part of the common OpenMP codegen infrastructure is moved to a new file under a new namespace (CGOpenMPCommon) so it can be shared between the default implementation and the specialized one. When CGOpenMPRuntime is created, an attempt to select a specialized implementation is done. In the patch a specialization for nvptx targets is done which currently checks if the target is an OpenMP device and trap if it is not. Let me know comments suggestions you may have. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, kkwli0, ABataev Subscribers: Hahnfeld, cfe-commits, fraggamuffin, caomhin, jholewinski Differential Revision: http://reviews.llvm.org/D16784 llvm-svn: 259977	2016-02-06 02:12:34 +00:00
Evgeniy Stepanov	f31ea30694	[cfi] Safe handling of unaddressable vtable pointers (clang). Avoid crashing when printing diagnostics for vtable-related CFI errors. In diagnostic mode, the frontend does an additional check of the vtable pointer against the set of all known vtable addresses and lets the runtime handler know if it is safe to inspect the vtable. http://reviews.llvm.org/D16823 llvm-svn: 259716	2016-02-03 22:18:55 +00:00
Artem Belevich	97c01c35f8	[CUDA] Do not allow dynamic initialization of global device side variables. In general CUDA does not allow dynamic initialization of global device-side variables. One exception is that CUDA allows records with empty constructors as described in section E2.2.1 of CUDA 7.5 Programming guide. This patch applies initializer checks for all device-side variables. Empty constructors are accepted, but no code is generated for them. Differential Revision: http://reviews.llvm.org/D15305 llvm-svn: 259592	2016-02-02 22:29:48 +00:00
Benjamin Kramer	8c30592e18	Move DebugInfoKind into its own header to cut the cyclic dependency edge from Driver to Frontend. llvm-svn: 259489	2016-02-02 11:06:51 +00:00
Eric Christopher	d160c50f6b	Use a consistent spelling for vtables. llvm-svn: 259137	2016-01-29 01:35:53 +00:00
David Majnemer	929025d1a6	[MS ABI] Allow a member pointers' converted type to change Member pointers in the MS ABI are tricky for a variety of reasons. The size of a member pointer is indeterminate until the program reaches a point where the representation is required to be known. However, pointers to member pointers may exist without knowing the pointee type's representation. In these cases, we synthesize an opaque LLVM type for the pointee type. However, we can be in a situation where the underlying member pointer's representation became known mid-way through the program. To account for this, we attempted to manicure CodeGen's type-cache so that we can replace the opaque member pointer type with the real deal while leaving the pointer types unperturbed. This, unfortunately, is a problematic approach to take as we will violate CodeGen's invariants. These violations are mostly harmless but let's do the right thing instead: invalidate the type-cache if a member pointer's LLVM representation changes. This fixes PR26313. llvm-svn: 258839	2016-01-26 19:30:26 +00:00
Evgeniy Stepanov	3fd61df186	[cfi] Cross-DSO CFI diagnostic mode (clang part) * Runtime diagnostic data for cfi-icall changed to match the rest of cfi checks * Layout of all CFI diagnostic data changed to put Kind at the beginning. There is no ABI stability promise yet. * Call cfi_slowpath_diag instead of cfi_slowpath when needed. * Emit __cfi_check_fail function, which dispatches a CFI check faliure according to trap/recover settings of the current module. * A tiny driver change to match the way the new handlers are done in compiler-rt. llvm-svn: 258745	2016-01-25 23:34:52 +00:00
Justin Lebar	cd2f6bbd5c	[CUDA] Don't generate aliases for static extern "C" functions. Summary: These aliases are done to support inline asm, but there's nothing we can do: NVPTX doesn't support aliases. Reviewers: tra Subscribers: cfe-commits, jhen, echristo Differential Revision: http://reviews.llvm.org/D16501 llvm-svn: 258734	2016-01-25 22:36:37 +00:00
Peter Collingbourne	dc13453128	Introduce -fsanitize-stats flag. This is part of a new statistics gathering feature for the sanitizers. See clang/docs/SanitizerStats.rst for further info and docs. Differential Revision: http://reviews.llvm.org/D16175 llvm-svn: 257971	2016-01-16 00:31:22 +00:00
Andrey Bokhanko	7aa88ce70c	PR25910: clang allows two var definitions with the same mangled name Proper diagnostic and resolution of mangled names' conflicts in variables. When there is a declaration and a definition using the same name but different types, we emit what is in the definition. When there are two conflicting definitions, we issue an error. Differential Revision: http://reviews.llvm.org/D15686 llvm-svn: 257754	2016-01-14 10:41:16 +00:00
David Majnemer	d2385c08cf	[MS ABI] Complete and base constructor GlobalDecls must have the same name Clang got itself into the situation where we mangled the same constructor twice with two different constructor types. After one of the constructors were utilized, the tag used for one of the types changed from class to struct because a class template became complete. This resulted in one of the constructor types varying from the other constructor. Instead, force "base" constructor types to "complete" if the ABI doesn't have constructor variants. This will ensure that GlobalDecls for both variants will get the same mangled name. This fixes PR26029. llvm-svn: 257205	2016-01-08 20:48:26 +00:00
Chad Rosier	7dbc9cf876	[Driver] Add support for -fno-builtin-foo options. Addresses PR4941 and rdar://6756912. http://reviews.llvm.org/D15195 llvm-svn: 256937	2016-01-06 14:35:46 +00:00
Samuel Antao	ee8fb302f5	[OpenMP] Reapply rL256842: [OpenMP] Offloading descriptor registration and device codegen. This patch attempts to fix the regressions identified when the patch was committed initially. Thanks to Michael Liao for identifying the fix in the offloading metadata generation related with side effects in evaluation of function arguments. llvm-svn: 256933	2016-01-06 13:42:12 +00:00
Samuel Antao	7d5de9a1ee	[OpenMP] Revert rL256842: [OpenMP] Offloading descriptor registration and device codegen. It was causing two regression, so I'm reverting until the cause is found. llvm-svn: 256858	2016-01-05 19:16:13 +00:00
Samuel Antao	4d5f0bbea1	[OpenMP] Offloading descriptor registration and device codegen. Summary: In order to offloading work properly two things need to be in place: - a descriptor with all the offloading information (device entry functions, and global variable) has to be created by the host and registered in the OpenMP offloading runtime library. - all the device functions need to be emitted for the device and a convention has to be in place so that the runtime library can easily map the host ID of an entry point with the actual function in the device. This patch adds support for these two things. However, only entry functions are being registered given that 'declare target' directive is not yet implemented. About offloading descriptor: The details of the descriptor are explained with more detail in http://goo.gl/L1rnKJ. Basically the descriptor will have fields that specify the number of devices, the pointers to where the device images begin and end (that will be defined by the linker), and also pointers to a the begin and end of table whose entries contain information about a specific entry point. Each entry has the type: ``` struct __tgt_offload_entry{ void addr; char name; int64_t size; }; ``` and will be implemented in a pre determined (ELF) section `.omp_offloading.entries` with 1-byte alignment, so that when all the objects are linked, the table is in that section with no padding in between entries (will be like a C array). The code generation ensures that all `__tgt_offload_entry` entries are emitted in the same order for both host and device so that the runtime can have the corresponding entries in both host and device in same index of the table, and efficiently implement the mapping. The resulting descriptor is registered/unregistered with the runtime library using the calls `__tgt_register_lib` and `__tgt_unregister_lib`. The registration is implemented in a high priority global initializer so that the registration happens always before any initializer (that can potentially include target regions) is run. The driver flag -omptargets= was created to specify a comma separated list of devices the user wants to support so that the new functionality can be exercised. Each device is specified with its triple. About target codegen: The target codegen is pretty much straightforward as it reuses completely the logic of the host version for the same target region. The tricky part is to identify the meaningful target regions in the device side. Unlike other programming models, like CUDA, there are no already outlined functions with attributes that mark what should be emitted or not. So, the information on what to emit is passed in the form of metadata in host bc file. This requires a new option to pass the host bc to the device frontend. Then everything is similar to what happens in CUDA: the global declarations emission is intercepted to check to see if it is an "interesting" declaration. The difference is that instead of checking an attribute, the metadata information in checked. Right now, there is only a form of metadata to pass information about the device entry points (target regions). A class `OffloadEntriesInfoManagerTy` was created to manage all the information and queries related with the metadata. The metadata looks like this: ``` !omp_offload.info = !{!0, !1, !2, !3, !4, !5, !6} !0 = !{i32 0, i32 52, i32 77426347, !"_ZN2S12r1Ei", i32 479, i32 13, i32 4} !1 = !{i32 0, i32 52, i32 77426347, !"_ZL7fstatici", i32 461, i32 11, i32 5} !2 = !{i32 0, i32 52, i32 77426347, !"_Z9ftemplateIiET_i", i32 444, i32 11, i32 6} !3 = !{i32 0, i32 52, i32 77426347, !"_Z3fooi", i32 99, i32 11, i32 0} !4 = !{i32 0, i32 52, i32 77426347, !"_Z3fooi", i32 272, i32 11, i32 3} !5 = !{i32 0, i32 52, i32 77426347, !"_Z3fooi", i32 127, i32 11, i32 1} !6 = !{i32 0, i32 52, i32 77426347, !"_Z3fooi", i32 159, i32 11, i32 2} ``` The fields in each metadata entry are (in sequence): Entry 1) an ID of the type of metadata - right now only zero is used meaning "OpenMP target region". Entry 2) a unique ID of the device where the input source file that contain the target region lives. Entry 3) a unique ID of the file where the input source file that contain the target region lives. Entry 4) a mangled name of the function that encloses the target region. Entries 5) and 6) line and column number where the target region was found. Entry 7) is the order the entry was emitted. Entry 2) and 3) are required to distinguish files that have the same function name. Entry 4) is required to distinguish different instances of the same declaration (usually templated ones) Entries 5) and 6) are required to distinguish the particular target region in body of the function (it is possible that a given target region is not an entry point - if clause can evaluate always to zero - and therefore we need to identify the "interesting" target regions. ) This patch replaces http://reviews.llvm.org/D12306. Reviewers: ABataev, hfinkel, tra, rjmccall, sfantao Subscribers: FBrygidyn, piotr.rak, Hahnfeld, cfe-commits Differential Revision: http://reviews.llvm.org/D12614 llvm-svn: 256842	2016-01-05 16:23:04 +00:00
Easwaran Raman	695890c971	Attach maximum function count to Module when using PGO mode. This sets the maximum entry count among all functions in the program to the module using module flags. This allows the optimizer to use this information. Differential Revision: http://reviews.llvm.org/D15163 llvm-svn: 255918	2015-12-17 19:14:27 +00:00
Evgeniy Stepanov	fd6f92d5cb	Cross-DSO control flow integrity (Clang part). Clang-side cross-DSO CFI. * Adds a command line flag -f[no-]sanitize-cfi-cross-dso. * Links a runtime library when enabled. * Emits __cfi_slowpath calls is bitset test fails. * Emits extra hash-based bitsets for external CFI checks. * Sets a module flag to enable __cfi_check generation during LTO. This mode does not yet support diagnostics. llvm-svn: 255694	2015-12-15 23:00:20 +00:00
David Majnemer	0b17d44faf	[WinEH] Update clang to use operand bundles on call sites This updates clang to use bundle operands to associate an invoke with the funclet which it is contained within. Depends on D15517. Differential Revision: http://reviews.llvm.org/D15518 llvm-svn: 255675	2015-12-15 21:27:59 +00:00
Easwaran Raman	dd4c71ca6e	Revert r254647. Reason: The testcase fails in many architectures. Differential Revision: http://reviews.llvm.org/D15163 llvm-svn: 255416	2015-12-12 01:11:09 +00:00
Easwaran Raman	d547e5e168	Attach maximum function count to Module when using PGO mode This sets the maximum entry count among all functions in the program to the module using module flags. This allows the optimizer to use this information. Differential Revision: http://reviews.llvm.org/D15163 llvm-svn: 255397	2015-12-12 00:31:02 +00:00
Reid Kleckner	953fe0367b	Revert "[x86] Exclusion of incorrect include headers paths for MCU target" This reverts commit r254195. From the description, I suspect that the wrong patch was committed here, and this is causing assertion failures in EmitDeferred() when the global value ends up being a bitcast of a global. llvm-svn: 254823	2015-12-05 01:52:14 +00:00
George Burgess IV	3e3bb95b69	Add the `pass_object_size` attribute to clang. `pass_object_size` is our way of enabling `__builtin_object_size` to produce high quality results without requiring inlining to happen everywhere. A link to the design doc for this attribute is available at the Differential review link below. Differential Revision: http://reviews.llvm.org/D13263 llvm-svn: 254554	2015-12-02 21:58:08 +00:00
Richard Smith	5a99c49d2e	Fix use-after-free when a C++ thread_local variable gets replaced (because its type changes when the initializer is attached). Don't hold onto the GlobalVariable; recompute it from the VarDecl instead. llvm-svn: 254359	2015-12-01 01:10:48 +00:00
Andrey Bokhanko	2a4db90157	[x86] Exclusion of incorrect include headers paths for MCU target Exclusion of /usr/include and /usr/local/include headers paths for MCU target. Differential Revision: http://reviews.llvm.org/D14954 llvm-svn: 254195	2015-11-27 12:18:22 +00:00
Manman Ren	f93fff27f0	[TLS on Darwin] treat all Darwin platforms in the same way. rdar://problem/9001553 llvm-svn: 252820	2015-11-11 23:08:18 +00:00
Eric Christopher	2b90a64e31	Extract out a function onto CodeGenModule for getting the map of features for a particular function, then use it to clean up some code. llvm-svn: 252819	2015-11-11 23:05:08 +00:00
Manman Ren	6815026991	[TLS on Darwin] change how we handle globals with linkonce or weak linkage. This is about how we handle static member of a template. Before this commit, we use internal linkage for the IR thread-local variable, which is inefficient. With this commit, we will start to follow Itanium C++ ABI. rdar://problem/23415206 Reviewed by John McCall. llvm-svn: 252814	2015-11-11 22:42:31 +00:00
Duncan P. N. Exon Smith	9f5260ab13	CodeGen: Remove implicit ilist iterator conversions, NFC Make ilist iterator conversions explicit in clangCodeGen. Eventually I'll remove them everywhere. llvm-svn: 252358	2015-11-06 23:00:41 +00:00
Keno Fischer	0e2d422254	Fix crash in EmitDeclMetadata mode Summary: This fixes a bug that's easily encountered in LLDB (https://llvm.org/bugs/show_bug.cgi?id=22875). The problem here is that we mangle a name during debug info emission, but never actually emit the actual Decl, so we run into problems in EmitDeclMetadata (which assumes such a Decl exists). Fix that by just skipping metadata emissions for mangled names that don't have associated Decls. Reviewers: rjmccall Subscribers: labath, cfe-commits Differential Revision: http://reviews.llvm.org/D13959 llvm-svn: 252229	2015-11-05 23:18:44 +00:00
Tim Northover	756447a67c	Watch and TV OS: wire up basic ABI choices This sets the mostly expected Darwin default ABI options for these two platforms. Active changes from these defaults for watchOS are in a later patch. llvm-svn: 251708	2015-10-30 16:30:36 +00:00
John McCall	b04ecb753a	Unify the ObjC entrypoint caches. llvm-svn: 250918	2015-10-21 18:06:43 +00:00
Benjamin Kramer	c2d2b4259c	[CodeGen] Remove dead code. NFC. llvm-svn: 250418	2015-10-15 15:29:40 +00:00
Akira Hatanaka	aec6b2c20e	[CodeGen] [CodeGen] Attach function attributes to functions created in CGBlocks.cpp. This commit fixes a bug in clang's code-gen where it creates the following functions but doesn't attach function attributes to them: __copy_helper_block_ __destroy_helper_block_ __Block_byref_object_copy_ __Block_byref_object_dispose_ rdar://problem/20828324 Differential Revision: http://reviews.llvm.org/D13525 llvm-svn: 249735	2015-10-08 20:26:34 +00:00
Akira Hatanaka	200500d6d3	[CodeGen] Check if the Decl pointer passed is null, and if so, return early. This is needed in a patch I plan to commit later, in which a null Decl pointer is passed to SetLLVMFunctionAttributesForDefinition. Relevant discussion is in http://reviews.llvm.org/D13525. llvm-svn: 249722	2015-10-08 19:30:57 +00:00
David Majnemer	3f02150d31	[MSVC Compat] Enable ABI impacting non-conforming behavior independently of -fms-compatibility No ABI for C++ currently makes it possible to implement the standard 100% perfectly. We wrongly hid some of our compatible behavior behind -fms-compatibility instead of tying it to the compiler ABI. llvm-svn: 249656	2015-10-08 04:53:31 +00:00
Yaron Keren	ed1fe5d093	Replace double-negated !SourceLocation.isInvalid() with SourceLocation.isValid(). llvm-svn: 249228	2015-10-03 05:15:57 +00:00
Craig Topper	c005cc06cd	Use llvm::makeArrayRef. NFC. llvm-svn: 248678	2015-09-27 03:44:08 +00:00
Akira Hatanaka	510d7c71e2	Remove attributes minsize and optsize, which conflict with optnone. This commit fixes an assert that is triggered when optnone is being added to an IR function that is already marked with minsize and optsize. rdar://problem/22723716 Differential Revision: http://reviews.llvm.org/D13004 llvm-svn: 248191	2015-09-21 18:52:24 +00:00
Piotr Padlewski	b83278687d	Using MD_invariant_group http://reviews.llvm.org/D12927 llvm-svn: 247933	2015-09-17 20:25:46 +00:00
Reid Kleckner	10aa77032d	[WinEH] Pass the catch adjectives to catchpad directly This avoids building a fake LLVM IR global variable just to ferry an i32 down into LLVM codegen. It also puts a nail in the coffin of using MS ABI C++ EH with landingpads, since now we'll assert in the lpad code when flags are present. llvm-svn: 247843	2015-09-16 20:15:55 +00:00
Piotr Padlewski	4b1ac72cd4	Decorating vptr load & stores with !invariant.group Adding !invariant.group to vptr load/stores for devirtualization purposes. For more goto: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html http://reviews.llvm.org/D12026 llvm-svn: 247725	2015-09-15 21:46:55 +00:00
Piotr Padlewski	9d0ecf27a7	Added llvm.module flag for strict vtable pointers It is dangerous to do LTO on code with strict-vtable-pointers, because one module has invariant.group.barriers, and the other one not. In the future I want to just strip all invariant.group metadata from vptrs loads/stores and get rid of invariant.group.barrier calls. http://reviews.llvm.org/D12580 llvm-svn: 247724	2015-09-15 21:46:50 +00:00
Evgeniy Stepanov	6b2a61d3a5	Revert "Always_inline codegen rewrite" and 2 follow-ups. Revert "Update cxx-irgen.cpp test to allow signext in alwaysinline functions." Revert "[CodeGen] Remove wrapper-free always_inline functions from COMDATs" Revert "Always_inline codegen rewrite." Reason for revert: PR24793. llvm-svn: 247620	2015-09-14 21:35:16 +00:00
David Blaikie	2a791d7d21	[opaque pointer type] Fix a few uses of PointerType::getElementType in favor of uses of types already available elsewhere These are a few cleanups I happened to have from trying to go in a different direction recently, so just flushing them out while I have them. llvm-svn: 247593	2015-09-14 18:38:22 +00:00
David Blaikie	aff29d3031	Revert "[opaque pointer type] update for LLVM API change" This was the wrong direction to take anyway (because ultimately the GlobalValue needed the pointee type again and /it/ used PointerType::getElementType eventually anyway)... let's go a different way. This reverts commit r236161. llvm-svn: 247586	2015-09-14 18:02:04 +00:00
David Majnemer	67cb9740f6	[CodeGen] Remove wrapper-free always_inline functions from COMDATs always_inline functions without a wrapper don't need to be in a COMDAT. llvm-svn: 247500	2015-09-12 06:37:42 +00:00
Evgeniy Stepanov	93db40a147	Always_inline codegen rewrite. Current implementation may end up emitting an undefined reference for an "inline __attribute__((always_inline))" function by generating an "available_externally alwaysinline" IR function for it and then failing to inline all the calls. This happens when a call to such function is in dead code. As the inliner is an SCC pass, it does not process dead code. Libc++ relies on the compiler never emitting such undefined reference. With this patch, we emit a pair of 1. internal alwaysinline definition (called F.alwaysinline) 2a. A stub F() { musttail call F.alwaysinline } -- or, depending on the linkage -- 2b. A declaration of F. The frontend ensures that F.inlinefunction is only used for direct calls, and the stub is used for everything else (taking the address of the function, really). Declaration (2b) is emitted in the case when "inline" is meant for inlining only (like __gnu_inline__ and some other cases). This approach, among other nice properties, ensures that alwaysinline functions are always internal, making it impossible for a direct call to such function to produce an undefined symbol reference. This patch is based on ideas by Chandler Carruth and Richard Smith. llvm-svn: 247494	2015-09-12 01:07:37 +00:00
Evgeniy Stepanov	67037ee21e	Revert "Specify target triple in alwaysinline tests." Revert "Always_inline codegen rewrite." Breaks gdb & lldb tests. Breaks on Fedora 22 x86_64. llvm-svn: 247491	2015-09-11 23:48:37 +00:00
Evgeniy Stepanov	072e83500e	Always_inline codegen rewrite. Current implementation may end up emitting an undefined reference for an "inline __attribute__((always_inline))" function by generating an "available_externally alwaysinline" IR function for it and then failing to inline all the calls. This happens when a call to such function is in dead code. As the inliner is an SCC pass, it does not process dead code. Libc++ relies on the compiler never emitting such undefined reference. With this patch, we emit a pair of 1. internal alwaysinline definition (called F.alwaysinline) 2a. A stub F() { musttail call F.alwaysinline } -- or, depending on the linkage -- 2b. A declaration of F. The frontend ensures that F.inlinefunction is only used for direct calls, and the stub is used for everything else (taking the address of the function, really). Declaration (2b) is emitted in the case when "inline" is meant for inlining only (like __gnu_inline__ and some other cases). This approach, among other nice properties, ensures that alwaysinline functions are always internal, making it impossible for a direct call to such function to produce an undefined symbol reference. This patch is based on ideas by Chandler Carruth and Richard Smith. llvm-svn: 247465	2015-09-11 20:29:07 +00:00
Artem Belevich	da1851ca58	[CUDA] Allow trivial constructors as initializer for __shared__ variables. Differential Revision: http://reviews.llvm.org/D12739 llvm-svn: 247307	2015-09-10 17:26:58 +00:00
Peter Collingbourne	2c7f7e31c4	CFI: Introduce -fsanitize=cfi-icall flag. This flag causes the compiler to emit bit set entries for functions as well as runtime bitset checks at indirect call sites. Depends on the new function bitset mechanism. Differential Revision: http://reviews.llvm.org/D11857 llvm-svn: 247238	2015-09-10 02:17:40 +00:00
Peter Collingbourne	25a80bf9b6	CodeGen: Introduce CodeGenModule::CreateMetadataIdentifierForType. This function can be used to create a metadata identifier for a specific type. No functionality change, but this will be used by D11857 and D12026. Differential Revision: http://reviews.llvm.org/D12038 llvm-svn: 247098	2015-09-08 23:01:30 +00:00
John McCall	7f416cc426	Compute and preserve alignment more faithfully in IR-generation. Introduce an Address type to bundle a pointer value with an alignment. Introduce APIs on CGBuilderTy to work with Address values. Change core APIs on CGF/CGM to traffic in Address where appropriate. Require alignments to be non-zero. Update a ton of code to compute and propagate alignment information. As part of this, I've promoted CGBuiltin's EmitPointerWithAlignment helper function to CGF and made use of it in a number of places in the expression emitter. The end result is that we should now be significantly more correct when performing operations on objects that are locally known to be under-aligned. Since alignment is not reliably tracked in the type system, there are inherent limits to this, but at least we are no longer confused by standard operations like derived-to-base conversions and array-to-pointer decay. I've also fixed a large number of bugs where we were applying the complete-object alignment to a pointer instead of the non-virtual alignment, although most of these were hidden by the very conservative approach we took with member alignment. Also, because IRGen now reliably asserts on zero alignments, we should no longer be subject to an absurd but frustrating recurring bug where an incomplete type would report a zero alignment and then we'd naively do a alignmentAtOffset on it and emit code using an alignment equal to the largest power-of-two factor of the offset. We should also now be emitting much more aggressive alignment attributes in the presence of over-alignment. In particular, field access now uses alignmentAtOffset instead of min. Several times in this patch, I had to change the existing code-generation pattern in order to more effectively use the Address APIs. For the most part, this seems to be a strict improvement, like doing pointer arithmetic with GEPs instead of ptrtoint. That said, I've tried very hard to not change semantics, but it is likely that I've failed in a few places, for which I apologize. ABIArgInfo now always carries the assumed alignment of indirect and indirect byval arguments. In order to cut down on what was already a dauntingly large patch, I changed the code to never set align attributes in the IR on non-byval indirect arguments. That is, we still generate code which assumes that indirect arguments have the given alignment, but we don't express this information to the backend except where it's semantically required (i.e. on byvals). This is likely a minor regression for those targets that did provide this information, but it'll be trivial to add it back in a later patch. I partially punted on applying this work to CGBuiltin. Please do not add more uses of the CreateDefaultAligned{Load,Store} APIs; they will be going away eventually. llvm-svn: 246985	2015-09-08 08:05:57 +00:00
Hal Finkel	0e2b975eb6	Don't crash on a self-alias declaration We were crashing in CodeGen given input like this: int self_alias(void) __attribute__((weak, alias("self_alias"))); such a self-alias is invalid, but instead of diagnosing the situation, we'd proceed to produce IR for both the function declaration and the alias. Because we already had a function named 'self_alias', the alias could not be named the same thing, and so LLVM would pick a different name ('self_alias1' for example) for that value. When we later called CodeGenModule::checkAliases, we'd look up the IR value corresponding to the alias name, find the function declaration instead, and then assert in a cast to llvm::GlobalAlias. The easiest way to prevent this is simply to avoid creating the wrongly-named alias value in the first place and issue the diagnostic there (instead of in checkAliases). We detect a related cycle case in CodeGenModule::EmitAliasDefinition already, so this just adds a second such check. Even though the other test cases for this 'alias definition is part of a cycle' diagnostic are in test/Sema/attr-alias-elf.c, I've added a separate regression test for this case. This is because I can't add this check to test/Sema/attr-alias-elf.c without disturbing the other test cases in that file. In order to avoid construction of the bad IR values, this diagnostic is emitted from within CodeGenModule::EmitAliasDefinition (and the relevant declaration is not added to the Aliases vector). The other cycle checks are done within the CodeGenModule::checkAliases function based on the Aliases vector, called from CodeGenModule::Release. However, if there have been errors earlier, HandleTranslationUnit does not call Release, and so checkAliases is never called, and so none of the other diagnostics would be produced. Fixes PR23509. llvm-svn: 246882	2015-09-04 21:49:21 +00:00
Dan Gohman	c285307e14	[WebAssembly] Initial WebAssembly support in clang This implements basic support for compiling (though not yet assembling or linking) for a WebAssembly target. Note that ABI details are not yet finalized, and may change. Differential Revision: http://reviews.llvm.org/D12002 llvm-svn: 246814	2015-09-03 22:51:53 +00:00
Andrey Bokhanko	cab5858e1b	PR17829: Proper diagnostic of mangled names conflicts Proper diagnostic and resolution of mangled names conflicts between C++ methods and C functions. This patch implements support for functions/methods only; support for variables is coming separately. Differential Revision: http://reviews.llvm.org/D11297 llvm-svn: 246438	2015-08-31 13:20:44 +00:00
Hans Wennborg	6eaa8323a8	Allow TLS vars in dllimport/export functions; only inline dllimport functions when safe (PR24593) This patch does two things: 1) Don't error about dllimport/export on thread-local static local variables. We put those attributes on static locals in dllimport/export functions implicitly in case the function gets inlined. Now, for TLS variables this is a problem because we can't import such variables, but it's a benign problem becase: 2) Make sure we never inline a dllimport function TLS static locals. In fact, never inline a dllimport function that references a non-imported function or variable (because these are not defined in the importing library). This seems to match MSVC's behaviour. Differential Revision: http://reviews.llvm.org/D12422 llvm-svn: 246338	2015-08-28 21:47:01 +00:00
Jingyue Wu	284ebe237f	[CUDA] Change initializer for CUDA device code based on CUDA documentation. Summary: According to CUDA documentation, global variables declared with __device__, __constant__ can be initialized from host code, so mark them as externally initialized. Because __shared__ variables cannot have an initialization as part of their declaration and since the value maybe kept across different kernel invocation, the value of __shared__ is effectively undefined instead of zero initialized. Wrongly using zero initializer may cause illegitimate optimization, e.g. removing unused __constant__ variable because it's not updated in the device code and the value is initialized with zero. Test Plan: test/CodeGenCUDA/address-spaces.cu Patch by Xuetian Weng Reviewers: jholewinski, eliben, tra, jingyue Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12241 llvm-svn: 245786	2015-08-22 05:49:28 +00:00
Richard Smith	c52efa7d40	[modules] Don't eagerly deserialize so many ImportDecls. CodeGen basically ignores ImportDecls imported from modules, so only eagerly deserialize the ones from a PCH / preamble. llvm-svn: 245406	2015-08-19 02:30:28 +00:00
David Majnemer	2dcef9e0a4	Avoid iteration invalidation issues around MaterializedTemporaryExpr We risk iterator invalidation issues if we use a DenseMap to hold the backing storage for an APValue. Instead, BumpPtrAllocate them and use APValue * as our DenseMap value. Also, don't assume that MaterializedGlobalTemporaryMap won't regrow between when we initially perform a lookup and later on when we actually try to insert into it. This fixes PR24289. Differential Revision: http://reviews.llvm.org/D11629 llvm-svn: 244989	2015-08-13 23:50:15 +00:00
Yaron Keren	556b21aa10	Remove and forbid raw_svector_ostream::flush() calls. After r244870 flush() will only compare two null pointers and return, doing nothing but wasting run time. The call is not required any more as the stream and its SmallString are always in sync. Thanks to David Blaikie for reviewing. llvm-svn: 244928	2015-08-13 18:12:56 +00:00
Eric Christopher	02d5d86b4e	Rename the non-coding style conformant functions in namespace Builtins to match the rest of their brethren and reformat the bits that need it. llvm-svn: 244186	2015-08-06 01:01:12 +00:00
Reid Kleckner	124955aade	Add -gcodeview and -gdwarf to control which type Clang emits Summary: By default, 'clang' emits dwarf and 'clang-cl' emits codeview. You can force emission of one or both by passing -gcodeview and -gdwarf to either driver. Reviewers: dblaikie, hans Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D11742 llvm-svn: 244097	2015-08-05 18:51:13 +00:00
Justin Bogner	203f09223b	InstrProf: Fix a misuse of the FunctionDecl API when generating coverage This was calling FD->hasBody(), meaning "Does the function that this decl refers to have a body?", rather than FD->doesThisDeclarationHaveABody(), meaning "Is this decl a non-deleted definition?". We might want to consider renaming these APIs :/ llvm-svn: 243360	2015-07-28 00:41:51 +00:00
Mehdi Amini	ca3cf9e6ff	LLVM API Change: the Module always owns the DataLayout From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 243115	2015-07-24 16:04:29 +00:00
Samuel Antao	f8b5012dfb	[OpenMP] Add TLS-based implementation for threadprivate directive. llvm-svn: 242080	2015-07-13 22:54:53 +00:00
Adrian Prantl	c4bb47e301	Debug Info: Emit debug info for @import declarations. This allows a module-aware debugger such as LLDB to import the currently visible modules before dropping into the expression evaluator. rdar://problem/20965932 llvm-svn: 241084	2015-06-30 17:39:51 +00:00
David Majnemer	64b0bdf88a	[CodeGen] Tweak isTriviallyRecursive further isTriviallyRecursive is a hack used to bridge a gap between the expectations that source code assumes and the semantics that LLVM IR can provide. Specifically, asm labels on functions are treated as an explicit name for a GlobalObject in Clang but treated like an output-processing step in GCC. Tweak this hack a little further to emit calls to library functions instead of emitting an incorrect definition. The definition in question would have available_externally linkage (this is OK) but result in a call to itself which will either result in an infinite loop or stack overflow. This fixes PR23964. llvm-svn: 241043	2015-06-30 04:41:18 +00:00
Adrian Prantl	e74f525bc9	Pass HeaderSearchOptions and PreprocessorOptions into CodeGenModule. In order to produce debug info for clang modules CGDebugInfo it needs access to macros passed on the command line and the isysroot. llvm-svn: 241035	2015-06-30 02:26:03 +00:00
David Majnemer	41011f6706	[CodeGen] Restrict isTriviallyRecursive to predefined lib functions forwarding to lib functions isTriviallyRecursive is only supposed to guard functions part of the implementation. This fixes PR23953. llvm-svn: 240735	2015-06-25 23:50:40 +00:00
Diego Novillo	5aecacb275	Display profile file name when emitting a file not found diagnostic. When a profile file cannot be opened, we used to display just the error message but not the name of the profile the compiler was trying to open. This will become useful in the next set of patches that introduce GCC-compatible flags to specify profiles. llvm-svn: 240715	2015-06-25 22:56:00 +00:00
Alexander Kornienko	ab9db51042	Revert r240270 ("Fixed/added namespace ending comments using clang-tidy"). llvm-svn: 240353	2015-06-22 23:07:51 +00:00
Alexander Kornienko	3d9d929e42	Fixed/added namespace ending comments using clang-tidy. NFC The patch is generated using this command: $ tools/extra/clang-tidy/tool/run-clang-tidy.py -fix \ -checks=-,llvm-namespace-comment -header-filter='llvm/.\|clang/.*' \ work/llvm/tools/clang To reduce churn, not touching namespaces spanning less than 10 lines. llvm-svn: 240270	2015-06-22 09:47:44 +00:00
Alexander Potapenko	b9b73ef906	[ASan] Initial support for Kernel AddressSanitizer This patch adds initial support for the -fsanitize=kernel-address flag to Clang. Right now it's quite restricted: only out-of-line instrumentation is supported, globals are not instrumented, some GCC kasan flags are not supported. Using this patch I am able to build and boot the KASan tree with LLVMLinux patches from github.com/ramosian-glider/kasan/tree/kasan_llvmlinux. To disable KASan instrumentation for a certain function attribute((no_sanitize("kernel-address"))) can be used. llvm-svn: 240131	2015-06-19 12:19:07 +00:00
Peter Collingbourne	86d34a72b3	CodeGen: Factor out some of the bitset entry creation code. NFC. llvm-svn: 239927	2015-06-17 19:08:05 +00:00
Yaron Keren	b54db52a7b	C++11 rangify several loops. llvm-svn: 239528	2015-06-11 12:33:25 +00:00
Eric Christopher	162c91ccc4	Rename the single non-style conformant function in TargetCodeGenInfo and update all callers. llvm-svn: 239193	2015-06-05 22:03:00 +00:00
Benjamin Kramer	3204b152b5	Replace push_back(Constructor(foo)) with emplace_back(foo) for non-trivial types If the type isn't trivially moveable emplace can skip a potentially expensive move. It also saves a couple of characters. Call sites were found with the ASTMatcher + some semi-automated cleanup. memberCallExpr( argumentCountIs(1), callee(methodDecl(hasName("push_back"))), on(hasType(recordDecl(has(namedDecl(hasName("emplace_back")))))), hasArgument(0, bindTemporaryExpr( hasType(recordDecl(hasNonTrivialDestructor())), has(constructExpr()))), unless(isInTemplateInstantiation())) No functional change intended. llvm-svn: 238601	2015-05-29 19:42:19 +00:00
Hans Wennborg	bb4f962ad6	Get the dll storage class right for structors of classes exported/imported via explicit instantiation (PR23667) This is a follow-up to r238266. It turned out structors are codegened through a different path, and didn't get the storage class set in EmitGlobalFunctionDefinition. llvm-svn: 238443	2015-05-28 17:44:56 +00:00
Pete Cooper	2f1637ad85	Use Intrinsic::ID instead of unsigned. NFC. This is after LLVM r237810 which made Function::getIntrinsicID() return an Intrinsic::ID. llvm-svn: 237811	2015-05-20 17:17:45 +00:00
Richard Smith	4241314164	[modules] Add local submodule visibility support for declarations. With this change, enabling -fmodules-local-submodule-visibility results in name visibility rules being applied to submodules of the current module in addition to imported modules (that is, names no longer "leak" between submodules of the same top-level module). This also makes it much safer to textually include a non-modular library into a module: each submodule that textually includes that library will get its own "copy" of that library, and so the library becomes visible no matter which including submodule you import. llvm-svn: 237473	2015-05-15 20:05:43 +00:00
Peter Collingbourne	915df9968b	Implement no_sanitize attribute. Differential Revision: http://reviews.llvm.org/D9631 llvm-svn: 237463	2015-05-15 18:33:32 +00:00
NAKAMURA Takumi	c7da6da58e	Revert r236879, "Do not emit thunks with available_externally linkage in comdats" It broke pecoff, at least i686-cygwin. llvm-svn: 236937	2015-05-09 21:10:07 +00:00
Derek Schuff	2312bd3811	Do not emit thunks with available_externally linkage in comdats Functions with available_externally linkage will not be emitted to object files (they will just be undefined symbols), so it does not make sense to put them in comdats. Creates a second overload of maybeSetTrivialComdat that uses the GlobalObject instead of the Decl, and uses that in several places that had the faulty logic. Differential Revision: http://reviews.llvm.org/D9580 llvm-svn: 236879	2015-05-08 16:47:21 +00:00
Artem Belevich	52cc487ba8	[cuda] Include GPU binary into host object file and generate init/deinit code. - added -fcuda-include-gpubinary option to incorporate results of device-side compilation into host-side one. - generate code to register GPU binaries and associated kernels with CUDA runtime and clean-up on exit. - added test case for init/deinit code generation. Differential Revision: http://reviews.llvm.org/D9507 llvm-svn: 236765	2015-05-07 19:34:16 +00:00
David Blaikie	d3c127e232	[opaque pointer type] Correctly pass the pointee type when creating a GEP constant expression llvm-svn: 236751	2015-05-07 17:27:56 +00:00
David Blaikie	881b23402e	[opaque pointer type] update for LLVM API change llvm-svn: 236161	2015-04-29 21:22:47 +00:00
Bradley Smith	dfddebcfb9	Revert code changes made under r235976. This issue was fixed elsewhere in r235396 in a more general way, hence these changes no longer do anything. Keep the testcase however, to ensure that we don't regress this for ARM. llvm-svn: 236104	2015-04-29 14:32:06 +00:00
Bradley Smith	ba945626b0	[ARM/AArch64] Enforce alignment for bitfielded structs When creating a global variable with a type of a struct with bitfields, we must forcibly set the alignment of the global from the RecordDecl. We must do this so that the proper bitfield alignment makes its way down to LLVM, since clang will mangle the bitfields into one large type. llvm-svn: 235976	2015-04-28 11:24:54 +00:00
Artem Belevich	0488d1e4ba	[cuda] treat file scope __asm as __host__ and ignore it during device-side compilation. Currently clang emits file-scope asm during both host and device compilation modes which is usually a wrong thing to do. There's no way to attach any attribute to an __asm statement, so there's no way to differentiate between host-side and device-side file-scope asm. This patch makes clang to match nvcc behavior and emit file-scope-asm only during host-side compilation. Differential Revision: http://reviews.llvm.org/D9270 llvm-svn: 235905	2015-04-27 18:52:00 +00:00
Ulrich Weigand	4608438386	Provide alignment info on LLVM external symbols Code in CodeGenModule::GetOrCreateLLVMGlobal that sets up GlobalValue object for LLVM external symbols has this comment: // FIXME: This code is overly simple and should be merged with other global // handling. One part does seems to be "overly simple" currently is that this code never sets any alignment info on the GlobalValue, so that the emitted IR does not have any align attribute on external globals. This can lead to unnecessarily inefficient code generation. This patch adds a GV->setAlignment call to set alignment info. llvm-svn: 235396	2015-04-21 17:27:59 +00:00
Nico Weber	608e768d8d	Don't crash when a selectany symbol would get common linkage Things can't both be in comdats and have common linkage, so never give things in comdats common linkage. Common linkage is only used in .c files, and the only thing that can trigger a comdat in c is selectany from what I can tell. Fixes PR23243. Also address an over-the-shoulder review comment from rnk by moving the hasAttr<SelectAnyAttr>() in Decl.cpp around a bit. It only makes a minor difference for selectany on global variables, so it goes well with the rest of this patch. http://reviews.llvm.org/D9042 llvm-svn: 235053	2015-04-15 23:04:24 +00:00
Renato Golin	c640ff63e2	Revert "[CodeGen] Fix crash with duplicated mangled name." This reverts commit r234767, as it was breaking all ARM buildbots for two days and the assert is not in the code, making it difficult to spot the error, which would keep the bots red for a few more days. New errors were silently introduced because of this bug, and we don't want this to escalate. llvm-svn: 234983	2015-04-15 08:44:40 +00:00
Richard Trieu	b58413323d	Change range-based for-loops to be -Wrange-loop-analysis clean. No functionality change. llvm-svn: 234964	2015-04-15 01:21:42 +00:00
Argyrios Kyrtzidis	6f1fadf989	[CodeGen] Fix crash with duplicated mangled name. Patch by Yunzhong Gao! llvm-svn: 234767	2015-04-13 17:40:46 +00:00
Nico Weber	176efac95b	Don't crash when passing a non-existent file to -fprofile-instr-use=. Fixes a regression from r229434. llvm-svn: 234141	2015-04-06 04:16:48 +00:00
David Blaikie	2e80428dc5	clang-format my last commit (sorry, keep forgetting that) llvm-svn: 234129	2015-04-05 22:47:07 +00:00
David Blaikie	1ed728c499	[opaque pointer type] More GEP API migrations Looks like the VTable code in particular will need some work to pass around the pointee type explicitly. llvm-svn: 234128	2015-04-05 22:45:47 +00:00
David Blaikie	e3b172afc3	[opaque pointer type] Update for GEP API changes in LLVM Now the GEP constant utility functions require the type to be explicitly passed (since eventually the pointer type will be opaque and not convey the required type information). For now callers can still pass nullptr (though none were needed here in Clang, which is nice) if convenienc/necessary, but eventually that will be disallowed as well. llvm-svn: 233937	2015-04-02 18:55:21 +00:00
Duncan P. N. Exon Smith	2809cc7493	DebugInfo: Use new LLVM API for DebugLoc Use the new API for `DebugLoc` added in r233573 before the old one disappears. llvm-svn: 233589	2015-03-30 20:01:41 +00:00
David Majnemer	37b417f1ed	[MS ABI] Rework .xdata HandlerType emission Utilizing IMAGEREL relocations for synthetic IR constructs isn't valuable, just clutter. While we are here, simplify HandlerType names by making the numeric value for the 'adjective' part of the mangled name instead of appending '.const', etc. The old scheme made for very long global names and leads to wordy things like '.std_bad_alloc' llvm-svn: 233503	2015-03-29 21:55:10 +00:00
Reid Kleckner	7ffc3fbb2f	C++14: Disable sized deallocation by default due to ABI breakage There are no widely deployed standard libraries providing sized deallocation functions, so we have to punt and ask the user if they want us to use sized deallocation. In the future, when such libraries are deployed, we can teach the driver to detect them and enable this feature. N3536 claimed that a weak thunk from sized to unsized deallocation could be emitted to avoid breaking backwards compatibility with standard libraries not providing sized deallocation. However, this approach and other variations don't work in practice. With the weak function approach, the thunk has to have default visibility in order to ensure that it is overridden by other DSOs providing sized deallocation. Weak, default visibility symbols are particularly expensive on MachO, so John McCall was considering disabling this feature by default on Darwin. It also changes behavior ELF linking behavior, causing certain otherwise unreferenced object files from an archive to be pulled into the link. Our second approach was to use an extern_weak function declaration and do an inline conditional branch at the deletion call site. This doesn't work because extern_weak only works on MachO if you have some archive providing the default value of the extern_weak symbol. Arranging to provide such an archive has the same challenges as providing the symbol in the standard library. Not to mention that extern_weak doesn't really work on COFF. Reviewers: rsmith, rjmccall Differential Revision: http://reviews.llvm.org/D8467 llvm-svn: 232788	2015-03-20 00:31:07 +00:00
Artem Belevich	f3d3db65de	Remove .CUDAIsDevice flags from CodeGenOpts as it's already available in LangOpts. Differential Revision: http://reviews.llvm.org/D8385 llvm-svn: 232749	2015-03-19 18:58:18 +00:00
David Majnemer	a03849b2e7	MS ABI: Don't try to emit VF/VB-Tables for extern class templates There will be an explicit template instantiation in another translation unit which will provide the definition of the VF/VB-Tables. This fixes PR22932. llvm-svn: 232680	2015-03-18 22:04:43 +00:00
Alexey Bataev	91797559ac	[OPENMP] Fix crash on code emitting if errors are found. Codegen for threadprivate variables (and in some other cases) may cause crash of the compiler if some diagnostic is produced later. This happens because some of the autogenerated globals are not removed from InternalVars StringMap when llvm::Module is reset. Differential Revision: http://reviews.llvm.org/D8360 llvm-svn: 232610	2015-03-18 04:13:55 +00:00
David Majnemer	5f0dd6162c	MS ABI: Emit HandlerMap entries for C++ catch The HandlerMap describes, to the runtime, what sort of catches surround the try. In principle, this structure has to be emitted by the backend because only it knows the layout of the stack (the runtime needs to know where on the stack the destination of a copy lives, etc.) but there is some C++ specific information that the backend can't reason about. Stick this information in special LLVM globals with the relevant "const", "volatile", "reference" info mangled into the name. llvm-svn: 232538	2015-03-17 20:35:05 +00:00
David Majnemer	443250f08d	WIP llvm-svn: 232537	2015-03-17 20:35:00 +00:00
David Majnemer	ad803d4b76	MS ABI: Don't use qualified pointee types for 'catch' EH TypeDescriptors Qualifiers are located next to the TypeDescriptor in order to properly ensure that a pointer type can only be caught by a more qualified catch handler. This means that a catch handler of type 'const int ' requires an RTTI object for 'int '. We got this correct for 'throw' but not for 'catch'. N.B. We don't currently have the means to store the qualifiers because LLVM's EH strategy is tailored to the Itanium scheme. The Itanium ABI stores qualifiers inside the type descriptor in such a way that the manner of qualification is stored in addition to the pointee type's descriptor. Perhaps the best way of modeling this for the MS ABI is using an aggregate type to bundle the qualifiers with the descriptor? This is tricky because we want to make it clear to the optimization passes which catch handlers invalidate other handlers. My current thoughts on a design for this is along the lines of: { { TypeDescriptor* TD, i32 QualifierFlags }, i32 MiscFlags } The idea is that the inner most aggregate is all that is needed to communicate that one catch handler might supercede another. The 'MiscFlags' field would be used to hold the bitpattern for the notion that the 'catch' handler does not need to invoke a copy-constructor because we are catching by reference. llvm-svn: 232318	2015-03-15 07:10:01 +00:00
Joerg Sonnenberger	dfd511e9bc	Simplify. llvm-svn: 232130	2015-03-13 00:54:30 +00:00
Alexey Bataev	3eff5f46d7	[OPENMP] Rename methods of OpenMPRuntime class. NFC. llvm-svn: 230470	2015-02-25 08:32:46 +00:00
David Majnemer	f63bcaa2c5	CodeGen: Weak reference temporaries belong in a COMDAT llvm-svn: 229902	2015-02-19 19:35:18 +00:00
David Majnemer	c9a9c7a673	CodeGen: static constexpr data members should have a linkonce_odr init Classes can be defined in multiple translation units. This means that the static constexpr data members should have identical initializers in all translation units. Implement this by giving the reference temporary linkonce_odr linkage. llvm-svn: 229900	2015-02-19 19:25:17 +00:00
Zoran Jovanovic	26a1216a74	Change representation of member function pointers for MIPS targets Differential Revision: http://reviews.llvm.org/D7148 llvm-svn: 229680	2015-02-18 15:21:35 +00:00
Larisse Voufo	e990a3f60c	Rename flags and options to match current naming: from -fdef-sized-delete to -fdefine-sized-deallocation, and from DefaultSizedDelete to DefineSizedDeallocation. llvm-svn: 229597	2015-02-18 01:04:10 +00:00
Justin Bogner	d29a4a3810	InstrProf: Update for LLVM API change Update for the API change in r229433 llvm-svn: 229434	2015-02-16 21:29:05 +00:00
Larisse Voufo	fbe56a0529	Fix a typo in r229291 causing buildbot failure. llvm-svn: 229295	2015-02-15 08:47:33 +00:00
Larisse Voufo	0c691d029a	More on fixing sized deallocation implementation logic: Fix PR21754. llvm-svn: 229291	2015-02-15 08:16:37 +00:00
Larisse Voufo	5526f4f094	Revise the implementation logic of sized deallocation: Do not automatically generate weak definitions of the sized operator delete (in terms of unsized operator delete). Instead, provide the funcitonality via a new compiler flag, -fdef-sized-delete. The current implementation causes link-time ODR violations when the delete symbols are exported into the dynamic table. llvm-svn: 229241	2015-02-14 05:42:57 +00:00
David Majnemer	e1a0b2e2af	MS ABI: Records with fields with required aligmnet shouldn't be common llvm-svn: 227954	2015-02-03 08:49:32 +00:00
Justin Holewinski	f37f3d35eb	When generating llvm.used, we may need an addrspacecast instead of a bitcast. Summary: This is especially important for targets that use multiple address spaces, and commonly place global variables in address spaces other than zero. Fixes PR22383 Test Plan: New test case added: llvm-used.cu Reviewers: jingyue Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7345 llvm-svn: 227861	2015-02-02 21:05:49 +00:00
Rafael Espindola	e5df59ff78	Emit DeferredDeclsToEmit in a DFS order. Currently we emit DeferredDeclsToEmit in reverse order. This patch changes that. The advantages of the change are that * The output order is a bit closer to the source order. The change to test/CodeGenCXX/pod-member-memcpys.cpp is a good example. * If we decide to deffer more, it will not cause as large changes in the estcases as it would without this patch. llvm-svn: 226751	2015-01-22 00:24:57 +00:00
Rafael Espindola	6d1178ca40	clang-format function. NFC. llvm-svn: 226662	2015-01-21 14:55:00 +00:00
David Majnemer	740d59ec49	CodeGen: Compiler generated __declspec(uuid) objects should be COMDAT'd llvm-svn: 226628	2015-01-21 01:04:28 +00:00
Rafael Espindola	d9b26d563a	Add comdat to string literal variables on COFF. llvm-svn: 226317	2015-01-16 20:32:35 +00:00
Rafael Espindola	cb92c19fc4	Use a trivial comdat for C++ tables. This produces comdats for vtables, typeinfo, typeinfo names, and vtts. When combined with llvm not producing implicit comdats, not doing this would cause code bloat on ELF and link errors on COFF. llvm-svn: 226227	2015-01-15 23:18:01 +00:00
Rafael Espindola	dbee8a7a7a	Use a trivial comdat for inline ctor/dtor when not using C5/D5. When combined with llvm not producing implicit comdats, not doing this would cause code bloat on ELF and link errors on COFF. llvm-svn: 226211	2015-01-15 21:36:08 +00:00
Chandler Carruth	0d9593ddec	[cleanup] Re-sort all #include lines with llvm/utils/sort_includes.py Sorry for the noise, I managed to miss a bunch of recent regressions of include orderings here. This should actually sort all the includes for Clang. Again, no functionality changed, this is just a mechanical cleanup that I try to run periodically to keep the #include lines as regular as possible across the project. llvm-svn: 225979	2015-01-14 11:29:14 +00:00
Rafael Espindola	0d4fb98504	[patch][pr19848] Produce explicit comdats in clang. The llvm IR until recently had no support for comdats. This was a problem when targeting C++ on ELF/COFF as just using weak linkage would cause quite a bit of dead bits to remain on the executable (unless -ffunction-sections, -fdata-sections and --gc-sections were used). To fix the problem, llvm's codegen will just assume that any weak or linkonce that is not in an explicit comdat should be output in one with the same name as the global. This unfortunately breaks cases like pr19848 where a weak symbol is not xpected to be part of any comdat. Now that we have explicit comdats in the IR, we can finally get both cases right. This first patch just makes clang give explicit comdats to GlobalValues where t is allowed to. A followup patch to llvm will then stop implicitly producing comdats. llvm-svn: 225705	2015-01-12 22:13:53 +00:00
Hans Wennborg	43a0f99b10	Don't emit implicit template instantiations eagerly (PR21718) Their linkage can change if they are later explicitly instantiated. We would previously emit such functions eagerly (as opposed to lazily on first use) if they have a 'dllexport' or 'used' attribute, and fail an assert when hitting the explicit instantiation. This is achieved by replacing the old CodeGenModule::MayDeferGeneration() method with two new ones: MustBeEmitted() and MayBeEmittedEagerly(). Differential Revision: http://reviews.llvm.org/D6674 llvm-svn: 225570	2015-01-10 01:19:48 +00:00
David Blaikie	0317bc9e55	PR21909: Don't try (and crash) to generate debug info for explicit instantiations of explicit specializations. llvm-svn: 224394	2014-12-16 23:49:18 +00:00
Paul Robinson	30e41fb4da	Warn when attribute 'optnone' conflicts with attributes on a different declaration of the same function. llvm-svn: 224256	2014-12-15 18:57:28 +00:00
Paul Robinson	0855695159	Instead of having -Os/-Oz add OptimizeForSize/MinSize first, and later having OptimizeNone remove them again, just don't add them in the first place if the function already has OptimizeNone. Note that MinSize can still appear due to attributes on different declarations; a future patch will address that. llvm-svn: 224047	2014-12-11 20:14:04 +00:00
Duncan P. N. Exon Smith	fb49491477	IR: Update clang for Metadata/Value split in r223802 Match LLVM API changes from r223802. llvm-svn: 223803	2014-12-09 18:39:32 +00:00
Justin Bogner	970ac60573	InstrProf: Use LLVM's -instrprof pass for profiling The logic for lowering profiling counters has been moved to an LLVM pass. Emit the intrinsics rather than duplicating the whole pass in clang. llvm-svn: 223683	2014-12-08 19:04:51 +00:00
Anton Korobeynikov	d90dd7977e	Fix invalid calling convention used for libcalls on ARM. ARM ABI specifies that all the libcalls use soft FP ABI (even hard FP binaries). These days clang emits _mulsc3 / _muldc3 calls with default (C) calling convention which would be translated into AAPCS_VFP LLVM calling and thus the result of complex multiplication will be bogus. Introduce a way for a target to specify explicitly calling convention for libcalls. Right now this is temporary correctness fix. Ultimately, we'll end with intrinsic for complex multiplication and all calling convention decisions for libcalls will be put into backend. llvm-svn: 223123	2014-12-02 16:04:58 +00:00
Reid Kleckner	ee7cf84c8f	Use nullptr to silence -Wsentinel when self-hosting on Windows Richard rejected my Sema change to interpret an integer literal zero in a varargs context as a null pointer, so -Wsentinel sees an integer literal zero and fires off a warning. Only CodeGen currently knows that it promotes integer literal zeroes in this context to pointer size on Windows. I didn't want to teach -Wsentinel about that compatibility hack. Therefore, I'm migrating to C++11 nullptr. llvm-svn: 223079	2014-12-01 22:02:27 +00:00
Richard Smith	a634ff2194	Add missing 'break's, found by inspection. No functionality change; the fallthrough happened to do the right thing in both cases. llvm-svn: 223064	2014-12-01 18:59:10 +00:00
Paul Robinson	4ece682586	Correctly remove OptimizeForSize from functions marked OptimizeNone. This allows using __attribute__((optnone)) and the -Os/-Oz options. Fixes PR21604. llvm-svn: 222683	2014-11-24 20:51:42 +00:00
David Blaikie	82e95a3c79	Update for LLVM API change to make Small(Ptr)Set::insert return pair<iterator, bool> as per the C++ standard's associative container concept. llvm-svn: 222335	2014-11-19 07:49:47 +00:00
David Blaikie	3c8c46efd7	clang-format a recent commit I made llvm-svn: 222317	2014-11-19 05:48:40 +00:00
David Blaikie	13156b689e	Standardize on StringMap::insert, removing uses of StringMap::GetOrCreateValue. llvm-svn: 222306	2014-11-19 03:06:06 +00:00
Justin Hibbits	90ca05e5e5	Add PIC-level support to Clang. Summary: This distinguishes between -fpic and -fPIC now, with the additions in LLVM for PIC level support. Test Plan: No regressions Reviewers: echristo, rafael Reviewed By: rafael Subscribers: rnk, emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D5400 llvm-svn: 222227	2014-11-18 06:17:20 +00:00
Fariborz Jahanian	800821a3b2	[Objective-C++ IRGen] do not generate .cxx_construct for class that contains trivially-constructible struct ivar. rdar://18950072 llvm-svn: 221823	2014-11-12 22:37:43 +00:00
Duncan P. N. Exon Smith	2b1f278e77	Revert "IR: MDNode => Value: Update for LLVM API change in r221375" This reverts commit r221376. The API change was reverted in r221711. llvm-svn: 221712	2014-11-11 21:31:03 +00:00
Alexey Bataev	9772000a22	[OPENMP] Codegen for threadprivate variables For all threadprivate variables which have constructor/destructor emit call to void __kmpc_threadprivate_register(ident_t * <Current Location>, void <Original Global Addr>, kmpc_ctor <Constructor>, kmpc_cctor NULL, kmpc_dtor <Destructor>); In expressions all references to such variables are replaced by calls to void __kmpc_threadprivate_cached(ident_t <Current Location>, kmp_int32 <Current Thread Id>, void <Original Global Addr>, size_t <Size of Data>, void ***<Pointer to autogenerated cache – array of private copies of threadprivate variable>); Test test/OpenMP/threadprivate_codegen.cpp checks that codegen is correct. Also it checks that codegen is correct after serialization/deserialization and one of passes verifies debug info. Differential Revision: http://reviews.llvm.org/D4002 llvm-svn: 221663	2014-11-11 04:05:39 +00:00
Alexey Samsonov	edf99a92c0	Introduce a SanitizerKind enum to LangOptions. Use the bitmask to store the set of enabled sanitizers instead of a bitfield. On the negative side, it makes syntax for querying the set of enabled sanitizers a bit more clunky. On the positive side, we will be able to use SanitizerKind to eventually implement the new semantics for -fsanitize-recover= flag, that would allow us to make some sanitizers recoverable, and some non-recoverable. No functionality change. llvm-svn: 221558	2014-11-07 22:29:38 +00:00
Duncan P. N. Exon Smith	2b049767b3	IR: MDNode => Value: Update for LLVM API change in r221375 llvm-svn: 221376	2014-11-05 18:16:35 +00:00
Hans Wennborg	606bd6dcc5	Don't dllimport inline functions when targeting MinGW (PR21366) It turns out that MinGW never dllimports of exports inline functions. This means that code compiled with Clang would fail to link with MinGW-compiled libraries since we might try to import functions that are not imported. To fix this, make Clang never dllimport inline functions when targeting MinGW. llvm-svn: 221154	2014-11-03 14:24:45 +00:00
David Majnemer	b9bd6fb397	CodeGen: Virtual dtor thunks shouldn't have this marked as 'returned' The ARM ABI virtual destructor thunks cannot be marked as 'returned' because they return undef. llvm-svn: 221042	2014-11-01 05:42:23 +00:00
Alexey Samsonov	a0ac3c2bf0	[ASan] Improve blacklisting of global variables. This commit changes the way we blacklist global variables in ASan. Now the global is excluded from instrumentation (either regular bounds checking, or initialization-order checking) if: 1) Global is explicitly blacklisted by its mangled name. This part is left unchanged. 2) SourceLocation of a global is in blacklisted source file. This changes the old behavior, where instead of looking at the SourceLocation of a variable we simply considered llvm::Module identifier. This was wrong, as identifier may not correspond to the file name, and we incorrectly disabled instrumentation for globals coming from #include'd files. 3) Global is blacklisted by type. Now we build the type of a global variable using Clang machinery (QualType::getAsString()), instead of llvm::StructType::getName(). After this commit, the active users of ASan blacklist files may have to revisit them (this is a backwards-incompatible change). llvm-svn: 220097	2014-10-17 22:37:33 +00:00

1 2 3 4 5 ...

1245 Commits