llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	c2d2b4259c	[CodeGen] Remove dead code. NFC. llvm-svn: 250418	2015-10-15 15:29:40 +00:00
Samuel Antao	bed3c46632	[OpenMP] Target directive host codegen. This patch implements the outlining for offloading functions for code annotated with the OpenMP target directive. It uses a temporary naming of the outlined functions that will have to be updated later on once target side codegen and registration of offloading libraries is implemented - the naming needs to be made unique in the produced library. llvm-svn: 249148	2015-10-02 16:14:20 +00:00
Charles Davis	c7d5c94f78	Support __builtin_ms_va_list. Summary: This change adds support for `__builtin_ms_va_list`, a GCC extension for variadic `ms_abi` functions. The existing `__builtin_va_list` support is inadequate for this because `va_list` is defined differently in the Win64 ABI vs. the System V/AMD64 ABI. Depends on D1622. Reviewers: rsmith, rnk, rjmccall CC: cfe-commits Differential Revision: http://reviews.llvm.org/D1623 llvm-svn: 247941	2015-09-17 20:55:33 +00:00
Piotr Padlewski	4b1ac72cd4	Decorating vptr load & stores with !invariant.group Adding !invariant.group to vptr load/stores for devirtualization purposes. For more goto: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html http://reviews.llvm.org/D12026 llvm-svn: 247725	2015-09-15 21:46:55 +00:00
Piotr Padlewski	d679d7e924	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479 and other bug caused in chrome. After this patch got reverted because of ScalarEvolution bug (D12719) Merged after John McCall big patch (Added Address). http://reviews.llvm.org/D11859 http://reviews.llvm.org/D12865 llvm-svn: 247646	2015-09-15 00:37:06 +00:00
Piotr Padlewski	4bed31b9bf	Revert "Generating assumption loads of vptr after ctor call (fixed)" It seems that there is small bug, and we can't generate assume loads when some virtual functions have internal visibiliy This reverts commit 982bb7d966947812d216489b3c519c9825cacbf2. llvm-svn: 247332	2015-09-10 20:18:30 +00:00
Alexey Bataev	2377fe95c6	[OPENMP] Outlined function for parallel and other regions with list of captured variables. Currently all variables used in OpenMP regions are captured into a record and passed to outlined functions in this record. It may result in some poor performance because of too complex analysis later in optimization passes. Patch makes to emit outlined functions for parallel-based regions with a list of captured variables. It reduces code for 2*n GEPs, stores and loads at least. Codegen for task-based regions remains unchanged because runtime requires that all captured variables are passed in captured record. llvm-svn: 247251	2015-09-10 08:12:02 +00:00
Piotr Padlewski	255652e828	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479. After this patch got reverted because of ScalarEvolution bug (D12719) Merged after John McCall big patch (Added Address). http://reviews.llvm.org/D11859 llvm-svn: 247199	2015-09-09 22:20:28 +00:00
Michael Zolotukhin	84df12375c	Introduce __builtin_nontemporal_store and __builtin_nontemporal_load. Summary: Currently clang provides no general way to generate nontemporal loads/stores. There are some architecture specific builtins for doing so (e.g. in x86), but there is no way to generate non-temporal store on, e.g. AArch64. This patch adds generic builtins which are expanded to a simple store with '!nontemporal' attribute in IR. Differential Revision: http://reviews.llvm.org/D12313 llvm-svn: 247104	2015-09-08 23:52:33 +00:00
John McCall	7f416cc426	Compute and preserve alignment more faithfully in IR-generation. Introduce an Address type to bundle a pointer value with an alignment. Introduce APIs on CGBuilderTy to work with Address values. Change core APIs on CGF/CGM to traffic in Address where appropriate. Require alignments to be non-zero. Update a ton of code to compute and propagate alignment information. As part of this, I've promoted CGBuiltin's EmitPointerWithAlignment helper function to CGF and made use of it in a number of places in the expression emitter. The end result is that we should now be significantly more correct when performing operations on objects that are locally known to be under-aligned. Since alignment is not reliably tracked in the type system, there are inherent limits to this, but at least we are no longer confused by standard operations like derived-to-base conversions and array-to-pointer decay. I've also fixed a large number of bugs where we were applying the complete-object alignment to a pointer instead of the non-virtual alignment, although most of these were hidden by the very conservative approach we took with member alignment. Also, because IRGen now reliably asserts on zero alignments, we should no longer be subject to an absurd but frustrating recurring bug where an incomplete type would report a zero alignment and then we'd naively do a alignmentAtOffset on it and emit code using an alignment equal to the largest power-of-two factor of the offset. We should also now be emitting much more aggressive alignment attributes in the presence of over-alignment. In particular, field access now uses alignmentAtOffset instead of min. Several times in this patch, I had to change the existing code-generation pattern in order to more effectively use the Address APIs. For the most part, this seems to be a strict improvement, like doing pointer arithmetic with GEPs instead of ptrtoint. That said, I've tried very hard to not change semantics, but it is likely that I've failed in a few places, for which I apologize. ABIArgInfo now always carries the assumed alignment of indirect and indirect byval arguments. In order to cut down on what was already a dauntingly large patch, I changed the code to never set align attributes in the IR on non-byval indirect arguments. That is, we still generate code which assumes that indirect arguments have the given alignment, but we don't express this information to the backend except where it's semantically required (i.e. on byvals). This is likely a minor regression for those targets that did provide this information, but it'll be trivial to add it back in a later patch. I partially punted on applying this work to CGBuiltin. Please do not add more uses of the CreateDefaultAligned{Load,Store} APIs; they will be going away eventually. llvm-svn: 246985	2015-09-08 08:05:57 +00:00
Alexey Bataev	caacd53dde	[OPENMP] Fix for http://llvm.org/PR24674 : assertion failed and and abort trap Fix processing of shared variables with reference types in OpenMP constructs. Previously, if the variable was not marked in one of the private clauses, the reference to this variable was emitted incorrectly and caused an assertion later. llvm-svn: 246846	2015-09-04 11:26:21 +00:00
Dan Gohman	c285307e14	[WebAssembly] Initial WebAssembly support in clang This implements basic support for compiling (though not yet assembling or linking) for a WebAssembly target. Note that ABI details are not yet finalized, and may change. Differential Revision: http://reviews.llvm.org/D12002 llvm-svn: 246814	2015-09-03 22:51:53 +00:00
Alexey Bataev	d6fdc8b685	[OPENMP 4.0] Codegen for array sections. Added codegen for array section in 'depend' clause of 'task' directive. It emits to pointers, one for the begin of array section and another for the end of array section. Size of the section is calculated as (end + 1 - start) * sizeof(basic_element_type). llvm-svn: 246422	2015-08-31 07:32:19 +00:00
Daniel Jasper	ad5b7962c9	Revert "[OPENMP 4.0] Codegen for array sections." The test is currently failing on bots: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental_check/12747/ llvm-svn: 246288	2015-08-28 08:42:22 +00:00
Steven Wu	5528da76ef	Revert r246214 and r246213 These two commits causes llvm LTO bootstrap to hang in ScalarEvolution. llvm-svn: 246282	2015-08-28 07:14:10 +00:00
Alexey Bataev	117fb35cf7	[OPENMP 4.0] Codegen for array sections. Added codegen for array section in 'depend' clause of 'task' directive. It emits to pointers, one for the begin of array section and another for the end of array section. Size of the section is calculated as (end + 1 - start) * sizeof(basic_element_type). llvm-svn: 246278	2015-08-28 06:09:05 +00:00
Piotr Padlewski	525f746710	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479. http://reviews.llvm.org/D11859 llvm-svn: 246213	2015-08-27 21:35:37 +00:00
Piotr Padlewski	fa0e11efdd	Revert "Generating assumption loads of vptr after ctor call (fixed)" Reverting because of 245721 This reverts commit 552658e2b60543c928030b09cc9b5dfcb40c3f28. llvm-svn: 245727	2015-08-21 19:49:41 +00:00
Piotr Padlewski	910a059e42	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479. http://reviews.llvm.org/D11859 llvm-svn: 245721	2015-08-21 18:28:00 +00:00
Justin Bogner	3c32c83daa	Revert "Generating assumption loads of vptr after ctor call (fixed)" Bootstrap bots were failing: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_build/6382/ http://bb.pgr.jp/builders/clang-3stage-i686-linux/builds/2969 This reverts r245264. llvm-svn: 245267	2015-08-18 05:40:20 +00:00
Piotr Padlewski	bc7497abbb	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479. http://reviews.llvm.org/D11859 llvm-svn: 245264	2015-08-18 03:52:00 +00:00
Hans Wennborg	386e442d1d	Revert r245257 "Generating assumption loads of vptr after ctor call" It caused PR24479 llvm-svn: 245260	2015-08-18 00:17:58 +00:00
Piotr Padlewski	a3f6f9477b	Generating assumption loads of vptr after ctor call Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html http://reviews.llvm.org/D11859 llvm-svn: 245257	2015-08-17 23:33:49 +00:00
Filipe Cabecinhas	7af183d841	Propagate SourceLocations through to get a Loc on float_cast_overflow Summary: float_cast_overflow is the only UBSan check without a source location attached. This patch propagates SourceLocations where necessary to get them to the EmitCheck() call. Reviewers: rsmith, ABataev, rjmccall Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D11757 llvm-svn: 244568	2015-08-11 04:19:28 +00:00
Filipe Cabecinhas	650d7f7dd5	Don't repeat function names in comments. NFC. llvm-svn: 244018	2015-08-05 06:19:26 +00:00
David Majnemer	dbf1045ad7	[MS ABI] Hook clang up to the new EH instructions The new EH instructions make it possible for LLVM to generate .xdata tables that the MSVC personality routines will be happy about. Because this is experimental, hide it behind a -cc1 flag (-fnew-ms-eh). Differential Revision: http://reviews.llvm.org/D11405 llvm-svn: 243767	2015-07-31 17:58:45 +00:00
Tyler Nowicki	54c020d372	Use CGLoopInfo to emit metadata for loop hint pragmas. When ‘#pragma clang loop vectorize(assume_safety)’ was specified on a loop other loop hints were lost. The problem is that CGLoopInfo attaches metadata differently than EmitCondBrHints in CGStmt. For do-loops CGLoopInfo attaches metadata to the br in the body block and for while and for loops, the inc block. EmitCondBrHints on the other hand always attaches data to the br in the cond block. When specifying assume_safety CGLoopInfo emits an empty llvm.loop metadata shadowing the metadata in the cond block. Loop transformations like rotate and unswitch would then eliminate the cond block and its non-empty metadata. This patch unifies both approaches for adding metadata and modifies the existing safety tests to include non-assume_safety loop hints. llvm-svn: 243315	2015-07-27 20:10:20 +00:00
David Blaikie	fd7c2198e4	Fix GCC build due to shadowing llvm-svn: 242826	2015-07-21 18:59:10 +00:00
David Blaikie	f05779e21c	Pass an iterator range to EmitCallArgs llvm-svn: 242824	2015-07-21 18:37:18 +00:00
Michael Wong	65f367fcbb	Commit for http://reviews.llvm.org/D10765 for OpenMP 4 target data directive parsing and sema. This commit is on behalf of Kelvin Li. llvm-svn: 242785	2015-07-21 13:44:28 +00:00
Benjamin Kramer	f48ee4482a	[AST] Cleanup ExprIterator. - Make it a proper random access iterator with a little help from iterator_adaptor_base - Clean up users of magic dereferencing. The iterator should behave like an Expr **. - Make it an implementation detail of Stmt. This allows inlining of the assertions. llvm-svn: 242608	2015-07-18 14:35:53 +00:00
Rafael Espindola	d6e669458c	Set the linkage before setting the visibility. Otherwise the visibility setting code would not know that a given function was available_externally. Fixes PR24097. llvm-svn: 242012	2015-07-13 06:07:58 +00:00
Reid Kleckner	98cb8ba64c	Update clang for intrinsic rename of framerecover to localrecover llvm-svn: 241634	2015-07-07 22:26:07 +00:00
Aaron Ballman	7c04eae204	Silence -Wparentheses warnings (and ran it through clang-format); NFC. llvm-svn: 241582	2015-07-07 13:25:57 +00:00
Douglas Gregor	e83b95641f	Substitute type arguments into uses of Objective-C interface members. When messaging a method that was defined in an Objective-C class (or category or extension thereof) that has type parameters, substitute the type arguments for those type parameters. Similarly, substitute into property accesses, instance variables, and other references. This includes general infrastructure for substituting the type arguments associated with an ObjCObject(Pointer)Type into a type referenced within a particular context, handling all of the substitutions required to deal with (e.g.) inheritance involving parameterized classes. In cases where no type arguments are available (e.g., because we're messaging via some unspecialized type, id, etc.), we substitute in the type bounds for the type parameters instead. Example: @interface NSSet<T : id<NSCopying>> : NSObject <NSCopying> - (T)firstObject; @end void f(NSSet<NSString > stringSet, NSSet anySet) { [stringSet firstObject]; // produces NSString [anySet firstObject]; // produces id<NSCopying> (the bound) } When substituting for the type parameters given an unspecialized context (i.e., no specific type arguments were given), substituting the type bounds unconditionally produces type signatures that are too strong compared to the pre-generics signatures. Instead, use the following rule: - In covariant positions, such as method return types, replace type parameters with “id” or “Class” (the latter only when the type parameter bound is “Class” or qualified class, e.g, “Class<NSCopying>”) - In other positions (e.g., parameter types), replace type parameters with their type bounds. - When a specialized Objective-C object or object pointer type contains a type parameter in its type arguments (e.g., NSArray<T>, but not NSArray<NSString > ), replace the entire object/object pointer type with its unspecialized version (e.g., NSArray ). llvm-svn: 241543	2015-07-07 03:57:53 +00:00
Reid Kleckner	9fe7f2396b	Revert "Revert 241171, 241187, 241199 (32-bit SEH)." This reverts commit r241244, but restricts SEH support to Win64. This way, Chromium builds will still fall back on TUs with SEH, and Clang developers can work on this incrementally upstream while patching this small predicate locally. It'll also make it easier to review small fixes. llvm-svn: 241533	2015-07-07 00:36:30 +00:00
Alexey Bataev	81c7ea0ec3	[OPENMP 4.0] Fixed codegen for 'cancellation point' construct. Generate the next code for 'cancellation point': if (__kmpc_cancellationpoint()) { __kmpc_cancel_barrier(); <exit construct>; } llvm-svn: 241336	2015-07-03 09:56:58 +00:00
Akira Hatanaka	85365cd72a	Attach attribute "trap-func-name" to call sites of llvm.trap and llvm.debugtrap. This is needed to use clang's command line option "-ftrap-function" for LTO and enable changing the trap function name on a per-call-site basis. rdar://problem/21225723 Differential Revision: http://reviews.llvm.org/D10831 llvm-svn: 241306	2015-07-02 22:15:41 +00:00
Alexey Bataev	80909878ad	[OPENMP 4.0] Initial support for 'omp cancel' construct. Implemented parsing/sema analysis + (de)serialization. llvm-svn: 241253	2015-07-02 11:25:17 +00:00
Nico Weber	e4f974c6fb	Revert 241171, 241187, 241199 (32-bit SEH). It still doesn't produce quite the right code, test binaries built with this enabled fail some tests. llvm-svn: 241244	2015-07-02 06:10:53 +00:00
Alexey Bataev	0f34da12e4	[OPENMP 4.0] Codegen for 'cancellation point' directive. The next code is generated for this construct: ``` if (__kmpc_cancellationpoint(ident_t *loc, kmp_int32 global_tid, kmp_int32 cncl_kind) != 0) <exit from outer innermost construct>; ``` llvm-svn: 241239	2015-07-02 04:17:07 +00:00
Reid Kleckner	eb11c41900	[SEH] Delete the 32-bit IR lowering for __finally blocks and use x64 32-bit finally funclets are intended to be called both directly from the parent function and indirectly from the EH runtime. Because we aren't contorting LLVM's X86 prologue to match MSVC's, calling the finally block directly passes in a different value of EBP than the one that the runtime provides. We need an adapter thunk to adjust EBP to the expected value. However, WinEHPrepare already has to solve this problem when cleanups are not pre-outlined, so we can go ahead and rely on it rather than duplicating work. Now we only do the llvm.x86.seh.recoverfp dance for 32-bit SEH filter functions. llvm-svn: 241187	2015-07-01 21:00:00 +00:00
Reid Kleckner	d0d9a1f63f	[SEH] Add 32-bit lowering for SEH __try This re-lands r236052 and adds support for __exception_code(). In 32-bit SEH, the exception code is not available in eax. It is only available in the filter function, and now we arrange to load it and store it into an escaped variable in the parent frame. As a consequence, we have to disable the "catch i8* null" optimization on 32-bit and always generate a filter function. We can re-enable the optimization if we detect an __except block that doesn't use the exception code, but this probably isn't worth optimizing. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D10852 llvm-svn: 241171	2015-07-01 17:10:10 +00:00
Alexey Bataev	6d4ed05830	[OPENMP 4.0] Initial support for 'omp cancellation point' construct. Add parsing and sema analysis for 'omp cancellation point' directive. llvm-svn: 241145	2015-07-01 06:57:41 +00:00
Justin Bogner	bdff219439	CodeGen: Resize LifetimeExtendedCleanupHeader to avoid alignment issues The LifetimeExtendedCleanupHeader is carefully fit into 32 bytes, meaning that cleanups on the LifetimeExtendedCleanupStack are always allocated at a misaligned address and cause undefined behaviour. There are two ways to solve this - add padding after the header when we allocated our cleanups, or just simplify the header and let it use 64 bits in the first place. I've opted for the latter, and added a static assert to avoid the issue in the future. llvm-svn: 241133	2015-07-01 00:59:27 +00:00
Peter Collingbourne	e286b0e1f2	Fix use-after-free. llvm-svn: 241121	2015-06-30 22:08:44 +00:00
Artem Belevich	d21e5c6684	[CUDA] Implemented __nvvm_atom__gen_ builtins. Integer variants are implemented as atomicrmw or cmpxchg instructions. Atomic add for floating point (__nvvm_atom_add_gen_f()) is implemented as a call to an overloaded @llvm.nvvm.atomic.load.add.f32.* LVVM intrinsic. Differential Revision: http://reviews.llvm.org/D10666 llvm-svn: 240669	2015-06-25 18:29:42 +00:00
Alexey Bataev	d157d47062	Proper changing/restoring for CapturedStmtInfo, NFC. Added special RAII class for proper values changing/restoring in CodeGenFunction::CapturedStmtInfo. llvm-svn: 240517	2015-06-24 03:35:38 +00:00
Matt Arsenault	3ea39f9e78	AMDGPU: Fix places missed in rename llvm-svn: 240148	2015-06-19 17:54:10 +00:00
Peter Collingbourne	6708c4a176	Implement diagnostic mode for -fsanitize=cfi*, -fsanitize=cfi-diag. This causes programs compiled with this flag to print a diagnostic when a control flow integrity check fails instead of aborting. Diagnostics are printed using UBSan's runtime library. The main motivation of this feature over -fsanitize=vptr is fidelity with the -fsanitize=cfi implementation: the diagnostics are printed under exactly the same conditions as those which would cause -fsanitize=cfi to abort the program. This means that the same restrictions apply regarding compiling all translation units with -fsanitize=cfi, cross-DSO virtual calls are forbidden, etc. Differential Revision: http://reviews.llvm.org/D10268 llvm-svn: 240109	2015-06-19 01:51:54 +00:00
Alexey Bataev	c30dd2daf9	[OPENMP] Support for '#pragma omp taskgroup' directive. Added parsing, sema analysis and codegen for '#pragma omp taskgroup' directive (OpenMP 4.0). The code for directive is generated the following way: #pragma omp taskgroup <body> void __kmpc_taskgroup(<loc>, thread_id); <body> void __kmpc_end_taskgroup(<loc>, thread_id); llvm-svn: 240011	2015-06-18 12:14:09 +00:00
Alexey Bataev	3b5b5c492e	[OPENMP] Add support for 'omp parallel for' directive. Codegen for this directive is a combined codegen for 'omp parallel' region with 'omp for simd' region inside. Clauses are supported. llvm-svn: 240006	2015-06-18 10:10:12 +00:00
Alexey Bataev	58e5bdb091	[OPENMP] Add support for 'omp for simd' directive. Added codegen for combined 'omp for simd' directives, that is a combination of 'omp for' directive followed by 'omp simd' directive. Includes support for all clauses. llvm-svn: 239990	2015-06-18 04:45:29 +00:00
Alexey Bataev	cbdcbb7690	[OPENMP] Code reformatting for omp simd codegen, NFC. llvm-svn: 239889	2015-06-17 07:45:51 +00:00
Alexey Bataev	fc087ecc05	[OPENMP] Support lastprivate clause in omp simd directive. Added codegen for lastprivate clauses within simd loop-based directives. llvm-svn: 239813	2015-06-16 13:14:42 +00:00
Alexey Bataev	ae05c29ab5	[OPENMP] Remove last iteration separation for loop-based constructs. Previously the last iteration for simd loop-based OpenMP constructs were generated as a separate code. This feature is not required and codegen is simplified. llvm-svn: 239810	2015-06-16 11:59:36 +00:00
Reid Kleckner	0b9bbbfc13	Revert "Re-land r236052, "[SEH] Add 32-bit lowering code for __try"" This reverts commit r239415. This was committed accidentally, LLVM isn't ready for this. llvm-svn: 239417	2015-06-09 17:49:42 +00:00
Reid Kleckner	65870442b3	Re-land r236052, "[SEH] Add 32-bit lowering code for __try" This reverts r236167. LLVM should be ready for this now. llvm-svn: 239415	2015-06-09 17:47:50 +00:00
Nuno Lopes	1ba2d78b9a	ubsan: Check for null pointers given to certain builtins, such as memcpy, memset, memmove, and bzero. Reviewed by: Richard Smith Differential Revision: http://reviews.llvm.org/D9673 llvm-svn: 238657	2015-05-30 16:11:40 +00:00
Justin Bogner	20eb9d486c	wip: Remove some unused functions llvm-svn: 238538	2015-05-29 02:42:14 +00:00
Alexey Bataev	d7589ffe1d	[OPENMP] Fix codegen for ordered loop directives. loops with ordered clause must be generated the same way as dynamic loops, but with static scheduleing. llvm-svn: 237788	2015-05-20 13:12:48 +00:00
Alexey Bataev	f0ab553fea	[OPENMP] Fixed bug in atomic update/capture/write constructs. Fixed a bug with codegen for destination atomic l-value with padding and junk in this padding bytes. llvm-svn: 237422	2015-05-15 08:36:34 +00:00
Peter Collingbourne	3eea677f3a	Unify sanitizer kind representation between the driver and the rest of the compiler. No functional change. Differential Revision: http://reviews.llvm.org/D9618 llvm-svn: 237055	2015-05-11 21:39:14 +00:00
Justin Bogner	65512647cc	InstrProf: Cede ownership of createProfileWeights to CGF The fact that PGO has a say in how these branch weights are determined isn't interesting to most of CodeGen, so it makes more sense for this API to be accessible via CodeGenFunction rather than CodeGenPGO. llvm-svn: 236380	2015-05-02 05:00:55 +00:00
Reid Kleckner	cb7a0a0562	Revert most of r236271, leaving only the datalayout change in lib/Basic/Targets.cpp llvm-svn: 236274	2015-04-30 22:29:25 +00:00
Reid Kleckner	af67602e14	Use 4 byte preferred aggregate alignment in datalayout on x86 Win32 llvm-svn: 236271	2015-04-30 22:13:05 +00:00
Reid Kleckner	be9843ce54	Revert r236128, LLVM isn't falling back in the right way llvm-svn: 236167	2015-04-29 21:55:21 +00:00
Reid Kleckner	0bb12a8981	Re-land r236052, the linker errors were fixed by LLVM r236123 Basic __finally blocks don't cause linker errors anymore (although they are miscompiled). llvm-svn: 236128	2015-04-29 17:17:17 +00:00
Nico Weber	ea721b64df	Revert r236052, it caused linker errors when building 32-bit applications. llvm-svn: 236082	2015-04-29 03:08:32 +00:00
Reid Kleckner	ddd40964f0	[SEH] Add 32-bit lowering code for __try This is just the clang-side of 32-bit SEH. LLVM still needs work, and it will determinstically fail to compile until it's feature complete. On x86, all outlined handlers have no parameters, but they do implicitly take the EBP value passed in and use it to address locals of the parent frame. We model this with llvm.frameaddress(1). This works (mostly), but __finally block inlining can break it. For now, we apply the 'noinline' attribute. If we really want to inline __finally blocks on 32-bit x86, we should teach the inliner how to untangle frameescape and framerecover. Promote the error diagnostic from codegen to sema. It now rejects SEH on non-Windows platforms. LLVM doesn't implement SEH on non-x86 Windows platforms, but there's nothing preventing it. llvm-svn: 236052	2015-04-28 22:19:32 +00:00
Justin Bogner	66242d6c5e	InstrProf: Stop using RegionCounter outside of CodeGenPGO (NFC) The RegionCounter type does a lot of legwork, but most of it is only meaningful within the implementation of CodeGenPGO. The uses elsewhere in CodeGen generally just want to increment or read counters, so do that directly. llvm-svn: 235664	2015-04-23 23:06:47 +00:00
Alexey Bataev	5e018f9e29	[OPENMP] Codegen for 'atomic capture'. Adds codegen for 'atomic capture' constructs with the following forms of expressions/statements: v = x binop= expr; v = x++; v = ++x; v = x--; v = --x; v = x = x binop expr; v = x = expr binop x; {v = x; x = binop= expr;} {v = x; x++;} {v = x; ++x;} {v = x; x--;} {v = x; --x;} {x = x binop expr; v = x;} {x binop= expr; v = x;} {x++; v = x;} {++x; v = x;} {x--; v = x;} {--x; v = x;} {x = x binop expr; v = x;} {x = expr binop x; v = x;} {v = x; x = expr;} If x and expr are integer and binop is associative or x is a LHS in a RHS of the assignment expression, and atomics are allowed for type of x on the target platform atomicrmw instruction is emitted. Otherwise compare-and-swap sequence is emitted. Update of 'v' is not required to be be atomic with respect to the read or write of the 'x'. bb: ... atomic load <x> cont: <expected> = phi [ <x>, label %bb ], [ <new_failed>, %cont ] <desired> = <expected> binop <expr> <res> = cmpxchg atomic &<x>, desired, expected <new_failed> = <res>.field1; br <res>field2, label %exit, label %cont exit: atomic store <old/new x>, <v> ... Differential Revision: http://reviews.llvm.org/D9049 llvm-svn: 235573	2015-04-23 06:35:10 +00:00
David Majnemer	dc012fa266	Revert "Revert r234581, it might have caused a few miscompiles in Chromium." This reverts commit r234700. It turns out that the lifetime markers were not the cause of Chromium failing but a bug which was uncovered by optimizations exposed by the markers. llvm-svn: 235553	2015-04-22 21:38:15 +00:00
Alexey Bataev	98eb6e3d41	[OPENMP] Codegen for 'ordered' directive. Add codegen for 'ordered' directive: __kmpc_ordered(ident_t , gtid); <associated statement>; __kmpc_end_ordered(ident_t , gtid); Also for 'for' directives with the dynamic scheduling and an 'ordered' clause added a call to '__kmpc_dispatch_fini_(4\|8)[u]()' function after increment expression for loop control variable: while(__kmpc_dispatch_next(&LB, &UB)) { idx = LB; while (idx <= UB) { BODY; ++idx; __kmpc_dispatch_fini_(4\|8)[u](); // For ordered loops only. } // inner loop } Differential Revision: http://reviews.llvm.org/D9070 llvm-svn: 235496	2015-04-22 11:15:40 +00:00
Alexey Bataev	f56f98c925	[OPENMP] Codegen for 'copyin' clause in 'parallel' directive. Emits the following code for the clause at the beginning of the outlined function for implicit threads: if (<not a master thread>) { ... <thread local copy of var> = <master thread local copy of var>; ... } <sync point>; Checking for a non-master thread is performed by comparing of the address of the thread local variable with the address of the master's variable. Master thread always uses original variables, so you always know the address of the variable in the master thread. Differential Revision: http://reviews.llvm.org/D9026 llvm-svn: 235075	2015-04-16 05:39:01 +00:00
Alexey Bataev	38e8953352	[OPENMP] Codegen for 'lastprivate' clause in 'for' directive. #pragma omp for lastprivate(<var>) for (i = a; i < b; ++b) <BODY>; This construct is translated into something like: <last_iter> = alloca i32 <lastprivate_var> = alloca <type> <last_iter> = 0 ; No initializer for simple variables or a default constructor is called for objects. ; For arrays perform element by element initialization by the call of the default constructor. ... OMP_FOR_START(...,<last_iter>, ..); sets <last_iter> to 1 if this is the last iteration. <BODY> ... OMP_FOR_END if (<last_iter> != 0) { <var> = <lastprivate_var> ; Update original variable with the lastprivate value. } call __kmpc_cancel_barrier() ; an implicit barrier to avoid possible data race. Differential Revision: http://reviews.llvm.org/D8658 llvm-svn: 235074	2015-04-16 04:54:05 +00:00
Alexey Bataev	69c62a9bdb	[OPENMP] Codegen for 'firstprivate' clause in 'for' directive. Adds proper codegen for 'firstprivate' clause in for directive. Initially codegen for 'firstprivate' clause was implemented for 'parallel' directive only. Also this patch emits sync point only after initialization of firstprivate variables, not all private variables. This sync point is not required for privates, lastprivates etc., only for initialization of firstprivate variables. Differential Revision: http://reviews.llvm.org/D8660 llvm-svn: 234978	2015-04-15 04:52:20 +00:00
Reid Kleckner	ebaf28d13d	Reland r234613 (and follow-ups 234614, 234616, 234618) The frameescape intrinsic cannot be inlined, so I fixed the inliner in r234937. This should address PR23216. llvm-svn: 234942	2015-04-14 20:59:00 +00:00
Alexey Bataev	420d45b2dd	[OPENMP] Fixed codegen for arrays in 'copyprivate' clause. Fixed a bug with codegen of variables with array types specified in 'copyprivate' clause of 'single' directive. Differential Revision: http://reviews.llvm.org/D8914 llvm-svn: 234856	2015-04-14 05:11:24 +00:00
Alexey Bataev	68adb7da1a	[OPENMP] Initial codegen for 'parallel sections' directive. Emits code for outlined 'parallel' directive with the implicitly inlined 'sections' directive: ... call __kmpc_fork_call(..., outlined_function, ...); ... define internal void outlined_function(...) { <code for implicit sections directive>; } Differential Revision: http://reviews.llvm.org/D8997 llvm-svn: 234849	2015-04-14 03:29:22 +00:00
Nico Weber	ad108337cf	Revert r234613 (and follow-ups 234614, 234616, 234618), it caused PR23216. llvm-svn: 234789	2015-04-13 20:04:22 +00:00
Nico Weber	f2a39a7b4e	Revert r234786, it contained a bunch of stuff I did not mean to commit. llvm-svn: 234787	2015-04-13 20:03:03 +00:00
Nico Weber	b31abb05fb	Revert r234613 (and follow-ups 234614, 234616, 234618), it caused PR23216. llvm-svn: 234786	2015-04-13 20:01:20 +00:00
Nico Weber	1c565c31b1	Revert r234581, it might have caused a few miscompiles in Chromium. If the revert helps, I'll get a repro this Monday. Else I'll put the change back in. llvm-svn: 234700	2015-04-11 23:51:38 +00:00
Reid Kleckner	11859afd5f	[SEH] Re-land r234532, but use internal linkage for all SEH helpers Even though these symbols are in a comdat group, the Microsoft linker really wants them to have internal linkage. I'm planning to tweak the mangling in a follow-up change. This is a straight revert with a 1-line fix. llvm-svn: 234613	2015-04-10 17:34:52 +00:00
Alexey Bataev	794ba0dcb7	[OPENMP] Codegen for 'reduction' clause in 'parallel' directive. Emit a code for reduction clause. Next code should be emitted for reductions: static kmp_critical_name lock = { 0 }; void reduce_func(void lhs[<n>], void rhs[<n>]) { ... (Type<i> )lhs[i] = RedOp<i>((Type<i> )lhs[i], (Type<i> )rhs[i]); ... } ... void RedList[<n>] = {&<RHSExprs>[0], ..., &<RHSExprs>[<n> - 1]}; switch (__kmpc_reduce{_nowait}(<loc>, <gtid>, <n>, sizeof(RedList), RedList, reduce_func, &<lock>)) { case 1: ... <LHSExprs>[i] = RedOp<i>(<LHSExprs>[i], <RHSExprs>[i]); ... __kmpc_end_reduce{_nowait}(<loc>, <gtid>, &<lock>); break; case 2: ... Atomic(<LHSExprs>[i] = RedOp<i>(<LHSExprs>[i], *<RHSExprs>[i])); ... break; default: ; } Reduction variables are a kind of a private variables, they have private copies, but initial values are chosen in accordance with the reduction operation. Differential Revision: http://reviews.llvm.org/D8915 llvm-svn: 234583	2015-04-10 10:43:45 +00:00
Arnaud A. de Grandmaison	047a686d53	Remove threshold for inserting lifetime markers for named temporaries Now that TailRecursionElimination has been fixed with r222354, the threshold on size for lifetime marker insertion can be removed. This only affects named temporary though, as the patch for unnamed temporaries is still in progress. My previous commit (r222993) was not handling debuginfo correctly, but this could only be seen with some asan tests. Basically, lifetime markers are just instrumentation for the compiler's usage and should not affect debug information; however, the cleanup infrastructure was assuming it contained only destructors, i.e. actual code to be executed, and was setting the breakpoint for the end of the function to the closing '}', and not the return statement, in order to show some destructors have been called when leaving the function. This is wrong when the cleanups are only lifetime markers, and this is now fixed. llvm-svn: 234581	2015-04-10 10:13:52 +00:00
Alexey Bataev	6f1ffc069b	[OPENMP] Refactoring of codegen for OpenMP directives. Refactored API of OpenMPRuntime for compatibility with combined directives. Differential Revision: http://reviews.llvm.org/D8859 llvm-svn: 234564	2015-04-10 04:50:10 +00:00
Nico Weber	bd51a6a99f	Revert r234532 for a bit, it very likely caused http://crbug.com/475768 llvm-svn: 234563	2015-04-10 04:33:03 +00:00
Reid Kleckner	0dbecf2b78	[SEH] Outline finally blocks using the new variable capture support WinEHPrepare was going to have to pattern match the control flow merge and split that the old lowering used, and that wasn't really feasible. Now we can teach WinEHPrepare to pattern match this, which is much simpler: %fp = call i8* @llvm.frameaddress(i32 0) call void @func(iN [01], i8* %fp) This prototype happens to match the prototype used by the Win64 SEH personality function, so this is really simple. llvm-svn: 234532	2015-04-09 20:37:24 +00:00
Reid Kleckner	31a1bb0c14	Reland "[SEH] Implement filter capturing in CodeGen" The test should be fixed. It was failing in NDEBUG builds due to a missing '*' character in a regex. In asserts builds, the pattern matched a single digit value, which became a double digit value in NDEBUG builds. Go figure. This reverts commit r234261. llvm-svn: 234447	2015-04-08 22:23:48 +00:00
Daniel Jasper	303c3ac925	Revert "[SEH] Implement filter capturing in CodeGen" Test fails: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_check/3182/ llvm-svn: 234306	2015-04-07 10:07:47 +00:00
Reid Kleckner	0ada50f17f	[SEH] Implement filter capturing in CodeGen While capturing filters aren't very common, we'd like to outline __finally blocks in the frontend to simplify -O0 EH preparation and reduce code size. Finally blocks are usually have captures, and this is the first step towards that. Currently we don't support capturing 'this' or VLAs. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D8825 llvm-svn: 234261	2015-04-06 23:51:44 +00:00
David Blaikie	1ed728c499	[opaque pointer type] More GEP API migrations Looks like the VTable code in particular will need some work to pass around the pointee type explicitly. llvm-svn: 234128	2015-04-05 22:45:47 +00:00
David Blaikie	fb901c7abf	[opaque pointer type] more GEP API migrations llvm-svn: 234097	2015-04-04 15:12:29 +00:00
Ulrich Weigand	3a610ebf1e	[SystemZ] Support transactional execution on zEC12 The zEC12 provides the transactional-execution facility. This is exposed to users via a set of builtin routines on other compilers. This patch adds clang support to enable those builtins. In partciular, the patch: - enables the transactional-execution feature by default on zEC12 - allows to override presence of that feature via the -mhtm/-mno-htm options - adds a predefined macro __HTM__ if the feature is enabled - adds support for the transactional-execution GCC builtins - adds Sema checking to verify the __builtin_tabort abort code - adds the s390intrin.h header file (for GCC compatibility) - adds s390 sections to the htmintrin.h and htmxlintrin.h header files Since this is first use of target-specific intrinsics on the platform, the patch creates the include/clang/Basic/BuiltinsSystemZ.def file and hooks it up in TargetBuiltins.h and lib/Basic/Targets.cpp. An associated LLVM patch adds the required LLVM IR intrinsics. For reference, the transactional-execution instructions are documented in the z/Architecture Principles of Operation for the zEC12: http://publibfp.boulder.ibm.com/cgi-bin/bookmgr/download/DZ9ZR009.pdf The associated builtins are documented in the GCC manual: http://gcc.gnu.org/onlinedocs/gcc/S_002f390-System-z-Built-in-Functions.html The htmxlintrin.h intrinsics provided for compatibility with the IBM XL compiler are documented in the "z/OS XL C/C++ Programming Guide". llvm-svn: 233804	2015-04-01 12:54:25 +00:00
Alexey Bataev	b4505a7229	[OPENMP] Codegen for 'atomic update' construct. Adds atomic update codegen for the following forms of expressions: x binop= expr; x++; ++x; x--; --x; x = x binop expr; x = expr binop x; If x and expr are integer and binop is associative or x is a LHS in a RHS of the assignment expression, and atomics are allowed for type of x on the target platform atomicrmw instruction is emitted. Otherwise compare-and-swap sequence is emitted: bb: ... atomic load <x> cont: <expected> = phi [ <x>, label %bb ], [ <new_failed>, %cont ] <desired> = <expected> binop <expr> <res> = cmpxchg atomic &<x>, desired, expected <new_failed> = <res>.field1; br <res>field2, label %exit, label %cont exit: ... Differential Revision: http://reviews.llvm.org/D8536 llvm-svn: 233513	2015-03-30 05:20:59 +00:00
Alexey Bataev	a63048e4fd	[OPENMP] Codegen for 'copyprivate' clause ('single' directive). If there is at least one 'copyprivate' clause is associated with the single directive, the following code is generated: ``` i32 did_it = 0; \\ for 'copyprivate' clause if(__kmpc_single(ident_t , gtid)) { SingleOpGen(); __kmpc_end_single(ident_t , gtid); did_it = 1; \\ for 'copyprivate' clause } <copyprivate_list>[0] = &var0; ... <copyprivate_list>[n] = &varn; call __kmpc_copyprivate(ident_t , gtid, <copyprivate_list_size>, <copyprivate_list>, <copy_func>, did_it); ... void<copy_func>(void LHSArg, void RHSArg) { Dst = (void [n])(LHSArg); Src = (void * [n])(RHSArg); Dst[0] = Src[0]; ... Dst[n] = Src[n]; } ``` All list items from all 'copyprivate' clauses are gathered into single <copyprivate list> (<copyprivate_list_size> is a size in bytes of this list) and <copy_func> is used to propagate values of private or threadprivate variables from the 'single' region to other implicit threads from outer 'parallel' region. Differential Revision: http://reviews.llvm.org/D8410 llvm-svn: 232932	2015-03-23 06:18:07 +00:00
Peter Collingbourne	d2926c91d5	Implement bad cast checks using control flow integrity information. This scheme checks that pointer and lvalue casts are made to an object of the correct dynamic type; that is, the dynamic type of the object must be a derived class of the pointee type of the cast. The checks are currently only introduced where the class being casted to is a polymorphic class. Differential Revision: http://reviews.llvm.org/D8312 llvm-svn: 232241	2015-03-14 02:42:25 +00:00
Benjamin Kramer	7f1f6b5370	Disambiguate call for GCC. llvm-svn: 232122	2015-03-12 23:46:55 +00:00
Benjamin Kramer	51680bccda	CodeGen: Base the conditional cleanup machinery on variadic templates This is complicated by the fact that we can't simply use side-effecting calls in an argument list without losing all guarantees about the order they're emitted. To keep things deterministic we use tuples and brace initialization, which thankfully guarantees evaluation order. No functionality change intended. llvm-svn: 232121	2015-03-12 23:41:40 +00:00
Alexey Bataev	2df54a07bf	[OPENMP] Initial codegen for 'omp sections' and 'omp section' directives. If only one section is found in the sections region, it is emitted just like single region. Otherwise it is emitted as a static non-chunked loop. #pragma omp sections { #pragma omp section {1} ... #pragma omp section {n} } is translated to something like i32 <iter_var> i32 <last_iter> = 0 i32 <lower_bound> = 0 i32 <upper_bound> = n-1 i32 <stride> = 1 call void @__kmpc_for_static_init_4(<loc>, i32 <gtid>, i32 34/static non-chunked/, i32* <last_iter>, i32* <lower_bound>, i32* <upper_bound>, i32* <stride>, i32 1/increment always 1/, i32 1/chunk always 1/) <upper_bound> = min(<upper_bound>, n-1) <iter_var> = <lb> check: br <iter_var> <= <upper_bound>, label cont, label exit continue: switch (IV) { case 0: {1}; break; ... case <NumSection> - 1: {n}; break; } ++<iter_var> br label check exit: call void @__kmpc_for_static_fini(<loc>, i32 <gtid>) Differential Revision: http://reviews.llvm.org/D8244 llvm-svn: 232021	2015-03-12 08:53:29 +00:00
David Majnemer	7c23707174	MS ABI: Implement support for throwing a C++ exception Throwing a C++ exception, under the MS ABI, is implemented using three components: - ThrowInfo structure which contains information like CV qualifiers, what destructor to call and a pointer to the CatchableTypeArray. - In a significant departure from the Itanium ABI, copying by-value occurs in the runtime and not at the catch site. This means we need to enumerate all possible types that this exception could be caught as and encode the necessary information to convert from the exception object's type to the catch handler's type. This includes complicated derived to base conversions and the execution of copy-constructors. N.B. This implementation doesn't support the execution of a copy-constructor from within the runtime for now. Adding support for that functionality is quite difficult due to things like default argument expressions which may evaluate arbitrary code hiding in the copy-constructor's parameters. Differential Revision: http://reviews.llvm.org/D8066 llvm-svn: 231328	2015-03-05 00:46:22 +00:00
Alexey Bataev	8cbe0a6b62	[OPENMP] Fixed codegen for directives without function outlining. Fixed crash on codegen for directives like 'omp for', 'omp single' etc. inside of the 'omp parallel', 'omp task' etc. regions. llvm-svn: 230621	2015-02-26 10:27:34 +00:00
Peter Collingbourne	a4ccff3281	Implement Control Flow Integrity for virtual calls. This patch introduces the -fsanitize=cfi-vptr flag, which enables a control flow integrity scheme that checks that virtual calls take place using a vptr of the correct dynamic type. More details in the new docs/ControlFlowIntegrity.rst file. It also introduces the -fsanitize=cfi flag, which is currently a synonym for -fsanitize=cfi-vptr, but will eventually cover all CFI checks implemented in Clang. Differential Revision: http://reviews.llvm.org/D7424 llvm-svn: 230055	2015-02-20 20:30:56 +00:00
Aaron Ballman	abc1892057	Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; Clang edition. llvm-svn: 229339	2015-02-15 22:54:08 +00:00
Aaron Ballman	673476684e	Removing LLVM_EXPLICIT, as MSVC 2012 was the last reason for requiring the macro. NFC; Clang edition. llvm-svn: 229336	2015-02-15 22:00:28 +00:00
David Majnemer	a5b195a1dc	Revert "Revert r229082 for a bit, it caused PR22577." This reverts commit r229123. It was a red herring, the bug was present without r229082. llvm-svn: 229205	2015-02-14 01:35:12 +00:00
Nico Weber	7ce96b853d	Revert r229082 for a bit, it caused PR22577. llvm-svn: 229123	2015-02-13 16:27:00 +00:00
David Majnemer	abc482effc	MS ABI: Implement /volatile:ms The /volatile:ms semantics turn volatile loads and stores into atomic acquire and release operations. This distinction is important because volatile memory operations do not form a happens-before relationship with non-atomic memory. This means that a volatile store is not sufficient for implementing a mutex unlock routine. Differential Revision: http://reviews.llvm.org/D7580 llvm-svn: 229082	2015-02-13 07:55:47 +00:00
Reid Kleckner	11c033e8aa	SEH: Use the SEHTryEpilogueStack instead of a separate bool We don't need a bool to track this now that we have a stack for it. llvm-svn: 228982	2015-02-12 23:40:45 +00:00
Nico Weber	5779f84000	[ms] Implement codegen for __leave. Reviewed at http://reviews.llvm.org/D7575 llvm-svn: 228977	2015-02-12 23:16:11 +00:00
Richard Smith	527473df0d	Fix typoo. llvm-svn: 228963	2015-02-12 21:23:20 +00:00
Nico Weber	1bebad1b86	Wrap to 80 columns. No behavior change. llvm-svn: 228880	2015-02-11 22:33:32 +00:00
Reid Kleckner	a593000f01	Add the 'noinline' attribute to call sites within __try bodies LLVM doesn't support non-call exceptions, so inlining makes it harder to catch such asynchronous exceptions. llvm-svn: 228876	2015-02-11 21:40:48 +00:00
Reid Kleckner	e7b3f7c70d	Emit landing pads for SEH even if nounwind is present Disabling exceptions applies nounwind to lots of functions. SEH catches asynch exceptions, so emit the landing pad anyway. llvm-svn: 228769	2015-02-11 00:00:21 +00:00
Reid Kleckner	aca01db706	Implement IRGen for SEH __finally and AbnormalTermination Previously we would simply double-emit the body of the __finally block, but that doesn't work when it contains any kind of Decl, which we can't double emit. This fixes that by emitting the block once and branching into a shared code region and then branching back out. llvm-svn: 228222	2015-02-04 22:37:07 +00:00
David Blaikie	4d52443c0e	DebugInfo: Attribute cleanup code to the end of the scope, not the end of the function. Now if you break on a dtor and go 'up' in your debugger (or you get an asan failure in a dtor) during an exception unwind, you'll have more context. Instead of all dtors appearing to be called from the '}' of the function, they'll be attributed to the end of the scope of the variable, the same as the non-exceptional dtor call. This doesn't /quite/ remove all uses of CurEHLocation (which might be nice to remove, for a few reasons) - it's still used to choose the location for some other work in the landing pad. It'd be nice to attribute that code to the same location as the exception calls within the block and to remove CurEHLocation. llvm-svn: 228181	2015-02-04 19:47:54 +00:00
David Majnemer	fd1e739a44	CodeGen: Copy-ctorm must obey the destination's alignment requirement We would synthesize memcpy intrinsics when emitting calls to trivial C++ constructors but we wouldn't take into account the alignment of the destination. llvm-svn: 228061	2015-02-03 23:04:06 +00:00
Alexander Musman	df7a8e2bc8	Support ‘omp for’ with static chunked schedule kind. Differential Revision: http://reviews.llvm.org/D7006 llvm-svn: 226795	2015-01-22 08:49:35 +00:00
Reid Kleckner	1d59f99f5c	Initial support for Win64 SEH IR emission The lowering looks a lot like normal EH lowering, with the exception that the exceptions are caught by executing filter expression code instead of matching typeinfo globals. The filter expressions are outlined into functions which are used in landingpad clauses where typeinfo would normally go. Major aspects that still need work: - Non-call exceptions in __try bodies won't work yet. The plan is to outline the __try block in the frontend to keep things simple. - Filter expressions cannot use local variables until capturing is implemented. - __finally blocks will not run after exceptions. Fixing this requires work in the LLVM SEH preparation pass. The IR lowering looks like this: // C code: bool safe_div(int n, int d, int r) { __try { r = normal_div(n, d); } __except(_exception_code() == EXCEPTION_INT_DIVIDE_BY_ZERO) { return false; } return true; } ; LLVM IR: define i32 @filter(i8* %e, i8* %fp) { %ehptrs = bitcast i8* %e to i32 %ehrec = load i32 %ehptrs %code = load i32* %ehrec %matches = icmp eq i32 %code, i32 u0xC0000094 %matches.i32 = zext i1 %matches to i32 ret i32 %matches.i32 } define i1 zeroext @safe_div(i32 %n, i32 %d, i32* %r) { %rr = invoke i32 @normal_div(i32 %n, i32 %d) to label %normal unwind to label %lpad normal: store i32 %rr, i32* %r ret i1 1 lpad: %ehvals = landingpad {i8, i32} personality i32 (...) @__C_specific_handler catch i8* bitcast (i32 (i8, i8)* @filter to i8) %ehptr = extractvalue {i8, i32} %ehvals, i32 0 %sel = extractvalue {i8, i32} %ehvals, i32 1 %filter_sel = call i32 @llvm.eh.seh.typeid.for(i8 bitcast (i32 (i8, i8)* @filter to i8*)) %matches = icmp eq i32 %sel, %filter_sel br i1 %matches, label %eh.except, label %eh.resume eh.except: ret i1 false eh.resume: resume } Reviewers: rjmccall, rsmith, majnemer Differential Revision: http://reviews.llvm.org/D5607 llvm-svn: 226760	2015-01-22 01:36:17 +00:00
David Blaikie	835afb205f	DebugInfo: Remove forced column-info workaround for inlined calls This workaround was to provide unique call sites to ensure LLVM's inline debug info handling would properly unique two calls to the same function on the same line. Instead, this has now been fixed in LLVM (r226736) and the workaround here can be removed. Originally committed in r176895, but this isn't a straight revert due to all the changes since then. I just searched for anything ForcedColumn* related and removed them. We could test this - but it didn't strike me as terribly valuable once we're no longer adding this workaround everything just works as expected & it's no longer a special case to test for. llvm-svn: 226738	2015-01-21 23:08:17 +00:00
David Blaikie	a0a1a8726f	Add comment after API changes in r225090 Code review suggestion by Eric Christopher. llvm-svn: 226395	2015-01-18 02:48:07 +00:00
David Blaikie	66e4197f07	Reapply r225000 (reverted in r225555): DebugInfo: Generalize debug info location handling (and follow-up commits). Several pieces of code were relying on implicit debug location setting which usually lead to incorrect line information anyway. So I've fixed those (in r225955 and r225845) separately which should pave the way for this commit to be cleanly reapplied. The reason these implicit dependencies resulted in crashes with this patch is that the debug location would no longer implicitly leak from one place to another, but be set back to invalid. Once a call with no/invalid location was emitted, if that call was ever inlined it could produce invalid debugloc chains and assert during LLVM's codegen. There may be further cases of such bugs in this patch - they're hard to flush out with regression testing, so I'll keep an eye out for reports and investigate/fix them ASAP if they come up. Original commit message: Reapply "DebugInfo: Generalize debug info location handling" Originally committed in r224385 and reverted in r224441 due to concerns this change might've introduced a crash. Turns out this change fixes the crash introduced by one of my earlier more specific location handling changes (those specific fixes are reverted by this patch, in favor of the more general solution). Recommitted in r224941 and reverted in r224970 after it caused a crash when building compiler-rt. Looks to be due to this change zeroing out the debug location when emitting default arguments (which were meant to inherit their outer expression's location) thus creating call instructions without locations - these create problems for inlining and must not be created. That is fixed and tested in this version of the change. Original commit message: This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 225956	2015-01-14 07:38:27 +00:00
David Blaikie	f142580dea	Sink a parameter into the callee since it's always the same expression in terms of another parameter llvm-svn: 225856	2015-01-14 00:04:42 +00:00
David Blaikie	f353d3ecd0	Revert "DebugInfo: Generalize debug info location handling" and related commits This reverts commit r225000, r225021, r225083, r225086, r225090. The root change (r225000) still has several issues where it's caused calls to be emitted without debug locations. This causes assertion failures if/when those calls are inlined. I'll work up some test cases and fixes before recommitting this. llvm-svn: 225555	2015-01-09 23:00:28 +00:00
David Blaikie	b9a23c9155	DebugInfo: Provide a less subtle way to set the debug location of simple ret instructions un-XFAILing the test XFAIL'd in r225086 after it regressed in r225083. llvm-svn: 225090	2015-01-02 22:07:26 +00:00
David Blaikie	84fe79cfc3	Reapply "DebugInfo: Generalize debug info location handling" Originally committed in r224385 and reverted in r224441 due to concerns this change might've introduced a crash. Turns out this change fixes the crash introduced by one of my earlier more specific location handling changes (those specific fixes are reverted by this patch, in favor of the more general solution). Recommitted in r224941 and reverted in r224970 after it caused a crash when building compiler-rt. Looks to be due to this change zeroing out the debug location when emitting default arguments (which were meant to inherit their outer expression's location) thus creating call instructions without locations - these create problems for inlining and must not be created. That is fixed and tested in this version of the change. Original commit message: This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 225000	2014-12-30 19:39:33 +00:00
David Blaikie	608a24501c	Revert "DebugInfo: Generalize debug info location handling" Asserting when building compiler-rt when using a GCC host compiler. Reverting while I investigate. This reverts commit r224941. llvm-svn: 224970	2014-12-29 23:49:00 +00:00
David Blaikie	3945d1bd99	Reapply "DebugInfo: Generalize debug info location handling" Originally committed in r224385 and reverted in r224441 due to concerns this change might've introduced a crash. Turns out this change fixes the crash introduced by one of my earlier more specific location handling changes (those specific fixes are reverted by this patch, in favor of the more general solution). Original commit message: This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 224941	2014-12-29 18:18:45 +00:00
Alexey Bataev	7cb1789011	Fix for PR21915: assert on multidimensional VLA in function arguments. Fixed assertion on type checking for arguments and parameters on function call if arguments are pointers to VLA Differential Revision: http://reviews.llvm.org/D6655 llvm-svn: 224504	2014-12-18 06:54:53 +00:00
David Blaikie	06b2c54db9	Revert "DebugInfo: Generalize debug info location handling" Fails an ASan bootstrap - I'll try to reproduce locally & sort that out before recommitting. This reverts commit r224385. llvm-svn: 224441	2014-12-17 18:02:04 +00:00
David Blaikie	bf22a4eaee	DebugInfo: Generalize debug info location handling This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 224385	2014-12-16 22:49:17 +00:00
Alexey Bataev	f841bd9fcd	[OPENMP] Bugfix for processing of global variables in OpenMP regions. Currently, if global variable is marked as a private OpenMP variable, the compiler crashes in debug version or generates incorrect code in release version. It happens because in the OpenMP region the original global variable is used instead of the generated private copy. It happens because currently globals variables are not captured in the OpenMP region. This patch adds capturing of global variables iff private copy of the global variable must be used in the OpenMP region. Differential Revision: http://reviews.llvm.org/D6259 llvm-svn: 224323	2014-12-16 07:00:22 +00:00
Alexander Musman	c638868bdf	First patch with codegen of the 'omp for' directive. It implements the simplest case, which is used when no chunk_size is specified in the schedule(static) or no 'schedule' clause is specified - the iteration space is divided by the library into chunks that are approximately equal in size, and at most one chunk is distributed to each thread. In this case, we do not need an outer loop in each thread - each thread requests once which iterations range it should handle (using __kmpc_for_static_init runtime call) and then runs the inner loop on this range. Differential Revision: http://reviews.llvm.org/D5865 llvm-svn: 224233	2014-12-15 07:07:06 +00:00
Alexey Bataev	452d8e1133	Bugfix for Codegen of atomic load/store/other ops. Currently clang fires assertions on x86-64 on any atomic operations for long double operands. Patch fixes codegen for such operations. Differential Revision: http://reviews.llvm.org/D6499 llvm-svn: 224230	2014-12-15 05:25:25 +00:00
Peter Collingbourne	f770683f14	Implement the __builtin_call_with_static_chain GNU extension. The extension has the following syntax: __builtin_call_with_static_chain(Call, Chain) where Call must be a function call expression and Chain must be of pointer type This extension performs a function call Call with a static chain pointer Chain passed to the callee in a designated register. This is useful for calling foreign language functions whose ABI uses static chain pointers (e.g. to implement closures). Differential Revision: http://reviews.llvm.org/D6332 llvm-svn: 224167	2014-12-12 23:41:25 +00:00
David Blaikie	7f138811cd	DebugInfo: Correct the location of initializations of auto. llvm-svn: 223839	2014-12-09 22:04:13 +00:00
David Blaikie	538deffd2d	DebugInfo: Emit the correct location for initialization of a complex variable Especially useful for sanitizer reports. llvm-svn: 223825	2014-12-09 20:52:24 +00:00
David Blaikie	73ca56942d	DebugInfo: Correctly identify the location of C++ member initializer list elements This particularly helps the fidelity of ASan reports (which can occur even in these examples - if, for example, one uses placement new over a buffer of insufficient size - now ASan will correctly identify which member's initialization went over the end of the buffer). This doesn't cover all types of members - more coming. llvm-svn: 223726	2014-12-09 00:32:22 +00:00
Saleem Abdulrasool	a14ac3f437	CodeGen: refactor ARM builtin handling Create a helper function to construct a value for the ARM hint intrinsic rather than inling the construction. In order to avoid the use of the sentinel value, inline the use of intrinsic instruction retrieval. NFC. llvm-svn: 223338	2014-12-04 04:52:37 +00:00
Nico Weber	aad4af6d50	Fix incorrect codegen for devirtualized calls to virtual overloaded operators. Consider this program: struct A { virtual void operator-() { printf("base\n"); } }; struct B final : public A { virtual void operator-() override { printf("derived\n"); } }; int main() { B* b = new B; -static_cast<A&>(*b); } Before this patch, clang saw the virtual call to A::operator-(), figured out that it can be devirtualized, and then just called A::operator-() directly, without going through the vtable. Instead, it should've looked up which operator-() the call devirtualizes to and should've called that. For regular virtual member calls, clang gets all this right already. So instead of giving EmitCXXOperatorMemberCallee() all the logic that EmitCXXMemberCallExpr() already has, cut the latter function into two pieces, call the second piece EmitCXXMemberOrOperatorMemberCallExpr(), and use it also to generate code for calls to virtual member operators. This way, virtual overloaded operators automatically don't get devirtualized if they have covariant returns (like it was done for regular calls in r218602), etc. This also happens to fix (or at least improve) codegen for explicit constructor calls (`A a; a.A::A()`) in MS mode with -fsanitize-address-field-padding=1. (This adjustment for virtual operator calls seems still wrong with the MS ABI.) llvm-svn: 223185	2014-12-03 01:21:41 +00:00
Arnaud A. de Grandmaison	f3470cc979	Revert "Remove threshold for lifetime marker insertion of named temporaries" Revert r222993 while I investigate some MemorySanitizer failures. llvm-svn: 222995	2014-12-01 09:30:16 +00:00
Arnaud A. de Grandmaison	f2730e2d22	Remove threshold for lifetime marker insertion of named temporaries Now that TailRecursionElimination has been fixed with r222354, the threshold on size for lifetime marker insertion can be removed. This only affects named temporary though, as the patch for unnamed temporaries is still in progress. llvm-svn: 222993	2014-12-01 09:13:54 +00:00
Alexey Samsonov	e396bfc064	Bundle conditions checked by UBSan with sanitizer kinds they implement. Summary: This change makes CodeGenFunction::EmitCheck() take several conditions that needs to be checked (all of them need to be true), together with sanitizer kinds these checks are for. This would allow to split one call into UBSan runtime into several calls in case different sanitizer kinds would have different recoverability settings. Tests should be fixed accordingly, I'm working on it. Test Plan: regression test suite. Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6219 llvm-svn: 221716	2014-11-11 22:03:54 +00:00
Alexey Samsonov	a041610f11	[Sanitizer] Refactor sanitizer options in LangOptions. Get rid of ugly SanitizerOptions class thrust into LangOptions: * Make SanitizeAddressFieldPadding a regular language option, and rely on default behavior to initialize/reset it. * Make SanitizerBlacklistFile a regular member LangOptions. * Introduce the helper class "SanitizerSet" to represent the set of enabled sanitizers and make it a member of LangOptions. It is exactly the entity we want to cache and modify in CodeGenFunction, for instance. We'd also be able to reuse SanitizerSet in CodeGenOptions for storing the set of recoverable sanitizers, and in the Driver to represent the set of sanitizers turned on/off by the commandline flags. No functionality change. llvm-svn: 221653	2014-11-11 01:26:14 +00:00
Alexey Samsonov	4c1a96f519	Propagate SanitizerKind into CodeGenFunction::EmitCheck() call. Make sure CodeGenFunction::EmitCheck() knows which sanitizer it emits check for. Make CheckRecoverableKind enum an implementation detail and move it away from header. Currently CheckRecoverableKind is determined by the type of sanitizer ("unreachable" and "return" are unrecoverable, "vptr" is always-recoverable, all the rest are recoverable). This will change in future if we allow to specify which sanitizers are recoverable, and which are not by -fsanitize-recover= flag. No functionality change. llvm-svn: 221635	2014-11-10 22:27:30 +00:00
Reid Kleckner	c311aba247	Silence a warning from MSVC "14" by making an enum unsigned It says there is a narrowing conversion when we assign it to an unsigned 3 bit bitfield. Also, use unsigned instead of size_t for the Size field of the struct in question. Otherwise they won't run together in MSVC or clang-cl. llvm-svn: 221019	2014-10-31 23:33:56 +00:00
David Majnemer	0c0b6d9ac6	MS ABI: Properly call global delete when invoking virtual destructors Summary: The Itanium ABI approach of using offset-to-top isn't possible with the MS ABI, it doesn't have that kind of information lying around. Instead, we do the following: - Call the virtual deleting destructor with the "don't delete the object flag" set. The virtual deleting destructor will return a pointer to 'this' adjusted to the most derived class. - Call the global delete using the adjusted 'this' pointer. Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D5996 llvm-svn: 220993	2014-10-31 20:09:12 +00:00
Alexey Samsonov	035462c1cf	Get rid of SanitizerOptions::Disabled global. NFC. SanitizerOptions is not even a POD now, so having global variable of this type, is not nice. Instead, provide a regular constructor and clear() method, and let each CodeGenFunction has its own copy of SanitizerOptions it uses. llvm-svn: 220920	2014-10-30 19:33:44 +00:00
Alexey Bataev	330de03083	Improved capturing variable-length array types in CapturedStmt. An updated implemnentation of VLA types capturing based on previously committed solution for Lambdas. This version captures the whole VLA type instead of particular variables which are part of VLA size expression and allows to use previusly calculated size of VLA type in captured regions. Required for OpenMP. Differential Revision: http://reviews.llvm.org/D5099 llvm-svn: 220850	2014-10-29 12:21:55 +00:00
Fariborz Jahanian	9ad94aa280	Objective-C. revert patch for rdar://17554063. llvm-svn: 220812	2014-10-28 18:28:16 +00:00
Aaron Ballman	560aa94ede	Fixing the MSVC build by removing friendship with CodeGenFunction; NFC. llvm-svn: 220293	2014-10-21 13:39:56 +00:00
Alexey Bataev	03b340a3a5	[OPENMP] Codegen for 'private' clause in 'parallel' directive. This patch generates some helper variables which used as a private copies of the corresponding original variables inside an OpenMP 'parallel' directive. These generated variables are initialized by default (with the default constructor, if any). In outlined function references to original variables are replaced by the references to these private helper variables. At the end of the initialization of the private variables and implicit barier is set by calling __kmpc_barrier(...) runtime function to be sure that all threads were initialized using original values of the variables. Differential Revision: http://reviews.llvm.org/D4752 llvm-svn: 220262	2014-10-21 03:16:40 +00:00
Kostya Serebryany	293dc9be6e	Insert poisoned paddings between fields in C++ classes so that AddressSanitizer can find intra-object-overflow bugs Summary: The general approach is to add extra paddings after every field in AST/RecordLayoutBuilder.cpp, then add code to CTORs/DTORs that poisons the paddings (CodeGen/CGClass.cpp). Everything is done under the flag -fsanitize-address-field-padding. The blacklist file (-fsanitize-blacklist) allows to avoid the transformation for given classes or source files. See also https://code.google.com/p/address-sanitizer/wiki/IntraObjectOverflow Test Plan: run SPEC2006 and some of the Chromium tests with -fsanitize-address-field-padding Reviewers: samsonov, rnk, rsmith Reviewed By: rsmith Subscribers: majnemer, cfe-commits Differential Revision: http://reviews.llvm.org/D5687 llvm-svn: 219961	2014-10-16 20:54:52 +00:00
Hal Finkel	6fae849597	Moving CGF::EmitAlignmentAssumption to IRBuilder The functionality contained in CodeGenFunction::EmitAlignmentAssumption has been moved to IRBuilder (so that it can also be used by LLVM-level code). Remove this now-duplicate implementation in favor of the IRBuilder code. llvm-svn: 219877	2014-10-15 23:45:08 +00:00
Alexey Samsonov	eb47d8a2c8	Sanitize upcasts and conversion to virtual base. This change adds UBSan check to upcasts. Namely, when we perform derived-to-base conversion, we: 1) check that the pointer-to-derived has suitable alignment and underlying storage, if this pointer is non-null. 2) if vptr-sanitizer is enabled, and we perform conversion to virtual base, we check that pointer-to-derived has a matching vptr. llvm-svn: 219642	2014-10-13 23:59:00 +00:00
Benjamin Kramer	c52193f4c7	Unfriend CGOpenMPRegionInfo so it can go into an anonymous namespace. Also remove some unnecessary virtual keywords. NFC. llvm-svn: 219497	2014-10-10 13:57:57 +00:00
Alexey Bataev	1809571c76	Code reformatting and improvement for OpenMP. Moved CGOpenMPRegionInfo from CGOpenMPRuntime.h to CGOpenMPRuntime.cpp file and reworked the code for this change. Also added processing of ThreadID variable passed as an argument in outlined functions in parallel and task directives. llvm-svn: 219490	2014-10-10 12:19:54 +00:00
Alexey Bataev	435ad7ba5e	Code improvements in OpenMP CodeGen. This patch makes class OMPPrivateScope a common class for all private variables. Reworked processing of firstprivate variables (now it is based on OMPPrivateScope too). llvm-svn: 219486	2014-10-10 09:48:26 +00:00
Nick Lewycky	5d1159ebe9	Revert r218865 because it introduced PR21236, a crash in codegen emitting the try block. llvm-svn: 219470	2014-10-10 04:05:00 +00:00
Reid Kleckner	79b0fd7a48	Promote null pointer constants used as arguments to variadic functions Make it possible to pass NULL through variadic functions on 64-bit Windows targets. The Visual C++ headers define NULL to 0, when they should define it to 0LL on Win64 so that NULL is a pointer-sized integer. Fixes PR20949. Reviewers: thakis, rsmith Differential Revision: http://reviews.llvm.org/D5480 llvm-svn: 219456	2014-10-10 00:05:45 +00:00
Alexey Bataev	13314bf526	[OPENMP] 'omp teams' directive basic support. Includes parsing and semantic analysis for 'omp teams' directive support from OpenMP 4.0. Adds additional analysis to 'omp target' directive with 'omp teams' directive. llvm-svn: 219385	2014-10-09 04:18:56 +00:00
Alexey Bataev	4a5bb772c3	[OPENMP] Codegen for 'firstprivate' clause. This patch generates some helper variables that used as private copies of the corresponding original variables inside an OpenMP 'parallel' directive. These generated variables are initialized by copy using values of the original variables (with the copy constructor, if any). For arrays, initializator is generated for single element and in the codegen procedure this initial value is automatically propagated between all elements of the private copy. In outlined function, references to original variables are replaced by the references to these private helper variables. At the end of the initialization of the private variables an implicit barier is generated by calling __kmpc_barrier(...) runtime function to be sure that all threads were initialized using original values of the variables. Differential Revision: http://reviews.llvm.org/D5140 llvm-svn: 219306	2014-10-08 14:01:46 +00:00
Alexey Bataev	8068b643c4	Revert commit r219297. Still troubles with OpenMP/parallel_firstprivate_codegen.cpp (now in ARM buildbots). llvm-svn: 219298	2014-10-08 12:00:22 +00:00
Alexey Bataev	3854f63aaf	[OPENMP] Codegen for 'firstprivate' clause. This patch generates some helper variables that used as private copies of the corresponding original variables inside an OpenMP 'parallel' directive. These generated variables are initialized by copy using values of the original variables (with the copy constructor, if any). For arrays, initializator is generated for single element and in the codegen procedure this initial value is automatically propagated between all elements of the private copy. In outlined function, references to original variables are replaced by the references to these private helper variables. At the end of the initialization of the private variables an implicit barier is generated by calling __kmpc_barrier(...) runtime function to be sure that all threads were initialized using original values of the variables. Differential Revision: http://reviews.llvm.org/D5140 llvm-svn: 219297	2014-10-08 11:35:04 +00:00
Alexey Bataev	bdef50e1ad	Revert back r219295. To fix issues with test OpenMP/parallel_firstprivate_codegen.cpp llvm-svn: 219296	2014-10-08 11:12:35 +00:00
Alexey Bataev	e7a5517a58	[OPENMP] Codegen for 'firstprivate' clause. This patch generates some helper variables that used as private copies of the corresponding original variables inside an OpenMP 'parallel' directive. These generated variables are initialized by copy using values of the original variables (with the copy constructor, if any). For arrays, initializator is generated for single element and in the codegen procedure this initial value is automatically propagated between all elements of the private copy. In outlined function, references to original variables are replaced by the references to these private helper variables. At the end of the initialization of the private variables an implicit barier is generated by calling __kmpc_barrier(...) runtime function to be sure that all threads were initialized using original values of the variables. Differential Revision: http://reviews.llvm.org/D5140 llvm-svn: 219295	2014-10-08 10:42:55 +00:00
Renato Golin	9804fa5d48	Revert "[OPENMP] 'omp teams' directive basic support. Includes parsing and semantic analysis for 'omp teams' directive support from OpenMP 4.0. Adds additional analysis to 'omp target' directive with 'omp teams' directive." This reverts commit r219197 because it broke ARM self-hosting buildbots with segmentation fault errors in many tests. llvm-svn: 219289	2014-10-08 09:06:45 +00:00
Reid Kleckner	453e056467	Fix IRGen for referencing a static local before emitting its decl Summary: Previously CodeGen assumed that static locals were emitted before they could be accessed, which is true for automatic storage duration locals. However, it is possible to have CodeGen emit a nested function that uses a static local before emitting the function that defines the static local, breaking that assumption. Fix it by creating the static local upon access and ensuring that the deferred function body gets emitted. We may not be able to emit the initializer properly from outside the function body, so don't try. Fixes PR18020. See also previous attempts to fix static locals in PR6769 and PR7101. Reviewers: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4787 llvm-svn: 219265	2014-10-08 01:07:54 +00:00
Alexey Bataev	941bbec6f4	[OPENMP] 'omp teams' directive basic support. Includes parsing and semantic analysis for 'omp teams' directive support from OpenMP 4.0. Adds additional analysis to 'omp target' directive with 'omp teams' directive. llvm-svn: 219197	2014-10-07 10:13:33 +00:00
Alexander Musman	d196ef2124	[OPENMP] Small refactoring of EmitOMPSimdLoop helper routine. No functional changes intended. Renamed EmitOMPSimdLoop to EmitOMPInnerLoop, I plan to re-use it to emit inner loop in the future patches for CodeGen of the worksharing loop directives (omp for, omp for simd). llvm-svn: 219195	2014-10-07 08:57:09 +00:00
David Majnemer	b3341ea453	MS ABI: Implement thread_local for global variables Summary: This add support for the C++11 feature, thread_local global variables. The ABI Clang implements is an improvement of the MSVC ABI. Sadly, further improvements could be made but not without sacrificing ABI compatibility. The feature is implemented as follows: - All thread_local initialization routines are pointed to from the .CRT$XDU section. - All non-weak thread_local variables have their initialization routines call from a single function instead of getting their own .CRT$XDU section entry. This is done to open up optimization opportunities to the compiler. - All weak thread_local variables have their own .CRT$XDU section entry. This entry is in a COMDAT with the global variable it is initializing; this ensures that we will initialize the global exactly once. - Destructors are registered in the initialization function using __tlregdtor. Differential Revision: http://reviews.llvm.org/D5597 llvm-svn: 219074	2014-10-05 05:05:40 +00:00
Arnaud A. de Grandmaison	42d314d1ba	Emit lifetime.start / lifetime.end markers for unnamed temporary objects. This will give more information to the optimizers so that they can reuse stack slots and reduce stack usage. llvm-svn: 218865	2014-10-02 12:19:51 +00:00
Alexander Musman	a5f070aec0	[OPENMP] Loop collapsing and codegen for 'omp simd' directive. This patch implements collapsing of the loops (in particular, in presense of clause 'collapse'). It calculates number of iterations N and expressions nesessary to calculate the nested loops counters values based on new iteration variable (that goes from 0 to N-1) in Sema. It also adds Codegen for 'omp simd', which uses (and tests) this feature. Differential Revision: http://reviews.llvm.org/D5184 llvm-svn: 218743	2014-10-01 06:03:56 +00:00
Alexander Musman	e4e893bb36	[OPENMP] Parsing/Sema of directive omp parallel for simd llvm-svn: 218299	2014-09-23 09:33:00 +00:00
Alexey Bataev	0bd520b767	[OPENMP] Initial parsing/sema analysis of 'target' directive. llvm-svn: 218110	2014-09-19 08:19:49 +00:00
David Majnemer	9928106536	MS ABI: Don't ICE for pointers to pointers to members of incomplete classes CodeGen would try to come up with an LLVM IR type for a pointer to member type on the way to forming an LLVM IR type for a pointer to pointer to member type. However, if the pointer to member representation has not been locked in yet, we would not be able to come up with a pointer to member IR type. In these cases, make the pointer to member type an incomplete type. This will make the pointer to pointer to member type a pointer to an incomplete type. If the class eventually obtains an inheritance model, we will make the pointer to member type represent the actual inheritance model. Differential Revision: http://reviews.llvm.org/D5373 llvm-svn: 218084	2014-09-18 22:05:54 +00:00
Alexander Musman	f82886e502	Parsing/Sema of directive omp for simd llvm-svn: 218029	2014-09-18 05:12:34 +00:00
Alexey Samsonov	8e1162c71d	Implement nonnull-attribute sanitizer Summary: This patch implements a new UBSan check, which verifies that function arguments declared to be nonnull with __attribute__((nonnull)) are actually nonnull in runtime. To implement this check, we pass FunctionDecl to CodeGenFunction::EmitCallArgs (where applicable) and if function declaration has nonnull attribute specified for a certain formal parameter, we compare the corresponding RValue to null as soon as it's calculated. Test Plan: regression test suite Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits, rnk Differential Revision: http://reviews.llvm.org/D5082 llvm-svn: 217389	2014-09-08 17:22:45 +00:00
Hal Finkel	bcc06085a8	Add __builtin_assume and __builtin_assume_aligned using @llvm.assume. This makes use of the recently-added @llvm.assume intrinsic to implement a __builtin_assume(bool) intrinsic (to provide additional information to the optimizer). This hooks up __assume in MS-compatibility mode to mirror __builtin_assume (the semantics have been intentionally kept compatible), and implements GCC's __builtin_assume_aligned as assume((p - o) & mask == 0). LLVM now contains special logic to deal with assumptions of this form. llvm-svn: 217349	2014-09-07 22:58:14 +00:00
Reid Kleckner	9b3e3dfc54	MS inline asm: Allow __asm blocks to set a return value If control falls off the end of a function after an __asm block, MSVC assumes that the inline assembly filled the EAX and possibly EDX registers with an appropriate return value. This functionality is used in inline functions returning 64-bit integers in system headers, so we need some amount of compatibility. This is implemented in Clang by adding extra output constraints to every inline asm block, and storing the resulting output registers into the return value slot. If we see an asm block somewhere in the function body, we emit a normal epilogue instead of marking the end of the function with a return type unreachable. Normal returns in functions not using this functionality will overwrite the return value slot, and in most cases LLVM should be able to eliminate the dead stores. Fixes PR17201. Reviewed By: majnemer Differential Revision: http://reviews.llvm.org/D5177 llvm-svn: 217187	2014-09-04 20:04:38 +00:00
Alexey Samsonov	cbe875a507	Kill one of EmitCallArgs overloads. NFC. llvm-svn: 216635	2014-08-28 00:22:11 +00:00
Craig Topper	3cb91b2ad1	Fix some cases were ArrayRefs were being passed by reference. llvm-svn: 216527	2014-08-27 06:28:16 +00:00
Alexey Samsonov	525bf650cc	Pass actual CXXConstructExpr instead of argument iterators into EmitSynthesizedCXXCopyCtorCall. No functionality change. llvm-svn: 216410	2014-08-25 21:58:56 +00:00
Alexey Samsonov	a5bf76bdf3	Pass actual CallExpr instead of CallExpr-specific iterators into EmitCXXMemberOrOperatorCall methods. In the end we want to make declaration visible in EmitCallArgs() method, that would allow us to alter CodeGen depending on function/parameter attributes. No functionality change. llvm-svn: 216404	2014-08-25 20:17:35 +00:00
David Blaikie	93be0b24b8	DebugInfo: Scope for condition variables more narrowly than the loop variable. for loops introduce two scopes - one for the outer loop variable and its initialization, and another for the body of the loop, including any variable declared inside the loop condition. llvm-svn: 216288	2014-08-22 21:37:04 +00:00
Alexey Samsonov	91cf455af1	CGCall: Factor out the logic mapping call arguments to LLVM IR arguments. Summary: This refactoring introduces ClangToLLVMArgMapping class, which encapsulates the information about the order in which function arguments listed in CGFunctionInfo should be passed to actual LLVM IR function, such as: 1) positions of sret, if there is any 2) position of inalloca argument, if there is any 3) position of helper padding argument for each call argument 4) positions of regular argument (there can be many if it's expanded). Simplify several related methods (ConstructAttributeList, EmitFunctionProlog and EmitCall): now they don't have to maintain iterators over the list of LLVM IR function arguments, dealing with all the sret/inalloca/this complexities, and just use expected positions of LLVM IR arguments stored in ClangToLLVMArgMapping. This may increase the running time of EmitFunctionProlog, as we have to traverse expandable arguments twice, but in further refactoring we will be able to speed up EmitCall by passing already calculated CallArgsToIRArgsMapping to ConstructAttributeList, thus avoiding traversing expandable argument there. No functionality change. Test Plan: regression test suite Reviewers: majnemer, rnk Reviewed By: rnk Subscribers: cfe-commits, rjmccall, timurrrr Differential Revision: http://reviews.llvm.org/D4938 llvm-svn: 216251	2014-08-22 01:06:06 +00:00
Alexey Samsonov	70b9c01bd4	Pass expressions instead of argument ranges to EmitCall/EmitCXXConstructorCall. Summary: This is a first small step towards passing generic "Expr" instead of ArgBeg/ArgEnd pair into EmitCallArgs() family of methods. Having "Expr" will allow us to get the corresponding FunctionDecl and its ParmVarDecls, thus allowing us to alter CodeGen depending on the function/parameter attributes. No functionality change. Test Plan: regression test suite Reviewers: rnk Reviewed By: rnk Subscribers: aemerson, cfe-commits Differential Revision: http://reviews.llvm.org/D4915 llvm-svn: 216214	2014-08-21 20:26:47 +00:00
Fariborz Jahanian	91b2fa2a9a	ext_vector IRGen. Patch to allow indexing into ext_vector_type's 'hi/lo' components when used as lvalue. rdar://18031917 pr20697 llvm-svn: 215991	2014-08-19 17:17:40 +00:00
Benjamin Kramer	2f5db8b3db	Header guard canonicalization, clang part. Modifications made by clang-tidy with minor tweaks. llvm-svn: 215557	2014-08-13 16:25:19 +00:00
Fariborz Jahanian	413297c53d	Objective-C ARC. First patch toward generating new APIs for Objective-C's array and dictionary literals. rdar://17554063. This is wip. llvm-svn: 214983	2014-08-06 18:13:46 +00:00
Fariborz Jahanian	bcd82afad6	Introduce f[no-]max-unknown-pointer-align=[number] option to instruct the code generator to not enforce a higher alignment than the given number (of bytes) when accessing memory via an opaque pointer or reference. Patch reviewed by John McCall (with post-commit review pending). rdar://16254558 llvm-svn: 214911	2014-08-05 18:37:48 +00:00
Reid Kleckner	fe5b4ed822	Remove separator parameter from static local naming code It was always set to ".", which was duplicated in a few places. llvm-svn: 214792	2014-08-04 22:35:30 +00:00
Reid Kleckner	ab2090d107	MS ABI: Use musttail for vtable thunks that pass arguments by value This moves some memptr specific code into the generic thunk emission codepath. Fixes PR20053. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D4613 llvm-svn: 214004	2014-07-26 01:34:32 +00:00
Reid Kleckner	3f76ac7daa	Remove an extra parameter and C++11 for loop-ify this code llvm-svn: 214003	2014-07-26 01:30:05 +00:00
Reid Kleckner	19819446eb	MS ABI: Don't push destructor cleanups for aggregate parameters in thunks The target method of the thunk will perform the cleanup. This can't be tested in 32-bit x86 yet because passing something by value would create an inalloca, and we refuse to generate broken code for that. llvm-svn: 213976	2014-07-25 21:39:46 +00:00
Alexey Bataev	0162e459ef	[OPENMP] Initial parsing and sema analysis for 'atomic' directive. llvm-svn: 213639	2014-07-22 10:10:35 +00:00
Alexey Bataev	9fb6e647e7	[OPENMP] Initial parsing and sema analysis for 'ordered' directive. llvm-svn: 213616	2014-07-22 06:45:04 +00:00
Arnaud A. de Grandmaison	6e24a46572	Revert "Emit lifetime.start / lifetime.end markers for unnamed temporary objects." This commit did break the sanitizer-x86 bot. Revert it while investigating. llvm-svn: 213579	2014-07-21 19:47:02 +00:00
Arnaud A. de Grandmaison	17a83cf4b6	Emit lifetime.start / lifetime.end markers for unnamed temporary objects. This will give more information to the optimizers so that they can reuse stack slots. llvm-svn: 213576	2014-07-21 18:54:21 +00:00
Alexey Bataev	6125da9258	[OPENMP] Initial parsing and sema analysis for 'flush' directive. llvm-svn: 213512	2014-07-21 11:26:11 +00:00
Alexander Musman	d9ed09f7a5	[OPENMP] Parsing/Sema of the OpenMP directive 'critical'. llvm-svn: 213510	2014-07-21 09:42:05 +00:00
Arnaud A. de Grandmaison	18bc4fff48	Revert "Emit lifetime.start / lifetime.end markers for unnamed temporary objects." This reverts commit dbf785a6432f78a8ec229665876647c4cc610d3d, while I qm investigating a buildbot failure. llvm-svn: 213380	2014-07-18 14:23:58 +00:00
Arnaud A. de Grandmaison	1be89f4977	Emit lifetime.start / lifetime.end markers for unnamed temporary objects. This will give more information to the optimizers so that they can reuse stack slots. llvm-svn: 213379	2014-07-18 13:36:33 +00:00
Alexey Bataev	2df347ad96	[OPENMP] Initial parsing and sema analysis for 'taskwait' directive. llvm-svn: 213363	2014-07-18 10:17:07 +00:00
Alexey Bataev	4d1dfeabc9	[OPENMP] Initial parsing and sema analysis for 'barrier' directive. llvm-svn: 213360	2014-07-18 09:11:51 +00:00
Alexey Bataev	68446b7253	[OPENMP] Initial parsing and sema analysis of 'taskyield' directive. llvm-svn: 213355	2014-07-18 07:47:19 +00:00
Alexey Samsonov	24cad99307	[UBSan] Add !nosanitize metadata to the code generated by UBSan. This is used to mark the instructions emitted by Clang to implement variety of UBSan checks. Generally, we don't want to instrument these instructions with another sanitizers (like ASan). Reviewed in http://reviews.llvm.org/D4544 llvm-svn: 213291	2014-07-17 18:46:27 +00:00
Alexander Musman	80c2289a03	[OPENMP] Parsing/Sema analysis of directive 'master' llvm-svn: 213237	2014-07-17 08:54:58 +00:00
Alexey Bataev	9c2e8ee72f	[OPENMP] Parsing and sema analysis for 'omp task' directive. llvm-svn: 212804	2014-07-11 11:25:16 +00:00
David Blaikie	1b5adb82d9	Fix the dtor location issues in PR20038 harder. Originally committed in r211722, this fixed one case of dtor calls being emitted without locations (this causes problems for debug info if the call is then inlined), this caught only some of the cases. Instead of trying to re-enable the location before the cleanup, simply re-enable the location immediately after the unconditional branches in question using a scoped device to ensure the no-location state doesn't leak out arbitrarily. llvm-svn: 212761	2014-07-10 20:42:59 +00:00
Alexey Bataev	84d0b3efee	[OPENMP] Parsing and sema analysis for 'omp parallel sections' directive. llvm-svn: 212516	2014-07-08 08:12:03 +00:00
Alexey Samsonov	ac4afe49e7	[Sanitizer] Remove brittle cache variable and slightly simplify blacklisting code. Now CodeGenFunction is responsible for looking at sanitizer blacklist (in CodeGenFunction::StartFunction) and turning off instrumentation, if necessary. No functionality change. llvm-svn: 212501	2014-07-07 23:59:57 +00:00
Alexey Bataev	4acb859fbd	[OPENMP] Added initial support for 'omp parallel for'. llvm-svn: 212453	2014-07-07 13:01:15 +00:00
Nico Weber	9b982078e9	Add an AST node for __leave statements, hook it up. Codegen is still missing (and I won't work on that), but __leave is now as implemented as __try and friends. llvm-svn: 212425	2014-07-07 00:12:30 +00:00
Logan Chien	e9c8ccbf8f	Remove CleanupHackLevel from CGException. This patch removes the dead code, and refines the getEHResumeBlock() slightly. The CleanupHackLevel was a hack to the old exception handling intrinsics, which have several issues with function inliner. Since LLVM 3.0, the new landingpad and resume instructions are added to LLVM IR. With the new exception handling mechanism, most of the issues are fixed now. We should always use these instructions to implement the exception handling code nowadays, and we don't need the hack any more. Besides, the `CleanupHackLevel` is a compile-time constant, thus other cases have been considered as dead code for a while. llvm-svn: 212097	2014-07-01 11:47:10 +00:00
Alexey Bataev	aca7fcf276	Using of variable length arrays in captured statements and OpenMP constructs. Differential Revision: http://reviews.llvm.org/D4067 llvm-svn: 212010	2014-06-30 02:55:54 +00:00
Craig Topper	00bbdcf9b3	Remove llvm:: from uses of ArrayRef. llvm-svn: 211987	2014-06-28 23:22:23 +00:00
Alexey Bataev	d1e40fbfe1	[OPENMP] Initial parsing and sema analysis for 'single' directive. llvm-svn: 211774	2014-06-26 12:05:45 +00:00
Alexey Bataev	1e0498a92d	[OPENMP] Initial parsing and sema analysis for 'section' directive. llvm-svn: 211767	2014-06-26 08:21:58 +00:00
Alexey Bataev	d3f8dd2d15	[OPENMP] Initial support for 'sections' directive. llvm-svn: 211685	2014-06-25 11:44:49 +00:00
Matt Arsenault	56f008d538	Add R600 builtin codegen. llvm-svn: 211631	2014-06-24 20:45:01 +00:00
Tim Northover	6ea28bdef5	ARM: remove dead CodeGen functions. These two are no longer being used by NEON codegen. llvm-svn: 211586	2014-06-24 12:07:44 +00:00
Alexey Bataev	f29276edb7	[OPENMP] Initial support for '#pragma omp for' (fixed incompatibility with MSVC). llvm-svn: 211140	2014-06-18 04:14:57 +00:00
Rafael Espindola	a566efbec9	Revert "[OPENMP] Initial support for '#pragma omp for'." This reverts commit r211096. Looks like it broke the msvc build: SemaOpenMP.cpp(140) : error C4519: default template arguments are only allowed on a class template llvm-svn: 211113	2014-06-17 17:20:53 +00:00
Alexey Bataev	c77dd5257a	[OPENMP] Initial support for '#pragma omp for'. llvm-svn: 211096	2014-06-17 11:49:22 +00:00
Aaron Ballman	b06b15aa28	Adding a new #pragma for the vectorize and interleave optimization hints. Patch thanks to Tyler Nowicki! llvm-svn: 210330	2014-06-06 12:40:24 +00:00
Richard Smith	760520bcb7	Add __builtin_operator_new and __builtin_operator_delete, which act like calls to the normal non-placement ::operator new and ::operator delete, but allow optimizations like new-expressions and delete-expressions do. llvm-svn: 210137	2014-06-03 23:27:44 +00:00
Richard Smith	06a67e2c6f	When emitting a multidimensional array new, emit the initializers for the trailing elements as a single loop, rather than sometimes emitting a nest of several loops. This fixes a bug where CodeGen would sometimes try to emit an expression with the wrong type for the element being initialized. Plus various other minor cleanups to the IR produced for array new initialization. llvm-svn: 210079	2014-06-03 06:58:52 +00:00
Tim Northover	573cbee543	AArch64/ARM64: rename ARM64 components to AArch64 This keeps Clang consistent with backend naming conventions. llvm-svn: 209579	2014-05-24 12:52:07 +00:00
Tim Northover	25e8a6754e	AArch64/ARM64: update Clang after AArch64 removal. A few (mostly CodeGen) parts of Clang were tightly coupled to the AArch64 backend. Now that it's gone, they will not even compile. I've also deduplicated RUN lines in many of the AArch64 tests. This might improve "make check-all" time noticably: some of those NEON tests were monsters. llvm-svn: 209578	2014-05-24 12:51:25 +00:00
Alexander Musman	515ad8c490	This patch adds a helper class (CGLoopInfo) for marking memory instructions with llvm.mem.parallel_loop_access metadata. It also adds a simple initial version of codegen for pragma omp simd (it will change in the future to support all the clauses). Differential revision: http://reviews.llvm.org/D3644 llvm-svn: 209411	2014-05-22 08:54:05 +00:00
Craig Topper	8a13c4180e	[C++11] Use 'nullptr'. CodeGen edition. llvm-svn: 209272	2014-05-21 05:09:00 +00:00
Renato Golin	230c5eb4bd	Non-allocatable Global Named Register This patch implements global named registers in Clang, lowering to the just created intrinsics in LLVM (@llvm.read/write_register). A new type of LValue had to be created (Register), which just adds support to carry the metadata node containing the name of the register. Two new methods to emit loads and stores interoperate with another to emit the named metadata node. No guarantees are being made and only non-allocatable global variable named registers are being supported. Local named register support is unchanged. llvm-svn: 209149	2014-05-19 18:15:42 +00:00
Rafael Espindola	42ae74531c	Don't indent in namespaces. llvm-svn: 208384	2014-05-09 00:57:59 +00:00
Alexey Bataev	9959db5fa9	[OPENMP] Initial codegen for '#pragma omp parallel' llvm-svn: 208077	2014-05-06 10:08:46 +00:00
Justin Bogner	81ab90f7ed	CodeGen: Handle CapturedStmt in instrumentation based profiling CapturedStmt was being ignored by instrumentation based profiling, and its counters attributed to the containing function. Instead, we need to treat this as a top level entity, like we do with blocks. llvm-svn: 206231	2014-04-15 00:50:54 +00:00
Adrian Prantl	22e66b434a	Cleanup: Add default arguments to CodeGenFunction::StartFunction. Thanks dblaikie for the suggestion! llvm-svn: 206012	2014-04-11 01:13:04 +00:00
Adrian Prantl	42d71b9906	Debug info: (Bugfix) Make sure artificial functions like _GLOBAL__I_a are not associated with any source lines. Previously, if the Location of a Decl was empty, EmitFunctionStart would just keep using CurLoc, which would sometimes be correct (e.g., thunks) but in other cases would just point to a hilariously random location. This patch fixes this by completely eliminating all uses of CurLoc from EmitFunctionStart and rather have clients explicitly pass in a SourceLocation for the function header and the function body. rdar://problem/14985269 llvm-svn: 205999	2014-04-10 23:21:53 +00:00
Tim Northover	a2ee433c8d	ARM64: initial clang support commit. This adds Clang support for the ARM64 backend. There are definitely still some rough edges, so please bring up any issues you see with this patch. As with the LLVM commit though, we think it'll be more useful for merging with AArch64 from within the tree. llvm-svn: 205100	2014-03-29 15:09:45 +00:00
Eli Bendersky	cb39943f6f	Proper handling of static local variables with address space qualifiers. Similar to the implementation for globals in r157167. Patch by Jingyue Wu. llvm-svn: 204677	2014-03-24 22:05:38 +00:00
Chandler Carruth	61743af166	[Modules] Update to reflect ValueHandle moving to the IR library in LLVM r202821. llvm-svn: 202822	2014-03-04 11:18:19 +00:00
Tim Northover	8fe03d6111	ARM & AArch64: use table for EmitCommonNeonBuiltinExpr This extends the intrinsic lookup table format slightly, and adds entries for use the shared ARM/AArch64 definitions. The benefit is currently smaller than for the SISD intrinsics (there's more custom code implementing this set), but a few lines are saved and there's scope for future expansion. llvm-svn: 201848	2014-02-21 11:57:24 +00:00
Tim Northover	2d83796860	AArch64: refactor table-driven NEON lookup. This extracts the table-driven intrinsic lookup phase into a separate function, to be used by EmitCommonNeonBuiltinExpr soon. It also simplifies the logic used in that lookup, since VectorCastArgN and ScalarArgN were actually identical. llvm-svn: 201847	2014-02-21 11:57:20 +00:00
Bob Wilson	bf854f0f53	Change PGO instrumentation to compute counts in a separate AST traversal. Previously, we made one traversal of the AST prior to codegen to assign counters to the ASTs and then propagated the count values during codegen. This patch now adds a separate AST traversal prior to codegen for the -fprofile-instr-use option to propagate the count values. The counts are then saved in a map from which they can be retrieved during codegen. This new approach has several advantages: 1. It gets rid of a lot of extra PGO-related code that had previously been added to codegen. 2. It fixes a serious bug. My original implementation (which was mailed to the list but never committed) used 3 counters for every loop. Justin improved it to move 2 of those counters into the less-frequently executed breaks and continues, but that turned out to produce wrong count values in some cases. The solution requires visiting a loop body before the condition so that the count for the condition properly includes the break and continue counts. Changing codegen to visit a loop body first would be a fairly invasive change, but with a separate AST traversal, it is easy to control the order of traversal. I've added a testcase (provided by Justin) to make sure this works correctly. 3. It improves the instrumentation overhead, reducing the number of counters for a loop from 3 to 1. We no longer need dedicated counters for breaks and continues, since we can just use the propagated count values when visiting breaks and continues. To make this work, I needed to make a change to the way we count case statements, going back to my original approach of not including the fall-through in the counter values. This was necessary because there isn't always an AST node that can be used to record the fall-through count. Now case statements are handled the same as default statements, with the fall-through paths branching over the counter increments. While I was at it, I also went back to using this approach for do-loops -- omitting the fall-through count into the loop body simplifies some of the calculations and make them behave the same as other loops. Whenever we start using this instrumentation for coverage, we'll need to add the fall-through counts into the counter values. llvm-svn: 201528	2014-02-17 19:21:09 +00:00
Fariborz Jahanian	7741101dce	[IRGen]. Fixes a crash in using Objective-C array properties by fixing shouldBindAsLValue to accept arrays (like record types) because we always manipulate them in memory. Patch suggested by John MaCall. // rdar://15610943 llvm-svn: 201428	2014-02-14 19:37:25 +00:00
Reid Kleckner	314ef7bafd	[ms-cxxabi] Use inalloca on win32 when passing non-trivial C++ objects When a non-trivial parameter is present, clang now gathers up all the parameters that lack inreg and puts them into a packed struct. MSVC always aligns each parameter to 4 bytes and no more, so this is a pretty simple struct to lay out. On win64, non-trivial records are passed indirectly. Prior to this change, clang was incorrectly using byval on win64. I'm able to self-host a working clang with this change and additional LLVM patches. Reviewers: rsmith Differential Revision: http://llvm-reviews.chandlerc.com/D2636 llvm-svn: 200597	2014-02-01 00:04:45 +00:00
Tim Northover	027b4ee607	ARM & AArch64: move shared vld/vst intrinsics to common implementation. llvm-svn: 200526	2014-01-31 10:46:45 +00:00
Tim Northover	58c4474dea	ARM & AArch64: extend shared NEON implementation to first block. This extends the refactoring to the whole of the first block of trivial correspondences (as a fairly arbitrary boundary). llvm-svn: 200472	2014-01-30 14:48:01 +00:00
Tim Northover	ac85c341ae	ARM & AArch64: fully share NEON implementation of permutation intrinsics As a starting point, this moves the CodeGen for NEON permutation instructions (vtrn, vzip, vuzp) into a new shared function. llvm-svn: 200471	2014-01-30 14:47:57 +00:00
Justin Bogner	e25ffdf8a1	Revert "CodeGen: Simplify CodeGenFunction::EmitCaseStmt" I misunderstood the discussion on this. The complexity here is justified by the malloc overhead it saves. This reverts commit r199302. llvm-svn: 199700	2014-01-21 00:35:11 +00:00
Alp Toker	9cacbabd33	Rename FunctionProtoType accessors from 'arguments' to 'parameters' Fix a perennial source of confusion in the clang type system: Declarations and function prototypes have parameters to which arguments are supplied, so calling these 'arguments' was a stretch even in C mode, let alone C++ where default arguments, templates and overloading make the distinction important to get right. Readability win across the board, especially in the casting, ADL and overloading implementations which make a lot more sense at a glance now. Will keep an eye on the builders and update dependent projects shortly. No functional change. llvm-svn: 199686	2014-01-20 20:26:09 +00:00
Justin Bogner	4c5c99f91a	CodeGen: Simplify CodeGenFunction::EmitCaseStmt Way back in r129652 we tried to avoid emitting an empty block at -O0 for switch cases that did nothing but break. This led to a poor debugging experience as reported in PR9796, so we disabled the optimization for -O0 but left it in for higher optimization levels in r154420. Since the whole point of this was to improve -O0, it's silly to keep the complexity at all. llvm-svn: 199302	2014-01-15 07:30:30 +00:00
Adrian Prantl	e83b130def	Revert "Debug info: Ensure that the last stop point in a function is still within" This reverts commit r198461. llvm-svn: 198714	2014-01-07 22:05:52 +00:00
Adrian Prantl	c6758879b3	Revert "Debug info: Implement a cleaner version of r198461. For symmetry with" This reverts commit 198699 so we can get a cleaner patch. llvm-svn: 198713	2014-01-07 22:05:45 +00:00
Adrian Prantl	f5ff0dc29b	Debug info: Implement a cleaner version of r198461. For symmetry with C and C++ don't emit an extra lexical scope for the compound statement that is the body of an Objective-C method. rdar://problem/15010825 llvm-svn: 198699	2014-01-07 19:24:24 +00:00
Chandler Carruth	5553d0d4ca	Sort all the #include lines with LLVM's utils/sort_includes.py which encodes the canonical rules for LLVM's style. I noticed this had drifted quite a bit when cleaning up LLVM, so wanted to clean up Clang as well. llvm-svn: 198686	2014-01-07 11:51:46 +00:00
Justin Bogner	ef512b9929	CodeGen: Initial instrumentation based PGO implementation llvm-svn: 198640	2014-01-06 22:27:43 +00:00
Adrian Prantl	96e70d9148	Debug info: Ensure that the last stop point in a function is still within the lexical block formed by the compound statement that is the function body. rdar://problem/15010825 llvm-svn: 198461	2014-01-03 23:34:30 +00:00
Reid Kleckner	89077a1b00	[ms-cxxabi] The 'most derived' ctor parameter usually comes last Unlike Itanium's VTTs, the 'most derived' boolean or bitfield is the last parameter for non-variadic constructors, rather than the second. For variadic constructors, the 'most derived' parameter comes after the 'this' parameter. This affects constructor calls and constructor decls in a variety of places. Reviewers: timurrrr Differential Revision: http://llvm-reviews.chandlerc.com/D2405 llvm-svn: 197518	2013-12-17 19:46:40 +00:00
Reid Kleckner	739756c0f9	[ms-cxxabi] Construct and destroy call arguments in the correct order Summary: MSVC destroys arguments in the callee from left to right. Because C++ objects have to be destroyed in the reverse order of construction, Clang has to construct arguments from right to left and destroy arguments from left to right. This patch fixes the ordering by reversing the order of evaluation of all call arguments under the MS C++ ABI. Fixes PR18035. Reviewers: rsmith Differential Revision: http://llvm-reviews.chandlerc.com/D2275 llvm-svn: 196402	2013-12-04 19:23:12 +00:00
Hans Wennborg	88497d6157	[-cxx-abi microsoft] Emit thunks for pointers to virtual member functions Instead of storing the vtable offset directly in the function pointer and doing a branch to check for virtualness at each call site, the MS ABI generates a thunk for calling the function at a specific vtable offset, and puts that in the function pointer. This patch adds support for emitting such thunks. However, it doesn't support pointers to virtual member functions that are variadic, have an incomplete aggregate return type or parameter, or are overriding a function in a virtual base class. Differential Revision: http://llvm-reviews.chandlerc.com/D2104 llvm-svn: 194827	2013-11-15 17:24:45 +00:00
Kevin Qin	1718af6f0a	Implement aarch64 neon instruction class misc. llvm-svn: 194657	2013-11-14 02:45:18 +00:00
Richard Smith	b47c36f8e1	C++1y sized deallocation: if we have a use, but not a definition, of a sized deallocation function (and the corresponding unsized deallocation function has been declared), emit a weak discardable definition of the function that forwards to the corresponding unsized deallocation. This allows a C++ standard library implementation to provide both a sized and an unsized deallocation function, where the unsized one does not just call the sized one, for instance by putting both in the same object file within an archive. llvm-svn: 194055	2013-11-05 09:12:18 +00:00
Peter Collingbourne	b453cd64a7	Implement function type checker for the undefined behavior sanitizer. This uses function prefix data to store function type information at the function pointer. Differential Revision: http://llvm-reviews.chandlerc.com/D1338 llvm-svn: 193058	2013-10-20 21:29:19 +00:00
Amaury de la Vieuville	21bf6ed730	Do not emit undefined lsrh/ashr for NEON shifts These IR instructions are undefined when the amount is equal to operand size, but NEON right shifts support such shifts. Work around that by emitting a different IR in these cases. llvm-svn: 191953	2013-10-04 13:13:15 +00:00
Nick Lewycky	2d84e84236	Thread a SourceLocation into the EmitCheck for "load_invalid_value". This occurs when scalars are loaded / undergo lvalue-to-rvalue conversion. llvm-svn: 191808	2013-10-02 02:29:49 +00:00
Faisal Vali	571df12581	Implement conversion to function pointer for generic lambdas without captures. The general strategy is to create template versions of the conversion function and static invoker and then during template argument deduction of the conversion function, create the corresponding call-operator and static invoker specializations, and when the conversion function is marked referenced generate the body of the conversion function using the corresponding static-invoker specialization. Similarly, Codegen does something similar - when asked to emit the IR for a specialized static invoker of a generic lambda, it forwards emission to the corresponding call operator. This patch has been reviewed in person both by Doug and Richard. Richard gave me the LGTM. A few minor changes: - per Richard's request i added a simple check to gracefully inform that captures (init, explicit or default) have not been added to generic lambdas just yet (instead of the assertion violation). - I removed a few lines of code that added the call operators instantiated parameters to the currentinstantiationscope. Not only did it not handle parameter packs, but it is more relevant in the patch for nested lambdas which will follow this one, and fix that problem more comprehensively. - Doug had commented that the original implementation strategy of using the TypeSourceInfo of the call operator to create the static-invoker was flawed and allowed const as a member qualifier to creep into the type of the static-invoker. I currently kludge around it - but after my initial discussion with Doug, with a follow up session with Richard, I have added a FIXME so that a more elegant solution that involves the use of TrivialTypeSourceInfo call followed by the correct wiring of the template parameters to the functionprototypeloc is forthcoming. Thanks! llvm-svn: 191634	2013-09-29 08:45:24 +00:00
Reid Kleckner	543a16c06b	Emit an error when attempting to generate IR for SEH __try Currently we silently omit the code in the try and finally bodies, which is pretty bad. This way we fail loudly. llvm-svn: 190809	2013-09-16 21:46:30 +00:00
Yunzhong Gao	0ebf1bb150	Revert r189649 because it was breaking sanitizer bots. llvm-svn: 189660	2013-08-30 08:53:09 +00:00
Yunzhong Gao	be8d7ba93a	Fixing a bug where debug info for a local variable gets emitted at file scope. The patch was discussed in Phabricator. See: http://llvm-reviews.chandlerc.com/D1281 llvm-svn: 189649	2013-08-30 05:37:02 +00:00
David Blaikie	ebe87e1cfa	Revert "PR14569: Omit debug info for thunks" This reverts commit r189320. Alexey Samsonov and Dmitry Vyukov presented some arguments for keeping these around - though it still seems like those tasks could be solved by a tool just using the symbol table. In a very small number of cases, thunks may be inlined & debug info might be able to save profilers & similar tools from misclassifying those cases as part of the caller. The extra changes here plumb through the VarDecl for various cases to CodeGenFunction - this provides better fidelity through a few APIs but generally just causes the CGF::StartFunction to fallback to using the name of the IR function as the name in the debug info. The changes to debug-info-global-ctor-dtor.cpp seem like goodness. The two names that go missing (in favor of only emitting those names as linkage names) are names that can be demangled - emitting them only as the linkage name should encourage tools to do just that. Again, thanks to Dinesh Dwivedi for investigation/work on this issue. llvm-svn: 189421	2013-08-27 23:57:18 +00:00
David Blaikie	92848dee31	Simplify/clean up debug info suppression in CodeGenFunction CodeGenFunction is run on only one function - a new object is made for each new function. I would add an assertion/flag to this effect, but there's an exception: ObjC properties involve emitting helper functions that are all emitted by the same CodeGenFunction object, so such a check is not possible/correct. llvm-svn: 189277	2013-08-26 20:33:21 +00:00
Benjamin Kramer	7463ed7c89	CodeGen: Unify two implementations of canDevirtualizeMemberFunctionCall. They were mostly copy&paste of each other, move it to CodeGenFunction. Of course the two implementations have diverged over time; the one in CGExprCXX seems to be the more modern one so I picked that one and moved it to CGClass which feels like a better home for it. No intended functionality change. llvm-svn: 189203	2013-08-25 22:46:27 +00:00
Timur Iskhodzhanov	d8fa10db12	[CGF] Get rid of passing redundant VTable pointer around in CodeGenFunction::InitializeVTablePointer[s] llvm-svn: 188909	2013-08-21 17:33:16 +00:00
Timur Iskhodzhanov	88fd439a24	Abstract out virtual calls and virtual function prologue code generation; implement them for -cxx-abi microsoft llvm-svn: 188870	2013-08-21 06:25:03 +00:00
David Blaikie	4a9ec7b59d	PR16933: Don't try to codegen things after we've seen errors. Refactor the underlying code a bit to remove unnecessary calls to "hasErrorOccurred" & make them consistently at all the entry points to the IRGen ASTConsumer. llvm-svn: 188707	2013-08-19 21:02:26 +00:00
Adrian Prantl	ca64c3e136	Debug Info / EmitCallArgs: arguments may modify the debug location. Restore it after each argument is emitted. This fixes the scope info for inlined subroutines inside of function argument expressions. (E.g., anything STL). rdar://problem/12592135 llvm-svn: 187240	2013-07-26 20:42:57 +00:00
Timur Iskhodzhanov	03e8746f90	Simplify the CodeGenFunction::BuildVirtualCall family of functions llvm-svn: 186657	2013-07-19 08:14:45 +00:00
Craig Topper	5603df45df	Use SmallVectorImpl& for function arguments instead of SmallVector. llvm-svn: 185715	2013-07-05 19:34:19 +00:00
Stephen Lin	9dc6eef755	Restore r184205 and associated commits (after commit of r185290) This allows clang to use the backend parameter attribute 'returned' when generating 'this'-returning constructors and destructors in ARM and MSVC C++ ABIs. llvm-svn: 185291	2013-06-30 20:40:16 +00:00
Eli Friedman	c7ad5c4e29	Delete dead code. llvm-svn: 185119	2013-06-28 00:23:34 +00:00
Stephen Lin	19cee1871e	Revert r184205 and associated patches while investigating issue with broken buildbot (possible interaction with LTO) <rdar://problem/14209661> llvm-svn: 184384	2013-06-19 23:23:19 +00:00
Reid Kleckner	d29f1342c2	[CodeGen] Move EHScopeStack into its own header CGCleanup.h isn't meant to be included by all of CodeGen according to John. llvm-svn: 184321	2013-06-19 17:07:50 +00:00
Stephen Lin	a637fb8ccd	CodeGen: Have 'this'-returning constructors and destructors to take advantage of the new backend 'returned' attribute. The backend will now use the generic 'returned' attribute to form tail calls where possible, as well as avoid save-restores of 'this' in some cases (specifically the cases that matter for the ARM C++ ABI). This patch also reverts a prior front-end only partial implementation of these optimizations, since it's no longer required. llvm-svn: 184205	2013-06-18 17:00:49 +00:00
Richard Smith	a1c9d4d932	Simplify: we don't need any special-case lifetime extension when initializing declarations of reference type; they're handled by the general case handling of MaterializeTemporaryExpr. llvm-svn: 183875	2013-06-12 23:38:09 +00:00
Richard Smith	cc1b96d356	PR12086, PR15117 Introduce CXXStdInitializerListExpr node, representing the implicit construction of a std::initializer_list<T> object from its underlying array. The AST representation of such an expression goes from an InitListExpr with a flag set, to a CXXStdInitializerListExpr containing a MaterializeTemporaryExpr containing an InitListExpr (possibly wrapped in a CXXBindTemporaryExpr). This more detailed representation has several advantages, the most important of which is that the new MaterializeTemporaryExpr allows us to directly model lifetime extension of the underlying temporary array. Using that, this patch drastically simplifies the IR generation of this construct, provides IR generation support for nested global initializer_list objects, fixes several bugs where the destructors for the underlying array would accidentally not get invoked, and provides constant expression evaluation support for std::initializer_list objects. llvm-svn: 183872	2013-06-12 22:31:48 +00:00
Richard Smith	736a947bdc	Reapply r183721, reverted in r183776, with a fix for a bug in the former (we were lacking ExprWithCleanups nodes in some cases where the new approach to lifetime extension needed them). Original commit message: Rework IR emission for lifetime-extended temporaries. Instead of trying to walk into the expression and dig out a single lifetime-extended entity and manually pull its cleanup outside the expression, instead keep a list of the cleanups which we'll need to emit when we get to the end of the full-expression. Also emit those cleanups early, as EH-only cleanups, to cover the case that the full-expression does not terminate normally. This allows IR generation to properly model temporary lifetime when multiple temporaries are extended by the same declaration. We have a pre-existing bug where an exception thrown from a temporary's destructor does not clean up lifetime-extended temporaries created in the same expression and extended to automatic storage duration; that is not fixed by this patch. llvm-svn: 183859	2013-06-12 20:42:33 +00:00
Eli Friedman	f045007f11	Add support for complex compound assignments where the LHS is a scalar. Fixes <rdar://problem/11224126> and PR12790. llvm-svn: 183821	2013-06-12 01:40:06 +00:00
Richard Smith	4a28f534e1	Revert r183721. It caused cleanups to be delayed too long in some cases. Testcase to follow. llvm-svn: 183776	2013-06-11 19:14:25 +00:00
Richard Smith	7c5d4dce49	Rework IR emission for lifetime-extended temporaries. Instead of trying to walk into the expression and dig out a single lifetime-extended entity and manually pull its cleanup outside the expression, instead keep a list of the cleanups which we'll need to emit when we get to the end of the full-expression. Also emit those cleanups early, as EH-only cleanups, to cover the case that the full-expression does not terminate normally. This allows IR generation to properly model temporary lifetime when multiple temporaries are extended by the same declaration. We have a pre-existing bug where an exception thrown from a temporary's destructor does not clean up lifetime-extended temporaries created in the same expression and extended to automatic storage duration; that is not fixed by this patch. llvm-svn: 183721	2013-06-11 02:41:00 +00:00
Eli Friedman	4871a46cc3	Make sure we don't emit invalid IR for StmtExprs with complex cleanups. Fixes <rdar://problem/14074868>. llvm-svn: 183699	2013-06-10 22:04:49 +00:00
Reid Kleckner	200fe22a13	[CodeGen] Move EHScopeStack to CGCleanup.h from CodeGenFunction.h No functionality change. CGCleanup.cpp provides the implementation for EHScopeStack, so it seems more consistent to place the class definition in CGCleanup.h. This should also help solve a header ordering problem that I have. llvm-svn: 183631	2013-06-09 16:45:02 +00:00
Reid Kleckner	d8cbeec178	[ms-cxxabi] Implement MSVC virtual base adjustment While we can't yet emit vbtables, this allows us to find virtual bases of objects constructed in other TUs. This make iostream hello world work, since basic_ostream virtually inherits from basic_ios. Differential Revision: http://llvm-reviews.chandlerc.com/D795 llvm-svn: 182870	2013-05-29 18:02:47 +00:00
Adrian Prantl	dc237b52bc	Cleanup: Use a member variable to store the SourceLocation for EH code. rdar://problem/13888152 llvm-svn: 181957	2013-05-16 00:41:26 +00:00
David Blaikie	7d17010db5	Use only explicit bool conversion operator The most common (non-buggy) case are where such objects are used as return expressions in bool-returning functions or as boolean function arguments. In those cases I've used (& added if necessary) a named function to provide the equivalent (or sometimes negative, depending on convenient wording) test. DiagnosticBuilder kept its implicit conversion operator owing to the prevalent use of it in return statements. One bug was found in ExprConstant.cpp involving a comparison of two PointerUnions (PointerUnion did not previously have an operator==, so instead both operands were converted to bool & then compared). A test is included in test/SemaCXX/constant-expression-cxx1y.cpp for the fix (adding operator== to PointerUnion in LLVM). llvm-svn: 181869	2013-05-15 07:37:26 +00:00
Ben Langmuir	3b4c30b7e7	CodeGen for CapturedStmts EmitCapturedStmt creates a captured struct containing all of the captured variables, and then emits a call to the outlined function. This is similar in principle to EmitBlockLiteral. GenerateCapturedFunction actually produces the outlined function. It is based on GenerateBlockFunction, but is much simpler. The function type is determined by the parameters that are in the CapturedDecl. Some changes have been added to this patch that were reviewed as part of the serialization patch and moving the parameters to the captured decl. Differential Revision: http://llvm-reviews.chandlerc.com/D640 llvm-svn: 181536	2013-05-09 19:17:11 +00:00
Richard Smith	ea85232c40	Don't crash in IRGen if a conditional with 'throw' in one of its branches is used as a branch condition. llvm-svn: 181368	2013-05-07 21:53:22 +00:00
Tim Northover	8ec8c4bf89	AArch64: teach Clang about __clear_cache intrinsic libgcc provides a __clear_cache intrinsic on AArch64, much like it does on 32-bit ARM. llvm-svn: 181111	2013-05-04 07:15:13 +00:00

... 4 5 6 7 8 ...

1277 Commits