llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexey Bataev	6125da9258	[OPENMP] Initial parsing and sema analysis for 'flush' directive. llvm-svn: 213512	2014-07-21 11:26:11 +00:00
Alexander Musman	d9ed09f7a5	[OPENMP] Parsing/Sema of the OpenMP directive 'critical'. llvm-svn: 213510	2014-07-21 09:42:05 +00:00
Arnaud A. de Grandmaison	18bc4fff48	Revert "Emit lifetime.start / lifetime.end markers for unnamed temporary objects." This reverts commit dbf785a6432f78a8ec229665876647c4cc610d3d, while I qm investigating a buildbot failure. llvm-svn: 213380	2014-07-18 14:23:58 +00:00
Arnaud A. de Grandmaison	1be89f4977	Emit lifetime.start / lifetime.end markers for unnamed temporary objects. This will give more information to the optimizers so that they can reuse stack slots. llvm-svn: 213379	2014-07-18 13:36:33 +00:00
Alexey Bataev	2df347ad96	[OPENMP] Initial parsing and sema analysis for 'taskwait' directive. llvm-svn: 213363	2014-07-18 10:17:07 +00:00
Alexey Bataev	4d1dfeabc9	[OPENMP] Initial parsing and sema analysis for 'barrier' directive. llvm-svn: 213360	2014-07-18 09:11:51 +00:00
Alexey Bataev	68446b7253	[OPENMP] Initial parsing and sema analysis of 'taskyield' directive. llvm-svn: 213355	2014-07-18 07:47:19 +00:00
Alexey Samsonov	24cad99307	[UBSan] Add !nosanitize metadata to the code generated by UBSan. This is used to mark the instructions emitted by Clang to implement variety of UBSan checks. Generally, we don't want to instrument these instructions with another sanitizers (like ASan). Reviewed in http://reviews.llvm.org/D4544 llvm-svn: 213291	2014-07-17 18:46:27 +00:00
Alexander Musman	80c2289a03	[OPENMP] Parsing/Sema analysis of directive 'master' llvm-svn: 213237	2014-07-17 08:54:58 +00:00
Alexey Bataev	9c2e8ee72f	[OPENMP] Parsing and sema analysis for 'omp task' directive. llvm-svn: 212804	2014-07-11 11:25:16 +00:00
David Blaikie	1b5adb82d9	Fix the dtor location issues in PR20038 harder. Originally committed in r211722, this fixed one case of dtor calls being emitted without locations (this causes problems for debug info if the call is then inlined), this caught only some of the cases. Instead of trying to re-enable the location before the cleanup, simply re-enable the location immediately after the unconditional branches in question using a scoped device to ensure the no-location state doesn't leak out arbitrarily. llvm-svn: 212761	2014-07-10 20:42:59 +00:00
Alexey Bataev	84d0b3efee	[OPENMP] Parsing and sema analysis for 'omp parallel sections' directive. llvm-svn: 212516	2014-07-08 08:12:03 +00:00
Alexey Samsonov	ac4afe49e7	[Sanitizer] Remove brittle cache variable and slightly simplify blacklisting code. Now CodeGenFunction is responsible for looking at sanitizer blacklist (in CodeGenFunction::StartFunction) and turning off instrumentation, if necessary. No functionality change. llvm-svn: 212501	2014-07-07 23:59:57 +00:00
Alexey Bataev	4acb859fbd	[OPENMP] Added initial support for 'omp parallel for'. llvm-svn: 212453	2014-07-07 13:01:15 +00:00
Nico Weber	9b982078e9	Add an AST node for __leave statements, hook it up. Codegen is still missing (and I won't work on that), but __leave is now as implemented as __try and friends. llvm-svn: 212425	2014-07-07 00:12:30 +00:00
Logan Chien	e9c8ccbf8f	Remove CleanupHackLevel from CGException. This patch removes the dead code, and refines the getEHResumeBlock() slightly. The CleanupHackLevel was a hack to the old exception handling intrinsics, which have several issues with function inliner. Since LLVM 3.0, the new landingpad and resume instructions are added to LLVM IR. With the new exception handling mechanism, most of the issues are fixed now. We should always use these instructions to implement the exception handling code nowadays, and we don't need the hack any more. Besides, the `CleanupHackLevel` is a compile-time constant, thus other cases have been considered as dead code for a while. llvm-svn: 212097	2014-07-01 11:47:10 +00:00
Alexey Bataev	aca7fcf276	Using of variable length arrays in captured statements and OpenMP constructs. Differential Revision: http://reviews.llvm.org/D4067 llvm-svn: 212010	2014-06-30 02:55:54 +00:00
Craig Topper	00bbdcf9b3	Remove llvm:: from uses of ArrayRef. llvm-svn: 211987	2014-06-28 23:22:23 +00:00
Alexey Bataev	d1e40fbfe1	[OPENMP] Initial parsing and sema analysis for 'single' directive. llvm-svn: 211774	2014-06-26 12:05:45 +00:00
Alexey Bataev	1e0498a92d	[OPENMP] Initial parsing and sema analysis for 'section' directive. llvm-svn: 211767	2014-06-26 08:21:58 +00:00
Alexey Bataev	d3f8dd2d15	[OPENMP] Initial support for 'sections' directive. llvm-svn: 211685	2014-06-25 11:44:49 +00:00
Matt Arsenault	56f008d538	Add R600 builtin codegen. llvm-svn: 211631	2014-06-24 20:45:01 +00:00
Tim Northover	6ea28bdef5	ARM: remove dead CodeGen functions. These two are no longer being used by NEON codegen. llvm-svn: 211586	2014-06-24 12:07:44 +00:00
Alexey Bataev	f29276edb7	[OPENMP] Initial support for '#pragma omp for' (fixed incompatibility with MSVC). llvm-svn: 211140	2014-06-18 04:14:57 +00:00
Rafael Espindola	a566efbec9	Revert "[OPENMP] Initial support for '#pragma omp for'." This reverts commit r211096. Looks like it broke the msvc build: SemaOpenMP.cpp(140) : error C4519: default template arguments are only allowed on a class template llvm-svn: 211113	2014-06-17 17:20:53 +00:00
Alexey Bataev	c77dd5257a	[OPENMP] Initial support for '#pragma omp for'. llvm-svn: 211096	2014-06-17 11:49:22 +00:00
Aaron Ballman	b06b15aa28	Adding a new #pragma for the vectorize and interleave optimization hints. Patch thanks to Tyler Nowicki! llvm-svn: 210330	2014-06-06 12:40:24 +00:00
Richard Smith	760520bcb7	Add __builtin_operator_new and __builtin_operator_delete, which act like calls to the normal non-placement ::operator new and ::operator delete, but allow optimizations like new-expressions and delete-expressions do. llvm-svn: 210137	2014-06-03 23:27:44 +00:00
Richard Smith	06a67e2c6f	When emitting a multidimensional array new, emit the initializers for the trailing elements as a single loop, rather than sometimes emitting a nest of several loops. This fixes a bug where CodeGen would sometimes try to emit an expression with the wrong type for the element being initialized. Plus various other minor cleanups to the IR produced for array new initialization. llvm-svn: 210079	2014-06-03 06:58:52 +00:00
Tim Northover	573cbee543	AArch64/ARM64: rename ARM64 components to AArch64 This keeps Clang consistent with backend naming conventions. llvm-svn: 209579	2014-05-24 12:52:07 +00:00
Tim Northover	25e8a6754e	AArch64/ARM64: update Clang after AArch64 removal. A few (mostly CodeGen) parts of Clang were tightly coupled to the AArch64 backend. Now that it's gone, they will not even compile. I've also deduplicated RUN lines in many of the AArch64 tests. This might improve "make check-all" time noticably: some of those NEON tests were monsters. llvm-svn: 209578	2014-05-24 12:51:25 +00:00
Alexander Musman	515ad8c490	This patch adds a helper class (CGLoopInfo) for marking memory instructions with llvm.mem.parallel_loop_access metadata. It also adds a simple initial version of codegen for pragma omp simd (it will change in the future to support all the clauses). Differential revision: http://reviews.llvm.org/D3644 llvm-svn: 209411	2014-05-22 08:54:05 +00:00
Craig Topper	8a13c4180e	[C++11] Use 'nullptr'. CodeGen edition. llvm-svn: 209272	2014-05-21 05:09:00 +00:00
Renato Golin	230c5eb4bd	Non-allocatable Global Named Register This patch implements global named registers in Clang, lowering to the just created intrinsics in LLVM (@llvm.read/write_register). A new type of LValue had to be created (Register), which just adds support to carry the metadata node containing the name of the register. Two new methods to emit loads and stores interoperate with another to emit the named metadata node. No guarantees are being made and only non-allocatable global variable named registers are being supported. Local named register support is unchanged. llvm-svn: 209149	2014-05-19 18:15:42 +00:00
Rafael Espindola	42ae74531c	Don't indent in namespaces. llvm-svn: 208384	2014-05-09 00:57:59 +00:00
Alexey Bataev	9959db5fa9	[OPENMP] Initial codegen for '#pragma omp parallel' llvm-svn: 208077	2014-05-06 10:08:46 +00:00
Justin Bogner	81ab90f7ed	CodeGen: Handle CapturedStmt in instrumentation based profiling CapturedStmt was being ignored by instrumentation based profiling, and its counters attributed to the containing function. Instead, we need to treat this as a top level entity, like we do with blocks. llvm-svn: 206231	2014-04-15 00:50:54 +00:00
Adrian Prantl	22e66b434a	Cleanup: Add default arguments to CodeGenFunction::StartFunction. Thanks dblaikie for the suggestion! llvm-svn: 206012	2014-04-11 01:13:04 +00:00
Adrian Prantl	42d71b9906	Debug info: (Bugfix) Make sure artificial functions like _GLOBAL__I_a are not associated with any source lines. Previously, if the Location of a Decl was empty, EmitFunctionStart would just keep using CurLoc, which would sometimes be correct (e.g., thunks) but in other cases would just point to a hilariously random location. This patch fixes this by completely eliminating all uses of CurLoc from EmitFunctionStart and rather have clients explicitly pass in a SourceLocation for the function header and the function body. rdar://problem/14985269 llvm-svn: 205999	2014-04-10 23:21:53 +00:00
Tim Northover	a2ee433c8d	ARM64: initial clang support commit. This adds Clang support for the ARM64 backend. There are definitely still some rough edges, so please bring up any issues you see with this patch. As with the LLVM commit though, we think it'll be more useful for merging with AArch64 from within the tree. llvm-svn: 205100	2014-03-29 15:09:45 +00:00
Eli Bendersky	cb39943f6f	Proper handling of static local variables with address space qualifiers. Similar to the implementation for globals in r157167. Patch by Jingyue Wu. llvm-svn: 204677	2014-03-24 22:05:38 +00:00
Chandler Carruth	61743af166	[Modules] Update to reflect ValueHandle moving to the IR library in LLVM r202821. llvm-svn: 202822	2014-03-04 11:18:19 +00:00
Tim Northover	8fe03d6111	ARM & AArch64: use table for EmitCommonNeonBuiltinExpr This extends the intrinsic lookup table format slightly, and adds entries for use the shared ARM/AArch64 definitions. The benefit is currently smaller than for the SISD intrinsics (there's more custom code implementing this set), but a few lines are saved and there's scope for future expansion. llvm-svn: 201848	2014-02-21 11:57:24 +00:00
Tim Northover	2d83796860	AArch64: refactor table-driven NEON lookup. This extracts the table-driven intrinsic lookup phase into a separate function, to be used by EmitCommonNeonBuiltinExpr soon. It also simplifies the logic used in that lookup, since VectorCastArgN and ScalarArgN were actually identical. llvm-svn: 201847	2014-02-21 11:57:20 +00:00
Bob Wilson	bf854f0f53	Change PGO instrumentation to compute counts in a separate AST traversal. Previously, we made one traversal of the AST prior to codegen to assign counters to the ASTs and then propagated the count values during codegen. This patch now adds a separate AST traversal prior to codegen for the -fprofile-instr-use option to propagate the count values. The counts are then saved in a map from which they can be retrieved during codegen. This new approach has several advantages: 1. It gets rid of a lot of extra PGO-related code that had previously been added to codegen. 2. It fixes a serious bug. My original implementation (which was mailed to the list but never committed) used 3 counters for every loop. Justin improved it to move 2 of those counters into the less-frequently executed breaks and continues, but that turned out to produce wrong count values in some cases. The solution requires visiting a loop body before the condition so that the count for the condition properly includes the break and continue counts. Changing codegen to visit a loop body first would be a fairly invasive change, but with a separate AST traversal, it is easy to control the order of traversal. I've added a testcase (provided by Justin) to make sure this works correctly. 3. It improves the instrumentation overhead, reducing the number of counters for a loop from 3 to 1. We no longer need dedicated counters for breaks and continues, since we can just use the propagated count values when visiting breaks and continues. To make this work, I needed to make a change to the way we count case statements, going back to my original approach of not including the fall-through in the counter values. This was necessary because there isn't always an AST node that can be used to record the fall-through count. Now case statements are handled the same as default statements, with the fall-through paths branching over the counter increments. While I was at it, I also went back to using this approach for do-loops -- omitting the fall-through count into the loop body simplifies some of the calculations and make them behave the same as other loops. Whenever we start using this instrumentation for coverage, we'll need to add the fall-through counts into the counter values. llvm-svn: 201528	2014-02-17 19:21:09 +00:00
Fariborz Jahanian	7741101dce	[IRGen]. Fixes a crash in using Objective-C array properties by fixing shouldBindAsLValue to accept arrays (like record types) because we always manipulate them in memory. Patch suggested by John MaCall. // rdar://15610943 llvm-svn: 201428	2014-02-14 19:37:25 +00:00
Reid Kleckner	314ef7bafd	[ms-cxxabi] Use inalloca on win32 when passing non-trivial C++ objects When a non-trivial parameter is present, clang now gathers up all the parameters that lack inreg and puts them into a packed struct. MSVC always aligns each parameter to 4 bytes and no more, so this is a pretty simple struct to lay out. On win64, non-trivial records are passed indirectly. Prior to this change, clang was incorrectly using byval on win64. I'm able to self-host a working clang with this change and additional LLVM patches. Reviewers: rsmith Differential Revision: http://llvm-reviews.chandlerc.com/D2636 llvm-svn: 200597	2014-02-01 00:04:45 +00:00
Tim Northover	027b4ee607	ARM & AArch64: move shared vld/vst intrinsics to common implementation. llvm-svn: 200526	2014-01-31 10:46:45 +00:00
Tim Northover	58c4474dea	ARM & AArch64: extend shared NEON implementation to first block. This extends the refactoring to the whole of the first block of trivial correspondences (as a fairly arbitrary boundary). llvm-svn: 200472	2014-01-30 14:48:01 +00:00
Tim Northover	ac85c341ae	ARM & AArch64: fully share NEON implementation of permutation intrinsics As a starting point, this moves the CodeGen for NEON permutation instructions (vtrn, vzip, vuzp) into a new shared function. llvm-svn: 200471	2014-01-30 14:47:57 +00:00
Justin Bogner	e25ffdf8a1	Revert "CodeGen: Simplify CodeGenFunction::EmitCaseStmt" I misunderstood the discussion on this. The complexity here is justified by the malloc overhead it saves. This reverts commit r199302. llvm-svn: 199700	2014-01-21 00:35:11 +00:00
Alp Toker	9cacbabd33	Rename FunctionProtoType accessors from 'arguments' to 'parameters' Fix a perennial source of confusion in the clang type system: Declarations and function prototypes have parameters to which arguments are supplied, so calling these 'arguments' was a stretch even in C mode, let alone C++ where default arguments, templates and overloading make the distinction important to get right. Readability win across the board, especially in the casting, ADL and overloading implementations which make a lot more sense at a glance now. Will keep an eye on the builders and update dependent projects shortly. No functional change. llvm-svn: 199686	2014-01-20 20:26:09 +00:00
Justin Bogner	4c5c99f91a	CodeGen: Simplify CodeGenFunction::EmitCaseStmt Way back in r129652 we tried to avoid emitting an empty block at -O0 for switch cases that did nothing but break. This led to a poor debugging experience as reported in PR9796, so we disabled the optimization for -O0 but left it in for higher optimization levels in r154420. Since the whole point of this was to improve -O0, it's silly to keep the complexity at all. llvm-svn: 199302	2014-01-15 07:30:30 +00:00
Adrian Prantl	e83b130def	Revert "Debug info: Ensure that the last stop point in a function is still within" This reverts commit r198461. llvm-svn: 198714	2014-01-07 22:05:52 +00:00
Adrian Prantl	c6758879b3	Revert "Debug info: Implement a cleaner version of r198461. For symmetry with" This reverts commit 198699 so we can get a cleaner patch. llvm-svn: 198713	2014-01-07 22:05:45 +00:00
Adrian Prantl	f5ff0dc29b	Debug info: Implement a cleaner version of r198461. For symmetry with C and C++ don't emit an extra lexical scope for the compound statement that is the body of an Objective-C method. rdar://problem/15010825 llvm-svn: 198699	2014-01-07 19:24:24 +00:00
Chandler Carruth	5553d0d4ca	Sort all the #include lines with LLVM's utils/sort_includes.py which encodes the canonical rules for LLVM's style. I noticed this had drifted quite a bit when cleaning up LLVM, so wanted to clean up Clang as well. llvm-svn: 198686	2014-01-07 11:51:46 +00:00
Justin Bogner	ef512b9929	CodeGen: Initial instrumentation based PGO implementation llvm-svn: 198640	2014-01-06 22:27:43 +00:00
Adrian Prantl	96e70d9148	Debug info: Ensure that the last stop point in a function is still within the lexical block formed by the compound statement that is the function body. rdar://problem/15010825 llvm-svn: 198461	2014-01-03 23:34:30 +00:00
Reid Kleckner	89077a1b00	[ms-cxxabi] The 'most derived' ctor parameter usually comes last Unlike Itanium's VTTs, the 'most derived' boolean or bitfield is the last parameter for non-variadic constructors, rather than the second. For variadic constructors, the 'most derived' parameter comes after the 'this' parameter. This affects constructor calls and constructor decls in a variety of places. Reviewers: timurrrr Differential Revision: http://llvm-reviews.chandlerc.com/D2405 llvm-svn: 197518	2013-12-17 19:46:40 +00:00
Reid Kleckner	739756c0f9	[ms-cxxabi] Construct and destroy call arguments in the correct order Summary: MSVC destroys arguments in the callee from left to right. Because C++ objects have to be destroyed in the reverse order of construction, Clang has to construct arguments from right to left and destroy arguments from left to right. This patch fixes the ordering by reversing the order of evaluation of all call arguments under the MS C++ ABI. Fixes PR18035. Reviewers: rsmith Differential Revision: http://llvm-reviews.chandlerc.com/D2275 llvm-svn: 196402	2013-12-04 19:23:12 +00:00
Hans Wennborg	88497d6157	[-cxx-abi microsoft] Emit thunks for pointers to virtual member functions Instead of storing the vtable offset directly in the function pointer and doing a branch to check for virtualness at each call site, the MS ABI generates a thunk for calling the function at a specific vtable offset, and puts that in the function pointer. This patch adds support for emitting such thunks. However, it doesn't support pointers to virtual member functions that are variadic, have an incomplete aggregate return type or parameter, or are overriding a function in a virtual base class. Differential Revision: http://llvm-reviews.chandlerc.com/D2104 llvm-svn: 194827	2013-11-15 17:24:45 +00:00
Kevin Qin	1718af6f0a	Implement aarch64 neon instruction class misc. llvm-svn: 194657	2013-11-14 02:45:18 +00:00
Richard Smith	b47c36f8e1	C++1y sized deallocation: if we have a use, but not a definition, of a sized deallocation function (and the corresponding unsized deallocation function has been declared), emit a weak discardable definition of the function that forwards to the corresponding unsized deallocation. This allows a C++ standard library implementation to provide both a sized and an unsized deallocation function, where the unsized one does not just call the sized one, for instance by putting both in the same object file within an archive. llvm-svn: 194055	2013-11-05 09:12:18 +00:00
Peter Collingbourne	b453cd64a7	Implement function type checker for the undefined behavior sanitizer. This uses function prefix data to store function type information at the function pointer. Differential Revision: http://llvm-reviews.chandlerc.com/D1338 llvm-svn: 193058	2013-10-20 21:29:19 +00:00
Amaury de la Vieuville	21bf6ed730	Do not emit undefined lsrh/ashr for NEON shifts These IR instructions are undefined when the amount is equal to operand size, but NEON right shifts support such shifts. Work around that by emitting a different IR in these cases. llvm-svn: 191953	2013-10-04 13:13:15 +00:00
Nick Lewycky	2d84e84236	Thread a SourceLocation into the EmitCheck for "load_invalid_value". This occurs when scalars are loaded / undergo lvalue-to-rvalue conversion. llvm-svn: 191808	2013-10-02 02:29:49 +00:00
Faisal Vali	571df12581	Implement conversion to function pointer for generic lambdas without captures. The general strategy is to create template versions of the conversion function and static invoker and then during template argument deduction of the conversion function, create the corresponding call-operator and static invoker specializations, and when the conversion function is marked referenced generate the body of the conversion function using the corresponding static-invoker specialization. Similarly, Codegen does something similar - when asked to emit the IR for a specialized static invoker of a generic lambda, it forwards emission to the corresponding call operator. This patch has been reviewed in person both by Doug and Richard. Richard gave me the LGTM. A few minor changes: - per Richard's request i added a simple check to gracefully inform that captures (init, explicit or default) have not been added to generic lambdas just yet (instead of the assertion violation). - I removed a few lines of code that added the call operators instantiated parameters to the currentinstantiationscope. Not only did it not handle parameter packs, but it is more relevant in the patch for nested lambdas which will follow this one, and fix that problem more comprehensively. - Doug had commented that the original implementation strategy of using the TypeSourceInfo of the call operator to create the static-invoker was flawed and allowed const as a member qualifier to creep into the type of the static-invoker. I currently kludge around it - but after my initial discussion with Doug, with a follow up session with Richard, I have added a FIXME so that a more elegant solution that involves the use of TrivialTypeSourceInfo call followed by the correct wiring of the template parameters to the functionprototypeloc is forthcoming. Thanks! llvm-svn: 191634	2013-09-29 08:45:24 +00:00
Reid Kleckner	543a16c06b	Emit an error when attempting to generate IR for SEH __try Currently we silently omit the code in the try and finally bodies, which is pretty bad. This way we fail loudly. llvm-svn: 190809	2013-09-16 21:46:30 +00:00
Yunzhong Gao	0ebf1bb150	Revert r189649 because it was breaking sanitizer bots. llvm-svn: 189660	2013-08-30 08:53:09 +00:00
Yunzhong Gao	be8d7ba93a	Fixing a bug where debug info for a local variable gets emitted at file scope. The patch was discussed in Phabricator. See: http://llvm-reviews.chandlerc.com/D1281 llvm-svn: 189649	2013-08-30 05:37:02 +00:00
David Blaikie	ebe87e1cfa	Revert "PR14569: Omit debug info for thunks" This reverts commit r189320. Alexey Samsonov and Dmitry Vyukov presented some arguments for keeping these around - though it still seems like those tasks could be solved by a tool just using the symbol table. In a very small number of cases, thunks may be inlined & debug info might be able to save profilers & similar tools from misclassifying those cases as part of the caller. The extra changes here plumb through the VarDecl for various cases to CodeGenFunction - this provides better fidelity through a few APIs but generally just causes the CGF::StartFunction to fallback to using the name of the IR function as the name in the debug info. The changes to debug-info-global-ctor-dtor.cpp seem like goodness. The two names that go missing (in favor of only emitting those names as linkage names) are names that can be demangled - emitting them only as the linkage name should encourage tools to do just that. Again, thanks to Dinesh Dwivedi for investigation/work on this issue. llvm-svn: 189421	2013-08-27 23:57:18 +00:00
David Blaikie	92848dee31	Simplify/clean up debug info suppression in CodeGenFunction CodeGenFunction is run on only one function - a new object is made for each new function. I would add an assertion/flag to this effect, but there's an exception: ObjC properties involve emitting helper functions that are all emitted by the same CodeGenFunction object, so such a check is not possible/correct. llvm-svn: 189277	2013-08-26 20:33:21 +00:00
Benjamin Kramer	7463ed7c89	CodeGen: Unify two implementations of canDevirtualizeMemberFunctionCall. They were mostly copy&paste of each other, move it to CodeGenFunction. Of course the two implementations have diverged over time; the one in CGExprCXX seems to be the more modern one so I picked that one and moved it to CGClass which feels like a better home for it. No intended functionality change. llvm-svn: 189203	2013-08-25 22:46:27 +00:00
Timur Iskhodzhanov	d8fa10db12	[CGF] Get rid of passing redundant VTable pointer around in CodeGenFunction::InitializeVTablePointer[s] llvm-svn: 188909	2013-08-21 17:33:16 +00:00
Timur Iskhodzhanov	88fd439a24	Abstract out virtual calls and virtual function prologue code generation; implement them for -cxx-abi microsoft llvm-svn: 188870	2013-08-21 06:25:03 +00:00
David Blaikie	4a9ec7b59d	PR16933: Don't try to codegen things after we've seen errors. Refactor the underlying code a bit to remove unnecessary calls to "hasErrorOccurred" & make them consistently at all the entry points to the IRGen ASTConsumer. llvm-svn: 188707	2013-08-19 21:02:26 +00:00
Adrian Prantl	ca64c3e136	Debug Info / EmitCallArgs: arguments may modify the debug location. Restore it after each argument is emitted. This fixes the scope info for inlined subroutines inside of function argument expressions. (E.g., anything STL). rdar://problem/12592135 llvm-svn: 187240	2013-07-26 20:42:57 +00:00
Timur Iskhodzhanov	03e8746f90	Simplify the CodeGenFunction::BuildVirtualCall family of functions llvm-svn: 186657	2013-07-19 08:14:45 +00:00
Craig Topper	5603df45df	Use SmallVectorImpl& for function arguments instead of SmallVector. llvm-svn: 185715	2013-07-05 19:34:19 +00:00
Stephen Lin	9dc6eef755	Restore r184205 and associated commits (after commit of r185290) This allows clang to use the backend parameter attribute 'returned' when generating 'this'-returning constructors and destructors in ARM and MSVC C++ ABIs. llvm-svn: 185291	2013-06-30 20:40:16 +00:00
Eli Friedman	c7ad5c4e29	Delete dead code. llvm-svn: 185119	2013-06-28 00:23:34 +00:00
Stephen Lin	19cee1871e	Revert r184205 and associated patches while investigating issue with broken buildbot (possible interaction with LTO) <rdar://problem/14209661> llvm-svn: 184384	2013-06-19 23:23:19 +00:00
Reid Kleckner	d29f1342c2	[CodeGen] Move EHScopeStack into its own header CGCleanup.h isn't meant to be included by all of CodeGen according to John. llvm-svn: 184321	2013-06-19 17:07:50 +00:00
Stephen Lin	a637fb8ccd	CodeGen: Have 'this'-returning constructors and destructors to take advantage of the new backend 'returned' attribute. The backend will now use the generic 'returned' attribute to form tail calls where possible, as well as avoid save-restores of 'this' in some cases (specifically the cases that matter for the ARM C++ ABI). This patch also reverts a prior front-end only partial implementation of these optimizations, since it's no longer required. llvm-svn: 184205	2013-06-18 17:00:49 +00:00
Richard Smith	a1c9d4d932	Simplify: we don't need any special-case lifetime extension when initializing declarations of reference type; they're handled by the general case handling of MaterializeTemporaryExpr. llvm-svn: 183875	2013-06-12 23:38:09 +00:00
Richard Smith	cc1b96d356	PR12086, PR15117 Introduce CXXStdInitializerListExpr node, representing the implicit construction of a std::initializer_list<T> object from its underlying array. The AST representation of such an expression goes from an InitListExpr with a flag set, to a CXXStdInitializerListExpr containing a MaterializeTemporaryExpr containing an InitListExpr (possibly wrapped in a CXXBindTemporaryExpr). This more detailed representation has several advantages, the most important of which is that the new MaterializeTemporaryExpr allows us to directly model lifetime extension of the underlying temporary array. Using that, this patch drastically simplifies the IR generation of this construct, provides IR generation support for nested global initializer_list objects, fixes several bugs where the destructors for the underlying array would accidentally not get invoked, and provides constant expression evaluation support for std::initializer_list objects. llvm-svn: 183872	2013-06-12 22:31:48 +00:00
Richard Smith	736a947bdc	Reapply r183721, reverted in r183776, with a fix for a bug in the former (we were lacking ExprWithCleanups nodes in some cases where the new approach to lifetime extension needed them). Original commit message: Rework IR emission for lifetime-extended temporaries. Instead of trying to walk into the expression and dig out a single lifetime-extended entity and manually pull its cleanup outside the expression, instead keep a list of the cleanups which we'll need to emit when we get to the end of the full-expression. Also emit those cleanups early, as EH-only cleanups, to cover the case that the full-expression does not terminate normally. This allows IR generation to properly model temporary lifetime when multiple temporaries are extended by the same declaration. We have a pre-existing bug where an exception thrown from a temporary's destructor does not clean up lifetime-extended temporaries created in the same expression and extended to automatic storage duration; that is not fixed by this patch. llvm-svn: 183859	2013-06-12 20:42:33 +00:00
Eli Friedman	f045007f11	Add support for complex compound assignments where the LHS is a scalar. Fixes <rdar://problem/11224126> and PR12790. llvm-svn: 183821	2013-06-12 01:40:06 +00:00
Richard Smith	4a28f534e1	Revert r183721. It caused cleanups to be delayed too long in some cases. Testcase to follow. llvm-svn: 183776	2013-06-11 19:14:25 +00:00
Richard Smith	7c5d4dce49	Rework IR emission for lifetime-extended temporaries. Instead of trying to walk into the expression and dig out a single lifetime-extended entity and manually pull its cleanup outside the expression, instead keep a list of the cleanups which we'll need to emit when we get to the end of the full-expression. Also emit those cleanups early, as EH-only cleanups, to cover the case that the full-expression does not terminate normally. This allows IR generation to properly model temporary lifetime when multiple temporaries are extended by the same declaration. We have a pre-existing bug where an exception thrown from a temporary's destructor does not clean up lifetime-extended temporaries created in the same expression and extended to automatic storage duration; that is not fixed by this patch. llvm-svn: 183721	2013-06-11 02:41:00 +00:00
Eli Friedman	4871a46cc3	Make sure we don't emit invalid IR for StmtExprs with complex cleanups. Fixes <rdar://problem/14074868>. llvm-svn: 183699	2013-06-10 22:04:49 +00:00
Reid Kleckner	200fe22a13	[CodeGen] Move EHScopeStack to CGCleanup.h from CodeGenFunction.h No functionality change. CGCleanup.cpp provides the implementation for EHScopeStack, so it seems more consistent to place the class definition in CGCleanup.h. This should also help solve a header ordering problem that I have. llvm-svn: 183631	2013-06-09 16:45:02 +00:00
Reid Kleckner	d8cbeec178	[ms-cxxabi] Implement MSVC virtual base adjustment While we can't yet emit vbtables, this allows us to find virtual bases of objects constructed in other TUs. This make iostream hello world work, since basic_ostream virtually inherits from basic_ios. Differential Revision: http://llvm-reviews.chandlerc.com/D795 llvm-svn: 182870	2013-05-29 18:02:47 +00:00
Adrian Prantl	dc237b52bc	Cleanup: Use a member variable to store the SourceLocation for EH code. rdar://problem/13888152 llvm-svn: 181957	2013-05-16 00:41:26 +00:00
David Blaikie	7d17010db5	Use only explicit bool conversion operator The most common (non-buggy) case are where such objects are used as return expressions in bool-returning functions or as boolean function arguments. In those cases I've used (& added if necessary) a named function to provide the equivalent (or sometimes negative, depending on convenient wording) test. DiagnosticBuilder kept its implicit conversion operator owing to the prevalent use of it in return statements. One bug was found in ExprConstant.cpp involving a comparison of two PointerUnions (PointerUnion did not previously have an operator==, so instead both operands were converted to bool & then compared). A test is included in test/SemaCXX/constant-expression-cxx1y.cpp for the fix (adding operator== to PointerUnion in LLVM). llvm-svn: 181869	2013-05-15 07:37:26 +00:00
Ben Langmuir	3b4c30b7e7	CodeGen for CapturedStmts EmitCapturedStmt creates a captured struct containing all of the captured variables, and then emits a call to the outlined function. This is similar in principle to EmitBlockLiteral. GenerateCapturedFunction actually produces the outlined function. It is based on GenerateBlockFunction, but is much simpler. The function type is determined by the parameters that are in the CapturedDecl. Some changes have been added to this patch that were reviewed as part of the serialization patch and moving the parameters to the captured decl. Differential Revision: http://llvm-reviews.chandlerc.com/D640 llvm-svn: 181536	2013-05-09 19:17:11 +00:00
Richard Smith	ea85232c40	Don't crash in IRGen if a conditional with 'throw' in one of its branches is used as a branch condition. llvm-svn: 181368	2013-05-07 21:53:22 +00:00
Tim Northover	8ec8c4bf89	AArch64: teach Clang about __clear_cache intrinsic libgcc provides a __clear_cache intrinsic on AArch64, much like it does on 32-bit ARM. llvm-svn: 181111	2013-05-04 07:15:13 +00:00
Adrian Prantl	52bf3c4c3f	Reapply r180982 with repaired logic and an additional testcase. Un-break the gdb buildbot. - Use the debug location of the return expression for the cleanup code if the return expression is trivially evaluatable, regardless of the number of stop points in the function. - Ensure that any EH code in the cleanup still gets the line number of the closing } of the lexical scope. - Added a testcase with EH in the cleanup. rdar://problem/13442648 llvm-svn: 181056	2013-05-03 20:11:48 +00:00
John McCall	dec348f7db	Correctly emit certain implicit references to 'self' even within a lambda. Bug #1 is that CGF's CurFuncDecl was "stuck" at lambda invocation functions. Fix that by generally improving getNonClosureContext to look through lambdas and captured statements but only report code contexts, which is generally what's wanted. Audit uses of CurFuncDecl and getNonClosureAncestor for correctness. Bug #2 is that lambdas weren't specially mapping 'self' when inside an ObjC method. Fix that by removing the requirement for that and using the normal EmitDeclRefLValue path in LoadObjCSelf. rdar://13800041 llvm-svn: 181000	2013-05-03 07:33:41 +00:00
Adrian Prantl	857f92371a	Revert "Attempt to un-break the gdb buildbot." This reverts commit 180982. llvm-svn: 180990	2013-05-03 01:42:35 +00:00
Adrian Prantl	44f38013e2	Attempt to un-break the gdb buildbot. - Use the debug location of the return expression for the cleanup code if the return expression is trivially evaluatable, regardless of the number of stop points in the function. - Ensure that any EH code in the cleanup still gets the line number of the closing } of the lexical scope. - Added a testcase with EH in the cleanup. rdar://problem/13442648 llvm-svn: 180982	2013-05-03 00:44:13 +00:00
Adrian Prantl	3be10542af	Ensure that the line table for functions with cleanups is sequential. If there is cleanup code, the cleanup code gets the debug location of the closing '}'. The subsequent ret IR-instruction does not get a debug location. The return _expression_ will get the debug location of the return statement. If the function contains only a single, simple return statement, the cleanup code may become the first breakpoint in the function. In this case we set the debug location for the cleanup code to the location of the return statement. rdar://problem/13442648 llvm-svn: 180932	2013-05-02 17:30:20 +00:00
Benjamin Kramer	139cfc2e63	ArrayRefize code. No functionality change. llvm-svn: 180632	2013-04-26 21:32:52 +00:00
Richard Smith	852c9db72b	C++1y: Allow aggregates to have default initializers. Add a CXXDefaultInitExpr, analogous to CXXDefaultArgExpr, and use it both in CXXCtorInitializers and in InitListExprs to represent a default initializer. There's an additional complication here: because the default initializer can refer to the initialized object via its 'this' pointer, we need to make sure that 'this' points to the right thing within the evaluation. llvm-svn: 179958	2013-04-20 22:23:05 +00:00
Richard Smith	2fd1d7aee3	Implement CodeGen for C++11 thread_local, following the Itanium ABI specification as discussed on cxx-abi-dev. llvm-svn: 179858	2013-04-19 16:42:07 +00:00
John McCall	c8e0170578	Standardize accesses to the TargetInfo in IR-gen. Patch by Stephen Lin! llvm-svn: 179638	2013-04-16 22:48:15 +00:00
Tareq A. Siraj	24110cc733	Implement CapturedStmt AST CapturedStmt can be used to implement generic function outlining as described in http://lists.cs.uiuc.edu/pipermail/cfe-dev/2013-January/027540.html. CapturedStmt is not exposed to the C api. Serialization and template support are pending. Author: Wei Pan <wei.pan@intel.com> Differential Revision: http://llvm-reviews.chandlerc.com/D370 llvm-svn: 179615	2013-04-16 18:53:08 +00:00
Manman Ren	c451e5766e	Initial support for struct-path aware TBAA. Added TBAABaseType and TBAAOffset in LValue. These two fields are initialized to the actual type and 0, and are updated in EmitLValueForField. Path-aware TBAA tags are enabled for EmitLoadOfScalar and EmitStoreOfScalar. Added command line option -struct-path-tbaa. llvm-svn: 178797	2013-04-04 21:53:22 +00:00
Manman Ren	092d9e8f3b	revert r178784 since it does not have a commit message llvm-svn: 178796	2013-04-04 21:51:07 +00:00
Manman Ren	037d2b252d	Index: include/clang/Driver/CC1Options.td =================================================================== --- include/clang/Driver/CC1Options.td (revision 178718) +++ include/clang/Driver/CC1Options.td (working copy) @@ -161,6 +161,8 @@ HelpText<"Use register sized accesses to bit-fields, when possible.">; def relaxed_aliasing : Flag<["-"], "relaxed-aliasing">, HelpText<"Turn off Type Based Alias Analysis">; +def struct_path_tbaa : Flag<["-"], "struct-path-tbaa">, + HelpText<"Turn on struct-path aware Type Based Alias Analysis">; def masm_verbose : Flag<["-"], "masm-verbose">, HelpText<"Generate verbose assembly output">; def mcode_model : Separate<["-"], "mcode-model">, Index: include/clang/Driver/Options.td =================================================================== --- include/clang/Driver/Options.td (revision 178718) +++ include/clang/Driver/Options.td (working copy) @@ -587,6 +587,7 @@ Flags<[CC1Option]>, HelpText<"Disable spell-checking">; def fno_stack_protector : Flag<["-"], "fno-stack-protector">, Group<f_Group>; def fno_strict_aliasing : Flag<["-"], "fno-strict-aliasing">, Group<f_Group>; +def fstruct_path_tbaa : Flag<["-"], "fstruct-path-tbaa">, Group<f_Group>; def fno_strict_enums : Flag<["-"], "fno-strict-enums">, Group<f_Group>; def fno_strict_overflow : Flag<["-"], "fno-strict-overflow">, Group<f_Group>; def fno_threadsafe_statics : Flag<["-"], "fno-threadsafe-statics">, Group<f_Group>, Index: include/clang/Frontend/CodeGenOptions.def =================================================================== --- include/clang/Frontend/CodeGenOptions.def (revision 178718) +++ include/clang/Frontend/CodeGenOptions.def (working copy) @@ -85,6 +85,7 @@ VALUE_CODEGENOPT(OptimizeSize, 2, 0) ///< If -Os (==1) or -Oz (==2) is specified. CODEGENOPT(RelaxAll , 1, 0) ///< Relax all machine code instructions. CODEGENOPT(RelaxedAliasing , 1, 0) ///< Set when -fno-strict-aliasing is enabled. +CODEGENOPT(StructPathTBAA , 1, 0) ///< Whether or not to use struct-path TBAA. CODEGENOPT(SaveTempLabels , 1, 0) ///< Save temporary labels. CODEGENOPT(SanitizeAddressZeroBaseShadow , 1, 0) ///< Map shadow memory at zero ///< offset in AddressSanitizer. Index: lib/CodeGen/CGExpr.cpp =================================================================== --- lib/CodeGen/CGExpr.cpp (revision 178718) +++ lib/CodeGen/CGExpr.cpp (working copy) @@ -1044,7 +1044,8 @@ llvm::Value CodeGenFunction::EmitLoadOfScalar(LValue lvalue) { return EmitLoadOfScalar(lvalue.getAddress(), lvalue.isVolatile(), lvalue.getAlignment().getQuantity(), - lvalue.getType(), lvalue.getTBAAInfo()); + lvalue.getType(), lvalue.getTBAAInfo(), + lvalue.getTBAABaseType(), lvalue.getTBAAOffset()); } static bool hasBooleanRepresentation(QualType Ty) { @@ -1106,7 +1107,9 @@ llvm::Value CodeGenFunction::EmitLoadOfScalar(llvm::Value Addr, bool Volatile, unsigned Alignment, QualType Ty, - llvm::MDNode TBAAInfo) { + llvm::MDNode TBAAInfo, + QualType TBAABaseType, + uint64_t TBAAOffset) { // For better performance, handle vector loads differently. if (Ty->isVectorType()) { llvm::Value V; @@ -1158,8 +1161,11 @@ Load->setVolatile(true); if (Alignment) Load->setAlignment(Alignment); - if (TBAAInfo) - CGM.DecorateInstruction(Load, TBAAInfo); + if (TBAAInfo) { + llvm::MDNode TBAAPath = CGM.getTBAAStructTagInfo(TBAABaseType, TBAAInfo, + TBAAOffset); + CGM.DecorateInstruction(Load, TBAAPath); + } if ((SanOpts->Bool && hasBooleanRepresentation(Ty)) \|\| (SanOpts->Enum && Ty->getAs<EnumType>())) { @@ -1217,7 +1223,8 @@ bool Volatile, unsigned Alignment, QualType Ty, llvm::MDNode TBAAInfo, - bool isInit) { + bool isInit, QualType TBAABaseType, + uint64_t TBAAOffset) { // Handle vectors differently to get better performance. if (Ty->isVectorType()) { @@ -1268,15 +1275,19 @@ llvm::StoreInst Store = Builder.CreateStore(Value, Addr, Volatile); if (Alignment) Store->setAlignment(Alignment); - if (TBAAInfo) - CGM.DecorateInstruction(Store, TBAAInfo); + if (TBAAInfo) { + llvm::MDNode TBAAPath = CGM.getTBAAStructTagInfo(TBAABaseType, TBAAInfo, + TBAAOffset); + CGM.DecorateInstruction(Store, TBAAPath); + } } void CodeGenFunction::EmitStoreOfScalar(llvm::Value value, LValue lvalue, bool isInit) { EmitStoreOfScalar(value, lvalue.getAddress(), lvalue.isVolatile(), lvalue.getAlignment().getQuantity(), lvalue.getType(), - lvalue.getTBAAInfo(), isInit); + lvalue.getTBAAInfo(), isInit, lvalue.getTBAABaseType(), + lvalue.getTBAAOffset()); } /// EmitLoadOfLValue - Given an expression that represents a value lvalue, this @@ -2494,9 +2505,12 @@ llvm::Value addr = base.getAddress(); unsigned cvr = base.getVRQualifiers(); + bool TBAAPath = CGM.getCodeGenOpts().StructPathTBAA; if (rec->isUnion()) { // For unions, there is no pointer adjustment. assert(!type->isReferenceType() && "union has reference member"); + // TODO: handle path-aware TBAA for union. + TBAAPath = false; } else { // For structs, we GEP to the field that the record layout suggests. unsigned idx = CGM.getTypes().getCGRecordLayout(rec).getLLVMFieldNo(field); @@ -2508,6 +2522,8 @@ if (cvr & Qualifiers::Volatile) load->setVolatile(true); load->setAlignment(alignment.getQuantity()); + // Loading the reference will disable path-aware TBAA. + TBAAPath = false; if (CGM.shouldUseTBAA()) { llvm::MDNode tbaa; if (mayAlias) @@ -2541,6 +2557,16 @@ LValue LV = MakeAddrLValue(addr, type, alignment); LV.getQuals().addCVRQualifiers(cvr); + if (TBAAPath) { + const ASTRecordLayout &Layout = + getContext().getASTRecordLayout(field->getParent()); + // Set the base type to be the base type of the base LValue and + // update offset to be relative to the base type. + LV.setTBAABaseType(base.getTBAABaseType()); + LV.setTBAAOffset(base.getTBAAOffset() + + Layout.getFieldOffset(field->getFieldIndex()) / + getContext().getCharWidth()); + } // __weak attribute on a field is ignored. if (LV.getQuals().getObjCGCAttr() == Qualifiers::Weak) Index: lib/CodeGen/CGValue.h =================================================================== --- lib/CodeGen/CGValue.h (revision 178718) +++ lib/CodeGen/CGValue.h (working copy) @@ -157,6 +157,11 @@ Expr BaseIvarExp; + /// Used by struct-path-aware TBAA. + QualType TBAABaseType; + /// Offset relative to the base type. + uint64_t TBAAOffset; + /// TBAAInfo - TBAA information to attach to dereferences of this LValue. llvm::MDNode TBAAInfo; @@ -175,6 +180,10 @@ this->ImpreciseLifetime = false; this->ThreadLocalRef = false; this->BaseIvarExp = 0; + + // Initialize fields for TBAA. + this->TBAABaseType = Type; + this->TBAAOffset = 0; this->TBAAInfo = TBAAInfo; } @@ -232,6 +241,12 @@ Expr getBaseIvarExp() const { return BaseIvarExp; } void setBaseIvarExp(Expr V) { BaseIvarExp = V; } + QualType getTBAABaseType() const { return TBAABaseType; } + void setTBAABaseType(QualType T) { TBAABaseType = T; } + + uint64_t getTBAAOffset() const { return TBAAOffset; } + void setTBAAOffset(uint64_t O) { TBAAOffset = O; } + llvm::MDNode getTBAAInfo() const { return TBAAInfo; } void setTBAAInfo(llvm::MDNode N) { TBAAInfo = N; } Index: lib/CodeGen/CodeGenFunction.h =================================================================== --- lib/CodeGen/CodeGenFunction.h (revision 178718) +++ lib/CodeGen/CodeGenFunction.h (working copy) @@ -2211,7 +2211,9 @@ /// the LLVM value representation. llvm::Value EmitLoadOfScalar(llvm::Value Addr, bool Volatile, unsigned Alignment, QualType Ty, - llvm::MDNode TBAAInfo = 0); + llvm::MDNode TBAAInfo = 0, + QualType TBAABaseTy = QualType(), + uint64_t TBAAOffset = 0); /// EmitLoadOfScalar - Load a scalar value from an address, taking /// care to appropriately convert from the memory representation to @@ -2224,7 +2226,9 @@ /// the LLVM value representation. void EmitStoreOfScalar(llvm::Value Value, llvm::Value Addr, bool Volatile, unsigned Alignment, QualType Ty, - llvm::MDNode TBAAInfo = 0, bool isInit=false); + llvm::MDNode TBAAInfo = 0, bool isInit = false, + QualType TBAABaseTy = QualType(), + uint64_t TBAAOffset = 0); /// EmitStoreOfScalar - Store a scalar value to an address, taking /// care to appropriately convert from the memory representation to Index: lib/CodeGen/CodeGenModule.cpp =================================================================== --- lib/CodeGen/CodeGenModule.cpp (revision 178718) +++ lib/CodeGen/CodeGenModule.cpp (working copy) @@ -227,6 +227,20 @@ return TBAA->getTBAAStructInfo(QTy); } +llvm::MDNode CodeGenModule::getTBAAStructTypeInfo(QualType QTy) { + if (!TBAA) + return 0; + return TBAA->getTBAAStructTypeInfo(QTy); +} + +llvm::MDNode CodeGenModule::getTBAAStructTagInfo(QualType BaseTy, + llvm::MDNode AccessN, + uint64_t O) { + if (!TBAA) + return 0; + return TBAA->getTBAAStructTagInfo(BaseTy, AccessN, O); +} + void CodeGenModule::DecorateInstruction(llvm::Instruction Inst, llvm::MDNode TBAAInfo) { Inst->setMetadata(llvm::LLVMContext::MD_tbaa, TBAAInfo); Index: lib/CodeGen/CodeGenModule.h =================================================================== --- lib/CodeGen/CodeGenModule.h (revision 178718) +++ lib/CodeGen/CodeGenModule.h (working copy) @@ -501,6 +501,11 @@ llvm::MDNode getTBAAInfo(QualType QTy); llvm::MDNode getTBAAInfoForVTablePtr(); llvm::MDNode getTBAAStructInfo(QualType QTy); + /// Return the MDNode in the type DAG for the given struct type. + llvm::MDNode getTBAAStructTypeInfo(QualType QTy); + /// Return the path-aware tag for given base type, access node and offset. + llvm::MDNode getTBAAStructTagInfo(QualType BaseTy, llvm::MDNode AccessN, + uint64_t O); bool isTypeConstant(QualType QTy, bool ExcludeCtorDtor); Index: lib/CodeGen/CodeGenTBAA.cpp =================================================================== --- lib/CodeGen/CodeGenTBAA.cpp (revision 178718) +++ lib/CodeGen/CodeGenTBAA.cpp (working copy) @@ -21,6 +21,7 @@ #include "clang/AST/Mangle.h" #include "clang/AST/RecordLayout.h" #include "clang/Frontend/CodeGenOptions.h" +#include "llvm/ADT/SmallSet.h" #include "llvm/IR/Constants.h" #include "llvm/IR/LLVMContext.h" #include "llvm/IR/Metadata.h" @@ -225,3 +226,87 @@ // For now, handle any other kind of type conservatively. return StructMetadataCache[Ty] = NULL; } + +/// Check if the given type can be handled by path-aware TBAA. +static bool isTBAAPathStruct(QualType QTy) { + if (const RecordType TTy = QTy->getAs<RecordType>()) { + const RecordDecl RD = TTy->getDecl()->getDefinition(); + // RD can be struct, union, class, interface or enum. + // For now, we only handle struct. + if (RD->isStruct() && !RD->hasFlexibleArrayMember()) + return true; + } + return false; +} + +llvm::MDNode * +CodeGenTBAA::getTBAAStructTypeInfo(QualType QTy) { + const Type Ty = Context.getCanonicalType(QTy).getTypePtr(); + assert(isTBAAPathStruct(QTy)); + + if (llvm::MDNode N = StructTypeMetadataCache[Ty]) + return N; + + if (const RecordType TTy = QTy->getAs<RecordType>()) { + const RecordDecl RD = TTy->getDecl()->getDefinition(); + + const ASTRecordLayout &Layout = Context.getASTRecordLayout(RD); + SmallVector <std::pair<uint64_t, llvm::MDNode>, 4> Fields; + // To reduce the size of MDNode for a given struct type, we only output + // once for all the fields with the same scalar types. + // Offsets for scalar fields in the type DAG are not used. + llvm::SmallSet <llvm::MDNode, 4> ScalarFieldTypes; + unsigned idx = 0; + for (RecordDecl::field_iterator i = RD->field_begin(), + e = RD->field_end(); i != e; ++i, ++idx) { + QualType FieldQTy = i->getType(); + llvm::MDNode FieldNode; + if (isTBAAPathStruct(FieldQTy)) + FieldNode = getTBAAStructTypeInfo(FieldQTy); + else { + FieldNode = getTBAAInfo(FieldQTy); + // Ignore this field if the type already exists. + if (ScalarFieldTypes.count(FieldNode)) + continue; + ScalarFieldTypes.insert(FieldNode); + } + if (!FieldNode) + return StructTypeMetadataCache[Ty] = NULL; + Fields.push_back(std::make_pair( + Layout.getFieldOffset(idx) / Context.getCharWidth(), FieldNode)); + } + + // TODO: This is using the RTTI name. Is there a better way to get + // a unique string for a type? + SmallString<256> OutName; + llvm::raw_svector_ostream Out(OutName); + MContext.mangleCXXRTTIName(QualType(Ty, 0), Out); + Out.flush(); + // Create the struct type node with a vector of pairs (offset, type). + return StructTypeMetadataCache[Ty] = + MDHelper.createTBAAStructTypeNode(OutName, Fields); + } + + return StructMetadataCache[Ty] = NULL; +} + +llvm::MDNode +CodeGenTBAA::getTBAAStructTagInfo(QualType BaseQTy, llvm::MDNode AccessNode, + uint64_t Offset) { + if (!CodeGenOpts.StructPathTBAA) + return AccessNode; + + const Type BTy = Context.getCanonicalType(BaseQTy).getTypePtr(); + TBAAPathTag PathTag = TBAAPathTag(BTy, AccessNode, Offset); + if (llvm::MDNode N = StructTagMetadataCache[PathTag]) + return N; + + llvm::MDNode BNode = 0; + if (isTBAAPathStruct(BaseQTy)) + BNode = getTBAAStructTypeInfo(BaseQTy); + if (!BNode) + return StructTagMetadataCache[PathTag] = AccessNode; + + return StructTagMetadataCache[PathTag] = + MDHelper.createTBAAStructTagNode(BNode, AccessNode, Offset); +} Index: lib/CodeGen/CodeGenTBAA.h =================================================================== --- lib/CodeGen/CodeGenTBAA.h (revision 178718) +++ lib/CodeGen/CodeGenTBAA.h (working copy) @@ -35,6 +35,14 @@ namespace CodeGen { class CGRecordLayout; + struct TBAAPathTag { + TBAAPathTag(const Type B, const llvm::MDNode A, uint64_t O) + : BaseT(B), AccessN(A), Offset(O) {} + const Type BaseT; + const llvm::MDNode AccessN; + uint64_t Offset; + }; + /// CodeGenTBAA - This class organizes the cross-module state that is used /// while lowering AST types to LLVM types. class CodeGenTBAA { @@ -46,8 +54,13 @@ // MDHelper - Helper for creating metadata. llvm::MDBuilder MDHelper; - /// MetadataCache - This maps clang::Types to llvm::MDNodes describing them. + /// MetadataCache - This maps clang::Types to scalar llvm::MDNodes describing + /// them. llvm::DenseMap<const Type , llvm::MDNode > MetadataCache; + /// This maps clang::Types to a struct node in the type DAG. + llvm::DenseMap<const Type , llvm::MDNode > StructTypeMetadataCache; + /// This maps TBAAPathTags to a tag node. + llvm::DenseMap<TBAAPathTag, llvm::MDNode > StructTagMetadataCache; /// StructMetadataCache - This maps clang::Types to llvm::MDNodes describing /// them for struct assignments. @@ -89,9 +102,49 @@ /// getTBAAStructInfo - Get the TBAAStruct MDNode to be used for a memcpy of /// the given type. llvm::MDNode getTBAAStructInfo(QualType QTy); + + /// Get the MDNode in the type DAG for given struct type QType. + llvm::MDNode getTBAAStructTypeInfo(QualType QType); + /// Get the tag MDNode for a given base type, the actual sclar access MDNode + /// and offset into the base type. + llvm::MDNode getTBAAStructTagInfo(QualType BaseQType, + llvm::MDNode AccessNode, uint64_t Offset); }; } // end namespace CodeGen } // end namespace clang +namespace llvm { + +template<> struct DenseMapInfo<clang::CodeGen::TBAAPathTag> { + static clang::CodeGen::TBAAPathTag getEmptyKey() { + return clang::CodeGen::TBAAPathTag( + DenseMapInfo<const clang::Type >::getEmptyKey(), + DenseMapInfo<const MDNode >::getEmptyKey(), + DenseMapInfo<uint64_t>::getEmptyKey()); + } + + static clang::CodeGen::TBAAPathTag getTombstoneKey() { + return clang::CodeGen::TBAAPathTag( + DenseMapInfo<const clang::Type >::getTombstoneKey(), + DenseMapInfo<const MDNode >::getTombstoneKey(), + DenseMapInfo<uint64_t>::getTombstoneKey()); + } + + static unsigned getHashValue(const clang::CodeGen::TBAAPathTag &Val) { + return DenseMapInfo<const clang::Type >::getHashValue(Val.BaseT) ^ + DenseMapInfo<const MDNode >::getHashValue(Val.AccessN) ^ + DenseMapInfo<uint64_t>::getHashValue(Val.Offset); + } + + static bool isEqual(const clang::CodeGen::TBAAPathTag &LHS, + const clang::CodeGen::TBAAPathTag &RHS) { + return LHS.BaseT == RHS.BaseT && + LHS.AccessN == RHS.AccessN && + LHS.Offset == RHS.Offset; + } +}; + +} // end namespace llvm + #endif Index: lib/Driver/Tools.cpp =================================================================== --- lib/Driver/Tools.cpp (revision 178718) +++ lib/Driver/Tools.cpp (working copy) @@ -2105,6 +2105,8 @@ options::OPT_fno_strict_aliasing, getToolChain().IsStrictAliasingDefault())) CmdArgs.push_back("-relaxed-aliasing"); + if (Args.hasArg(options::OPT_fstruct_path_tbaa)) + CmdArgs.push_back("-struct-path-tbaa"); if (Args.hasFlag(options::OPT_fstrict_enums, options::OPT_fno_strict_enums, false)) CmdArgs.push_back("-fstrict-enums"); Index: lib/Frontend/CompilerInvocation.cpp =================================================================== --- lib/Frontend/CompilerInvocation.cpp (revision 178718) +++ lib/Frontend/CompilerInvocation.cpp (working copy) @@ -324,6 +324,7 @@ Opts.UseRegisterSizedBitfieldAccess = Args.hasArg( OPT_fuse_register_sized_bitfield_access); Opts.RelaxedAliasing = Args.hasArg(OPT_relaxed_aliasing); + Opts.StructPathTBAA = Args.hasArg(OPT_struct_path_tbaa); Opts.DwarfDebugFlags = Args.getLastArgValue(OPT_dwarf_debug_flags); Opts.MergeAllConstants = !Args.hasArg(OPT_fno_merge_all_constants); Opts.NoCommon = Args.hasArg(OPT_fno_common); Index: test/CodeGen/tbaa.cpp =================================================================== --- test/CodeGen/tbaa.cpp (revision 0) +++ test/CodeGen/tbaa.cpp (working copy) @@ -0,0 +1,217 @@ +// RUN: %clang_cc1 -O1 -disable-llvm-optzns %s -emit-llvm -o - \| FileCheck %s +// RUN: %clang_cc1 -O1 -struct-path-tbaa -disable-llvm-optzns %s -emit-llvm -o - \| FileCheck %s -check-prefix=PATH +// Test TBAA metadata generated by front-end. + +#include <stdint.h> +typedef struct +{ + uint16_t f16; + uint32_t f32; + uint16_t f16_2; + uint32_t f32_2; +} StructA; +typedef struct +{ + uint16_t f16; + StructA a; + uint32_t f32; +} StructB; +typedef struct +{ + uint16_t f16; + StructB b; + uint32_t f32; +} StructC; +typedef struct +{ + uint16_t f16; + StructB b; + uint32_t f32; + uint8_t f8; +} StructD; + +typedef struct +{ + uint16_t f16; + uint32_t f32; +} StructS; +typedef struct +{ + uint16_t f16; + uint32_t f32; +} StructS2; + +uint32_t g(uint32_t s, StructA A, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !5 + s = 1; + A->f32 = 4; + return s; +} + +uint32_t g2(uint32_t s, StructA A, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i16 4, i16 %{{.}}, align 2, !tbaa !5 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// PATH: store i16 4, i16 %{{.}}, align 2, !tbaa !8 + s = 1; + A->f16 = 4; + return s; +} + +uint32_t g3(StructA A, StructB B, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !5 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !9 + A->f32 = 1; + B->a.f32 = 4; + return A->f32; +} + +uint32_t g4(StructA A, StructB B, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i16 4, i16 %{{.}}, align 2, !tbaa !5 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !5 +// PATH: store i16 4, i16 %{{.}}, align 2, !tbaa !11 + A->f32 = 1; + B->a.f16 = 4; + return A->f32; +} + +uint32_t g5(StructA A, StructB B, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !5 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !12 + A->f32 = 1; + B->f32 = 4; + return A->f32; +} + +uint32_t g6(StructA A, StructB B, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !5 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !13 + A->f32 = 1; + B->a.f32_2 = 4; + return A->f32; +} + +uint32_t g7(StructA A, StructS S, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !5 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !14 + A->f32 = 1; + S->f32 = 4; + return A->f32; +} + +uint32_t g8(StructA A, StructS S, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i16 4, i16 %{{.}}, align 2, !tbaa !5 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !5 +// PATH: store i16 4, i16 %{{.}}, align 2, !tbaa !16 + A->f32 = 1; + S->f16 = 4; + return A->f32; +} + +uint32_t g9(StructS S, StructS2 S2, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !14 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !17 + S->f32 = 1; + S2->f32 = 4; + return S->f32; +} + +uint32_t g10(StructS S, StructS2 S2, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i16 4, i16 %{{.}}, align 2, !tbaa !5 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !14 +// PATH: store i16 4, i16 %{{.}}, align 2, !tbaa !19 + S->f32 = 1; + S2->f16 = 4; + return S->f32; +} + +uint32_t g11(StructC C, StructD D, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !20 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !22 + C->b.a.f32 = 1; + D->b.a.f32 = 4; + return C->b.a.f32; +} + +uint32_t g12(StructC C, StructD D, uint64_t count) { +// CHECK: define i32 @{{.}}( +// CHECK: store i32 1, i32* %{{.}}, align 4, !tbaa !4 +// CHECK: store i32 4, i32 %{{.}}, align 4, !tbaa !4 +// TODO: differentiate the two accesses. +// PATH: define i32 @{{.}}( +// PATH: store i32 1, i32* %{{.}}, align 4, !tbaa !9 +// PATH: store i32 4, i32 %{{.}}, align 4, !tbaa !9 + StructB b1 = &(C->b); + StructB *b2 = &(D->b); + // b1, b2 have different context. + b1->a.f32 = 1; + b2->a.f32 = 4; + return b1->a.f32; +} + +// CHECK: !1 = metadata !{metadata !"omnipotent char", metadata !2} +// CHECK: !2 = metadata !{metadata !"Simple C/C++ TBAA"} +// CHECK: !4 = metadata !{metadata !"int", metadata !1} +// CHECK: !5 = metadata !{metadata !"short", metadata !1} + +// PATH: !1 = metadata !{metadata !"omnipotent char", metadata !2} +// PATH: !4 = metadata !{metadata !"int", metadata !1} +// PATH: !5 = metadata !{metadata !6, metadata !4, i64 4} +// PATH: !6 = metadata !{metadata !"_ZTS7StructA", i64 0, metadata !7, i64 4, metadata !4} +// PATH: !7 = metadata !{metadata !"short", metadata !1} +// PATH: !8 = metadata !{metadata !6, metadata !7, i64 0} +// PATH: !9 = metadata !{metadata !10, metadata !4, i64 8} +// PATH: !10 = metadata !{metadata !"_ZTS7StructB", i64 0, metadata !7, i64 4, metadata !6, i64 20, metadata !4} +// PATH: !11 = metadata !{metadata !10, metadata !7, i64 4} +// PATH: !12 = metadata !{metadata !10, metadata !4, i64 20} +// PATH: !13 = metadata !{metadata !10, metadata !4, i64 16} +// PATH: !14 = metadata !{metadata !15, metadata !4, i64 4} +// PATH: !15 = metadata !{metadata !"_ZTS7StructS", i64 0, metadata !7, i64 4, metadata !4} +// PATH: !16 = metadata !{metadata !15, metadata !7, i64 0} +// PATH: !17 = metadata !{metadata !18, metadata !4, i64 4} +// PATH: !18 = metadata !{metadata !"_ZTS8StructS2", i64 0, metadata !7, i64 4, metadata !4} +// PATH: !19 = metadata !{metadata !18, metadata !7, i64 0} +// PATH: !20 = metadata !{metadata !21, metadata !4, i64 12} +// PATH: !21 = metadata !{metadata !"_ZTS7StructC", i64 0, metadata !7, i64 4, metadata !10, i64 28, metadata !4} +// PATH: !22 = metadata !{metadata !23, metadata !4, i64 12} +// PATH: !23 = metadata !{metadata !"_ZTS7StructD", i64 0, metadata !7, i64 4, metadata !10, i64 28, metadata !4, i64 32, metadata !1} llvm-svn: 178784	2013-04-04 20:14:17 +00:00
Adrian Prantl	5d5b67c52c	* Attempt to un-break gdb buildbot by emitting a lexical block end only when we actually end a lexical block. * Added new test for line table / block cleanup. * Follow-up to r177819 / rdar://problem/13115369 llvm-svn: 178490	2013-04-01 19:02:06 +00:00
Nadav Rotem	1da30944a6	Make clang to mark static stack allocations with lifetime markers to enable a more aggressive stack coloring. Patch by John McCall with help by Shuxin Yang. rdar://13115369 llvm-svn: 177819	2013-03-23 06:43:35 +00:00
John McCall	eff1884274	Under ARC, when we're passing the address of a strong variable to an out-parameter using the indirect-writeback conversion, and we copied the current value of the variable to the temporary, make sure that we register an intrinsic use of that value with the optimizer so that the value won't get released until we have a chance to retain it. rdar://13195034 llvm-svn: 177813	2013-03-23 02:35:54 +00:00
Manman Ren	0175461296	Exploit this-return of a callsite in a this-return function. For constructors/desctructors that return 'this', if there exists a callsite that returns 'this' and is immediately before the return instruction, make sure we are using the return value from the callsite. We don't need to keep 'this' alive through the callsite. It also enables optimizations in the backend, such as tail call optimization. Updated from r177211. rdar://12818789 llvm-svn: 177541	2013-03-20 16:59:38 +00:00
Manman Ren	c089074aa5	revert r177211 due to its potential issues llvm-svn: 177222	2013-03-16 04:47:38 +00:00
Manman Ren	58dd990c11	Exploit this-return of a callsite in a this-return function. For constructors/desctructors that return 'this', if there exists a callsite that returns 'this' and is immediately before the return instruction, make sure we are using the return value from the callsite. We don't need to keep 'this' alive through the callsite. It also enables optimizations in the backend, such as tail call optimization. rdar://12818789 llvm-svn: 177211	2013-03-16 00:11:09 +00:00
John McCall	cdda29c968	Tighten up the rules for precise lifetime and document the requirements on the ARC optimizer. rdar://13407451 llvm-svn: 176924	2013-03-13 03:10:54 +00:00
Joey Gouly	aba589cceb	Add support for the OpenCL attribute 'vec_type_hint'. Patch by Murat Bolat! llvm-svn: 176686	2013-03-08 09:42:32 +00:00
John McCall	a8ec7eb9cf	Promote atomic type sizes up to a power of two, capped by MaxAtomicPromoteWidth. Fix a ton of terrible bugs with _Atomic types and (non-intrinsic-mediated) loads and stores thereto. llvm-svn: 176658	2013-03-07 21:37:17 +00:00
John McCall	47fb950871	Change hasAggregateLLVMType, which conflates complex and aggregate types in a profoundly wrong way that has to be worked around in every call site, to getEvaluationKind, which classifies and distinguishes between all of these cases. Also, normalize the API for loading and storing complexes. I'm working on a larger patch and wanted to pull these changes out, but it would have be annoying to detangle them from each other. llvm-svn: 176656	2013-03-07 21:37:08 +00:00
John McCall	e739a49325	Restore order to placate test. I had no real reason to switch them. llvm-svn: 176328	2013-03-01 01:38:54 +00:00
John McCall	07e60263dd	Re-use bit from superclass and extract stuff into a local function. Serves a patch we're kicking around out-of-tree. llvm-svn: 176327	2013-03-01 01:24:35 +00:00
John McCall	882987f30c	Use the actual ABI-determined C calling convention for runtime calls and declarations. LLVM has a default CC determined by the target triple. This is not always the actual default CC for the ABI we've been asked to target, and so we sometimes find ourselves annotating all user functions with an explicit calling convention. Since these calling conventions usually agree for the simple set of argument types passed to most runtime functions, using the LLVM-default CC in principle has no effect. However, the LLVM optimizer goes into histrionics if it sees this kind of formal CC mismatch, since it has no concept of CC compatibility. Therefore, if this module happens to define the "runtime" function, or got LTO'ed with such a definition, we can miscompile; so it's quite important to get this right. Defining runtime functions locally is quite common in embedded applications. llvm-svn: 176286	2013-02-28 19:01:20 +00:00
Timur Iskhodzhanov	57cbe5c790	Better support for constructors with -cxx-abi microsoft, partly fixes PR12784 llvm-svn: 176186	2013-02-27 13:46:31 +00:00
Richard Smith	539e4a77bb	ubsan: Emit bounds checks for array indexing, vector indexing, and (in really simple cases) pointer arithmetic. This augments the existing bounds checking with language-level array bounds information. llvm-svn: 175949	2013-02-23 02:53:19 +00:00
Lang Hames	bf122744e5	Re-apply r174919 - smarter copy/move assignment/construction, with fixes for bitfield related issues. The original commit broke Takumi's builder. The bug was caused by bitfield sizes being determined by their underlying type, rather than the field info. A similar issue with bitfield alignments showed up on closer testing. Both have been fixed in this patch. llvm-svn: 175389	2013-02-17 07:22:09 +00:00
Richard Smith	2c5868c334	ubsan: Add checking for invalid downcasts. Per [expr.static.cast]p2 and p11, base-to-derived casts have undefined behavior if the object is not actually an instance of the derived type. llvm-svn: 175078	2013-02-13 21:18:23 +00:00
Timur Iskhodzhanov	ee6bc53365	Emit virtual/deleting destructors properly with -cxx-abi microsoft, PR15058 llvm-svn: 175045	2013-02-13 08:37:51 +00:00
Lang Hames	697b004219	Backing out r174919 while I investigate a self-host bug on Takumi's builder. llvm-svn: 174925	2013-02-12 00:44:43 +00:00
Lang Hames	5824a4f1b0	When generating IR for default copy-constructors, copy-assignment operators, move-constructors and move-assignment operators, use memcpy to copy adjacent POD members. Previously, classes with one or more Non-POD members would fall back on element-wise copies for all members, including POD members. This often generated a lot of IR. Without padding metadata, it wasn't often possible for the LLVM optimizers to turn the element-wise copies into a memcpy. This code hasn't yet received any serious tuning. I didn't see any serious regressions on a self-hosted clang build, or any of the nightly tests, but I think it's important to get this out in the wild to get more testing. Insights, feedback and comments welcome. Many thanks to David Blaikie, Richard Smith, and especially John McCall for their help and feedback on this work. llvm-svn: 174919	2013-02-11 23:44:11 +00:00
Arnaud A. de Grandmaison	49c04467ea	Fix typo in comment llvm-svn: 174359	2013-02-05 09:06:17 +00:00
David Blaikie	357aafb566	Fix exception handling line table problems introduced by r173593 r173593 made us a little too eager to associate all code at the end of a function with the user-written 'return' line. This caused problems with breakpoints as they'd be set in exception handling code preceeding the actual non-exception return handling code, leading to the breakpoint never being hit in non-exceptional execution. This change restores the pre-r173593 exception handling line information where the cleanup code is associated with the '}' not the return line. llvm-svn: 174206	2013-02-01 19:09:49 +00:00
John McCall	12cc42aa1b	Destroy arrays and ARC fields when throwing out of ctors. Previously we were only handling non-array fields of class type. Testcases derived from a patch by WenHan Gu. llvm-svn: 174146	2013-02-01 05:11:40 +00:00
Douglas Gregor	6153500517	When we're emitting a constructor or destructor call from a delegating constructor, retrieve our VTT parameter directly. Fixes PR14588 / <rdar://problem/12867962>. llvm-svn: 174042	2013-01-31 05:50:40 +00:00
Chad Rosier	ae229d599b	[ubsan] Implement the -fcatch-undefined-behavior flag using a trapping implementation; this is much more inline with the original implementation (i.e., pre-ubsan) and does not require run-time library support. The trapping implementation can be invoked using either '-fcatch-undefined-behavior' or '-fsanitize=undefined-trap -fsanitize-undefined-trap-on-error', with the latter being preferred. Eventually, the -fcatch-undefined-behavior' flag will be removed. llvm-svn: 173848	2013-01-29 23:31:22 +00:00
David Blaikie	0a21d0da17	PR14566: Debug Info: avoid top level lexical blocks in functions One of the gotchas (see changes to CodeGenFunction) was due to the fix in r139416 (for PR10829). This only worked previously because the top level lexical block would set the location to the end of the function, the debug location would be updated (as per r139416), the location would be set to the end of the function again (but that would no-op, since it was the same as the previous location), then the return instruction would be emitted using the debug location. Once the top level lexical block was no longer emitted, the end-of-function location change was causing the debug loc to be updated, regressing that bug. llvm-svn: 173593	2013-01-26 22:16:26 +00:00
Fariborz Jahanian	7865220da4	patch for PR9027 and // rdar://11861085 Title: [PR9027] volatile struct bug: member is not loaded at -O; This is caused by last flag passed to @llvm.memcpy being false, not honoring that aggregate has at least one 'volatile' data member (even though aggregate itself has not been qualified as 'volatile'. As a result, optimization optimizes away the memcpy altogether. Patch review by John MaCall (I still need to fix up a test though). llvm-svn: 173535	2013-01-25 23:57:05 +00:00
Will Dietz	f54319c891	[ubsan] Add support for -fsanitize-blacklist llvm-svn: 172808	2013-01-18 11:30:38 +00:00
Dmitri Gribenko	f857950d39	Remove useless 'llvm::' qualifier from names like StringRef and others that are brought into 'clang' namespace by clang/Basic/LLVM.h llvm-svn: 172323	2013-01-12 19:30:44 +00:00
Eli Friedman	33accdf602	Don't assert/crash on reference variables in lambdas bound to a static local variable from the parent scope. PR14773. llvm-svn: 171433	2013-01-03 00:39:26 +00:00
Chandler Carruth	3a02247dc9	Sort all of Clang's files under 'lib', and fix up the broken headers uncovered. This required manually correcting all of the incorrect main-module headers I could find, and running the new llvm/utils/sort_includes.py script over the files. I also manually added quite a few missing headers that were uncovered by shuffling the order or moving headers up to be main-module-headers. llvm-svn: 169237	2012-12-04 09:13:33 +00:00
Will Dietz	88e0233ff4	[ubsan] Add flag to enable recovery from checks when possible. llvm-svn: 169114	2012-12-02 19:50:33 +00:00
David Chisnall	9a837be2b9	Fix the Objective-C exception rethrow from cleanups (GNU runtimes). Note that a bug in the inliner still causes the wrong thing to happen at -O2 and above (PR14116). llvm-svn: 167534	2012-11-07 16:50:40 +00:00
Richard Smith	b1b0ab41e7	Use the individual -fsanitize=<...> arguments to control which of the UBSan checks to enable. Remove frontend support for -fcatch-undefined-behavior, -faddress-sanitizer and -fthread-sanitizer now that they don't do anything. llvm-svn: 167413	2012-11-05 22:21:05 +00:00
Richard Smith	de67068fc1	Split emission of -ftrapv checks and -fcatch-undefined-behavior checks into separate functions, since they share essentially no code. llvm-svn: 167259	2012-11-01 22:15:34 +00:00
Richard Smith	4d3110af06	-fcatch-undefined-behavior checking for appropriate vptr value: Clang CodeGen side. llvm-svn: 166661	2012-10-25 02:14:12 +00:00
John McCall	e68b8f4dcc	At -O0, prefer objc_storeStrong with a null new value to the combination of a load+objc_release; this is generally better for tools that try to track why values are retained and released. Also use objc_storeStrong when copying a block (again, only at -O0), which requires us to do a preliminary store of null in order to compensate for objc_storeStrong's assign semantics. llvm-svn: 166085	2012-10-17 02:28:37 +00:00
Alexey Samsonov	38e2496497	Transform pattern: if (CGM.getModuleDebugInfo()) DebugInfo = CGM.getModuleDebugInfo() into a call: maybeInitializeDebugInfo(); This is a simplification for a possible future fix of PR13942. llvm-svn: 166019	2012-10-16 07:22:28 +00:00
Nico Weber	cf4ff586e8	Add codegen support for __uuidof(). llvm-svn: 165710	2012-10-11 10:13:44 +00:00
Richard Smith	e30752c93b	-fcatch-undefined-behavior: emit calls to the runtime library whenever one of the checks fails. llvm-svn: 165536	2012-10-09 19:52:38 +00:00
Benjamin Kramer	1ca66919a5	CodeGen: Copy tail padding when we're not dealing with a trivial copy assign or move assign operator. This fixes a regression from r162254, the optimizer has problems reasoning about the smaller memcpy as it's often not safe to widen a store but making it smaller is. llvm-svn: 164917	2012-09-30 12:43:37 +00:00
Sylvestre Ledru	33b5baf189	Revert 'Fix a typo 'iff' => 'if''. iff is an abreviation of if and only if. See: http://en.wikipedia.org/wiki/If_and_only_if Commit 164766 llvm-svn: 164769	2012-09-27 10:16:10 +00:00
Sylvestre Ledru	a876013dc9	Fix a typo 'iff' => 'if' llvm-svn: 164766	2012-09-27 09:57:10 +00:00
Dmitri Gribenko	a664e5b88f	Use LLVM_DELETED_FUNCTION in place of 'DO NOT IMPLEMENT' comments. llvm-svn: 163983	2012-09-15 20:20:27 +00:00
Richard Smith	4d1458ed38	-fcatch-undefined-behavior: Factor emission of the creation of, and branch to, the trap BB out of the individual checks and into a common function, to prepare for making this code call into a runtime library. Rename the existing EmitCheck to EmitTypeCheck to clarify it and to move it out of the way of the new EmitCheck. llvm-svn: 163451	2012-09-08 02:08:36 +00:00
Chad Rosier	649dfc317d	[ms-inline asm] Have MSAsmStmts use the generic EmitAsmStmt codegen function. llvm-svn: 162796	2012-08-28 21:11:24 +00:00
Chad Rosier	6051bb94c0	[ms-inline asm] Rename EmitGCCAsmStmt to EmitAsmStmt and have it accept AsmStmts. This function is only used by GCCAsmStmts, however. Constraints need to be properly computed before MSAsmStmts can use EmitAsmStmt. No functional change intended. llvm-svn: 162776	2012-08-28 18:54:39 +00:00
Chad Rosier	de70e0ef45	[ms-inline asm] As part of a larger refactoring, rename AsmStmt to GCCAsmStmt. No functional change intended. llvm-svn: 162632	2012-08-25 00:11:56 +00:00
Richard Smith	69d0d2626a	New -fcatch-undefined-behavior features: * when checking that a pointer or reference refers to appropriate storage for a type, also check the alignment and perform a null check * check that references are bound to appropriate storage * check that 'this' has appropriate storage in member accesses and member function calls llvm-svn: 162523	2012-08-24 00:54:33 +00:00
Chad Rosier	59df25b659	[ms-inline asm] Remove an unused argument. This logic can now be reused by the ms-style inline asms. llvm-svn: 162463	2012-08-23 20:00:18 +00:00
Dmitri Gribenko	adba9be7c5	Fix a bunch of -Wdocumentation warnings. llvm-svn: 162452	2012-08-23 17:58:28 +00:00
Eli Friedman	a5dd5684dc	Use the alignment from lvalue emission to more accurately compute the alignment of a pointer for builtin emission, instead of just depending on the type of the pointee. <rdar://problem/11314941>. llvm-svn: 162425	2012-08-23 03:10:17 +00:00
Eli Friedman	f6d2184c83	Fix an assertion failure with a C++ constructor initializing a member of reference type in an anonymous struct. PR13154. llvm-svn: 161473	2012-08-08 03:51:37 +00:00
Richard Trieu	c320c745cc	Change APInt to APSInt in one instance. Also change a call to operator==() to APSInt::isSameValue() when comparing different sized APSInt's. llvm-svn: 160641	2012-07-23 20:21:35 +00:00
Simon Atanasyan	94a6d863a9	Revert commit r160308. We decide to move builtins selection to the backend. llvm-svn: 160353	2012-07-17 08:15:06 +00:00
Simon Atanasyan	a06d06b660	MIPS: Implement __builtin_mips_shll_qb builtin function overloading. This function has two versions. The first one is used for a register operand. The second one is used for an immediate number. llvm-svn: 160308	2012-07-16 18:52:02 +00:00
Eric Christopher	f8b9809fab	Temporarily revert this to see if it brings the gdb bot back. llvm-svn: 160049	2012-07-11 15:32:13 +00:00
Eric Christopher	2977378974	The end of a block doesn't necessarily need a line table entry unless there's something going on there. Remove the unconditional line entry and only add one if we're emitting cleanups (any other statements would be handled normally). Fixes rdar://9199234 llvm-svn: 160033	2012-07-11 01:49:26 +00:00
Tanya Lattner	bcffcdfd18	Patch by Anton Lokhmotov to add OpenCL work group size attributes. llvm-svn: 159965	2012-07-09 22:06:01 +00:00
John McCall	4e8ca4fa14	Significantly simplify CGExprAgg's logic about ignored results: if we want to ignore a result, the Dest will be null. Otherwise, we must copy into it. This means we need to ensure a slot when loading from a volatile l-value. With all that in place, fix a bug with chained assignments into __block variables of aggregate type where we were losing insight into the actual source of the value during the second assignment. llvm-svn: 159630	2012-07-02 23:58:38 +00:00
Benjamin Kramer	46a72fb741	Dead code eliminate the massive hexagon builtin intrinsic supporting code. The tablegen'd code does the same thing without this egregious duplication. In my limited testing everything seems to work, however there can be differences if the clang and llvm builtin definitions don't match. llvm-svn: 159371	2012-06-28 20:08:55 +00:00
Simon Atanasyan	07ce7d8fb5	Support MIPS DSP Rev1 intrinsics. This patch was reviewed in the llvm-commits list by Jim Grosbach. llvm-svn: 159366	2012-06-28 18:23:16 +00:00
Eli Friedman	c24e2fb1fb	Propagate lvalue alignment into bitfields. Per report on cfe-dev. llvm-svn: 159295	2012-06-27 21:19:48 +00:00
Fariborz Jahanian	6362803cfe	block literal irgen: several improvements on naming block literal helper functions. All helper functions (global and locals) use block_invoke as their prefix. Local literal helper names are prefixed by their enclosing mangled function names. Blocks in non-local initializers (e.g. a global variable or a C++11 field) are prefixed by their mangled variable name. The descriminator number added to end of the name starts off with blank (for first block) and _<N> (for the N+2-th block). llvm-svn: 159206	2012-06-26 16:06:38 +00:00
Chad Rosier	32503020a4	Etch out the code path for MS-style inline assembly. llvm-svn: 158325	2012-06-11 20:47:18 +00:00
Fariborz Jahanian	b5dd2cb13c	objective-c: fix a sema and IRGen crash when property getter result type is safe but does not match with property type resulting in spurious warning followed by crash in IRGen. // rdar://11515196 llvm-svn: 157641	2012-05-29 19:56:01 +00:00
Richard Smith	bb653bd5f9	Implement IRGen for C++11's "T{1, 2, 3}", where T is an aggregate and the expression is treated as an lvalue. llvm-svn: 156781	2012-05-14 21:57:21 +00:00
Nuno Lopes	3d6311d5f7	add -fbounds-checking option. When enabled, clang generates bounds checks for array and pointers dereferences. Work to follow in LLVM's backend. OK'ed by Chad; thanks for the review. llvm-svn: 156431	2012-05-08 22:10:46 +00:00
John McCall	c84ed6a336	Abstract the emission of global destructors into ABI-specific code and only consider using __cxa_atexit in the Itanium logic. The default logic is to use atexit(). Emit "guarded" initializers in Microsoft mode unconditionally. This is definitely not correct, but it's closer to correct than just not emitting the initializer. Based on a patch by Timur Iskhodzhanov! llvm-svn: 155894	2012-05-01 06:13:13 +00:00
Patrick Beard	0caa39474b	Implements boxed expressions for Objective-C. <rdar://problem/10194391> llvm-svn: 155082	2012-04-19 00:25:12 +00:00
Eli Friedman	7f1ff60021	Propagate alignment on lvalues through EmitLValueForField. PR12395. llvm-svn: 154789	2012-04-16 03:54:45 +00:00
Richard Smith	c202b2809a	Add an AttributedStmt type to represent a statement with C++11 attributes attached. Since we do not support any attributes which appertain to a statement (yet), testing of this is necessarily quite minimal. Patch by Alexander Kornienko! llvm-svn: 154723	2012-04-14 00:33:13 +00:00
Anton Korobeynikov	4215ca7564	Step forward with supporting of ARM homogenous aggregates: - Handle unions - Handle C++ classes llvm-svn: 154664	2012-04-13 11:22:00 +00:00
Duncan Sands	e81111ca71	Express the number of ULPs in fpaccuracy metadata as a real rather than a rational number, eg as 2.5 rather than 5, 2. OK'd by Peter Collingbourne. llvm-svn: 154388	2012-04-10 08:23:07 +00:00
John McCall	ee08c53478	Rename GenerateCXXGlobalDtorFunc to GenerateCXXGlobalDtorsFunc. llvm-svn: 154190	2012-04-06 18:21:03 +00:00
Chandler Carruth	8453795255	Revert r153723, and its follow-ups r153728 and r153733. These patches cause us to miscompile and/or reject code with static function-local variables in an extern-C context. Previously, we were papering over this as long as the variables are within the same translation unit, and had not seen any failures in the wild. We still need a proper fix, which involves mangling static locals inside of an extern-C block (as GCC already does), but this patch causes pretty widespread regressions. Firefox, and many other applications no longer build. Lots of test cases have been posted to the list in response to this commit, so there should be no problem reproducing the issues. llvm-svn: 153768	2012-03-30 19:44:53 +00:00
John McCall	87590e60c0	Do the static-locals thing properly in the face of unions and other things which might mess with the variable's type. llvm-svn: 153733	2012-03-30 07:09:50 +00:00
Chad Rosier	615ed1a3a6	Revert r153613 as it's causing large compile-time regressions on the nightly testers. llvm-svn: 153660	2012-03-29 17:37:10 +00:00
John McCall	1a0877f99d	When we can't prove that the target of an aggregate copy is a complete object, the memcpy needs to use the data size of the structure instead of its sizeof() value. Fixes PR12204. llvm-svn: 153613	2012-03-28 23:30:44 +00:00
Rafael Espindola	5c0034a7c6	Add back r153360 with a fix for enums that cover all the 32 bit values. Thanks to NAKAMURA Takumi for finding it! llvm-svn: 153383	2012-03-24 16:50:34 +00:00
NAKAMURA Takumi	2681efcc95	Revert r153360 (and r153380), "Second part of PR12251. Produce the range metadata in clang for booleans and". For i686 targets (eg. cygwin), I saw "Range must not be empty!" in verifier. It produces (i32)[0x80000000:0x80000000) from (uint64_t)[0xFFFFFFFF80000000ULL:0x0000000080000000ULL), for signed i32 on MDNode::Range. llvm-svn: 153382	2012-03-24 14:43:42 +00:00
Rafael Espindola	54355820e8	Second part of PR12251. Produce the range metadata in clang for booleans and c++ enums. llvm-svn: 153360	2012-03-24 00:28:06 +00:00
David Blaikie	bbafb8a745	Unify naming of LangOptions variable/get function across the Clang stack (Lex to AST). The member variable is always "LangOpts" and the member function is always "getLangOpts". Reviewed by Chris Lattner llvm-svn: 152536	2012-03-11 07:00:24 +00:00
John McCall	113bee0536	Remove BlockDeclRefExpr and introduce a bit on DeclRefExpr to track whether the referenced declaration comes from an enclosing local context. I'm amenable to suggestions about the exact meaning of this bit. llvm-svn: 152491	2012-03-10 09:33:50 +00:00
John McCall	7133505936	Unify the BlockDeclRefExpr and DeclRefExpr paths so that we correctly emit loads of BlockDeclRefExprs even when they don't qualify as ODR-uses. I think I'm adequately convinced that BlockDeclRefExpr can die. llvm-svn: 152479	2012-03-10 03:05:10 +00:00
Ted Kremenek	e65b086e07	Add clang support for new Objective-C literal syntax for NSDictionary, NSArray, NSNumber, and boolean literals. This includes both Sema and Codegen support. Included is also support for new Objective-C container subscripting. My apologies for the large patch. It was very difficult to break apart. The patch introduces changes to the driver as well to cause clang to link in additional runtime support when needed to support the new language features. Docs are forthcoming to document the implementation and behavior of these features. llvm-svn: 152137	2012-03-06 20:05:56 +00:00
Jay Foad	b0f3344b10	PR12094: Set the alignment of memory intrinsic instructions based on the types of the pointer arguments. llvm-svn: 151927	2012-03-02 18:34:30 +00:00
Eli Friedman	98b01edc8c	Implement "optimization" for lambda-to-block conversion which inlines the generated block literal for lambdas which are immediately converted to block pointer type. This simplifies the AST, avoids an unnecessary copy of the lambda and makes it much easier to avoid copying the result onto the heap. Note that this transformation has a substantial semantic effect outside of ARC: it gives the converted lambda lifetime semantics similar to a block literal. With ARC, the effect is much less obvious because the lifetime of blocks is already managed. llvm-svn: 151797	2012-03-01 04:01:32 +00:00

... 2 3 4 5 6 ...

976 Commits