llvm-project

Commit Graph

Author	SHA1	Message	Date
Anna Zaks	9747febba9	[analyzer] Address Jordan's nitpicks as per code review of r170625. llvm-svn: 170832	2012-12-21 01:50:14 +00:00
Anna Zaks	dc15415da4	[analyzer] Add the pointer escaped callback. Instead of using several callbacks to identify the pointer escape event, checkers now can register for the checkPointerEscape. Converted the Malloc checker to use the new callback. SimpleStreamChecker will be converted next. llvm-svn: 170625	2012-12-20 00:38:25 +00:00
Jordan Rose	047208027a	[analyzer] Remove bindExprAndLocation, which does extra work for no gain. This feature was probably intended to improve diagnostics, but was currently only used when dumping the Environment. It shows what location a given value was loaded from, e.g. when evaluating an LValueToRValue cast. llvm-svn: 169522	2012-12-06 18:58:01 +00:00
Chandler Carruth	3a02247dc9	Sort all of Clang's files under 'lib', and fix up the broken headers uncovered. This required manually correcting all of the incorrect main-module headers I could find, and running the new llvm/utils/sort_includes.py script over the files. I also manually added quite a few missing headers that were uncovered by shuffling the order or moving headers up to be main-module-headers. llvm-svn: 169237	2012-12-04 09:13:33 +00:00
Jordan Rose	520a30fd05	[analyzer] Move convenience REGISTER_*_WITH_PROGRAMSTATE to CheckerContext.h As Anna pointed out, ProgramStateTrait.h is a relatively obscure header, and checker writers may not know to look there to add their own custom state. The base macro that specializes the template remains in ProgramStateTrait.h (REGISTER_TRAIT_WITH_PROGRAMSTATE), which allows the analyzer core to keep using it. llvm-svn: 167385	2012-11-05 16:58:00 +00:00
Jordan Rose	0c153cb277	[analyzer] Use nice macros for the common ProgramStateTraits (map, set, list). Also, move the REGISTER_*_WITH_PROGRAMSTATE macros to ProgramStateTrait.h. This doesn't get rid of /all/ explicit uses of ProgramStatePartialTrait, but it does get a lot of them. llvm-svn: 167276	2012-11-02 01:54:06 +00:00
Jordan Rose	d63f04d8a7	[analyzer] Make ProgramStateManager's SubEngine parameter optional. It is possible and valid to have a state manager and associated objects without having a SubEngine or checkers. Patch by Olaf Krzikalla! llvm-svn: 164947	2012-10-01 16:53:40 +00:00
Ted Kremenek	a808e165b2	Remove unnecessary ASTContext& parameter from SymExpr::getType(). llvm-svn: 164661	2012-09-26 06:00:14 +00:00
Jordan Rose	db72e2fc37	Reapply "[analyzer] Remove constraints on dead symbols as part of removeDeadBindings." Previously, we'd just keep constraints around forever, which means we'd never be able to merge paths that differed only in constraints on dead symbols. Because we now allow constraints on symbolic expressions, not just single symbols, this requires changing SymExpr::symbol_iterator to include intermediate symbol nodes in its traversal, not just the SymbolData leaf nodes. This depends on the previous commit to be correct. Originally applied in r163444, reverted in r164275, now being re-applied. llvm-svn: 164622	2012-09-25 19:03:06 +00:00
Jordan Rose	ae134c6449	Revert "[analyzer] Remove constraints on dead symbols as part of removeDeadBindings." While we definitely want this optimization in the future, we're not currently handling constraints on symbolic /expressions/ correctly. These should stay live even if the SymExpr itself is no longer referenced because could recreate an identical SymExpr later. Only once the SymExpr can no longer be recreated -- i.e. a component symbol is dead -- can we safely remove the constraints on it. This liveness issue is tracked by <rdar://problem/12333297>. This reverts r163444 / 24c7f98828e039005cff3bd847e7ab404a6a09f8. llvm-svn: 164275	2012-09-20 01:54:56 +00:00
Ted Kremenek	e9764d8f91	Remove dead method ProgramState::MarshalState(). llvm-svn: 163479	2012-09-09 14:55:59 +00:00
Jordan Rose	5860e329a4	[analyzer] Remove constraints on dead symbols as part of removeDeadBindings. Previously, we'd just keep constraints around forever, which means we'd never be able to merge paths that differed only in constraints on dead symbols. Because we now allow constraints on symbolic expressions, not just single symbols, this requires changing SymExpr::symbol_iterator to include intermediate symbol nodes in its traversal, not just the SymbolData leaf nodes. llvm-svn: 163444	2012-09-08 01:24:53 +00:00
Ted Kremenek	244e1d7d0f	Remove ProgramState::getSymVal(). It was being misused by Checkers, with at least one subtle bug in MacOSXKeyChainAPIChecker where the calling the method was a substitute for assuming a symbolic value was null (which is not the case). We still keep ConstraintManager::getSymVal(), but we use that as an optimization in SValBuilder and ProgramState::getSVal() to constant-fold SVals. This is only if the ConstraintManager can provide us with that information, which is no longer a requirement. As part of this, introduce a default implementation of ConstraintManager::getSymVal() which returns null. For Checkers, introduce ConstraintManager::isNull(), which queries the state to see if the symbolic value is constrained to be a null value. It does this without assuming it has been implicitly constant folded. llvm-svn: 163428	2012-09-07 22:31:01 +00:00
Ted Kremenek	6269888166	Rename 'unbindLoc()' (in ProgramState) and 'Remove()' to 'killBinding()'. The name is more specific, and one just forwarded to the other. Add some doxygen comments along the way. llvm-svn: 162350	2012-08-22 06:37:46 +00:00
Ted Kremenek	1afcb7442f	Remove Store::bindDecl() and Store::bindDeclWithNoInit(), and all forwarding methods. This functionality is already covered by bindLoc(). llvm-svn: 162346	2012-08-22 06:00:18 +00:00
Ted Kremenek	2cd56c4c6e	Rename 'BindCompoundLiteral' to 'bindCompoundLiteral' and add doxygen comments. llvm-svn: 162345	2012-08-22 06:00:12 +00:00
Jordan Rose	0f6d63be06	[analyzer] Correctly devirtualize virtual method calls in destructors. C++11 [class.cdtor]p4: When a virtual function is called directly or indirectly from a constructor or from a destructor, including during the construction or destruction of the class’s non-static data members, and the object to which the call applies is the object under construction or destruction, the function called is the final overrider in the constructor's or destructor's class and not one overriding it in a more-derived class. llvm-svn: 161915	2012-08-15 00:51:56 +00:00
Jordan Rose	e521f93225	[analyzer] Look up DynamicTypeInfo by region instead of symbol. This allows us to store type info for non-symbolic regions. No functionality change. llvm-svn: 161811	2012-08-13 23:59:07 +00:00
Anna Zaks	a0105b2320	[analyzer] Rename the function to better reflect what it actually does. llvm-svn: 161617	2012-08-09 21:02:45 +00:00
Jordan Rose	356279ca2d	[analyzer] Track malloc'd regions stored in structs. The main blocker on this (besides the previous commit) was that ScanReachableSymbols was not looking through LazyCompoundVals. Once that was fixed, it's easy enough to clear out malloc data on return, just like we do when we bind to a global region. <rdar://problem/10872635> llvm-svn: 161511	2012-08-08 18:23:31 +00:00
Jordan Rose	3a80cec5e9	[analyzer] Revamp RegionStore to distinguish regions with symbolic offsets. RegionStore currently uses a (Region, Offset) pair to describe the locations of memory bindings. However, this representation breaks down when we have regions like 'array[index]', where 'index' is unknown. We used to store this as (SubRegion, 0); now we mark them specially as (SubRegion, SYMBOLIC). Furthermore, ProgramState::scanReachableSymbols depended on the existence of a sub-region map, but RegionStore's implementation doesn't provide for such a thing. Moving the store-traversing logic of scanReachableSymbols into the StoreManager allows us to eliminate the notion of SubRegionMap altogether. This fixes some particularly awkward broken test cases, now in array-struct-region.c. llvm-svn: 161510	2012-08-08 18:23:27 +00:00
Anna Zaks	472dbcf156	[analyzer] Add a checker to manage dynamic type propagation. Instead of sprinkling dynamic type info propagation throughout ExprEngine, the added checker would add the more precise type information on known APIs (Ex: ObjC alloc, new) and propagate the type info in other cases (ex: ObjC init method, casts (the second is not implemented yet)). Add handling of ObjC alloc, new and init to the checker. llvm-svn: 161357	2012-08-06 23:25:39 +00:00
Anna Zaks	afc13b9ec5	[analyzer] Fixup: remove the extra whitespace llvm-svn: 161265	2012-08-03 21:49:42 +00:00
Anna Zaks	150843b87e	[analyzer] ObjC Inlining: Start tracking dynamic type info in the GDM In the following code, find the type of the symbolic receiver by following it and updating the dynamic type info in the state when we cast the symbol from id to MyClass . MyClass a = [[self alloc] init]; return 5/[a testSelf]; llvm-svn: 161264	2012-08-03 21:43:37 +00:00
Anna Zaks	63282aefb9	[analyzer] Very simple ObjC instance method inlining - Retrieves the type of the object/receiver from the state. - Binds self during stack setup. - Only explores the path on which the method is inlined (no bifurcation to explore the path on which the method is not inlined). llvm-svn: 160991	2012-07-30 20:31:29 +00:00
Jordan Rose	d457ca92ce	[analyzer] Introduce a CallEventManager to keep a pool of CallEvents. This allows us to get around the C++ "virtual constructor" problem when we'd like to create a CallEvent from an ExplodedNode, an inlined StackFrameContext, or another CallEvent. The solution has three parts: - CallEventManager uses a BumpPtrAllocator to allocate CallEvent-sized memory blocks. It also keeps a cache of freed CallEvents for reuse. - CallEvents all have protected copy constructors, along with cloneTo() methods that use placement new to copy into CallEventManager-managed memory, vtables intact. - CallEvents owned by CallEventManager are now wrapped in an IntrusiveRefCntPtr. Going forwards, it's probably a good idea to create ALL CallEvents through the CallEventManager, so that we don't accidentally try to reclaim a stack-allocated CallEvent. All of this machinery is currently unused but will be put into use shortly. llvm-svn: 160983	2012-07-30 20:21:55 +00:00
Jordan Rose	d1d54aa131	[analyzer] Use CallEvent for building inlined stack frames. In order to accomplish this, we now build the callee's stack frame as part of the CallEnter node, rather than the subsequent BlockEdge node. This should not have any effect on perceived behavior or diagnostics. This makes it safe to re-enable inlining of member overloaded operators. llvm-svn: 160022	2012-07-10 22:07:57 +00:00
Jordan Rose	742920c8e7	[analyzer] Add a new abstraction over all types of calls: CallEvent This is intended to replace CallOrObjCMessage, and is eventually intended to be used for anything that cares more about /what/ is being called than /how/ it's being called. For example, inlining destructors should be the same as inlining blocks, and checking __attribute__((nonnull)) should apply to the allocator calls generated by operator new. llvm-svn: 159554	2012-07-02 19:27:35 +00:00
Ted Kremenek	c3da376fbc	static analyzer: add inlining support for directly called blocks. llvm-svn: 157833	2012-06-01 20:04:04 +00:00
Ted Kremenek	b14b42d477	Have ScanReachableSymbols reported reachable regions. Fixes a false positive with nested array literals. <rdar://problem/10686586> llvm-svn: 151012	2012-02-21 00:46:29 +00:00
Ted Kremenek	d519cae8aa	Have conjured symbols depend on LocationContext, to add context sensitivity for functions called more than once. llvm-svn: 150849	2012-02-17 23:13:45 +00:00
Anna Zaks	3d34834bb0	[analyzer] Make Malloc Checker optimistic in presence of inlining. (In response of Ted's review of r150112.) This moves the logic which checked if a symbol escapes through a parameter to invalidateRegionCallback (instead of post CallExpr visit.) To accommodate the change, added a CallOrObjCMessage parameter to checkRegionChanges callback. llvm-svn: 150513	2012-02-14 21:55:24 +00:00
Argyrios Kyrtzidis	2753ca84f0	Reapply r149311 which I reverted by mistake. Original log: Convert ProgramStateRef to a smart pointer for managing the reference counts of ProgramStates. This leads to a slight memory improvement, and a simplification of the logic for managing ProgramState objects. # Please enter the commit message for your changes. Lines starting llvm-svn: 149339	2012-01-31 02:23:28 +00:00
Argyrios Kyrtzidis	0dc0c5411f	Revert r149311 which failed to compile. Original log: Convert ProgramStateRef to a smart pointer for managing the reference counts of ProgramStates. This leads to a slight memory improvement, and a simplification of the logic for managing ProgramState objects. llvm-svn: 149336	2012-01-31 02:14:24 +00:00
Ted Kremenek	b1ca33fde5	Convert ProgramStateRef to a smart pointer for managing the reference counts of ProgramStates. This leads to a slight memory improvement, and a simplification of the logic for managing ProgramState objects. llvm-svn: 149311	2012-01-31 00:57:20 +00:00
Anna Zaks	4f870e652a	[analyzer] Add index out of bounds check for CFArrayGetArrayAtIndex. llvm-svn: 149228	2012-01-30 06:42:48 +00:00
Ted Kremenek	49b1e38e4b	Change references to 'const ProgramState *' to typedef 'ProgramStateRef'. At this point this is largely cosmetic, but it opens the door to replace ProgramStateRef with a smart pointer that more eagerly acts in the role of reclaiming unused ProgramState objects. llvm-svn: 149081	2012-01-26 21:29:00 +00:00
Anna Zaks	282dc1437f	[analyzer] Skip casts when determining taint dependencies + pretty printing. llvm-svn: 148517	2012-01-20 00:11:16 +00:00
Ted Kremenek	3d3aea9374	[analyzer] fix inlining's handling of mapping actual to formal arguments and limit the call stack depth. The analyzer can now accurately simulate factorial for limited depths. llvm-svn: 148036	2012-01-12 19:25:46 +00:00
Anna Zaks	126a2ef920	[analyzer] Add basic format string vulnerability checking. We already have a more conservative check in the compiler (if the format string is not a literal, we warn). Still adding it here for completeness and since this check is stronger - only triggered if the format string is tainted. llvm-svn: 147714	2012-01-07 02:33:10 +00:00
Ted Kremenek	632e3b7ee2	[analyzer] Make the entries in 'Environment' context-sensitive by making entries map from (Stmt,LocationContext) pairs to SVals instead of Stmt* to SVals. This is needed to support basic IPA via inlining. Without this, we cannot tell if a Stmt* binding is part of the current analysis scope (StackFrameContext) or part of a parent context. This change introduces an uglification of the use of getSVal(), and thus takes two steps forward and one step back. There are also potential performance implications of enlarging the Environment. Both can be addressed going forward by refactoring the APIs and optimizing the internal representation of Environment. This patch mainly introduces the functionality upon when we want to build upon (and clean up). llvm-svn: 147688	2012-01-06 22:09:28 +00:00
Anna Zaks	8158ef0dec	[analyzer] Be less pessimistic about invalidation of global variables as a result of a call. Problem: Global variables, which come in from system libraries should not be invalidated by all calls. Also, non-system globals should not be invalidated by system calls. Solution: The following solution to invalidation of globals seems flexible enough for taint (does not invalidate stdin) and should not lead to too many false positives. We split globals into 3 classes: * immutable - values are preserved by calls (unless the specific global is passed in as a parameter): A : Most system globals and const scalars * invalidated by functions defined in system headers: B: errno * invalidated by all other functions (note, these functions may in turn contain system calls): B: errno C: all other globals (which are not in A nor B) llvm-svn: 147569	2012-01-04 23:54:01 +00:00
David Blaikie	68e081d606	Unweaken vtables as per http://llvm.org/docs/CodingStandards.html#ll_virtual_anch llvm-svn: 146959	2011-12-20 02:48:34 +00:00
Anna Zaks	9de45554e1	[analyzer] Minor: Simplify & assert. llvm-svn: 146792	2011-12-17 00:26:29 +00:00
Anna Zaks	e48ee50324	[analyzer] Better stdin support. llvm-svn: 146748	2011-12-16 18:28:50 +00:00
Anna Zaks	04b57c25bc	[analyzer] Minor refactor to addTaint. llvm-svn: 146535	2011-12-14 00:56:15 +00:00
Anna Zaks	d6bb3227de	[analyzer] Mark getenv output as tainted. Also, allow adding taint to a region (not only a symbolic value). llvm-svn: 146532	2011-12-14 00:55:58 +00:00
Anna Zaks	ecd730085d	[analyzer] Introduce IntSymExpr, where the integer is on the lhs. Fix a bug in SimpleSValBuilder, where we should swap lhs and rhs when calling generateUnknownVal(), - the function which creates symbolic expressions when data is tainted. The issue is not visible when we only create the expressions for taint since all expressions are commutative from taint perspective. Refactor SymExpr::symbol_iterator::expand() to use a switch instead of a chain of ifs. llvm-svn: 146336	2011-12-10 23:36:51 +00:00
Anna Zaks	394256cc0d	[analyzer] If memory region is tainted mark data as tainted. + random comments llvm-svn: 146199	2011-12-08 22:38:43 +00:00
Anna Zaks	9da86ce834	[analyzer] Cleanup: use the variable. llvm-svn: 146056	2011-12-07 19:56:13 +00:00

1 2

63 Commits