llvm-project

Commit Graph

Author	SHA1	Message	Date
Artem Dergachev	eed7a3102c	[analyzer] Support partially tainted records. The analyzer's taint analysis can now reason about structures or arrays originating from taint sources in which only certain sections are tainted. In particular, it also benefits modeling functions like read(), which may read tainted data into a section of a structure, but RegionStore is incapable of expressing the fact that the rest of the structure remains intact, even if we try to model read() directly. Patch by Vlad Tsyrklevich! Differential revision: https://reviews.llvm.org/D28445 llvm-svn: 304162	2017-05-29 15:42:56 +00:00
Anna Zaks	12d0c8d662	[analyzer] Extend taint propagation and checking to support LazyCompoundVal A patch by Vlad Tsyrklevich! Differential Revision: https://reviews.llvm.org/D28445 llvm-svn: 297326	2017-03-09 00:01:16 +00:00
Anna Zaks	d4e43ae22a	[analyzer] Add bug visitor for taint checker. Add a bug visitor to the taint checker to make it easy to distinguish where the tainted value originated. This is especially useful when the original taint source is obscured by complex data flow. A patch by Vlad Tsyrklevich! Differential Revision: https://reviews.llvm.org/D30289 llvm-svn: 297324	2017-03-09 00:01:07 +00:00
Alexander Kornienko	9c10490efe	Refactor: Simplify boolean conditional return statements in lib/StaticAnalyzer/Checkers Summary: Use clang-tidy to simplify boolean conditional return values Reviewers: dcoughlin, krememek Subscribers: krememek, cfe-commits Patch by Richard Thomson! Differential Revision: http://reviews.llvm.org/D10021 llvm-svn: 256491	2015-12-28 13:06:58 +00:00
Devin Coughlin	e39bd407ba	[analyzer] Add generateErrorNode() APIs to CheckerContext. The analyzer trims unnecessary nodes from the exploded graph before reporting path diagnostics. However, in some cases it can trim all nodes (including the error node), leading to an assertion failure (see https://llvm.org/bugs/show_bug.cgi?id=24184). This commit addresses the issue by adding two new APIs to CheckerContext to explicitly create error nodes. Unless the client provides a custom tag, these APIs tag the node with the checker's tag -- preventing it from being trimmed. The generateErrorNode() method creates a sink error node, while generateNonFatalErrorNode() creates an error node for a path that should continue being explored. The intent is that one of these two methods should be used whenever a checker creates an error node. This commit updates the checkers to use these APIs. These APIs (unlike addTransition() and generateSink()) do not take an explicit Pred node. This is because there are not any error nodes in the checkers that were created with an explicit different than the default (the CheckerContext's Pred node). It also changes generateSink() to require state and pred nodes (previously these were optional) to reduce confusion. Additionally, there were several cases where checkers did check whether a generated node could be null; we now explicitly check for null in these places. This commit also includes a test case written by Ying Yi as part of http://reviews.llvm.org/D12163 (that patch originally addressed this issue but was reverted because it introduced false positive regressions). Differential Revision: http://reviews.llvm.org/D12780 llvm-svn: 247859	2015-09-16 22:03:05 +00:00
Ted Kremenek	3a0678e33c	[analyzer] Apply whitespace cleanups by Honggyu Kim. llvm-svn: 246978	2015-09-08 03:50:52 +00:00
Aaron Ballman	8d3a7a56a9	Clarify pointer ownership semantics by hoisting the std::unique_ptr creation to the caller instead of hiding it in emitReport. NFC. llvm-svn: 240400	2015-06-23 13:15:32 +00:00
Enrico Pertoso	4432d87578	Fixes a typo in a comment. llvm-svn: 238910	2015-06-03 09:10:58 +00:00
Craig Topper	0dbb783c7b	[C++11] Use 'nullptr'. StaticAnalyzer edition. llvm-svn: 209642	2014-05-27 02:45:47 +00:00
Nuno Lopes	fb744589bc	remove a bunch of unused private methods found with a smarter version of -Wunused-member-function that I'm playwing with. Appologies in advance if I removed someone's WIP code. ARCMigrate/TransProperties.cpp \| 8 ----- AST/MicrosoftMangle.cpp \| 1 Analysis/AnalysisDeclContext.cpp \| 5 --- Analysis/LiveVariables.cpp \| 14 ---------- Index/USRGeneration.cpp \| 10 ------- Sema/Sema.cpp \| 33 +++++++++++++++++++++--- Sema/SemaChecking.cpp \| 3 -- Sema/SemaDecl.cpp \| 20 ++------------ StaticAnalyzer/Checkers/GenericTaintChecker.cpp \| 1 9 files changed, 34 insertions(+), 61 deletions(-) llvm-svn: 204561	2014-03-23 17:12:37 +00:00
Aaron Ballman	be22bcb180	[C++11] Replacing DeclBase iterators specific_attr_begin() and specific_attr_end() with iterator_range specific_attrs(). Updating all of the usages of the iterators with range-based for loops. llvm-svn: 203474	2014-03-10 17:08:28 +00:00
Ahmed Charles	b89843299a	Replace OwningPtr with std::unique_ptr. This compiles cleanly with lldb/lld/clang-tools-extra/llvm. llvm-svn: 203279	2014-03-07 20:03:18 +00:00
Alexander Kornienko	4aca9b1cd8	Expose the name of the checker producing each diagnostic message. Summary: In clang-tidy we'd like to know the name of the checker producing each diagnostic message. PathDiagnostic has BugType and Category fields, which are both arbitrary human-readable strings, but we need to know the exact name of the checker in the form that can be used in the CheckersControlList option to enable/disable the specific checker. This patch adds the CheckName field to the CheckerBase class, and sets it in the CheckerManager::registerChecker() method, which gets them from the CheckerRegistry. Checkers that implement multiple checks have to store the names of each check in the respective registerXXXChecker method. Reviewers: jordan_rose, krememek Reviewed By: jordan_rose CC: cfe-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2557 llvm-svn: 201186	2014-02-11 21:49:21 +00:00
Aaron Ballman	f58070baed	Switched FormatAttr to using an IdentifierArgument instead of a StringArgument since that is a more accurate modeling. llvm-svn: 189851	2013-09-03 21:02:22 +00:00
David Blaikie	05785d1622	Include llvm::Optional in clang/Basic/LLVM.h Post-commit CR feedback from Jordan Rose regarding r175594. llvm-svn: 175679	2013-02-20 22:23:23 +00:00
David Blaikie	2fdacbc5b0	Replace SVal llvm::cast support to be well-defined. See r175462 for another example/more details. llvm-svn: 175594	2013-02-20 05:52:05 +00:00
Dmitri Gribenko	f857950d39	Remove useless 'llvm::' qualifier from names like StringRef and others that are brought into 'clang' namespace by clang/Basic/LLVM.h llvm-svn: 172323	2013-01-12 19:30:44 +00:00
Chandler Carruth	3a02247dc9	Sort all of Clang's files under 'lib', and fix up the broken headers uncovered. This required manually correcting all of the incorrect main-module headers I could find, and running the new llvm/utils/sort_includes.py script over the files. I also manually added quite a few missing headers that were uncovered by shuffling the order or moving headers up to be main-module-headers. llvm-svn: 169237	2012-12-04 09:13:33 +00:00
Benjamin Kramer	ea70eb30a0	Pull the Attr iteration parts out of Attr.h, so including DeclBase.h doesn't pull in all the generated Attr code. Required to pull some functions out of line, but this shouldn't have a perf impact. No functionality change. llvm-svn: 169092	2012-12-01 15:09:41 +00:00
Jordan Rose	0c153cb277	[analyzer] Use nice macros for the common ProgramStateTraits (map, set, list). Also, move the REGISTER_*_WITH_PROGRAMSTATE macros to ProgramStateTrait.h. This doesn't get rid of /all/ explicit uses of ProgramStatePartialTrait, but it does get a lot of them. llvm-svn: 167276	2012-11-02 01:54:06 +00:00
Jordan Rose	e10d5a7659	[analyzer] Rename 'EmitReport' to 'emitReport'. No functionality change. llvm-svn: 167275	2012-11-02 01:53:40 +00:00
Benjamin Kramer	e6f7008534	Remove trivial destructor from SVal. This enables the faster SmallVector in clang and also allows clang's unused variable warnings to be more effective. Fix the two instances that popped up. The RetainCountChecker change actually changes functionality, it would be nice if someone from the StaticAnalyzer folks could look at it. llvm-svn: 160444	2012-07-18 19:08:44 +00:00
Jordan Rose	6cd16c5152	[analyzer] Guard against C++ member functions that look like system functions. C++ method calls and C function calls both appear as CallExprs in the AST. This was causing crashes for an object that had a 'free' method. <rdar://problem/11822244> llvm-svn: 160029	2012-07-10 23:13:01 +00:00
Benjamin Kramer	474261af7b	Fix typos found by http://github.com/lyda/misspell-check llvm-svn: 157886	2012-06-02 10:20:41 +00:00
Anna Zaks	b508d29b78	[analyzer] Don't crash even when the system functions are redefined. (Applied changes to CStringAPI, Malloc, and Taint.) This might almost never happen, but we should not crash even if it does. This fixes a crash on the internal analyzer buildbot, where postgresql's configure was redefining memmove (radar://11219852). llvm-svn: 154451	2012-04-10 23:41:11 +00:00
Anna Zaks	3705a1ee10	[analyzer] Change naming in bug reports "tainted" -> "untrusted" llvm-svn: 151120	2012-02-22 02:35:58 +00:00
Dylan Noblesmith	e27789991d	Basic: import OwningPtr<> into clang namespace llvm-svn: 149798	2012-02-05 02:12:40 +00:00
Ted Kremenek	49b1e38e4b	Change references to 'const ProgramState *' to typedef 'ProgramStateRef'. At this point this is largely cosmetic, but it opens the door to replace ProgramStateRef with a smart pointer that more eagerly acts in the role of reclaiming unused ProgramState objects. llvm-svn: 149081	2012-01-26 21:29:00 +00:00
Anna Zaks	bf740512ec	[analyzer] Add more C taint sources/sinks. llvm-svn: 148844	2012-01-24 19:32:25 +00:00
Anna Zaks	97bef5642e	[analyzer] It's possible to have a non PointerType expression evaluate to a Loc value. When this happens, use the default type. llvm-svn: 148631	2012-01-21 06:59:01 +00:00
David Blaikie	e4d798f078	More dead code removal (using -Wunreachable-code) llvm-svn: 148577	2012-01-20 21:50:17 +00:00
Anna Zaks	3b754b25bd	[analyzer] Add socket API as a source of taint. llvm-svn: 148518	2012-01-20 00:11:19 +00:00
Anna Zaks	7f6a6b7507	[analyzer] Refactor: prePropagateTaint -> TaintPropagationRule::process(). Also remove the "should be a pointer argument" warning - should be handled elsewhere. llvm-svn: 148372	2012-01-18 02:45:13 +00:00
Anna Zaks	560dbe9ac9	[analyzer] Taint: warn when tainted data is used to specify a buffer size (Ex: in malloc, memcpy, strncpy..) (Maybe some of this could migrate to the CString checker. One issue with that is that we might want to separate security issues from regular API misuse.) llvm-svn: 148371	2012-01-18 02:45:11 +00:00
Anna Zaks	5d324e509c	[analyzer] Taint: add taint propagation rules for string and memory copy functions. llvm-svn: 148370	2012-01-18 02:45:07 +00:00
Anna Zaks	3666d2c160	[analyzer] Taint: generalize taint propagation to simplify adding more taint propagation functions. llvm-svn: 148266	2012-01-17 00:37:02 +00:00
Anna Zaks	0244cd7450	[analyzer] Taint: add system and popen as undesirable sinks for taint data. llvm-svn: 148176	2012-01-14 02:48:40 +00:00
Anna Zaks	a31f6b9559	[analyzer] Taint: when looking up a binding, provide the type. llvm-svn: 148080	2012-01-13 00:56:51 +00:00
Anna Zaks	b3fa8d7dd1	[analyzer] Add taint transfer by strcpy & others (part 1). To simplify the process: Refactor taint generation checker to simplify passing the information on which arguments need to be tainted from pre to post visit. Todo: We need to factor out the code that sema is using to identify the string and memcpy functions and use it here and in the CString checker. llvm-svn: 148010	2012-01-12 02:22:34 +00:00
Rafael Espindola	47dbcd1d39	Remove unused variable. llvm-svn: 147744	2012-01-07 22:52:07 +00:00
Anna Zaks	126a2ef920	[analyzer] Add basic format string vulnerability checking. We already have a more conservative check in the compiler (if the format string is not a literal, we warn). Still adding it here for completeness and since this check is stronger - only triggered if the format string is tainted. llvm-svn: 147714	2012-01-07 02:33:10 +00:00
Ted Kremenek	632e3b7ee2	[analyzer] Make the entries in 'Environment' context-sensitive by making entries map from (Stmt,LocationContext) pairs to SVals instead of Stmt* to SVals. This is needed to support basic IPA via inlining. Without this, we cannot tell if a Stmt* binding is part of the current analysis scope (StackFrameContext) or part of a parent context. This change introduces an uglification of the use of getSVal(), and thus takes two steps forward and one step back. There are also potential performance implications of enlarging the Environment. Both can be addressed going forward by refactoring the APIs and optimizing the internal representation of Environment. This patch mainly introduces the functionality upon when we want to build upon (and clean up). llvm-svn: 147688	2012-01-06 22:09:28 +00:00
Anna Zaks	3b0ab206d2	[analyzer] Add support for taint flowing through a function (atoi). Check if the input parameters are tainted (or point to tainted data) on a checkPreStmt<CallExpr>. If the output should be tainted, record it in the state. On post visit (checkPostStmt<CallExpr>), use the state to make decisions (in addition to the existing logic). Use this logic for atoi and fscanf. llvm-svn: 146793	2011-12-17 00:26:34 +00:00
Anna Zaks	e48ee50324	[analyzer] Better stdin support. llvm-svn: 146748	2011-12-16 18:28:50 +00:00
Anna Zaks	099fe3fb28	[analyzer] Treat stdin as a source of taint. Some of the test cases do not currently work because the analyzer core does not seem to call checkers for pre/post DeclRefExpr visits. (Opened radar://10573500. To be fixed later on.) llvm-svn: 146536	2011-12-14 00:56:18 +00:00
Anna Zaks	eefc0e9342	[analyzer] Mark output of fscanf and fopen as tainted. llvm-svn: 146533	2011-12-14 00:56:02 +00:00
Anna Zaks	d6bb3227de	[analyzer] Mark getenv output as tainted. Also, allow adding taint to a region (not only a symbolic value). llvm-svn: 146532	2011-12-14 00:55:58 +00:00
Anna Zaks	7c96b7db96	[analyzer] CStringChecker should not rely on the analyzer generating UndefOrUnknown value when it cannot reason about the expression. We are now often generating expressions even if the solver is not known to be able to simplify it. This is another cleanup of the existing code, where the rest of the analyzer and checkers should not base their logic on knowing ahead of the time what the solver can reason about. In this case, CStringChecker is performing a check for overflow of 'left+right' operation. The overflow can be checked with either 'maxVal-left' or 'maxVal-right'. Previously, the decision was based on whether the expresion evaluated to undef or not. With this patch, we check if one of the arguments is a constant, in which case we know that 'maxVal-const' is easily simplified. (Another option is to use canReasonAbout() method of the solver here, however, it's currently is protected.) This patch also contains 2 small bug fixes: - swap the order of operators inside SValBuilder::makeGenericVal. - handle a case when AddeVal is unknown in GenericTaintChecker::getPointedToSymbol. llvm-svn: 146343	2011-12-11 18:43:40 +00:00
Anna Zaks	457c68726c	[analyzer] Warn when non pointer arguments are passed to scanf (only when running taint checker). There is an open radar to implement better scanf checking as a Sema warning. However, a bit of redundancy is fine in this case. llvm-svn: 144964	2011-11-18 02:26:36 +00:00
Anna Zaks	5c5bf9b634	[analyzer] Adding generic taint checker. The checker is responsible for defining attack surface and adding taint to symbols. llvm-svn: 144825	2011-11-16 19:58:13 +00:00

50 Commits