forked from OSchip/llvm-project
367 lines
17 KiB
HTML
Executable File
367 lines
17 KiB
HTML
Executable File
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
|
|
"http://www.w3.org/TR/html4/strict.dtd">
|
|
<html>
|
|
<head>
|
|
<meta http-equiv="content-type" content="text/html; charset=iso-8859-1">
|
|
<title>Clang - Expressive Diagnostics</title>
|
|
<link type="text/css" rel="stylesheet" href="menu.css">
|
|
<link type="text/css" rel="stylesheet" href="content.css">
|
|
<style type="text/css">
|
|
.loc { font-weight: bold; }
|
|
.err { color:red; font-weight: bold; }
|
|
.warn { color:magenta; font-weight: bold; }
|
|
.note { color:gray; font-weight: bold; }
|
|
.msg { font-weight: bold; }
|
|
.cmd { font-style: italic; }
|
|
.snip { }
|
|
.point { color:green; font-weight: bold; }
|
|
</style>
|
|
</head>
|
|
<body>
|
|
|
|
<!--#include virtual="menu.html.incl"-->
|
|
|
|
<div id="content">
|
|
|
|
|
|
<!--=======================================================================-->
|
|
<h1>Expressive Diagnostics</h1>
|
|
<!--=======================================================================-->
|
|
|
|
<p>In addition to being fast and functional, we aim to make Clang extremely user
|
|
friendly. As far as a command-line compiler goes, this basically boils down to
|
|
making the diagnostics (error and warning messages) generated by the compiler
|
|
be as useful as possible. There are several ways that we do this. This section
|
|
talks about the experience provided by the command line compiler, contrasting
|
|
Clang output to GCC 4.9's output in some cases.
|
|
</p>
|
|
|
|
<h2>Column Numbers and Caret Diagnostics</h2>
|
|
|
|
<p>First, all diagnostics produced by clang include full column number
|
|
information. The clang command-line compiler driver uses this information
|
|
to print "point diagnostics".
|
|
(IDEs can use the information to display in-line error markup.)
|
|
This is nice because it makes it very easy to understand exactly
|
|
what is wrong in a particular piece of code.</p>
|
|
|
|
<p>The point (the green "^" character) exactly shows where the problem is, even
|
|
inside of a string. This makes it really easy to jump to the problem and
|
|
helps when multiple instances of the same character occur on a line. (We'll
|
|
revisit this more in following examples.)</p>
|
|
|
|
<pre>
|
|
$ <span class="cmd">clang -fsyntax-only format-strings.c</span>
|
|
<span class="loc">format-strings.c:91:13:</span> <span class="warn">warning:</span> <span class="msg">'.*' specified field precision is missing a matching 'int' argument</span>
|
|
<span class="snip" > printf("%.*d");</span>
|
|
<span class="point"> ^</span>
|
|
</pre>
|
|
|
|
<p>Note that modern versions of GCC have followed Clang's lead, and are
|
|
now able to give a column for a diagnostic, and include a snippet of source
|
|
text in the result. However, Clang's column number is much more accurate,
|
|
pointing at the problematic format specifier, rather than the <tt>)</tt>
|
|
character the parser had reached when the problem was detected.
|
|
Also, Clang's diagnostic is colored by default, making it easier to
|
|
distinguish from nearby text.</p>
|
|
|
|
<h2>Range Highlighting for Related Text</h2>
|
|
|
|
<p>Clang captures and accurately tracks range information for expressions,
|
|
statements, and other constructs in your program and uses this to make
|
|
diagnostics highlight related information. In the following somewhat
|
|
nonsensical example you can see that you don't even need to see the original source code to
|
|
understand what is wrong based on the Clang error. Because clang prints a
|
|
point, you know exactly <em>which</em> plus it is complaining about. The range
|
|
information highlights the left and right side of the plus which makes it
|
|
immediately obvious what the compiler is talking about.
|
|
Range information is very useful for
|
|
cases involving precedence issues and many other cases.</p>
|
|
|
|
<pre>
|
|
$ <span class="cmd">gcc-4.9 -fsyntax-only t.c</span>
|
|
t.c: In function 'int f(int, int)':
|
|
t.c:7:39: error: invalid operands to binary + (have 'int' and 'struct A')
|
|
return y + func(y ? ((SomeA.X + 40) + SomeA) / 42 + SomeA.X : SomeA.X);
|
|
^
|
|
$ <span class="cmd">clang -fsyntax-only t.c</span>
|
|
<span class="loc">t.c:7:39:</span> <span class="err">error:</span> <span class="msg">invalid operands to binary expression ('int' and 'struct A')</span>
|
|
<span class="snip" > return y + func(y ? ((SomeA.X + 40) + SomeA) / 42 + SomeA.X : SomeA.X);</span>
|
|
<span class="point"> ~~~~~~~~~~~~~~ ^ ~~~~~</span>
|
|
</pre>
|
|
|
|
<h2>Precision in Wording</h2>
|
|
|
|
<p>A detail is that we have tried really hard to make the diagnostics that come
|
|
out of clang contain exactly the pertinent information about what is wrong and
|
|
why. In the example above, we tell you what the inferred types are for
|
|
the left and right hand sides, and we don't repeat what is obvious from the
|
|
point (e.g., that this is a "binary +").</p>
|
|
|
|
<p>Many other examples abound. In the following example, not only do we tell you
|
|
that there is a problem with the <tt>*</tt>
|
|
and point to it, we say exactly why and tell you what the type is (in case it is
|
|
a complicated subexpression, such as a call to an overloaded function). This
|
|
sort of attention to detail makes it much easier to understand and fix problems
|
|
quickly.</p>
|
|
|
|
<pre>
|
|
$ <span class="cmd">gcc-4.9 -fsyntax-only t.c</span>
|
|
t.c:5:11: error: invalid type argument of unary '*' (have 'int')
|
|
return *SomeA.X;
|
|
^
|
|
$ <span class="cmd">clang -fsyntax-only t.c</span>
|
|
<span class="loc">t.c:5:11:</span> <span class="err">error:</span> <span class="msg">indirection requires pointer operand ('int' invalid)</span>
|
|
<span class="snip" > int y = *SomeA.X;</span>
|
|
<span class="point"> ^~~~~~~~</span>
|
|
</pre>
|
|
|
|
<h2>Typedef Preservation and Selective Unwrapping</h2>
|
|
|
|
<p>Many programmers use high-level user defined types, typedefs, and other
|
|
syntactic sugar to refer to types in their program. This is useful because they
|
|
can abbreviate otherwise very long types and it is useful to preserve the
|
|
typename in diagnostics. However, sometimes very simple typedefs can wrap
|
|
trivial types and it is important to strip off the typedef to understand what
|
|
is going on. Clang aims to handle both cases well.<p>
|
|
|
|
<p>The following example shows where it is important to preserve
|
|
a typedef in C.</p>
|
|
|
|
<pre>
|
|
$ <span class="cmd">clang -fsyntax-only t.c</span>
|
|
<span class="loc">t.c:15:11:</span> <span class="err">error:</span> <span class="msg">can't convert between vector values of different size ('__m128' and 'int const *')</span>
|
|
<span class="snip"> myvec[1]/P;</span>
|
|
<span class="point"> ~~~~~~~~^~</span>
|
|
</pre>
|
|
|
|
<p>The following example shows where it is useful for the compiler to expose
|
|
underlying details of a typedef. If the user was somehow confused about how the
|
|
system "pid_t" typedef is defined, Clang helpfully displays it with "aka".</p>
|
|
|
|
<pre>
|
|
$ <span class="cmd">clang -fsyntax-only t.c</span>
|
|
<span class="loc">t.c:13:9:</span> <span class="err">error:</span> <span class="msg">member reference base type 'pid_t' (aka 'int') is not a structure or union</span>
|
|
<span class="snip"> myvar = myvar.x;</span>
|
|
<span class="point"> ~~~~~ ^</span>
|
|
</pre>
|
|
|
|
<p>In C++, type preservation includes retaining any qualification written into type names. For example, if we take a small snippet of code such as:
|
|
|
|
<blockquote>
|
|
<pre>
|
|
namespace services {
|
|
struct WebService { };
|
|
}
|
|
namespace myapp {
|
|
namespace servers {
|
|
struct Server { };
|
|
}
|
|
}
|
|
|
|
using namespace myapp;
|
|
void addHTTPService(servers::Server const &server, ::services::WebService const *http) {
|
|
server += http;
|
|
}
|
|
</pre>
|
|
</blockquote>
|
|
|
|
<p>and then compile it, we see that Clang is both providing accurate information and is retaining the types as written by the user (e.g., "servers::Server", "::services::WebService"):
|
|
|
|
<pre>
|
|
$ <span class="cmd">clang -fsyntax-only t.cpp</span>
|
|
<span class="loc">t.cpp:9:10:</span> <span class="err">error:</span> <span class="msg">invalid operands to binary expression ('servers::Server const' and '::services::WebService const *')</span>
|
|
<span class="snip">server += http;</span>
|
|
<span class="point">~~~~~~ ^ ~~~~</span>
|
|
</pre>
|
|
|
|
<p>Naturally, type preservation extends to uses of templates, and Clang retains information about how a particular template specialization (like <code>std::vector<Real></code>) was spelled within the source code. For example:</p>
|
|
|
|
<pre>
|
|
$ <span class="cmd">clang -fsyntax-only t.cpp</span>
|
|
<span class="loc">t.cpp:12:7:</span> <span class="err">error:</span> <span class="msg">incompatible type assigning 'vector<Real>', expected 'std::string' (aka 'class std::basic_string<char>')</span>
|
|
<span class="snip">str = vec</span>;
|
|
<span class="point">^ ~~~</span>
|
|
</pre>
|
|
|
|
<h2>Fix-it Hints</h2>
|
|
|
|
<p>"Fix-it" hints provide advice for fixing small, localized problems
|
|
in source code. When Clang produces a diagnostic about a particular
|
|
problem that it can work around (e.g., non-standard or redundant
|
|
syntax, missing keywords, common mistakes, etc.), it may also provide
|
|
specific guidance in the form of a code transformation to correct the
|
|
problem. In the following example, Clang warns about the use of a GCC
|
|
extension that has been considered obsolete since 1993. The underlined
|
|
code should be removed, then replaced with the code below the
|
|
point line (".x =" or ".y =", respectively).</p>
|
|
|
|
<pre>
|
|
$ <span class="cmd">clang t.c</span>
|
|
<span class="loc">t.c:5:28:</span> <span class="warn">warning:</span> <span class="msg">use of GNU old-style field designator extension</span>
|
|
<span class="snip">struct point origin = { x: 0.0, y: 0.0 };</span>
|
|
<span class="err">~~</span> <span class="msg"><span class="point">^</span></span>
|
|
<span class="snip">.x = </span>
|
|
<span class="loc">t.c:5:36:</span> <span class="warn">warning:</span> <span class="msg">use of GNU old-style field designator extension</span>
|
|
<span class="snip">struct point origin = { x: 0.0, y: 0.0 };</span>
|
|
<span class="err">~~</span> <span class="msg"><span class="point">^</span></span>
|
|
<span class="snip">.y = </span>
|
|
</pre>
|
|
|
|
<p>"Fix-it" hints are most useful for
|
|
working around common user errors and misconceptions. For example, C++ users
|
|
commonly forget the syntax for explicit specialization of class templates,
|
|
as in the error in the following example. Again, after describing the problem,
|
|
Clang provides the fix--add <code>template<></code>--as part of the
|
|
diagnostic.<p>
|
|
|
|
<pre>
|
|
$ <span class="cmd">clang t.cpp</span>
|
|
<span class="loc">t.cpp:9:3:</span> <span class="err">error:</span> <span class="msg">template specialization requires 'template<>'</span>
|
|
struct iterator_traits<file_iterator> {
|
|
<span class="point">^</span>
|
|
<span class="snip">template<> </span>
|
|
</pre>
|
|
|
|
<h2>Template Type Diffing</h2>
|
|
|
|
<p>Templates types can be long and difficult to read. More so when part of an
|
|
error message. Instead of just printing out the type name, Clang has enough
|
|
information to remove the common elements and highlight the differences. To
|
|
show the template structure more clearly, the templated type can also be
|
|
printed as an indented text tree.</p>
|
|
|
|
Default: template diff with type elision
|
|
<pre>
|
|
<span class="loc">t.cc:4:5:</span> <span class="note">note:</span> candidate function not viable: no known conversion from 'vector<map<[...], <span class="template-highlight">float</span>>>' to 'vector<map<[...], <span class="template-highlight">double</span>>>' for 1st argument;
|
|
</pre>
|
|
-fno-elide-type: template diff without elision
|
|
<pre>
|
|
<span class="loc">t.cc:4:5:</span> <span class="note">note:</span> candidate function not viable: no known conversion from 'vector<map<int, <span class="template-highlight">float</span>>>' to 'vector<map<int, <span class="template-highlight">double</span>>>' for 1st argument;
|
|
</pre>
|
|
-fdiagnostics-show-template-tree: template tree printing with elision
|
|
<pre>
|
|
<span class="loc">t.cc:4:5:</span> <span class="note">note:</span> candidate function not viable: no known conversion for 1st argument;
|
|
vector<
|
|
map<
|
|
[...],
|
|
[<span class="template-highlight">float</span> != <span class="template-highlight">double</span>]>>
|
|
</pre>
|
|
-fdiagnostics-show-template-tree -fno-elide-type: template tree printing with no elision
|
|
<pre>
|
|
<span class="loc">t.cc:4:5:</span> <span class="note">note:</span> candidate function not viable: no known conversion for 1st argument;
|
|
vector<
|
|
map<
|
|
int,
|
|
[<span class="template-highlight">float</span> != <span class="template-highlight">double</span>]>>
|
|
</pre>
|
|
|
|
<h2>Automatic Macro Expansion</h2>
|
|
|
|
<p>Many errors happen in macros that are sometimes deeply nested. With
|
|
traditional compilers, you need to dig deep into the definition of the macro to
|
|
understand how you got into trouble. The following simple example shows how
|
|
Clang helps you out by automatically printing instantiation information and
|
|
nested range information for diagnostics as they are instantiated through macros
|
|
and also shows how some of the other pieces work in a bigger example.</p>
|
|
|
|
<pre>
|
|
$ <span class="cmd">clang -fsyntax-only t.c</span>
|
|
<span class="loc">t.c:80:3:</span> <span class="err">error:</span> <span class="msg">invalid operands to binary expression ('typeof(P)' (aka 'struct mystruct') and 'typeof(F)' (aka 'float'))</span>
|
|
<span class="snip"> X = MYMAX(P, F);</span>
|
|
<span class="point"> ^~~~~~~~~~~</span>
|
|
<span class="loc">t.c:76:94:</span> <span class="note">note:</span> expanded from:
|
|
<span class="snip">#define MYMAX(A,B) __extension__ ({ __typeof__(A) __a = (A); __typeof__(B) __b = (B); __a < __b ? __b : __a; })</span>
|
|
<span class="point"> ~~~ ^ ~~~</span>
|
|
</pre>
|
|
|
|
<p>Here's another real world warning that occurs in the "window" Unix package (which
|
|
implements the "wwopen" class of APIs):</p>
|
|
|
|
<pre>
|
|
$ <span class="cmd">clang -fsyntax-only t.c</span>
|
|
<span class="loc">t.c:22:2:</span> <span class="warn">warning:</span> <span class="msg">type specifier missing, defaults to 'int'</span>
|
|
<span class="snip"> ILPAD();</span>
|
|
<span class="point"> ^</span>
|
|
<span class="loc">t.c:17:17:</span> <span class="note">note:</span> expanded from:
|
|
<span class="snip">#define ILPAD() PAD((NROW - tt.tt_row) * 10) /* 1 ms per char */</span>
|
|
<span class="point"> ^</span>
|
|
<span class="loc">t.c:14:2:</span> <span class="note">note:</span> expanded from:
|
|
<span class="snip"> register i; \</span>
|
|
<span class="point"> ^</span>
|
|
</pre>
|
|
|
|
<p>In practice, we've found that Clang's treatment of macros is actually more useful in multiply nested
|
|
macros than in simple ones.</p>
|
|
|
|
<h2>Quality of Implementation and Attention to Detail</h2>
|
|
|
|
<p>Finally, we have put a lot of work polishing the little things, because
|
|
little things add up over time and contribute to a great user experience.</p>
|
|
|
|
<p>The following example shows that we recover from the simple case of
|
|
forgetting a ; after a struct definition much better than GCC.</p>
|
|
|
|
<pre>
|
|
$ <span class="cmd">cat t.cc</span>
|
|
template<class T>
|
|
class a {};
|
|
struct b {}
|
|
a<int> c;
|
|
$ <span class="cmd">gcc-4.9 t.cc</span>
|
|
t.cc:4:8: error: invalid declarator before 'c'
|
|
a<int> c;
|
|
^
|
|
$ <span class="cmd">clang t.cc</span>
|
|
<span class="loc">t.cc:3:12:</span> <span class="err">error:</span> <span class="msg">expected ';' after struct</span>
|
|
<span class="snip" >struct b {}</span>
|
|
<span class="point"> ^</span>
|
|
<span class="point"> ;</span>
|
|
</pre>
|
|
|
|
<p>The following example shows that we diagnose and recover from a missing
|
|
<tt>typename</tt> keyword well, even in complex circumstances where GCC
|
|
cannot cope.</p>
|
|
|
|
<pre>
|
|
$ <span class="cmd">cat t.cc</span>
|
|
template<class T> void f(T::type) { }
|
|
struct A { };
|
|
void g()
|
|
{
|
|
A a;
|
|
f<A>(a);
|
|
}
|
|
$ <span class="cmd">gcc-4.9 t.cc</span>
|
|
t.cc:1:33: error: variable or field 'f' declared void
|
|
template<class T> void f(T::type) { }
|
|
^
|
|
t.cc: In function 'void g()':
|
|
t.cc:6:5: error: 'f' was not declared in this scope
|
|
f<A>(a);
|
|
^
|
|
t.cc:6:8: error: expected primary-expression before '>' token
|
|
f<A>(a);
|
|
^
|
|
$ <span class="cmd">clang t.cc</span>
|
|
<span class="loc">t.cc:1:26:</span> <span class="err">error:</span> <span class="msg">missing 'typename' prior to dependent type name 'T::type'</span>
|
|
<span class="snip" >template<class T> void f(T::type) { }</span>
|
|
<span class="point"> ^~~~~~~</span>
|
|
<span class="point"> typename </span>
|
|
<span class="loc">t.cc:6:5:</span> <span class="err">error:</span> <span class="msg">no matching function for call to 'f'</span>
|
|
<span class="snip" > f<A>(a);</span>
|
|
<span class="point"> ^~~~</span>
|
|
<span class="loc">t.cc:1:24:</span> <span class="note">note:</span> <span class="msg">candidate template ignored: substitution failure [with T = A]: no type named 'type' in 'A'</span>
|
|
<span class="snip" >template<class T> void f(T::type) { }</span>
|
|
<span class="point"> ^ ~~~~</span>
|
|
</pre>
|
|
|
|
|
|
|
|
<p>While each of these details is minor, we feel that they all add up to provide
|
|
a much more polished experience.</p>
|
|
|
|
</div>
|
|
</body>
|
|
</html>
|