llvm-project

Commit Graph

Author	SHA1	Message	Date
Argyrios Kyrtzidis	86f1a935dc	Pull the bulk of Lexer::MeasureTokenLength() out into a new function, Lexer::getRawToken(). No functionality change. llvm-svn: 171771	2013-01-07 19:16:18 +00:00
Richard Smith	2bf7fdb723	s/CPlusPlus0x/CPlusPlus11/g llvm-svn: 171367	2013-01-02 11:42:31 +00:00
Chandler Carruth	3a02247dc9	Sort all of Clang's files under 'lib', and fix up the broken headers uncovered. This required manually correcting all of the incorrect main-module headers I could find, and running the new llvm/utils/sort_includes.py script over the files. I also manually added quite a few missing headers that were uncovered by shuffling the order or moving headers up to be main-module-headers. llvm-svn: 169237	2012-12-04 09:13:33 +00:00
Richard Smith	9a67f47882	Teach Lexer::getSpelling about raw string literals. Specifically, if a raw string literal needs cleaning (because it contains line-splicing in the encoding prefix or in the ud-suffix), do not clean the section between the double-quotes -- that's the "raw" bit! llvm-svn: 168776	2012-11-28 07:29:00 +00:00
Nico Weber	4e270380c1	Fix crash on end-of-file after \ in a char literal, fixes PR14369. This makes LexCharConstant() look more like LexStringLiteral(), which doesn't have this bug. Add tests for eof after \ for several other cases. llvm-svn: 168269	2012-11-17 20:25:54 +00:00
Eli Friedman	b699e619fe	Fix an assertion failure printing the unused-label fixit in files using CRLF line endings. <rdar://problem/12639047>. llvm-svn: 167900	2012-11-14 01:28:38 +00:00
Daniel Dunbar	cf3f2c49ea	Revert r167801, "[preprocessor] When #including something that contributes no tokens at all,". This change broke External/Nurbs in LLVM test-suite. llvm-svn: 167858	2012-11-13 19:12:37 +00:00
Nico Weber	7cc28804e2	UCNs in char literals are done (in LiteralSupport), remove FIXME. Expand UCN FIXME in LexNumericConstant. llvm-svn: 167818	2012-11-13 06:25:15 +00:00
Argyrios Kyrtzidis	4f10a3e9f0	[preprocessor] When #including something that contributes no tokens at all, don't recursively continue lexing. This avoids a stack overflow with a sequence of many empty #includes. rdar://11988695 llvm-svn: 167801	2012-11-13 01:03:15 +00:00
Argyrios Kyrtzidis	36675b75fb	In Lexer::LexTokenInternal, avoid code duplication; no functionality change. llvm-svn: 167800	2012-11-13 01:02:40 +00:00
Nico Weber	158a31abe2	s/BCPLComment/LineComment/ llvm-svn: 167690	2012-11-11 07:02:14 +00:00
Argyrios Kyrtzidis	d53d0daab9	Take into account that there may be a BOM at the beginning of the file, when computing the size of the precompiled preamble. llvm-svn: 166659	2012-10-25 01:51:45 +00:00
Dmitri Gribenko	b8e9e7507e	StringRef'ize Preprocessor::CreateString(). llvm-svn: 164555	2012-09-24 21:07:17 +00:00
Roman Divacky	e637711ae0	Dont cast away const needlessly. Found by gcc48 -Wcast-qual. llvm-svn: 163325	2012-09-06 15:59:27 +00:00
Eli Friedman	324adad966	Make a bunch of methods on Lexer private. llvm-svn: 162970	2012-08-31 02:29:37 +00:00
Dmitri Gribenko	4aa05c571e	Lexer: remove dead stores. Found by Clang static analyzer! llvm-svn: 160973	2012-07-30 17:59:40 +00:00
Richard Smith	608c0b65d7	Add warning flag -Winvalid-pp-token for preprocessing-tokens which have undefined behaviour, and move the diagnostic for '' from an Error into an ExtWarn in this group. This is important for some users of the preprocessor, and is necessary for gcc compatibility. llvm-svn: 159335	2012-06-28 07:51:56 +00:00
James Dennett	f442d2455b	Documentation cleanup: * Removed docs for Lexer::makeFileCharRange from Lexer.cpp, as they're in the header file; * Reworked the documentation for SkipBlockComment so that it doesn't confuse Doxygen's comment parsing; * Added another summary with \brief markup. llvm-svn: 158618	2012-06-17 03:40:43 +00:00
Jordan Rose	127f6eef7e	[-E] Emit a rewritten _Pragma on its own line. 1. Teach Lexer that pragma lexers are like macro expansions at EOF. 2. Treat pragmas like #define/#undef when printing. 3. If we just printed a directive, add a newline before any more tokens. (4. Miscellaneous cleanup in PrintPreprocessedOutput.cpp) PR10594 and <rdar://problem/11562490> (two separate related problems) llvm-svn: 158571	2012-06-15 23:33:51 +00:00
James Dennett	ff3c995624	Documentation cleanup: escape backslashes in Doxygen comments. llvm-svn: 158552	2012-06-15 21:36:54 +00:00
Richard Smith	e6799ddae8	PR12717: Clang supports hexadecimal floating-point literals in all language modes. For languages other than C99/C11, this isn't quite a conforming extension, and for C++11, it breaks some reasonable code containing user-defined literals. In languages which don't officially have hexfloats, pare back this extension to only apply in cases where the token starts 0x and does not contain an underscore. The extension is still not quite conforming, but it's a lot closer now. llvm-svn: 158487	2012-06-15 05:07:49 +00:00
David Blaikie	2af2b3071d	Fix PR13065. This condition (added in r158093) was overly conservative. llvm-svn: 158483	2012-06-15 00:47:13 +00:00
Dmitri Gribenko	702b732d6f	Correct method name in comment: from LexRawToken to LexFromRawLexer, according to a change done long ago in r57393. llvm-svn: 158243	2012-06-08 23:19:37 +00:00
Jordan Rose	288c421b3d	Insert a space if necessary when suggesting CFBridgingRetain/Release. This was a problem for people who write 'return(result);' Also fix ARCMT's corresponding code, though there's no test case for this because implicit casts like this are rejected by the migrator for being ambiguous, and explicit casts have no problem. <rdar://problem/11577346> llvm-svn: 158130	2012-06-07 01:10:31 +00:00
David Blaikie	d5321247c4	Add a -rewrite-includes option, which is similar to -rewrite-macros, but only expands #include directives. Patch contributed by Lubos Lunak (l.lunax@suse.cz). Review by Matt Beaumont-Gay (matthewbg@google.com). llvm-svn: 158093	2012-06-06 18:52:13 +00:00
David Blaikie	987bcf9462	Escape \n and \r in doxycomment. llvm-svn: 158091	2012-06-06 18:43:20 +00:00
Benjamin Kramer	e5fbc6c85d	Lexer::ReadToEndOfLine: Only build the string if it's actually used and do so in a less malloc-intensive way. llvm-svn: 157064	2012-05-18 19:32:16 +00:00
Seth Cantrell	e83c731cad	Support -Wc++98-compat-pedantic as requested: http://lists.cs.uiuc.edu/pipermail/cfe-commits/Week-of-Mon-20120409/056126.html llvm-svn: 154655	2012-04-13 03:43:23 +00:00
Seth Cantrell	10ac7205ce	C++11 no longer requires files to end with a newline llvm-svn: 154643	2012-04-13 01:00:34 +00:00
Francois Pichet	7ebc4c1910	ext_reserved_user_defined_literal must not default to Error in MicrosoftMode. Hence create ext_ms_reserved_user_defined_literal that doesn't default to Error; otherwise MSVC headers won't parse. Fixes PR12383. llvm-svn: 154273	2012-04-07 23:09:23 +00:00
David Blaikie	bbafb8a745	Unify naming of LangOptions variable/get function across the Clang stack (Lex to AST). The member variable is always "LangOpts" and the member function is always "getLangOpts". Reviewed by Chris Lattner llvm-svn: 152536	2012-03-11 07:00:24 +00:00
Richard Smith	0df56f4a90	Implement C++11 [lex.ext]p10 for string and character literals: a ud-suffix not starting with an underscore is ill-formed. Since this rule rejects programs that were using <inttypes.h>'s macros, recover from this error by treating the ud-suffix as a separate preprocessing-token, with a DefaultError ExtWarn. The approach of treating such cases as two tokens is under discussion for standardization, but is in any case a conforming extension and allows existing codebases to keep building while the committee makes up its mind. Reword the warning on the definition of literal operators not starting with underscores (which are, strangely, legal) to more explicitly state that such operators can't be called by literals. Remove the special-case diagnostic for hexfloats, since it was both triggering in the wrong cases and incorrect. llvm-svn: 152287	2012-03-08 02:39:21 +00:00
Richard Smith	3e4a60a2cd	Add -Wc++11-compat warning for string and character literals followed by identifiers, in cases where those identifiers would be treated as user-defined literal suffixes in C++11. llvm-svn: 152198	2012-03-07 03:13:00 +00:00
Richard Smith	d67aea28f6	User-defined literals: reject string and character UDLs in all places where the grammar requires a string-literal and not a user-defined-string-literal. The two constructs are still represented by the same TokenKind, in order to prevent a combinatorial explosion of different kinds of token. A flag on Token tracks whether a ud-suffix is present, in order to prevent clients from needing to look at the token's spelling. llvm-svn: 152098	2012-03-06 03:21:47 +00:00
Richard Smith	e18f0faff2	Lexing support for user-defined literals. Currently these lex as the same token kinds as the underlying string literals, and we silently drop the ud-suffix; those issues will be fixed by subsequent patches. llvm-svn: 152012	2012-03-05 04:02:15 +00:00
Argyrios Kyrtzidis	0d9e24b1db	Change Lexer::makeFileCharRange() to have it accept a CharSourceRange instead of a SourceRange, and handle the case where the range is a char (not token) range. llvm-svn: 149677	2012-02-03 05:58:29 +00:00
Argyrios Kyrtzidis	abff5f1271	Improve Lexer::getImmediateMacroName to take into account inner macros of macro arguments. For "MAC1( MAC2(foo) )" and location of 'foo' token it would return "MAC1" instead of "MAC2". llvm-svn: 148704	2012-01-23 16:58:33 +00:00
Argyrios Kyrtzidis	85e7671b71	Enhance Lexer::makeFileCharRange to check for ranges inside a macro argument expansion, in which case it returns a file range in the location where the argument was spelled. llvm-svn: 148551	2012-01-20 16:52:43 +00:00
Argyrios Kyrtzidis	7838a2bffb	Introduce Lexer::getSourceText() that returns a string for the source that the given source range encompasses. llvm-svn: 148481	2012-01-19 15:59:19 +00:00
Argyrios Kyrtzidis	a99e02d019	Introduce Lexer::makeFileCharRange() that accepts a token source range and returns a character range with file locations. llvm-svn: 148480	2012-01-19 15:59:14 +00:00
Argyrios Kyrtzidis	1b07c344b4	For Lexer's isAt[Start/End]OfMacroExpansion add an out parameter for the macro start/end location. It is commonly needed after calling the function; with this way we avoid recalculating it. llvm-svn: 148479	2012-01-19 15:59:08 +00:00
Anna Zaks	1bea4bf590	Refactor: Pull getImmediateMacroName() out of DiagnosticRenderer and into Lexer and Preprocessor; making it widely available. llvm-svn: 148410	2012-01-18 20:17:16 +00:00
Chandler Carruth	5b15a9be6a	Two variables had been added for an assert, but their values were re-computed rather than the variables be re-used just after the assert. Just use the variables since we have them already. Fixes an unused variable warning. Also fix an 80-column violation. llvm-svn: 148212	2012-01-15 09:03:45 +00:00
Argyrios Kyrtzidis	8a26c4de64	In Lexer::getCharAndSizeSlow[NoWarn] if we come up against \<newline><newline> don't consume the second newline. Thanks to David Blaikie for pointing out the crash! llvm-svn: 147138	2011-12-22 04:38:07 +00:00
Argyrios Kyrtzidis	e5cdd080ba	In Lexer::getCharAndSizeSlow[NoWarn] make sure we don't go over the end of the buffer when the end of the buffer is immediately after an escaped newline. Fixes http://llvm.org/PR10153. llvm-svn: 147091	2011-12-21 20:19:55 +00:00
David Blaikie	68e081d606	Unweaken vtables as per http://llvm.org/docs/CodingStandards.html#ll_virtual_anch llvm-svn: 146959	2011-12-20 02:48:34 +00:00
Benjamin Kramer	900f1defdd	Remove assert from hot code path and add a clarifying comment. The assert wasn't adding much value but slowed down Release+Asserts builds. llvm-svn: 145082	2011-11-22 20:39:31 +00:00
Benjamin Kramer	3885737a1b	Lexer: Don't throw away the hard work SSE did to find a slash. We can reuse the information and avoid looping over all the bytes again. llvm-svn: 145070	2011-11-22 18:56:46 +00:00
Ted Kremenek	a08713ce86	Move about 20 random diagnostics under -W flags. Patch by Ahmed Charles! llvm-svn: 142284	2011-10-17 21:47:53 +00:00
Richard Smith	acd4d3d52a	-Wc++98-compat warnings for the lexer. This also adds a -Wc++98-compat-pedantic for warning on constructs which would be diagnosed by -std=c++98 -pedantic (that is, it warns even on C++11 features which we enable by default, with no warning, in C++98 mode). llvm-svn: 142034	2011-10-15 01:18:56 +00:00
Douglas Gregor	227c352bae	We do parse hexfloats in C++11; make it actually work. llvm-svn: 141798	2011-10-12 18:51:02 +00:00
Richard Smith	a9e33d44a6	Handle Perforce-style conflict markers like normal conflict markers. Perforce swaps over the <<<< and >>>> markers, and uses shorter markers than traditional tools. llvm-svn: 141751	2011-10-12 00:37:51 +00:00
Abramo Bagnara	e398e60611	Fixed exapnsion range for # and ##. llvm-svn: 141012	2011-10-03 18:39:03 +00:00
Argyrios Kyrtzidis	e6e67deeed	Rename SourceLocation::getFileLocWithOffset -> getLocWithOffset. It already works (and is useful with) macro locs as well. llvm-svn: 140057	2011-09-19 20:40:19 +00:00
Francois Pichet	0706d203cf	Rename LangOptions::Microsoft to LangOptions::MicrosoftExt to make it clear that this flag must be used only for Microsoft extensions and not emulation; to avoid confusion with the new LangOptions::MicrosoftMode flag. Many of the code now under LangOptions::MicrosoftExt will eventually be moved under the LangOptions::MicrosoftMode flag. llvm-svn: 139987	2011-09-17 17:15:52 +00:00
Benjamin Kramer	17ff23c708	Speed up BCPL comment lexing by looking aggressively for newlines and then scannig backwards to see if the newline is escaped. 3% speedup in preprocessing all of clang with -Eonly. Also includes a small testcase for coverage. llvm-svn: 139116	2011-09-05 07:19:39 +00:00
Benjamin Kramer	dbfb18a0a9	Use the Lexer's definition of whitespace here. llvm-svn: 139115	2011-09-05 07:19:35 +00:00
Argyrios Kyrtzidis	5cec2aea3f	Support code-completion for C++ inline methods and ObjC buffering methods. Previously we would cut off the source file buffer at the code-completion point; this impeded code-completion inside C++ inline methods and, recently, with buffering ObjC methods. Have the code-completion inserted into the source buffer so that it can be buffered along with a method body. When we actually hit the code-completion point the cut-off lexing or parsing. Fixes rdar://10056932&8319466 llvm-svn: 139086	2011-09-04 03:32:15 +00:00
Argyrios Kyrtzidis	a3deaeeb52	Fix Lexer::ComputePreamble when MaxLines parameter is non-zero. The function was only counting lines that included tokens and not empty lines, but MaxLines (mainly initiated to the line where the code-completion point resides) is a count of overall lines (even empty ones). llvm-svn: 139085	2011-09-04 03:32:04 +00:00
Douglas Gregor	081425343b	Introduce support for a simple module import declaration, which loads the named module. The syntax itself is intentionally hideous and will be replaced at some later point with something more palatable. For now, we're focusing on the semantics: - Module imports are handled first by the preprocessor (to get macro definitions) and then the same tokens are also handled by the parser (to get declarations). If both happen (as in normal compilation), the second one is redundant, because we currently have no way to hide macros or declarations when loading a module. Chris gets credit for this mad-but-workable scheme. - The Preprocessor now holds on to a reference to a module loader, which is responsible for loading named modules. CompilerInstance is the only important module loader: it now knows how to create and wire up an AST reader on demand to actually perform the module load. - We search for modules in the include path, using the module name with the suffix ".pcm" (precompiled module) for the file name. This is a temporary hack; we hope to improve the situation in the future. llvm-svn: 138679	2011-08-26 23:56:07 +00:00
Argyrios Kyrtzidis	7aecbc7661	Make Lexer::ComputePreamble accept a LangOptions parameter, otherwise it may be out-of-sync how a file is compiled. Patch by Matthias Kleine! llvm-svn: 138580	2011-08-25 20:39:19 +00:00
Argyrios Kyrtzidis	f6a3b0ca4b	In Lexer::isAtEndOfMacroExpansion use SourceManager::isInFileID and avoid the extra SourceManager::getFileID call. llvm-svn: 138376	2011-08-23 21:02:30 +00:00
Argyrios Kyrtzidis	161868db4c	Make Lexer::GetBeginningOfToken able to handle macro arg expansion locations. llvm-svn: 137795	2011-08-17 00:31:23 +00:00
Craig Topper	54edccafc5	Add support for C++0x raw string literals. llvm-svn: 137298	2011-08-11 04:06:15 +00:00
Anna Zaks	59a3c80717	Add a utility function to the Lexer, which makes it easier to find a token after the given location. (It is a generalized version of trans::findLocationAfterSemi from ArcMigrate, which will be changed to use the Lexer utility). llvm-svn: 136268	2011-07-27 21:43:43 +00:00
Douglas Gregor	fb65e592e0	Add support for C++0x unicode string and character literals, from Craig Topper! llvm-svn: 136210	2011-07-27 05:40:30 +00:00
Chandler Carruth	ee4c1d1298	Migrate 'Instantiation' data and API bits of SLocEntry to 'Expansion' etc. With this I think essentially all of the SourceManager APIs are converted. Comments and random other bits of cleanup should be all thats left. llvm-svn: 136057	2011-07-26 04:56:51 +00:00
Chandler Carruth	73ee5d7fae	Convert InstantiationInfo and much of the related code to ExpansionInfo and various other 'expansion' based terms. I've tried to reformat where appropriate and catch as many references in comments but I'm going to do several more passes. Also I've tried to expand parameter names to be more clear where appropriate. llvm-svn: 136056	2011-07-26 04:41:47 +00:00
Chandler Carruth	115b077f30	Rename create(MacroArg)InstantiationLoc to create(MacroArg)ExpansionLoc. llvm-svn: 136054	2011-07-26 03:03:05 +00:00
Chandler Carruth	ca757587a3	Rename SourceManager::getImmediateInstantiationRange to getImmediateExpansionRange. llvm-svn: 135960	2011-07-25 20:52:21 +00:00
Chandler Carruth	6d28d7f2a3	Rename SourceManager::getInstantiationRange to getExpansionRange. llvm-svn: 135915	2011-07-25 16:56:02 +00:00
Chandler Carruth	35f5320d8e	Mechanically rename SourceManager::getInstantiationLoc and FullSourceLoc::getInstantiationLoc to ...::getExpansionLoc. This is part of the API and documentation update from 'instantiation' as the term for macros to 'expansion'. llvm-svn: 135914	2011-07-25 16:49:02 +00:00
Chris Lattner	0e62c1cc0b	remove unneeded llvm:: namespace qualifiers on some core types now that LLVM.h imports them into the clang namespace. llvm-svn: 135852	2011-07-23 10:55:15 +00:00
Joerg Sonnenberger	da5d2b761a	Spelling llvm-svn: 135545	2011-07-20 00:14:37 +00:00
Douglas Gregor	925296b4c2	Revamp the SourceManager to separate the representation of parsed source locations from source locations loaded from an AST/PCH file. Previously, loading an AST/PCH file involved carefully pre-allocating space at the beginning of the source manager for the source locations and FileIDs that correspond to the prefix, and then appending the source locations/FileIDs used for parsing the remaining translation unit. This design forced us into loading PCH files early, as a prefix, whic has become a rather significant limitation. This patch splits the SourceManager space into two parts: for source location "addresses", the lower values (growing upward) are used to describe parsed code, while upper values (growing downward) are used for source locations loaded from AST/PCH files. Similarly, positive FileIDs are used to describe parsed code while negative FileIDs are used to file/macro locations loaded from AST/PCH files. As a result, we can load PCH/AST files even during parsing, making various improvemnts in the future possible, e.g., teaching #include <foo.h> to look for and load <foo.h.gch> if it happens to be already available. This patch was originally written by Sebastian Redl, then brought forward to the modern age by Jonathan Turner, and finally polished/finished by me to be committed. llvm-svn: 135484	2011-07-19 16:10:42 +00:00
Chandler Carruth	e2c09ebcaa	Convert terminology in the Lexer from 'instantiate' and variants to 'expand'. Also update the public API it provides to the new term, and propagate that update to the various clients. No functionality changed. llvm-svn: 135138	2011-07-14 08:20:40 +00:00
Argyrios Kyrtzidis	61c58f7f43	Move SourceManager::isAt[Start/End]OfMacroInstantiation functions to the Lexer, since they depend on it now. llvm-svn: 134644	2011-07-07 21:54:45 +00:00
Argyrios Kyrtzidis	41fb2d95a3	Make the Preprocessor more memory efficient and improve macro instantiation diagnostics. When a macro instantiation occurs, reserve a SLocEntry chunk with length the full length of the macro definition source. Set the spelling location of this chunk to point to the start of the macro definition and any tokens that are lexed directly from the macro definition will get a location from this chunk with the appropriate offset. For any tokens that come from argument expansion, '##' paste operator, etc. have their instantiation location point at the appropriate place in the instantiated macro definition (the argument identifier and the '##' token respectively). This improves macro instantiation diagnostics: Before: t.c:5:9: error: invalid operands to binary expression ('struct S' and 'int') int y = M(/); ^~~~ t.c:5:11: note: instantiated from: int y = M(/); ^ After: t.c:5:9: error: invalid operands to binary expression ('struct S' and 'int') int y = M(/); ^~~~ t.c:3:20: note: instantiated from: \#define M(op) (foo op 3); ~~~ ^ ~ t.c:5:11: note: instantiated from: int y = M(/); ^ The memory savings for a candidate boost library that abuses the preprocessor are: - 32% less SLocEntries (37M -> 25M) - 30% reduction in PCH file size (900M -> 635M) - 50% reduction in memory usage for the SLocEntry table (1.6G -> 800M) llvm-svn: 134587	2011-07-07 03:40:34 +00:00
Argyrios Kyrtzidis	2cfce18645	Allow Lexer::getLocForEndOfToken to return the location just passed the macro instantiation if the location given points at the last token of the macro instantiation. Fixes rdar://9045701. llvm-svn: 133804	2011-06-24 17:58:59 +00:00
Eli Friedman	86a5101c27	Don't strlen() every file before parsing it. llvm-svn: 131132	2011-05-10 17:11:21 +00:00
Chris Lattner	57540c5be0	fix a bunch of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129559	2011-04-15 05:22:18 +00:00
Richard Smith	f7b6202e6c	Implement C++0x [lex.pptoken]p3's handling of <::. llvm-svn: 129525	2011-04-14 18:36:27 +00:00
Eric Christopher	7f36a79ee9	Eat the UTF-8 BOM at the beginning of a file since it's ignored anyhow. Nom Nom Nom. Patch by Anton Korobeynikov! llvm-svn: 129174	2011-04-09 00:01:04 +00:00
John McCall	75ca6d72c2	Fix getLocForEndOfToken to not double-count spurious internal characters within a token, like trigraphs and escaped newlines. Patch by Marcin Kowalczyk! llvm-svn: 128978	2011-04-06 01:50:22 +00:00
Daniel Dunbar	1057f86d0e	Lexer: Add extremely limited support for -traditional-cpp, ignoring BCPL comments. llvm-svn: 127910	2011-03-18 21:23:38 +00:00
John McCall	462c055d85	Fix my earlier commit to work with escaped newlines and leave breadcrumbs in case we want to make a world where we can check intermediate instantiations for this kind of breadcrumb. llvm-svn: 127221	2011-03-08 07:59:04 +00:00
Peter Collingbourne	2f1e36bfd0	Rename tok::eom to tok::eod. The previous name was inaccurate as this token in fact appears at the end of every preprocessing directive, not just macro definitions. No functionality change, except for a diagnostic tweak. llvm-svn: 126631	2011-02-28 02:37:51 +00:00
Argyrios Kyrtzidis	c541ade850	Warn for missing terminating " or ' instead of error for gcc compatibility. Fixed rdar://8914293. llvm-svn: 125616	2011-02-15 23:45:31 +00:00
Peter Collingbourne	c1270f51fa	Lexer: add CUDA kernel call tokens llvm-svn: 125218	2011-02-09 21:08:21 +00:00
Douglas Gregor	86af98444f	Harden Lexer::GetBeginningOfToken() against bogus source locations and the disappearance/alteration of files. llvm-svn: 124616	2011-01-31 22:42:36 +00:00
Abramo Bagnara	ea4f7c7761	Introduced raw_identifier token kind. llvm-svn: 122394	2010-12-22 08:23:18 +00:00
Chris Lattner	39720111e0	move getSpelling from Preprocessor to Lexer, which it is more conceptually related to. llvm-svn: 119479	2010-11-17 07:26:20 +00:00
Chris Lattner	2a6ee91619	move AdvanceToTokenCharacter and getLocForEndOfToken from Preprocessor to Lexer where they make more sense. llvm-svn: 119474	2010-11-17 07:05:50 +00:00
Chandler Carruth	c3ce5840af	Update remaining attribute macros to new style. llvm-svn: 117204	2010-10-23 08:44:57 +00:00
Sebastian Redl	517523014d	In MeasureTokenLength, the FileLoc supplied to the lexer must point to the start of the buffer, or we risk overflow. llvm-svn: 115117	2010-09-30 01:03:03 +00:00
Chris Lattner	0f0492e69c	improve isHexaLiteral to work with escaped newlines and trigraphs, patch by Francois Pichet! llvm-svn: 112602	2010-08-31 16:42:00 +00:00
Chris Lattner	dec7334218	silence a warning llvm-svn: 112549	2010-08-30 23:11:03 +00:00
Alexis Hunt	3b7918625c	Revert my user-defined literal commits - r1124{58,60,67} pending some issues being sorted out. llvm-svn: 112493	2010-08-30 17:47:05 +00:00
Chris Lattner	5f183aa592	add a fixme. llvm-svn: 112491	2010-08-30 17:11:14 +00:00
Chris Lattner	7a9e9e7d76	use 'features' instead of 'PP->getLangOptions'. llvm-svn: 112490	2010-08-30 17:09:08 +00:00
Douglas Gregor	759ef23bb8	In Microsoft compatibility mode, don't parse the exponent as part of the pp-number in a hexadecimal floating point literal, from Francois Pichet! Fixes PR7968. llvm-svn: 112481	2010-08-30 14:50:47 +00:00
Alexis Hunt	79eb5469e0	Implement C++0x user-defined string literals. The extra data stored on user-defined literal Tokens is stored in extra allocated memory, which is managed by the PreprocessorLexer because there isn't a better place to put it that makes sure it gets deallocated, but only after it's used up. My testing has shown no significant slowdown as a result, but independent testing would be appreciated. llvm-svn: 112458	2010-08-29 21:26:48 +00:00
Douglas Gregor	115837041e	Introduce a preprocessor code-completion hook for contexts where we expect "natural" language and should not provide any completions, e.g., comments, string literals, #error. llvm-svn: 112054	2010-08-25 17:04:25 +00:00
Douglas Gregor	3a7ad25eb6	Introduce basic code-completion support for preprocessor directives, e.g., after a "#" we'll suggest #if, #ifdef, etc. llvm-svn: 111943	2010-08-24 19:08:16 +00:00
Douglas Gregor	02690ba643	Don't emit end-of-file diagnostics like "unterminated conditional" or "unterminated string" when we're performing code completion. llvm-svn: 110933	2010-08-12 17:04:55 +00:00
Benjamin Kramer	e8394df11b	Random temporary string cleanup. llvm-svn: 110807	2010-08-11 14:47:12 +00:00
Douglas Gregor	028d3e4d0f	Use precompiled preambles for in-process code completion. llvm-svn: 110596	2010-08-09 20:45:32 +00:00
Douglas Gregor	3f4bea0646	Introduce basic support for loading a precompiled preamble while reparsing an ASTUnit. When saving a preamble, create a buffer larger than the actual file we're working with but fill everything from the end of the preamble to the end of the file with spaces (so the lexer will quickly skip them). When we load the file, create a buffer of the same size, filling it with the file and then spaces. Then, instruct the lexer to start lexing after the preamble, therefore continuing the parse from the spot where the preamble left off. It's now possible to perform a simple preamble build + parse (+ reparse) with ASTUnit. However, one has to disable a bunch of checking in the PCH reader to do so. That part isn't committed; it will likely be handled with some other kind of flag (e.g., -fno-validate-pch). As part of this, fix some issues with null termination of the memory buffers created for the preamble; we were trying to explicitly NULL-terminate them, even though they were also getting implicitly NULL terminated, leading to excess warnings about NULL characters in source files. llvm-svn: 109445	2010-07-26 21:36:20 +00:00
Douglas Gregor	cd8bdd025f	Improve performance during cursor traversal when a region of interest is present. Rather than using clang_getCursorExtent(), which requires us to lex the token at the ending position to determine its length. Then, we'd be comparing [a, b) source ranges that cover the characters in the range rather than the normal behavior for Clang's source ranges, which covers the tokens in the range. However, relexing causes us to read the source file (which may come from a precompiled header), which is rather unfortunate and affects performance. In the new scheme, we only use Clang-style source ranges that cover the tokens in the range. At the entry points where this matters (clang_annotateTokens, clang_getCursor), we make sure to move source locations to the start of the token. Addresses most of <rdar://problem/8049381>. llvm-svn: 109134	2010-07-22 20:22:31 +00:00
Douglas Gregor	af82e3510b	Introduce a new lexer function to compute the "preamble" of a file, which is the part of the file that contains all of the initial comments, includes, and preprocessor directives that occur before any of the actual code. Added a new -print-preamble cc1 action that is only used for testing. llvm-svn: 108913	2010-07-20 20:18:03 +00:00
Chris Lattner	86851b8a7a	fix PR4499, patch by Kyle Dean! llvm-svn: 107836	2010-07-07 23:24:27 +00:00
Chris Lattner	52d96ac930	simpler fix for rdar://8044135 - escaped newlines have already been processed, so they don't have to be tip-toed around. llvm-svn: 105182	2010-05-30 23:27:38 +00:00
Douglas Gregor	fe4a4107d8	Improve our handling of NULL after an escaping '\' in a string literal. Fixes <rdar://problem/8044135>. llvm-svn: 105181	2010-05-30 22:59:50 +00:00
Douglas Gregor	6da3db4af3	Improve code completion in failure cases in two ways: 1) Suppress diagnostics as soon as we form the code-completion token, so we don't get any error/warning spew from the early end-of-file. 2) If we consume a code-completion token when we weren't expecting one, go into a code-completion recovery path that produces the best results it can based on the context that the parser is in. llvm-svn: 104585	2010-05-25 05:58:43 +00:00
Chris Lattner	467f6bcfe5	robustify the conflict marker stuff. Don't add 7 twice, which would make it miss (invalid) things like: <<<<<<< >>>>>>> and crash if <<<<<<< was at the end of the line. When we find a >>>>>>> that is not at the end of the line, make sure to reset Pos so we don't crash on something like: <<<<<<< >>>>>>> This isn't worth making testcases for, since each would require a new file. rdar://7987078 - signal 11 compiling "<<<<<<<<<<" llvm-svn: 103968	2010-05-17 20:27:25 +00:00
Chris Lattner	561aabd943	when code completing inside a C-style block comment, don't emit errors about a missing */ since we truncated the file. This fixes rdar://7948776 llvm-svn: 103913	2010-05-16 19:54:05 +00:00
Chris Lattner	1a9e873bf9	fix a minor bug I noticed while work with Jordy's patch for PR6101, in an input file like this: # 42 int x; we were emitting: # <something> int x; (with a space before the int) because we weren't clearing the leading whitespace flag properly after the \n from the directive was handled. llvm-svn: 101084	2010-04-12 23:04:41 +00:00
Douglas Gregor	a771f46c82	Reinstate my CodeModificationHint -> FixItHint renaming patch, without the C-only "optimization". llvm-svn: 100022	2010-03-31 17:46:05 +00:00
Douglas Gregor	30e631862f	Revert r100008, which inexplicably breaks the clang-i686-darwin10 builder llvm-svn: 100018	2010-03-31 17:25:35 +00:00
Douglas Gregor	3baad0d4f7	Rename CodeModificationHint to FixItHint, since we've been using the term "fix-it" everywhere and even I get tired of long names sometimes. No functionality change. llvm-svn: 100008	2010-03-31 15:31:50 +00:00
Douglas Gregor	1668355e06	Remove unused variable llvm-svn: 98691	2010-03-16 22:54:32 +00:00
Douglas Gregor	dc970f0866	Audit all Preprocessor::getSpelling() callers, improving failure recovery for those that need it. llvm-svn: 98689	2010-03-16 22:30:13 +00:00
Douglas Gregor	42fe858cd6	Audit all callers of SourceManager::getCharacterData(); update some of them to recover more gracefully on failure. llvm-svn: 98672	2010-03-16 20:46:42 +00:00
Benjamin Kramer	eb92dc0b09	Let SourceManager::getBufferData return StringRef instead of a pair of two const char*. llvm-svn: 98630	2010-03-16 14:14:31 +00:00
Douglas Gregor	e0fbb83b8b	Give SourceManager a Diagnostic object with which to report errors, and start simplifying the interfaces in SourceManager that can fail. llvm-svn: 98594	2010-03-16 00:06:06 +00:00
Douglas Gregor	802b77601e	Introduce a new BufferResult class to act as the return type of SourceManager's getBuffer() (and similar) operations. This abstract can be used to force callers to cope with errors in getBuffer(), such as missing files and changed files. Fix a bunch of callers to use the new interface. Add some very basic checks for file consistency (file size, modification time) into ContentCache::getBuffer(), although these checks don't help much until we've updated the main callers (e.g., SourceManager::getSpelling()). llvm-svn: 98585	2010-03-15 22:54:52 +00:00
Chris Lattner	93ddf80eb7	don't inform comment handlers about comments in #if 0 blocks, doing so invalidates the file guard optimization and is not in the spirit of "#if 0" because it is supposed to completely skip everything, even if it isn't lexically valid. Patch by Abramo Bagnara! llvm-svn: 95253	2010-02-03 21:06:21 +00:00
Douglas Gregor	562c1f9365	Teach CIndex's cursor visitor to restrict its traversal to a specific region of interest (if provided). Implement clang_getCursor() in terms of this traversal rather than using the Index library; the unified cursor visitor is more complete, and will be The Way Forward. Minor other tweaks needed to make this work: - Extend Preprocessor::getLocForEndOfToken() to accept an offset from the end, making it easy to move to the last character in the token (rather than just past the end of the token). - In Lexer::MeasureTokenLength(), the length of whitespace is zero. llvm-svn: 94200	2010-01-22 19:49:59 +00:00
Chris Lattner	87d0208c41	allow the HandlerComment callback to push tokens into the preprocessor. This could be used by an OpenMP implementation or something. Patch by Abramo Bagnara! llvm-svn: 93795	2010-01-18 22:35:47 +00:00
Chris Lattner	21d9b9a948	add a TODO for a perf improvement in LexIdentifier. llvm-svn: 93141	2010-01-11 02:38:50 +00:00
Alexis Hunt	91b78382b5	Do not parse hexadecimal floating point literals in C++0x mode because they are incompatible with user-defined literals, specifically with the following form: 0x1p+1 The preprocessing-number token extends only as far as the 'p'; the '+' is not included. Previously we could get away with this extension as p was an invalid suffix, but now with user-defined literals, 'p' might well be a valid suffix and we are forced to consider it as such. This patch also adds a warning in non-0x C++ modes telling the user that this extension is incompatible with C++0x that is enabled by default (previously and with other languages, we warn only with a compliance option such as -pedantic). llvm-svn: 93135	2010-01-10 23:37:56 +00:00
Chris Lattner	3dfff974ec	reimplement r90860, fixing a couple of problems: 1. Don't make a copy of LangOptions every time a lexer is created. 2. Don't make CharInfo global mutable state. 3. Fix the implementation to properly treat ^Z as EOF instead of as horizontal whitespace, which matches the semantic implemented by VC++. llvm-svn: 91586	2009-12-17 05:29:40 +00:00
Chris Lattner	7c027ee4c2	teach clang to recover gracefully from conflict markers left in source files: PR5238. llvm-svn: 91270	2009-12-14 06:16:57 +00:00
Steve Naroff	04bc01833e	Integrate the following from the 'objective-rewrite' branch: http://llvm.org/viewvc/llvm-project?view=rev&revision=80043 llvm-svn: 90860	2009-12-08 16:38:12 +00:00
Douglas Gregor	53ad6b94b0	Extend the source manager with the ability to override the contents of files with the contents of an arbitrary memory buffer. Use this new functionality to drastically clean up the way in which we handle file truncation for code-completion: all of the truncation/completion logic is now encapsulated in the preprocessor where it belongs (<rdar://problem/7434737>). llvm-svn: 90300	2009-12-02 06:49:09 +00:00
Chris Lattner	710bb87147	Fix PR5633 by making the preprocessor handle the case where we can stat a file but where mmaping it fails. In this case, we emit an error like: t.c:1:10: fatal error: error opening file '../../foo.h' instead of "cannot find file". llvm-svn: 90110	2009-11-30 04:18:44 +00:00
Benjamin Kramer	5e738284d7	Move DISABLE_INLINE to the front of the decl so MSVC can parse it. Patch by Amine Khaldi! llvm-svn: 88797	2009-11-14 16:36:57 +00:00
Chris Lattner	a3d4f16b12	Teach Lexer::MeasureTokenLength to be able to measure the length of comment tokens. Patch by Abramo Bagnara! llvm-svn: 84100	2009-10-14 15:04:18 +00:00
Douglas Gregor	ea9b03e6e2	Replace the -code-completion-dump option with -code-completion-at=filename:line:column which performs code completion at the specified location by truncating the file at that position and enabling code completion. This approach makes it possible to run multiple tests from a single test file, and gives a more natural command-line interface. llvm-svn: 82571	2009-09-22 21:11:38 +00:00
Douglas Gregor	3545ff43f4	Refactor and simplify the CodeCompleteConsumer, so that all of the real work is performed within Sema. Addresses Chris's comments, but still retains the heavyweight list-of-multimaps data structure. llvm-svn: 82459	2009-09-21 16:56:56 +00:00
Douglas Gregor	2436e7116b	Initial implementation of a code-completion interface in Clang. In essence, code completion is triggered by a magic "code completion" token produced by the lexer [], which the parser recognizes at certain points in the grammar. The parser then calls into the Action object with the appropriate CodeCompletionXXX action. Sema implements the CodeCompletionXXX callbacks by performing minimal translation, then forwarding them to a CodeCompletionConsumer subclass, which uses the results of semantic analysis to provide code-completion results. At present, only a single, "printing" code completion consumer is available, for regression testing and debugging. However, the design is meant to permit other code-completion consumers. This initial commit contains two code-completion actions: one for member access, e.g., "x." or "p->", and one for nested-name-specifiers, e.g., "std::". More code-completion actions will follow, along with improved gathering of code-completion results for the various contexts. [] In the current -code-completion-dump testing/debugging mode, the file is truncated at the completion point and EOF is translated into "code completion". llvm-svn: 82166	2009-09-17 21:32:03 +00:00
Mike Stump	11289f4280	Remove tabs, and whitespace cleanups. llvm-svn: 81346	2009-09-09 15:08:12 +00:00
Chris Lattner	de50a0c251	Convert the CharInfo table to be statically initialized, instead of dynamically initialized. Patch by Ryan Flynn! llvm-svn: 74919	2009-07-07 17:09:54 +00:00
Chris Lattner	5c34938aa4	fix an out-of-date comment. llvm-svn: 74894	2009-07-07 05:05:42 +00:00
Douglas Gregor	c6d5edd2ed	Add support for retrieving the Doxygen comment associated with a given declaration in the AST. The new ASTContext::getCommentForDecl function searches for a comment that is attached to the given declaration, and returns that comment, which may be composed of several comment blocks. Comments are always available in an AST. However, to avoid harming performance, we don't actually parse the comments. Rather, we keep the source ranges of all of the comments within a large, sorted vector, then lazily extract comments via a binary search in that vector only when needed (which never occurs in a "normal" compile). Comments are written to a precompiled header/AST file as a blob of source ranges. That blob is only lazily loaded when one requests a comment for a declaration (this never occurs in a "normal" compile). The indexer testbed now supports comment extraction. When the -point-at location points to a declaration with a Doxygen-style comment, the indexer testbed prints the associated comment block(s). See test/Index/comments.c for an example. Some notes: - We don't actually attempt to parse the comment blocks themselves, beyond identifying them as Doxygen comment blocks to associate them with a declaration. - We won't find comment blocks that aren't adjacent to the declaration, because we start our search based on the location of the declaration. - We don't go through the necessary hops to find, for example, whether some redeclaration of a declaration has comments when our current declaration does not. Similarly, we don't attempt to associate a \param Foo marker in a function body comment with the parameter named Foo (although that is certainly possible). - Verification of my "no performance impact" claims is still "to be done". llvm-svn: 74704	2009-07-02 17:08:52 +00:00
Chris Lattner	c183595534	Fix our check for "random whitespace between a \ and newline" to work with dos style newlines. I have a trivial test for this: // RUN: clang-cc %s -verify #define test(x, y) \ x ## y but I don't know how to get svn to not change newlines and testrunner doesn't work with dos style newlines either, so "not worth it". :) rdar://6994000 llvm-svn: 73945	2009-06-23 05:15:06 +00:00
Chris Lattner	ff96dd0301	Fix rdar://6880630 - # in _Pragma does not start a preprocessor directive. llvm-svn: 71643	2009-05-13 06:10:29 +00:00
Eli Friedman	5d72d41189	Get rid of some useless uses of NoExtensions. The philosophy here is that if we're going to print an extension warning anyway, there's no point to changing behavior based on NoExtensions: it will only make error recovery worse. Note that this doesn't cause any behavior change because NoExtensions isn't used by the current front-end. I'm still considering what to do about the remaining use of NoExtensions in IdentifierTable.cpp. llvm-svn: 70273	2009-04-28 00:51:18 +00:00
Chris Lattner	40493eb6eb	fix rdar://6816766 - Crash with function-like macro test at end of directive. llvm-svn: 69964	2009-04-24 07:15:46 +00:00
Chris Lattner	38b2cde4c4	add a new Lexer::SkipEscapedNewLines method. llvm-svn: 69483	2009-04-18 22:27:02 +00:00

1 2 3 4 5 ...

315 Commits