llvm-project

Commit Graph

Author	SHA1	Message	Date
Ted Kremenek	62224c1d7f	Add more PTH diagnostics for invalid PTH files, etc. llvm-svn: 63232	2009-01-28 21:02:43 +00:00
Ted Kremenek	3b0589e4b4	Enhance PTHManager::Create() to take an optional Diagnostic* argument that can be used to report issues such as a missing PTH file. llvm-svn: 63231	2009-01-28 20:49:33 +00:00
Ted Kremenek	8d178f4357	PTH: Use Token::setLiteralData() to directly store a pointer to cached spelling data in the PTH file. This removes a ton of code for looking up spellings using sourcelocations in the PTH file. This simplifies both PTH-generation and reading. Performance impact for -fsyntax-only on Cocoa.h (with Cocoa.h in the PTH file): - PTH generation time improves by 5% - PTH reading improves by 0.3%. llvm-svn: 63072	2009-01-27 00:01:05 +00:00
Ted Kremenek	327d00cd45	Silence warning. llvm-svn: 63054	2009-01-26 22:16:12 +00:00
Ted Kremenek	978b5becea	Add version number checking to PTH files. llvm-svn: 63047	2009-01-26 21:50:21 +00:00
Ted Kremenek	eb8c8fbd63	Embed the offset of the PTH table inside the prologue of the PTH file. This will help improve gradual versioning of PTH files instead of relying that the PTH table is at a fixed offset. llvm-svn: 63045	2009-01-26 21:43:14 +00:00
Chris Lattner	4fa23625ab	Check in the long promised SourceLocation rewrite. This lays the ground work for implementing #line, and fixes the "out of macro ID's" problem. There is nothing particularly tricky about the code, other than the very performance sensitive SourceManager::getFileID() method. llvm-svn: 62978	2009-01-26 00:43:02 +00:00
Chris Lattner	1f6c7fe6a8	This is a follow-up to r62675: Refactor how the preprocessor changes a token from being an tok::identifier to a keyword (e.g. tok::kw_for). Instead of doing this in HandleIdentifier, hoist this common case out into the caller, so that every keyword doesn't have to go through HandleIdentifier. This drops time in HandleIdentifier from 1.25ms to .62ms, and speeds up clang -Eonly with PTH by about 1%. llvm-svn: 62855	2009-01-23 18:35:48 +00:00
Chris Lattner	f8ccb4f9e3	Update comment. llvm-svn: 62819	2009-01-23 00:13:28 +00:00
Chris Lattner	34eab390b9	remove my gross #ifdef's, using portable abstractions now that the 32-bit load is always aligned. I verified that the bswap doesn't occur in the assembly code on x86. llvm-svn: 62815	2009-01-22 23:50:07 +00:00
Chris Lattner	fec5470f03	remove Read8/Read24, which are dead. Rename Read16/Read32 to be more descriptive. llvm-svn: 62775	2009-01-22 19:48:26 +00:00
Ted Kremenek	ae54f2f590	Fix <rdar://problem/6512717> by correctly reading the right offset in the token data in PTHLexer::getSourceLocation(). llvm-svn: 62725	2009-01-21 22:41:38 +00:00
Chris Lattner	3029b35faa	merge two checks for identifiers in the pth loop into one. llvm-svn: 62677	2009-01-21 07:50:06 +00:00
Chris Lattner	ad89ec013f	Add a bit to IdentifierInfo that acts as a simple predicate which tells us whether Preprocessor::HandleIdentifier needs to be called. Because this method is only rarely needed, this saves a call and a bunch of random checks. This drops the time in HandleIdentifier from 3.52ms to .98ms on cocoa.h on my machine. llvm-svn: 62675	2009-01-21 07:43:11 +00:00
Ted Kremenek	8d6c828728	Don't crash on empty PTH files. This fixes <rdar://problem/6512714>. llvm-svn: 62673	2009-01-21 07:34:28 +00:00
Chris Lattner	c950296006	really we only need on Read24! llvm-svn: 62672	2009-01-21 07:28:57 +00:00
Chris Lattner	47def9787e	revert my previous patch, it assumed endianness. llvm-svn: 62671	2009-01-21 07:21:56 +00:00
Chris Lattner	a74f7cbb9d	minor cleanups: now that tokens are 4-byte aligned in a PTH file, just load them directly as ints. llvm-svn: 62668	2009-01-21 07:06:08 +00:00
Ted Kremenek	52f73cad4a	Fix: <rdar://problem/6510344> [pth] PTH slows down regular lexer considerably (when it has substantial work) Changes to IdentifierTable: - High-level summary: StringMap never owns IdentifierInfos. It just references them. - The string map now has StringMapEntry<IdentifierInfo> instead of StringMapEntry<IdentifierInfo>. The IdentifierInfo object is allocated using the same bump pointer allocator as used by the StringMap. Changes to IdentifierInfo: - Added an extra pointer to point to the StringMapEntry<IdentifierInfo> in the string map. This pointer will be null if the IdentifierInfo* is only used by the PTHLexer (that is it isn't in the StringMap). Algorithmic changes: - Non-PTH case: IdentifierInfo::get() will always consult the StringMap first to see if we have an IdentifierInfo object. If that StringMapEntry references a null pointer, we allocate a new one from the BumpPtrAllocator and update the reference in the StringMapEntry. - PTH case: We do the same lookup as with the non-PTH case, but if we don't get a hit in the StringMap we do a secondary lookup in the PTHManager for the IdentifierInfo. If we don't find an IdentifierInfo we create a new one as in the non-PTH case. If we do find and IdentifierInfo in the PTHManager, we update the StringMapEntry to refer to it so that the IdentifierInfo will be found on the next StringMap lookup. This way we only do a binary search in the PTH file at most once for a given IdentifierInfo. This greatly speeds things up for source files containing a non-trivial amount of code. Performance impact: While these changes do add some extra indirection in IdentifierTable to access an IdentifierInfo, I saw speedups even in the non-PTH case as well. Non-PTH: For -fsyntax-only on Cocoa.h, we see a 6% speedup. PTH (with Cocoa.h in token cache): 11% speedup. I also did an experiment where we did -fsyntax-only on a source file including a large header and Cocoa.h, but the token cache did not contain the larger header. For this file, we were seeing a performance regression* when using PTH of 3% over non-PTH. Now we are seeing a performance improvement of 9%! Tests: The serialization tests are now failing. I looked at this extensively, and I my belief is that this change is unmasking a bug rather than introducing a new one. I have disabled the serialization tests for now. llvm-svn: 62636	2009-01-20 23:28:34 +00:00
Ted Kremenek	8433f0b400	PTH: Emitted tokens now consist of 12 bytes that are loaded used 3 32-bit loads. This reduces user time but increases system time because of the slightly larger PTH file. Although there is no performance win on Cocoa.h and -Eonly, overall this seems like a good step. llvm-svn: 62542	2009-01-19 23:13:15 +00:00
Chris Lattner	144aacd19e	rearrange GetIdentifierInfo so that the fast path can be partially inlined into PTHLexer::Lex. This speeds up the user time of PTH -Eonly by another 2ms (4.4%) llvm-svn: 62454	2009-01-18 02:57:21 +00:00
Chris Lattner	18fc6ceb56	rename some variables, only set a tokens identifierinfo if non-null. llvm-svn: 62450	2009-01-18 02:34:01 +00:00
Chris Lattner	9cdd877436	On i386 and x86-64, just do unaligned loads instead of assembling from bytes. This speeds up -Eonly PTH reading of cocoa.h by about 2ms, which is 4.2%. llvm-svn: 62447	2009-01-18 02:19:16 +00:00
Chris Lattner	137d6492a8	switch PTHLexer to use Read32 and friends instead of lots of inlined copies. I verified that this causes no performance change in PTH. llvm-svn: 62445	2009-01-18 02:10:31 +00:00
Chris Lattner	eb09754a9d	switch PTH lexer from using "const char"s to "const unsigned char"s internally. This is just a cleanup that reduces the need to cast to unsigned char before assembling a larger integer. llvm-svn: 62442	2009-01-18 01:57:14 +00:00
Chris Lattner	ab1d4b8abd	simplify PTHManager::CreateLexer llvm-svn: 62424	2009-01-17 08:06:50 +00:00
Chris Lattner	3793bba26f	suck the call to "getSpellingLoc" that all clients do into the implementation of PTHManager::getSpelling. llvm-svn: 62408	2009-01-17 06:29:33 +00:00
Chris Lattner	d32480d3db	this massive patch introduces a simple new abstraction: it makes "FileID" a concept that is now enforced by the compiler's type checker instead of yet-another-random-unsigned floating around. This is an important distinction from the "FileID" currently tracked by SourceLocation. That FileID may refer to the start of a file or to a chunk within it. The new FileID only refers to the file (and its #include stack and eventually #line data), it cannot refer to a chunk. FileID is a completely opaque datatype to all clients, only SourceManager is allowed to poke and prod it. llvm-svn: 62407	2009-01-17 06:22:33 +00:00
Chris Lattner	53e384f633	Change some terminology in SourceLocation: instead of referring to the "physical" location of tokens, refer to the "spelling" location. This is more concrete and useful, tokens aren't really physical objects! llvm-svn: 62309	2009-01-16 07:00:02 +00:00
Ted Kremenek	4bbb79a642	PTH: Fix termination condition in binary search. llvm-svn: 62277	2009-01-15 19:28:38 +00:00
Ted Kremenek	a705b04d7f	IdentifierInfo: - IdentifierInfo can now (optionally) have its string data not be co-located with itself. This is for use with PTH. This aspect is a little gross, as getName() and getLength() now make assumptions about a possible alternate representation of IdentifierInfo. Perhaps we should make IdentifierInfo have virtual methods? IdentifierTable: - Added class "IdentifierInfoLookup" that can be used by IdentifierTable to perform "string -> IdentifierInfo" lookups using an auxilliary data structure. This is used by PTH. - Perform tests show that IdentifierTable::get() does not slow down because of the extra check for the IdentiferInfoLookup object (the regular StringMap lookup does enough work to mitigate the impact of an extra null pointer check). - The upshot is that now that some IdentifierInfo objects might be owned by the IdentiferInfoLookup object. This should be reviewed. PTH: - Modified PTHManager::GetIdentifierInfo to not insert entries in IdentifierTable's string map, and instead create IdentifierInfo objects on the fly when mapping from persistent IDs to IdentifierInfos. This saves a ton of work with string copies, hashing, and StringMap lookup and resizing. This change was motivated because when processing source files in the PTH cache we don't need to do any string -> IdentifierInfo lookups. - PTHManager now subclasses IdentifierInfoLookup, allowing clients of IdentifierTable to transparently use IdentifierInfo objects managed by the PTH file. PTHManager resolves "string -> IdentifierInfo" queries by doing a binary search over a sorted table of identifier strings in the PTH file (the exact algorithm we use can be changed as needed). These changes lead to the following performance changes when using PTH on Cocoa.h: - fsyntax-only: 10% performance improvement - Eonly: 30% performance improvement llvm-svn: 62273	2009-01-15 18:47:46 +00:00
Ted Kremenek	bef9fc2240	PTH: Embed a persistentID side-table in the PTH file that is sorted in the lexical order of the corresponding identifier strings. This will be used for a forthcoming optimization. This slows down PTH generation time by 7%. We can revert this change if the optimization proves to not be valuable. llvm-svn: 62248	2009-01-15 01:26:25 +00:00
Ted Kremenek	e9814186ac	PTH: - Use canonical FileID when using getSpelling() caching. This addresses some cache misses we were seeing with -fsyntax-only on Cocoa.h - Added Preprocessor::getPhysicalCharacterAt() utility method for clients to grab the first character at a specified sourcelocation. This uses the PTH spelling cache. - Modified Sema::ActOnNumericConstant() to use Preprocessor::getPhysicalCharacterAt() instead of SourceManager::getCharacterData() (to get PTH hits). These changes cause -fsyntax-only to not page in any sources from Cocoa.h. We see a speedup of 27%. llvm-svn: 62193	2009-01-13 23:19:12 +00:00
Ted Kremenek	7cbdcc25d4	Fix corner cases in PTH getSpelling() binary search. llvm-svn: 62187	2009-01-13 22:16:45 +00:00
Ted Kremenek	b0b4f74b6b	PTH: Fix remaining cases where the spelling cache in the PTH file was being missed when it shouldn't. This shaves another 7% off PTH time for -Eonly on Cocoa.h llvm-svn: 62186	2009-01-13 22:05:50 +00:00
Ted Kremenek	47b8cf6deb	Enhance PTH 'getSpelling' caching: - Refactor caching logic into a helper class PTHSpellingSearch - Allow "random accesses" in the spelling cache, thus catching the remaining cases where 'getSpelling' wasn't hitting the PTH cache For -Eonly, PTH, Cocoa.h: - This reduces wall time by 3% (user time unchanged, sys time reduced) - This reduces the amount of paged source by 1112K. The remaining 1112K still being paged in is from somewhere else (investigating). llvm-svn: 62009	2009-01-09 22:05:30 +00:00
Ted Kremenek	8ae06625b5	Invert assertion condition. llvm-svn: 61961	2009-01-09 00:36:11 +00:00
Ted Kremenek	d5e6e16d0d	PTH: Hook up getSpelling() caching in PTHLexer. This results in a nice performance gain. Here's what we see for -Eonly on Cocoa.h (using PTH): - wall time decreases by 21% (26% speedup overall) - system time decreases by 35% - user time decreases by 6% These reductions are due to not paging source files just to get spellings for literals. The solution in place doesn't appear to be 100% yet, as we still see some of the pages for source files getting mapped in. Using -print-stats, we see that SourceManager maps in 7179K less bytes of source text (reduction of 75%). Will investigate why the remaining 25% are getting paged in. With these changes, here's how PTH compares to non-PTH on Cocoa.h: -Eonly: PTH takes 64% of the time as non-PTH (54% speedup) -fsyntax-only: PTH takes 89% of the time as non-PTH (11% speedup) llvm-svn: 61913	2009-01-08 04:30:32 +00:00
Ted Kremenek	884a558441	PTH: - Added stub PTHLexer::getSpelling() that will be used for fetching cached spellings from the PTH file. This doesn't do anything yet. - Added a hook in Preprocessor::getSpelling() to call PTHLexer::getSpelling() when using a PTHLexer. - Updated PTHLexer to read the offsets of spelling tables in the PTH file. llvm-svn: 61911	2009-01-08 02:47:16 +00:00
Ted Kremenek	78cc24730e	PTH: Remove some methods and simplify some conditions in PTHLexer::Lex(). No big functionality change. llvm-svn: 61381	2008-12-23 19:24:24 +00:00
Ted Kremenek	a754c40390	PTH: Use 3 bytes instead of 4 bytes to encode the persistent ID for a token. - This reduces the PTH size for Cocoa.h by 7%. - The increases PTH -Eonly speed for Cocoa.h by 0.8%. llvm-svn: 61377	2008-12-23 18:41:34 +00:00
Ted Kremenek	3f94706e57	Cosmetics: rename a variable and tighten spacing. No functionality change. llvm-svn: 61375	2008-12-23 18:27:26 +00:00
Ted Kremenek	1bd0a550d0	PTH: - Encode the token length with 2 bytes instead of 4. - This reduces the size of the .pth file for Cocoa.h by 12%. - This speeds up PTH time (-Eonly) on Cocoa.h by 1.6%. llvm-svn: 61364	2008-12-23 02:52:12 +00:00
Ted Kremenek	66076a964b	PTH: - In PTHLexer::Lex read all of the token data from PTH file before constructing the token. The idea is to enhance locality. - Do not use Read8/Read32 in PTHLexer::Lex. Inline these operations manually. - Change PTHManager::ReadIdentifierInfo() to PTHManager::GetIdentifierInfo(). They are functionally the same except that PTHLexer::Lex() reads the persistent id. These changes result in a 3.3% speedup for PTH on Cocoa.h (-Eonly). llvm-svn: 61363	2008-12-23 02:30:15 +00:00
Ted Kremenek	1b18ad240c	PTH: - Embed 'eom' tokens in PTH file. - Use embedded 'eom' tokens to not lazily generate them in the PTHLexer. This means that PTHLexer can always advance to the next token after reading a token (instead of buffering tokens using a copy). - Moved logic of 'ReadToken' into Lex. GetToken & ReadToken no longer exist. - These changes result in a 3.3% speedup (-Eonly) on Cocoa.h. - The code is a little gross. Many cleanups are possible and should be done. llvm-svn: 61360	2008-12-23 01:30:52 +00:00
Ted Kremenek	9443f0ea5e	Use '&' to test StartOfLine flag. llvm-svn: 61205	2008-12-18 18:15:29 +00:00
Ted Kremenek	aceeb25660	Rewrite PTHLexer::DiscardToEndOfLine() to not use GetToken and instead only read the bytes needed to determine if a token is not at the start of the line. llvm-svn: 61172	2008-12-17 23:52:11 +00:00
Ted Kremenek	63ff81c4e1	Change PTHLexer::getSourceLocation() to not call GetToken() and instead just read the file offset in the token data buffer directly. llvm-svn: 61170	2008-12-17 23:36:32 +00:00
Chris Lattner	d88c933970	add a dropped word back llvm-svn: 61152	2008-12-17 21:38:44 +00:00
Ted Kremenek	a7d73b1fd4	Shadow CurPtr with a local variable in ReadToken. llvm-svn: 61145	2008-12-17 18:38:19 +00:00
Ted Kremenek	877556f4b9	PTH: Added minor 'sibling jumping' optimization for iterating over the side table used for fast preprocessor block skipping. This has a minor performance improvement when preprocessing Cocoa.h, but can have some wins in pathologic cases. llvm-svn: 60966	2008-12-12 22:05:38 +00:00
Ted Kremenek	56572ab9e9	Added PTH optimization to not process entire blocks of tokens that appear in skipped preprocessor blocks. This improves PTH speed by 6%. The code for this optimization itself is not very optimized, and will get cleaned up. llvm-svn: 60956	2008-12-12 18:34:08 +00:00
Ted Kremenek	864eb39233	PTH: - Added a side-table per each token-cached file with the preprocessor conditional stack. This tracks what #if's are matched with what #endifs and where their respective tokens are in the PTH file. This will allow for quick skipping of excluded conditional branches in the Preprocessor. - Performance testing shows the addition of this information (without actually utilizing it) leads to no performance regressions. llvm-svn: 60911	2008-12-11 23:36:38 +00:00
Ted Kremenek	ca153f7349	PTHLexer: Keep track of the location of the last '#' token and provide the means to jump ahead in the token stream. llvm-svn: 60905	2008-12-11 22:41:47 +00:00
Ted Kremenek	67ab296d5c	Remove unused ivar CurTokenIdx. llvm-svn: 60896	2008-12-11 20:39:48 +00:00
Ted Kremenek	d40ab7b72a	Declare PerIDCache as IdentifierInfo** instead of void*. This is just cleaner. No performance change. llvm-svn: 60843	2008-12-10 19:40:23 +00:00
Ted Kremenek	1aed3ddffa	Remove unneeded assertion. llvm-svn: 60559	2008-12-04 22:47:11 +00:00
Ted Kremenek	baedbf47f6	Use 'free' to release PerIDCache since it was allocated using calloc(). llvm-svn: 60556	2008-12-04 22:09:37 +00:00
Ted Kremenek	73a4d28758	PTH: Use an array instead of a DenseMap to cache persistent IDs -> IdentifierInfo*. This leads to a 4% speedup at -fsyntax-only using PTH. llvm-svn: 60452	2008-12-03 01:16:39 +00:00
Ted Kremenek	33eeabda61	- Remove PTHManager.cpp. Move all of its functions to PTHLexer.cpp since some of the internal methods are used by PTHLexer (their implementations are intertwined.) This enables some important inlining opportunities at -O3. - Don't construct an std::vector<Token> prior to feeding PTH tokens to the Preprocessor. Stream them off the PTH file directly. llvm-svn: 60447	2008-12-03 00:38:03 +00:00
Ted Kremenek	1f50dc899f	PTHLexer now owns the Token vector. llvm-svn: 60136	2008-11-27 00:38:24 +00:00
Ted Kremenek	6b3ced2b15	In PTHLexer::DiscardToEndOfLine() use Lex() instead of AdvanceToken(). This handles transitions in the preprocessor state. llvm-svn: 59845	2008-11-21 23:28:56 +00:00
Ted Kremenek	53ab374d9f	PTHLexer: - Move out logic for handling the end-of-file to LexEndOfFile (to match the Lexer) class. The logic now mirrors the Lexer class more, which allows us to pass most of the Preprocessor test cases. llvm-svn: 59768	2008-11-21 00:58:35 +00:00
Ted Kremenek	111caaac58	PTHLexer: - Move PTHLexer::GetToken() to be inside PTHLexer.cpp. - When lexing in raw mode, null out identifiers. llvm-svn: 59744	2008-11-20 19:49:00 +00:00
Ted Kremenek	94981e1f23	PTHLexer: - Rename 'CurToken' and 'LastToken' to 'CurTokenIdx' and 'LastTokenIdx' respectively. - Add helper methods GetToken(), AdvanceToken(), AtLastToken() to abstract away details of the token stream. This also allows us to easily replace their implementation later. llvm-svn: 59733	2008-11-20 16:32:22 +00:00
Ted Kremenek	c490c8877c	Rewrote PTHLexer::Lex by digging through the sources of Lexer again. Now we can do basic macro expansion using the PTHLexer. llvm-svn: 59724	2008-11-20 07:58:05 +00:00
Ted Kremenek	b0262c1e64	- Default initialize ParsingPreprocessorDirective, ParsingFilename, and LexingRawMode in the ctor of PreprocessorLexer. - PTHLexer: Use "LastToken" instead of "NumToken" llvm-svn: 59690	2008-11-20 01:29:45 +00:00
Ted Kremenek	61915f5d4a	Add (untested) implementation of PTHLexer::isNextPPTokenLParen() and PTHLexer::DiscardToEndOfLine(). llvm-svn: 59687	2008-11-20 01:16:50 +00:00
Ted Kremenek	11cfbb473e	Add stub for PTHLexer::isNextPPTokenLParen(). llvm-svn: 59670	2008-11-19 22:42:26 +00:00
Ted Kremenek	76c3441a4e	When using a PTHLexer, use DiscardToEndOfLine() instead of ReadToEndOfLine(). llvm-svn: 59668	2008-11-19 22:21:33 +00:00
Ted Kremenek	45245217bc	- Move static function IsNonPragmaNonMacroLexer into Preprocessor.h. - Add variants of IsNonPragmaNonMacroLexer to accept an IncludeMacroStack entry (simplifies some uses). - Use IsNonPragmaNonMacroLexer in Preprocessor::LookupFile. - Add 'FileID' to PreprocessorLexer, and have Preprocessor query this fileid when looking up the FileEntry for a file Performance testing of -Eonly on Cocoa.h shows no performance regression because of this patch. llvm-svn: 59666	2008-11-19 21:57:25 +00:00
Chris Lattner	1b03f76113	Trivial tidying llvm-svn: 59424	2008-11-16 20:22:05 +00:00
Ted Kremenek	66312a3ff4	Move some diagnostic handling to PreprocessorLexer. llvm-svn: 59191	2008-11-12 23:13:54 +00:00
Ted Kremenek	7cd62457c4	Add skeleton for PTH lexer. llvm-svn: 59169	2008-11-12 21:37:15 +00:00

1 2 3 4

174 Commits