llvm-project

Commit Graph

Author	SHA1	Message	Date
Argyrios Kyrtzidis	b4c83a13f6	[Tooling/DependencyScanning & Preprocessor] Refactor dependency scanning to produce pre-lexed preprocessor directive tokens, instead of minimized sources This is a commit with the following changes: * Remove `ExcludedPreprocessorDirectiveSkipMapping` and related functionality Removes `ExcludedPreprocessorDirectiveSkipMapping`; its intended benefit for fast skipping of excluded directived blocks will be superseded by a follow-up patch in the series that will use dependency scanning lexing for the same purpose. * Refactor dependency scanning to produce pre-lexed preprocessor directive tokens, instead of minimized sources Replaces the "source minimization" mechanism with a mechanism that produces lexed dependency directives tokens. * Make the special lexing for dependency scanning a first-class feature of the `Preprocessor` and `Lexer` This is bringing the following benefits: * Full access to the preprocessor state during dependency scanning. E.g. a component can see what includes were taken and where they were located in the actual sources. * Improved performance for dependency scanning. Measurements with a release+thin-LTO build shows ~ -11% reduction in wall time. * Opportunity to use dependency scanning lexing to speed-up skipping of excluded conditional blocks during normal preprocessing (as follow-up, not part of this patch). For normal preprocessing measurements show differences are below the noise level. Since, after this change, we don't minimize sources and pass them in place of the real sources, `DependencyScanningFilesystem` is not technically necessary, but it has valuable performance benefits for caching file `stat`s along with the results of scanning the sources. So the setup of using the `DependencyScanningFilesystem` during a dependency scan remains. Differential Revision: https://reviews.llvm.org/D125486 Differential Revision: https://reviews.llvm.org/D125487 Differential Revision: https://reviews.llvm.org/D125488	2022-05-26 12:50:06 -07:00
Argyrios Kyrtzidis	b58a420ff4	[Tooling/DependencyScanning] Rename refactorings towards transitioning dependency scanning to use pre-lexed preprocessor directive tokens This is first of a series of patches for making the special lexing for dependency scanning a first-class feature of the `Preprocessor` and `Lexer`. This patch only includes NFC renaming changes to make reviewing of the functionality changing parts easier. Differential Revision: https://reviews.llvm.org/D125484	2022-05-26 12:49:51 -07:00
Argyrios Kyrtzidis	42823beb1d	[Tooling/DependencyScanning] Make skipping excluded PP ranges during dependency scanning the default This is to improve maintenance a bit and remove need to maintain the additional option and related code-paths. Differential Revision: https://reviews.llvm.org/D124558	2022-04-28 15:23:03 -07:00
Krasimir Georgiev	e8cc7490d2	Revert "[clang-format] SortIncludes should support "@import" lines in Objective-C" This reverts commit `d46fa023ca`. Regressed include order in some cases with trailing comments, see the comments on https://reviews.llvm.org/D121370. Will add a regression test in a follow-up commit.	2022-04-28 11:00:32 +02:00
Ishaan Gandhi	87468e85fc	compile commands header to source heuristic lower-cases filenames before inferring file types This leads to ".C" files being rewritten as ".c" files and being inferred to be "c" files as opposed to "c++" files. Fixes https://github.com/clangd/clangd/issues/1108 Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D124262	2022-04-25 20:40:56 +02:00
Konrad Kleine	d46fa023ca	[clang-format] SortIncludes should support "@import" lines in Objective-C Fixes [[ https://github.com/llvm/llvm-project/issues/38995 \| #38995 ]] This is an attempt to modify the regular expression to identify `@import` and `import` alongside the regular `#include`. The challenging part was not to support `@` in addition to `#` but how to handle everything that comes after the `include\|import` keywords. Previously everything that wasn't `"` or `<` was consumed. But as you can see in this example from the issue #38995, there is no `"` or `<` following the keyword: ``` @import Foundation; ``` I experimented with a lot of fancy and useful expressions in [this online regex tool](https://regex101.com) only to find out that some things are simply not supported by the regex implementation in LLVM. * For example the beginning `[\t\ ]` should be replacable by the horizontal whitespace character `\h` but this will break the `SortIncludesTest.LeadingWhitespace` test. That's why I've chosen to come back to the basic building blocks. The essential change in this patch is the change from this regular expression: ``` ^[\t\ ]#[\t\ ](import\|include)[^"<](["<][^">][">]) ~ ~~~~~~~~~~~~~~ ^ ^ \| \| only support # prefix not @ \| only support "" and <> as delimiters no support for C++ modules and ; ending. Also this allows for "> or <" or "" or <> which all seems either off or wrong. ``` to this: ``` ^[\t\ ][@#][\t\ ](import\|include)([^"]("[^"]+")\|[^<](<[^>]+>)\|[\t\ ]([^;]+;)) ~~~~ ~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~ ^ ^ ^ ^ ^ \| \| \| \| \| Now support @ and #. Clearly support "" and <> as well as an include name without enclosing characters. Allows for no mixture of "> or <" or empty include names. ``` Here is how I've tested this patch: ``` ninja clang-Format ninja FormatTests ./tools/clang/unittests/Format/FormatTests --gtest_filter=SortIncludesTest ``` And if that worked I doubled checked that nothing else broke by running all format checks: ``` ./tools/clang/unittests/Format/FormatTests ``` One side effect of this change is it should partially support [C++20 Module](https://en.cppreference.com/w/cpp/language/modules) `import` lines without the optional `export` in front. Adding this can be a change on its own that shouldn't be too hard. I say partially because the `@` or `#` are currently NOT optional in the regular expression. I see an opportunity to optimized the matching to exclude `@include` for example. But eventually these should be caught by the compiler, so... With my change, the matching group is not at a fixed position any longer. I decided to choose the last match (group) that is not empty. Reviewed By: HazardyKnusperkeks Differential Revision: https://reviews.llvm.org/D121370	2022-04-20 07:03:35 +00:00
Jan Svoboda	7ed01ba88d	[clang][deps] NFC: Inline function with single caller	2022-04-15 16:24:40 +02:00
Jan Svoboda	d79ad2f1db	[clang][lex] NFCI: Use FileEntryRef in PPCallbacks::InclusionDirective() This patch changes type of the `File` parameter in `PPCallbacks::InclusionDirective()` from `const FileEntry *` to `Optional<FileEntryRef>`. With the API change in place, this patch then removes some uses of the deprecated `FileEntry::getName()` (e.g. in `DependencyGraph.cpp` and `ModuleDependencyCollector.cpp`). Reviewed By: dexonsmith, bnbarham Differential Revision: https://reviews.llvm.org/D123574	2022-04-14 10:46:12 +02:00
Eric Li	e334f044cd	[libTooling] Support TransformerResult<void> in consumer callbacks Support `TransformerResult<void>` in the consumer callback, which allows generic code to more naturally use the `Transformer` interface (instead of needing to specialize on `void`). This also delete the specialization that existed within `Transformer` itself, instead replacing it with an `std::function` adapter. Reviewed By: ymandel Differential Revision: https://reviews.llvm.org/D122499	2022-03-28 15:39:46 +00:00
Eric Li	9edeceaece	[libTooling] Generalize string explanation as templated metadata Change RewriteRule from holding an `Explanation` to being able to generate arbitrary metadata. Where TransformerClangTidyCheck was interested in a string description for the diagnostic, other tools may be interested in richer metadata at a higher level of abstraction than at the edit level (which is currently available as ASTEdit::Metadata). Reviewed By: ymandel Differential Revision: https://reviews.llvm.org/D120360	2022-03-21 20:39:35 +00:00
Yitzhak Mandelbaum	8351726e6d	Revert "[libTooling] Generalize string explanation as templated metadata" This reverts commit `18440547d3`. Causing failures in some build modes. e.g. https://lab.llvm.org/buildbot/#/builders/217/builds/1886	2022-03-21 19:06:59 +00:00
Eric Li	18440547d3	[libTooling] Generalize string explanation as templated metadata Change RewriteRule from holding an `Explanation` to being able to generate arbitrary metadata. Where TransformerClangTidyCheck was interested in a string description for the diagnostic, other tools may be interested in richer metadata at a higher level of abstraction than at the edit level (which is currently available as ASTEdit::Metadata). Reviewed By: ymandel Differential Revision: https://reviews.llvm.org/D120360	2022-03-21 18:45:39 +00:00
Jan Svoboda	1e25ff84d8	[clang][deps] Fix traversal of precompiled dependencies The code for traversing precompiled dependencies is somewhat complicated and contains a dangling iterator bug. This patch simplifies the code and fixes the bug. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D121533	2022-03-16 12:17:53 +01:00
Jan Svoboda	d73daa9135	[clang][deps] Don't prune search paths used by dependencies When pruning header search paths (to reduce the number of modules we need to build explicitly), we can't prune the search paths used in (transitive) dependencies of a module. Otherwise, we could end up with either of the following dependency graphs: ``` X:<hash1> -> Y:<hash2> X:<hash1> -> Y:<hash3> ``` depending on the search paths of the translation unit we discovered `X` and `Y` from. This patch fixes that. Depends on D121295. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D121303	2022-03-16 12:17:53 +01:00
Sam McCall	89cd86bbc5	Reapply [pseudo] Move pseudoparser from clang to clang-tools-extra" This reverts commit `049f4e4eab`. The problem was a stray dependency in CLANG_TEST_DEPS which caused cmake to fail if clang-pseudo wasn't built. This is now removed.	2022-03-16 01:10:55 +01:00
Sam McCall	049f4e4eab	Revert "[pseudo] Move pseudoparser from clang to clang-tools-extra" This reverts commit `b97856c4cf`. Breaks a bunch of bots: https://lab.llvm.org/buildbot/#/builders/193/builds/8513	2022-03-16 01:06:24 +01:00
Sam McCall	b97856c4cf	[pseudo] Move pseudoparser from clang to clang-tools-extra This should make clearer that: - it's not part of clang proper - there's no expectation to update it along with clang (beyond green tests) - clang should not depend on it This is intended to be expose a library, so unlike other tools has a split between include/ and lib/. The main renames are: clang/lib/Tooling/Syntax/Pseudo/* => clang-tools-extra/pseudo/lib/* clang/include/clang/Tooling/Syntax/Pseudo/* => clang-tools-extra/pseudo/include/clang-pseudo/* clang/tools/clang/pseudo/* => clang-tools-extra/pseudo/tool/* clang/test/Syntax/* => clang-tools-extra/pseudo/test/* clang/unittests/Tooling/Syntax/Pseudo/* => clang-tools-extra/pseudo/unittests/* #include "clang/Tooling/Syntax/Pseudo/" => #include "clang-pseudo/" namespace clang::syntax::pseudo => namespace clang::pseudo check-clang => check-clang-pseudo clangToolingSyntaxPseudo => clangPseudo The clang-pseudo and ClangPseudoTests binaries are not renamed. See discussion around: https://discourse.llvm.org/t/rfc-a-c-pseudo-parser-for-tooling/59217/50 Differential Revision: https://reviews.llvm.org/D121233	2022-03-16 00:14:11 +01:00
Jan Svoboda	cf4a31fc0f	[clang][deps] Remove '-fmodules-cache-path=' arguments With explicit modules build, the '-fmodules-cache-path=' argument is unused. This patch removes the argument to avoid warnings or errors (with '-Werror') stemming from that. Depends on D118915. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D120474	2022-03-12 11:42:07 +01:00
Jan Svoboda	7f6af60746	[clang][deps] Generate '-fmodule-file=' only for direct dependencies The `clang-scan-deps` tool currently generates `-fmodule-file=` command-line arguments for the whole transitive closure of modular dependencies. This is not necessary, we only need to provide the direct dependencies on the command line. Information about transitive dependencies is stored within the `.pcm` files of direct dependencies. This makes the command lines shorter, but should be a NFC otherwise (unless there are bugs in the loading mechanism for explicit modules). Depends on D120465. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D118915	2022-03-12 11:32:51 +01:00
Jan Svoboda	a6ef363546	[clang][deps] Disable implicit module maps Since D113473, we don't report any module map files via `-fmodule-map-file=` in explicit builds. The ultimate goal here is to make sure Clang doesn't open/read/parse/evaluate unnecessary module maps. However, implicit module maps still end up reading all reachable module maps. This patch disables implicit module maps in explicit builds. Unfortunately, we still need to report some module map files that aren't encoded in PCM files of dependencies: module maps that are necessary to correctly evaluate includes in modules marked as `[no_undeclared_includes]`. Depends on D120464. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D120465	2022-03-12 11:07:21 +01:00
Haojian Wu	2d01ac18df	[pseudo] Strip comments for TokenStream. Add a utility function to strip comments from a "raw" tokenstream. The derived stream will be fed to the GLR parser (for early testing). Differential Revision: https://reviews.llvm.org/D121092	2022-03-07 20:24:37 +01:00
Haojian Wu	d5b8ecbd33	[pseudo] empty parameter-declaration should be allowed in lambda declarator. This was an oversight, as we did a avoild-nullable modication to parameter-declaration-clause. Differential Revision: https://reviews.llvm.org/D121089	2022-03-07 20:05:35 +01:00
Sam McCall	54d6b5b67f	[pseudo] Rename {Preprocess,PPStructure} -> DirectiveMap. NFC More precisely describes what this file does. Per comments on https://reviews.llvm.org/D121092	2022-03-07 17:41:35 +01:00
Sam McCall	68b4e2d703	[pseudo] Add readme Differential Revision: https://reviews.llvm.org/D121108	2022-03-07 15:54:00 +01:00
Haojian Wu	28ccf32672	[pseudo] Fix an out-of-bound access for LRTable::Actions. Without this patch, when End == Start, we access Actions[Actions.end()] though we return an empty result. This fixes an assertion failure in MSVC STL debug build.	2022-03-03 14:27:44 +01:00
Haojian Wu	05d7e9f68e	[pseudo] fix some comment nits, NFC.	2022-03-02 10:19:17 +01:00
Haojian Wu	28efb1ccf5	[pseudo] Fix an out-of-bound error in LRTable::find. The linear scan should not escape the TargetedStates range. Differential Revision: https://reviews.llvm.org/D120723	2022-03-02 09:53:52 +01:00
Dawid Jurczak	b3e2dac27c	[NFC] Don't pass temporary LangOptions to Lexer Since https://reviews.llvm.org/D120334 we shouldn't pass temporary LangOptions to Lexer. This change fixes stack-use-after-scope UB in LocalizationChecker found by sanitizer-x86_64-linux-fast buildbot and resolve similar issue in HeaderIncludes.	2022-02-28 20:43:28 +01:00
Haojian Wu	302ca279cb	[pseudo] fix an out-of-bound error in LRTable. Fix window debug build.	2022-02-23 21:34:54 +01:00
Sam McCall	7c1ee5e95f	[Pseudo] Token/TokenStream, PP directive parser. The TokenStream class is the representation of the source code that will be fed into the GLR parser. This patch allows a "raw" TokenStream to be built by reading source code. It also supports scanning a TokenStream to find the directive structure. Next steps (with placeholders in the code): heuristically choosing a path through #ifs, preprocessing the code by stripping directives and comments. These will produce a suitable stream to feed into the parser proper. Differential Revision: https://reviews.llvm.org/D119162	2022-02-23 17:52:02 +01:00
Jan Svoboda	19017c2435	[clang][deps] Return the whole TU command line The dependency scanner already generates canonical -cc1 command lines that can be used to compile discovered modular dependencies. For translation unit command lines, the scanner only generates additional driver arguments the build system is expected to append to the original command line. While this works most of the time, there are situations where that's not the case. For example with `-Wunused-command-line-argument`, Clang will complain about the `-fmodules-cache-path=` argument that's not being used in explicit modular builds. Combine that with `-Werror` and the build outright fails. To prevent such failures, this patch changes the dependency scanner to return the full driver command line to compile the original translation unit. This gives us more opportunities to massage the arguments into something reasonable. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D118986	2022-02-23 15:46:20 +01:00
Jan Svoboda	80a696898c	[clang][deps] NFC: Update documentation In D113473, the dependency scanner stopped emitting "-fmodule-map-file=" arguments. Potential build systems are expected to not add any such arguments on their own. This commit removes mentions of such arguments to avoid confusion.	2022-02-23 15:46:20 +01:00
Aaron Ballman	b1a8dcf8c1	Silence some "not all control paths return a value" warnings; NFC	2022-02-23 09:18:56 -05:00
Haojian Wu	a2fab82f33	[pseudo] Implement LRTable. This patch introduces a dense implementation of the LR parsing table, which is used by LR parsers. We build a SLR(1) parsing table from the LR(0) graph. Statistics of the LR parsing table on the C++ spec grammar: - number of states: 1449 - number of actions: 83069 - size of the table (bytes): 334928 Differential Revision: https://reviews.llvm.org/D118196	2022-02-23 09:21:34 +01:00
Eric Li	d1e3235f60	[libTooling] Change Tranformer's consumer to take multiple changes Previously, Transformer would invoke the consumer once per file modified per match, in addition to any errors encountered. The consumer is not aware of which AtomicChanges come from any particular match. It is unclear which sets of edits may be related or whether an error invalidates any previously emitted changes. Modify the signature of the consumer to accept a set of changes. This keeps related changes (i.e. all edits from a single match) together, and clarifies that errors don't produce partial changes. Reviewed By: ymandel Differential Revision: https://reviews.llvm.org/D119745	2022-02-15 16:34:36 +00:00
Jan Svoboda	c6f8704053	[clang][deps] Disable global module index While scanning dependencies of a TU that depends on a PCH, the scanner basically performs mixed implicit/explicit modular compilation. (Explicit modules come from the PCH.) This seems to trip up the global module index. This patch disables global module index in the dependency scanner. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D118890	2022-02-15 09:51:23 +01:00
Erich Keane	8073da0bee	[NFC] Fix sign-compare warning in GrammarBNF thanks to int promotion	2022-02-09 11:25:58 -08:00
Kirill Bobyrev	e3ba831937	[clang] Fix the tooling build after D119130 New StandardLibrary.cpp depends on Clang AST, add the dependency to CMakeLists.txt Broken builbot: https://lab.llvm.org/buildbot/#/builders/57/builds/14892	2022-02-09 11:52:03 +01:00
Haojian Wu	f1984b1433	[pseudo] Implement LRGraph LRGraph is the key component of the clang pseudo parser, it is a deterministic handle-finding finite-state machine, which is used to generated the LR parsing table. Separate from https://reviews.llvm.org/D118196. Differential Revision: https://reviews.llvm.org/D119172	2022-02-09 11:20:07 +01:00
Kirill Bobyrev	46a6f5ae14	[clangd] NFC: Move stdlib headers handling to Clang This will allow moving the IncludeCleaner library essentials to Clang and decoupling them from the majority of clangd. The patch itself just moves the code, it doesn't change existing functionality. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D119130	2022-02-09 11:05:39 +01:00
Haojian Wu	fe932a88e9	[pseudo] Add first and follow set computation in Grammar. These will be used when building parsing table for LR parsers. Separate from https://reviews.llvm.org/D118196. Differential Revision: https://reviews.llvm.org/D118990	2022-02-09 09:16:27 +01:00
Haojian Wu	e1db505b42	[syntax][pseudo] Introduce the C++ spec grammar. Add a dummy clang-pseudo tool (right now it accepts and parses the grammar file). Differential Revision: https://reviews.llvm.org/D115856	2022-02-04 11:58:50 +01:00
Haojian Wu	b94f09524e	[pseudo] NFC, clangSyntaxPsuedo => clangToolingSyntaxPseudo To be consistent with existing name pattern.	2022-02-04 09:57:20 +01:00
Haojian Wu	20e05b9f0e	[syntax][pseudo] Add Grammar for the clang pseudo-parser This patch introduces the Grammar class, which is a critial piece for constructing a tabled-based parser. As the first patch, the scope is limited to: - define base types (symbol, rules) of modeling the grammar - construct Grammar by parsing the BNF file (annotations are excluded for now) Differential Revision: https://reviews.llvm.org/D114790	2022-02-03 11:28:27 +01:00
Simon Pilgrim	04754af925	Fix MSVC 'not all control paths return a value' warning. NFC.	2022-01-26 11:33:37 +00:00
Jan Svoboda	600c6714ac	[clang][syntax] Replace `std::vector<bool>` use LLVM Programmer’s Manual strongly discourages the use of `std::vector<bool>` and suggests `llvm::BitVector` as a possible replacement. This patch replaces `std::vector<bool>` with `llvm::BitVector` in the Syntax library and replaces range-based for loop with regular for loop. This is necessary due to `llvm::BitVector` not having `begin()` and `end()` (D117116). Reviewed By: dexonsmith, dblaikie Differential Revision: https://reviews.llvm.org/D118109	2022-01-26 11:20:18 +01:00
Yitzhak Mandelbaum	0944c196c5	[libTooling] Adds more support for constructing object access expressions. This patch adds a `buildAccess` function, which constructs a string with the proper operator to use based on the expression's form and type. It also adds two predicates related to smart pointers, which are needed by `buildAccess` but are also of general value. We deprecate `buildDot` and `buildArrow` in favor of the more general `buildAccess`. These will be removed in a future patch. Differential Revision: https://reviews.llvm.org/D116377	2022-01-25 19:43:36 +00:00
Jan Svoboda	8cc2a13727	[clang][deps] Handle symlinks in minimizing FS The minimizing and caching filesystem used by the dependency scanner can be configured to not minimize some files. That's necessary when scanning a TU with prebuilt inputs (i.e. PCH) that refer to the original (non-minimized) files. Minimizing such files in the dependency scanner would cause discrepancy between the current perceived state of the filesystem and the file sizes stored in the AST file. By not minimizing such files, we avoid creating the discrepancy. The problem with the current approach is that files that should not be minimized are identified by their path. This breaks down when the prebuilt input (PCH) and the current TU refer to the same file via different paths (i.e. symlinks). This patch switches from paths to `llvm::sys::fs::UniqueID` when identifying ignored files. This is consistent with how the rest of Clang treats files. Depends on D114966. Reviewed By: dexonsmith, arphaman Differential Revision: https://reviews.llvm.org/D114971	2022-01-21 13:04:25 +01:00
Jan Svoboda	5daeada330	[clang][deps] Ensure filesystem cache consistency The minimizing filesystem used by the dependency scanner isn't great when it comes to the consistency of its caches. There are two problems that can be exposed by a filesystem that changes during dependency scan: 1. In-memory cache entries for original and minimized files are distinct, populated at different times using separate stat/open syscalls. This means that when a file is read with minimization disabled, its contents might be inconsistent when the same file is read with minimization enabled at later point (and vice versa). 2. In-memory cache entries are indexed by filename. This is problematic for symlinks, where the contents of the symlink might be inconsistent with contents of the original file (for the same reason as in problem 1). This patch ensures consistency by always stating/reading a file exactly once. The original contents are always cached and minimized contents are derived from that on demand. The cache entries are now indexed by their `UniqueID` ensuring consistency for symlinks too. Moreover, the stat/read syscalls are now issued outside of critical section. Depends on D115935. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D114966	2022-01-21 13:04:25 +01:00
Jan Svoboda	ced077e1ba	[clang][deps] NFC: Simplify handling of cached FS errors The return types of some `CachedFileSystemEntry` member function are needlessly complex. This patch attempts to simplify the code by unwrapping cached entries that represent errors early, and then asserting `!isError()`. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D115935	2022-01-21 13:04:25 +01:00

1 2 3 4 5 ...

985 Commits