llvm-project

Commit Graph

Author	SHA1	Message	Date
Johannes Doerfert	7dfad948f1	[OpenMP][FIX] Repair ExclusiveAccess move semantic snafu	2022-03-25 16:00:53 -05:00
Johannes Doerfert	b9fd8f34ae	[OpenMP][NFC] Add missing virtual destructor to silence warning	2022-03-25 16:00:53 -05:00
William S. Moses	89525cbf28	[Clang] Add helper method to determine if a nonvirtual base has an entry in the LLVM struct This patch adds a helper method to determine if a nonvirtual base has an entry in the LLVM struct. Such a base may not have an entry if the base does not have any fields/bases itself that would change the size of the struct. This utility method is useful for other frontends (Polygeist) that use Clang as an API to generate code. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D122502	2022-03-25 16:32:12 -04:00
Paul Robinson	6aa0397758	Remove dead code in driver parsing -gsimple-template-names= options While -g[no-]simple-template-names is a driver option, the fancier -gsimple-template-names={simple,mangled} option is cc1-only, so code to handle it in the driver is dead. Differential Revision: https://reviews.llvm.org/D122503	2022-03-25 13:23:24 -07:00
Peter Klausler	2ab9990c9e	[flang] Add & use a better visit() Adds flang/include/flang/Common/visit.h, which defines a Fortran::common::visit() template function that is a drop-in replacement for std::visit(). Modifies most use sites in the front-end and runtime to use common::visit(). The C++ standard mandates that std::visit() have O(1) execution time, which forces implementations to build dispatch tables. This new common::visit() is O(log2 N) in the number of alternatives in a variant<>, but that N tends to be small and so this change produces a fairly significant improvement in compiler build memory requirements, a 5-10% improvement in compiler build time, and a small improvement in compiler execution time. Building with -DFLANG_USE_STD_VISIT causes common::visit() to be an alias for std::visit(). Calls to common::visit() with multiple variant arguments are referred to std::visit(), pending further work. Differential Revision: https://reviews.llvm.org/D122441	2022-03-25 13:15:20 -07:00
Hongtao Yu	e25f4e4c4a	[PseudoProbe] Do not emit pseudo probes when module is not probed. There is a case when a function has pseudo probe intrinsics but the module it resides does not have the probe desc. This could happen when the current module is not built with `-fpseudo-probe-for-profiling` while a function in it calls some other function from a probed module. In thinLTO mode, the callee function could be imported and inlined into the current function. While this is undefined behavior, I'm fixing the asm printer to not ICE and warn user about this. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D121737	2022-03-25 12:59:53 -07:00
Emilio Cota	0e1e3ccc98	[bazel] add missing targets since `3be7c28917`	2022-03-25 15:41:29 -04:00
Adrian Prantl	1f98e09bf8	Add missing include diagnosed in modules build. (NFC)	2022-03-25 12:40:08 -07:00
Martin Storsjö	9a3eeae321	[clang-tidy] Fix the condition for building CTTestTidyModule This is the correct intended condition; the problematic case where we don't want to try to build the plugin is "WIN32 AND LLVM_LINK_LLVM_DYLIB" and thus the negation is "NOT WIN32 OR NOT LLVM_LINK_LLVM_DYLIB". Differential Revision: https://reviews.llvm.org/D121687	2022-03-25 21:22:46 +02:00
Sam McCall	57ee624d79	[cmake] Provide CURRENT_TOOLS_DIR centrally, replacing CLANG_TOOLS_DIR CLANG_TOOLS_DIR holds the the current bin/ directory, maybe with a %(build_mode) placeholder. It is used to add the just-built binaries to $PATH for lit tests. In most cases it equals LLVM_TOOLS_DIR, which is used for the same purpose. But for a standalone build of clang, CLANG_TOOLS_DIR points at the build tree and LLVM_TOOLS_DIR points at the provided LLVM binaries. Currently CLANG_TOOLS_DIR is set in clang/test/, clang-tools-extra/test/, and other things always built with clang. This is a few cryptic lines of CMake in each place. Meanwhile LLVM_TOOLS_DIR is provided by configure_site_lit_cfg(). This patch moves CLANG_TOOLS_DIR to configure_site_lit_cfg() and renames it: - there's nothing clang-specific about the value - it will also replace LLD_TOOLS_DIR, LLDB_TOOLS_DIR etc (not in this patch) It also defines CURRENT_LIBS_DIR. While I removed the last usage of CLANG_LIBS_DIR in `e4cab4e24d`, there are LLD_LIBS_DIR usages etc that may be live, and I'd like to mechanically update them in a followup patch. Differential Revision: https://reviews.llvm.org/D121763	2022-03-25 20:22:01 +01:00
Chia-hung Duan	dda003afa6	[mlir] Add InferTensorType without supporting reifyReturnTypeShapes This is useful for the case that we don't need to implement reifyReturnTypeShapes. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D121403	2022-03-25 19:04:34 +00:00
Philip Reames	f80aaa675f	[SLP] Simplify eraseInstruction [NFC] This simplifies the implementation of eraseInstruction by moving the odd-replace-users-with-undef handling back to the only caller which uses it. This handling was not obviously correct, so add the asserts which make it clear why this is safe to do at all. The result is simpler code and stronger assertions.	2022-03-25 12:01:52 -07:00
LLVM GN Syncbot	a78bd83264	[gn build] Port `cef52105bd`	2022-03-25 18:54:35 +00:00
Douglas Yung	cef52105bd	Revert "[clang-tidy] Add modernize-macro-to-enum check" This reverts commit `39b80c8380`. This change was causing build failures on several build bots: - https://lab.llvm.org/buildbot/#/builders/139/builds/19210 - https://lab.llvm.org/buildbot/#/builders/93/builds/7956	2022-03-25 11:53:42 -07:00
Corentin Jabot	26e201b796	[Clang] Fix error in Documentation introduced by `3784e8cc` [nfc]. The documentation contained extra space. Also remove https://github.com/llvm/llvm-project/issues/54296 from the list of issues by `3784e8cc` as this commit did not fix it (nor was it supposed to).	2022-03-25 19:38:36 +01:00
Peter Klausler	0363a164b6	[flang] Fix bogus error from assignment to CLASS(*) Assignment semantics was coughing up bad errors and crashes for intrinsic assignments to unlimited polymorphic entities while looking for any (impossible) user defined ASSIGNMENT(=) generic or intrinsic type conversion. Differential Revision: https://reviews.llvm.org/D122440	2022-03-25 11:17:01 -07:00
Corentin Jabot	3784e8ccfb	[Clang] Fix Unevaluated Lambdas Unlike other types, when lambdas are instanciated, they are recreated from scratch. When an unevaluated lambdas appear in the type of a function, parameter it is instanciated in the wrong declaration context, as parameters are transformed before the function. To support lambda in function parameters, we try to compute whether they are dependant without looking at the declaration context. This is a short term stopgap solution to avoid clang iceing. A better fix might be to inject some kind of transparent declaration with correctly computed dependency for function parameters, variable templates, etc. Fixes https://github.com/llvm/llvm-project/issues/50376 Fixes https://github.com/llvm/llvm-project/issues/51414 Fixes https://github.com/llvm/llvm-project/issues/51416 Fixes https://github.com/llvm/llvm-project/issues/51641 Fixes https://github.com/llvm/llvm-project/issues/54296 Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D121532	2022-03-25 19:16:45 +01:00
Florian Hahn	171cdba867	[Clang,TBAA] Use pattern for metadata reference in test. Update the single check line that still had a hard-coded metadata reference. This makes it more robust to slight changes in the metadata numbering.	2022-03-25 18:12:39 +00:00
Florian Hahn	8c3281db49	[ConstraintElimination] Use AddOverflow for offset summation. Fixes an incorrect transformation due to values overflowing https://alive2.llvm.org/ce/z/uizoea	2022-03-25 18:08:24 +00:00
Florian Hahn	8530259985	[ConstraintElimination] Add test where offset additions overflow.	2022-03-25 18:08:18 +00:00
Emil Kieri	35b89bc24c	[clang][driver] Disable non-functional --version option for clang -cc1 This patch removes --version as a clang -cc1 option. clang --version and clang --cc1 -version remain valid. This behaviour is consistent with clang -cc1as. Previously, clang -cc1 accepted both --version and -version, but only -version was acted upon. The call clang -cc1 --version stalled without any message: --version was an accepted option but triggered no action, and the driver waited for standard input. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D122344	2022-03-25 19:04:27 +01:00
Nathan James	b97f26083b	Reland "[ASTMatchers] Output currently processing match and nodes on crash" This reverts commit `cff34ccb60`. This relands commit `d89f9e963e`	2022-03-25 17:53:58 +00:00
Philip Reames	48cc9287f5	Reapply "[SLP] Schedule only sub-graph of vectorizable instructions"" (try 3) The original commit exposed several missing dependencies (e.g. latent bugs in SLP scheduling). Most of these were fixed over the weekend and have had several days to bake. The last was fixed this morning after being noticed in manual review of test changes yesterday. See the review thread for links to each change. Original commit message follows: SLP currently schedules all instructions within a scheduling window which stretches from the first instruction potentially vectorized to the last. This window can include a very large number of unrelated instructions which are not being considered for vectorization. This change switches the code to only schedule the sub-graph consisting of the instructions being vectorized and their transitive users. This has the effect of greatly reducing the amount of work performed in large basic blocks, and thus greatly improves compile time on degenerate examples. To understand the effects, I added some statistics (not planned for upstream contribution). Here's an illustration from my motivating example: Before this patch: 704357 SLP - Number of calcDeps actions 699021 SLP - Number of schedule calls 5598 SLP - Number of ReSchedule actions 59 SLP - Number of ReScheduleOnFail actions 10084 SLP - Number of schedule resets 8523 SLP - Number of vector instructions generated After this patch: 102895 SLP - Number of calcDeps actions 161916 SLP - Number of schedule calls 5637 SLP - Number of ReSchedule actions 55 SLP - Number of ReScheduleOnFail actions 10083 SLP - Number of schedule resets 8403 SLP - Number of vector instructions generated I do want to highlight that there is a small difference in number of generated vector instructions. This example is hitting the bailout due to maximum window size, and the change in scheduling is slightly perturbing when and how we hit it. This can be seen in the RescheduleOnFail counter change. Given that, I think we can safely ignore. The downside of this change can be seen in the large test diff. We group all vectorizable instructions together at the bottom of the scheduling region. This means that vector instructions can move quite far from their original point in code. While maybe undesirable, I don't see this as being a major problem as this pass is not intended to be a general scheduling pass. For context, it's worth noting that the pre-scheduling that SLP does while building the vector tree is exactly the sub-graph scheduling implemented by this patch. Differential Revision: https://reviews.llvm.org/D118538	2022-03-25 10:39:23 -07:00
Christopher Bate	3be7c28917	[mlir][NVVM] Add support for nvvm mma.sync ops This patch adds MLIR NVVM support for the various NVPTX `mma.sync` operations. There are a number of possible data type, shape, and other attribute combinations supported by the operation, so a custom assebmly format is added and attributes are inferred where possible. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D122410	2022-03-25 17:28:05 +00:00
Jean Perier	5bc9ee1b78	[flang][lowering] Handle zero extent case in LBOUND Follow up of https://reviews.llvm.org/D121488. Ensure lower bounds are `1` when the related dimension extent is zero. Note that lower bounds from descriptors are now guaranteed to fulfill this property after the runtime/codegen patches. Also fixes explicit shape array extent lowering when instantiating variables to deal with negative extent cases (issue found while testing LBOUND edge case). This notably caused allocation crashes when dealing with automatic arrays with reversed bounds or negative size specification expression. The standard specifies that the extent of such arrays is zero. This change has some ripple effect in the current lit tests. Add move two helpers as part of this change: - Add a helper to tell if a fir::ExtendedValue describes an assumed size array (last dimension extent is unknown to the compiler, both at compile time and runtime). - Move and share getIntIfConstant from Character.cpp so that it can be used elsewhere (NFC). Differential Revision: https://reviews.llvm.org/D122467	2022-03-25 18:05:54 +01:00
Philip Reames	ec858f0201	[SLP] Optimize stacksave dependence handling [NFC] After writing the commit message for 4b1bace28, realized that the mentioned optimization was rather straight forward. We already have the code for scanning a block during region initialization, we can simply keep track if we've seen a stacksave or stackrestore. If we haven't, none of these dependencies are relevant and we can avoid the relatively expensive scans entirely.	2022-03-25 10:04:10 -07:00
Philip Reames	a16308c282	[SLP] Explicit track required stacksave/alloca dependency (try 3) This is an extension of commit b7806c to handle one last case noticed in test changes for D118538. Again, this is thought to be a latent bug in the existing code, though this time I have not managed to reduce tests for the original algoritthm. The prior attempt had failed to account for this case: %a = alloca i8 stacksave stackrestore store i8 0, i8* %a If we allow '%a' to reorder into the stacksave/restore region, then the alloca will be deallocated before the use. We will have taken a well defined program, and introduced a use-after-free bug. There's also an inverse case where the alloca originally follows the stackrestore, and we need to prevent the reordering it above the restore. Compile time wise, we potentially do an extra scan of the block for each alloca seen in a bundle. This is significantly more expensive than the stacksave rooted version and is why I'd tried to avoid this in the initial patch. There is room to optimize this (by essentially caching a "has stacksave" bit per block), but I'm leaving that to future work if it actually shows up in practice. Since allocas in bundles should be rare in practice, I suspect we can defer the complexity for a long while.	2022-03-25 10:04:10 -07:00
Gulfem Savrun Yeniceri	c7f91e227a	[InstrProfiling] No runtime hook for unused funcs CoverageMappingModuleGen generates a coverage mapping record even for unused functions with internal linkage, e.g. static int foo() { return 100; } Clang frontend eliminates such functions, but InstrProfiling pass still pulls in profile runtime since there is a coverage record. Fuchsia uses runtime counter relocation, and pulling in profile runtime for unused functions causes a linker error: undefined hidden symbol: __llvm_profile_counter_bias. Since `389dc94d4b`, we do not hook profile runtime for the binaries that none of its translation units have been instrumented in Fuchsia. This patch extends that for the instrumented binaries that consist of only unused functions. Differential Revision: https://reviews.llvm.org/D122336	2022-03-25 17:03:03 +00:00
Argyrios Kyrtzidis	b867f5c7b8	[Support/BLAKE3] Do manual instrumentation of `llvm_blake3_hasher_finalize` for memory sanitizer This is to avoid false positives when using the uninstrumented assembly code implementation.	2022-03-25 09:59:27 -07:00
Florian Hahn	e47d220230	[LV] Use getVectorLoopRegion to retrieve header. (NFC) Update all places that currently assume the entry block to the plan is also the vector loop header to use getVectorLoopRegion instead. getVectorLoopRegion will keep doing the right thing when the pre-header is modeled explicitly (and becomes the new entry block in the plan).	2022-03-25 16:57:12 +00:00
Jonas Devlieghere	8f7db763ef	[lldb] Conditionalize target_link_libraries on the target Fixes "Cannot specify link libraries for target "lldb-target-fuzzer" which is not built by this project." Normally that's taken care of by add_llvm_fuzzer but we need target_link_libraries for liblldb and our utility library.	2022-03-25 09:50:34 -07:00
lipracer	5161835d5a	[mlir][tosa] : adding folder and canonicalizer for select define canonicalizer and folder for tosa::select Reviewed By: mehdi_amini, Mogball Differential Revision: https://reviews.llvm.org/D121513	2022-03-25 16:50:29 +00:00
Peter Klausler	174cabeda5	[flang] Fix cycle-catcher in procedure characterization The "seenProcs" sets passed as arguments to the procedure and dummy procedure characterization routines need to be passed by value so that local updates to those sets do not become permanent. They are presently passed by reference and that has led to bogus errors about recursively defined procedures in testing. (It might be faster to pass the sets by reference and undo those local updates in these functions, but that's error-prone, and the performance difference is not expected to be detectable in practice.) Differential Revision: https://reviews.llvm.org/D122439	2022-03-25 09:49:55 -07:00
Yitzhak Mandelbaum	a184a0d8aa	[clang][dataflow] Add support for disabling warnings on smart pointers. This patch provides the user with the ability to disable all checked of accesses to optionals that are the pointees of smart pointers. Since smart pointers are not modeled (yet), the system cannot distinguish safe from unsafe accesses to optionals through smart pointers. This results in false positives whenever optionals are used through smart pointers. The patch gives the user the choice of ignoring all positivess in these cases. Differential Revision: https://reviews.llvm.org/D122143	2022-03-25 16:44:34 +00:00
Johannes Doerfert	4e34f061d6	[OpenMP][FIX] Ensure exclusive access to the HDTT map This patch solves two problems with the `HostDataToTargetMap` (HDTT map) which caused races and crashes before: 1) Any access to the HDTT map needs to be exclusive access. This was not the case for the "dump table" traversals that could collide with updates by other threads. The new `Accessor` and `ProtectedObject` wrappers will ensure we have a hard time introducing similar races in the future. Note that we could allow multiple concurrent read-accesses but that feature can be added to the `Accessor` API later. 2) The elements of the HDTT map were `HostDataToTargetTy` objects which meant that they could be copied/moved/deleted as the map was changed. However, we sometimes kept pointers to these elements around after we gave up the map lock which caused potential races again. The new indirection through `HostDataToTargetMapKeyTy` will allows us to modify the map while keeping the (interesting part of the) entries valid. To offset potential cost we duplicate the ordering key of the entry which avoids an additional indirect lookup. We should replace more objects with "protected objects" as we go. Differential Revision: https://reviews.llvm.org/D121057	2022-03-25 11:38:54 -05:00
Jonas Devlieghere	61efe14e21	[lldb] Add a fuzzer for target creation This patch adds a generic fuzzer that interprets inputs as object files and uses them to create a target in lldb. It is very similar to the llvm-dwarfdump fuzzer which found a bunch of issues in libObject. Differential revision: https://reviews.llvm.org/D122461	2022-03-25 09:34:00 -07:00
Tue Ly	6168b42225	[libc] Improve the performance of expf. Reduce the polynomial's degree from 7 down to 4. Currently we use a degree-7 minimax polynomial on an interval of length 2^-7 around 0 to compute `expf`. Based on the suggestion of @santoshn and the RLIBM project (https://github.com/rutgers-apl/rlibm-all/blob/main/source/float/exp.c) and the improvement we made with `exp2f` in https://reviews.llvm.org/D122346, it is possible to have a good polynomial of degree-4 on a subinterval of length 2^(-7) to approximate e^x. We did try to either reduce the degree of the polynomial down to 3 or increase the interval size to 2^(-6), but in both cases the number of exceptional values exploded. So we settle with using a degree-4 polynomial of the interval of size 2^(-7) around 0. Reviewed By: sivachandra, zimmermann6, santoshn Differential Revision: https://reviews.llvm.org/D122418	2022-03-25 12:20:20 -04:00
Dávid Bolvanský	39d348c602	[NFCI] Fix set-but-unused warning in DenseMap.h in some configurations	2022-03-25 17:12:53 +01:00
Dávid Bolvanský	d6c6eb3db5	[NFCI] Fix set-but-unused warning in AArch64AsmParser.cpp	2022-03-25 17:10:37 +01:00
Hongtao Yu	7a316c0a1f	[CSSPGO] Turn on profi and ext-tsp when using probe-based profile. Probe-based profile leads to a better performance when combined with profi and ext-tsp block layout. I'm turning them on by default. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D122442	2022-03-25 09:09:21 -07:00
Philip Reames	d9756fa723	[slp] Factor out a lambda to avoid uplicating code a third time in upcoming patch [nfc]	2022-03-25 09:02:39 -07:00
Ben Shi	49b0b5f0fa	[AVR][NFC] Fix incorrect register states in expanding pseudo instructions Reviewed By: aykevl Differential Revision: https://reviews.llvm.org/D118354	2022-03-25 16:02:15 +00:00
LLVM GN Syncbot	bb48c3a9e7	[gn build] Port `39b80c8380`	2022-03-25 15:50:53 +00:00
Philip Reames	a121458edc	[test,slp] Add another stacksave related dependence test	2022-03-25 08:48:17 -07:00
Richard	39b80c8380	[clang-tidy] Add modernize-macro-to-enum check This check performs basic analysis of macros and replaces them with an anonymous unscoped enum. Using an unscoped anonymous enum ensures that everywhere the macro token was used previously, the enumerator name may be safely used. Potential macros for replacement must meet the following constraints: - Macros must expand only to integral literal tokens. The unary operators plus, minus and tilde are recognized to allow for positive, negative and bitwise negated integers. - Macros must be defined on sequential source file lines, or with only comment lines in between macro definitions. - Macros must all be defined in the same source file. - Macros must not be defined within a conditional compilation block. - Macros must not be defined adjacent to other preprocessor directives. - Macros must not be used in preprocessor conditions Each cluster of macros meeting the above constraints is presumed to be a set of values suitable for replacement by an anonymous enum. From there, a developer can give the anonymous enum a name and continue refactoring to a scoped enum if desired. Comments on the same line as a macro definition or between subsequent macro definitions are preserved in the output. No formatting is assumed in the provided replacements. The check cppcoreguidelines-macro-to-enum is an alias for this check. Fixes #27408 Differential Revision: https://reviews.llvm.org/D117522	2022-03-25 09:45:55 -06:00
Simon Pilgrim	6a094a6264	[InstCombine] SimplifyDemandedUseBits - remove ashr node if we only demand known sign bits We already do this for SelectionDAG, but we're missing it here. Noticed while re-triaging PR21929 Differential Revision: https://reviews.llvm.org/D122340	2022-03-25 15:39:08 +00:00
Joseph Huber	b9f67d44ba	[OpenMP] Replace device kernel linkage with weak_odr Currently the device kernels all have weak linkage to prevent linkage errors on multiple defintions. However, this prevents some optimizations from adequately analyzing them because of the nature of weak linkage. This patch replaces the weak linkage with weak_odr linkage so we can statically assert that multiple declarations of the same kernel will have the same definition. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D122443	2022-03-25 11:29:15 -04:00
Dmitry Preobrazhensky	5975f1c5f9	[AMDGPU][DOC][NFC] Added GFX1030 assembler syntax description	2022-03-25 18:14:04 +03:00
Iain Sandoe	cf396c56e7	[C++20][Modules] Correct an assert for modules-ts. When adding the support for modules partitions we added an assert that the actual status of Global Module Fragments matches the state machine that is driven by the module; keyword. That does not apply to the modules-ts case, where there is an implicit GMF. Differential Revision: https://reviews.llvm.org/D122394	2022-03-25 14:55:13 +00:00
Adam Czachorowski	7e45912618	[clang] Do not crash on arrow operator on dependent type. There seems to be more than one way to get to that state. I included to example cases in the test, both were noticed recently. There is room for improvement, for example by creating RecoveryExpr in place of the bad initializer, but for now let's stop the crashes. Differential Revision: https://reviews.llvm.org/D121824	2022-03-25 15:48:08 +01:00

1 2 3 4 5 ...

419233 Commits All Branches Search

419233 Commits

All Branches