llvm-project

Commit Graph

Author	SHA1	Message	Date
Iain Sandoe	b826567136	[C++20][Modules] Add a testcase for [basic.link] p10 [NFC]. This adds a testcase based on example 2 from the basic.link section of the standard.	2022-07-25 12:20:02 +01:00
Sam McCall	b2b993a6ae	[pseudo] Eliminate multiple-specified-types ambiguities using guards Motivating case: `foo bar;` is not a declaration of nothing with `foo` and `bar` both types. This is a common and critical ambiguity, clangd/AST.cpp has 20% fewer ambiguous nodes (1674->1332) after this change. Differential Revision: https://reviews.llvm.org/D130337	2022-07-25 12:57:07 +02:00
Matthias Springer	1defec8730	[mlir][tensor][bufferize][NFC] Remove duplicate code InsertSliceOp and ParallelInsertSliceOp are very similar and can share some of the bufferization analysis code. Differential Revision: https://reviews.llvm.org/D130465	2022-07-25 12:34:16 +02:00
Saiyedul Islam	8cbf4a386b	Revert "[Libomptarget] Add checks for AMDGPU TargetID using new image info" This reverts commit `471f2abc62`.	2022-07-25 05:32:59 -05:00
Matthias Springer	664ffa46bb	[mlir][tensor][bufferize] Fix deallocation of GenerateOp/FromElementsOp Both ops allocate a buffer. There were cases in which the buffer was not deallocated. Differential Revision: https://reviews.llvm.org/D130469	2022-07-25 12:25:06 +02:00
Alex Zinenko	333ee218ce	[mlir] Transform dialect: separate dependent and generated dialects In the Transform dialect extensions, provide the separate mechanism to declare dependent dialects (the dialects the transform IR depends on) and the generated dialects (the dialects the payload IR may be transformed into). This allows the Transform dialect clients that are only constructing the transform IR to avoid loading the dialects relevant for the payload IR along with the Transform dialect itself, thus decreasing the build/link time. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D130289	2022-07-25 09:59:53 +00:00
Chuanqi Xu	d35134485a	[C++20] [Modules] Make the linkage consistent for class template and its specialization Previously in D120397, we've handled the linkage for function template and its specialization. But we forgot to handle it for class templates and their specialization. So we make it in the patch with the similar approach.	2022-07-25 17:57:02 +08:00
David Stuttard	b14d7bf750	AMDGPU: Turn off force init 16 input SGPRS for pal Pal uses a different mechanism for user sgprs. Differential Revision: https://reviews.llvm.org/D129566	2022-07-25 10:52:46 +01:00
Saiyedul Islam	471f2abc62	[Libomptarget] Add checks for AMDGPU TargetID using new image info This patch extends the is_valid_binary routine to also check if the binary's target ID matches the one parsed from the system's runtime environment. This should allow us to only use the binary whose compute capability matches, allowing us to support basic multi-architecture binaries for AMDGPU. It also handles compatibility testing of target IDs of the image and the enviornment. Depends on D127432 Differential Revision: https://reviews.llvm.org/D127769	2022-07-25 04:44:36 -05:00
Matthias Springer	5f5f71e737	[mlir][tensor][bufferize] Load dependent dialects Load dialects that will be generated by the extension. (Except for BufferizationDialect and MemrefDialect which are loaded already.) Differential Revision: https://reviews.llvm.org/D130463	2022-07-25 11:36:10 +02:00
jacquesguan	d8800ead62	[RISCV] Scalarize binop followed by extractelement. This patch adds shouldScalarizeBinop to RISCV target in order to convert an extract element of a vector binary operation into an extract element followed by a scalar binary operation. Differential Revision: https://reviews.llvm.org/D129545	2022-07-25 17:23:31 +08:00
Nikita Popov	aaeb951b27	[DebugInfo] Avoid ptrtoint expression in generic test After D130366 the ptrtoint expression this used results in an error on 32-bit targets (the IR was already invalid previously, but only produced the error with -filetype=obj). Tweak the IR so it is correct for both 32-bit and 64-bit targets, and can thus continue working as a generic test.	2022-07-25 11:20:36 +02:00
Rajas Vanjape	d146b51162	[mlir][sparse][nfc] Fix memory leak in sparse sampled matmul integration test https://reviews.llvm.org/D130023 added a memory leak in sparse_sampled_matmul.mlir This diff fixes the memory leak. Testing: Ran integration tests after building with -DLLVM_USE_SANITIZER=Address flag. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D130428	2022-07-25 08:55:12 +00:00
Adrian Kuegel	07628a94e8	[mlir] Apply ClangTidyPerformance findings (NFC)	2022-07-25 10:50:04 +02:00
Sebastian Neubauer	efe1527e28	[CMake] Copy folder without permissions Copying the folder keeps the original permissions by default. This creates problems when the source folder is read-only, e.g. in a packaging environment. Then, the copied folder in the build directory is read-only as well. Later on, other files are copied into that directory (in the build tree), failing when the directory is read-only. Fix that problem by copying the folder without keeping the original permissions. Follow-up to D130254. Differential Revision: https://reviews.llvm.org/D130338	2022-07-25 10:47:04 +02:00
Dmitry Vyukov	7ec308715c	tsan: prevent pathological slowdown for spurious races Prevent the following pathological behavior: Since memory access handling is not synchronized with DoReset, a thread running concurrently with DoReset can leave a bogus shadow value that will be later falsely detected as a race. For such false races RestoreStack will return false and we will not report it. However, consider that a thread leaves a whole lot of such bogus values and these values are later read by a whole lot of threads. This will cause massive amounts of ReportRace calls and lots of serialization. In very pathological cases the resulting slowdown can be >100x. This is very unlikely, but it was presumably observed in practice: https://github.com/google/sanitizers/issues/1552 If this happens, previous access sid+epoch will be the same for all of these false races b/c if the thread will try to increment epoch, it will notice that DoReset has happened and will stop producing bogus shadow values. So, last_spurious_race is used to remember the last sid+epoch for which RestoreStack returned false. Then it is used to filter out races with the same sid+epoch very early and quickly. It is of course possible that multiple threads left multiple bogus shadow values and all of them are read by lots of threads at the same time. In such case last_spurious_race will only be able to deduplicate a few races from one thread, then few from another and so on. An alternative would be to hold an array of such sid+epoch, but we consider such scenario as even less likely. Note: this can lead to some rare false negatives as well: 1. When a legit access with the same sid+epoch participates in a race as the "previous" memory access, it will be wrongly filtered out. 2. When RestoreStack returns false for a legit memory access because it was already evicted from the thread trace, we will still remember it in last_spurious_race. Then if there is another racing memory access from the same thread that happened in the same epoch, but was stored in the next thread trace part (which is still preserved in the thread trace), we will also wrongly filter it out while RestoreStack would actually succeed for that second memory access. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D130269	2022-07-25 10:40:11 +02:00
Dmitry Vyukov	7505cc301f	tsan: remove tracking of racy addresses We used to deduplicate based on the race address to prevent lots of repeated reports about the same race. But now we clear the shadow for the racy address in DoReportRace: // This prevents trapping on this address in future. for (uptr i = 0; i < kShadowCnt; i++) StoreShadow(&shadow_mem[i], i == 0 ? Shadow::kRodata : Shadow::kEmpty); It should have the same effect of not reporting duplicates (and actually better because it's automatically reset when the memory is reallocated). So drop the address deduplication code. Both simpler and faster. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D130240	2022-07-25 10:33:26 +02:00
Balázs Kéri	acd80a29ae	[clang][ASTImporter] Improved handling of functions with auto return type. Avoid a crash if a function is imported that has auto return type that references to a template with an expression-type of argument that references into the function's body. Fixes issue #56047 Reviewed By: martong Differential Revision: https://reviews.llvm.org/D129640	2022-07-25 10:28:01 +02:00
jacquesguan	9e241c70f7	[mlir][Math] Add constant folder for ExpOp. This patch adds constant folder for ExpOp which only supports single and double precision floating-point. Differential Revision: https://reviews.llvm.org/D130318	2022-07-25 16:26:48 +08:00
Fangrui Song	b89407153d	[ELF] --reproduce: support --export-dynamic-symbol-list	2022-07-25 01:20:32 -07:00
David Spickett	3a35bcef22	[llvm][FileCheck] Fix unit tests failures with EXPENSIVE_CHECKS EXPENSIVE_CHECKS enables _GLIBCXX_DEBUG, which makes std::sort check that the compare function is implemented correctly. To do this it calls it with the first item as both sides. Which trips the assert here because we think they're 2 capture ranges that overlap, when it's just the same range twice. Check up front for the two sides being the same item (same address, not just ==). Reviewed By: kazu Differential Revision: https://reviews.llvm.org/D130282	2022-07-25 08:19:28 +00:00
Nikita Popov	fb7caa3c7b	[AsmPrinter] Reject ptrtoint to larger size in lowerConstant() When using a ptrtoint to a size larger than the pointer width in a global initializer, we currently create a ptr & low_bit_mask style MCExpr, which will later result in a relocation error during object file emission. This patch rejects the constant expression already during lowerConstant(), which results in a much clearer error message that references the constant expression at fault. This fixes https://github.com/llvm/llvm-project/issues/56400, for certain definitions of "fix". Differential Revision: https://reviews.llvm.org/D130366	2022-07-25 10:18:27 +02:00
Fangrui Song	cbcdb5248d	[ELF] Simplify --build-id/--color-diagnostics with AliasArgs. NFC	2022-07-25 01:14:53 -07:00
Rosie Sumpter	034a27e688	[AArch64] Add f16 fpimm patterns This patch recognizes f16 immediates as legal and adds the necessary patterns. This allows the fadda folding introduced in `05d424d165` to be applied to the f16 cases. Differential Revision: https://reviews.llvm.org/D129989	2022-07-25 09:08:10 +01:00
Sam McCall	661e0b63f7	[pseudo] Fix minor errors in module grammar	2022-07-25 10:04:56 +02:00
Peter Waller	f8919d2f7e	[NFC][GVN] Put phi-translation of 'add' behind a switch The code in this `#if 0` block appears to be a net benefit. Put it behind a switch defaulting to off to support experimentation and as a request for comment. The codegen impact of enabling this that I'm currently persuing is that it allows PRE to take place more frequently, particularly in loops with second order recurrences. Preliminary experimental data: Across LNT on AArch64, 54 benchmarks are sped up by >1%, and 42 are regressed by >1%, the geomean (exec_time_enabled / exec_time_disabled) of these 96 "1% or greater significance" benchmarks is 0.991. For the full set of 770 benchmarks it's 0.998. There are two benchmarks which experience a >30% speedup, and the worst slowdown is ~12%, and for every benchmark with a slowdown there is a benckmark which is sped up by a greater factor. Differential Revision: https://reviews.llvm.org/D130241	2022-07-25 07:59:47 +00:00
Kazu Hirata	fac0fb4d96	[flang] Use X->foo() instead of X.getValue().foo() (NFC) Flang C++ Style Guide tells us to use X when the reference is protected by a presense test. However, (X).foo() is a little harder to read, especially when X is a complicated expression. This patch slightly deviates from the guide (but retains the spirit) by using X->foo() instead. Differential Revision: https://reviews.llvm.org/D130413	2022-07-25 00:55:06 -07:00
Kazu Hirata	a010d32abb	[flang] Use value instead of getValue (NFC) Flang C++ Style Guide tells us to use x.value() when no presence test is obviously protecting the reference. Since a failure in EXPECT_TRUE doesn't terminate a given test, I don't count it as "protection" here. Differential Revision: https://reviews.llvm.org/D130410	2022-07-25 00:55:05 -07:00
Nikita Popov	b66ca91fe6	[Docs] Update GEP docs for opaque pointers Update the GEP FAQ to use opaque pointers. This requires more than a syntactic change in some place, because some of the concerns just don't make sense anymore (trying to index past a ptr member in a struct for example). This also fixes uses of incorrect syntax to declare or reference globals. Differential Revision: https://reviews.llvm.org/D130353	2022-07-25 09:52:14 +02:00
Nikita Popov	7ac7ec8202	[LangRef] Update for opaque pointers (NFC) Update LangRef examples to use opaque pointers in most places. I've retained typed pointers in a few cases where opaque pointers don't make much sense, e.g. pointer to pointer bitcasts. Differential Revision: https://reviews.llvm.org/D130356	2022-07-25 09:45:49 +02:00
Fangrui Song	ef03f6623c	[llvm-objcopy] Simplify --compress-debug-sections handling with AliasArgs. NFC	2022-07-25 00:31:00 -07:00
Balázs Kéri	94ca2beccc	[clang][analyzer] Added partial wide character support to CStringChecker Support for functions wmemcpy, wcslen, wcsnlen is added to the checker. Documentation and tests are updated and extended with the new functions. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D130091	2022-07-25 09:23:14 +02:00
Cullen Rhodes	836f790bb1	[AArch64][SVE] Add patterns to select masked add/sub instructions When lowering add(a, select(mask, b, splat(0))) the sel instruction can be removed by using predicated add/sub instructions. Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D129751	2022-07-25 07:22:05 +00:00
Cullen Rhodes	c4f356e519	[AArch64][SVE] NFC: Add tests for masked add/sub patterns (D129751)	2022-07-25 07:22:04 +00:00
Fangrui Song	91e2cd4fa9	[llvm-objcopy] Remove getDecompressedSizeAndAlignment. NFC	2022-07-25 00:06:36 -07:00
Max Kazantsev	a053f35990	[SCEV][NFC][CT] Cheaper handling of guards in isBasicBlockEntryGuardedByCond Handle guards uniformly with assumes, rather than iterating through all block instructions in attempt to find them. Differential Revision: https://reviews.llvm.org/D129874 Reviewed By: nikic	2022-07-25 13:38:59 +07:00
Kazu Hirata	9d5a544d34	[Hexagon] Remove isLateInstrFeedsEarlyInstr (NFC) The last use was removed on May 3, 2017 in commit `2af5037d34`. This patch also removes isLateResultInstr and isEarlySourceInstr as they become dead once we remove isLateInstrFeedsEarlyInstr.	2022-07-24 22:55:14 -07:00
Kazu Hirata	95a932fb15	Remove redundaunt override specifiers (NFC) Identified with modernize-use-override.	2022-07-24 22:28:11 -07:00
Fangrui Song	7181c4e10a	[llvm-objcopy] --compress-debug-sections: fix uninitialized ch_reserved for Elf64_Chdr ch_reserved is uninitialized and the output is not deterministic. Fix it. Rewrite and improve compress-debug-sections-zlib.test.	2022-07-24 22:19:00 -07:00
Kazu Hirata	a210f404da	[clang] Remove redundant virtual specifies (NFC) Identified with modernize-use-override.	2022-07-24 22:02:58 -07:00
Kazu Hirata	b5188591a0	[llvm] Remove redundaunt virtual specifiers (NFC) Identified with modernize-use-override.	2022-07-24 21:50:35 -07:00
Kazu Hirata	3650615fb2	[clang] Remove unused forward declarations (NFC)	2022-07-24 20:51:06 -07:00
inclyc	edaae251cc	[clang] better error message for while loops outside of control flow report an error when encountering 'while' token parsing declarator ``` clang/test/Parser/while-loop-outside-function.c:3:1: error: while loop outside of a function while // expected-error {{while loop outside of a function}} ^ clang/test/Parser/while-loop-outside-function.c:7:1: error: while loop outside of a function while // expected-error {{while loop outside of a function}} ^ ``` Fixes: https://github.com/llvm/llvm-project/issues/34462 Differential Revision: https://reviews.llvm.org/D129573	2022-07-25 11:48:24 +08:00
Marius Hillenbrand	1ef32e7828	[mlir][Arithmetic] Fix printing larger integer attributes in arith.const For arith.constant operations of integer type, the operation generates result names that include the value of the constant (i.e., the IntegerAttr that defines the constant's value). That code currently assumes integer widths of 64 bits or less and hits an assert with wider constants or would create truncated and potentially ambiguous names when built with assertions disabled. To enable printing arith.constant ops for arbitrarily wide integer types, change to use the IntegerAttr's function getValue() when generating result names. Also, add a regression test. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D129930	2022-07-25 08:37:51 +05:30
Fangrui Song	73c84f9c13	[llvm-objcopy] Remove remnant .zdebug code	2022-07-24 18:52:15 -07:00
Warren Ristow	3089b411a4	[Reassociate][NFC] Consistent checking for FastMathFlags suitability In D129523, it was noted that the approach to check whether a value can have FastMathFlags was done in different ways, and they should be made consistent. This patch makes minor changes to fix that. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D130408	2022-07-24 17:44:30 -07:00
Kazu Hirata	acf648b5e9	Use llvm::less_first and llvm::less_second (NFC)	2022-07-24 16:21:29 -07:00
Amaury Séchet	5e29360743	[NFC] Add parentheses in MathExtra.h The code used to cause a warning: llvm/include/llvm/Support/MathExtras.h:751:39: warning: suggest parentheses around ‘-’ in operand of ‘&’ [-Wparentheses] 751 \| assert(Align != 0 && (Align & Align - 1) == 0 && \|	2022-07-24 22:04:09 +00:00
Kazu Hirata	9e88cbcc40	Use any_of (NFC)	2022-07-24 14:48:11 -07:00
Kazu Hirata	bafeb63448	[Hexagon] Remove unused declaration CanReturnSmallStruct (NFC) The declaration was introduced without a corresponding definition on Dec 12, 2011 in commit `1213a7a57f`.	2022-07-24 14:48:09 -07:00

... 11 12 13 14 15 ...

431527 Commits All Branches Search

431527 Commits

All Branches