llvm-project

Commit Graph

Author	SHA1	Message	Date
Jim Ingham	71c4da83b6	Don't assume that stepping out of a function will land on the next line. For instance, some recent clang emits this code on x86_64: 0x100002b99 <+57>: callq 0x100002b40 ; step_out_of_here at main.cpp:11 -> 0x100002b9e <+62>: xorl %eax, %eax 0x100002ba0 <+64>: popq %rbp 0x100002ba1 <+65>: retq and the "xorl %eax, %eax" is attributed to the same line as the callq. Since step out is supposed to stop just on returning from the function, you can't guarantee it will end up on the next line. I changed the test to check that we were either on the call line or on the next line, since either would be right depending on the debug information.	2021-03-18 17:44:17 -07:00
Philip Reames	fa26da0582	Add a couple of missing attribute query methods [NFC]	2021-03-18 17:33:20 -07:00
Thomas Lively	cbab2cd6bf	[WebAssembly] Remove experimental instructions from wasm_simd128.h These experimental builtin functions and the feature macro they were gated behind have been removed. Reviewed By: aheejin Differential Revision: https://reviews.llvm.org/D98907	2021-03-18 17:13:50 -07:00
Hsiangkai Wang	aa8d33a6d6	[RISCV] Spilling for Zvlsseg registers. For Zvlsseg, we create several tuple register classes. When spilling for these tuple register classes, we need to iterate NF times to load/store these tuple registers. Differential Revision: https://reviews.llvm.org/D98629	2021-03-19 07:46:16 +08:00
Fangrui Song	9558456b53	[SanitizerCoverage] Make __start_/__stop_ symbols extern_weak On ELF, we place the metadata sections (`__sancov_guards`, `__sancov_cntrs`, `__sancov_bools`, `__sancov_pcs` in section groups (either `comdat any` or `comdat noduplicates`). With `--gc-sections`, LLD since D96753 and GNU ld `-z start-stop-gc` may garbage collect such sections. If all `__sancov_bools` are discarded, LLD will error `error: undefined hidden symbol: __start___sancov_cntrs` (other sections are similar). ``` % cat a.c void discarded() {} % clang -fsanitize-coverage=func,trace-pc-guard -fpic -fvisibility=hidden a.c -shared -fuse-ld=lld -Wl,--gc-sections ... ld.lld: error: undefined hidden symbol: __start___sancov_guards >>> referenced by a.c >>> /tmp/a-456662.o:(sancov.module_ctor_trace_pc_guard) ``` Use the `extern_weak` linkage (lowered to undefined weak symbols) to avoid the undefined error. Differential Revision: https://reviews.llvm.org/D98903	2021-03-18 16:46:04 -07:00
Craig Topper	c9861f722e	[RISCV] Correct the output chain in lowerFixedLengthVectorMaskedLoadToRVV We returned the input chain instead of the output chain from the new load. This bypasses the load in the chain. I haven't found a good way to test this yet. IR order prevents my initial attempts at causing reordering.	2021-03-18 16:34:35 -07:00
George Balatsouras	d10f173f34	[dfsan] Add -dfsan-fast-8-labels flag This is only adding support to the dfsan instrumentation pass but not to the runtime. Added more RUN lines for testing: for each instrumentation test that had a -dfsan-fast-16-labels invocation, a new invocation was added using fast8. Reviewed By: stephan.yichao.zhao Differential Revision: https://reviews.llvm.org/D98734	2021-03-18 16:28:42 -07:00
Rob Suderman	286a9d467e	[mlir][tosa] Add lowering for tosa.rescale to linalg.generic This adds a tosa.apply_scale operation that handles the scaling operation common to quantized operatons. This scalar operation is lowered in TosaToStandard. We use a separate ApplyScale factorization as this is a replicable pattern within TOSA. ApplyScale can be reused within pool/convolution/mul/matmul for their quantized variants. Tests are added to both tosa-to-standard and tosa-to-linalg-on-tensors that verify each pass is correct. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D98753	2021-03-18 16:14:05 -07:00
Jessica Paquette	0ca83730cc	Recommit "[AArch64][GlobalISel] Fold constants into G_GLOBAL_VALUE" This reverts commit `962b73dd0f`. This commit was reverted because of some internal SPEC test failures. It turns out that this wasn't actually relevant to anything in open source, so it's safe to recommit this.	2021-03-18 16:01:02 -07:00
Rob Suderman	5627564fe0	[mlir][tosa] Add tosa.concat to subtensor inserts lowering Includes lowering for tosa.concat with indice computation with subtensor insert operations. Includes tests along two different indices. Differential Revision: https://reviews.llvm.org/D98813	2021-03-18 15:59:07 -07:00
Yuanfang Chen	80df56f7f9	Fix test case in `b4a8c0ebb6`	2021-03-18 15:55:48 -07:00
Craig Topper	182b831aeb	[DAGCombiner][RISCV] Teach visitMGATHER/MSCATTER to remove gather/scatters with all zeros masks that use SPLAT_VECTOR. Previously only all zeros BUILD_VECTOR was recognized.	2021-03-18 15:34:14 -07:00
Yuanfang Chen	b4a8c0ebb6	[LTO][MC] Discard non-prevailing defined symbols in module-level assembly This is the alternative approach to D96931. In LTO, for each module with inlineasm block, prepend directive ".lto_discard <sym>, <sym>*" to the beginning of the inline asm. ".lto_discard" is both a module inlineasm block marker and (optionally) provides a list of symbols to be discarded. In MC while emitting for inlineasm, discard symbol binding & symbol definitions according to ".lto_disard". Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D98762	2021-03-18 15:33:42 -07:00
Shilei Tian	2df65f87c1	[OpenMP] Fixed a crash in hidden helper thread It is reported that after enabling hidden helper thread, the program can hit the assertion `new_gtid < __kmp_threads_capacity` sometimes. The root cause is explained as follows. Let's say the default `__kmp_threads_capacity` is `N`. If hidden helper thread is enabled, `__kmp_threads_capacity` will be offset to `N+8` by default. If the number of threads we need exceeds `N+8`, e.g. via `num_threads` clause, we need to expand `__kmp_threads`. In `__kmp_expand_threads`, the expansion starts from `__kmp_threads_capacity`, and repeatedly doubling it until the new capacity meets the requirement. Let's assume the new requirement is `Y`. If `Y` happens to meet the constraint `(N+8)2^X=Y` where `X` is the number of iterations, the new capacity is not enough because we have 8 slots for hidden helper threads. Here is an example. ``` #include <vector> int main(int argc, char argv[]) { constexpr const size_t N = 1344; std::vector<int> data(N); #pragma omp parallel for for (unsigned i = 0; i < N; ++i) { data[i] = i; } #pragma omp parallel for num_threads(N) for (unsigned i = 0; i < N; ++i) { data[i] += i; } return 0; } ``` My CPU is 20C40T, then `__kmp_threads_capacity` is 160. After offset, `__kmp_threads_capacity` becomes 168. `1344 = (160+8)*2^3`, then the assertions hit. Reviewed By: protze.joachim Differential Revision: https://reviews.llvm.org/D98838	2021-03-18 18:25:36 -04:00
Craig Topper	305a0bad1d	[SelectionDAG] Don't pass a scalable vector to MachinePointerInfo::getWithOffset in a unit test. Suppresses an implicit TypeSize to uint64_t conversion warning. We might be able to just not offset it since we're writing to a Fixed stack object, but I wasn't sure so I just did what DAGTypeLegalizer::IncrementPointer does. Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D98736	2021-03-18 15:19:22 -07:00
Stefan Gränitz	e1579894d2	[lli] Add Orc greedy mode as -jit-kind=orc In the existing OrcLazy mode, modules go through partitioning and outgoing calls are replaced by reexport stubs that resolve on call-through. In greedy mode that this patch unlocks for lli, modules materialize as a whole and trigger materialization for all required symbols recursively. This is useful for testing (e.g. D98785) and it's more similar to the way MCJIT works.	2021-03-18 23:16:51 +01:00
thomasraoux	44f24f3996	[mlir] Fix build failure due to `1a572f4`	2021-03-18 14:58:32 -07:00
Stanislav Mekhanoshin	edd6da10d2	[AMDGPU] Remove cpol, tfe, and swz from MUBUF patterns These are always selected as 0 anyway. Differential Revision: https://reviews.llvm.org/D98663	2021-03-18 14:36:04 -07:00
Lei Zhang	fcc1ce0093	Revert "Revert "[mlir] Add linalg.fill bufferization conversion"" This reverts commit `c69550c132` with proper fix applied.	2021-03-18 17:21:58 -04:00
Mehdi Amini	c69550c132	Revert "[mlir] Add linalg.fill bufferization conversion" This reverts commit `32a744ab20`. CI is broken: test/Dialect/Linalg/bufferize.mlir:274:12: error: CHECK: expected string not found in input // CHECK: %[[MEMREF:.*]] = tensor_to_memref %[[IN]] : memref<?xf32> ^	2021-03-18 21:18:07 +00:00
Daniel Kiss	4220531cef	[AArch64][compiler-rt] Strip PAC from the link register. -mbranch-protection protects the LR on the stack with PAC. When the frames are walked the LR need to be cleared. This inline assembly later will be replaced with a new builtin. Test: build with -DCMAKE_C_FLAGS="-mbranch-protection=standard". Reviewed By: kubamracek Differential Revision: https://reviews.llvm.org/D98008	2021-03-18 22:01:50 +01:00
Daniel Kiss	c1940aac99	Revert "[AArch64][compiler-rt] Strip PAC from the link register." This reverts commit `ad40453fc4`.	2021-03-18 22:01:50 +01:00
Jonas Devlieghere	36335fe753	[lldb] Move Apple simulators test targets under API Move the Apple simulators test targets as they only matter for the API tests. Differential revision: https://reviews.llvm.org/D98880	2021-03-18 13:55:37 -07:00
Eugene Zhulenev	32a744ab20	[mlir] Add linalg.fill bufferization conversion `BufferizeAnyLinalgOp` fails because `FillOp` is not a `LinalgGenericOp` and it fails while reading operand sizes attribute. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98671	2021-03-18 13:41:16 -07:00
Zequan Wu	1c740b29fa	[clang-cl] make -ffile-compilation-dir a CoreOption. Let clang-cl accepts `-ffile-compilation-dir` flag. Differential Revision: https://reviews.llvm.org/D98887	2021-03-18 13:20:47 -07:00
thomasraoux	1a572f4509	[mlir] Add vector op support to cuda-runner including vector.print Differential Revision: https://reviews.llvm.org/D97346	2021-03-18 13:03:08 -07:00
Pavel Labath	0c208d1f42	[lldb] Fix flakyness in TestGdbRemote_vContThreads The cause is the non-async-signal-safety printf function (et al.). If the test managed to interrupt the process and inject a signal before the printf("@started") call returned (but after it has actually written the output), that string could end up being printed twice (presumably, because the function did not manage the clear the userspace buffer, and so the print call in the signal handler would print it once again). This patch fixes the issue by replacing the printf call in the signal handler with a sprintf+write combo, which should not suffer from that problem (though I wouldn't go as far as to call it async signal safe).	2021-03-18 20:41:55 +01:00
Sanjay Patel	92068d6c31	[SimplifyCFG] add tests for branch cond merging with prof metadata; NFC See PR49336.	2021-03-18 15:39:50 -04:00
thomasraoux	16947650d5	[mlir][linalg] Extend linalg vectorization to support non-identity input maps This propagates the affine map to transfer_read op in case it is not a minor identity map. Differential Revision: https://reviews.llvm.org/D98523	2021-03-18 12:32:35 -07:00
Mehdi Amini	3614df3537	Revert "[VPlan] Add plain text (not DOT's digraph) dumps" This reverts commit `6b053c9867`. The build is broken: ld.lld: error: undefined symbol: llvm::VPlan::printDOT(llvm::raw_ostream&) const >>> referenced by LoopVectorize.cpp >>> LoopVectorize.cpp.o:(llvm::LoopVectorizationPlanner::printPlans(llvm::raw_ostream&)) in archive lib/libLLVMVectorize.a	2021-03-18 19:20:39 +00:00
Muiez Ahmed	f6af5efcec	[SystemZ][z/OS] vasprintf fix libc++ The aim is to use the correct vasprintf implementation for z/OS libc++, where a copy of va_list ap is needed. In particular, it avoids the potential that the initial internal call to vsnprintf will modify ap and the subsequent call to vsnprintf will use that modified ap. Differential Revision: https://reviews.llvm.org/D97473	2021-03-18 15:00:57 -04:00
Andrei Elovikov	6b053c9867	[VPlan] Add plain text (not DOT's digraph) dumps I foresee two uses for this: 1) It's easier to use those in debugger. 2) Once we start implementing more VPlan-to-VPlan transformations (especially inner loop massaging stuff), using the vectorized LLVM IR as CHECK targets in LIT test would become too obscure. I can imagine that we'd want to CHECK against VPlan dumps after multiple transformations instead. That would be easier with plain text dumps than with DOT format. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D96628	2021-03-18 11:33:39 -07:00
Thomas Lively	f5764a8654	[WebAssembly] Finalize SIMD names and opcodes Updates the names (e.g. widen => extend, saturate => sat) and opcodes of all SIMD instructions to match the finalized SIMD spec. Deliberately does not change the public interface in wasm_simd128.h yet; that will require more care. Depends on D98466. Differential Revision: https://reviews.llvm.org/D98676	2021-03-18 11:21:25 -07:00
Thomas Lively	2f2ae08da9	[WebAssembly] Remove experimental SIMD instructions Removes the instruction definitions, intrinsics, and builtins for qfma/qfms, signselect, and prefetch instructions, which were not included in the final WebAssembly SIMD spec. Depends on D98457. Differential Revision: https://reviews.llvm.org/D98466	2021-03-18 11:21:24 -07:00
peter klausler	0d8331c06b	[flang] Refine symbol sorting Replace semantics::SymbolSet with alternatives that clarify whether the set should order its contents by source position or not. This matters because positionally-ordered sets must not be used for Symbols that might be subjected to name replacement during name resolution, and address-ordered sets must not be used (without sorting) in circumstances where the order of their contents affects the output of the compiler. All set<> and map<> instances in the compiler that are keyed by Symbols now have explicit Compare types in their template instantiations. Symbol::operator< is no more. Differential Revision: https://reviews.llvm.org/D98878	2021-03-18 11:18:14 -07:00
lorenzo chelini	4c782a24d9	[mlir] Fix typo in SCF.cpp (NFC)	2021-03-18 19:15:33 +01:00
Jorg Brown	858ca7c174	Fix typo: `char` should be `TS`	2021-03-18 11:00:07 -07:00
Markus Böck	6359049c35	[CMake][runtimes] Add file level dependency to merge_archives commands Both libc++ and libc++abi have options of merging with another archive. In the case of libc++abi, libunwind can be merged into it and in the case of libc++, libc++abi can be merged into it. This is realized using add_custom_command with POST_BUILD and the usage of the CMake generator expression TARGET_LINKER_FILE in the arguments. For such generator expressions CMake doc states: "This target-level dependency does NOT add a file-level dependency that would cause the custom command to re-run whenever the executable is recompiled" [1] This patch adds a DEPENDS argument to both add_custom_command invocations so that the archives also have a file-level dependency on the target they are merging with. That way, changes in say, libunwind source code, will be updated in the libc++abi and/or libc++ static libraries as well. [1] https://cmake.org/cmake/help/v3.20/command/add_custom_command.html Differential Revision: https://reviews.llvm.org/D98129	2021-03-18 18:51:10 +01:00
Arthur O'Dwyer	eb37d3546c	[libc++] Future-proof generate_feature_test_macro_components.py against long names. `__cpp_lib_default_template_type_for_algorithm_values` is 52 characters long, which is enough to reduce the multiplier to less-than-zero, producing an empty string between the name of the macro and its numeric value. Ensure there's always a space between the name of the macro and its value. Differential Revision: https://reviews.llvm.org/D98869	2021-03-18 13:35:28 -04:00
Kristof Beyls	64bb3759dd	[docs] Document regular LLVM sync-ups This documents current regular LLVM sync-ups that are happening in the Getting Involved section. I hope this gives a bit more visibility to regular sync-ups that are happening in the LLVM community, documenting another way communication in the community happens. Of course the downside is that this is another location that sync-up metadata needs to be maintained. That being said, the structure as proposed means that no changes are needed once a new sync-up is added, apart from maybe removing the entry once it becomes clear that that particular sync-up series is completely cancelled. Documenting a few pointers on how current sync-ups happen may also encourage others to organize useful sync-ups on specific topics. I've started with adding the sync-ups I'm aware of. There's a good chance I've missed some. If most sync-ups end up having a public google calendar, we could also create and maintain a public google calendar that shows all events happening in the LLVM community, including dev meetings, sync-ups, socials, etc - assuming that would be valuable. Differential Revision: https://reviews.llvm.org/D98797	2021-03-18 18:32:27 +01:00
Louis Dionne	6a9e7b117b	[libc++] Remove the Docker files for BuildBot We don't use them anymore since we're using the BuildKite setup. Differential Revision: https://reviews.llvm.org/D97779	2021-03-18 10:24:48 -07:00
Thomas Lively	8638c897f4	[WebAssembly] Remove unimplemented-simd target feature Now that the WebAssembly SIMD specification is finalized and engines are generally up-to-date, there is no need for a separate target feature for gating SIMD instructions that engines have not implemented. With this change, v128.const is now enabled by default with the simd128 target feature. Differential Revision: https://reviews.llvm.org/D98457	2021-03-18 10:23:12 -07:00
Peter Waller	0d6482a76a	[llvm][AArch64][SVE] Lower fixed length vector fabs Seemingly striaghtforward. Differential Revision: https://reviews.llvm.org/D98434	2021-03-18 17:20:08 +00:00
Fangrui Song	16c30c3c23	[ELF] Change --shuffle-sections=<seed> to --shuffle-sections=<section-glob>=<seed> `--shuffle-sections=<seed>` applies to all sections. The new `--shuffle-sections=<section-glob>=<seed>` makes shuffling selective. To the best of my knowledge, the option is only used as debugging, so just drop the original form. `--shuffle-sections '.init_array=-1'` `--shuffle-sections '.fini_array=-1'`. reverses static constructors/destructors of the same priority. Useful to detect some static initialization order fiasco. `--shuffle-sections '.data=-1'` reverses `.data` sections. Useful to detect unfunded pointer comparison results of two unrelated objects. If certain sections have an intrinsic order, the old form cannot be used. Differential Revision: https://reviews.llvm.org/D98679	2021-03-18 10:18:19 -07:00
Christopher Di Bella	580416d573	[libcxx] updates the feature-test macro generator D97015 didn't correctly update `generate_feature_test_macro_components.py`. Reviewed By: ldionne, Quuxplusone, #libc, Mordante Differential Revision: https://reviews.llvm.org/D97904	2021-03-18 17:08:10 +00:00
Jon Chesterfield	626a31de15	[libomptarget] Add register usage info to kernel metadata Add register usage information to the runtime metadata so that it can be used during kernel launch (that change will be in a different commit). Add this information to the kernel trace. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D98829	2021-03-18 17:00:42 +00:00
Stanislav Mekhanoshin	961e4384f4	[AMDGPU] Support SCC on buffer atomics Differential Revision: https://reviews.llvm.org/D98731	2021-03-18 09:56:14 -07:00
Wei Mi	14756b70ee	[SampleFDO] Don't mix up the existing indirect call value profile with the new value profile annotated after inlining. In https://reviews.llvm.org/D96806 and https://reviews.llvm.org/D97350, we use the magic number -1 in the value profile to avoid repeated indirect call promotion to the same target for an indirect call. Function updateIDTMetaData is used to mark an target as being promoted in the value profile with the magic number. updateIDTMetaData is also used to update the value profile when an indirect call is inlined and new inline instance profile should be applied. For the second case, currently updateIDTMetaData mixes up the existing value profile of the indirect call with the new profile, leading to the problematic senario that a target count is larger than the total count in the value profile. The patch fixes the problem. When updateIDTMetaData is used to update the value profile after inlining, all the values in the existing value profile will be dropped except the values with the magic number counts. Differential Revision: https://reviews.llvm.org/D98835	2021-03-18 09:54:34 -07:00
Mircea Trofin	92ccc6cb17	Reapply "[NPM][CGSCC] FunctionAnalysisManagerCGSCCProxy: do not clear immutable function passes" This reverts commit `11b70b9e3a`. The bot failure was due to ArgumentPromotion deleting functions without deleting their analyses. This was separately fixed in `4b1c807`.	2021-03-18 09:44:34 -07:00
Ricky Taylor	6dad34454d	Test commit This is a test commit to verify my access.	2021-03-18 16:29:21 +00:00

... 4 5 6 7 8 ...

383357 Commits All Branches Search

383357 Commits

All Branches