llvm-project

Commit Graph

Author	SHA1	Message	Date
Heejin Ahn	ed41945faa	[WebAssembly] Fix call unwind mismatches This adds `delegate` instruction and use it to fix unwind destination mismatches created by marker placement in CFGStackify. There are two kinds of unwind destination mismatches: - Mismatches caused by throwing instructions (here we call it "call unwind mismatches", even though `throw` and `rethrow` can also cause mismatches) - Mismatches caused by `catch`es, in case a foreign exception is not caught by the nearest `catch` and the next outer `catch` is not the catch it should unwind to. This kind of mismatches didn't exist in the previous version of the spec, because in the previous spec `catch` was effectively `catch_all`, catching all exceptions. This implements routines to fix the first kind of unwind mismatches, which we call "call unwind mismatches". The second mismatch (catch unwind mismatches) will be fixed in a later CL. This also reenables all previously disabled tests in cfg-stackify-eh.ll and updates FileCheck lines to match the new spec. Two tests were deleted because they specifically tested the way we fixed unwind mismatches before using `exnref`s and branches, which we don't do anymore. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D94048	2021-02-06 07:07:04 -08:00
Sander de Smalen	79a6cfc29e	NFC: Migrate LoopIdiomRecognize to work on InstructionCost This patch migrates cost values and arithmetic to work on InstructionCost. When the interfaces to TargetTransformInfo are changed, any InstructionCost state will propagate naturally. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html	2021-02-06 14:39:19 +00:00
David Green	0f435a544a	[AArch64] Correct some tablegen operand types. NFC	2021-02-06 14:34:14 +00:00
Sander de Smalen	ae27274b2f	NFC: Migrate LoopFlatten to work on InstructionCost. This patch migrates cost values and arithmetic to work on InstructionCost. When the interfaces to TargetTransformInfo are changed, any InstructionCost state will propagate naturally. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D96029	2021-02-06 11:47:04 +00:00
Tung D. Le	05c6c648ec	[MLIR] [affine-loop-fusion] Fix a bug about non-result ops in affine-loop-fusion This patch fixes the following bug when calling --affine-loop-fusion Input program: ```mlir func @should_not_fuse_since_top_level_non_affine_non_result_users( %in0 : memref<32xf32>, %in1 : memref<32xf32>) { %c0 = constant 0 : index %cst_0 = constant 0.000000e+00 : f32 affine.for %d = 0 to 32 { %lhs = affine.load %in0[%d] : memref<32xf32> %rhs = affine.load %in1[%d] : memref<32xf32> %add = addf %lhs, %rhs : f32 affine.store %add, %in0[%d] : memref<32xf32> } store %cst_0, %in0[%c0] : memref<32xf32> affine.for %d = 0 to 32 { %lhs = affine.load %in0[%d] : memref<32xf32> %rhs = affine.load %in1[%d] : memref<32xf32> %add = addf %lhs, %rhs: f32 affine.store %add, %in0[%d] : memref<32xf32> } return } ``` call --affine-loop-fusion, we got an incorrect output: ```mlir func @should_not_fuse_since_top_level_non_affine_non_result_users(%arg0: memref<32xf32>, %arg1: memref<32xf32>) { %c0 = constant 0 : index %cst = constant 0.000000e+00 : f32 store %cst, %arg0[%c0] : memref<32xf32> affine.for %arg2 = 0 to 32 { %0 = affine.load %arg0[%arg2] : memref<32xf32> %1 = affine.load %arg1[%arg2] : memref<32xf32> %2 = addf %0, %1 : f32 affine.store %2, %arg0[%arg2] : memref<32xf32> %3 = affine.load %arg0[%arg2] : memref<32xf32> %4 = affine.load %arg1[%arg2] : memref<32xf32> %5 = addf %3, %4 : f32 affine.store %5, %arg0[%arg2] : memref<32xf32> } return } ``` This happened because when analyzing the source and destination nodes, affine loop fusion ignored non-result ops sandwitched between them. In other words, the MemRefDependencyGraph in the affine loop fusion ignored these non-result ops. This patch solves the issue by adding these non-result ops to the MemRefDependencyGraph. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D95668	2021-02-06 13:30:16 +05:30
Fangrui Song	e44a100942	.gcc_except_table: Set SHF_LINK_ORDER if binutils>=2.36, and drop unneeded unique ID for -fno-unique-section-names GNU ld>=2.36 supports mixed SHF_LINK_ORDER and non-SHF_LINK_ORDER sections in an output section, so we can set SHF_LINK_ORDER if -fbinutils-version=2.36 or above. If -fno-function-sections or older binutils, drop unique ID for -fno-unique-section-names. The users can just specify -fbinutils-version=2.36 or above to allow GC with both GNU ld and LLD. (LLD does not support garbage collection of non-group non-SHF_LINK_ORDER .gcc_except_table sections.)	2021-02-05 21:45:21 -08:00
Fangrui Song	6a1235211d	[ELF] --gc-sections: collect unused SHF_LINK_ORDER .gcc_except_table A SHF_LINK_ORDER .gcc_except_table is similar to a .gcc_except_table in a section group. The associated text section is responsible for retaining it. LLD still does not support GC of non-group non-SHF_LINK_ORDER .gcc_except_table - but that is not necessary because we can teach the compiler to set SHF_LINK_ORDER.	2021-02-05 21:35:27 -08:00
Kazu Hirata	ea3175c15b	[Transforms/Instrumentation] Use range-based for loops (NFC)	2021-02-05 21:02:08 -08:00
Kazu Hirata	aa5c09bead	[llvm] Fix header guards (NFC) Identified with llvm-header-guard.	2021-02-05 21:02:06 -08:00
Kazu Hirata	7725b81822	[AMDGPU] Drop unnecessary const from a return type (NFC) Identified with const-return-type.	2021-02-05 21:02:04 -08:00
Fangrui Song	853a264916	[AsmPrinter] __patchable_function_entries: Set SHF_LINK_ORDER for binutils 2.36 and above This matches GCC behavior when the configure-time binutils is new. GNU ld<2.36 did not support mixed SHF_LINK_ORDER and non-SHF_LINK_ORDER sections in an output section, so we conservatively disable SHF_LINK_ORDER for <2.36.	2021-02-05 19:53:06 -08:00
Wenlei He	801d9cc7b9	[CSSPGO] Use merged base profile for hot threshold calculation Context-sensitive profile effectively split a function profile into many copies each representing the CFG profile of a particular calling context. That makes the count distribution looks more flat as we now have more function profiles each with lower counts, which in turn leads to lower hot thresholds. Now we tells threshold computation to merge context profile first before calculating percentile based cutoffs to compensate for seemingly flat context profile. This can be controlled by swtich `sample-profile-contextless-threshold`. Earlier measurement showed ~0.4% perf boost with this tuning on spec2k6 for CSSPGO (with pseudo-probe and new inliner). Differential Revision: https://reviews.llvm.org/D95980	2021-02-05 17:51:00 -08:00
Mehdi Amini	d6efb6fc86	Rework ExecutionEngine::invoke() to make it more friendly to use from C++ This new invoke will pack a list of argument before calling the `invokePacked` method. It accepts returned value as output argument wrapped in `ExecutionEngine::Result<T>`, and delegate the packing of arguments to a trait to allow for customization for some types. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D95961	2021-02-06 01:32:50 +00:00
Mehdi Amini	0453d2ddb4	Add a link to the LLVM Dev recording from the MLIR tutorial landing page	2021-02-06 01:26:59 +00:00
Adrian Prantl	79f46a30c2	Have stripDebugInfo() also strip !llvm.loop annotations from all instructions. The !llvm.loop annotations consist of pointers into the debug info, so when stripping the debug info (particularly important when it is malformed!) !llvm.loop annotations need to be stripped as well, or else the malformed debug info stays around. This patch applies the stripping to all instructions, not just terminator instructions. rdar://73687049 Differential Revision: https://reviews.llvm.org/D96181	2021-02-05 17:22:41 -08:00
Joerg Sonnenberger	9179764710	SPARCv9: recognize SIR trap instruction	2021-02-06 01:34:02 +01:00
Greg McGary	c3e4f3b231	[lld-macho] Fix alignment & layout to match ld64 and satisfy kernel & codesign The Mach kernel & codesign on arm64 macOS has strict requirements for alignment and sequence of segments and sections. Dyld probably is just as picky, though kernel & codesign reject malformed Mach-O files before dyld ever has a chance. I developed this diff by incrementally changing alignments & sequences to match the output of ld64. I stopped when my hello-world test program started working: `codesign --verify` succeded, and `execve(2)` didn't immediately fail with `errno == EBADMACHO` = `"Malformed Mach-O file"`. Differential Revision: https://reviews.llvm.org/D94935	2021-02-05 17:22:03 -07:00
Wouter van Oortmerssen	a872ee2f36	[WebAssembly] ensure .functype applies to right label in assembler We used to require .functype immediately follows the label it sets the type of, but not all Clang output follows this rule. Now we simply allow it on any symbol, but only assume its a function start for a defined symbol, which is simpler and more general. Fixes (part of) https://bugs.llvm.org/show_bug.cgi?id=49036 Differential Revision: https://reviews.llvm.org/D96165	2021-02-05 15:36:15 -08:00
Jonas Devlieghere	28c6b1e552	[lldb] Re-enable TestExprsChar on arm64 This test passes on arm64 (Apple Silicon). I assume that "aarch64" still ensures this gets skipped on Linux. I don't have access to such and environment so I'll have to rely on the bot complaining.	2021-02-05 15:21:24 -08:00
Jonas Devlieghere	ac1242bce3	[debugserver] Fix more warnings in DNBArchImplARM64	2021-02-05 15:21:24 -08:00
Siva Chandra	53fcf6bb62	[libc][aarch64] Enable a bunch of math functions. Namely, these are the functions enabled: rint, lrint, llrint, lround, llround, nearbyint. They were previously not enabled because they required rounding mode and FP exception support. Now that rounding mode and FP exception support is available for Aarch64, they can be enabled.	2021-02-05 15:11:17 -08:00
Lei Zhang	7630520ae3	[mlir][vector] Add pattern to shuffle bitcast ops These patterns move vector.bitcast ops to be before insert ops or after extract ops where suitable. With them, bitcast will happen on smaller vectors and there are more chances to share extract/insert ops. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D96040	2021-02-05 17:52:49 -05:00
Fangrui Song	a4fa667dee	[libc++abi] Disable _Unwind_ForcedUnwind + exception tests for ARM EHABI libunwind ARM EHABI does not support _Unwind_ForcedUnwind yet. In addition, ARM EHABI makes `_Unwind_Exception` a typedef so `struct _Unwind_Exception*` cannot be used.	2021-02-05 14:12:27 -08:00
Sam Clegg	38a285885d	[clang][emscripten] Add builtin define for __EMSCRIPTEN_PTHREADS__ Currently the emscripten frontend driver injects this when building with thread support. Moving this into the clang driver itself makes the emscripten python driver less magical. Differential Revision: https://reviews.llvm.org/D96171	2021-02-05 13:53:05 -08:00
Wouter van Oortmerssen	5e5b2cb131	[WebAssembly] Prevent data inside text sections in assembly This is not supported in Wasm, unless the data was encoded instructions, but that wouldn't work with the assembler's other functionality (enforcing nesting etc.). Fixes: https://bugs.llvm.org/show_bug.cgi?id=48971 Differential Revision: https://reviews.llvm.org/D95838	2021-02-05 13:48:25 -08:00
Nico Weber	5f76044c25	[gn build] enable new pass manager more, follow-up to `39ceb5c9cf`	2021-02-05 16:15:14 -05:00
Aaron Ballman	ec04e2850a	Allow SmallPtrSet to be used with a std::insert_iterator Currently, the SmallPtrSet type allows inserting elements but it does not support inserting elements with a positional hint. The lack of this signature means that you cannot use SmallPtrSet with std::insert_iterator or std::inserter(), which makes some code constructs more awkward. This adds an overload of insert() that can be used in these scenarios. The positional hint is unused by SmallPtrSet and the call is equivalent to calling insert() without a hint.	2021-02-05 16:12:47 -05:00
Petr Hosek	9fd9b5a9c9	Don't emit coverage mapping for excluded functions When a function or a file is excluded using -fprofile-list= option, don't emit coverage mapping as doing so confuses users since those functions would always have zero count. This also reduces the binary size considerably in cases where only a few functions or files are being instrumented. Differential Revision: https://reviews.llvm.org/D96000	2021-02-05 13:03:57 -08:00
Sterling Augustine	a34b8b879e	Various minor fixes for python 3 Switch StdTuple printer from python 2-style "next" to python 3. Nested iteration changed enough to make the original bitset iteration code a bit trickier than it needs to be, so unnest. The end node of a map iterator is sometimes hard to detect in isolation, don't fail in that case. Differential Revision: https://reviews.llvm.org/D96167	2021-02-05 13:01:34 -08:00
Martin Storsjö	d4f4e723d0	[libcxx] Implement temp_directory_path using GetTempPath on windows This does roughly the same as the manual implementation, but checks a slightly different set of environment variables and has a more appropriate fallback if no environment variables are available (/tmp isn't a very useful fallback on windows). Differential Revision: https://reviews.llvm.org/D91175	2021-02-05 22:47:33 +02:00
Martin Storsjö	4d292d531b	[libcxx] Use the posix code for directory_entry::__do_refresh This works just fine for windows, as all the functions it calls are implemented and wrapped for windows. Differential Revision: https://reviews.llvm.org/D91173	2021-02-05 22:47:33 +02:00
Eric Schweitz	f6342806db	[flang][fir] Add FIR's vector type. This patch adds support for `!fir.vector`, a rank one, constant length data type. https://github.com/flang-compiler/f18-llvm-project/pull/413 Differential Revision: https://reviews.llvm.org/D96162	2021-02-05 12:44:19 -08:00
Arthur O'Dwyer	85167fb7c2	[libc++] Further improve the contiguous-iterator story, and fix some bugs. - Quality-of-implementation: Avoid calling __unwrap_iter in constexpr contexts. The user might conceivably write a contiguous iterator where normal iterator arithmetic is constexpr-friendly but `std::to_address(it)` isn't. - Bugfix: When you pass contiguous iterators to `std::copy`, you should get back your contiguous iterator type, not a raw pointer. That means that libc++ can't `__unwrap_iter` unless it also does `__rewrap_iter`. Fortunately, this is implementable. - Improve test coverage of the new `contiguous_iterator` test iterator. This catches the bug described above. - Tests: Stop testing that we can `std::copy` //into// an `input_iterator`. Our test iterators may currently support that, but it seems nonsensical to me. Differential Revision: https://reviews.llvm.org/D95983	2021-02-05 15:18:04 -05:00
Yaxun (Sam) Liu	b008ea304d	[CUDA][HIP] Fix device variable linkage For -fgpu-rdc, shadow variables should not be internalized, otherwise they cannot be accessed by other TUs. This is necessary because the shadow variable of external device variables are always emitted as undefined symbols, which need to resolve to a global symbols. Managed variables need to be emitted as undefined symbols in device compilations. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D95901	2021-02-05 15:11:12 -05:00
Sanjay Patel	c981f6f8e1	Revert "[Codegen][ReplaceWithVecLib] add pass to replace vector intrinsics with calls to vector library" This reverts commit `2303e93e66`. Investigating bot failures.	2021-02-05 15:10:11 -05:00
Craig Topper	3c767b96dc	[RISCV] Correct types in tablegen multiclasses found by D95874.	2021-02-05 11:55:58 -08:00
Arthur Eubanks	526c0955c0	[NVPTX][NewPM] Temporarily disable NVPTX passes in new PM pipeline These passes are causing numerical discrepancies after being added to the pipeline. Disable while investigating. Reviewed By: rupprecht Differential Revision: https://reviews.llvm.org/D96166	2021-02-05 11:31:07 -08:00
zoecarver	fab194898b	[lic++][docs] Explain noexcept policy for narrow contracts. Adds documentation around libc++'s policy to add noexcept to things that cannot throw but aren't marked as noexcept. Refs LWG 3518 and D95251. Differential Revision: https://reviews.llvm.org/D95821	2021-02-05 11:27:19 -08:00
Sanjay Patel	8d9527a0bf	[PhaseOrdering] add test to show combined result of reassociate+instcombine+vectorizers; NFC	2021-02-05 14:25:19 -05:00
Sanjay Patel	3d10a0bdab	[Reassociate] add test for shl+or; NFC	2021-02-05 14:25:19 -05:00
Lukas Sommer	2303e93e66	[Codegen][ReplaceWithVecLib] add pass to replace vector intrinsics with calls to vector library This patch adds a pass to replace calls to vector intrinsics (i.e., LLVM intrinsics operating on vector operands) with calls to a vector library. Currently, calls to LLVM intrinsics are only replaced with calls to vector libraries when scalar calls to intrinsics are vectorized by the Loop- or SLP-Vectorizer. With this pass, it is now possible to replace calls to LLVM intrinsics already operating on vector operands, e.g., if such code was generated by MLIR. For the replacement, information from the TargetLibraryInfo, e.g., as specified via -vector-library is used. Differential Revision: https://reviews.llvm.org/D95373	2021-02-05 14:25:19 -05:00
Wouter van Oortmerssen	e3c0b0fe09	[WebAssembly] locals can now be indirect in DWARF This for example to indicate that byval args are represented by a pointer to a struct. Followup to https://reviews.llvm.org/D94140 Differential Revision: https://reviews.llvm.org/D94347	2021-02-05 11:14:42 -08:00
Eric Schweitz	a1a1d338e9	[flang][NFC] Make KindTy consistent and consistently used. Differential Revision: https://reviews.llvm.org/D96154	2021-02-05 10:54:58 -08:00
Eric Schweitz	ea35745610	[flang][NFC] Update comments. Differential Revision: https://reviews.llvm.org/D96152	2021-02-05 10:54:58 -08:00
Siva Chandra	c90c8d38d3	[libc] Add aarch64 flavors of floor, round, sqrt and trunc. Only single and double precision flavors have been added. Reviewed By: lntue, sdesmalen Differential Revision: https://reviews.llvm.org/D95999	2021-02-05 10:41:32 -08:00
Thomas Preud'homme	00a62547da	Stop traping on sNaN in __builtin_isnan __builtin_isnan currently generates a floating-point compare operation which triggers a trap when faced with a signaling NaN in StrictFP mode. This commit uses integer operations instead to not generate any trap in such a case. Reviewed By: kpn Differential Revision: https://reviews.llvm.org/D95948	2021-02-05 18:28:48 +00:00
Amy Huang	34f3249abd	[DebugInfo] Fix error from D95893, where I accidentally used an unsigned int in a loop and it wraps around. Follow up to https://reviews.llvm.org/D95893	2021-02-05 10:25:21 -08:00
Huihui Zhang	1b81117f88	[DAGCombiner][SVE] Fix invalid use of getVectorNumElements() in visitSRA. Make sure scalable property is preserved by using getVectorElementCount(). Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D95967	2021-02-05 09:56:49 -08:00
Amy Huang	a740af4de9	[CodeView][DebugInfo] Update the code for removing template arguments from the display name of a codeview function id. Previously the code split the string at the first '<', which incorrectly truncated names like `operator<`. Differential Revision: https://reviews.llvm.org/D95893	2021-02-05 09:49:11 -08:00
Jianzhou Zhao	0f3fd3b281	[dfsan] Add thread registration This is a part of https://reviews.llvm.org/D95835. This change is to address two problems 1) When recording stacks in origin tracking, libunwind is not async signal safe. Inside signal callbacks, we need to use fast unwind. Fast unwind needs threads 2) StackDepot used by origin tracking is not async signal safe, we set a flag per thread inside a signal callback to prevent from using it. The thread registration is similar to ASan and MSan. Related MSan changes are * `98f5ea0dba` * `f653cda269` * `5a7c364343` Some changes in the diff are used in the next diffs 1) The test case pthread.c is not very interesting for now. It will be extended to test origin tracking later. 2) DFsanThread::InSignalHandler will be used by origin tracking later. Reviewed-by: morehouse Differential Revision: https://reviews.llvm.org/D95963	2021-02-05 17:38:59 +00:00

1 2 3 4 5 ...

379213 Commits All Branches Search

379213 Commits

All Branches