llvm-project

Commit Graph

Author	SHA1	Message	Date
Matthias Gehre	bf2dc4b376	compiler-rt: Add udivmodei5 to builtins and add bitint library According to the RFC [0], this review contains the compiler-rt parts of large integer divison for _BitInt. It adds the functions ``` /// Computes the unsigned division of a / b for two large integers /// composed of n significant words. /// Writes the quotient to quo and the remainder to rem. /// /// \param quo The quotient represented by n words. Must be non-null. /// \param rem The remainder represented by n words. Must be non-null. /// \param a The dividend represented by n + 1 words. Must be non-null. /// \param b The divisor represented by n words. Must be non-null. /// \note The word order is in host endianness. /// \note Might modify a and b. /// \note The storage of 'a' needs to hold n + 1 elements because some /// implementations need extra scratch space in the most significant word. /// The value of that word is ignored. COMPILER_RT_ABI void __udivmodei5(su_int quo, su_int rem, su_int a, su_int b, unsigned int n); /// Computes the signed division of a / b. /// See __udivmodei5 for details. COMPILER_RT_ABI void __divmodei5(su_int quo, su_int rem, su_int a, su_int b, unsigned int words); ``` into builtins. In addition it introduces a new "bitint" library containing only those new functions, which is meant as a way to provide those when using libgcc as runtime. [0] https://discourse.llvm.org/t/rfc-add-support-for-division-of-large-bitint-builtins-selectiondag-globalisel-clang/60329 Differential Revision: https://reviews.llvm.org/D120327	2022-04-08 07:43:15 +01:00
River Riddle	36d3efea15	[mlir][NFC] Drop a few unnecessary includes from Pass.h	2022-04-07 23:42:47 -07:00
Zi Xuan Wu	3d4ca8a8c3	[CSKY] Correct the alignment of FPR register The alignment of FPR64 and sFPR64 declared in RegisterClass should be 32 bit.	2022-04-08 14:37:07 +08:00
Markus Böck	0c789db541	[mlir] Add support for operation-produced successor arguments in BranchOpInterface This patch revamps the BranchOpInterface a bit and allows a proper implementation of what was previously `getMutableSuccessorOperands` for operations, which internally produce arguments to some of the block arguments. A motivating example for this would be an invoke op with a error handling path: ``` invoke %function(%0) label ^success ^error(%1 : i32) ^error(%e: !error, %arg0 : i32): ... ``` The advantages of this are that any users of `BranchOpInterface` can still argue over remaining block argument operands (such as `%1` in the example above), as well as make use of the modifying capabilities to add more operands, erase an operand etc. The way this patch implements that functionality is via a new class called `SuccessorOperands`, which is now returned by `getSuccessorOperands`. It basically contains an `unsigned` denoting how many operator produced operands exist, as well as a `MutableOperandRange`, which are the usual forwarded operands we are used to. The produced operands are assumed to the first few block arguments, followed by the forwarded operands afterwards. The role of `SuccessorOperands` is to provide various utility functions to modify and query the successor arguments from a `BranchOpInterface`. Differential Revision: https://reviews.llvm.org/D123062	2022-04-08 08:28:16 +02:00
Michael Forney	795b07f549	[asan] Always skip first object from dl_iterate_phdr All platforms return the main executable as the first dl_phdr_info. FreeBSD, NetBSD, Solaris, and Linux-musl place the executable name in the dlpi_name field of this entry. It appears that only Linux-glibc uses the empty string. To make this work generically on all platforms, unconditionally skip the first object (like is currently done for FreeBSD and NetBSD). This fixes first DSO detection on Linux-musl. It also would likely fix detection on Solaris/Illumos if it were to gain PIE support (since dlpi_addr would not be NULL). Additionally, only skip the Linux VDSO on linux. Finally, use the empty string as the "seen first dl_phdr_info" marker rather than (char *)-1. If there was no other object, we would try to dereference it for a string comparison. Reviewed By: MaskRay, vitalybuka Differential Revision: https://reviews.llvm.org/D119515	2022-04-07 22:35:24 -07:00
Hongtao Yu	8a0406dcc8	[llvm-profgen] Filter out invalid LBR ranges. The profiler can sometimes give us a LBR trace that implicates bogus code ranges. For example, 0xc5acb56/0xc66c6c0 0xc628195/0xf31fbb0 0xc611261/0xc628130 0xc5c1a21/0xc6111c0 0x1f7edfd3/0xc5c3a50 0xc5c154f/0x1f7edec0 0xe8eed07/0xc5c11e0 , note that the first two pairs are supposed to form a linear execution range, in this case, it is [0xf31fbb0, 0xc5acb56] , which doesn't make sense. Such bogus ranges should be ruled out to avoid generating a bad profile. I'm fixing this for both CS and non-CS cases. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D123271	2022-04-07 21:42:01 -07:00
Zi Xuan Wu	208f93c1fd	[CSKY] support select instruction in floating type In FPUv3, there is fsel.32/64 instruction to select float/double type data. In FPUv2, split block and use branch and move instruction to select float/double type data.	2022-04-08 12:38:50 +08:00
Senran Zhang	a23652f6f9	[demangler] Support C23 _BitInt type Reviewed By: #libc_abi, aaron.ballman, urnathan Differential Revision: https://reviews.llvm.org/D122530	2022-04-08 12:20:45 +08:00
Stella Laurenzo	497f87bb7b	NFC: Silence unused function 'scaleAndAdd' in release build. Differential Revision: https://reviews.llvm.org/D123354	2022-04-07 21:19:19 -07:00
Kito Cheng	5286c7aef8	[RISCV][NFC] Add missing lit.local.cfg in test/CodeGen/MIR/RISCV/	2022-04-08 12:10:20 +08:00
LLVM GN Syncbot	a5daf81df0	[gn build] Port `690085c9b7`	2022-04-08 04:04:28 +00:00
Ye Luo	c1a6fe196d	[libomptarget] Implement pointer lookup as 5.1 spec. As described in 5.1 spec 2.21.7.2 Pointer Initialization for Device Data Environments Reviewed By: RaviNarayanaswamy Differential Revision: https://reviews.llvm.org/D123093	2022-04-07 23:01:25 -05:00
Kito Cheng	9c5aedfbf5	[RISCV] Fixing stack offset for RVV object with vararg in stack. We found LLVM generate wrong stack offset for RVV object when stack having variable argument, that cause by we didn't count vaarg part during calculate RVV stack objects. Also update the stack layout diagram for including vaarg in the diagram. Stack layout ref: https://github.com/gcc-mirror/gcc/blob/master/gcc/config/riscv/riscv.cc#L3941 Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D123180	2022-04-08 12:01:16 +08:00
Kito Cheng	7a123890c9	[RISCV] Pre-commit for fixing stack offset for RVV object Reviewed By: rogfer01, frasercrmck Differential Revision: https://reviews.llvm.org/D123179	2022-04-08 11:57:57 +08:00
Kito Cheng	690085c9b7	[RISCV] Store/restore RISCVMachineFunctionInfo into MIR YAML file RISCVMachineFunctionInfo has some fields like VarArgsFrameIndex and VarArgsSaveSize are calculated at ISel lowering stage, those info are not contained in MIR files, that cause test cases rely on those field can't not reproduce correctly by MIR dump files. This patch adding the MIR read/write for those fields. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D123178	2022-04-08 11:55:48 +08:00
Chuanqi Xu	74b56e02bd	[NFC] Remove unused variable in CodeGenModules This eliminates an unused-variable warning	2022-04-08 11:52:31 +08:00
Evgeniy Brevnov	da41214d65	Add support for atomic memory copy lowering Currently, the utility supports lowering of non atomic memory transfer routines only. This patch adds support for atomic version of memcopy. This may be useful for targets not supporting atomic memcopy. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D118443	2022-04-08 10:41:31 +07:00
jacquesguan	5bd7b0efd0	[mlir][LLVMIR] Add more vector predication intrinsic ops. This revision adds float unary, ternary and float/integer reduction intrinsic ops. Differential Revision: https://reviews.llvm.org/D123189	2022-04-08 03:16:37 +00:00
Austin Kerbow	26b14c3ea7	[InferAddressSpaces] Fix assert on invalid bitcast placement Similar to the problem in `0bb25b4603`, bitcasts that are inserted must dominate all uses. When rewriting "values" with "new values" that have the updated address space, we may replace the "new value" with a bitcast if one of the original users is an addresspace cast. This bitcast must be inserted before ALL users, not only before the addresspace cast. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D122964	2022-04-07 20:07:53 -07:00
jacquesguan	a55c19c44b	[RISCV][NFC] Use defvar to simplify pattern definations. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D123292	2022-04-08 02:51:30 +00:00
Chenbing Zheng	467cbb6249	[InstCombine] fold more constant divisor to select-of-constants divisor By adding a parameter to function FoldOpIntoSelect， we can fold more Ops to Select. For this example, we tend to fold the division instruction, so we no longer care whether SelectInst is one use. This patch slove TODO left in InstCombine/div.ll. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D122967	2022-04-08 10:19:24 +08:00
Jeremy Furtek	21949de62f	[mlir] Width parameterization of BitEnum attributes This diff contains: - Parameterization of bit enum attributes in OpBase.td by bit width (e.g. 32 and 64). Previously, all enums were 32-bits. This brings enum functionality in line with other integer attributes, and allows for bit enums greater than 32 bits. - SPIRV and Vector dialects were updated to use bit enum attributes with an explicit bit width Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D123095	2022-04-08 01:21:29 +00:00
Stella Laurenzo	145574fa2d	NFC: Eliminate warning for unused type alias FnTraitsT in release builds. Differential Revision: https://reviews.llvm.org/D123351	2022-04-07 18:11:11 -07:00
Lang Hames	a76209c265	[ORC] Fix handling of casts in llvm.global_ctors. Removes a bogus dyn_cast_or_null that was breaking cast-expression handling when parsing llvm.global_ctors. The intent of this code was to identify Functions nested within cast expressions, but the offending dyn_cast_or_null was actually blocking that: Since a function is not a cast expression, we would set FuncC to null and break the loop without finding the Function. The cast was not necessary either: Functions are already Constants, and we didn't need to do anything ConstantExpr-specific with FuncC, so we could just drop the cast. Thanks to Jonas Hahnfeld for tracking this down. http://llvm.org/PR54797	2022-04-07 17:06:38 -07:00
David Blaikie	1cee3d9db7	DebugInfo: Consider the type of NTTP when simplifying template names Since the NTTP may need to be cast to the type when rebuilding the name, check that the type can be rebuilt when determining whether a template name can be simplified.	2022-04-08 00:00:46 +00:00
Kevin Athey	0713053e4a	[MSAN] extend prctl interceptor to support PR_SCHED_CORE Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D122851	2022-04-07 16:49:25 -07:00
Walter Erquinigo	e0cfe20ad2	[trace][intel pt] Create a common accessor for live and postmortem data Some parts of the code have to distinguish between live and postmortem threads to figure out how to get some data, e.g. thread trace buffers. This makes the code less generic and more error prone. An example of that is that we have two different decoders: LiveThreadDecoder and PostMortemThreadDecoder. They exist because getting the trace bufer is different for each case. The problem doesn't stop there. Soon we'll have even more kinds of data, like the context switch trace, whose fetching will be different for live and post- mortem processes. As a way to fix this, I'm creating a common API for accessing thread data, which is able to figure out how to handle the postmortem and live cases on behalf of the caller. As a result of that, I was able to eliminate the two decoders and unify them into a simpler one. Not only that, our TraceSave functionality only worked for live threads, but now it can also work for postmortem processes, which might be useful now, but it might in the future. This common API is OnThreadBinaryDataRead. More information in the inline documentation. Differential Revision: https://reviews.llvm.org/D123281	2022-04-07 15:58:44 -07:00
Walter Erquinigo	6423b50235	[trace][intel pt] Create a class for the libipt decoder wrapper As we soon will need to decode multiple raw traces for the same thread, having a class that encapsulates the decoding of a single raw trace is a stepping stone that will make the coming features easier to implement. So, I'm creating a LibiptDecoder class with that purpose. I refactored the code and it's now much more readable. Besides that, more comments were added. With this new structure, it's also easier to implement unit tests. Differential Revision: https://reviews.llvm.org/D123106	2022-04-07 15:58:34 -07:00
Arthur Eubanks	4713038425	[test][DSE] Precommit more assume tests	2022-04-07 15:37:39 -07:00
Jorge Gorbe Moya	627f55b3ae	Fix format specifier. NFCI. Using a portable format specifier avoids a "format specifies type 'unsigned long long' but the argument has type 'uint64_t' (aka 'unsigned long') [-Werror,-Wformat]" error depending on the exact definition of `uint64_t`.	2022-04-07 15:26:49 -07:00
Zequan Wu	1da67ecefd	[llvm-symbolizer] Fix line offset for inline site. This fixes the issue when the current line offset is actually for next range. Maintain a current code range with current line offset and cache next file/line offset. Update file/line offset after finishing current range. Differential Revision: https://reviews.llvm.org/D123151	2022-04-07 15:17:59 -07:00
Jez Ng	b440c25742	[lld-macho][nfc] Give non-text ConcatOutputSections order-independent finalization This diff is motivated by my work to add proper DWARF unwind support. As detailed in PR50956 functions that need DWARF unwind need to have compact unwind entries synthesized for them. These CU entries encode an offset within `__eh_frame` that points to the corresponding DWARF FDE. In order to encode this offset during `UnwindInfoSectionImpl::finalize()`, we need to first assign values to `InputSection::outSecOff` for each `__eh_frame` subsection. But `__eh_frame` is ordered after `__unwind_info` (according to ld64 at least), which puts us in a bit of a bind: `outSecOff` gets assigned during finalization, but `__eh_frame` is being finalized after `__unwind_info`. But it occurred to me that there's no real need for most ConcatOutputSections to be finalized sequentially. It's only necessary for text-containing ConcatOutputSections that may contain branch relocs which may need thunks. ConcatOutputSections containing other types of data can be finalized in any order. This diff moves the finalization logic for non-text sections into a separate `finalizeContents()` method. This method is called before section address assignment & unwind info finalization takes place. In theory we could call these `finalizeContents()` methods in parallel, but in practice it seems to be faster to do it all on the main thread. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D123279	2022-04-07 18:13:27 -04:00
Stanislav Mekhanoshin	16cf9e6dad	[AMDGPU] Fix handling of gfx10 LDS misaligned access bug It was only handled for FLAT initially because we did not have unaligned DS instructions lowering. Now it is implemented but the bug is not handled. Differential Revision: https://reviews.llvm.org/D123338	2022-04-07 15:08:29 -07:00
Pengxuan Zheng	1c9415806b	[compiler-rt][builtins] Move DMB definition to syn-ops.h Compiler-rt cross-compile for ARMv5 fails because D99282 made it an error if DMB is used for any pre-ARMv6 targets. More specifically, the "#error only supported on ARMv6+" added in D99282 will cause compilation to fail when any source file which includes assembly.h are compiled for pre-ARMv6 targets. Since the only place where DMB is used is syn-ops.h (which is only included by arm/sync_fetch_and_* and these files are excluded from being built for older targets), this patch moves the definition there to avoid the issues described above. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D123105	2022-04-07 14:57:41 -07:00
Quinn Pham	fef56f79ac	Revert "[PowerPC] Fix EmitPPCBuiltinExpr to emit arguments once" This reverts commit `2aae5b1fac`. Because it breaks tests on windows.	2022-04-07 16:45:19 -05:00
Fangrui Song	be01af4a0f	[ELF] Fix non-relocatable-non-emit-relocs --gc-sections to discard .L symbols This reverts commit `764cd491b1`, which I incorrectly assumed NFC partly because there were no test coverage for the non-relocatable non-emit-relocs case before 9d6d936243fe343abe89323a27c7241b395af541. The interaction of {,-r,--emit-relocs} {,--discard-locals} {,--gc-sections} is complex but without -r/--emit-relocs, --gc-sections does need to discard .L symbols like --no-gc-sections. The behavior matches GNU ld.	2022-04-07 14:34:32 -07:00
Stanislav Mekhanoshin	e66f0edb40	[AMDGPU] Split unaligned LDS access instead of scalarizing There is no need to fully scalarize an unaligned operation in some case, just split it to alignment. Differential Revision: https://reviews.llvm.org/D123330	2022-04-07 14:27:37 -07:00
Fangrui Song	e25c41803f	[ELF][test] Improve discard-locals.s	2022-04-07 14:24:15 -07:00
Florian Hahn	631016a853	[LV] Add test case for PR54427. Reduced test for #54427.	2022-04-07 23:21:21 +02:00
Quinn Pham	2aae5b1fac	[PowerPC] Fix EmitPPCBuiltinExpr to emit arguments once This patch changes `EmitPPCBuiltinExpr` in `CGBuiltin.cpp` to remove the loop at the beginning of the function that emits the arguments and to delay emitting the arguments until inside the switch statement. These changes will put `EmitPPCBuiltinExpr` in line with the strategy of the target independent function `EmitBuiltinExpr`. Also, this patch ensures that arguments are only emitted once. Tests that included builtins affected by these changes have been modified to match expected behaviour. Reviewed By: #powerpc, nemanjai, amyk Differential Revision: https://reviews.llvm.org/D121637	2022-04-07 16:00:12 -05:00
Jonas Devlieghere	8ece6b78c0	[lldb] Use getMainExecutable in SBDebugger::PrintStackTraceOnError Implement Pavel's suggestion to use llvm::sys::fs::getMainExecutable to find the executable name for llvm::sys::PrintStackTraceOnErrorSignal.	2022-04-07 13:53:23 -07:00
Mark de Wever	b05027aaf9	Revert "[libc++][format] Use a helper constant." This reverts commit `82427685ea`. This seems to break the AIX-32 bit build.	2022-04-07 22:40:08 +02:00
Luboš Luňák	c29a51b3a2	[lldb][gui] remove the "expand" diamond for variables where expanding fails If the variables view shows a variable that is a struct that has MightHaveChildren(), the expand diamond is shown, but if trying to expand it and it's not possible (e.g. incomplete type), remove the expand diamond to visualize that it can't be in fact expanded. Otherwise it feels kinda weird that a tree item cannot be expanded even though it "should". I guess the MightHaveChildren() checking means that GetChildren() may be expensive, so also do not call it for rows that are not expanded. Differential Revision: https://reviews.llvm.org/D123008	2022-04-07 21:59:18 +02:00
Luboš Luňák	f42f21746c	[lldb][gui] handle Ctrl+C to stop a running process Differential Revision: https://reviews.llvm.org/D123015	2022-04-07 21:58:37 +02:00
Craig Topper	65942554e2	[ARM] Add missing return to ARMTTIImpl::isLoweredToCall. I assume we meant to return the result of the call to BaseT::isLoweredToCall(F). This might not be a functional change in practice because it would still hit the default case in the switch and call BaseT::isLoweredToCall(F) at the end. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D123333	2022-04-07 12:52:54 -07:00
Nico Weber	2cb3d28b17	[lld/mac] Add some comments and asserts I was wondering if SymtabSection::emitStabs() should check defined->includeInSymtab. Add asserts and comments explaining why that's not necessary. No behavior change. Differential Revision: https://reviews.llvm.org/D123302	2022-04-07 15:43:28 -04:00
Emil Kieri	da1fc3ae95	[Driver][NFC] Simplify handling of flags in Options.td We aim at improving the readability and maintainability of Options.td, and in particular its handling of 'Flags', by - limiting the extent of 'let Flags = [...] in {'s, and - adding closing comments to matching '}'s. - being more consistent about empty lines around 'let Flags' and '}'. More concretely, - we do not let a 'let Flags' span across several headline comments. When all 'def's in two consecutive headlines share the same flags, we stil close and start a new 'let Flags' at the intermediate headline. - when a 'let Flags' span just one or two 'def's, set 'Flags' within the 'def's instead. - we remove nested 'let Flags'. Note that nested 'let Flags' can be quite confusing, especially when the outer was started long before the inner. Moving a 'def' out of the inner 'let Flags' and setting 'Flags' within the 'def' will not have the intended effect, as those flags will be overridden by the outer 'let Flags'. Reviewed By: awarzynski, jansvoboda11, hans Differential Revision: https://reviews.llvm.org/D123070	2022-04-07 20:38:51 +02:00
River Riddle	af371f9f98	Reland [GreedPatternRewriter] Preprocess constants while building worklist when not processing top down Reland Note: Adds a fix to properly mark a commutative operation as folded if we change the order of its operands. This was uncovered by the fact that we no longer re-process constants. This avoids accidentally reversing the order of constants during successive application, e.g. when running the canonicalizer. This helps reduce the number of iterations, and also avoids unnecessary changes to input IR. Fixes #51892 Differential Revision: https://reviews.llvm.org/D122692	2022-04-07 11:31:42 -07:00
Jez Ng	f004ecf6ec	[lld-macho][nfc] Remove indirection when looking up common section members {D118797} means that we can now check the name/segname of a given section directly, instead of having to look those properties up on one of its subsections. This allows us to simplify our code. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D123275	2022-04-07 14:28:52 -04:00
David Green	fa784f6382	[AArch64] Insert subvector costs An insert subvector under aarch64 can often be done as a single lane mov operation. For example a v4i8 inserted into a v16i8 is a s-reg mov, so long as the index is a multiple of 4. This teaches the cost model that, using code copied over from the X86 backend. Some of the costs (v16i16_4_0) are still high because they get matched as a SK_Select, not an SK_InsertSubvector. D120879 has some codegen tests for inserting subvectors, which I were added as llvm/test/CodeGen/AArch64/insert-subvector.ll. Differential Revision: https://reviews.llvm.org/D120880	2022-04-07 19:27:41 +01:00

1 2 3 4 5 ...

420453 Commits All Branches Search

420453 Commits

All Branches