llvm-project

Commit Graph

Author	SHA1	Message	Date
Hsiangkai Wang	5821a58d8e	[RISCV] Add inline asm constraint 'vr' and 'vm' in Clang for RISC-V 'V'. Add asm constraint 'vr' for vector registers. Add asm constraint 'vm' for vector mask registers. Differential Revision: https://reviews.llvm.org/D98616	2021-03-30 09:47:27 +08:00
Evandro Menezes	fd94cfeeb5	[RISCV] Move scheduling resources for B into a separate file (NFC) Differential Revision: https://reviews.llvm.org/D99557	2021-03-29 20:37:22 -05:00
Adrian Prantl	8573c28a51	Add debug support for set types This commit adds debugging support for set types defined in languages such as Pascal and Modula-2. Patch by Peter McKinna! Differential Revision: https://reviews.llvm.org/D76115	2021-03-29 18:04:48 -07:00
Dave Lee	50a6aa6c0f	[llvm][utils] Fix handling of llvm::None	2021-03-29 17:43:53 -07:00
David Blaikie	bd56e91fdb	Add missing dependency to fix building the jit tests	2021-03-29 17:33:08 -07:00
Thomas Lively	a1b8b0739a	[WebAssembly] Fix i8x16.popcnt opcode When I updated the SIMD opcodes in `f5764a8654`, I accidentally missed updating i8x16.popcnt. This patch fixes the omission. Differential Revision: https://reviews.llvm.org/D99536	2021-03-29 17:23:15 -07:00
Jonas Devlieghere	b19a9efbc9	[dsymutil] s/dwarfdump/llvm-dwarfdump/ in test	2021-03-29 17:14:35 -07:00
Huihui Zhang	ca721042f1	[IPO][SampleContextTracker] Use SmallVector to track context profiles to prevent non-determinism. Use SmallVector instead of SmallSet to track the context profiles mapped. Doing this can help avoid non-determinism caused by iterating over unordered containers. This bug was found with reverse iteration turning on, --extra-llvm-cmake-variables="-DLLVM_REVERSE_ITERATION=ON". Failing LLVM test profile-context-tracker-debug.ll . Reviewed By: MaskRay, wenlei Differential Revision: https://reviews.llvm.org/D99547	2021-03-29 16:37:10 -07:00
Jessica Paquette	247ff26a89	[AArch64][GlobalISel] NFC: Replace IR regbankselect test with MIR test regbank-ceil.ll -> regbank-ceil.mir The IR test was intended to only check register banks. This makes it brittle, especially as we improve load/store combines in GlobalISel. Rewriting this as a MIR test also makes it more consistent with the rest of the testcases in GlobalISel.	2021-03-29 16:32:34 -07:00
Jonas Devlieghere	e0577b3130	[dsymutil] Relocate DW_TAG_label dsymutil is not relocating the DW_AT_low_pc for a DW_TAG_label. This patch fixes that and adds a test. Differential revision: https://reviews.llvm.org/D99534	2021-03-29 15:45:48 -07:00
Jonas Devlieghere	984e2f440a	[lldb] Prints error using WithColor::error in lldb-platform	2021-03-29 15:45:33 -07:00
Greg Clayton	eee309068e	Fix .debug_aranges parsing issues. When LLVM error handling was introduced to the parsing of the .debug_aranges it would cause major issues if any DWARFDebugArangeSet::extract() calls returned any errors. The code in DWARFDebugInfo::GetCompileUnitAranges() would end up calling DWARFDebugAranges::extract() which would return an error if _any_ DWARFDebugArangeSet had any errors, but it default constructed a DWARFDebugAranges object into DWARFDebugInfo::m_cu_aranges_up and populated it partially, and returned an error prior to finishing much needed functionality in the DWARFDebugInfo::GetCompileUnitAranges() function. Subsequent callers to this function would see that the DWARFDebugInfo::m_cu_aranges_up was actually valid and return this partially populated DWARFDebugAranges reference _and_ it would not be sorted or minimized. This above bugs would cause an incomplete .debug_aranges parsing, it would skip manually parsing any compile units for ranges, and would not sort the DWARFDebugAranges in m_cu_aranges_up. This bug would also cause breakpoints set by file and line to fail to set correctly if a symbol context for an address could not be resolved properly, which the incomplete and unsorted DWARFDebugAranges object that DWARFDebugInfo::GetCompileUnitAranges() returned would cause symbol context lookups resolved by address (breakpoint address) to fail to find any DWARF debug info for a given address. This patch fixes all of the issues that I found: - DWARFDebugInfo::GetCompileUnitAranges() no longer returns a "llvm::Expected<DWARFDebugAranges &>", but just returns a "const DWARFDebugAranges &". Why? Because this code contained a fallback that would parse all of the valid DWARFDebugArangeSet objects, and would check which compile units had valid .debug_aranges set entries, and manually build an address ranges table using DWARFUnit::BuildAddressRangeTable(). If we return an error because any DWARFDebugArangeSet has any errors, then we don't do any of this code. Now we parse all DWARFDebugArangeSet objects that have no errors, if any calls to DWARFDebugArangeSet::extract() return errors, we skip that DWARFDebugArangeSet so that we can use the fallback call to DWARFUnit::BuildAddressRangeTable(). Since DWARFDebugInfo::GetCompileUnitAranges() needs to parse what it can from the .debug_aranges and build address ranges tables for any compile units that don't have any .debug_aranges sets, everything now works as expected. - Fix an issue where a DWARFDebugArangeSet contains multiple terminator entries. The LLVM parser and llvm-dwarfdump properly warn about this because it happens with linux compilers and linkers and was the original cause of the bug I am fixing here. We now correctly warn about this issue if "log enable dwarf info" is enabled, but we continue to parse the DWARFDebugArangeSet correctly so we don't lose data that is contained in the .debug_aranges section. - DWARFDebugAranges::extract() no longer returns a llvm::Error because we need to be able to parse all of the valid DWARFDebugArangeSet objects. It also will correctly skip a DWARFDebugArangeSet object that has errors in the middle of the stream by setting the start offsets of each DWARFDebugArangeSet to be calculated by the previous DWARFDebugArangeSet::extract() calculated offset that uses the header which contains the length of the DWARFDebugArangeSet. This means if do we run into real errors while parsing individual DWARFDebugArangeSet objects, we can continue to parse the rest of the validly encoded DWARFDebugArangeSet objects in the .debug_aranges section. This will allow LLDB to parse DWARF that contains a possibly newer .debug_aranges set format than LLDB currently supports because we will error out for the parsing of the DWARFDebugArangeSet, but be able to skip to the next DWARFDebugArangeSet object using the "DWARFDebugArangeSet.m_header.length" field to calculate the next starting offset. Tests were added to cover all new functionality. Differential Revision: https://reviews.llvm.org/D99401	2021-03-29 15:34:36 -07:00
LLVM GN Syncbot	b75018e305	[gn build] Port `5178ffc7cf`	2021-03-29 22:12:00 +00:00
Gulfem Savrun Yeniceri	5178ffc7cf	[Passes] Add relative lookup table converter pass Lookup tables generate non PIC-friendly code, which requires dynamic relocation as described in: https://bugs.llvm.org/show_bug.cgi?id=45244 This patch adds a new pass that converts lookup tables to relative lookup tables to make them PIC-friendly. Differential Revision: https://reviews.llvm.org/D94355	2021-03-29 21:53:32 +00:00
Florian Hahn	482283042f	[AArch64] Remove custom zext/sext legalization code. Currently performExtendCombine assumes that the src-element bitwidth * 2 is a valid MVT. But this is not the case for i1 and it causes a crash on the v64i1 test cases added in this patch. It turns out that this code appears to not be needed; the same patterns are handled by other code and we end up with the same results, even without the custom lowering. I also added additional test cases in `a50037aaa6`. Let's just remove the unneeded code. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D99437	2021-03-29 22:22:05 +01:00
Jonas Devlieghere	047cbfe2bb	[lldb] Print stack trace when lldb-vscode crashes Print LLVM's pretty stack trace when lldb-vscode crashes. Also removes the unnecessary call to PrintStackTraceOnErrorSignal in lldb-server as it's already part of InitLLVM. Differential revision: https://reviews.llvm.org/D99535	2021-03-29 14:20:59 -07:00
Nikita Popov	7669455df4	[X86][FastISel] Fix with.overflow eflags clobber (PR49587) If the successor block has a phi node, then additional moves may be inserted into predecessors, which may clobber eflags. Don't try to fold the with.overflow result into the branch in that case. This is done by explicitly checking for any phis in successor blocks, not sure if there's some more principled way to address this. Other fused compare and branch patterns avoid the issue by emitting the comparison when handling the branch, so that no instructions may be inserted in between. In this case, the with.overflow call is emitted separately (and I don't think this is avoidable, as it will generally have at least two users). Fixes https://bugs.llvm.org/show_bug.cgi?id=49587. Differential Revision: https://reviews.llvm.org/D98600	2021-03-29 23:08:47 +02:00
Fanbo Meng	bd8dd580ff	[NFC] clang-formatting zos-alignment.c Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D99514	2021-03-29 16:48:10 -04:00
Fangrui Song	1daa48f005	[lsan] realloc: don't deallocate if requested size is too large This is the behavior required by the standards. Differential Revision: https://reviews.llvm.org/D99480	2021-03-29 13:35:10 -07:00
Petr Hosek	188592ff08	Revert "[CMake] Use write_basic_package_version_file for LLVM" This reverts commit `3001d080c8` which seems to have introduced a race condition that's failing the build in some cases.	2021-03-29 13:07:39 -07:00
MaheshRavishankar	c4d5b95617	Fix broken build for commit `9b0517035f` Differential Revision: https://reviews.llvm.org/D99533	2021-03-29 12:48:45 -07:00
Nico Weber	f53dc06ed3	fix comment typo to cycle bots	2021-03-29 15:47:16 -04:00
Stanislav Mekhanoshin	619b88849e	[AMDGPU] Fix "Sequence" spelling. NFC.	2021-03-29 12:11:36 -07:00
Samuel	24339056c8	[llvm-reduce] Remove dso_local when possible Add a new delta pass to llvm-reduce that removes dso_local when possible Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D98673	2021-03-29 12:00:10 -07:00
Petr Hosek	bc4d3ca7bd	[libcxx] Use integer division In Python 3, math.floor returns int when both arguments are ints. In Python 2, math.floor returns float. This leads to a failure because the result of math.floor is used as an array index. While Python 2 is on its way out, it's still used in some places so use an integer division instead. Differential Revision: https://reviews.llvm.org/D99520	2021-03-29 11:59:44 -07:00
Joe Nash	45fd7c02af	Revert "[AMDGPU] Mark additional VOP3 as commutable" This reverts commit `d35d8da7d6`.	2021-03-29 14:48:11 -04:00
Nico Weber	221388f451	fix comment typo to cycle bots	2021-03-29 14:50:17 -04:00
Fangrui Song	59e422c90b	[lsan][test] Add malloc(0) and realloc(p, 0) tests	2021-03-29 11:41:07 -07:00
MaheshRavishankar	9b0517035f	[mlir] Enhance InferShapedTypeOpInterface and move LinalgOps to use them. A new `InterfaceMethod` is added to `InferShapedTypeOpInterface` that allows an operation to return the `Value`s for each dim of its results. It is intended for the case where the `Value` returned for each dim is computed using the operands and operation attributes. This interface method is for cases where the result dim of an operation can be computed independently, and it avoids the need to aggregate all dims of a result into a single shape value. This also implies that this is not suitable for cases where the result type is unranked (for which the existing interface methods is to be used). Also added is a canonicalization pattern that uses this interface and resolves the shapes of the output in terms of the shapes of the inputs. Moving Linalg ops to use this interface, so that many canonicalization patterns implemented for individual linalg ops to achieve the same result can be removed in favor of the added canonicalization pattern. Differential Revision: https://reviews.llvm.org/D97887	2021-03-29 11:39:48 -07:00
Nico Weber	742f663705	fix comment typo to cycle bots	2021-03-29 14:35:57 -04:00
Stella Laurenzo	4ca39dad52	NFC: Update MLIR python bindings docs to install deps via requirements.txt. * Also adds some verbiage about upgrading `pip` itself, since this is a common source of issues. Differential Revision: https://reviews.llvm.org/D99522	2021-03-29 18:32:51 +00:00
Joe Nash	d35d8da7d6	[AMDGPU] Mark additional VOP3 as commutable Note, only src0 and src1 will be commuted if the isCommutable flag is set. This patch does not change that, it just makes it possible to commute src0 and src1 of more instructions. Reviewed By: foad, rampitec Differential Revision: https://reviews.llvm.org/D99376 Change-Id: I61e20490962d95ea429beb355c55f55c024dafdc	2021-03-29 14:22:20 -04:00
Jez Ng	a43f588e01	[lld-macho] Implement -segprot Addresses llvm.org/PR49405. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D99389	2021-03-29 14:08:12 -04:00
Florian Hahn	a50037aaa6	[AArch64] Add a few more vector extension tests.	2021-03-29 18:56:00 +01:00
Raphael Isemann	10d02fb15b	[lldb][NFC] Fix -Wdocumentation issue in ModuleSpec.h/ThreadTrace.h	2021-03-29 19:47:29 +02:00
Raphael Isemann	32f252a765	[lldb][NFC] Fix -Wdocumentation issue in ProcessMinidump	2021-03-29 19:40:41 +02:00
Roger Ferrer Ibanez	489ca73ac4	[PrologEpilogInserter][AMDGPU] Only adjust offset for emergency spill slots if the stack grows down D89239 adjusts the stack offset of emergency spill slots for overaligned stacks. However the adjustment is not valid for targets whose stack grows up (such as AMDGPU). This change makes the adjustment conditional only to those targets whose stack grows down. Fixes https://bugs.llvm.org/show_bug.cgi?id=49686 Differential Revision: https://reviews.llvm.org/D99504	2021-03-29 17:26:58 +00:00
Craig Topper	3dd4aa7d09	[RISCV] When custom iseling masked loads/stores, copy the mask into V0 instead of virtual register. This matches what we do in our isel patterns. In our internal testing we've found this is needed to make the fast register allocator happy at -O0. Otherwise it may assign V0 to an earlier operand and find itself with no registers left when it reaches the mask operand. By using V0 explicitly, the fast register allocator will see it when it checks for phys register usages before it starts allocating vregs. I'll try to update this with a test case. Unfortunately, this does appear to prevent some instruction reordering by the pre-RA scheduler which leads to the increased spills seen in some tests. I suspect that problem could already occur for other instructions that already used V0 directly. There's a lot of repeated code here that could do with some wrapper functions. Not sure if that should be at the level of the new code that deals with V0. That would require multiple output parameters to pass the glue, chain and register back. Maybe it should be at a higher level over the entire set of push_backs. Reviewed By: frasercrmck, HsiangKai Differential Revision: https://reviews.llvm.org/D99367	2021-03-29 10:20:43 -07:00
Peter Steinfeld	a7afc8a514	[flang] Fix CHECK() calls on erroneous procedure declarations When writing tests for a previous problem, I ran across situations where the compiler was failing calls to CHECK(). In these situations, the compiler had inconsistent semantic information because the programs were erroneous. This inconsistent information was causing the calls to CHECK(). I fixed this by avoiding the code that ended up making the failed calls to CHECK() and making sure that we were only avoiding these situations when the associated symbols were erroneous. I also added tests that would cause the calls to CHECK() without these changes. Differential Revision: https://reviews.llvm.org/D99342	2021-03-29 10:12:35 -07:00
Craig Topper	54bacaf311	[X86] Always use rip-relative addressing on 64-bit when rematerializing all zeros/ones registers using a folded load. Previously we only used RIP relative when PIC was enabled. But we know we're in small/kernel code model here so we should be able to always use RIP-relative which will give a smaller encoding. Here's a godbolt link that demonstrates the current codegen https://godbolt.org/z/j3158o Note in the non-PIC version the load from .LCPI0_0 doesn't use RIP-relative addressing, but if you change the constant in the source from 0.0 to 1.0 it will become RIP-relative. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D97208	2021-03-29 10:06:17 -07:00
Roger Ferrer Ibanez	ef76a333fa	[RISCV] Fix offset computation for RVV In D97111 we changed the RVV frame layout when using sp or bp to address the stack slots so we could address the emergency stack slot. The idea is to put the RVV objects as far as possible (in offset terms) from the frame reference register (sp / fp / bp). When using fp this happens naturally because the RVV objects are already the top of the stack and due to the constraints of RVV (VLENB being a power of two >= 128) the stack remains aligned. The rest of this summary does not apply to this case. When using sp / bp we need to skip the non-RVV stack slots. The size of the the non-RVV objects is computed subtracting the callee saved register size (whose computation is added in D97111 itself) to the total size of the stack (which does not account for RVV stack slots). However, when doing so we round to 16 bytes when computing that size and we end emitting a smaller offset that may belong to a scalar stack slot (see D98801). So this change removes that rounding. Also, because we want the RVV objects be between the non-RVV stack slots and the callee-saved register slots, we need to make sure the RVV objects are properly aligned to 8 bytes. Adding a padding of 8 would render the stack unaligned. So when allocating space for RVV (only when we don't use fp) we need to have extra padding that preserves the stack alignment. This way we can round to 8 bytes the offset that skips the non-RVV objects and we do not misalign the whole stack in the way. In some circumstances this means that the RVV objects may have padding before (=lower offsets from sp/bp) and after (before the CSR stack slots). Differential Revision: https://reviews.llvm.org/D98802	2021-03-29 17:03:49 +00:00
Roger Ferrer Ibanez	3abd0bacc2	[NFC][RISCV] Add test showing wrong stack slot for GPR and RVV spilled registers This testcase shows that we attempt to assign the same offset sp + 16 to two different stack objects. The fix will come in a later change. Differential Revision: https://reviews.llvm.org/D98801	2021-03-29 17:03:18 +00:00
Roger Ferrer Ibanez	96d14ff505	[NFC][RISCV] Pass file through update_llc_tests to fix whitespace issues While addressing RVV frame layout issues I found this file had whitespace differences that made diffs noisier than they should be. Differential Revision: https://reviews.llvm.org/D98800	2021-03-29 17:02:47 +00:00
Wenlei He	30b0232336	[CSSPGO][llvm-profgen] Context-sensitive global pre-inliner This change sets up a framework in llvm-profgen to estimate inline decision and adjust context-sensitive profile based on that. We call it a global pre-inliner in llvm-profgen. It will serve two purposes: 1) Since context profile for not inlined context will be merged into base profile, if we estimate a context will not be inlined, we can merge the context profile in the output to save profile size. 2) For thinLTO, when a context involving functions from different modules is not inined, we can't merge functions profiles across modules, leading to suboptimal post-inline count quality. By estimating some inline decisions, we would be able to adjust/merge context profiles beforehand as a mitigation. Compiler inline heuristic uses inline cost which is not available in llvm-profgen. But since inline cost is closely related to size, we could get an estimate through function size from debug info. Because the size we have in llvm-profgen is the final size, it could also be more accurate than the inline cost estimation in the compiler. This change only has the framework, with a few TODOs left for follow up patches for a complete implementation: 1) We need to retrieve size for funciton//inlinee from debug info for inlining estimation. Currently we use number of samples in a profile as place holder for size estimation. 2) Currently the thresholds are using the values used by sample loader inliner. But they need to be tuned since the size here is fully optimized machine code size, instead of inline cost based on not yet fully optimized IR. Differential Revision: https://reviews.llvm.org/D99146	2021-03-29 09:46:14 -07:00
Florian Hahn	d3ff65dc11	[Clang] Fix line numbers in CHECK lines.	2021-03-29 17:37:48 +01:00
Wei Mi	3cbf44190b	[SampleFDO] Do not scale the magic number NOMORE_ICP_MAGICNUM in value profile during profile update. When we inline a function and update the profile, the value profiles of the indirect call in the inliner and inlinee will be scaled. In https://reviews.llvm.org/D96806 and https://reviews.llvm.org/D97350, we start using the magic number NOMORE_ICP_MAGICNUM (-1) to mark targets which have been promoted. The magic number shouldn't be scaled during the profile update. Although the problem has been suppressed by https://reviews.llvm.org/D98187 for SampleFDO, which stops profile update for inlining in sampleFDO, the patch is still wanted since it will be more consistent to handle the magic number properly in profile update. Differential Revision: https://reviews.llvm.org/D99394	2021-03-29 09:34:37 -07:00
Florian Hahn	9320ac9b49	[Clang] Only run test when X86 backend is built. After `c773d0f973` the remark is only emitted if the loop is profitable to vectorize, but cannot be vectorized. Hence, it depends on X86-specific cost-modeling.	2021-03-29 17:27:01 +01:00
Jonas Devlieghere	bf8cbfa65f	[lldb] Move UpdateISAToDescriptorMap into ClassInfoExtractor (NFC) Move UpdateISAToDescriptorMap into ClassInfoExtractor so that all the formerly public functions can be private and remain an implementation detail of the extractor. Differential revision: https://reviews.llvm.org/D99448	2021-03-29 09:23:44 -07:00
Joseph Huber	29338459fb	[OpenMP] Trim error messages in CUDA plugin Summary: Remove some of the error messages printed when the CUDA plugin fails. The current error messages can be confusing because they are the first error messages printed after the async stream finds an error. This means that the printed values aren't related to what caused the issue, but are simply the last asyncronous operation that succeeded on the device. Remove these as they can be misleading. Reviewers: jdoerfert Differential Revision: https://reviews.llvm.org/D99510	2021-03-29 12:20:19 -04:00
MaheshRavishankar	f0a2fe7f79	[mlir][Linalg] Rewrite SubTensors that take a slice out of a unit-extend dimension. Subtensor operations that are taking a slice out of a tensor that is unit-extent along a dimension can be rewritten to drop that dimension. Differential Revision: https://reviews.llvm.org/D99226	2021-03-29 09:19:36 -07:00

1 2 3 4 5 ...

384091 Commits All Branches Search

384091 Commits

All Branches