llvm-project

Commit Graph

Author	SHA1	Message	Date
NAKAMURA Takumi	65ab6b495a	[bazel] Unset REVISION as if LLVM_APPEND_VC_REV=OFF, for now. We could implement retrieving the revision here, but we may avoid "Just the same but only different revision hash string".	2022-05-26 06:24:49 +09:00
NAKAMURA Takumi	801ac2ebf1	[bazel] Bump to 15.0.0git	2022-05-26 06:24:49 +09:00
Snehasish Kumar	ec51971eae	[memprof] Keep and display symbol names in the RawMemProfReader. Extend the Frame struct to hold the symbol name if requested when a RawMemProfReader object is constructed. This change updates the tests and removes the need to pass --debug to obtain the mapping from GUID to symbol names. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D126344	2022-05-25 21:17:44 +00:00
Alexey Bataev	10f41a2147	[SLP]Fix PR55688: Miscompile due to incorrect nuw/nsw handling. Need to use all ReductionOps when propagating flags for the reduction ops, otherwise transformation is not correct. Plus, need to drop nuw/nsw flags. Differential Revision: https://reviews.llvm.org/D126371	2022-05-25 13:59:06 -07:00
Eric Li	5520c58390	[clang][dataflow] Fix incorrect CXXThisExpr pointee for lambdas When constructing the `Environment`, the `this` pointee is established for a `CXXMethodDecl` by looking at its parent. However, inside of lambdas, a `CXXThisExpr` refers to the captured `this` coming from the enclosing member function. When establishing the `this` pointee for a function, we check whether the function is a lambda, and check for an enclosing member function to establish the `this` pointee storage location. Differential Revision: https://reviews.llvm.org/D126413	2022-05-25 20:58:02 +00:00
Eric Li	33b598a808	[clang][dataflow] Relax assert on existence of `this` pointee storage Support for unions is incomplete (per `99f7d55e`) and the `this` pointee storage location is not set for unions. The assert in `VisitCXXThisExpr` is then guaranteed to trigger when analyzing member functions of a union. This commit changes the assert to an early-return. Any expression may be undefined, and so having a value for the `CXXThisExpr` is not a postcondition of the transfer function. Differential Revision: https://reviews.llvm.org/D126405	2022-05-25 20:58:02 +00:00
Arthur Eubanks	67e2e6e66d	[gn build] Set llvm_have_mallinfo2 to true by default If you are using an old enough glibc which doesn't have mallinfo2, set `llvm_have_mallinfo2 = false`. At this point it's likely that most people using the gn build are compiling against a recent enough glibc (glibc 2.33 which was released in Feb 2021). Reviewed By: peterwaller-arm Differential Revision: https://reviews.llvm.org/D126415	2022-05-25 13:56:05 -07:00
Maksim Panchenko	bed9efed71	[MCDisassembler] Disambiguate Size parameter in tryAddingSymbolicOperand() MCSymbolizer::tryAddingSymbolicOperand() overloaded the Size parameter to specify either the instruction size or the operand size depending on the architecture. However, for proper symbolic disassembly on X86, we need to know both sizes, as an instruction can have two operands, and the instruction size cannot be reliably calculated based on the operand offset and its size. Hence, split Size into OpSize and InstSize. For X86, the new interface allows to fix a couple of issues: * Correctly adjust the value of PC-relative operands. * Set operand size to zero when the operand is specified implicitly. Differential Revision: https://reviews.llvm.org/D126101	2022-05-25 13:44:32 -07:00
zr33	e51a6b7374	[BOLT][DWARF] Convert dwarf5-df-* tests to assembly tests Reviewed By: ayermolo Differential Revision: https://reviews.llvm.org/D126086	2022-05-25 13:41:18 -07:00
Patrick Walton	256a52d9aa	Round up zero-sized symbols to 1 byte in `.debug_aranges`. This commit modifies the AsmPrinter to avoid emitting any zero-sized symbols to the .debug_aranges table, by rounding their size up to 1. Entries with zero length violate the DWARF 5 spec, which states: > Each descriptor is a triple consisting of a segment selector, the beginning > address within that segment of a range of text or data covered by some entry > owned by the corresponding compilation unit, followed by the non-zero length > of that range. In practice, these zero-sized entries produce annoying warnings in lld and cause GNU binutils to truncate the table when parsing it. Other parts of LLVM, such as DWARFDebugARanges in the DebugInfo module (specifically the appendRange method), already avoid emitting zero-sized symbols to .debug_aranges, but not comprehensively in the AsmPrinter. In fact, the AsmPrinter does try to avoid emitting such zero-sized symbols when labels aren't involved, but doesn't when the symbol to emitted is a difference of two labels; this patch extends that logic to handle the case in which the symbol is defined via labels. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D126257	2022-05-25 13:31:36 -07:00
Joseph Huber	b7c8c4d8cf	[Clang] Introduce `--offload-link` option to perform offload device linking The new driver uses an augmented linker wrapper to perform the device linking phase, but to the user looks like a regular linker invocation. Contrary to the old driver, the new driver contains all the information necessary to produce a linked device image in the host object itself. Currently, we infer the usage of the device linker by the user specifying an offloading toolchain, e.g. (--offload-arch=...) or (-fopenmp-targets=...), but this shouldn't be strictly necessary. This patch introduces a new option `--offload-link` to tell the driver to use the offloading linker instead. So a compilation flow can now look like this, ``` clang foo.cu --offload-new-driver -fgpu-rdc --offload-arch=sm_70 -c clang foo.o --offload-link -lcudart ``` I was considering if this could be merged into the `-fuse-ld` option, but because the device linker wraps over the users linker it would conflict with that. In the future it's possible to merge this into `lld` completely or `gold` via a plugin and we would use this option to enable the device linking feature. Let me know what you think for this. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D126398	2022-05-25 16:30:53 -04:00
Matthias Springer	ab249fd87d	[mlir][bufferization][NFC] Remove dead code There were two copies of AlwaysCopyAnalysisState. (Must have been a merge conflict mistake...) Differential Revision: https://reviews.llvm.org/D126414	2022-05-25 22:26:00 +02:00
Zequan Wu	5c9f3ec4ad	[LLDB][NativePDB] Check for missing type info to avoid crash. NativePDB often assumes that all debug info are available. This is one step to make it more pervasive. Differential Revision: https://reviews.llvm.org/D125844	2022-05-25 13:19:27 -07:00
Joe Nash	835e09c4c3	[AMDGPU] gfx11 FLAT Instructions MachineCode Support for FLAT type instructions Contributors: Sebastian Neubauer <sebastian.neubauer@amd.com> Patch 12/N for upstreaming of AMDGPU gfx11 architecture. Depends on D125989 Reviewed By: rampitec, #amdgpu Differential Revision: https://reviews.llvm.org/D125992	2022-05-25 15:29:39 -04:00
Thomas Preud'homme	8e757c6b50	Fix conversion error to Expected On Ubuntu 18.04 with GCC 7.5 Intel trace code fails to build due to failure to convert from lldb_private::process_linux::IntelPTPerThreadProcessTraceUP to Expected<lldb_private::process_linux::IntelPTPerThreadProcessTraceUP>. This commit explicitely marks those unique_ptr values as being moved which fixes the conversion error. Reviewed By: wallace Differential Revision: https://reviews.llvm.org/D126402	2022-05-25 20:51:14 +01:00
Alex Lorenz	79e09af1d6	[clang] Fix the begin location of concepts specialization expression The concept specialization expression should start at the location of the nested qualifiers when it has nested qualifiers. This ensures that libclang reports correct source ranges that include all subexpressions when visiting the expression. Differential Revision: https://reviews.llvm.org/D126332	2022-05-25 12:39:21 -07:00
Chia-hung Duan	c088fbe7de	[mlir][PDLL] Allow numeric result indexing for unregistered op If we don't specify the result index while matching operand with the result of certain operation, it's supposed to match all the results of the operation with the operand. For registered op, it's easy to do that by either indexing with number or name. For unregistered op, this commit enables the numeric result indexing for this use case. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D126330	2022-05-25 19:29:29 +00:00
Joe Nash	ef1ea5ac01	[AMDGPU] gfx11 vinterp instructions MC support A new instruction encoding. Some of these instructions were previously VOP3 encoded. Contributors: Carl Ritson <carl.ritson@amd.com> Patch 11/N for upstreaming of AMDGPU gfx11 architecture. Depends on D125824 Reviewed By: critson Differential Revision: https://reviews.llvm.org/D125989	2022-05-25 14:59:16 -04:00
Joe Nash	1a51ab766f	[AMDGPU] gfx11 export instructions Contributors: Jay Foad <jay.foad@amd.com> Dmitry Preobrazhensky <d-pre@mail.ru> Patch 10/N for upstreaming of AMDGPU gfx11 architecture. Depends on D125822 Reviewed By: dp Differential Revision: https://reviews.llvm.org/D125824	2022-05-25 14:44:09 -04:00
Christian Sigg	c4bc416418	[LLVM] Add rcp.approx.ftz.f32 intrinsic Split out from https://reviews.llvm.org/D126158. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D126369	2022-05-25 21:05:20 +02:00
Paul Kirth	7f5439945b	[compiler-rt][test] Fix flake in symbolize_stack test Addresses tests flakes described in https://github.com/llvm/llvm-project/issues/55460 The test being updated can fail in FileCheck to match when given long enough stack traces. This can be problematic when file system paths become long enough to cause the majority of the long function name to become truncated. We found in our CI that the truncated output would often fail to match, thereby causing the test to fail when it should not. Here we change the test to match on sybolizer output that should be more reliable than matching inside the long function name. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D126102	2022-05-25 19:02:30 +00:00
Peter Klausler	bbad981dab	[flang] Address regression (calls to assumed-length character function dummy procedures) A recent fix beefed up semantics checking to catch the case of a call to an external assumed-length character function; this check has false positives in the case of an assumed-length character function that is a dummy procedure. These do have a length that is passed in extra compiler-created arguments. This patch refines the check and undoes some changes to tests. Differential Revision: https://reviews.llvm.org/D126390	2022-05-25 12:00:07 -07:00
James Y Knight	997b072e10	C++ DR2394: Const-default-constructible for members. Const class members may be initialized with a defaulted default constructor under the same conditions it would be allowed for a const object elsewhere. Differential Revision: https://reviews.llvm.org/D126170	2022-05-25 14:20:11 -04:00
Philip Reames	1f06398e96	Reapply "[RISCV] Enable strict assertions in InsertVSETVLI data flow" be2cb8 fixes the case which triggered the revert. Reapply, and let's see if anything else falls out. Original commit message: These asserts are believed to hold after several recent miscompiles have been fixed. If you see an assertion failure on this change, please toggle the default back and make sure you file a bug with a reproducer. We may have as yet uncaught miscompiles lurking in this code. Differential Revision: https://reviews.llvm.org/D125271	2022-05-25 11:18:55 -07:00
Shoaib Meenai	4baae166ce	[pseudo] Fix pseudo-gen usage when cross-compiling Use the LLVM build system's cross-compilation support for the tool, so that the build works for both host and cross-compilation scenarios. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D126397	2022-05-25 11:08:21 -07:00
Daniel Grumberg	504736cedf	[clang][extract-api] Don't emit symbols prefixed with an underscore These symbols are understood to not be used for client API consumption by convention so they should not appear in the generated symbol graph. Differential Revision: https://reviews.llvm.org/D125678	2022-05-25 19:02:18 +01:00
Krzysztof Parzyszek	23bb550eeb	DWARFVerifier: Change vector of IntervalMap to vector of unique_ptrs This is a workaround for compilation issue on FreeBSD. See comments in https://reviews.llvm.org/rG0d8cb8b399ad for more information. This fixes https://github.com/llvm/llvm-project/issues/55414. Differential Revision: https://reviews.llvm.org/D125611	2022-05-25 10:59:46 -07:00
Matthias Springer	0ee1c0388c	[mlir][bufferize] Remove hoisting functionality from One-Shot Bufferize The same functionality is already provided by `-buffer-hoisting` and `-buffer-loop-hoisting`. Differential Revision: https://reviews.llvm.org/D126251	2022-05-25 19:56:18 +02:00
Sanjay Patel	d3187dd5f0	[SLP] add minimum test for miscompile (PR55688); NFC	2022-05-25 13:52:47 -04:00
Philip Reames	be2cb824d0	[riscv] Remove mutation of prior vsetvli from insertion dataflow This moves mutation entirely out of the main algorithm. The immediate trigger is that we hit another case of the same issue I thought we'd fixed in `72925d9`. It turns out we hadn't considered the cross block case. As a brief summary, the issue being fixed is that if we mutate a previous vsetvli in phase 3, there's a possibility that some later use of that vsetvli changes "compatibility". In the cross_block_mutate test, this later vsetvli occurs in another block (and is thus visit order dependent too!). This causes us to fail strict asserts. (To be explicit, the current on by default workaround should compensate. It's only when we turn that off that we have problems.) Now, I want to explicitly call out an alternate workaround. We could leave the mutation in phase 3, and simplify restrict it to the case where the previous vsetvli's GPR result is unused. That covers the case we've actually seen. (I'll note that codegen regressions with a simple form of this were significant. We might have to check specifically for the use outside block case to keep them reasonable, which complicates the workaround slightly.) Personally, I'm at the point where I want the mutation pulled out just for robustness sake. I'm worried there's yet one more form of this bug we haven't thought about. The other motivation for this change is that it does give us a couple of minor codegen wins. None appear to be hugely significant, but improvements never hurt right? Differential Revision: https://reviews.llvm.org/D125270	2022-05-25 10:51:14 -07:00
Aaron Ballman	9368bf9023	Removing this as part of the revert done in `69da3b6aea` This appears to have been added in a follow-up commit that I missed.	2022-05-25 13:45:17 -04:00
Aaron Ballman	69da3b6aea	Revert "[OpenMP] atomic compare fail : Parser & AST support" This reverts commit `232bf8189e`. It broke the sanitize buildbot: https://lab.llvm.org/buildbot/#/builders/5/builds/24074 It also reproduces on Windows debug builds as a crash.	2022-05-25 13:34:34 -04:00
Mike Rice	ba3f85390b	[OpenMP] Add diagnostic for unterminated 'omp [begin] declare target' Warns when end-of-file is reached without seeing all matching 'omp end declare target' directives. The diagnostic shows the location of the related begin directive. Differential Revision: https://reviews.llvm.org/D126331	2022-05-25 10:34:07 -07:00
Joseph Huber	8a1984c25e	[Clang][Docs] Document `-Xoffload-linker` flag Summary: I added the `-Xoffload-linker` flag and did not provide additional documentation. This patch adds it.	2022-05-25 13:33:10 -04:00
Nicolai Hähnle	affa1b1cc5	AMDGPU/GISel: Factor out AMDGPURegisterBankInfo::buildReadFirstLane A later change will add a 3rd user, so factoring out the common code seems useful. Reorganizing the executeInWaterfallLoop causes some more COPYs to be generated, but those all fold away during instruction selection. Generating the comparisons uses generic instructions over machine instructions now which admittedly shouldn't make a difference (though it should make it easier to move the waterfall loop generation to another place). (Resubmit with missing test added.) Differential Revision: https://reviews.llvm.org/D125324	2022-05-25 12:14:01 -05:00
Nicolai Hähnle	afc90101a5	Revert "AMDGPU/GISel: Factor out AMDGPURegisterBankInfo::buildReadFirstLane" This reverts commit `2a28467e53`.	2022-05-25 12:03:23 -05:00
Zequan Wu	a648724921	Reland "[llvm-pdbutil] Add options to only dump symbol record at specified offset and its parents or children with spcified depth." This reverts commit `cfb4e78252`.	2022-05-25 09:57:35 -07:00
Kazu Hirata	4391625255	[lldb] Fix an unused function warning This patch fixes: .../llvm-project/lldb/source/Host/common/PseudoTerminal.cpp:106:20: error: unused function 'use_ptsname' [-Werror,-Wunused-function]	2022-05-25 09:49:05 -07:00
Logan Chien	0c8fdd7230	[mlir] Fix Tensor_InsertSliceOp description This commit fixes `Tensor_InsertSliceOp` `sizes` inputs/attributes description. Before this commit, the description says the `sizes` inputs/attributes denote the size of the return type. But according to the `InsertSliceOpConstantArgumentFolder` in `lib/Dialect/Tensor/IR/TensorOps.cpp`, the `sizes` inputs/attributes actually denote the size of the source type. I had an off-line discussion with the authors of `TensorOps.td` and `TensorOps.cpp`. We concluded that it was a typo in the Op description. This commit updates the Op description to match the actual usage. Differential Revision: https://reviews.llvm.org/D126264	2022-05-25 09:38:06 -07:00
Nicolai Hähnle	2a28467e53	AMDGPU/GISel: Factor out AMDGPURegisterBankInfo::buildReadFirstLane A later change will add a 3rd user, so factoring out the common code seems useful. Reorganizing the executeInWaterfallLoop causes some more COPYs to be generated, but those all fold away during instruction selection. Generating the comparisons uses generic instructions over machine instructions now which admittedly shouldn't make a difference (though it should make it easier to move the waterfall loop generation to another place). Differential Revision: https://reviews.llvm.org/D125324	2022-05-25 11:35:02 -05:00
Takafumi Arakaki	18e6b8234a	Allow pointer types for atomicrmw xchg This adds support for pointer types for `atomic xchg` and let us write instructions such as `atomicrmw xchg i64** %0, i64* %1 seq_cst`. This is similar to the patch for allowing atomicrmw xchg on floating point types: https://reviews.llvm.org/D52416. Differential Revision: https://reviews.llvm.org/D124728	2022-05-25 16:20:26 +00:00
Craig Topper	172149e98c	[RISCV] Preserve fast math flags in lowerVPOp. Update test to check MIR after finalize-isel instead of debug output. This is of course not the only place we should preserve FMF, but it's the most obvious one. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D126306	2022-05-25 09:16:07 -07:00
Craig Topper	06fee478d2	[X86] Add isSimple check to the load combine in combineExtractVectorElt. I think we need to be sure the load isn't volatile before we duplicate and shrink it. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D126353	2022-05-25 09:11:11 -07:00
Anubhab Ghosh	9da89651a8	[llvm-objcopy][ObjectYAML][mips] Add MIPS specific ELF section indexes This fixes https://github.com/llvm/llvm-project/issues/53998 and displays correct information in obj2yaml for SHN_MIPS_* sections according to https://refspecs.linuxfoundation.org/elf/mipsabi.pdf Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D123902	2022-05-25 09:01:12 -07:00
Martin Boehme	6f4644d194	[clang] Don't parse MS attributes in `ParseExportDeclaration()`. As @rsmith commented on https://reviews.llvm.org/D111548: "That looks like it's simply a bug as far as I can tell, and that call can be removed. MS attributes will be parsed as part of the decl specifier sequence as needed and don't need to be parsed as declaration attributes." Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D126062	2022-05-25 17:15:41 +02:00
Philip Reames	2a3b6f2cba	[RISCV] Hoist VSETVLI vlmax, vtype out of scalable loops This is a straight forward extension of the PRE transform introduced in D124869 to handle the VLMAX case. The test changes here look quite positive. This surprised me until I realized that all the tests are using @llvm.vscale to figure out the VLMAX, not the llvm.riscv.vsetvlmax intrinsic. If they'd used the later, these would have been full redundancy cases and fully handled by the data flow. I'm not really sure if use of vscale here is representative or not. If it is, we should probably look at using VSETVLI to lower vscale rather than a raw read of vlenb and some math. Differential Revision: https://reviews.llvm.org/D126338	2022-05-25 08:00:27 -07:00
Philip Reames	dd336b6891	[RISCV] Restructure comment and add clarifying assert to getFrameIndexReference [NFC] Differential Revision: https://reviews.llvm.org/D126088	2022-05-25 07:59:27 -07:00
Lei Zhang	e0ea1fc6f8	[mlir][spirv] Fix capability check for 64-bit element types Using 64-bit integer/float type in interface storage classes would require Int64/Float64 capability, per the Vulkan spec: ``` shaderInt64 specifies whether 64-bit integers (signed and unsigned) are supported in shader code. If this feature is not enabled, 64-bit integer types must not be used in shader code. This also specifies whether shader modules can declare the Int64 capability. Declaring and using 64-bit integers is enabled for all storage classes that SPIR-V allows with the Int64 capability. ``` This is different from, say, 16-bit element types, where: ``` shaderInt16 specifies whether 16-bit integers (signed and unsigned) are supported in shader code. If this feature is not enabled, 16-bit integer types must not be used in shader code. This also specifies whether shader modules can declare the Int16 capability. However, this only enables a subset of the storage classes that SPIR-V allows for the Int16 SPIR-V capability: Declaring and using 16-bit integers in the Private, Workgroup (for non-Block variables), and Function storage classes is enabled, while declaring them in the interface storage classes (e.g., UniformConstant, Uniform, StorageBuffer, Input, Output, and PushConstant) is not enabled. ``` Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D126256	2022-05-25 10:57:31 -04:00
Lei Zhang	413fbb045d	[mlir][scf] Retain existing attributes in scf.for transforms These attributes can carry useful information, e.g., pipelines might use them to organize and chain patterns. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D126320	2022-05-25 10:53:02 -04:00
Joe Loser	5a2dbe49be	[libc++][test] Verify std::ranges::rbegin, crbegin, rend, crend are CPOs Uncomment the tests to ensure `std::ranges::rbegin` and friends are indeed customization points objects. Differential Revision: https://reviews.llvm.org/D126325	2022-05-25 08:48:22 -06:00

... 4 5 6 7 8 ...

425060 Commits All Branches Search

425060 Commits

All Branches