llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexey Lapshin	e74197bc12	[Reland][Debuginfo][llvm-dwarfutil] Add check for unsupported debug sections. Current DWARFLinker implementation does not support some debug sections (mainly DWARF v5 sections). This patch adds diagnostic for such sections. The warning would be displayed for critical(such that could not be removed) sections and the source file would be skipped. Other unsupported sections would be removed and warning message should be displayed. The zero exit status would be returned for both cases. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D123623	2022-07-28 21:29:16 +03:00
Austin Kerbow	0f93a45b11	[AMDGPU] Add isMeta flag to SCHED_GROUP_BARRIER	2022-07-28 11:04:33 -07:00
Fangrui Song	9f0d5330bd	[MC][test] Rename two --compress-debug-sections=zlib tests To be clearer when zstd support is added.	2022-07-28 10:57:56 -07:00
River Riddle	00a52c7565	[mlir:SubElementsInterface] Add support for "skipping" when replacing attributes/types This is used to fix a bug in SymbolTable::replaceAllSymbolUses where we replace symbols that we shouldn't. Differential Revision: https://reviews.llvm.org/D130693	2022-07-28 10:52:12 -07:00
Fangrui Song	c26dc2904b	[llvm-objcopy] Support --{,de}compress-debug-sections for zstd Also, add ELFCOMPRESS_ZSTD (2) from the approved generic-abi proposal: https://groups.google.com/g/generic-abi/c/satyPkuMisk ("Add new ch_type value: ELFCOMPRESS_ZSTD") Link: https://discourse.llvm.org/t/rfc-zstandard-as-a-second-compression-method-to-llvm/63399 ("[RFC] Zstandard as a second compression method to LLVM") Differential Revision: https://reviews.llvm.org/D130458	2022-07-28 10:45:53 -07:00
Austin Kerbow	f5b21680d1	[AMDGPU] Add amdgcn_sched_group_barrier builtin This builtin allows the creation of custom scheduling pipelines on a per-region basis. Like the sched_barrier builtin this is intended to be used either for testing, in situations where the default scheduler heuristics cannot be improved, or in critical kernels where users are trying to get performance that is close to handwritten assembly. Obviously using these builtins will require extra work from the kernel writer to maintain the desired behavior. The builtin can be used to create groups of instructions called "scheduling groups" where ordering between the groups is enforced by the scheduler. __builtin_amdgcn_sched_group_barrier takes three parameters. The first parameter is a mask that determines the types of instructions that you would like to synchronize around and add to a scheduling group. These instructions will be selected from the bottom up starting from the sched_group_barrier's location during instruction scheduling. The second parameter is the number of matching instructions that will be associated with this sched_group_barrier. The third parameter is an identifier which is used to describe what other sched_group_barriers should be synchronized with. Note that multiple sched_group_barriers must be added in order for them to be useful since they only synchronize with other sched_group_barriers. Only "scheduling groups" with a matching third parameter will have any enforced ordering between them. As an example, the code below tries to create a pipeline of 1 VMEM_READ instruction followed by 1 VALU instruction followed by 5 MFMA instructions... // 1 VMEM_READ __builtin_amdgcn_sched_group_barrier(32, 1, 0) // 1 VALU __builtin_amdgcn_sched_group_barrier(2, 1, 0) // 5 MFMA __builtin_amdgcn_sched_group_barrier(8, 5, 0) // 1 VMEM_READ __builtin_amdgcn_sched_group_barrier(32, 1, 0) // 3 VALU __builtin_amdgcn_sched_group_barrier(2, 3, 0) // 2 VMEM_WRITE __builtin_amdgcn_sched_group_barrier(64, 2, 0) Reviewed By: jrbyrnes Differential Revision: https://reviews.llvm.org/D128158	2022-07-28 10:43:14 -07:00
Sunho Kim	c619d4f840	[clang-repl] Support destructors of global objects. Supports destructors of global objects by properly calling jitdylib deinitialize which calls the global dtors of ir modules. This supersedes https://reviews.llvm.org/D127945. There was an issue when calling deinitialize on windows but it got fixed by https://reviews.llvm.org/D128037. Reviewed By: v.g.vassilev Differential Revision: https://reviews.llvm.org/D128589	2022-07-29 02:38:40 +09:00
Xing Xue	aeb1c98f4c	[libc++][AIX] Use non-unique implementation for typeinfo comparison Summary: The AIX linker does not merge typeinfos when shared libraries are involved, which causes address comparison to fail although the types are the same. This patch changes to use the non-unique implementation for typeinfo comparison for AIX. Reviewed by: hubert.reinterpretcast, philnik, libc++ Differential Revision: https://reviews.llvm.org/D130715	2022-07-28 13:17:12 -04:00
Craig Topper	2750873dfe	[RISCV] Update lowerFROUND to use masked instructions. This avoids a vmerge at the end and avoids spurious fflags updates. This isn't used for constrained intrinsic so we technically don't have to worry about fflags, but it doesn't cost much to support it. To support I've extend our FCOPYSIGN_VL node to support a passthru operand. Similar to what was done for VRGATHER*_VL nodes. I plan to do a similar update for trunc, floor, and ceil. Reviewed By: reames, frasercrmck Differential Revision: https://reviews.llvm.org/D130659	2022-07-28 10:05:19 -07:00
Craig Topper	89173dee71	[RISCV] Remove duplicate code. NFC The same operations are part of `FloatingPointVecReduceOps` a little bit earlier.	2022-07-28 10:05:19 -07:00
Louis Dionne	1422a9689d	[libc++] Properly log crashes with the assertion handler on older Androids This reintroduces the same workaround we have in libc++abi for older Androids based on https://reviews.llvm.org/D130507#inline-1255914. Differential Revision: https://reviews.llvm.org/D130708	2022-07-28 12:55:33 -04:00
Mahesh Ravishankar	9fe27bca71	[mlir][Linalg] Allow decompose to handle ops when value of `outs` operand is used in payload. Current implementation of decomposition of Linalg operations wouldnt work if the `outs` operand values were used within the body of the operation. Relax this restriction. This potentially sets the stage for decomposing ops with reduction iterator types (but is not done here since it requires more study). Differential Revision: https://reviews.llvm.org/D130527	2022-07-28 16:42:54 +00:00
Mahesh Ravishankar	6f03a10e4f	[mlir][TilingInterface] Add a method to generate scalar implementation of the op. While The tiling interface provides a mechanism for operations to be tiled into tiled version of the op (or another op at the same level of abstraction), the `generateScalarImplementation` method added here is the "exit point" after all transformations have been done. Ops that implement this method are expected to generate IR that are directly lowerable to backend dialects like LLVM or SPIR-V dialects. Differential Revision: https://reviews.llvm.org/D130612	2022-07-28 16:37:15 +00:00
Amaury Séchet	1e15e24a76	[NFC] Autogenerate CodeGen/PowerPC/pzero-fp-xored.ll	2022-07-28 16:18:43 +00:00
Simon Pilgrim	8c99cef1e7	[DAG] Remove SelectionDAG::GetDemandedBits and use SimplifyMultipleUseDemandedBits directly. GetDemandedBits is mainly a wrapper around SimplifyMultipleUseDemandedBits now, and is only used by DAGCombiner::visitSTORE so I've moved all remaining functionality there. visitSTORE was making use of this to 'simplify' constants for a trunc-store. Just removing this code left to a mixture of regressions and gains - it came down to whether a target preferred a sign or zero extended constant for materialization/truncation. I've just moved the code over for now, but a next step would be to move this to targetShrinkDemandedConstant, but some targets that override the method expect a basic binop, and might react badly to a store node.....	2022-07-28 17:03:44 +01:00
Philip Reames	82c1b136db	[LV] Don't predicate uniform mem op stores unneccessarily We already had the reasoning about uniform mem op loads; if the address is accessed at least once, we know the instruction doesn't need predicated to ensure fault safety. For stores, we do need to ensure that the values visible in memory are the same with and without predication. The easiest sub-case to check for is that all the values being stored are the same. Since we know that at least one lane is active, this tells us that the value must be visible. Warning on confusing terminology: "uniform" vs "uniform mem op" mean two different things here, and this patch is specific to the later. It would not be legal to make this same change for merely "uniform" operations. Differential Revision: https://reviews.llvm.org/D130637	2022-07-28 08:55:52 -07:00
Jon Chesterfield	c214cb6a68	[amdgpu][openmp][nfc] Restore stb_local on DeviceInfo symbol	2022-07-28 16:50:46 +01:00
Prabhdeep Singh Soni	f5efa1892e	[Flang][MLIR][OpenMP] Add support for simdlen clause This supports lowering from parse-tree to MLIR and translation from MLIR to LLVM IR using OMPIRBuilder for OpenMP simdlen clause in SIMD construct. Reviewed By: shraiysh, peixin, arnamoy10 Differential Revision: https://reviews.llvm.org/D130195	2022-07-28 23:49:17 +08:00
Jon Chesterfield	75aa521064	[openmp][amdgpu] Move global DeviceInfo behind call syntax prior to using D130712	2022-07-28 16:40:42 +01:00
Jon Chesterfield	1f9d3974e4	[openmp] Introduce optional plugin init/deinit functions Will allow plugins to migrate away from using global variables to manage lifetime, which will fix a segfault discovered in relation to D127432 Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D130712	2022-07-28 16:21:38 +01:00
LLVM GN Syncbot	59ea2c64d5	[gn build] Port `d52e775b05`	2022-07-28 14:44:36 +00:00
Liqiang Tao	d52e775b05	[llvm][ModuleInliner] Add inline cost priority for module inliner This patch introduces the inline cost priority into the module inliner, which uses the same computation as InlineCost. Reviewed By: kazu Differential Revision: https://reviews.llvm.org/D130012	2022-07-28 22:44:03 +08:00
LLVM GN Syncbot	cf0196db88	[gn build] Port `c113594378`	2022-07-28 14:37:35 +00:00
Liqiang Tao	c113594378	Revert "[llvm][ModuleInliner] Add inline cost priority for module inliner" This reverts commit `bb7f62bbbd`.	2022-07-28 22:36:28 +08:00
Florian Hahn	f912bab111	Revert "[X86][DAGISel] Don't widen shuffle element with AVX512" This reverts commit `5fb4134210`. This patch is causing crashes when building llvm-test-suite when optimizing for CPUs with AVX512. Reproducer crashing with llc: target datalayout = "e-m:o-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128" target triple = "x86_64-apple-macosx" define i32 @test(<32 x i32> %0) #0 { entry: %1 = mul <32 x i32> %0, <i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1> %2 = tail call i32 @llvm.vector.reduce.add.v32i32(<32 x i32> %1) ret i32 %2 } ; Function Attrs: nocallback nofree nosync nounwind readnone willreturn declare i32 @llvm.vector.reduce.add.v32i32(<32 x i32>) #1 attributes #0 = { "min-legal-vector-width"="0" "target-cpu"="skylake-avx512" } attributes #1 = { nocallback nofree nosync nounwind readnone willreturn }	2022-07-28 15:26:42 +01:00
Simon Pilgrim	be488ba7de	[DAG] DAGCombiner::visitTRUNCATE - remove GetDemandedBits call This should now all be handled by SimplifyDemandedBits.	2022-07-28 15:23:04 +01:00
Chris Bieneman	fe13002bb3	[HLSL] Add __builtin_hlsl_create_handle This is pretty straightforward, it just adds a builtin to return a pointer to a resource handle. This maps to a dx intrinsic. The shape of this builtin and the underlying intrinsic will likely shift a bit as this implementation becomes more feature complete, but this is a good basis to get started. Depends on D128569. Differential Revision: https://reviews.llvm.org/D130016	2022-07-28 09:16:11 -05:00
Chris Bieneman	6e56d0dbe3	Start support for HLSL `RWBuffer` Most of the change here is fleshing out the HLSLExternalSemaSource with builder implementations to build the builtin types. Eventually, I may move some of this code into tablegen or a more managable declarative file but I want to get the AST generation logic ready first. This code adds two new types into the HLSL AST, `hlsl::Resource` and `hlsl::RWBuffer`. The `Resource` type is just a wrapper around a handle identifier, and is largely unused in source. It will morph a bit over time as I work on getting the source compatability correct, but for now it is a reasonable stand-in. The `RWBuffer` type is not ready for use. I'm posting this change for review because it adds a lot of infrastructure code and is testable. There is one change to clang code outside the HLSL-specific logic here, which addresses a behavior change introduced a long time ago in `967d438439`. That change resulted in unintentionally breaking situations where an incomplete template declaration was provided from an AST source, and needed to be completed later by the external AST. That situation doesn't happen in the normal AST importer flow, but can happen when an AST source provides incomplete declarations of templates. The solution is to annotate template specializations of incomplete types with the HasExternalLexicalSource bit from the base template. Depends on D128012. Differential Revision: https://reviews.llvm.org/D128569	2022-07-28 08:49:50 -05:00
Sunho Kim	bd08f413c0	[clang-repl] Disable exception unittest on AIX. AIX platform was not supported but it was not explicitly checked in exception test as it was excluded by isPPC() check.	2022-07-28 22:48:51 +09:00
Simon Pilgrim	ea7f14dad0	[DAG] SelectionDAG::GetDemandedBits - don't simplify opaque constants I'm actually trying to get rid of GetDemandedBits - but while dismantling it I noticed that we were altering opaque constants. Fixing that causes a FP_TO_INT_SAT regression that should be addressed separately - I'll raise a bug.	2022-07-28 14:46:59 +01:00
LLVM GN Syncbot	e293802499	[gn build] Port `bb7f62bbbd`	2022-07-28 13:30:20 +00:00
Liqiang Tao	bb7f62bbbd	[llvm][ModuleInliner] Add inline cost priority for module inliner This patch introduces the inline cost priority into the module inliner, which uses the same computation as InlineCost. Reviewed By: kazu Differential Revision: https://reviews.llvm.org/D130012	2022-07-28 21:28:07 +08:00
David Green	3b09e532ee	[ARM] Remove duplicate fp16 intrinsics These vdup and vmov float16 intrinsics are being defined in both the general section and then again in fp16 under a !aarch64 flag. The vdup_lane intrinsics were being defined in both aarch64 and !aarch64 sections, so have been commoned. They are defined as macros, so do not give duplicate warnings, but removing the duplicates shouldn't alter the available intrinsics.	2022-07-28 14:26:17 +01:00
Simon Pilgrim	69d5a038b9	[DAG] Enable ISD::SRL SimplifyMultipleUseDemandedBits handling inside SimplifyDemandedBits This patch allows SimplifyDemandedBits to call SimplifyMultipleUseDemandedBits in cases where the ISD::SRL source operand has other uses, enabling us to peek through the shifted value if we don't demand all the bits/elts. This is another step towards removing SelectionDAG::GetDemandedBits and just using TargetLowering::SimplifyMultipleUseDemandedBits. There a few cases where we end up with extra register moves which I think we can accept in exchange for the increased ILP. Differential Revision: https://reviews.llvm.org/D77804	2022-07-28 14:10:44 +01:00
Kevin P. Neal	25a83005ef	Precommit tests for D112256 "[FPEnv][EarlyCSE] Add support for CSE of constrained FP intrinsics, take 2"	2022-07-28 08:59:27 -04:00
Amaury Séchet	474a8ee03d	[DAG] Use recursivelyDeleteUnusedNodes in PromoteLoad It simplifies the code overall and removes the need for manual bookkeeping. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D130447	2022-07-28 12:54:52 +00:00
Sebastian Neubauer	50716ba2b3	[CMake][OpenMP] Remove wrong backslash outdir is defined in the line above, it will not exist in the install command, so it should not be escaped.	2022-07-28 14:35:04 +02:00
Amaury Séchet	7920805b27	[DAG] Use recursivelyDeleteUnusedNodes in ReplaceLoadWithPromotedLoad It simplifies the code overall and removes the need for manual bookkeeping. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D130444	2022-07-28 12:32:37 +00:00
Alexander Timofeev	76d9ae924c	[AMDGPU] avoid blind converting to VALU REG_SEQUENCE and PHIs In the `2e29b0138c` we introduce a specific solving algorithm that analyzes the VGPR to SGPR copies use chains and either lowers the copy to v_readfirstlane_b32 or converts the whole chain to VALU forms. Same time we still have the code that blindly converts to VALU REG_SEQUENCE and PHIs in case they produce SGPR but have VGPRs input operands. In case the REG_SEQUENCE and PHIs are in the VGPR to SGPR copy use chain, and this chain was considered long enough to convert copy to v_readfistlane_b32, further lowering them to VALU leads to several kinds of issues. At first, we have v_readfistlane_b32 which is completely useless because most parts of its use chain were moved to VALU forms. Second, we may encounter subtle bugs related to the EXEC-dependent CF because of the weird mixing of SALU and VALU instructions. This change removes the code that moves REG_SEQUENCE and PHIs to VALU. Instead, we use the fact that both REG_SEQUENCE and PHIs have copy semantics. That is, if they define SGPR but have VGPR inputs, we insert VGPR to SGPR copies to make them pure SGPR. Then, the new copies are processed by the common VGPR to SGPR lowering algorithm. This is Part 2 in the series of commits aiming at the massive refactoring of the SIFixSGPRCopies pass. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D130367	2022-07-28 14:30:29 +02:00
Sunho Kim	3cc3be8fa4	[clang-repl] Add host exception support check utility flag. Add host exception support check utility flag. This is needed to not run tests that require exception support in few buildbots that lacks related symbols for some reason. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D129242	2022-07-28 21:14:58 +09:00
Sunho Kim	72ea1a721e	[ORC] Fix weak hidden symbols failure on PPC with runtimedyld Fix "JIT session error: Symbols not found: [ DW.ref.__gxx_personality_v0 ] error" which happens when trying to use exceptions on ppc linux. To do this, it expands AutoClaimSymbols option in RTDyldObjectLinkingLayer to also claim weak symbols before they are tried to be resovled. In ppc linux, DW.ref symbols is emitted as weak hidden symbols in the later stage of MC pipeline. This means when using IRLayer (i.e. LLJIT), IRLayer will not claim responsibility for such symbols and RuntimeDyld will skip defining this symbol even though it couldn't resolve corresponding external symbol. Reviewed By: sgraenitz Differential Revision: https://reviews.llvm.org/D129175	2022-07-28 21:12:25 +09:00
Muhammad Usman Shahid	0cc3c184c7	Missing tautological compare warnings due to unary operators The patch mainly focuses on the lack of warnings for -Wtautological-compare. It works fine for positive numbers but doesn't for negative numbers. This is because the warning explicitly checks for an IntegerLiteral AST node, but -1 is represented by a UnaryOperator with an IntegerLiteral sub-Expr. For the below code we have warnings: if (0 == (5 \| x)) {} but not for if (0 == (-5 \| x)) {} This patch changes the analysis to not look at the AST node directly to see if it is an IntegerLiteral, but instead attempts to evaluate the expression to see if it is an integer constant expression. This handles unary negation signs, but also handles all the other possible operators as well. Fixes #42918 Differential Revision: https://reviews.llvm.org/D130510	2022-07-28 07:45:28 -04:00
Dmitry Preobrazhensky	955cc56af4	[AMDGPU][GFX1030][DOC][NFC] Update assembler syntax description Summary of changes: - Update FLAT LDS syntax (see https://reviews.llvm.org/D125126)	2022-07-28 14:36:53 +03:00
Dmitry Preobrazhensky	2b230d69ad	[AMDGPU][MC][GFX90A] Correct MIMG dst size validation Correct validator to enable MIMG dst size checks. Differential Revision: https://reviews.llvm.org/D130512	2022-07-28 14:30:08 +03:00
Adrian Kuegel	ba110cf97a	[mlir] Add getters for DenseArrayAttr. This change adds convenience getters to builders. Differential Revision: https://reviews.llvm.org/D130696	2022-07-28 13:26:27 +02:00
Sanjay Patel	28ad5dc3f7	[InstCombine] try harder to narrow bitwise logic with cast operands This works with any logic + extend: https://alive2.llvm.org/ce/z/vzsqQD The motivating case is from issue #56294, but that's still not optimal (it should simplify completely).	2022-07-28 07:23:22 -04:00
Sanjay Patel	35e8179c47	[InstCombine] add tests for bitwise logic with cast operands; NFC	2022-07-28 07:23:22 -04:00
Dmitry Preobrazhensky	fa7fd8ec31	[AMDGPU][MC][GFX11] Disable SGPRs for src1 of v_fma_mix*_dpp opcodes Differential Revision: https://reviews.llvm.org/D130634	2022-07-28 14:20:05 +03:00
Nico Weber	dd428a571c	[gn build] (manually) port `18b4a8bcf3` more	2022-07-28 07:14:43 -04:00
chendewen	7eeb468ae5	[Aarch64] Add cost for missing extensions. This patch adds a cost estimate for some missing sign extensions. ref: https://reviews.llvm.org/D14730 Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D130565	2022-07-28 17:34:00 +08:00

1 2 3 4 5 ...

431394 Commits All Branches Search

431394 Commits

All Branches