llvm-project

Commit Graph

Author	SHA1	Message	Date
Clement Courbet	3b72448084	[clang-tidy] Add unit tests for `DeclRefExprUtils`. In preparation for D114539.	2021-11-24 16:47:55 +01:00
Quinn Pham	6ee69c0dea	[NFC][X86] Inclusive language: Rename master label in x86-shrink-wrapping.ll [NFC] As part of using inclusive language within the llvm project, this patch renames the master label in `x86-shrink-wrapping.ll` to main. Reviewed By: ZarkoCA Differential Revision: https://reviews.llvm.org/D113940	2021-11-24 09:47:45 -06:00
Paul Robinson	f3bfe1b418	Have yaml2obj describe all options in --help Differential Revision: https://reviews.llvm.org/D114538	2021-11-24 07:44:52 -08:00
Pavel Labath	96beb30fbb	[lldb] Move GetSupportedArchitectureAtIndex to PlatformDarwin All other platforms use GetSupportedArchitectures now.	2021-11-24 15:48:23 +01:00
Phoebe Wang	5ba795c6e8	[X86][MS] Add test cases to show wrong alignment in vector variable arguments. NFC	2021-11-24 22:45:04 +08:00
Roman Lebedev	cd8d219536	[X86][Costmodel] `getReplicationShuffleCost()`: promote 1 bit-wide elements to 32 bit when have AVX512DQ I believe, this effectively completes `X86TTIImpl::getReplicationShuffleCost()` for AVX512, other than the question of handling plain AVX512F, where we end up with some really ugly "shuffles", but then is there any CPU's that support AVX512, but not AVX512DQ/AVX512BW? Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D114315	2021-11-24 17:23:15 +03:00
Guillaume Chatelet	408c0cc4eb	[libc] Fix wrong type for load/store of Repeated elements	2021-11-24 14:13:13 +00:00
Florian Hahn	8b86752c60	[VPlan] Remove unused VPInstruction constructor. (NFC) VPInstruction inherits from VPValue, so the constructor taking ArrayRef<VPValue*> covers all cases that would be covered by the removed constructor.	2021-11-24 14:06:50 +00:00
Bradley Smith	080ef0b6a6	[AArch64][SVE] Recognize all ones mask during fixed mask generation Differential Revision: https://reviews.llvm.org/D114431	2021-11-24 13:55:06 +00:00
Benjamin Kramer	d32787230d	Revert "[DAG] SimplifyDemandedBits - simplify rotl/rotr to shl/srl" This reverts commit `3cf4a2c620`. It makes llc hang on the following test case. ``` target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128" target triple = "aarch64-unknown-linux-gnu" define dso_local void @_PyUnicode_EncodeUTF16() local_unnamed_addr #0 { entry: br label %while.body117.i while.body117.i: ; preds = %cleanup149.i, %entry %out.6269.i = phi i16* [ undef, %cleanup149.i ], [ undef, %entry ] %0 = load i16, i16* undef, align 2 %1 = icmp eq i16 undef, -10240 br i1 %1, label %fail.i, label %cleanup149.i cleanup149.i: ; preds = %while.body117.i %or130.i = call i16 @llvm.bswap.i16(i16 %0) #2 store i16 %or130.i, i16* %out.6269.i, align 2 br label %while.body117.i fail.i: ; preds = %while.body117.i ret void } ; Function Attrs: nofree nosync nounwind readnone speculatable willreturn declare i16 @llvm.bswap.i16(i16) #1 attributes #0 = { "target-features"="+neon,+v8a" } attributes #1 = { nofree nosync nounwind readnone speculatable willreturn } attributes #2 = { mustprogress nofree norecurse nosync nounwind readnone uwtable willreturn "frame-pointer"="non-leaf" "min-legal-vector-width"="0" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "target-cpu"="generic" "target-features"="+neon,+v8a" } ```	2021-11-24 14:42:54 +01:00
Florian Hahn	a7648eb2aa	[LV] Use patterns in some induction tests, to make more robust. (NFC)	2021-11-24 13:32:24 +00:00
Simon Pilgrim	73fd36963c	[X86] Add BMI test coverage for for or-lea with no common bits tests Ensure D113970 handles andnot patterns as well.	2021-11-24 13:24:27 +00:00
Omer Aviram	19be7f9702	[X86] Add D113970 tests cases for or-lea with no common bits. Added tests are permutations of the pattern: (X & ~M) or (Y & M). Differential Revision: https://reviews.llvm.org/D114078	2021-11-24 13:22:28 +00:00
Sanjay Patel	b326c05814	[InstSimplify] fold xor logic of 2 variables, part 2 (~a & b) ^ (a \| b) --> a This is the swapped and/or (Demorgan?) sibling fold for the fold added with D114462 ( `892648b18a` ). This case is easier to specify because we are returning a root value, not a 'not': https://alive2.llvm.org/ce/z/SRzj4f	2021-11-24 08:15:47 -05:00
Sanjay Patel	823fc8aa06	[InstSimplify] add tests for xor logic; NFC	2021-11-24 08:15:47 -05:00
Djordje Todorovic	e3d8ebe158	[llvm-dwarfdump][Statistics] Handle LTO cases with cross CU referencing With link-time optimizations enabled, resulting DWARF mayend up containing cross CU references (through the DW_AT_abstract_origin attribute). Consider the following example: // sum.c __attribute__((always_inline)) int sum(int a, int b) { return a + b; } // main.c extern int sum(int, int); int main() { int a = 5, b = 10, c = sum(a, b); return 0; } Compiled as follows: $ clang -g -flto -fuse-ld=lld main.c sum.c -o main Results in the following DWARF: -- sum.c CU: abstract instance tree ... 0x000000b0: DW_TAG_subprogram DW_AT_name ("sum") DW_AT_decl_file ("sum.c") DW_AT_decl_line (1) DW_AT_prototyped (true) DW_AT_type (0x000000d3 "int") DW_AT_external (true) DW_AT_inline (DW_INL_inlined) 0x000000bc: DW_TAG_formal_parameter DW_AT_name ("a") DW_AT_decl_file ("sum.c") DW_AT_decl_line (1) DW_AT_type (0x000000d3 "int") 0x000000c7: DW_TAG_formal_parameter DW_AT_name ("b") DW_AT_decl_file ("sum.c") DW_AT_decl_line (1) DW_AT_type (0x000000d3 "int") ... -- main.c CU: concrete inlined instance tree ... 0x0000006d: DW_TAG_inlined_subroutine DW_AT_abstract_origin (0x00000000000000b0 "sum") DW_AT_low_pc (0x00000000002016ef) DW_AT_high_pc (0x00000000002016f1) DW_AT_call_file ("main.c") DW_AT_call_line (5) DW_AT_call_column (0x19) 0x00000081: DW_TAG_formal_parameter DW_AT_location (DW_OP_reg0 RAX) DW_AT_abstract_origin (0x00000000000000bc "a") 0x00000088: DW_TAG_formal_parameter DW_AT_location (DW_OP_reg2 RCX) DW_AT_abstract_origin (0x00000000000000c7 "b") ... Note that each entry within the concrete inlined instance tree in the main.c CU has a DW_AT_abstract_origin attribute which refers to a corresponding entry within the abstract instance tree in the sum.c CU. llvm-dwarfdump --statistics did not properly report DW_TAG_formal_parameters/DW_TAG_variables from concrete inlined instance trees which had 0% location coverage and which referred to a different CU, mainly because information about abstract instance trees and their parameters/variables was stored locally - just for the currently processed CU, rather than globally - for all CUs. In particular, if the concrete inlined instance tree from the example above was to look like this (i.e. parameter b has 0% location coverage, hence why it's missing): 0x0000006d: DW_TAG_inlined_subroutine DW_AT_abstract_origin (0x00000000000000b0 "sum") DW_AT_low_pc (0x00000000002016ef) DW_AT_high_pc (0x00000000002016f1) DW_AT_call_file ("main.c") DW_AT_call_line (5) DW_AT_call_column (0x19) 0x00000081: DW_TAG_formal_parameter DW_AT_location (DW_OP_reg0 RAX) DW_AT_abstract_origin (0x00000000000000bc "a") llvm-dwarfdump --statistics would have not reported b as such. Patch by Dimitrije Milosevic. Differential revision: https://reviews.llvm.org/D113465	2021-11-24 13:50:47 +01:00
Nemanja Ivanovic	b7bf937bbe	[PowerPC] Provide XL-compatible vec_round implementation The XL implementation of vec_round for vector double uses "round-to-nearest, ties to even" just as the vector float `version does. However clang and gcc use "round-to-nearest-away" for vector double and "round-to-nearest, ties to even" for vector float. The XL behaviour is implemented under the __XL_COMPAT_ALTIVEC__ macro similarly to other instances of incompatibility. Differential revision: https://reviews.llvm.org/D113642	2021-11-24 06:43:56 -06:00
Jeremy Morse	f911c397dc	[DebugInfo] Adjust x86 location-list tests for instruction referencing This patch updates location lists in various x86 tests to reflect what instruction referencing produces. There are two flavours of change: * Not following a register copy immediately, because instruction referencing can make some slightly smarter decisions, * Extended ranges, due to having additional information. The register changes aren't that interesting, it's just a choice between equally legitimate registers that instr-ref does differently. The extended ranges are largely due to following stack restores better. Differential Revision: https://reviews.llvm.org/D114362	2021-11-24 12:30:55 +00:00
Dmitry Vyukov	a68b52e0a3	tsan: add another fork deadlock test The test tries to provoke internal allocator to be locked during fork and then force the child process to use the internal allocator. This test sometimes deadlocks with the new tsan runtime. Depends on D114514. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D114515	2021-11-24 13:25:53 +01:00
Dmitry Vyukov	06677d6a9f	sanitizer_common: remove SANITIZER_USE_MALLOC It was introduced in: `9cffc9550b` tsan: allow to force use of __libc_malloc in sanitizer_common and used in: `512a18e518` tsan: add standalone deadlock detector and later used for Go support. But now both uses are gone. Nothing defines SANITIZER_USE_MALLOC. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D114514	2021-11-24 13:24:19 +01:00
Manuel Klimek	1b5a43ac3f	Clean up clang-format tech debt. Make all code go through FormatTokenSource instead of going around it, which makes changes to TokenSource brittle. Add LLVM_DEBUG in FormatTokenSource to be able to follow the token stream.	2021-11-24 12:58:35 +01:00
Jeremy Morse	2191d502a8	[DebugInfo] Check both instr-ref and DBG_VALUE modes of sdag tests In these test updates for instruction referencing, I've added specific instr-ref RUN lines, and kep thte DBG_VALUE-based variable location check lines too. This is because argument handling is really fiddly, and I figure it's worth duplicating the testing to ensure it's definitely correct. There's also dbg-value-superreg-copy2.mir, a dtest for where varaible locations go when virtual registers are coalesced together. I don't think there's an instruction referencing specific test for this, so have duplicated that to for instruction referencing. Differential Revision: https://reviews.llvm.org/D114262	2021-11-24 11:48:16 +00:00
Simon Pilgrim	3cf4a2c620	[DAG] SimplifyDemandedBits - simplify rotl/rotr to shl/srl If we only demand bits from one half of a rotation pattern, see if we can simplify to a logical shift. For the ARM rev16 patterns, I had to drop a fold to prevent srl(bswap()) -> rotr(bswap) -> srl(bswap) infinite loops. I've replaced this with an isel PatFrag which should do the same task. https://alive2.llvm.org/ce/z/iroxki (rol -> shl by amt iff demanded bits has at least as many trailing zeros as the shift amount) https://alive2.llvm.org/ce/z/4ez_U- (ror -> shl by revamt iff demanded bits has at least as many trailing zeros as the reverse shift amount) https://alive2.llvm.org/ce/z/cD7dR- (ror -> lshr by amt iff demanded bits has at least as many leading zeros as the shift amount) https://alive2.llvm.org/ce/z/_XGHtQ (rol -> lshr by revamt iff demanded bits has at least as many leading zeros as the reverse shift amount) Differential Revision: https://reviews.llvm.org/D114354	2021-11-24 11:28:35 +00:00
Jay Foad	d7e03df719	[AMDGPU] Implement widening multiplies with v_mad_i64_i32/v_mad_u64_u32 Select SelectionDAG ops smul_lohi/umul_lohi to v_mad_i64_i32/v_mad_u64_u32 respectively, with an addend of 0. v_mul_lo, v_mul_hi and v_mad_i64/u64 are all quarter-rate instructions so it is better to use one instruction than two. Further improvements are possible to make better use of the addend operand, but this is already a strict improvement over what we have now. Differential Revision: https://reviews.llvm.org/D113986	2021-11-24 11:25:02 +00:00
Jay Foad	8a52bd82e3	[AMDGPU] Only select VOP3 forms of VOP2 instructions Change VOP_PAT_GEN to default to not generating an instruction selection pattern for the VOP2 (e32) form of an instruction, only for the VOP3 (e64) form. This allows SIFoldOperands maximum freedom to fold copies into the operands of an instruction, before SIShrinkInstructions tries to shrink it back to the smaller encoding. This affects the following VOP2 instructions: v_min_i32 v_max_i32 v_min_u32 v_max_u32 v_and_b32 v_or_b32 v_xor_b32 v_lshr_b32 v_ashr_i32 v_lshl_b32 A further cleanup could simplify or remove VOP_PAT_GEN, since its optional second argument is never used. Differential Revision: https://reviews.llvm.org/D114252	2021-11-24 11:15:30 +00:00
$SYNOPSYS\georgiev$ SYNOPSYS\georgiev	c52ff0cfcb	[LLDB/test] lldbutil check_breakpoint() - check target instance Check test.target instance type before we attempt to get the breakpoint. This fix is suggested by 'clayborg'. Ref: https://reviews.llvm.org/D111899#inline-1090156	2021-11-24 11:13:17 +00:00
Carl Ritson	976f3b3c9e	[AMDGPU] Only allow implicit WQM in pixel shaders Implicit derivatives are only valid in pixel shaders, hence only implicitly enable WQM for pixel shaders. This avoids unintended WQM in other shader types (e.g. compute) when image sampling instructions are used. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D114414	2021-11-24 20:04:42 +09:00
David Green	581f837355	[ARM] Fold (fadd x, (vselect c, y, -1.0)) into (vselect c, (fadd x, y), x) This is similar to D113574, but as a DAG combine, not tablegen patterns. Doing the fold as a DAG combine allows the fadd to be folded with a fmul, finally producing a predicated vfma. It performs the same fold of fadd(x, vselect(p, y, -0.0)) to vselect p, (fadd x, y), x) using -0.0 as the identity value of a fadd. Differential Revision: https://reviews.llvm.org/D113584	2021-11-24 10:41:00 +00:00
Matthias Springer	ca9d149e07	[mlir][linalg][bufferize][NFC] Move vector interface impl to new build target This makes ComprehensiveBufferize entirely independent of the vector dialect. Differential Revision: https://reviews.llvm.org/D114218	2021-11-24 19:36:12 +09:00
David Sherwood	cf40ca026f	[NFC] Tidy up SelectionDAGBuilder::visitIntrinsicCall to use existing sdl debug loc In quite a few places we were calling getCurSDLoc() to get the debug location, but this is already a local variable `sdl`. Differential Revision: https://reviews.llvm.org/D114447	2021-11-24 10:35:49 +00:00
Jeremy Morse	b8f68ad9cd	[DebugInfo][InstrRef] Avoid crash when values optimised out late in sdag It appears that we can emit all the instructions for a function, including debug instructions, and then optimise some of the values out late. Specifically, in the attached test case, an argument gets optimised out after DBG_VALUE / DBG_INSTR_REFs are created. This confuses MachineFunction::finalizeDebugInstrRefs, which expects to be able to find a defining instruction, and crashes instead. Fix this by identifying when there's no defining instruction, and translating that instead into a DBG_VALUE $noreg. Differential Revision: https://reviews.llvm.org/D114476	2021-11-24 10:34:48 +00:00
David Green	d9af9c2c5a	[ARM] Fold floating point select(binop) patterns Similar to D84091 which added extra predicated folds for integer operations using the identity element of the operation, this adds them for floating point operations for the form `BinOp(x, select(p, y, Identity))`. They are folded back to predicated versions of the operator, with fadd having the identity -0.0, fsub using the identity 0.0 and fmul using 1.0. Differential Revision: https://reviews.llvm.org/D113574	2021-11-24 10:22:20 +00:00
Dmitry Vyukov	764b35d89f	tsan: extend mmap test Test size larger than clear_shadow_mmap_threshold, which is handled differently. Depends on D114348. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D114366	2021-11-24 10:57:21 +01:00
David Green	734e2386ff	[ARM] Add fma and update fadd/fmul predicated select tests. NFC	2021-11-24 09:51:33 +00:00
mydeveloperday	93fc91610f	[clang-format] NFC - recent changes caused clang-format to no longer be clang-formatted. The following 2 commits caused files in clang-format to no longer be clang-formatted. we would lose our "clean" status https://releases.llvm.org/13.0.0/tools/clang/docs/ClangFormattedStatus.html `c2271926a4` - Make clang-format fuzz through Lexing with asserts enabled (https://github.com/llvm/llvm-project/commit/c2271926a4fc ) `84bf5e3286` - Fix various problems found by fuzzing. (https://github.com/llvm/llvm-project/commit/84bf5e328664) Reviewed By: HazardyKnusperkeks, owenpan Differential Revision: https://reviews.llvm.org/D114430	2021-11-24 09:45:32 +00:00
Matthias Springer	bb273a35a0	[mlir][linalg][bufferize][NFC] Move tensor interface impl to new build target This makes ComprehensiveBufferize entirely independent of the tensor dialect. Differential Revision: https://reviews.llvm.org/D114217	2021-11-24 18:25:17 +09:00
Florian Hahn	8ef460fc51	[llvm-reduce] Add parallel chunk processing. This patch adds parallel processing of chunks. When reducing very large inputs, e.g. functions with 500k basic blocks, processing chunks in parallel can significantly speed up the reduction. To allow modifying clones of the original module in parallel, each clone needs their own LLVMContext object. To achieve this, each job parses the input module with their own LLVMContext. In case a job successfully reduced the input, it serializes the result module as bitcode into a result array. To ensure parallel reduction produces the same results as serial reduction, only the first successfully reduced result is used, and results of other successful jobs are dropped. Processing resumes after the chunk that was successfully reduced. The number of threads to use can be configured using the -j option. It defaults to 1, which means serial processing. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D113857	2021-11-24 09:23:52 +00:00
Pavel Labath	6f82264dbb	[lldb/gdb-remote] Remove more non-stop mode remnants The read thread handling is completely dead code now that non-stop mode no longer exists.	2021-11-24 10:00:43 +01:00
Rosie Sumpter	df32a39dd0	[LoopVectorize][CostModel] Update cost model for fmuladd intrinsic This patch updates the cost model for ordered reductions so that a call to the llvm.fmuladd intrinsic is modelled as a normal fmul instruction plus the cost of an ordered fadd reduction. Differential Revision: https://reviews.llvm.org/D111630	2021-11-24 08:50:05 +00:00
Rosie Sumpter	2d33327f9d	[LoopVectorize] Print fast-math flags for VPReductionRecipe	2021-11-24 08:50:05 +00:00
Rosie Sumpter	991074012a	[LoopVectorize] Propagate fast-math flags for VPInstruction In-loop vector reductions which use the llvm.fmuladd intrinsic involve the creation of two recipes; a VPReductionRecipe for the fadd and a VPInstruction for the fmul. If the call to llvm.fmuladd has fast-math flags these should be propagated through to the fmul instruction, so an interface setFastMathFlags has been added to the VPInstruction class to enable this. Differential Revision: https://reviews.llvm.org/D113125	2021-11-24 08:50:04 +00:00
Rosie Sumpter	c2441b6b89	[LoopVectorize] Add vector reduction support for fmuladd intrinsic Enables LoopVectorize to handle reduction patterns involving the llvm.fmuladd intrinsic. Differential Revision: https://reviews.llvm.org/D111555	2021-11-24 08:50:04 +00:00
Butygin	7f5d9bf13a	[mlir][scf] Canonicalize scf.while with unused results Differential Revision: https://reviews.llvm.org/D114291	2021-11-24 11:11:22 +03:00
Clement Courbet	ba4411e7c6	[clang-tidy] performance-unnecessary-copy-initialization: Fix false negative. `isConstRefReturningMethodCall` should be considering `CXXOperatorCallExpr` in addition to `CXXMemberCallExpr`. Clang considers these to be distinct (`CXXOperatorCallExpr` derives from `CallExpr`, not `CXXMemberCallExpr`), but we don't care in the context of this check. This is important because of `std::vector<Expensive>::operator[](size_t) const`. Differential Revision: https://reviews.llvm.org/D114249	2021-11-24 08:07:21 +01:00
Vitaly Buka	b9fd7247a7	[sanitizer] Add Abs<T>	2021-11-23 22:25:36 -08:00
Abinav Puthan Purayil	078da26b1c	[AMDGPU] Check for unneeded shift mask in shift PatFrags. The existing constrained shift PatFrags only dealt with masked shift from OpenCL front-ends. This change copies the X86DAGToDAGISel::isUnneededShiftMask() function to AMDGPU and uses it in the shift PatFrag predicates. Differential Revision: https://reviews.llvm.org/D113448	2021-11-24 10:53:12 +05:30
Igor Kudrin	8cdf1c1edb	[ELF] Support the "read-only" memory region attribute The attribute 'r' allows (or disallows for the negative case) read-only sections, i.e. ones without the SHF_WRITE flag, to be assigned to the memory region. Before the patch, lld could put a section in the wrong region or fail with "error: no memory region specified for section". Differential Revision: https://reviews.llvm.org/D113771	2021-11-24 12:17:09 +07:00
Vitaly Buka	55792b5ac4	[sanitizer] Fail instead of crash without real_pthread_create	2021-11-23 20:32:09 -08:00
Bixia Zheng	02710413a3	Accept symmetric sparse matrix in Matrix Market Exchange Format. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D114402	2021-11-23 19:53:17 -08:00
Weverything	1150f02c77	Revert "tsan: new runtime (v3)" This reverts commit `ebd47b0fb7`. This was causing unexpected behavior in programs.	2021-11-23 18:32:32 -08:00

1 2 3 4 5 ...

405662 Commits All Branches Search

405662 Commits

All Branches