llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	fc7c1cebbc	[X86] LowerFunnelShift - pull out repeated EltSizeInBits variable. NFC.	2021-11-15 17:11:44 +00:00
Roman Lebedev	bc35d5fe2f	[NFC][X86][Costmodel] Add i1 replication shuffle costmodel test coverage	2021-11-15 20:02:52 +03:00
Chris Lattner	a3ee67a685	[PatternMatch] Add a new m_Any that binds a value. This is analogous to what LLVM's PatternMatch.h supports, but LLVM calls it m_Value for both the binding and nonbinding versions. This is an upstream from CIRCT and is used there. Differential Revision: https://reviews.llvm.org/D113905	2021-11-15 08:38:07 -08:00
Zarko Todorovski	44a64afd43	[llvm][ubsan] Inclusive language: replace use of blacklist HandleLLVMOptions.cmake This patch changes it to ignorelist and contains a filename change for the .txt file that's called. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D113689	2021-11-15 16:22:03 +00:00
Sanjay Patel	3d01507c2d	[x86] fold vector (X > -1) & Y to shift+andn (2nd try) The first try at this patch ( `bf5748a1af` ) was reverted ( `5be64d4164` ) because it could crash. The cause of that problem was failing to account for the optional peek-through-bitcast in the enclosing function. This version of the patch adds a clause to avoid the fold in case of bitcasts because it is unlikely to be profitable in that scenario. A test case based on https://llvm.org/PR52504 was added to make sure we don't have that problem again. Original commit message: and (pcmpgt X, -1), Y --> pandn (vsrai X, BitWidth-1), Y This avoids the -1 constant vector in favor of an arithmetic shift instruction if it exists (the ISA is still not complete after all these years...). We catch this pattern late in combining by matching PCMPGT, so it should not interfere with more general folds. Differential Revision: https://reviews.llvm.org/D113603	2021-11-15 11:09:32 -05:00
Sanjay Patel	6efe64cf9f	[x86] add test for vector signbit mask fold (PR52504); NFC This goes with D113603 - which was reverted because it could crash on this and similar examples.	2021-11-15 11:09:31 -05:00
Roman Lebedev	5c7255fe3a	[X86][Costmodel] `getReplicationShuffleCost()`: promote 8 bit-wide elements to 32 bit when no AVX512VBMI Currently `X86TTIImpl::getInterleavedMemoryOpCostAVX512()` asks about i8 elt type, so this change does affect vectorization. In the end, it will ask about i1. We should also try to promote to i16 if we have AVX512BW, i'll do that in a follow-up. All costs here look good, i've added the missing truncation costs in preparatory patches. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D113853	2021-11-15 19:04:02 +03:00
Roman Lebedev	a468c39c90	[X86][Costmodel] `trunc v32i16 to v64i8` can appear after legalization, cost is same as for `trunc v32i16 to v32i8` Some of the costs get larger here, but i suppose that makes sense since we'd previously query scalarization costs that may not be really representative of the reality. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D113852	2021-11-15 19:04:02 +03:00
Roman Lebedev	9e57d9b09d	[X86][Costmodel] `trunc v8i64 to v16i8/v32i8/v64i8` can appear after legalization, cost is same as for `trunc v8i64 to v8i8` While this one is trivial and identical to the previous patch, there is a weird cost change in a follow-up patch that i'm not sure about. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D113851	2021-11-15 19:04:02 +03:00
Roman Lebedev	0116c708c6	[X86][Costmodel] `trunc v16i32 to v32i8/v64i8` can appear after legalization, cost is same as for `trunc v16i32 to v16i8` While this one is trivial and identical to the previous patch, there is a weird cost change in a follow-up patch that i'm not sure about. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D113850	2021-11-15 19:04:02 +03:00
Kiran Chandramohan	49c08a22ed	[Flang] Add the FIR LLVMPointer Type Add a fir.llvm_ptr type to allow any level of indirections Currently, fir pointer types (fir.ref, fir.ptr, and fir.heap) carry a special Fortran semantics, and cannot be freely combined/nested. When implementing some features, lowering sometimes needs more liberty regarding the number of indirection levels. Add a fir.llvm_ptr that has no constraints. Allow its usage in fir.coordinate_op, fir.load, and fir.store. Convert the FIR LLVMPointer to an LLVMPointer in the LLVM dialect. Reviewed By: clementval Differential Revision: https://reviews.llvm.org/D113755 Co-authored-by: Jean Perier <jperier@nvidia.com>	2021-11-15 15:57:59 +00:00
Jon Chesterfield	0e738323a9	[openmp][amdgpu] Add comment warning that libm may be broken Using llvm-link to add rocm device-libs probably doesn't work Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D112639	2021-11-15 15:56:01 +00:00
Simon Pilgrim	ea9e6aa423	[X86] getAVX512Node() - find constant broadcasts to encourage load-folding If an operand is a bitcasted or widended constant, try to more aggressively create broadcastable constants for folding, which in particular helps non-VLX modes. I've refactored getAVX512Node so that VLX targets can make better use of this as well. NOTE: In the future, I think we should consider removing the broadcast of constant data from DAG entirely and move this to either X86InstrInfo::foldMemoryOperand or a new pass - AVX1/2 targets has similar problems with missed (whole vector) folds that need to be improved as well. Differential Revision: https://reviews.llvm.org/D113845	2021-11-15 15:52:03 +00:00
Alexey Bataev	036207d5f2	[SLP]Improve splat detection. A bunch of scalars can be treated as a splat not only if all elements are the same but also if some of them are undefvalues. Differential Revision: https://reviews.llvm.org/D113774	2021-11-15 07:50:34 -08:00
Ella Ma	da168dd875	[clang] Allow clang-check to customize analyzer output file or dir name Required by https://stackoverflow.com/questions/58073606 As the output argument is stripped out in the clang-check tool, it seems impossible for clang-check users to customize the output file name, even with -extra-args and -extra-arg-before. This patch adds the -analyzer-output-path argument to allow users to adjust the output name. And if the argument is not set or the analyzer is not enabled, the original strip output adjuster will remove the output arguments. Differential Revision: https://reviews.llvm.org/D97265	2021-11-15 16:49:41 +01:00
Valentin Clement	677df8c709	[fir] Add fir.global_len conversion placeholder As for D113662, this patch just add a place holder for the fir.global_len operation conversion. This operation is part of F20xx and is not implemented yet. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: awarzynski Differential Revision: https://reviews.llvm.org/D113887	2021-11-15 16:47:20 +01:00
Andrzej Warzynski	14867ffc7c	[flang][CodeGen] Transform `fir.unboxchar` to a sequence of LLVM MLIR This patch extends the `FIRToLLVMLowering` pass in Flang by adding a hook to transform `fir.unboxchar` to a sequence of LLVM MLIR instructions. This is part of the upstreaming effort from the `fir-dev` branch in [1]. [1] https://github.com/flang-compiler/f18-llvm-project Differential Revision: https://reviews.llvm.org/D113747 Originally written by: Co-authored-by: Eric Schweitz <eschweitz@nvidia.com> Co-authored-by: Jean Perier <jperier@nvidia.com>	2021-11-15 15:34:55 +00:00
Valentin Clement	37c7211f11	[fir] Remove extra return in SelectTypeOpConversion Extra commit after D113878	2021-11-15 16:20:53 +01:00
Louis Dionne	855a419b92	[libc++] Add missing _LIBCPP_HIDE_FROM_ABI to __rewrap_iter	2021-11-15 10:10:33 -05:00
Matt Devereau	485c193aa1	Regenerate acle_st1.c tests Regenerate acle_st1.c tests using update_cc_test_checks.py	2021-11-15 15:07:52 +00:00
Mehrnoosh Heidarpour	a7f7cf115b	[NFC][InstSimplify] add test cases with base results for or-xor fold This patch adds tests with baseline results as a pre-commit for D113861 Differential Revision: https://reviews.llvm.org/D113860	2021-11-15 10:08:31 -05:00
Alexey Bataev	b85152f8b1	[SLP][NFC]Use `isa_and_nonnull` and fix comment, NFC.	2021-11-15 06:49:33 -08:00
Kirstóf Umann	d896c9f40a	Fix an unused variable warning	2021-11-15 15:45:43 +01:00
ksyx	72b5138d37	Revert "[GVN][NFC] Remove redundant check" This reverts commit `c35e8185d8`. mstorsjo reported in the revision thread that one VNCoercion assertion is violated and seemly in relate to this commit. As per "If a test case that demonstrates a problem is reported in the commit thread, please revert and investigate offline", this commit is reverted.	2021-11-15 09:14:13 -05:00
Alexey Bataev	6fb5bed7d1	[SLP]Do not create unused gather nodes for scalar arguments of vector intrinsics. If the vector intrinsic has scalar argument, we currently still create a tree entry for this argument. This entry is not used, just consumes resources and increases the cost of the tree. Differential Revision: https://reviews.llvm.org/D113806	2021-11-15 06:11:19 -08:00
Simon Tatham	00ff774fca	[CMake] Allow passing extra options to extract_symbols.py. When cross-compiling LLVM in an environment where there //is// an objdump binary available but it does not understand the target platform's object file format, extract_symbols.py fails, because its initial check for tool availability decides that the existence of objdump at all is good enough to settle on it as the tool of choice. In such an environment it's useful to work around this by telling extract_symbols.py to use llvm-readobj instead. The script itself has an option for that, but its invocation in AddLLVM.cmake wasn't providing a mechanism to add extra options passed through for the cmake command line. Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D113557	2021-11-15 14:01:22 +00:00
Andy Yankovsky	95102b7dc3	[lldb] Unwrap the type when dereferencing the value The value type can be a typedef of a reference (e.g. `typedef int& myint`). In this case `GetQualType(type)` will return `clang::Typedef`, which cannot be casted to `clang::ReferenceType`. Fix a regression introduced in https://reviews.llvm.org/D103532. Reviewed By: teemperor Differential Revision: https://reviews.llvm.org/D113673	2021-11-15 14:48:19 +01:00
Valentin Clement	2a299e4f06	[fir] Add fir.select_type conversion placeholder As for D113662, this patch just add a place holder for the `fir.select_type` operation conversion. This operation is part of F20xx and is not implemented yet. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D113878	2021-11-15 14:35:57 +01:00
Florian Hahn	112c1c346a	[IVDescriptor] Make sure the sign is included for negative extension. At the moment, computeRecurrenceType does not include any sign bits in the maximum bit width. If the value can be negative, this means the sign bit will be missing and the sext won't properly extend the value. If the value can be negative, increment the bitwidth by one to make sure there is at least one sign bit in the result value. Note that the increment is also needed if the value is known to be negative, as a sign bit needs to be preserved for the sext to work. Note that this at the moment prevents vectorization, because the analysis computes i1 as type for the recurrence when looking through the AND in lookThroughAnd. Fixes PR51794, PR52485. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D113056	2021-11-15 13:12:57 +00:00
Mikhail Maltsev	6938270fa6	[libcxx] Fix enable_if condition of std::reverse_iterator::operator= The template std::is_assignable<T, U> checks that T is assignable from U. Hence, the order of operands in the instantiation of std::is_assignable in the std::reverse_iterator::operator= condition should be reversed. This issue remained unnoticed because std::reverse_iterator has an implicit conversion constructor. This patch adds a test to check that the assignment operator is used directly, without any implicit conversions. The patch also adds a similar test for std::move_iterator. Reviewed By: Quuxplusone, ldionne, #libc Differential Revision: https://reviews.llvm.org/D113417	2021-11-15 13:08:36 +00:00
James Henderson	254aa65d04	[llvm-nm][test] Move X86 lit.local.cfg into the X86 subfolder The file seems to have been put in the wrong place in its original commit. This had the effect of marking all llvm-nm tests as unsupported, unless X86 was enabled, even for tests that weren't X86 specific. Fixes https://bugs.llvm.org/show_bug.cgi?id=52506. Reviewed by: mstorsjo Differential Revision: https://reviews.llvm.org/D113882	2021-11-15 13:04:42 +00:00
Nicolas Vasilache	641fe70776	[mlir][Linalg] Fix and improve vectorization of depthwise convolutions. When trying to connect the vectorization of depthwise convolutions to e2e execution a number of problems surfaced. Fix an off-by-one error on the size of the input vector (similary to what was previously done for regular conv). Rewrite the lowering to vector.fma instead of vector.contract: the KW reduction dimension has already been unrolled and vector.contract requires a reduction dimension to be valid. Differential Revision: https://reviews.llvm.org/D113884	2021-11-15 12:58:05 +00:00
Nicolas Vasilache	ee80ffbf9a	[mlir][Linalg] Add bounded recursion declaration to FMAOp -> LLVM conversion. FMAOp -> LLVM conversion is done progressively by peeling off 1 dimension from FMAOp at each pattern iteration. Add the recursively bounded property declaration to the pattern so that the rewriter can apply it multiple times. Without this, FMAOps with 3+D do not lower to LLVM. Differential Revision: https://reviews.llvm.org/D113886	2021-11-15 12:41:52 +00:00
Alexander Belyaev	9b1d90e8ac	[mlir] Move min/max ops from Std to Arith. Differential Revision: https://reviews.llvm.org/D113881	2021-11-15 13:19:17 +01:00
Kristóf Umann	29a8d45c5a	[clang-tidy] Fix a crash in modernize-loop-convert around conversion operators modernize-loop-convert checks and fixes when a loop that iterates over the elements of a container can be rewritten from a for(...; ...; ...) style into the "new" C++11 for-range format. For that, it needs to parse the elements of that loop, like its init-statement, such as ItType it = cont.begin(). modernize-loop-convert checks whether the loop variable is initialized by a begin() member function. When an iterator is initialized with a conversion operator (e.g. for (const_iterator it = non_const_container.begin(); ...), attempts to retrieve the name of the initializer expression resulted in an assert, as conversion operators don't have a valid IdentifierInfo. I fixed this by making digThroughConstructors dig through conversion operators as well. Differential Revision: https://reviews.llvm.org/D113201	2021-11-15 13:11:29 +01:00
Butygin	2a3878ea16	[mlir] DialectConversion: fix OperationLegalizer::isIllegal result when legality callback returns None OperationLegalizer::isIllegal returns false if operation legality wasn't registered by user and we expect same behaviour when dynamic legality callback return None, but instead true was returned. Differential Revision: https://reviews.llvm.org/D113267	2021-11-15 14:53:06 +03:00
Nicolas Vasilache	f1c86b8354	[mlir][Linalg] Fix off-by-one error in conv vector size computation. Differential Revision: https://reviews.llvm.org/D113877	2021-11-15 11:37:44 +00:00
Hans Wennborg	5be64d4164	Revert "[x86] fold vector (X > -1) & Y to shift+andn" This casued assertion failures: llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp:9446: void llvm::SelectionDAG::ReplaceAllUsesWith(llvm::SDNode , llvm::SDNode ): Assertion `(!From->hasAnyUseOfValue(i) \|\| From->getValueType(i) == To->getValueType(i)) && "Cannot use this version of ReplaceAllUsesWith!"' failed. See comment on the code review. (Had to update some expectations in test/CodeGen/X86/vselect-zero.ll manually due to other changes having landed after the reverted one.) > and (pcmpgt X, -1), Y --> pandn (vsrai X, BitWidth-1), Y > > This avoids the -1 constant vector in favor of an arithmetic shift > instruction if it exists (the ISA is still not complete after all > these years...). > > We catch this pattern late in combining by matching PCMPGT, so it > should not interfere with more general folds. > > Differential Revision: https://reviews.llvm.org/D113603 This reverts commit `bf5748a1af`.	2021-11-15 12:35:49 +01:00
Matheus Izvekov	9fec50f001	[cmake] use project relative paths when generating ASTNodeAPI.json Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: stephenneuendorffer Differential Revision: https://reviews.llvm.org/D113664	2021-11-15 12:35:34 +01:00
Andrzej Warzynski	1e77b09538	[flang][CodeGen] Transform `fir.emboxchar` to a sequence of LLVM MLIR This patch extends the `FIRToLLVMLowering` pass in Flang by adding a hook to transform `fir.emboxchar` to a sequence of LLVM MLIR instructions. This is part of the upstreaming effort from the `fir-dev` branch in [1]. [1] https://github.com/flang-compiler/f18-llvm-project Differential Revision: https://reviews.llvm.org/D113666 Patch originally written by: Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-11-15 11:09:12 +00:00
Simon Pilgrim	7bac1985f4	[DAG] SimplifyVBinOp - add SDLoc() argument Pass in SDLoc instead of (repeated) local creations in SimplifyVBinOp and scalarizeBinOpOfSplats	2021-11-15 10:43:56 +00:00
Simon Pilgrim	8658d20724	[DAG] SimplifyVBinOp - pull out repeated getValueType() call. NFC.	2021-11-15 10:43:55 +00:00
Matthias Springer	8835a1924e	[mlir][linalg][bufferize] Allow non-tensor mappings in BufferizationState This change makes it possible to set up custom mappings in a PostAnalysisStep. Some users of Comprehensive Bufferize have custom tensor types and it is most convenient to just reuse the same bvm. Also add some more assertions. Differential Revision: https://reviews.llvm.org/D113726	2021-11-15 19:40:30 +09:00
Nicolas Vasilache	c1a2985d7f	[mlir] NFC - Add VectorType::Builder to more easily build vector types from existing ones Differential Revision: https://reviews.llvm.org/D113875	2021-11-15 10:36:55 +00:00
Matthias Springer	542a8cfba7	[mlir][linalg][bufferize] Fix insertion point of result buffers Differential Revision: https://reviews.llvm.org/D113723	2021-11-15 19:27:33 +09:00
Jay Foad	4119da2f7c	[MachineVerifier] Live interval for a subreg must have subranges MachineVerifier verified the subranges of a live interval if they existed, but did not complain if they did not exist. This patch changes the verifier to complain if there are no subranges in the live interval for a subreg operand (so long as MachineRegisterInfo says we should be tracking subreg liveness for that register). This matches the conditions for LiveIntervalCalc to create subranges in the first place. Differential Revision: https://reviews.llvm.org/D112556	2021-11-15 10:13:35 +00:00
Pavel Labath	5e20cd6568	[lldb/test] Fix std-module vector tests to work with both kinds of vector layouts D112976 changed the layout and `0d62e31c45` andjusted the test expectations to match. This patch changes the tests to expect both versions, so that one can run the test suite against older libc++ versions as well.	2021-11-15 11:12:05 +01:00
Dmitry Preobrazhensky	91f4650ebb	[AMDGPU][MC][GFX10] Corrected global_atomic_fcmpswap* Corrected src data size of global_atomic_fcmpswap and global_atomic_fcmpswap_x2 opcodes. Differential Revision: https://reviews.llvm.org/D113746	2021-11-15 12:51:12 +03:00
David Green	4c3bfdc7f1	[ARM] Fix GatherScatter AddLikeOr condition	2021-11-15 09:44:41 +00:00
Matt Kulukundis	2d9bdd9dba	Fix a deadlock in __cxa_guard_abort in tsan hat tip: @The_Whole_Daisy for helping to isolate Reviewed By: dvyukov, fowles Differential Revision: https://reviews.llvm.org/D113713	2021-11-15 10:39:08 +01:00

... 2 3 4 5 6 ...

404910 Commits All Branches Search

404910 Commits

All Branches