llvm-project

Commit Graph

Author	SHA1	Message	Date
Zakk Chen	b1b22b4a85	[RISCV][Clang] Support policy functions for vneg, vnot, vncvt, vwcvt, vwcvtu, vfabs and vfneg. We will switch all UndefValue to PoisonValue in follow up patches. Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D126744	2022-08-02 17:27:55 +00:00
Fangrui Song	d884eb2bce	[test] Remove -fsanitize-coverage-whitelist=	2022-08-02 10:25:44 -07:00
Andrey Tretyakov	8468e67495	[SPIRV] Add tests to improve test coverage Differential Revision: https://reviews.llvm.org/D130597	2022-08-02 20:22:40 +03:00
Vladislav Dzhidzhoev	f6d9f00031	[DebugInfo] Test commit: update irrelevant comments Differential Revision: https://reviews.llvm.org/D130998	2022-08-02 20:21:24 +03:00
Slava Gurevich	d735307aa2	[LLDB][Reliability] Remove dead code. Remove redundant code that can never execute due to preceeding logic checks in the code. Differential Revision: https://reviews.llvm.org/D130929	2022-08-02 10:09:45 -07:00
Guozhi Wei	85a6dd50ad	[MIPS] Expose the ZERO register as a constant physical register The ZERO register should be exposed as a constant physical register through the interface TargetRegisterInfo::isConstantPhysReg. Differential Revision: https://reviews.llvm.org/D130932	2022-08-02 17:04:52 +00:00
Mark de Wever	da38bcfd52	[libc++][format] Improves generated files. This improves the formatting of the generated files. That allows it to remove the clang-format step in D129668. Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D130911	2022-08-02 18:56:02 +02:00
Mark de Wever	679169b7dd	[libc++][format] Enables feature-test macro. The macro is only enabled when the Clang is used with -fexperimental-library. Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D130792	2022-08-02 18:43:27 +02:00
Craig Topper	ae6877836e	[RISCV] Add scheduler classes to PseudoVMV*R_V. I think these pseudos will exist when the post-RA scheduler runs so they should have sched classes. Reviewed By: monkchiang Differential Revision: https://reviews.llvm.org/D130945	2022-08-02 09:38:32 -07:00
Craig Topper	2e5c516a3d	[RISCV] Add scheduler class to PseudoReadVLENB. Reviewed By: monkchiang Differential Revision: https://reviews.llvm.org/D130938	2022-08-02 09:38:32 -07:00
Alexander Timofeev	a321d95b59	[AMDGPU] avoid blind converting to VALU REG_SEQUENCE and PHIs In the `2e29b0138c` we introduce a specific solving algorithm that analyzes the VGPR to SGPR copies use chains and either lowers the copy to v_readfirstlane_b32 or converts the whole chain to VALU forms. Same time we still have the code that blindly converts to VALU REG_SEQUENCE and PHIs in case they produce SGPR but have VGPRs input operands. In case the REG_SEQUENCE and PHIs are in the VGPR to SGPR copy use chain, and this chain was considered long enough to convert copy to v_readfistlane_b32, further lowering them to VALU leads to several kinds of issues. At first, we have v_readfistlane_b32 which is completely useless because most parts of its use chain were moved to VALU forms. Second, we may encounter subtle bugs related to the EXEC-dependent CF because of the weird mixing of SALU and VALU instructions. This change removes the code that moves REG_SEQUENCE and PHIs to VALU. Instead, we use the fact that both REG_SEQUENCE and PHIs have copy semantics. That is, if they define SGPR but have VGPR inputs, we insert VGPR to SGPR copies to make them pure SGPR. Then, the new copies are processed by the common VGPR to SGPR lowering algorithm. This is Part 2 in the series of commits aiming at the massive refactoring of the SIFixSGPRCopies pass. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D130367	2022-08-02 18:37:57 +02:00
Jay Foad	e301e071ba	[AMDGPU] Remove IR SpeculativeExecution pass from codegen pipeline This pass seems to have very little effect because all it does is hoist some instructions, but it is followed later in the codegen pipeline by the IR CodeSinking pass which does the opposite. Differential Revision: https://reviews.llvm.org/D130258	2022-08-02 17:35:20 +01:00
John Regehr	71d1bd1457	llvm-reduce: reorder passes to run the ones first that delete function bodies; this makes reductions go faster	2022-08-02 10:32:49 -06:00
Jay Foad	c24d68fff1	[AMDGPU] Take advantage of VOP3 literals in convertToThreeAddress This improves a corner case where v_fmac can be converted to v_fma on GFX10+ even if it has a literal operand. Differential Revision: https://reviews.llvm.org/D130992	2022-08-02 17:27:11 +01:00
Alok Kumar Sharma	5ec6ea3dfd	[clang][OpenMP][DebugInfo] Mark OpenMP generated functions as artificial The Clang compiler generates internal functions for OpenMP. Current patch marks these functions as artificial. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D111521	2022-08-02 21:24:46 +05:30
Philip Reames	0b47615fcf	[LV] Recognize store of invariant value to invariant address as uniform This extends the handling of uniform memory operations to handle the case where a store is storing a loop invariant value. Unlike the general case of a store to an invariant address where we must use the last active lane, in this case we can use any lane since all lanes must produce the same result. For context, the basic structure of the existing code and how the change fits in: * First, we select a widening strategy. (The result is irrelevant for this patch.) * Then we determine if a computation is uniform within all lanes of VF. (Note this is the uniform-per-part definition, not LAI's uniform across all unrolled iterations definition.) * If it is, we overrule the widening strategy, and unconditionally scalarize. * VPReplicationRecipe - which is what actually does the scalarization - knows how to handle unform-per-part values including for scalable vectors. However, we do need to know that the expression is safe to execute without predication - e.g. the uniform mem op was unconditional in the original loop. (This part was split off and already landed.) An obvious question is why not simply implement the generic case? The answer is that I'm going to, but doing so without a canonicalization towards uniform causes regressions due to bad interaction with scalarization/uniformity of values feeding the uniform mem-op. This patch is needed to avoid those regressions. Differential Revision: https://reviews.llvm.org/D130364	2022-08-02 08:09:49 -07:00
Peixin Qiao	48b6f5c708	[flang] Add some semantic checks for derived type with BIND attribute This supports checks in C1801-C1805 for derived type with BIND attribute. The other compilers such as 'gfortran' and 'ifort' do not report error for C1802 and C1805, so emit warnings for them. Reviewed By: klausler Differential Revision: https://reviews.llvm.org/D130438	2022-08-02 23:07:02 +08:00
Peixin Qiao	1f9212d8d5	[flang] Support extention intrinsic ABORT The semantic checks and runtime have been supported. This supports the lowering of intrinsic ABORT. `gfortran` prints a backtrace before abort, unless `-fno-backtrace` is given. This is good to use. The intrinsic BACKTRACE is not supported yet, so add TODO in the runtime. This extention is needed in SPEC2017 521.wrf_r in https://github.com/llvm/llvm-project/issues/55955. Reviewed By: klausler Differential Revision: https://reviews.llvm.org/D130439	2022-08-02 23:02:12 +08:00
Peixin Qiao	9b867928f4	[flang] Support lowering of intrinsic module procedure `c_loc` As Fortran 2018 18.2.3.6, the intrinsic `c_loc(x)` gets the C address of argument `x`. It returns the scalar of type C_PTR. As defined in iso_c_binding in `flang/module/__fortran_builtins.f90`, C_PTR is the derived type with only one component of integer 64. This supports the lowering of intrinsic module procedure `c_loc` by converting the address of argument into integer 64, where the argument is lowered as Box and the address is generated using fir.box_addr. The lowering of intrinsic `c_funloc` has the similar characteristic and will be supported later. The execution tests for various data types are in issue https://github.com/llvm/llvm-project/issues/56552. Reviewed By: Jean Perier Differential Revision: https://reviews.llvm.org/D129659	2022-08-02 22:59:28 +08:00
Phoebe Wang	23021d4d8c	[X86][FP16] Fix vector_shuffle and lowering without f16c feature problems The problem Alexander reported on D127982 was caused by an optimization for AVX512-FP16 instruction. We must limit it to the feature enabled only. During the investigation, I found we didn't expand for fp_round/fp_extend without F16C. This may result runtime crash, so change them too. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D130817	2022-08-02 22:26:41 +08:00
Pavel Labath	6093a77caf	[lldb] Create an enum to specify the kind of ArchSpec matching s/true/ArchSpec::ExactMatch s/false/ArchSpec::CompatibleMatch Differential Revision: https://reviews.llvm.org/D121290	2022-08-02 16:20:13 +02:00
Gabriel Ravier	5dbd8faad5	[lld] Fixed a number of typos I went over the output of the following mess of a command: `(ulimit -m 2000000; ulimit -v 2000000; git ls-files -z \| parallel --xargs -0 cat \| aspell list --mode=none --ignore-case \| grep -E '^[A-Za-z][a-z]*$' \| sort \| uniq -c \| sort -n \| grep -vE '.{25}' \| aspell pipe -W3 \| grep : \| cut -d' ' -f2 \| less)` and proceeded to spend a few days looking at it to find probable typos and fixed a few hundred of them in all of the llvm project (note, the ones I found are not anywhere near all of them, but it seems like a good start). Differential Revision: https://reviews.llvm.org/D130982	2022-08-02 09:52:31 -04:00
Aaron Ballman	c783ca0de1	Revert "Missing tautological compare warnings due to unary operators" This reverts commit `0cc3c184c7`. The changes did not account for templated code where one instantiation may trigger the diagnostic but other instantiations will not, as in: ``` template <int I, class T> void foo(int x) { bool b1 = (x & sizeof(T)) == 8; bool b2 = (x & I) == 8; bool b3 = (x & 4) == 8; } void run(int x) { foo<4, int>(8); } ```	2022-08-02 09:39:36 -04:00
Simon Tatham	07e6eb6e75	[yaml2obj] Add a `-E` flag to preprocess only. If you're having trouble getting a yaml2obj macro expansion to do what you want, it's useful to be able to print the output of the preprocessing to see what your macros expanded to //before// going into the YAML processing phase. yaml2obj has its own preprocessing system which isn't the same as any other well-known thing like cpp. So there's no way to do this macro expansion via another tool: yaml2obj will have to do it itself. In this commit I add an `-E` flag to yaml2obj to do that. Differential Revision: https://reviews.llvm.org/D130981	2022-08-02 13:56:27 +01:00
Jay Foad	bb2832410e	[IRBuilder] CreateIntrinsic with implicit mangling Add a new IRBuilderBase::CreateIntrinsic which takes the return type and argument values for the intrinsic call but does not take an explicit list of types to mangle. Instead the builder works this out from the intrinsic declaration and the types of the supplied arguments. This means that the mangling is hidden from the client, which in turn means that intrinsic definitions can change which arguments are mangled without requiring any changes to the client code. Differential Revision: https://reviews.llvm.org/D130776	2022-08-02 13:08:35 +01:00
David Green	1206f72e31	[AArch64] Fold Mul(And(Srl(X, 15), 0x10001), 0xffff) to CMLTz This folds a v4i32 Mul(And(Srl(X, 15), 0x10001), 0xffff) into a v8i16 CMLTz instruction. The Srl and And extract the top bit (whether the input is negative) and the Mul sets all values in the i16 half to all 1/0 depending on if that top bit was set. This is equivalent to a v8i16 CMLTz instruction. The same applies to other sizes with equivalent constants. Differential Revision: https://reviews.llvm.org/D130874	2022-08-02 13:01:59 +01:00
Muhammad Omair Javaid	a1bf0c0894	[LLDB] Skip buildbot failures AArch64/Windows TestInlineStepping.py is flaky while TestUseSourceCache.py fails on Windows 11 only. Marked them skipped to make buildbot happy.	2022-08-02 16:59:12 +05:00
jacquesguan	008ea1c201	[mlir][Math] Add constant folder for TanhOp. This patch adds constant folder for TanhOp which only supports single and double precision floating-point. Differential Revision: https://reviews.llvm.org/D130960	2022-08-02 19:37:27 +08:00
David Green	29f97ec845	[AArch64] Mul fold tests for D130874. NFC	2022-08-02 12:29:40 +01:00
Alex Zinenko	64bb0ae75f	[mlir] add TOC to top-level documents Multiple top-level MLIR documents did not have a table of contents tag, making them harder to nagivate.	2022-08-02 13:22:40 +02:00
Igor Zhukov	8be1197285	[libc++] Implement P2499R0 (`string_view` range constructor should be `explicit`) Reviewed By: #libc, philnik, Mordante Spies: Mordante, jloser, philnik, libcxx-commits Differential Revision: https://reviews.llvm.org/D130785	2022-08-02 12:47:45 +02:00
Gabriel Ravier	c23e2c015f	[libcxx] Fixed a number of typos I went over the output of the following mess of a command: `(ulimit -m 2000000; ulimit -v 2000000; git ls-files -z \| parallel --xargs -0 cat \| aspell list --mode=none --ignore-case \| grep -E '^[A-Za-z][a-z]*$' \| sort \| uniq -c \| sort -n \| grep -vE '.{25}' \| aspell pipe -W3 \| grep : \| cut -d' ' -f2 \| less)` and proceeded to spend a few days looking at it to find probable typos and fixed a few hundred of them in all of the llvm project (note, the ones I found are not anywhere near all of them, but it seems like a good start). Reviewed By: #libc, philnik Spies: philnik, libcxx-commits, mgorny, arichardson Differential Revision: https://reviews.llvm.org/D130905	2022-08-02 12:42:34 +02:00
Simon Pilgrim	b651fdff79	[DAG] matchRotateSub - ensure the (pre-extended) shift amount is wide enough for the amount mask (PR56859) matchRotateSub is given shift amounts that will already have stripped any/zero-extend nodes from - so make sure those values are wide enough to take a mask.	2022-08-02 11:38:52 +01:00
Adrian Kuegel	b395c0f0cd	[mlir] Update comment now that DenseArrayAttr has Tensor type.	2022-08-02 12:31:22 +02:00
Andrzej Warzynski	b13448c56c	[flang][docs][nfc] Refine FlangOptionsDocs.td Currently, FlangOptionsDocs.td doesn't specify `ExcludedFlags` which means that in the generated documentation file we expose flags that: * we don't necessarily won't to advertise to our users (e.g. hidden flags), or * are not supported altogether (e.g. CL options). This patch defines `ExcludeFlags` to fix that. The definition of `ExcludeFlags` was copied from Clang so that LLVM frontends have consistent documentation. It might be a bit counter-intuitive that IncludeFlags alone is not sufficient here. However, the current logic in ClangOptionDocEmitter.cpp will parse IncludeFlags and print all options that contains one of the included flags, as well as their aliases. So, for example, for -fopenmp (which is a supported Flang option), one would also get /fopenmp (i.e. CL mode equivalent for -fopenmp). By adding ExcludeFlags, we make sure that such aliases are excluded. I've also taken the liberty and moved FlangOptionsDocs.td. Originally it was located in Flang's "flang/include" directory, but there shouldn't be any implementation/documentation files there. Instead, I'm moving it to the "flang/docs" directory. Differential Revision: https://reviews.llvm.org/D130558	2022-08-02 10:04:17 +00:00
Dawid Jurczak	78650b7861	[NFC] Remove some boilerplate from SmallVector header In SmallVector header we use couple of times exactly same enable_if, in this change we de-duplicate it and declare only once. Extracted from: https://reviews.llvm.org/D129990 Differential Revision: https://reviews.llvm.org/D130779	2022-08-02 11:45:55 +02:00
Stephan Herhut	09ca1c0656	[mlir] Use EXPECT_DEBUG_DEATH in unit test This allows the tests to also pass when compiled in opt mode.	2022-08-02 11:16:26 +02:00
David Sherwood	4ef9cb6c17	[AArch64][LoopVectorize] Disable tail-folding for SVE when loop has interleaved accesses If we have interleave groups in the loop we want to vectorise then we should fall back on normal vectorisation with a scalar epilogue. In such cases when tail-folding is enabled we'll almost certainly go on to create vplans with very high costs for all vector VFs and fall back on VF=1 anyway. This is likely to be worse than if we'd just used an unpredicated vector loop in the first place. Once the vectoriser has proper support for analysing all the costs for each combination of VF and vectorisation style, then we should be able to remove this. Added an extra test here: Transforms/LoopVectorize/AArch64/sve-tail-folding-option.ll Differential Revision: https://reviews.llvm.org/D128342	2022-08-02 09:52:33 +01:00
Alex Bradbury	5ad59c9e59	[RISCV][NFCI] Set TransientStackAlignment and rely on it rather than RVV-specific logic on RVV-less functions * TargetFrameLowering has a TransientStackAlignment field that "returns the number of bytes to which the stack pointer must be aligned at all times, even between calls. * As explained in the [RISC-V calling convention](https://github.com/riscv-non-isa/riscv-elf-psabi-doc/blob/master/riscv-cc.adoc), the stack pointer must remain fully aligned throughout execution for compliant code. This is important for embedded targets that might avoid realigning the stack pointer for interrupt service routines. Systems running full OSes may always realign the stack anyway. * TransientStackAlignment is used in estimateStackSize in MachineFrameInfo and in PEI::calculateFrameObjectOffsets. * estimateStackSize is only used in the RISC-V backend for scavenging slots. It may be possible to craft a function where the difference is observable, but it wouldn't be a meaningful test. * calculateFrameObjectOffsets makes use of TransientStackAlignment, but then sets the stack alignment to the max of that alignment and MaxAlign, which is unconditionally set to 16 in RISCVFrameLowering::processFunctionBeforeFrameFinalized * I've changed this logic to only set MaxAlign if there are RVV frame objects. There should be no functional change here for either RVV targets (MaxAlign is set as before) or non-RVV targets (TransientStackAlign is now 16 anyway). Differential Revision: https://reviews.llvm.org/D130068	2022-08-02 09:46:06 +01:00
Tim Northover	b586dc21a7	Outliner: add "target-cpu" feature from source function to outlined The CPU is used to determine which inline asm instructions are allowed, so needs to be copied across in case the outlined function contains any.	2022-08-02 09:33:29 +01:00
wanglian	e208bab55f	[RISCV][NFC] Use defined variable instead some code. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D130687	2022-08-02 16:26:33 +08:00
Martin Storsjö	3f25ad335b	[OpenMP] Fix warnings about unused expressions when OMPT_LOOP_DISPATCH is a no-op. NFC. This fixes warnings like these: ../runtime/src/kmp_dispatch.cpp:2159:24: warning: left operand of comma operator has no effect [-Wunused-value] OMPT_LOOP_DISPATCH(p_lb, p_ub, pr->u.p.st, status); ^~~~~ ../runtime/src/kmp_dispatch.cpp:2159:31: warning: left operand of comma operator has no effect [-Wunused-value] OMPT_LOOP_DISPATCH(p_lb, p_ub, pr->u.p.st, status); ^~~~~ ../runtime/src/kmp_dispatch.cpp:2159:46: warning: left operand of comma operator has no effect [-Wunused-value] OMPT_LOOP_DISPATCH(p_lb, p_ub, pr->u.p.st, status); ~~~~~~~ ^~ ../runtime/src/kmp_dispatch.cpp:2159:50: warning: expression result unused [-Wunused-value] OMPT_LOOP_DISPATCH(p_lb, p_ub, pr->u.p.st, status); ^~~~~~	2022-08-02 11:16:23 +03:00
Tom Stellard	e0a3964aff	workflows: Fix error when searching for backport reviewers	2022-08-02 01:09:02 -07:00
jacquesguan	e38af7ba95	[LV] Refactor getExtendedAddReductionCost to support other extended reduction more than Add. Now the API getExtendedAddReductionCost is used to determine the cost of extended Add reduction with optional Mul. For Arm, it could cover the cases. But for other target, for example: RISCV, they support other kinds of extended recution, such as FAdd. This patch does the following changes: 1, Split getExtendedAddReductionCost into 2 new API: getExtendedReductionCost which handles the extended reduction with addtional input of Opcode; getMulAccReductionCost which handle the MLA cases the getExtendedAddReductionCost. 2, Refactor getReductionPatternCost, add some contraint condition to make sure the getMulAccReductionCost should only handle the reuction of Add + Mul. Differential Revision: https://reviews.llvm.org/D130868	2022-08-02 16:02:38 +08:00
Martin Storsjö	112499f35f	[mlir] Fix calling the native mlir-tblgen tool when cross compiling flang When the mlir-tblgen tool is set up, the `MLIR_TABLEGEN_EXE` variable is set, which either points to the mlir-tblgen tool built in the current cmake build, or points to one built in a nested cmake build (if cross conpiling, or if building with e.g. `LLVM_OPTIMIZED_TABLEGEN`. The `MLIR_TABLEGEN_EXE` variable is only set within the scope of the mlir/CMakeLists.txt file, so it's unavailable in sibling level projects such as flang. Set the `MLIR_TABLEGEN_EXE` and the `MLIR_TABLEGEN_TARGET` variables as global, so that flang can use them properly without guessing. Differential Revision: https://reviews.llvm.org/D130350	2022-08-02 10:58:24 +03:00
Martin Storsjö	46b5ab15cd	[flang] Don't try to run the newly built flang-new when cross compiling If CMAKE_CROSSCOMPILING, then the newly built flang-new executable was cross compiled and thus can't be executed on the build system, and thus can't be used for generating module files. Differential Revision: https://reviews.llvm.org/D130349	2022-08-02 10:57:57 +03:00
Martin Storsjö	b7c5683fac	[lldb] Silence a GCC warning about missing returns after a fully covered switch. NFC.	2022-08-02 10:57:33 +03:00
Martin Storsjö	7f24fd26a8	[OpenMP] Only include CMAKE_DL_LIBS on unix platforms CMAKE_DL_LIBS is documented as "Name of library containing dlopen and dlclose". On Windows platforms, there's no system provided dlopen/dlclose, but it can be argued that if you really intend to call dlopen/dlclose, you're going to be using a third party compat library like https://github.com/dlfcn-win32/dlfcn-win32, and CMAKE_DL_LIBS should expand to its name. This has been argued upstream in CMake in https://gitlab.kitware.com/cmake/cmake/-/issues/17600 and https://gitlab.kitware.com/cmake/cmake/-/merge_requests/1642, that CMAKE_DL_LIBS should expand to "dl" on mingw platforms. The merge request wasn't merged though, as it caused some amount of breakage, but in practice, Fedora still carries a custom CMake patch with the same effect. Thus, this patch fixes cross compiling OpenMP for mingw targets on Fedora with their custom-patched CMake. Differential Revision: https://reviews.llvm.org/D130892	2022-08-02 10:56:30 +03:00
Fangrui Song	afb785f511	[Driver] Remove Separate form for XRay options Supporting something like `-fxray-instruction-threshold= 1` is not intended.	2022-08-02 00:47:37 -07:00
Nikita Popov	62e4ee2def	[LangRef] Fix typo in GEP docs Introduced in D130356, reported here: https://reviews.llvm.org/rG7ac7ec820296#inline-7690	2022-08-02 09:30:17 +02:00

... 2 3 4 5 6 ...

431965 Commits All Branches Search

431965 Commits

All Branches