llvm-project

Commit Graph

Author	SHA1	Message	Date
Georgii Rymar	6ae5b9e405	[llvm-readobj] - Make decode_relrs() don't return Expected<>. NFCI. The `decode_relrs` helper is declared as: `Expected<std::vector<Elf_Rel>> decode_relrs(Elf_Relr_Range relrs) const;` it never returns an error though and hence can be simplified to return a vector. Differential revision: https://reviews.llvm.org/D85302	2020-08-05 17:05:47 +03:00
Joel E. Denny	5ab43989c3	[OpenMP] Fix `omp target update` for array extension OpenMP TR8 sec. 2.15.6 "target update Construct", p. 183, L3-4 states: > If the corresponding list item is not present in the device data > environment and there is no present modifier in the clause, then no > assignment occurs to or from the original list item. L10-11 states: > If a present modifier appears in the clause and the corresponding > list item is not present in the device data environment then an > error occurs and the program termintates. (OpenMP 5.0 also has the first passage but without mention of the present modifier of course.) In both passages, I assume "is not present" includes the case of partially but not entirely present. However, without this patch, the target update directive misbehaves in this case both with and without the present modifier. For example: ``` #pragma omp target enter data map(to:arr[0:3]) #pragma omp target update to(arr[0:5]) // might fail on data transfer #pragma omp target update to(present:arr[0:5]) // might fail on data transfer ``` The problem is that `DeviceTy::getTgtPtrBegin` does not return a null pointer in that case, so `target_data_update` sees the data as fully present, and the data transfer then might fail depending on the target device. However, without the present modifier, there should never be a failure. Moreover, with the present modifier, there should always be a failure, and the diagnostic should mention the present modifier. This patch fixes `DeviceTy::getTgtPtrBegin` to return null when `target_data_update` is the caller. I'm wondering if it should do the same for more callers. Reviewed By: grokos, jdoerfert Differential Revision: https://reviews.llvm.org/D85246	2020-08-05 10:03:31 -04:00
Joel E. Denny	03bb545b68	[OpenMP][Docs] Mark `present` map type modifier as done	2020-08-05 10:03:31 -04:00
Joel E. Denny	26cf9c1704	[OpenMP][Docs] Add map clause reordering status as unclaimed	2020-08-05 10:03:31 -04:00
Joel E. Denny	002d61db2b	[OpenMP] Fix `present` for exit from `omp target data` Without this patch, the following example fails but shouldn't according to OpenMP TR8: ``` #pragma omp target enter data map(alloc:i) #pragma omp target data map(present, alloc: i) { #pragma omp target exit data map(delete:i) } // fails presence check here ``` OpenMP TR8 sec. 2.22.7.1 "map Clause", p. 321, L23-26 states: > If the map clause appears on a target, target data, target enter > data or target exit data construct with a present map-type-modifier > then on entry to the region if the corresponding list item does not > appear in the device data environment an error occurs and the > program terminates. There is no corresponding statement about the exit from a region. Thus, the `present` modifier should: 1. Check for presence upon entry into any region, including a `target exit data` region. This behavior is already implemented correctly. 2. Should not check for presence upon exit from any region, including a `target` or `target data` region. Without this patch, this behavior is not implemented correctly, breaking the above example. In the case of `target data`, this patch fixes the latter behavior by removing the `present` modifier from the map types Clang generates for the runtime call at the end of the region. In the case of `target`, we have not found a valid OpenMP program for which such a fix would matter. It appears that, if a program can guarantee that data is present at the beginning of a `target` region so that there's no error there, that data is also guaranteed to be present at the end. This patch adds a comment to the runtime to document this case. Reviewed By: grokos, RaviNarayanaswamy, ABataev Differential Revision: https://reviews.llvm.org/D84422	2020-08-05 10:03:31 -04:00
Denis Antrushin	d21ce40821	[Statepoints] Operand folding in presense of tied registers. Implement proper folding of statepoint meta operands (deopt and GC) when statepoint uses tied registers. For deopt operands it is just about properly preserving tiedness in new instruction. For tied GC operands folding is a little bit more tricky. We can fold tied GC operands only from InlineSpiller, because it knows how to properly reload tied def after it was turned into memory operand. Other users (e.g. peephole) cannot properly fold such operands as they do not know how (or when) to reload them from memory. We do this by un-tieing operand we want to fold in InlineSpiller and allowing to fold only untied operands in foldPatchpoint.	2020-08-05 20:18:28 +07:00
Bruno Ricci	4dcbb9cef7	[clang] Add -fno-delayed-template-parsing to the added unit tests in DeclPrinterTest.cpp	2020-08-05 14:13:05 +01:00
Alex Zinenko	75f239e975	[mlir] Initial version of C APIs Introduce an initial version of C API for MLIR core IR components: Value, Type, Attribute, Operation, Region, Block, Location. These APIs allow for both inspection and creation of the IR in the generic form and intended for wrapping in high-level library- and language-specific constructs. At this point, there is no stability guarantee provided for the API. Reviewed By: stellaraccident, lattner Differential Revision: https://reviews.llvm.org/D83310	2020-08-05 15:04:08 +02:00
Roman Lebedev	f5df5cd558	Recommit "[InstCombine] Negator: -(X << C) --> X * (-1 << C)" This reverts commit `ac70b37a00` which reverted commit `8aeb2fe13a` because codegen tests got broken and i needed time to investigate. This shows some regressions in tests, but they are all around GEP's, so i'm not really sure how important those are. https://rise4fun.com/Alive/1Gn	2020-08-05 15:59:13 +03:00
Nico Weber	cc26121858	[gn build] (manually) merge `3ab01550b` This reverts commit `0bbaacc8ca` and `2ad56119f5` which merged `10b1b4a23` (and follow-ups), since that change was reverted in `3ab01550b`.	2020-08-05 08:56:14 -04:00
Bruno Ricci	f7a039de7a	[clang][NFC] DeclPrinter: use NamedDecl::getDeclName instead of NamedDecl::printName to print the name of enumerations, namespaces and template parameters. NamedDecl::printName will print the pretty-printed name of the entity, which is not what we want here (we should print "enum { e };" instead of "enum (unnamed enum at input.cc:1:5) { e };"). For now only DecompositionDecl and MDGuidDecl have an overloaded printName so this does not result in any functional change, but this change is needed since I will be adding overloads to better handle unnamed entities in diagnostics.	2020-08-05 13:54:38 +01:00
Bruno Ricci	94b43118e2	[clang][NFCI] Get rid of ConstantMatrixTypeBitfields to avoid increasing the size of every type. sizeof(ConstantMatrixTypeBitfields) > 8 which increases the size of every type. This was not detected because no corresponding static_assert for its size was added. To prevent this from occuring again replace the various static_asserts for the size of each of the bit-field classes by a single static_assert for the size of Type. I have left ConstantMatrixType::MaxElementsPerDimension unchanged since the limit is exercised by multiple tests.	2020-08-05 13:54:37 +01:00
Bruno Ricci	19701458d4	[clang][nearly-NFC] Remove some superfluous uses of NamedDecl::getNameAsString `OS << ND->getDeclName();` is equivalent to `OS << ND->getNameAsString();` without the extra temporary string. This is not quite a NFC since two uses of `getNameAsString` in a diagnostic are replaced, which results in the named entity being quoted with additional "'"s (ie: 'var' instead of var).	2020-08-05 13:54:37 +01:00
Bruno Ricci	6f2fa9d312	[clang][NFC] Document NamedDecl::printName	2020-08-05 13:54:37 +01:00
Bruno Ricci	bc29634b93	[clang][NFC] Remove an old workaround for MSVC 2013	2020-08-05 13:54:37 +01:00
Bruno Ricci	98b4b45705	[clang][NFC] Add a test showcasing an unnamed template parameter in a diagnostic	2020-08-05 13:54:36 +01:00
Bruno Ricci	00b89f66f9	[clang][NFC] Remove spurious +x flag on DeclTemplate.cpp and DeclTemplate.h	2020-08-05 13:54:30 +01:00
Alex Zinenko	4e491570b5	[mlir] Remove LLVMTypeTestDialect This dialect was introduced during the bring-up of the new LLVM dialect type system for testing purposes. The main LLVM dialect now uses the new type system and the test dialect is no longer necessary, so remove it. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D85224	2020-08-05 14:39:36 +02:00
Yaxun (Sam) Liu	45f2a56856	[CUDA][HIP] Support accessing static device variable in host code for -fno-gpu-rdc nvcc supports accessing file-scope static device variables in host code by host APIs like cudaMemcpyToSymbol etc. CUDA/HIP let users access device variables in host code by shadow variables. In host compilation, clang emits a shadow variable for each device variable, and calls __RegisterVariable to register it in init function. The address of the shadow variable and the device side mangled name of the device variable is passed to __RegisterVariable. Runtime looks up the symbol by name in the device binary to find the address of the device variable. The problem with static device variables is that they have internal linkage, therefore their name may be changed by the linker if there are multiple symbols with the same name. Also they end up as local symbols in the elf file, whereas the runtime only looks up the global symbols. Another reason for making the static device variables external linkage is that they may be initialized externally by host code and their final value may be accessed by host code after kernel execution, therefore they actually have external linkage. Giving them internal linkage will cause incorrect optimizations on them. To support accessing static device var in host code for -fno-gpu-rdc mode, change the intnernal linkage to external linkage. The name does not need change since there is only one TU for -fno-gpu-rdc mode. Also the externalization is done only if the device static var is referenced by host code. Differential Revision: https://reviews.llvm.org/D80858	2020-08-05 07:57:38 -04:00
Sam Parker	f2675ab45f	[ARM][CostModel] Implement getCFInstrCost As with other targets, set the throughput cost of control-flow instructions to free so that we don't miss out of vectorization opportunities. Differential Revision: https://reviews.llvm.org/D85283	2020-08-05 12:44:51 +01:00
Simon Pilgrim	7b993903e0	DWARFVerifier.h - remove unnecessary forward declarations and includes. NFCI.	2020-08-05 12:42:44 +01:00
Alex Zinenko	bdb9295664	[mlir] Fix convert-to-llvmir.mlir test broken due to syntax change The syntax of the LLVM dialect types changed between the time the code was written and it was submitted, leading to a test failure. Update the syntax.	2020-08-05 13:29:35 +02:00
Xing GUO	bd7f3f8a3e	[obj2yaml] Add support for dumping the .debug_aranges section. This patch adds support for dumping DWARF sections to obj2yaml. The .debug_aranges section is used to illustrate the basic idea. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85094	2020-08-05 19:19:05 +08:00
Alex Cameron	a44161692a	Support member expressions in bugprone-bool-pointer-implicit-conversion. This addresses PR45189.	2020-08-05 07:14:28 -04:00
Simon Pilgrim	315e1daf7f	GISelWorkList.h - remove unnecessary includes. NFCI.	2020-08-05 12:00:28 +01:00
Simon Pilgrim	e3d3657b9b	CallLowering.h - remove unnecessary CCState forward declaration. NFCI. Already defined in CallingConvLower.h	2020-08-05 12:00:28 +01:00
Simon Pilgrim	300899b9c4	[X86][AVX] Add test showing unnecessary duplicate HADD instructions Taken from internal fuzz test	2020-08-05 12:00:27 +01:00
Hans Wennborg	3ab01550b6	Revert "[CMake] Simplify CMake handling for zlib" This quietly disabled use of zlib on Windows even when building with -DLLVM_ENABLE_ZLIB=FORCE_ON. > Rather than handling zlib handling manually, use find_package from CMake > to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, > HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is > set to YES, which requires the distributor to explicitly select whether > zlib is enabled or not. This simplifies the CMake handling and usage in > the rest of the tooling. > > This is a reland of `abb0075` with all followup changes and fixes that > should address issues that were reported in PR44780. > > Differential Revision: https://reviews.llvm.org/D79219 This reverts commit `10b1b4a231` and follow-ups `64d99cc6ab` and `f9fec0447e`.	2020-08-05 12:31:44 +02:00
Paul Walker	927fc536ca	[SVE] Add lowering for fixed length vector and, or & xor operations. Since there are no ill effects when performing these operations with undefined elements, they are lowered to the already supported unpredicated scalable vector equivalents. Differential Revision: https://reviews.llvm.org/D85117	2020-08-05 11:28:34 +01:00
Simon Pilgrim	4aaf301fb8	[DAG] Fold vector (aext (load x)) -> (zext (truncate (zextload x))) We currently don't do anything to fold any_extend vector loads as no target has such an instruction. Instead I've added support for folding to a zextload, SimplifyDemandedBits does a good job of adjusting the zext(truncate(()) stages as required later on. We still need the custom scalar extload handling instead of using the tryToFoldExtOfLoad helper as it has different legality tests - we can probably tweak that to reduce most of the code duplication. Fixes the regression I mentioned in rG99a971cadff7 Differential Revision: https://reviews.llvm.org/D85129	2020-08-05 11:22:23 +01:00
Luboš Luňák	188187f062	[lldb] expect TestGuiBasicDebug.py failure on aarch64 http://lab.llvm.org:8011/builders/lldb-aarch64-ubuntu/builds/7287/steps/test/logs/stdio fails, and the output suggests that gui 'finish' (='thread step-out') is broken on aarch64.	2020-08-05 12:21:05 +02:00
Arpith C. Jacob	fab4b59961	[mlir] Conversion of ViewOp with memory space to LLVM. Handle the case where the ViewOp takes in a memref that has an memory space. Reviewed By: ftynse, bondhugula, nicolasvasilache Differential Revision: https://reviews.llvm.org/D85048	2020-08-05 12:19:52 +02:00
Alexander Belyaev	a3d427d30c	[mlir] Lower RankOp to LLVM for unranked memrefs. Differential Revision: https://reviews.llvm.org/D85273	2020-08-05 12:13:43 +02:00
Georgii Rymar	f97019ad6e	[llvm-readobj/elf] - Add a testing for --stackmap and refine the implementation. Currently, we only test the `--stackmap` option here: https://github.com/llvm/llvm-project/blob/master/llvm/test/Object/stackmap-dump.test it uses a precompiled MachO binary currently and I've found no tests for this option for ELF. The implementation also has issues. For example, it might assert on a wrong version of the .llvm-stackmaps section. Or it might crash on an empty or truncated section. This patch introduces a new tools/llvm-readobj/ELF test file as well as implements a few basic checks to catch simple crashes/issues It also eliminates `unwrapOrError` calls in `printStackMap()`. Differential revision: https://reviews.llvm.org/D85208	2020-08-05 13:09:04 +03:00
Benjamin Kramer	c558c22cab	[llvm-symbolizer] Add legacy aliases -demangle=true and -demangle=false. This is used in the wild, don't break compatibility for no good reason. https://github.com/google/pprof/blob/master/internal/binutils/addr2liner_llvm.go	2020-08-05 12:07:46 +02:00
Jordan Rupprecht	4963ca4658	[docs] Document pattern of using CHECK-SAME to skip irrelevant lines This came up during the review for D67656. It's nice but also subtle, so documenting it as an idiom will make tests easier to understand. Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D68061	2020-08-05 11:03:56 +01:00
Luboš Luňák	138281904b	Revert "[lldb] temporary commit to see why a test is failing only on lldb-aarch64-ubuntu" This reverts commit `21f142ce1d`.	2020-08-05 11:55:02 +02:00
Luboš Luňák	21f142ce1d	[lldb] temporary commit to see why a test is failing only on lldb-aarch64-ubuntu	2020-08-05 11:54:14 +02:00
Frederik Gossen	4cd923784e	[MLIR][Shape] Expose extent tensor type builder The extent tensor type is a `tensor<?xindex>` that is used in the shape dialect. To facilitate the use of this type when working with the shape dialect, we expose the helper function for its construction. Differential Revision: https://reviews.llvm.org/D85121	2020-08-05 09:42:57 +00:00
Pierre Gousseau	14948a08f3	[compiler-rt] Normalize some in/out doxygen parameter in interface headers. NFC. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D84945	2020-08-05 10:17:25 +01:00
Tatyana Krasnukha	bc056b3aa7	[lldb] Suppress MSVC warning C4065 MSVC reports "switch statement contains 'default' but no 'case' labels". Suppress, as this was intended behavior.	2020-08-05 11:59:48 +03:00
Tatyana Krasnukha	75012a8044	[lldb] Use PyUnicode_GetLength instead of PyUnicode_GetSize PyUnicode_GetSize is deprecated since Python version 3.3.	2020-08-05 11:59:47 +03:00
Tatyana Krasnukha	cc68c122cd	[lldb/TestingSupport] Manually disable GTEST_HAS_TR1_TUPLE Gtest 1.8.0 uses tr1::tuple which is deprecated on MSVC. We have to force it off to avoid the compiler warnings, which will become errors after switching on C++17 (https://devblogs.microsoft.com/cppblog/c17-feature-removals-and-deprecations).	2020-08-05 11:59:46 +03:00
David Turner	ba0e71432a	Do not map read-only data memory sections with EXECUTE flags. The code in SectionMemoryManager.cpp unnecessarily maps read-only data sections with the READ+EXECUTE flags. This is undesirable from a security stand-point. Moreover, on the Fuchsia platform, which is now very strict about mapping pages with the EXECUTE permission, this simply fails, because the section's pages were initially allocated with only the READ+WRITE flags. A more detailed description of the issue can be found in this public SwiftShader bug: https://issuetracker.google.com/issues/154586551 This patch just restrict the mapping to the READ flag for ROData sections. Code sections are still mapped with READ+EXECUTE as expected. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D78574	2020-08-05 10:51:48 +02:00
Sander de Smalen	f2916636f8	[AArch64][SVE] Disable tail calls if callee does not preserve SVE regs. This fixes an issue triggered by the following code, where emitEpilogue got confused when trying to restore the SVE registers after the call, whereas the call to bar() is implemented as a TCReturn: int non_sve(); int sve(svint32_t x) { return non_sve(); } Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D84869	2020-08-05 09:38:54 +01:00
George Mitenkov	159806704b	[MLIR][SPIRVToLLVM] Updated LLVM types in the documentation Updated the documentation with new MLIR LLVM types for vectors, pointers, arrays and structs. Also, changed remaining tabs to spaces. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D85277	2020-08-05 11:18:52 +03:00
Jay Foad	8cbf4a17ac	[AMDGPU] Propagate fast math flags in frem lowering Differential Revision: https://reviews.llvm.org/D84518	2020-08-05 09:09:38 +01:00
Jay Foad	1bb07e1b91	[AMDGPU] Precommit tests for D84518 Propagate fast math flags in frem lowering	2020-08-05 09:09:02 +01:00
Jay Foad	04cf4a5a65	[AMDGPU] Lower frem f16 Without this it would fail to select on subtargets that have 16-bit instructions. Differential Revision: https://reviews.llvm.org/D84517	2020-08-05 09:08:40 +01:00
Andrzej Warzynski	621681e3e5	[Flang] Fix multi-config generator builds Based on https://reviews.llvm.org/D84022 with additional changes to maintain out-of-tree builds. Original commit message: Currently the binaries are output directly into the bin subdirectory of the build directory. This doesn't work correctly with multi-config generators which should output the binaries into <CONFIG_NAME>/bin instead. The original patch was implemented by David Truby and the additional changes added here were also proposed by David Truby. Differential Revision: https://reviews.llvm.org/D85078/ Co-authored-by: David Truby <david.truby@arm.com>	2020-08-05 08:59:11 +01:00

... 3 4 5 6 7 ...

362706 Commits All Branches Search

362706 Commits

All Branches