llvm-project

Commit Graph

Author	SHA1	Message	Date
peter klausler	27c3db32c0	[flang] Remove unused variable Fixes the clang build of flang/runtime/transformational.cpp.	2021-05-20 14:08:43 -07:00
Yaxun (Sam) Liu	4cb42564ec	[CUDA][HIP] Fix device variables used by host variables emitted on both host and device side with different addresses when ODR-used by host function should not cause device side counter-part to be force emitted. This fixes the regression caused by https://reviews.llvm.org/D102237 Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D102801	2021-05-20 17:04:29 -04:00
Min-Yih Hsu	dccf5c7dfb	[M68k] Support for inline asm operands w/ simple constraints This patch adds supports for inline assembly operands and some simple operand constraints, including register and constant operands. Differential Revision: https://reviews.llvm.org/D102585	2021-05-20 14:00:09 -07:00
Min-Yih Hsu	e620bea211	[M68k] Allow user to preserve certain registers Add `-ffixed-a[0-6]` and `-ffixed-d[0-7]` and the corresponding subtarget features to prevent certain register from being allocated. Differential Revision: https://reviews.llvm.org/D102805	2021-05-20 13:57:22 -07:00
peter klausler	854367240d	[flang] Fix broken build of flang with clang A recent commit (e00a170) inadvertently exposed the Reshape runtime function as being extern "C", for which it is not yet ready, leading to a valid warning from clang that we treat as an error. Patch.	2021-05-20 13:49:31 -07:00
peter klausler	c1db35f0c2	[flang] Implement more transformational intrinsic functions in runtime Define APIs, naively implement, and add basic sanity unit tests for the transformational intrinsic functions CSHIFT, EOSHIFT, PACK, SPREAD, TRANSPOSE, and UNPACK. These are the remaining transformational intrinsic functions that rearrange data without regard to type (except for default boundary values in EOSHIFT); RESHAPE was already in place as a stress test for the runtime's descriptor handling facilities. Code is in place to create copies of allocatable/automatic components when transforming arrays of derived type, but it won't do anything until we have derived type information being passed to the runtime from the frontend. Differential Revision: https://reviews.llvm.org/D102857	2021-05-20 13:22:01 -07:00
Nicolas Vasilache	e3cf7c88c4	[mlir][MemRef] NFC - Drop MemRef EDSC usage Drop the MemRef dialect EDSC subdirectory and update all uses. Differential Revision: https://reviews.llvm.org/D102868	2021-05-20 20:13:58 +00:00
Heejin Ahn	3eb12b0ae1	[WebAssembly] Warn on exception spec for Emscripten EH It turns out we have not correctly supported exception spec all along in Emscripten EH. Emscripten EH supports `throw()` but not `throw` with types. See https://bugs.llvm.org/show_bug.cgi?id=50396. Wasm EH also only supports `throw()` but not `throw` with types, and we have been printing a warning message for the latter. This prints the same warning message for `throw` with types when Emscripten EH is used, or more precisely, when Wasm EH is not used. (So this will print the warning messsage even when `-fno-exceptions` is used but I think that should be fine. It's cumbersome to do a complilcated option checking in CGException.cpp and options checkings are mostly done in elsewhere.) Reviewed By: dschuff, kripken Differential Revision: https://reviews.llvm.org/D102791	2021-05-20 13:00:20 -07:00
Jan Kratochvil	6d19c84cd9	[lldb] Improve invalid DWARF DW_AT_ranges error reporting In D98289#inline-939112 @dblaikie said: Perhaps this could be more informative about what makes the range list index of 0 invalid? "index 0 out of range of range list table (with range list base 0xXXX) with offset entry count of XX (valid indexes 0-(XX-1))" Maybe that's too verbose/not worth worrying about since this'll only be relevant to DWARF producers trying to debug their DWARFv5, maybe no one will ever see this message in practice. Just a thought. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D102851	2021-05-20 21:37:01 +02:00
Jessica Clarke	e10958c807	[SelectionDAG][Mips][PowerPC][RISCV][WebAssembly] Teach computeKnownBits/ComputeNumSignBits about atomics Unlike normal loads these don't have an extension field, but we know from TargetLowering whether these are sign-extending or zero-extending, and so can optimise away unnecessary extensions. This was noticed on RISC-V, where sign extensions in the calling convention would result in unnecessary explicit extension instructions, but this also fixes some Mips inefficiencies. PowerPC sees churn in the tests as all the zero extensions are only for promoting 32-bit to 64-bit, but these zero extensions are still not optimised away as they should be, likely due to i32 being a legal type. This also simplifies the WebAssembly code somewhat, which currently works around the lack of target-independent combines with some ugly patterns that break once they're optimised away. Re-landed with correct handling in ComputeNumSignBits for Tmp == VTBits, where zero-extending atomics were incorrectly returning 0 rather than the (slightly confusing) required return value of 1. Re-landed again after D102819 fixed PowerPC to correctly zero-extend all of its atomics as it claimed to do, since the combination of that bug and this optimisation caused buildbot regressions. Reviewed By: RKSimon, atanasyan Differential Revision: https://reviews.llvm.org/D101342	2021-05-20 20:34:23 +01:00
Jon Chesterfield	68b88ae670	[libomptarget] Improve dlwrap compile time error diagnostic [libomptarget] Improve dlwrap compile time error diagnostic The dlwrap interface takes an explict arity, e.g. DLWRAP(cuAlloc, 2); This probably can't be eliminated as it controls the argument list of an external symbol, not an inline header function. If the arity given is too big, the error from clang referring to the line is in the middle of implementation details. /usr/lib/gcc/x86_64-linux-gnu/10/../../../../include/c++/10/tuple:1277:7: error: static_assert failed due to requirement '0UL < tuple_size<std::tuple<>>::value' "tuple index is in range" static_assert(__i < tuple_size<tuple<>>::value, ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/lib/gcc/x86_64-linux-gnu/10/../../../../include/c++/10/tuple:1260:7: ... /usr/lib/gcc/x86_64-linux-gnu/10/../../../../include/c++/10/tuple:1260:7: ... /home/amd/llvm-project/openmp/libomptarget/include/dlwrap.h:93:27 ... /home/amd/llvm-project/openmp/libomptarget/plugins/cuda/dynamic_cuda/cuda.cpp:34:1: note: in instantiation of template class 'dlwrap::trait<cudaError_enum ()(unsigned long , unsigned long)>::arg<2>' requested here DLWRAP(cuMemAlloc, 3); ^ /home/amd/llvm-project/openmp/libomptarget/include/dlwrap.h:51:31: ... /home/amd/llvm-project/openmp/libomptarget/include/dlwrap.h:166:3: ... /home/amd/llvm-project/openmp/libomptarget/include/dlwrap.h:133:3: ... /home/amd/llvm-project/openmp/libomptarget/include/dlwrap.h:186:37: ... If the arity is too small, the diagnostic is better: cuda/dynamic_cuda/cuda.cpp:34:1: error: too few arguments to function call, expected 2, have 1 DLWRAP(cuMemAlloc, 1); This patch changes the diagnostic to: cuda/dynamic_cuda/cuda.cpp:34:1: error: static_assert failed due to requirement '1 == trait<cudaError_enum ()(unsigned long , unsigned long)>::nargs' "Arity Error" DLWRAP(cuMemAlloc, 1); or cuda/dynamic_cuda/cuda.cpp:34:1: error: static_assert failed due to requirement '3 == trait<cudaError_enum ()(unsigned long , unsigned long)>::nargs' "Arity Error" DLWRAP(cuMemAlloc, 3); Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D102858	2021-05-20 20:33:36 +01:00
Louis Dionne	31bcdb6635	[libc++] Translate the test suite sanitizer support to the DSL Differential Revision: https://reviews.llvm.org/D102108	2021-05-20 15:30:48 -04:00
LLVM GN Syncbot	efea813445	[gn build] Port `0af3105b64`	2021-05-20 19:20:25 +00:00
Jon Roelofs	0af3105b64	Revert "[Remarks] Add analysis remarks for memset/memcpy/memmove lengths" This reverts commit `4bf69fb52b`. This broke spec2k6/403.gcc under -global-isel. Details to follow once I've reduced the problem.	2021-05-20 12:19:16 -07:00
Nico Weber	e1ffcfcee7	[gn build] try reverting code part of `f05fbb7795` Maybe `aa8fe8fe6c` was all that was needed to fix the build and we can keep the code with fewer conditionals after all.	2021-05-20 15:08:39 -04:00
Nico Weber	aa8fe8fe6c	[gn build] attempt again to unbreak linux after `fc9696130c`	2021-05-20 15:01:35 -04:00
Nico Weber	f3f592ac89	[gn build] use PEP-8 indents in symbol_exports.py	2021-05-20 15:00:24 -04:00
Nico Weber	f05fbb7795	[gn build] attempt to unbreak linux after `fc9696130c` Only emit `global:` if there are any exported symbols. While here, `chmod +x` the symbol_exports.py script.	2021-05-20 14:55:40 -04:00
zoecarver	c40b02608e	[libcxx][ranges] Implement `ranges::borrowed_range`. Differential Revision: https://reviews.llvm.org/D102426	2021-05-20 11:51:44 -07:00
Nico Weber	fc9696130c	[gn build] Use .export files Just fixing an old TODO, no dramatic behavior change. Differential Revision: https://reviews.llvm.org/D102843	2021-05-20 14:48:12 -04:00
Kevin P. Neal	f21f1eea05	[FPEnv] EarlyCSE support for constrained intrinsics, default FP environment edition EarlyCSE cannot distinguish between floating point instructions and constrained floating point intrinsics that are marked as running in the default FP environment. Said intrinsics are supposed to behave exactly the same as the regular FP instructions. Teach EarlyCSE to handle them in that case. Differential Revision: https://reviews.llvm.org/D99962	2021-05-20 14:40:51 -04:00
Mitch Phillips	f4ccbaf310	[scudo] Add supported architectures. Adds extra supported architectures that were available for vanilla scudo, in preparation for D102543. Hopefully the dust has settled and `7d0a81ca38` is no longer an issue. Reviewed By: cryptoad, vitalybuka Differential Revision: https://reviews.llvm.org/D102648	2021-05-20 11:22:51 -07:00
Fraser Cormack	c74ab891fc	[RISCV] Ensure small mask BUILD_VECTORs aren't expanded The default expansion for BUILD_VECTORs -- save for going through shuffles -- is to go through the stack. This method only works when the type is at least byte-sized, so for v2i1 and v4i1 we would crash. This patch ensures that small mask-type BUILD_VECTORs are always handled without crashing. We lower to a SETCC of the equivalent i8 type. This also exposes some pre-existing issues where the lowering when optimizing for size results in larger code than without. Those will be tackled in future patches. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D102767	2021-05-20 19:12:29 +01:00
Reid Kleckner	8f20ac9595	[PGO] Don't reference functions unless value profiling is enabled This reduces the size of chrome.dll.pdb built with optimizations, coverage, and line table info from 4,690,210,816 to 2,181,128,192, which makes it possible to fit under the 4GB limit. This change can greatly reduce binary size in coverage builds, which do not need value profiling. IR PGO builds are unaffected. There is a minor behavior change for frontend PGO. PGO and coverage both use InstrProfiling to create profile data with counters. PGO records the address of each function in the __profd_ global. It is used later to map runtime function pointer values back to source-level function names. Coverage does not appear to use this information. Recording the address of every function with code coverage drastically increases code size. Consider this program: void foo(); void bar(); inline void inlineMe(int x) { if (x > 0) foo(); else bar(); } int getVal(); int main() { inlineMe(getVal()); } With code coverage, the InstrProfiling pass runs before inlining, and it captures the address of inlineMe in the __profd_ global. This greatly increases code size, because now the compiler can no longer delete trivial code. One downside to this approach is that users of frontend PGO must apply the -mllvm -enable-value-profiling flag globally in TUs that enable PGO. Otherwise, some inline virtual method addresses may not be recorded and will not be able to be promoted. My assumption is that this mllvm flag is not popular, and most frontend PGO users don't enable it. Differential Revision: https://reviews.llvm.org/D102818	2021-05-20 11:09:24 -07:00
Mitch Phillips	577a80bff8	[scudo] Disable secondary cache-unmap tests on arm32. Looks like secondary pointers don't get unmapped on one of the arm32 bots. In the interests of landing some dependent patches, disable this test on arm32 so that it can be tested in isolation later. Reviewed By: cryptoad, vitalybuka Split from differential patchset (1/2): https://reviews.llvm.org/D102648	2021-05-20 11:07:45 -07:00
Lang Hames	d22b27cfde	[ORC-RT] Add string_view and span utilities for use by the ORC runtime. These are substitutes for std::string_view (and llvm::StringRef) and std::span (and llvm::ArrayRef) for use by the ORC runtime.	2021-05-20 11:02:44 -07:00
Simon Pilgrim	a26288e803	[X86][Atom] Fix vector fadd/fcmp/fmul resource/throughputs Match whats documented in the Intel AOM - these are all fadd/fcmp use Port1 and fmul uses Port1, but in many cases BOTH ports are required - this was being incorrectly modelled as EITHER port. Discovered while investigating the correct fptoui costs to fix the regressions in D101555. Now that we can use in-order models in llvm-mca, the atom model is a good "worst case scenario" analysis for x86.	2021-05-20 18:56:58 +01:00
Tamar Christina	68d5235cb5	libsanitizer: Remove cyclades inclusion in sanitizer The Linux kernel has removed the interface to cyclades from the latest kernel headers[1] due to them being orphaned for the past 13 years. libsanitizer uses this header when compiling against glibc, but glibcs itself doesn't seem to have any references to cyclades. Further more it seems that the driver is broken in the kernel and the firmware doesn't seem to be available anymore. As such since this is breaking the build of libsanitizer (and so the GCC bootstrap[2]) I propose to remove this. [1] https://lkml.org/lkml/2021/3/2/153 [2] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100379 Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D102059	2021-05-20 18:55:26 +01:00
Alex Orlov	752385b128	Add support for DWARF embedded source to llvm-symbolizer. This patch adds DWARF embedded source printout to llvm-symbolizer. Reviewed By: jhenderson, dblaikie Differential Revision: https://reviews.llvm.org/D102355	2021-05-20 21:40:28 +04:00
Stefan Pintilie	45ad207e45	[PowerPC] Add fix to partword atomic operations Partword atomic binaries are not zero extended as they should be. This patch fixes them to ensure that they are zero extended. Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D102819	2021-05-20 12:36:37 -05:00
Fraser Cormack	26bd2250c1	[RISCV] Ensure shuffle splat operands are type-legal The use of `SelectionDAG::getSplatValue` isn't guaranteed to return a type-legal splat value as it may implicitly extract a vector element from another shuffle. It is not permitted to introduce an illegal type when lowering shuffles. This patch addresses the crash by adding a boolean flag to `getSplatValue`, defaulting to false, which when set will ensure a type-legal return value. If it is unable to do that it will fail to return a splat value. I've been through the existing uses of `getSplatValue` in other targets and was unable to find a need or test cases showing a need to update their uses. In some cases, the call is made during `LegalizeVectorOps` which may still produce illegal scalar types. In other situations, the illegally-typed splat value may be quickly patched up to a legal type (such as any-extending the returned `extract_vector_elt` up to a legal type) before `LegalizeDAG` notices. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D102687	2021-05-20 18:00:03 +01:00
Wouter van Oortmerssen	3a293cbf13	[WebAssembly] Fix PIC/GOT codegen for wasm64 __table_base is know 64-bit, since in LLVM it represents a function pointer offset __table_base32 is a copy in wasm32 for use in elem init expr, since no truncation may be used there. New reloc R_WASM_TABLE_INDEX_REL_SLEB64 added Differential Revision: https://reviews.llvm.org/D101784	2021-05-20 09:59:31 -07:00
Steven Wu	5b6cae5524	[IR][AutoUpgrade] Drop alignment from non-pointer parameters and returns This is a follow-up of D102201. After some discussion, it is a better idea to upgrade all invalid uses of alignment attributes on function return values and parameters, not just limited to void function return types. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D102726	2021-05-20 09:54:38 -07:00
Jamie Schmeiser	136ced498b	When vector is found as a type or non-type id, check if it is really the altivec vector token. Summary: Call TryAltiVecVectorToken when an identifier is seen in the parser before annotating the token. This checks the next token where necessary to ensure that vector is properly handled as the altivec token. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: ZarkoCA (Zarko Todorovski) Differential Revision: https://reviews.llvm.org/D100991	2021-05-20 12:39:04 -04:00
Stephen Tozer	cf725dde9c	[DebugInfo] Handle DIArgList in FastISel or GlobalIsel Currently, variadic dbg.values (i.e. those using a DIArgList as part of their location) are not handled properly by FastISel or GlobalISel, and will produce invalid DBG_VALUE instructions if they encounter them. This patch fixes this issue by emitting undef DBG_VALUE instructions for variadic dbg.values, so that no incorrect instruction is produced and any prior variable location is terminated. This is simply a quick-fix to prevent errors; a correct implementation should come later for these ISel pipelines to ensure that we do not drop debug information unnecessarily. Differential Revision: https://reviews.llvm.org/D102500	2021-05-20 17:37:28 +01:00
Jean Perier	943839870a	[flang] simplify derived type info table format - Replace class(*) member by a c_ptr member to avoid having to handle polymorphic components in the type info table generation. Polymorphic entity handling will require these very tables to be lowered properly. Note: keep the init as NullPointer/Designators. This is technically invalid Fortran, the init should have c_ptr type. But wrapping this in a C_LOC intrinsic call would make runtime generation and lowering more complex with no real benefits. - ComponentIterator is crashing when used on the generated derived types in GetScope. This patch makes GetScope more robust, but it is not entirely clear to me why this is only happening with the generated derived types. - The type of generated character globals was incorrect because Scope::FindType was matching character types with different length. Add a CharacterTypeSpec == operator to fix this. Differential Revision: https://reviews.llvm.org/D102768	2021-05-20 18:26:53 +02:00
Peter Waller	2d574a1104	[CodeGen][AArch64][SVE] Canonicalize intrinsic rdffr{ => _z} Follow up to D101357 / `3fa6510f6`. Supersedes D102330. Goal: Use flags setting rdffrs instead of rdffr + ptest. Problem: RDFFR_P doesn't have have a flags setting equivalent. Solution: in instcombine, canonicalize to RDFFR_PP at the IR level, and rely on RDFFR_PP+PTEST => RDFFRS_PP optimization in AArch64InstrInfo::optimizePTestInstr. While here: * Test that rdffr.z+ptest generates a rdffrs. * Use update_{test,llc}_checks.py on the tests. * Use sve attribute on functions. Differential Revision: https://reviews.llvm.org/D102623	2021-05-20 16:22:50 +00:00
Sanjay Patel	f34311c402	[GlobalOpt] recompute alignments for loads and stores of updated globals GlobalOpt can slice structs/arrays and change GEPs in the process, but it was not updating alignments for load/store users. This eventually causes the crashing seen in: https://llvm.org/PR49661 https://llvm.org/PR50253 On x86, this required SLP+codegen to create an aligned vector store on an invalid address. The bugs would be easier to demonstrate on a target with stricter alignment requirements. I'm not sure if this is a complete solution. The alignment updating code is adapted from InstCombine, so I assume that part is tested and good. Differential Revision: https://reviews.llvm.org/D102552	2021-05-20 12:12:21 -04:00
Sanjay Patel	ee4055cf23	[GlobalOpt] adjust test to show load problems; NFC Goes with D102552	2021-05-20 12:12:21 -04:00
Raphael Isemann	8ebaa19501	[lldb] Adjust DumpDataExtractorTest.Formats for Windows Not sure if that's the ostringstream or our conversion code, but this is returning the wrong results on Windows.	2021-05-20 18:00:02 +02:00
Alexey Bataev	182162b616	[SLP]Try to vectorize tiny trees with shuffled gathers of extractelements. If we gather extract elements and they actually are just shuffles, it might be profitable to vectorize them even if the tree is tiny. Differential Revision: https://reviews.llvm.org/D101460	2021-05-20 08:36:16 -07:00
Nicolas Vasilache	4519ca3d2e	[mlir][Linalg] NFC - Drop Linalg EDSC usage Drop the Linalg dialect EDSC subdirectory and update all uses. Differential Revision: https://reviews.llvm.org/D102848	2021-05-20 15:33:56 +00:00
Aaron Ballman	beb5a3a298	Correct some thread safety analysis diagnostics; NFC. The diagnostics were not following the usual style rules.	2021-05-20 11:30:21 -04:00
Jon Chesterfield	d18fb09c69	[libomptarget][amdgpu] Remove majority of fatal errors [libomptarget][amdgpu] Remove majority of fatal errors Replaces most calls to exit() with returning an error to the library entry point. Minor changes to error handling for clear bugs, remove some dead code. Each exit() call site replaced is either in a library entry point or a function that already returns error codes on some paths. The existing handling is not well tested but replacing exit() with a fallback path should be a strict improvement. Remaining two early exit points are an abort() from a callback and exit() from within msgpack. Fixes for those are less obvious and left for a later patch. Reviewed By: pdhaliwal Differential Revision: https://reviews.llvm.org/D102346	2021-05-20 16:26:43 +01:00
Raphael Isemann	48780527dd	[lldb][NFC] Add more Float16 unit tests	2021-05-20 17:07:23 +02:00
Daniel Kiss	801ab71032	[ARM][AArch64] SLSHardening: make non-comdat thunks possible Linker scripts might not handle COMDAT sections. SLSHardeing adds new section for each __llvm_slsblr_thunk_xN. This new option allows the generation of the thunks into the normal text section to handle these exceptional cases. ,comdat or ,noncomdat can be added to harden-sls to control the codegen. -mharden-sls=[all\|retbr\|blr],nocomdat. Reviewed By: kristof.beyls Differential Revision: https://reviews.llvm.org/D100546	2021-05-20 17:07:05 +02:00
Louis Dionne	5c26f895b6	[libc++] Fix documentation build failure	2021-05-20 11:01:10 -04:00
Louis Dionne	cb82e8ea33	[libc++] Remove workaround for PR28391 (ODR violations with ASAN) This is not an issue anymore since we don't build the libc++ dylib with C++14 anymore (see https://llvm.org/PR28391) for details. Differential Revision: https://reviews.llvm.org/D102106	2021-05-20 10:48:36 -04:00
Jon Chesterfield	ea68ad6e26	[libomptarget] Disable test bug49334 on amdgpu [libomptarget] Disable test bug49334 on amdgpu Hangs on amdgpu, do not know why. Disable to unblock build. Reviewed By: ye-luo Differential Revision: https://reviews.llvm.org/D102017	2021-05-20 15:46:56 +01:00
Louis Dionne	b274728b1a	[libc++] Switch a few CI jobs to the minimal Lit configuration Eventually, this should become the default way of running the tests. For now, only move a few CI nodes to it, and keep a node that runs the legacy configuration. Differential Revision: https://reviews.llvm.org/D97565	2021-05-20 10:46:59 -04:00

1 2 3 4 5 ...

389032 Commits All Branches Search

389032 Commits

All Branches