llvm-project

Commit Graph

Author	SHA1	Message	Date
Arthur Eubanks	ee7c9b8f14	[gn build] Manually add file Since bot is broken	2021-07-29 10:37:28 -07:00
Alfonso Gregory	09529892b5	[Support] Remove LLVM_ATTRIBUTE_NORETURN Code should use C++11 [[noreturn]] or C11 _Noreturn instead. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D106899	2021-07-29 10:10:37 -07:00
Fangrui Song	72a83674dd	Replace LLVM_ATTRIBUTE_NORETURN with C++11 [[noreturn]]. NFC [[noreturn]] can be used since Oct 2016 when the minimum compiler requirement was bumped to GCC 4.8/MSVC 2015.	2021-07-29 09:59:45 -07:00
Fangrui Song	172a55e7a4	[lldb] Fix FunctionDecl::Create after D102343	2021-07-29 09:57:10 -07:00
Melanie Blower	fd251d903b	[clang][patch] Remove erroneous run line committed in D102343	2021-07-29 12:42:04 -04:00
Dawid Jurczak	5c315bee8c	[DSE] Transform memset + malloc --> calloc (PR25892) After this change DSE can eliminate malloc + memset and emit calloc. It's https://reviews.llvm.org/D101440 follow-up. Differential Revision: https://reviews.llvm.org/D103009	2021-07-29 18:34:10 +02:00
Melanie Blower	bc5b5ea037	[clang][patch][FPEnv] Make initialization of C++ globals strictfp aware @kpn pointed out that the global variable initialization functions didn't have the "strictfp" metadata set correctly, and @rjmccall said that there was buggy code in SetFPModel and StartFunction, this patch is to solve those problems. When Sema creates a FunctionDecl, it sets the FunctionDeclBits.UsesFPIntrin to "true" if the lexical FP settings (i.e. a combination of command line options and #pragma float_control settings) correspond to ConstrainedFP mode. That bit is used when CodeGen starts codegen for a llvm function, and it translates into the "strictfp" function attribute. See bugs.llvm.org/show_bug.cgi?id=44571 Reviewed By: Aaron Ballman Differential Revision: https://reviews.llvm.org/D102343	2021-07-29 12:02:37 -04:00
Joachim Protze	4acc2f29a2	[OpenMP][Tools][Tests][NFC] Address flaky archer tests Adding more concurrent threads significantly increases the chance that the data race can be observed during testing.	2021-07-29 17:56:44 +02:00
Jessica Clarke	95ef464ac9	Handle subregs and superregs in callee-saved register mask If a target lists both a subreg and a superreg in a callee-saved register mask, the prolog will spill both aliasing registers. Instead, don't spill the subreg if a superreg is being spilled. This case is hit by the PowerPC SPE code, as well as a modified RISC-V backend for CHERI I maintain out of tree. Reviewed By: jhibbits Differential Revision: https://reviews.llvm.org/D73170	2021-07-29 16:53:29 +01:00
bakhtiyar	9a5bc83660	Add an escape-hatch for conversion of funcs with blocking awaits to coroutines. Currently TFRT does not support top-level coroutines, so this functionality will allow to have a single blocking await at the top level until TFRT implements the necessary functionality. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D106730	2021-07-29 08:52:28 -07:00
Tobias Gysi	2a342c7c1e	[mlir][linalg] Format bufferization debug print outs (NFC). Change the formatting of the debug print outs to elide unnecessary information. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D106661	2021-07-29 15:48:58 +00:00
Dmitry Vyukov	9e9599ef78	tsan: introduce LazyInitialize We call non-inlinable Initialize from all interceptors/syscalls, but most of the time runtime is already initialized and this just introduces unnecessary overhead. Add LazyInitialize that (1) inlinable, (2) does nothing if .preinit_array is enabled (expected case on Linux). Depends on D107071. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107072	2021-07-29 17:19:29 +02:00
Dmitry Vyukov	17f650cb00	tsan: s/CHECK/DCHECK/ in tsan_interface_java.cpp We are very paranoid with CHECKs in all Java entry points. These CHECKs were added along with Java support. At that point it wasn't clear what exactly to expect from JVM part and if JVM part is correct or not. Thus CHECK paranoia. These CHECKs never fired in practice, but we pay runtime cost in every entry point all the time. Replace CHECKs with DCHECKs. Depends on D107069. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107071	2021-07-29 17:13:19 +02:00
Dmitry Vyukov	0bc10d9a8e	tsan: restore Initialize call in Java entry points We used to call Initialize in every Java point. That was removed in `6563bb53b5` ("tsan: don't use caller/current PC in Java interfaces"). The intention was to add a single Initialize to __tsan_java_init instead. Do that. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107069	2021-07-29 17:13:09 +02:00
Jez Ng	a26bb9cc05	[lld-macho][nfc] Simplify common-symbol-coalescing test	2021-07-29 11:07:50 -04:00
Jez Ng	e49374f9e0	[lld-macho] Support common symbols in bitcode (but differently from ld64) ld64 seems to handle common symbols in bitcode rather bizarrely. They follow entirely different precedence rules from their non-bitcode counterparts. I initially tried to emulate ld64 in D106597, but I'm not sure the extra complexity is worth it, especially given that common symbols are not, well, very common. This diff accords common bitcode symbols the same precedence as regular common symbols, just as we treat all other pairs of bitcode and non-bitcode symbol types. The tests document ld64's behavior in detail, just in case we want to revisit this. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D107027	2021-07-29 11:07:50 -04:00
Dmitry Vyukov	f821a55c5e	tsan: add intrusive doubly-linked list Add intrusive doubly-linked list container template, IList. It will be used in the new tsan runtime. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107050	2021-07-29 17:07:27 +02:00
Rosie Sumpter	fab5659c79	Revert "[LoopFlatten] Fix missed LoopFlatten opportunity" This reverts commit `2df8bf9339`. Reverting because it causes an assertion failure.	2021-07-29 15:52:45 +01:00
Kai Luo	e4902e69e9	[PowerPC] Fix return type of XL compat CAS `__compare_and_swap*` should return `i32` rather than `i1`. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D107077	2021-07-29 14:49:26 +00:00
Dmitry Vyukov	5697841f66	tsan: add another test for atomics Add a test where atomic-release happens while another thread spins calling load-acquire. This can expose some interesting interleavings of release and acquire. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107055	2021-07-29 16:43:24 +02:00
Guillaume Chatelet	cd2f5d5b49	[libc] rewrite aarch64 memcmp implementation This patch is simply rearranging the code layout so it's easier to understand. Differential Revision: https://reviews.llvm.org/D106641	2021-07-29 14:41:12 +00:00
Kim-Anh Tran	2e9853e0e9	[DWARF5] Only fallback to manual index if no entry was found If we succeed at gathering global variables for a compile unit, there is no need to fallback to generating a manual index. Reviewed By: jankratochvil Differential Revision: https://reviews.llvm.org/D106355	2021-07-29 16:16:42 +02:00
Kazu Hirata	416f3ff803	[CodeGen] Remove getPseudoProbeAttribute and addPseudoProbeAttribute (NFC) The last uses of these functions were removed on Jun 17, 2021 in commit `bd52495518`.	2021-07-29 07:14:37 -07:00
Jessica Clarke	cfaa5bf4ce	[ELF] Align the first section of a PT_TLS even if its type is SHT_NOBITS This is somewhat of a repeat of D66658 but for sections in PT_TLS segments. Although such sections don't need to be aligned such that address and offset are congruent modulo the page size, they do need to be congruent modulo the segment alignment, otherwise the whole PT_TLS will be unaligned. We therefore use the normal calculation to determine the section's address within the PT_LOAD rather than bailing out early due to being SHT_NOBITS. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D106987	2021-07-29 15:14:00 +01:00
Jessica Clarke	b96bb7899f	[ELF] Add two new tests showing broken .tbss alignment if first in PT_TLS This is a similar problem to D66658, where we are too aggressive in not aligning NOBITS sections, and the tests are based on the ones added for that fix. If a .tbss section is first in a PT_TLS segment (i.e. there is no .tdata section) then, although it doesn't need to be aligned such that address and offset are congruent modulo the page size, they do need to be congruent modulo the segment alignment, otherwise the whole PT_TLS will be unaligned. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D106986	2021-07-29 15:13:52 +01:00
Jamie Schmeiser	c3c1826c31	Set TargetCPUName for AIX to default to pwr7. Summary: Set the TargetCPUName for AIX to default to pwr7, removing the setting of it based on the major/minor of the OS version, which previously set it to pwr4 for AIX 7.1 and earlier. The old code would also set it to pwr4 when the OS version was not specified and with the change, it will default it to pwr7 in all cases. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By:hubert.reinterpretcast (Hubert Tong) Differential Revision: https://reviews.llvm.org/D107063	2021-07-29 09:59:24 -04:00
Jon Chesterfield	a90da62adb	[libomptarget][amdgpu] Update printed plugin name	2021-07-29 14:46:42 +01:00
Augusto Noronha	77e9d10f0f	[lldb] Assert filecache and live memory match on debug under a setting	2021-07-29 10:29:34 -03:00
Sanjay Patel	fa6b2c9915	[DAGCombiner] don't try to partially reduce add-with-overflow ops This transform was added with D58874, but there were no tests for overflow ops. We need to change this one way or another because it can crash as shown in: https://llvm.org/PR51238 Note that if there are no uses of an overflow op's bool overflow result, we reduce it to a regular math op, so we continue to fold that case either way. If we have uses of both the math and the overflow bool, then we are likely not saving anything by creating an independent sub instruction as seen in the test diffs here. This patch makes the behavior in SDAG consistent with what we do in instcombine AFAICT. Differential Revision: https://reviews.llvm.org/D106983	2021-07-29 08:51:54 -04:00
Stefan Gränitz	058935145d	[Orc][examples] Adopt ExecutorProcessControl API and re-enable LLJITWithRemoteDebugging The API change originated from D104694. The LLJITWithRemoteDebugging example and the test for it were disabled while it was in the works.	2021-07-29 14:40:42 +02:00
Sanjay Patel	e427077ec1	[AArch64][x86] add tests for add-with-overflow folds; NFC There's a generic combine for these, but no test coverage. It's not clear if this is actually a good fold. The combine was added with D58874, but it has a bug that can cause crashing ( https://llvm.org/PR51238 ).	2021-07-29 07:58:13 -04:00
Andrew Savonichev	bcc83a2e83	[MCA] Use LSU for the in-order pipeline Load/Store unit is used to enforce order of loads and stores if they alias (controlled by --noalias=false option). Fixes PR50483 - [MCA] In-order pipeline doesn't track memory load/store dependencies. Differential Revision: https://reviews.llvm.org/D103955	2021-07-29 14:40:23 +03:00
Bradley Smith	191831e380	[AArch64][SVE] Fix incorrect mask type when lowering fixed type SVE gather/scatter An incorrect mask type when lowering an SVE gather/scatter was causing a codegen fault which manifested as the incorrect predicate size being used for an SVE gather/scatter, (e.g.. p0.b rather than p0.d). Fixes PR51182. Differential Revision: https://reviews.llvm.org/D106943	2021-07-29 11:22:17 +00:00
Sebastian Neubauer	4864893127	[Utils] Do not remove comments in llc test script When checking if two prefixes can be merged for a function, update_llc_test_checks.py removed IR comments before comparing llc outputs of different RUN lines. This means, if one RUN line emited lines starting with ';' and another RUN line emited the same lines except the ones starting with ';', both RUNs would be merged (if they share a prefix). However, CHECK-NEXT lines check the comments, otherwise they fail, so the script should not merge RUNs if they contain different comments. Differential Revision: https://reviews.llvm.org/D101312	2021-07-29 13:03:05 +02:00
Jesse Towner	68546c9d6f	bugprone-forwarding-reference-overload: support non-type template parameters Many concepts emulation libraries, such as the one found in Range v3, tend to use non-type template parameters for the enable_if type expression, due to their versatility in template functions and constructors containing variadic template parameter packs. Unfortunately the bugprone-forwarding-reference-overload check does not handle non-type template parameters, as was first noted in this bug report: https://bugs.llvm.org/show_bug.cgi?id=38081 This patch fixes this long standing issue and allows for the check to be suppressed with the use of a non-type template parameter containing enable_if or enable_if_t in the type expression, so long as it has a default literal value.	2021-07-29 07:01:19 -04:00
Jeremy Morse	2537120c87	Follow-up to D105207, only salvage affine SCEVs to avoid a crash SCEVToIterCountExpr only expects to be fed affine expressions, but DbgRewriteSalvageableDVIs is feeding it non-affine induction variables. Following this up with an obvious fix, will add test coverage too if this avoids D105207 being reverted.	2021-07-29 11:48:08 +01:00
Dmitry Vyukov	0d68cfc996	tsan: store ThreadRegistry in Context by value It's unclear why we allocate ThreadRegistry separately, I assume it's some historical leftover. Embed ThreadRegistry into Context. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107045	2021-07-29 12:44:44 +02:00
Dmitry Vyukov	4e15ee2867	sanitizer_common: remove BlockingMutex and RWMutex Remove the legacy typedefs and use Mutex/Lock types directly. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107043	2021-07-29 12:38:53 +02:00
Dmitry Vyukov	b8f4232823	tsan: rename deadlock detector Mutex to UserMutex It conflicts with sanitizer_common Mutex. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107053	2021-07-29 12:38:14 +02:00
Andrzej Warzynski	8bf0a40608	[flang][driver] Forward `-fopenmp`/`-fopenacc` to the host compiler This patch only modifies `flang` - the bash wrapper script. `-fopenmp`/`-fopenacc` are required to enable the OpenMP/OpenACC extension in the frontend and to make sure that the required libraries are linked when generating the final binary. This patch makes sure that `-fopnemp`/`-fopenacc` is used for both unparsing and the code generation (via the host compiler). Differential Revision: https://reviews.llvm.org/D106871	2021-07-29 11:21:56 +01:00
Cullen Rhodes	08d92dbbff	[AArch64][AsmParser] NFC: Parser.getTok() -> getTok() Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D106949	2021-07-29 10:18:54 +00:00
Amara Emerson	da61ab8475	[AArch64][GlobalISel] More widenToNextPow2 changes, this time for arithmetic/bitwise ops.	2021-07-29 03:02:29 -07:00
Kirill Bobyrev	d8fd2146da	NFC: Change quotes from Unicode to ASCII This was causing some problems for Python scripts that we have. Context: https://reviews.llvm.org/D106792	2021-07-29 11:37:10 +02:00
David Spickett	d510b5f199	[lldb][AArch64] Annotate synchronous tag faults In the latest Linux kernels synchronous tag faults include the tag bits in their address. This change adds logical and allocation tags to the description of synchronous tag faults. (asynchronous faults have no address) Process 1626 stopped * thread #1, name = 'a.out', stop reason = signal SIGSEGV: sync tag check fault (fault address: 0x900fffff7ff9010 logical tag: 0x9 allocation tag: 0x0) This extends the existing description and will show as much as it can on the rare occasion something fails. This change supports AArch64 MTE only but other architectures could be added by extending the switch at the start of AnnotateSyncTagCheckFault. The rest of the function is generic code. Tests have been added for synchronous and asynchronous MTE faults. Reviewed By: omjavaid Differential Revision: https://reviews.llvm.org/D105178	2021-07-29 10:26:37 +01:00
Mirko Brkusanin	971f4173f8	[AMDGPU][GlobalISel] Insert an and with exec before s_cbranch_vccnz if necessary While v_cmp will AND inactive lanes with 0, that is not the case for logical operations. This fixes a Vulkan CTS test that would hang otherwise. Differential Revision: https://reviews.llvm.org/D105709	2021-07-29 11:20:49 +02:00
Benjamin Kramer	d81a843846	[mlir] Put back virtual ~ConversionTarget(), some users started relying on it	2021-07-29 11:10:07 +02:00
Benjamin Kramer	1c9c2c91d4	[mlir] Remove the default isDynamicallyLegal hook This is redundant with the callback variant and untested. Also remove the callback-less methods for adding a dynamically legal op, as they are no longer useful. Differential Revision: https://reviews.llvm.org/D106786	2021-07-29 11:00:57 +02:00
Tomasz Kamiński	f1ab60e40d	Fix FindZ3.cmake to support static libraries and Windows Use absolute path to link z3 to allow builds both on windows and linux since the library name is platform dependent for Z3 (libz3 on Windows and z3 on Linux) and MSVC does not recognized -L and -l options. Fix CMAKE_CROSSCOMPILING that does not work correctly since it uses Z3_BUILD_VERSION instead of Z3_BUILD_NUMBER Fix building with the static version of z3 library (supersedes D80227). - Build the Z3 version detection code as C++, since the static library brings in libstdc++ symbols - Detect threading support and link against threading, in the (likely) case Z3 was built with threads Exposed compilation error from building a program that is used to detect z3 version in the warning message, to simplify troubleshooting. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D106131	2021-07-29 10:55:44 +02:00
Rosie Sumpter	2df8bf9339	[LoopFlatten] Fix missed LoopFlatten opportunity When the trip count of the inner loop is a constant, the InstCombine pass now causes the transformation e.g. imcp ult i32 %inc, tripcount -> icmp ult %j, tripcount-step (where %j is the inner loop induction variable and %inc is add %j, step), which is now accounted for when identifying the trip count of the loop. This is also an acceptable use of %j (provided the step is 1) so is ignored as long as the compare that it's used in is also the condition of the inner branch. Differential Revision: https://reviews.llvm.org/D105802	2021-07-29 09:47:41 +01:00
Fraser Cormack	02dd4b59bc	[RISCV] Optimize floating-point "dominant value" BUILD_VECTORs This patch aims to improve the performance of BUILD_VECTORs which are identified as containing a dominant element. Given that most floating-point constants themselves require a load from the constant pool, it was possible for the optimization to actually increase the number of individual loads on small vectors. The exception is the zero constant -- +0.0 -- which can be materialized efficiently. While this optimization could do with a proper cost model to weigh the benfits of a single vector load vs. the manipulation of individual elements -- even for integer vectors which often require several instructions to materialize -- without a concrete RVV implementation to work with any heuristic is likely to be both more obtuse and inaccurate. Until then, this patch fixes at least one known obvious deficiency. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D106963	2021-07-29 09:22:34 +01:00

1 2 3 4 5 ...

395253 Commits All Branches Search

395253 Commits

All Branches