llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	47f2affa08	Fix MSVC "result of 32-bit shift implicitly converted to 64 bits" warning. NFCI.	2021-08-26 15:08:12 +01:00
Balazs Benics	379b6394d9	Revert "[analyzer] Extend the documentation of MallocOverflow" This reverts commit `6097a41924`.	2021-08-26 15:29:32 +02:00
Balazs Benics	6097a41924	[analyzer] Extend the documentation of MallocOverflow Previously by following the documentation it was not immediately clear what the capabilities of this checker are. In this patch, I add some clarification on when does the checker issue a report and what it's limitations are. I'm also advertising suppressing such reports by adding an assertion, as demonstrated by the test3(). I'm highlighting that this checker might produce an extensive amount of findings, but it might be still useful for code audits. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D107756	2021-08-26 15:20:41 +02:00
Andrew Wei	99c4336374	[LoopDataPrefetch] Add missed LoopSimplify dependence for prefetch pass SCEVExpander::expandCodeFor may expand add recurrences for loop with a preheader, so we should make LoopDataPrefetch dependent on LoopSimplify. This patch will try to fix : https://bugs.llvm.org/show_bug.cgi?id=43784 Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D108448	2021-08-26 21:01:59 +08:00
Jessica Clarke	8f89e2f6c9	[AMDGPU] Remove dead and broken ComplexPatterns SelectADDRParam was discovered as being dead 5 years ago and removed in `7b4ef068c6` but the unused ComplexPattern definition was left behind. SelectADDRDWord has never existed as far as I can tell, even back when AMDGPU was R600-only and called that. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D108758	2021-08-26 12:48:32 +01:00
Jessica Clarke	2cbdf7e131	[SelectionDAG] Remove unused SDTConvertOp This was used by CONVERT_RNDSAT, which was removed in `def496c04b`, so the profile is now unused. Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D108508	2021-08-26 12:48:17 +01:00
Andrea Di Biagio	4a5b191703	[X86][MCA] Address the latest issues with MULX reported in PR51495. It turns out that SchedWrite WriteIMulH was always assigned to the low half of the result of a MULX (rather than to the high half). To avoid confusion, this patch swaps the two MULX writes in the tablegen definition of MULX32/64. That way, write names better describe what they actually refer to; this also avoids further complications if in future we decide to reuse the same MulH writes to also model other scalar integer multiply instructions. I also had to swap the latency values for the two MULX writes to make sure that the change is effectively an NFC. In fact, none of the existing x86 tests were affected by this small refactoring. This patch also fixes a bug in MCA: a wrong latency value was propagated for instructions that perform multiple writes to a same register. This last issue was found by Roman while testing MULX on targets that define a different latency for the Low/High part of the result. Differential Revision: https://reviews.llvm.org/D108727	2021-08-26 12:08:20 +01:00
Alex Richardson	b475ce39e8	[sanitizer] Fix build on FreeBSD RISC-V We have to avoid calling renameat2 and clone on FreeBSD. Additionally, the mcontext structure has different members. Reviewed By: jrtc27, luismarques Differential Revision: https://reviews.llvm.org/D103886	2021-08-26 12:05:37 +01:00
Sindhu Chittireddy	de15979bc3	Assert pointer cannot be null; NFC Klocwork static code analysis exposed this concern: Pointer 'SubExpr' returned from call to getSubExpr() function which may return NULL from 'cast_or_null<Expr>(Operand)', which will be dereferenced in the statement following it Add an assert on SubExpr to make it clear this pointer cannot be null.	2021-08-26 06:58:56 -04:00
Matthew Devereau	9b830c798e	[AArch64][SVE] Teach cost model masked gathers/scatters are cheap Tell the cost model to use the scalable calculation for non-neon fixed vector. This results in a cheaper cost for fixed-length SVE masked gathers/scatters allowing the vectorizor to emit them more frequently.	2021-08-26 11:17:47 +01:00
Benjamin Kramer	bd7ece4e06	[X86] Don't write to the source directory in test	2021-08-26 12:11:20 +02:00
Roman Lebedev	564d85e090	The maximal representable alignment in LLVM IR is 1GiB, not 512MiB In LLVM IR, `AlignmentBitfieldElementT` is 5-bit wide But that means that the maximal alignment exponent is `(1<<5)-2`, which is `30`, not `29`. And indeed, alignment of `1073741824` roundtrips IR serialization-deserialization. While this doesn't seem all that important, this doubles the maximal supported alignment from 512MiB to 1GiB, and there's actually one noticeable use-case for that; On X86, the huge pages can have sizes of 2MiB and 1GiB (!). So while this doesn't add support for truly huge alignments, which i think we can easily-ish do if wanted, i think this adds zero-cost support for a not-trivially-dismissable case. I don't believe we need any upgrade infrastructure, and since we don't explicitly record the IR version, we don't need to bump one either. As @craig.topper speculates in D108661#2963519, this might be an artificial limit imposed by the original implementation of the `getAlignment()` functions. Differential Revision: https://reviews.llvm.org/D108661	2021-08-26 12:53:39 +03:00
Benjamin Kramer	5ece556271	[libunwind] Don't include cet.h/immintrin.h unconditionally These may not exist when CET isn't available.	2021-08-26 11:37:07 +02:00
Alex Richardson	581613413c	Make Value::MaxAlignment(Exponent) constexpr This avoids references to the variables be generated when using e.g. max(). Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D95050	2021-08-26 10:09:40 +01:00
Alex Richardson	7cab90a7b1	Fix __attribute__((annotate("")) with non-zero globals AS The existing code attempting to bitcast from a value in the default globals AS to i8 addrspace(0)* was triggering an assertion failure in our downstream fork. I found this while compiling poppler for CHERI-RISC-V (we use AS200 for all globals). The test case uses AMDGPU since that is one of the in-tree targets with a non-zero default globals address space. The new test previously triggered a "Invalid constantexpr bitcast!" assertion and now correctly generates code with addrspace(1) pointers. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D105972	2021-08-26 10:09:40 +01:00
Alex Richardson	bf66b0eefc	Fix LLVM_ENABLE_THREADS check from `26a92d5852` We should be using #if instead of #ifdef here since LLVM_ENABLE_THREADS is set using #cmakedefine01 so is always defined. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D108110	2021-08-26 10:09:39 +01:00
Florian Hahn	aa5b6c9779	[ConstraintElimination] Initial support for using info from assumes. This patch adds initial support to use facts from @llvm.assume calls. It intentionally does not handle all possible cases to keep things simple initially. For now, the condition from an assume is made available on entry to the containing block, if the assume is guaranteed to execute. Otherwise it is only made available in the successor blocks.	2021-08-26 10:08:00 +01:00
Florian Hahn	dd1ec869b0	[ConstraintElimination] Add more assume tests.	2021-08-26 10:07:47 +01:00
David Green	6ffc6951a3	[AArch64] Remove unpredictable from narrowing instructions. Like other similar instructions the xtn2 family do not have side effects, and explicitly marking them as such can help improve scheduling freedom.	2021-08-26 09:43:44 +01:00
David Green	9474b03d41	[AArch64] Add a Cortex-A55 NEON scheduler test case.	2021-08-26 09:43:44 +01:00
Jay Foad	985eb25546	[MachineScheduler] Fix tracing Consistently print a newline before "RegionInstrs:".	2021-08-26 09:27:01 +01:00
LLVM GN Syncbot	6894552a74	[gn build] Port `21b25a1fb3`	2021-08-26 08:14:37 +00:00
gejin	21b25a1fb3	[libunwind] Support stack unwind in CET environment Control-flow Enforcement Technology (CET), published by Intel, introduces shadow stack feature aiming to ensure a return from a function is directed to where the function was called. In a CET enabled system, each function call will push return address into normal stack and shadow stack, when the function returns, the address stored in shadow stack will be popped and compared with the return address, program will fail if the 2 addresses don't match. In exception handling, the control flow may skip some stack frames and we must adjust shadow stack to avoid violating CET restriction. In order to achieve this, we count the number of stack frames skipped and adjust shadow stack by this number before jumping to landing pad. Reviewed By: hjl.tools, compnerd, MaskRay Differential Revision: https://reviews.llvm.org/D105968 Signed-off-by: gejin <ge.jin@intel.com>	2021-08-26 16:20:38 +08:00
Jean Perier	9016b2a1ca	[flang] Take result length into account in ApplyElementwise folding ApplyElementwise on character operation was always creating a result ArrayConstructor with the length of the left operand. This is not correct for concatenation and SetLength operations. Compute and thread the length to the spot creating the ArrayConstructor so that the length is correct for those character operations. Differential Revision: https://reviews.llvm.org/D108711	2021-08-26 09:46:14 +02:00
LLVM GN Syncbot	fdefde4965	[gn build] Port `3373e84539`	2021-08-26 07:29:05 +00:00
Gabor Bencze	3373e84539	[clang-tidy] Add bugprone-suspicious-memory-comparison check The check warns on suspicious calls to `memcmp`. It currently checks for comparing types that do not have unique object representations or are non-standard-layout. Based on https://wiki.sei.cmu.edu/confluence/display/c/EXP42-C.+Do+not+compare+padding+data https://wiki.sei.cmu.edu/confluence/display/c/FLP37-C.+Do+not+use+object+representations+to+compare+floating-point+values and part of https://wiki.sei.cmu.edu/confluence/display/cplusplus/OOP57-CPP.+Prefer+special+member+functions+and+overloaded+operators+to+C+Standard+Library+functions Add alias `cert-exp42-c` and `cert-flp37-c`. Some tests are currently failing at head, the check depends on D89649. Originally started in D71973 Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D89651	2021-08-26 09:23:37 +02:00
Gabor Bencze	ad59735f9d	Fix __has_unique_object_representations with no_unique_address Fix incorrect behavior of `__has_unique_object_representations` when using the no_unique_address attribute. Based on the bug report: https://bugs.llvm.org/show_bug.cgi?id=47722 Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D89649	2021-08-26 09:23:37 +02:00
Esme-Yi	b21ed75e10	[llvm-readobj][XCOFF] Add support for `--needed-libs` option. Summary: This patch is trying to add support for llvm-readobj --needed-libs option under XCOFF. For XCOFF, the needed libraries can be found from the Import File ID Name Table of the Loader Section. Currently, I am using binary inputs in the test since yaml2obj does not yet support for writing the Loader Section and the import file table. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D106643	2021-08-26 07:17:06 +00:00
Lin Sun	d280a76908	[Driver][Linux] Fix regression when -DLIBCXX_LIBDIR_SUFFIX=64 This patch allows an installed (`ninja install-clang`) Clang to find `../lib64/libc++.so` Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D108286	2021-08-25 23:50:17 -07:00
Jan Svoboda	6da811fd5c	[clang][deps] Reset non-modular language and preprocessor options There are a number of language and preprocessor options that are reset in the `CompilerInvocation` that describes the build of an implicit module. This patch uses the logic for explicit modules as well. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D108710	2021-08-26 08:43:21 +02:00
Aart Bik	6b26857dbf	[mlir][sparse] add asCOO() functionality to sparse tensor object This prepares general sparse to sparse conversions. The code that needs to be generated using this new feature is now simply: (1) coo = sparse_tensor_1->asCOO(); // source format1 (2) sparse_tensor_2 = newSparseTensor(coo); // destination format2 By using COO as an intermediate, we can do all conversions without having to implement the full O(N^2) conversion matrix. Note that we can always improve particular conversions individually if a faster solution is required. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108681	2021-08-25 21:50:39 -07:00
Wenlei He	a45d72e024	[CSSPGO] Add switch for sample loader to honor global pre-inliner decision from llvm-profgen The change adds a switch to allow sample loader to use global pre-inliner's decision instead. The pre-inliner in llvm-profgen makes inline decision globally based on whole program profile and function byte size as cost proxy. Since pre-inliner also adjusts/merges context profile based on its inline decision, honoring its inline decision in sample loader would lead to better post-inline profile quality especially for thinlto where cross module profile merging isn't possible without pre-inliner. Minor fix in profile reader is also included. When pre-inliner is use, we now also turn off the default merging and trimming logic unless it's explicitly asked. Differential Revision: https://reviews.llvm.org/D108677	2021-08-25 17:20:15 -07:00
Fangrui Song	4a66a11286	[LLVMgold.so][test] Make comdat-nodeduplicate.ll work with binutils<2.27	2021-08-25 16:59:06 -07:00
Sam Clegg	c05d30e444	[clang][Emscripten] Define __unix family of macros This will allow us to remove these from the downstream driver: `57270ce815/emcc.py (L860-L863)` Differential Revision: https://reviews.llvm.org/D108735	2021-08-25 19:24:47 -04:00
Arthur Eubanks	1bdeafeaf4	[gn build] Unbreak non-clang host builds `eecd5d0a` broke non-clang host builds. Some crt code is not always built with the just-built clang. `0da172b` checked if the compiler is clang, not assert that the compiler is clang.	2021-08-25 16:14:45 -07:00
Alexey Bataev	1c7dda9095	[SLP][NFC]Add a test for non-optimal PHIs vectorization, NFC.	2021-08-25 15:55:11 -07:00
Heejin Ahn	e849d99df1	[WebAssembly] Use entry block only for initializations in EmSjLj Emscripten SjLj transformation is done in four steps. This will be mostly the same for the soon-to-be-added Wasm SjLj; the step 1, 3, and 4 will be shared and there will be separate way of doing step 2. 1. Initialize `setjmpTable` and `setjmpTableSize` in the entry BB 2. Handle `setjmp` callsites 3. Handle `longjmp` callsites 4. Cleanup and update SSA We initialize `setjmpTable` and `setjmpTableSize` in the entry BB. But if the entry BB contains a `setjmp` call, some `setjmp` handling transformation will also happen in the entry BB, such as calling `saveSetjmp`. This is fine for Emscripten SjLj but not for Wasm SjLj, because in Wasm SjLj we will add a dispatch BB that contains a `switch` right after the entry BB, from which we jump to one of post-`setjmp` BBs. And this dispatch BB should precede all `setjmp` calls. Emscripten SjLj (current): ``` entry: %setjmpTable = ... %setjmpTableSize = ... ... call @saveSetjmp(...) ``` Wasm SjLj (follow-up): ``` entry: %setjmpTable = ... %setjmpTableSize = ... setjmp.dispatch: ... ; Jump to the right post-setjmp BB, if we are returning from a ; longjmp. If this is the first setjmp call, go to %entry.split. switch i32 %no, label %entry.split [ i32 1, label %post.setjmp1 i32 2, label %post.setjmp2 ... i32 N, label %post.setjmpN ] entry.split: ... call @saveSetjmp(...) ``` So in Wasm SjLj we split the entry BB to make the entry block only for `setjmpTable` and `setjmpTableSize` initialization and insert a `setjmp.dispatch` BB. (This part is not in this CL. This will be a follow-up.) But note that Emscripten SjLj and Wasm SjLj share all steps except for the step 2. If we only split the entry BB only for Wasm SjLj, there will be one more `if`-`else` and the code will be more complicated. So this CL splits the entry BB in Emscripten SjLj and put only initialization stuff there as follows: Emscripten SjLj (this CL): ``` entry: %setjmpTable = ... %setjmpTableSize = ... br %entry.split entry.split: ... call @saveSetjmp(...) ``` This is just done to share code with Wasm SjLj. It adds an unnecessary branch but this will be removed in later optimization passes anyway. This is in effect NFC, meaning the program behavior will not change, but existing ll tests files have changed because the entry block was split. The reason I upload this in a separate CL is to make the Wasm SjLj diff tidier, because this changes many existing Emscripten SjLj tests, which can be confusing for the follow-up Wasm SjLj CL. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D108729	2021-08-25 15:46:57 -07:00
Heejin Ahn	2f88a30ca6	[WebAssembly] Extract longjmp handling in EmSjLj to a function (NFC) Emscripten SjLj and (soon-to-be-added) Wasm SjLj transformation share many steps: 1. Initialize `setjmpTable` and `setjmpTableSize` in the entry BB 2. Handle `setjmp` callsites 3. Handle `longjmp` callsites 4. Cleanup and update SSA 1, 3, and 4 are identical for Emscripten SjLj and Wasm SjLj. Only the step 2 is different. This CL extracts the current Emscripten SjLj's longjmp callsites handling into a function. The reason to make this a separate CL is, without this, the diff tool cannot compare things well in the presence of moved code and added code in the followup Wasm SjLj CL, and it ends up mixing them together, making the diff unreadable. Also fixes some typos and variable names. So far we've been calling the buffer argument to `setjmp` and `longjmp` `jmpbuf`, but the name used in the man page for those functions is `env`, so updated them to be consistent. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D108728	2021-08-25 15:45:38 -07:00
Dimitry Andric	18da6db838	[libc++][NFC] Remove duplicate ranges entry in CMakeLists.txt. The second entry got added accidentally as part of `5a3309f825`. Reviewed By: cjdb Differential Revision: https://reviews.llvm.org/D108726	2021-08-25 23:49:43 +02:00
Reid Kleckner	db3d029fbe	Effectively revert `33c3d8a916` / D33782 This change would treat the token `or` in system headers as an identifier, and elsewhere as an operator. As reported in llvm.org/pr42427, many users classify their third party library headers as "system" headers to suppress warnings. There's no clean way to separate Windows SDK headers from user headers. Clang is still able to parse old Windows SDK headers if C++ operator names are disabled. Traditionally this was controlled by `-fno-operator-names`, but is now also enabled with `/permissive` since D103773. This change will prevent `clang-cl` from parsing <query.h> from the Windows SDK out of the box, but there are multiple ways to work around that: - Pass `/clang:-fno-operator-names` - Pass `/permissive` - Pass `-DQUERY_H_RESTRICTION_PERMISSIVE` In all of these modes, the operator names will consistently be available or not available, instead of depending on whether the code is in a system header. I added a release note for this, since it may break straightforward users of the Windows SDK. Fixes PR42427 Differential Revision: https://reviews.llvm.org/D108720	2021-08-25 14:41:26 -07:00
Vitaly Buka	23a1e9f70b	[sanitizer] Add new line to the test	2021-08-25 14:33:06 -07:00
Vitaly Buka	c92631a59a	[sanitizer] Fix VReport of symbol version Version is already a string and does not need stringizing.	2021-08-25 14:32:15 -07:00
Vitaly Buka	ea575598f5	[sanitizers] Basic realpath test	2021-08-25 14:32:15 -07:00
Craig Topper	ccd364286b	[RISCV] Fix the check prefixes in some B extension tests. NFC Looks like a bad merge happened after these were renamed in D107992.	2021-08-25 14:26:51 -07:00
Ricky Taylor	f659b6b1fa	[M68k][NFC] Rename M68kOperand::Kind to KindTy Rename the M68kOperand::Type enumeration to KindTy to avoid ambiguity with the Kind field when referencing enumeration values e.g. `Kind::Value`. This works around a compilation error under GCC 5, where GCC won't lookup enum class values if you have a similarly named field (see https://gcc.gnu.org/bugzilla/show_bug.cgi?id=60994). The error in question is: `M68kAsmParser.cpp:857:8: error: 'Kind' is not a class, namespace, or enumeration` Differential Revision: https://reviews.llvm.org/D108723	2021-08-25 22:24:43 +01:00
Heejin Ahn	c2c9a3fd9c	[WebAssembly] Rename wasm.catch.exn intrinsic back to wasm.catch The plan was to use `wasm.catch.exn` intrinsic to catch exceptions and add `wasm.catch.longjmp` intrinsic, that returns two values (setjmp buffer and return value), later to catch longjmps. But because we decided not to use multivalue support at the moment, we are going to use one intrinsic that returns a single value for both exceptions and longjmps. And even if it's not for that, I now think the naming of `wasm.catch.exn` is a little weird, because the intrinsic can still take a tag immediate, which means it can be used for anything, not only exceptions, as long as that returns a single value. This partially reverts D107405. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D108683	2021-08-25 14:19:22 -07:00
Vitaly Buka	5213f307ab	Revert "Problem with realpath interceptor" Breaks realpath(, nullptr) for all sanitizers. Somehow INTERCEPT_FUNCTION and INTERCEPT_FUNCTION_VER return false even if everything seemingly right. And this is the issue for COMMON_INTERCEPT_FUNCTION_GLIBC_VER_MIN. There is a check in every sanitlizer: if (!INTERCEPT_FUNCTION_VER(name, ver) && !INTERCEPT_FUNCTION(name)) For non-versioned interceptors when INTERCEPT_FUNCTION returns false it's not considered fatal, and it just prints a warning. However INTERCEPT_FUNCTION_VER in this case will fallback to INTERCEPT_FUNCTION replacing realpath with wrong version. We need to investigate that before relanding the patch. This reverts commit `faef0d042f`.	2021-08-25 13:55:23 -07:00
Omar Emara	3c11e5722c	[LLDB][GUI] Add initial searcher support This patch adds a new type of reusable UI components. Searcher Windows contain a text field to enter a search keyword and a list of scrollable matches are presented. The target match can be selected and executed which invokes a user callback to do something with the match. This patch also adds one searcher delegate, which wraps the common command completion searchers for simple use cases. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D108545	2021-08-25 13:55:11 -07:00
Andrea Di Biagio	6181427bb9	[X86][MCA] Add more tests for MULX (PR51495). llvm-mca still reports a wrong latency for the case where the two destination registers of MULX are the same.	2021-08-25 21:28:21 +01:00
Justas Janickas	9dc92bba6c	[OpenCL][NFC] Fix code example in __remove_address_space documentation.	2021-08-25 21:24:32 +01:00

1 2 3 4 5 ...

397473 Commits All Branches Search

397473 Commits

All Branches