llvm-project

Commit Graph

Author	SHA1	Message	Date
Sander de Smalen	4f42d873c2	[TTI] NFC: Change getArithmeticInstrCost to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D100317	2021-04-14 17:20:36 +01:00
Sander de Smalen	d84bd951a8	[TTI] NFC: Change getFPOpCost to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: c-rhodes Differential Revision: https://reviews.llvm.org/D100316	2021-04-14 17:20:36 +01:00
Sander de Smalen	1af35e77f4	[TTI] NFC: Change getVectorInstrCost to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D100315	2021-04-14 17:20:35 +01:00
Sander de Smalen	174e8f6c5e	[TTI] NFC: Change getShuffleCost to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D100314	2021-04-14 17:20:35 +01:00
Sander de Smalen	14b934f8a6	[TTI] NFC: Change getCFInstrCost to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: samparker Differential Revision: https://reviews.llvm.org/D100313	2021-04-14 17:20:34 +01:00
Sander de Smalen	596f669cfb	[TTI] NFC: Change getCallInstrCost to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: c-rhodes Differential Revision: https://reviews.llvm.org/D100312	2021-04-14 17:20:34 +01:00
Thomas Lively	af7ab81ce3	[WebAssembly] Use standard intrinsics for f32x4 and f64x2 ops Now that these instructions are no longer prototypes, we do not need to be careful about keeping them opt-in and can use the standard LLVM infrastructure for them. This commit removes the bespoke intrinsics we were using to represent these operations in favor of the corresponding target-independent intrinsics. The clang builtins are preserved because there is no standard way to easily represent these operations in C/C++. For consistency with the scalar codegen in the Wasm backend, the intrinsic used to represent {f32x4,f64x2}.nearest is @llvm.nearbyint even though @llvm.roundeven better captures the semantics of the underlying Wasm instruction. Replacing our use of @llvm.nearbyint with use of @llvm.roundeven is left to a potential future patch. Differential Revision: https://reviews.llvm.org/D100411	2021-04-14 09:19:27 -07:00
Mark de Wever	ac08e2bb98	[libc++] Make chars_format a bitmask type. Some of Microsoft's unit tests in D70631 fail because libc++'s implementation of std::chars_format isn't a proper bitmask type. Adding the required functions to make std::chars_format a proper bitmask type. Implements parts of P0067: Elementary string conversions Differential Revision: https://reviews.llvm.org/D97115	2021-04-14 18:17:38 +02:00
Sjoerd Meijer	39d29817f3	[SCCP] Follow up of rGbbab9f986c6d. NFC. This addresses the linter messages, mainly the inconsistent capitalisation of member functions.	2021-04-14 17:14:46 +01:00
Ties Stuij	3b9dc59dbf	[arm][compiler-rt] add armv8m.main and arv8.1m.main targets These changes were enough to compile compiler-rt builtins for armv8m.main and armv8.1m.main. Differential Revision: https://reviews.llvm.org/D99600	2021-04-14 16:41:03 +01:00
Tobias Gysi	ce82843f72	[mlir][linalg] update fusion to support linalg index operations. The patch updates the linalg fusion pass to add the tile offsets to the indices. Differential Revision: https://reviews.llvm.org/D100456	2021-04-14 15:32:42 +00:00
Martin Probst	4d195f1b4d	review comments track symbol merge status in references to avoid excesive rewrites	2021-04-14 17:20:08 +02:00
Martin Probst	d45df0d29f	clang-format: [JS] merge import lines. Multiple lines importing from the same URL can be merged: import {X} from 'a'; import {Y} from 'a'; Merge to: import {X, Y} from 'a'; This change implements this merge operation. It takes care not to merge in various corner case situations (default imports, star imports). Differential Revision: https://reviews.llvm.org/D100466	2021-04-14 17:20:07 +02:00
Hans Wennborg	f29dcbdde1	Add flag for showing skipped headers in -H / --show-includes output Consider the following set of files: a.cc: #include "a.h" a.h: #ifndef A_H #define A_H #include "b.h" #include "c.h" // This gets "skipped". #endif b.h: #ifndef B_H #define B_H #include "c.h" #endif c.h: #ifndef C_H #define C_H void c(); #endif And the output of the -H option: $ clang -c -H a.cc . ./a.h .. ./b.h ... ./c.h Note that the include of c.h in a.h is not shown in the output (GCC does the same). This is because of the include guard optimization: clang knows c.h is covered by an include guard which is already defined, so when it sees the include in a.h, it skips it. The same would have happened if #pragma once were used instead of include guards. However, a.h does include c.h, and it may be useful to show that in the -H output. This patch adds a flag for doing that. Differential revision: https://reviews.llvm.org/D100480	2021-04-14 17:01:51 +02:00
Simon Pilgrim	c4c9e4d6df	[X86] Add PR49028 test case	2021-04-14 15:55:21 +01:00
Benjamin Kramer	cf4161673c	[Instcombine] Disable memcpy of alloca bypass for instruction sources This transformation is fundamentally broken when it comes to dominance, it just happened to work when the source of the memcpy can be moved into the place of the alloca. The bug shows up a lot more often since `077bff39d4` allows the source to be a switch. It would be possible to check dominance of the source and all its operands, but that seems very heavy for instcombine.	2021-04-14 16:52:09 +02:00
hsmahesha	e3070db0f7	[AMDGPU] Rename "LDS lowering" pass name. Rename the name of "LDS lowering" pass from `amdgpu-disable-lower-module-lds` to `amdgpu-enable-lower-module-lds` as later is consistent and reads better. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D100441	2021-04-14 20:19:53 +05:30
Simon Pilgrim	4fbe761572	[X86][SSE] canonicalizeShuffleWithBinOps - check for more combos of merge-able binary shuffles. In the fold SHUFFLE(BINOP(X,Y),BINOP(Z,W)) -> BINOP(SHUFFLE(X,Z),SHUFFLE(Y,W)), check if both X/Z AND Y/W have at least one merge-able shuffle in which case the total number of shuffle should still fall. Helps with instruction count regressions we saw while fixing PR48823	2021-04-14 15:24:41 +01:00
Simon Pilgrim	b49c41afba	[SLP] createOp - fix null dereference warning. NFCI. Only attempt to propagateIRFlags if we have both SelectInst - afaict we shouldn't have matched a min/max reduction without both SelectInst, but static analyzer doesn't know that.	2021-04-14 15:24:41 +01:00
Pablo Barrio	cca40aa8d8	[AArch64][v8.5A] Add BTI to all function starts The existing BTI placement pass avoids inserting "BTI c" when the function has local linkage and is only directly called. However, even in this case, there is a (small) chance that the linker later adds a hunk with an indirect call to the function, e.g. if the function is placed in a separate section and moved far away from its callers. Make sure to add BTI for these functions too. Differential Revision: https://reviews.llvm.org/D99417	2021-04-14 15:24:01 +01:00
Hanhan Wang	7c4de2e9b9	[mlir][StandardToSPIRV] Add support for lowering memref<?xi1> to SPIR-V Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D100452	2021-04-14 07:22:49 -07:00
LLVM GN Syncbot	34367dd253	[gn build] Port `bbab9f986c`	2021-04-14 13:59:02 +00:00
Sjoerd Meijer	bbab9f986c	[SCCP] Create SCCP Solver This refactors SCCP and creates a SCCPSolver interface and class so that it can be used by other passes and transformations. We will use this in D93838, which adds a function specialisation pass. This is based on an early version by Vinay Madhusudan. Differential Revision: https://reviews.llvm.org/D93762	2021-04-14 14:58:03 +01:00
Nico Weber	7a9cb801f3	[llvm-symbolizer] remove unused variable This should've been removed in D83530. Differential Revision: https://reviews.llvm.org/D100434	2021-04-14 09:24:45 -04:00
Erich Keane	92aba5ae49	CPUDispatch- allow out of line member definitions ICC permits this, and after some extensive testing it looks like we can support this with very little trouble. We intentionally don't choose to do this with attribute-target (despite it likely working as well!) because GCC does not support that, and introducing said incompatibility doesn't seem worth it.	2021-04-14 06:19:49 -07:00
Sanjay Patel	7ef2c68a3d	[InstSimplify] improve efficiency for detecting non-zero value Stepping through callstacks in the example from D99759 reveals this potential compile-time improvement. The savings come from avoiding ValueTracking's computing known bits if we have already dealt with special-case patterns. Further improvements in this direction seem possible. This makes a degenerate test based on PR49785 about 40x faster (25 sec -> 0.6 sec), but it does not address the larger question of how to limit computeKnownBitsFromAssume(). Ie, the original test there is still infinite-time for all practical purposes. Differential Revision: https://reviews.llvm.org/D100408	2021-04-14 09:04:15 -04:00
Sanjay Patel	5ae5d25e38	[ValueTracking] match negative-stepping non-zero recurrence This is pulled out of D100408. This avoids a regression that would be exposed by making the calling code from InstSimplify more efficient.	2021-04-14 08:57:53 -04:00
Sven van Haastregt	856c49d79c	[OpenCL][Docs] Update OpenCL 3.0 implementation status Reviewed-By: Anastasia Stulova	2021-04-14 13:56:26 +01:00
Hansang Bae	77dc7b4653	[OpenMP] Fix printing routine for OMP_TOOL_VERBOSE_INIT Also fixed typo in the verbose message. Differential Revision: https://reviews.llvm.org/D100414	2021-04-14 07:55:26 -05:00
Sebastian Neubauer	929edd4375	[AMDGPU] Mark scavenged SGPR as used Otherwise it reuses the same register for storing the stack slot offset if the stack slot offset is big. Differential Revision: https://reviews.llvm.org/D100461	2021-04-14 14:55:01 +02:00
Sanjay Patel	4919365397	[ValueTracking] reduce code duplication; NFC The start value can't be null for something to be a non-zero recurrence, so hoist that common check out of the switch. Subsequent checks may be incomplete or over-specified as noted in: D100408	2021-04-14 08:32:42 -04:00
Max Kazantsev	d0920b201f	[Test] Account for possibility to free memory in loop load PRE test	2021-04-14 19:28:42 +07:00
Zarko Todorovski	6b7838b68c	[AIX] Allow safe for 32bit P8 VSX pattern matching Pull some of the safe for 32bit pattern matching for Pwr8 and above. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D97909	2021-04-14 08:12:48 -04:00
Martin Storsjö	413d84fb5c	[lit] Remove unnecessary testcases from lit-quoting.txt that fail on macOS These were added in `37935405ef`, but they fail on macOS (and on Windows with MSYS based tools, before relanding D98859). Remove the tests that exercise "not not echo", as the primary thing to test is the plain echo patterns above.	2021-04-14 15:09:42 +03:00
Sanjay Patel	989445f438	[ValueTracking] add unit test for isKnownNonZero(); NFC We call various value tracking APIs from within -instsimplify, so I don't think this is visible in a larger test.	2021-04-14 08:06:26 -04:00
Martin Storsjö	3637c5c8ec	[clang] [AArch64] Fix Windows va_arg handling for larger structs Aggregate types over 16 bytes are passed by reference. Contrary to the x86_64 ABI, smaller structs with an odd (non power of two) are padded and passed in registers. Differential Revision: https://reviews.llvm.org/D100374	2021-04-14 14:51:53 +03:00
David Spickett	6cdc2239db	[lldb][AArch64] Simplify MTE memory region test By checking for cpu and toolchain features ahead of time we don't need the custom return codes. Reviewed By: omjavaid Differential Revision: https://reviews.llvm.org/D97684	2021-04-14 11:50:45 +01:00
Tim Northover	6401b78ab3	SDAG: constant fold bf16 -> i16 casts This direction is particularly useful because i16 constants are much more likely to be legal than bf16.	2021-04-14 11:27:46 +01:00
Martin Storsjö	57b259a852	[Passes] Enable the relative lookup table converter pass on aarch64 After `d5c5cf5ce8`, it should work fine for aarch64 on COFF too. (It was disabled when the patch was (re)applied in `e96df3e531`, pending that fix.)	2021-04-14 13:15:41 +03:00
Roman Lebedev	2fea5d5d4a	[InstCombine] tmp alloca bypass: ensure that the replacement dominates all alloca uses After `077bff39d4`, isDereferenceableForAllocaSize() can recurse into selects, which is causing a problem for the new test case, reduced from https://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20210412/904154.html because the replacement (the select) is defined after the first use of an alloca, so we'd end up with a verifier error. Now, this new check is too restrictive. We likely can handle some cases, by trying to sink all uses of an alloca to after the the def.	2021-04-14 13:04:12 +03:00
Simon Pilgrim	73737fe990	[X86] Fold cmpeq/ne(trunc(x),0) --> cmpeq/ne(x,0) Relax the fold from rGbaadbe04bf75 to compare any op, not just logic ops, now that the movmsk regressions have been handled.	2021-04-14 11:02:02 +01:00
Simon Pilgrim	62af2af85d	[X86] Regenerate PR32284.ll test case prefixes. NFC. Use X64 for 64-bit targets and X86 for 32-bit targets	2021-04-14 11:02:01 +01:00
Simon Pilgrim	016ceb8382	[X86][SSE] combineSetCCMOVMSK - allow comparison with upper (known zero) bits in MOVMSK(SHUFFLE(X,u)) -> MOVMSK(X) fold Extension to rG74f98391a7a4, we can also include any of the upper (known zero) bits in the comparison in the shuffle removal fold, just as long as we demand all the elements of the movmsk source vector.	2021-04-14 11:02:01 +01:00
Nemanja Ivanovic	8be3181df6	[PowerPC] Fix incorrect subreg typo from `0148bf53f0`	2021-04-14 05:01:12 -05:00
Martin Storsjö	37935405ef	[lit] Always quote arguments containing '[' on windows This avoids breaking clang-tidy/infrastructure/validate-check-names.cpp if 'not' is evaluated as a lit internal tool (making TestRunner invoke 'grep' directly in that test, instead of invoking 'not', which then invokes 'grep'). The quoting of arguments is still brittle if the executable is an MSYS based tool though, as MSYS based tools incorrectly unescape backslashes in quoted arguments (contrary to regular win32 argument parsing rules), see D99406 and https://github.com/msys2/msys2-runtime/issues/36 for more examples of the issues. Differential Revision: https://reviews.llvm.org/D99938	2021-04-14 12:32:48 +03:00
Martin Storsjö	3b32dc4b84	[ARM] [COFF] Properly produce cross-section relative relocations Differential Revision: https://reviews.llvm.org/D99574	2021-04-14 12:31:28 +03:00
Martin Storsjö	d5c5cf5ce8	[AArch64] [COFF] Properly produce cross-section relative relocations This fixes breakage on Windows/ARM64 after D94355. Modelled after the corresponding code for X86; not entirely familiar with those aspects of that layer otherwise. Differential Revision: https://reviews.llvm.org/D99572	2021-04-14 12:31:26 +03:00
Martin Storsjö	127322ddeb	[lldb] Silence GCC warnings about control reaching the end of non-void functions. NFC. Also remove a superfluous semicolon after the braces for a switch statement (that wasn't warned about). Differential Revision: https://reviews.llvm.org/D100447	2021-04-14 11:54:45 +03:00
Liu, Chen3	1c4108ab66	[i386] Modify the alignment of __m128/__m256/__m512 vector type according i386 abi. According to i386 System V ABI: 1. when __m256 are required to be passed on the stack, the stack pointer must be aligned on a 0 mod 32 byte boundary at the time of the call. 2. when __m512 are required to be passed on the stack, the stack pointer must be aligned on a 0 mod 64 byte boundary at the time of the call. The current method of clang passing __m512 parameter are as follow: 1. when target supports avx512, passing it with 64 byte alignment; 2. when target supports avx, passing it with 32 byte alignment; 3. Otherwise, passing it with 16 byte alignment. Passing __m256 parameter are as follow: 1. when target supports avx or avx512, passing it with 32 byte alignment; 2. Otherwise, passing it with 16 byte alignment. This pach will passing __m128/__m256/__m512 following i386 System V ABI and apply it to Linux only since other System V OS (e.g Darwin, PS4 and FreeBSD) don't want to spend any effort dealing with the ramifications of ABI breaks at present. Differential Revision: https://reviews.llvm.org/D78564	2021-04-14 16:44:54 +08:00
Balázs Kéri	bda20282cb	[clang-tidy] Add exception flag to bugprone-unhandled-exception-at-new test.	2021-04-14 10:01:05 +02:00

1 2 3 4 5 ...

385472 Commits All Branches Search

385472 Commits

All Branches