llvm-project

Commit Graph

Author	SHA1	Message	Date
Erich Keane	09233412ed	Revert part of D112349 to allow ifunc resolvers be declarations. The patch in D112349 added a previously nonexistant restriction on ifunc resolvers that they MUST be defintions. However, the function multiversioning depends on being able to resolve these resolvers at link-time, so this additional restriction was breaking.	2021-11-03 07:15:16 -07:00
David Sherwood	c0f2774973	[NFC][LoopVectorize] Simple tidy-up in InnerLoopVectorizer::createVectorIntOrFpInductionPHI Use getSignedIntOrFpConstant instead of creating int or FP constants manually.	2021-11-03 14:05:21 +00:00
David Spickett	fac3f20de5	Reland "[lldb] Remove non address bits when looking up memory regions" This reverts commit `5fbcf67734`. ProcessDebugger is used in ProcessWindows and NativeProcessWindows. I thought I was simplifying things by renaming to DoGetMemoryRegionInfo in ProcessDebugger but the Native process side expects "GetMemoryRegionInfo". Follow the pattern that WriteMemory uses. So: * ProcessWindows::DoGetMemoryRegioninfo calls ProcessDebugger::GetMemoryRegionInfo * NativeProcessWindows::GetMemoryRegionInfo does the same	2021-11-03 13:56:51 +00:00
Peter Waller	7a34145f40	Reland "[AArch64][SVE][InstCombine] Combine contiguous gather/scatter to load/store" This reverts commit `753eba6421`. Contiguous gather => masked load: (sve.ld1.gather.index Mask BasePtr (sve.index IndexBase 1)) => (masked.load (gep BasePtr IndexBase) Align Mask undef) Contiguous scatter => masked store: (sve.ld1.scatter.index Value Mask BasePtr (sve.index IndexBase 1)) => (masked.store Value (gep BasePtr IndexBase) Align Mask) Tests with <vscale x 2 x double>: [Gather, Scatter] for each [Positive test (index=1), Negative test (index=2), Alignment propagation]. Differential Revision: https://reviews.llvm.org/D112076	2021-11-03 13:42:14 +00:00
Peter Waller	753eba6421	Revert "[AArch64][SVE][InstCombine] Combine contiguous gather/scatter to load/store" This reverts commit `1febf42f03`, which has a use-of-uninitialized-memory bug. See: https://reviews.llvm.org/D112076	2021-11-03 13:39:38 +00:00
David Spickett	5fbcf67734	Revert "[lldb] Remove non address bits when looking up memory regions" This reverts commit `6f5ce43b43` due to build failure on Windows.	2021-11-03 13:27:41 +00:00
Florian Hahn	64bc31ee93	[LV] Drop unneeded use of getVPSingleValue (NFC). VPReductionPHIRecipe inherits from VPValue, so there's no need to call getVPSingleValue.	2021-11-03 14:26:15 +01:00
Konstantin Boyarinov	d7ac595fc5	[libcxx][test][NFC] More tests for containers comparisons Add more missing tests for comparisons to improve code coverage (follow-up for D111738) Reviewed By: ldionne, rarutyun, #libc Differential Revision: https://reviews.llvm.org/D112424	2021-11-03 16:15:10 +03:00
Sanjay Patel	ff30394de8	[PhaseOrdering] add tests for x86 abs/max using SSE intrinsics (PR34047); NFC D113035	2021-11-03 09:13:23 -04:00
Florian Hahn	8e44bdd12a	[VPlan] Make VPWidenCanonicalIVRecipe a VPValue (NFC). The recipe produces exactly one VPValue and can inherit directly from it. This is in line with other recipes and avoids having to use getVPSingleValue.	2021-11-03 14:11:01 +01:00
Andrew Savonichev	123ad720f1	[NVPTX] Mark special registers as reserved A reserved register: - is not allocatable - is considered always live - is ignored by liveness tracking NVPTX special registers match the criteria, and marking them as reserved helps to avoid machine verifier error: * Bad machine code: Using an undefined physical register * - function: foo - basic block: %bb.0 (0x557bb178b708) - instruction: %0:int32regs = MOV_SPECIAL $envreg0 - operand 1: $envreg0 Differential Revision: https://reviews.llvm.org/D113008	2021-11-03 15:48:04 +03:00
Clement Courbet	1427742750	[Sema][NFC] Improve test coverage for builtin operators. In preparation for D112453.	2021-11-03 13:32:48 +01:00
Pavel Labath	30f922741a	[lldb] Remove ConstString from plugin names in PluginManager innards This completes de-constification of plugin names.	2021-11-03 13:14:21 +01:00
Cullen Rhodes	d968b173d3	[TableGen] Emit a warning for unused template args Add a warning to TableGen for unused template arguments in classes and multiclasses, for example: multiclass Foo<int x> { def bar; } $ llvm-tblgen foo.td foo.td:1:20: warning: unused template argument: Foo::x multiclass Foo<int x> { ^ A flag '--no-warn-on-unused-template-args' is added to disable the warning. The warning is disabled for LLVM and sub-projects if 'LLVM_ENABLE_WARNINGS=OFF'. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D109359	2021-11-03 11:55:07 +00:00
Cullen Rhodes	6c5a897c44	[mlir][nvvm] NFC: Fix unused template arg tablegen warning Identified in D109359.	2021-11-03 11:55:06 +00:00
Butygin	1cb13fddb9	[mlir] spirv: Add some atomic ops Differential Revision: https://reviews.llvm.org/D112812	2021-11-03 14:47:12 +03:00
Andrew Savonichev	0e70785538	[NVPTX] Add MoveParam instruction for TargetExternalSymbol operand TargetExternalSymbol is considered to be an immediate and not a register, so machine verifier emits an error: * Bad machine code: Expected a register operand. * - function: static_offset - basic block: %bb.0 bb (0x560e9b306028) - instruction: %3:int64regs = MoveParamI64 &static_offset_param_1 - operand 1: &static_offset_param_1 The patch adds variants of this instruction with an immediate operand for byval arguments on 64-bit and 32-bit targets. Differential Revision: https://reviews.llvm.org/D113006	2021-11-03 14:43:41 +03:00
David Green	3bc586b9aa	[ARM] Treat MVE gather add-like-or's like adds LLVM has the habit of turning adds with no common bits set into ors, which means we need to detect them and treat them like adds again in the MVE gather/scatter lowering pass. Differential Revision: https://reviews.llvm.org/D112922	2021-11-03 11:41:06 +00:00
David Spickett	6f5ce43b43	[lldb] Remove non address bits when looking up memory regions On AArch64 we have various things using the non address bits of pointers. This means when you lookup their containing region you won't find it if you don't remove them. This changes Process GetMemoryRegionInfo to a non virtual method that uses the current ABI plugin to remove those bits. Then it calls DoGetMemoryRegionInfo. That function does the actual work and is virtual to be overriden by Process implementations. A test case is added that runs on AArch64 Linux using the top byte ignore feature. Reviewed By: omjavaid Differential Revision: https://reviews.llvm.org/D102757	2021-11-03 11:10:42 +00:00
Peter Waller	1febf42f03	[AArch64][SVE][InstCombine] Combine contiguous gather/scatter to load/store Contiguous gather => masked load: (sve.ld1.gather.index Mask BasePtr (sve.index IndexBase 1)) => (masked.load (gep BasePtr IndexBase) Align Mask undef) Contiguous scatter => masked store: (sve.ld1.scatter.index Value Mask BasePtr (sve.index IndexBase 1)) => (masked.store Value (gep BasePtr IndexBase) Align Mask) Tests with <vscale x 2 x double>: [Gather, Scatter] for each [Positive test (index=1), Negative test (index=2), Alignment propagation]. Differential Revision: https://reviews.llvm.org/D112076	2021-11-03 11:02:44 +00:00
David Green	d36dd1f842	[ARM] Push gather/scatter shl index updates out of loops This teaches the MVE gather scatter lowering pass that SHL is essentially the same as Mul, where we are able to optimize the induction of a gather/scatter address by pushing them out of loops. https://alive2.llvm.org/ce/z/wG4VyT Differential Revision: https://reviews.llvm.org/D112920	2021-11-03 11:00:05 +00:00
David Spickett	52615df0f2	[libcxx][utils] Note read only mount and ptrace permission in container script Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D110938	2021-11-03 10:09:15 +00:00
Qiu Chaofan	741aeda97d	[PowerPC] Implement longdouble pack/unpack builtins Implement two builtins to pack/unpack IBM extended long double float, according to GCC 'Basic PowerPC Builtin Functions Available ISA 2.05'. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D112055	2021-11-03 17:57:25 +08:00
David Sherwood	9da8dde7fd	[NFC][LoopVectorize] Add test for tail-folding loop with conditional uniform load I've added a test for a loop containing a conditional uniform load for a target that supports masked loads. The test just ensures that we correctly use gather instructions and have the correct mask. Differential Revision: https://reviews.llvm.org/D112619	2021-11-03 09:51:11 +00:00
Alex Zinenko	34f72d9125	[mlir][python] expose the shape property of shaped types This has been missing in the original definition of shaped types. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D113025	2021-11-03 10:49:12 +01:00
Alex Zinenko	fc7594cc4a	[mlir][python] improve usability of Python affine construct bindings - Provide the operator overloads for constructing (semi-)affine expressions in Python by combining existing expressions with constants. - Make AffineExpr, AffineMap and IntegerSet hashable in Python. - Expose the AffineExpr composition functionality. Reviewed By: gysit, aoyal Differential Revision: https://reviews.llvm.org/D113010	2021-11-03 10:48:01 +01:00
rkayaith	f78fe0b7b8	[mlir][python] Make Operation and Value hashable This allows operations and values to be used as dict keys Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D112669	2021-11-03 10:40:03 +01:00
Andrew Savonichev	30a3a17df8	[NVPTX] Copy machine operand flags in TII::insertBranch Before this patch, flags such as undef were dropped by TII::insertBranch (used by BranchFolding pass), resulting in the following error from machine verifier: * Bad machine code: Reading virtual register without a def * - function: hoge - basic block: %bb.0 bb (0x562e9c240e68) - instruction: CBranch %2:int1regs, %bb.3 - operand 0: %2:int1regs Differential Revision: https://reviews.llvm.org/D113001	2021-11-03 12:38:27 +03:00
Yi Kong	803d4f8a35	[ARM][AsmParser] Don't emit "deprecated instruction in IT block" warning if requested Also fixed formatting in AsmMatcherEmitter because it was confusing. Differential Revision: https://reviews.llvm.org/D112993	2021-11-03 17:18:04 +08:00
Valentin Clement	3c7ff45cbb	[fir] Add substr information to fircg.ext_embox and fircg.ext_rebox operations This patch adds the substring information to the fircg.ext_embox and fircg.ext_rebox operations. Substring is used for CHARACTER types. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D112807 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-11-03 10:15:10 +01:00
Andrew Savonichev	a8083d42b1	[X86][clang] Disable long double type for -mno-x87 option This patch attempts to fix a compiler crash that occurs when long double type is used with -mno-x87 compiler option. The option disables x87 target feature, which in turn disables x87 registers, so CG cannot select them for x86_fp80 LLVM IR type. Long double is lowered as x86_fp80 for some targets, so it leads to a crash. The option seems to contradict the SystemV ABI, which requires long double to be represented as a 80-bit floating point, and it also requires to use x87 registers. To avoid that, `long double` type is disabled when -mno-x87 option is set. In addition to that, `float` and `double` also use x87 registers for return values on 32-bit x86, so they are disabled as well. Differential Revision: https://reviews.llvm.org/D98895	2021-11-03 12:08:39 +03:00
Kazushi (Jam) Marukawa	3d32218d1a	[VE] Change to omitting the frame pointer on leaf functions Change to omitting the frame pointer on leaf functions by default for VE. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D113087	2021-11-03 17:45:18 +09:00
Piotr Sobczak	03961709ed	[InstCombine] Extend pattern to replace shuffle's insertelement operand In D71220 a pattern was added to replace shuffle's insertelement operand if inserted scalar is not demanded. The pattern was added only for the case where the shuffle's mask size is equal to element's vector size. However, that condition is not required because the pattern does not change the shuffle vector size. This patch extends the pattern to also include cases where shuffle's mask size is not equal to element's vector size. Differential Revision: https://reviews.llvm.org/D112318	2021-11-03 09:43:04 +01:00
Nicolas Vasilache	9c4971740b	[mlir][Linalg] Refactor vectorization of conv1d more aggressively. This better decouples transfer read/write from vector-only rewrite of conv. This form is close to ready to plop into a new vector.conv op and the vector.transfer operations to be generalized as part of generic vectorization once the properties ConvolutionOpInterface are inferred from the indexing maps. This also results in a nice perf boost in the dw == 1 cases. Differential revision: https://reviews.llvm.org/D112822	2021-11-03 08:18:01 +00:00
Nicolas Vasilache	7b09f157e1	[mlir][Linalg] Refactor conv vectorization to decouple memory from vector ops. This refactoring prepares conv1d vectorization for a future integration into the generic codegen path. Once transfer_read / transfer_write vectorization also supports sliding windows, the special pattern for conv can disappear. This will also likely need a vector.conv operation. Differential Revision: https://reviews.llvm.org/D112797	2021-11-03 08:03:40 +00:00
Fangrui Song	c977564fc2	Revert "[ELF] Try appeasing --target=armv7-linux-androideabi24 sanitizer symbolization tests" This reverts commit `5cbec88cbf`. Vitaly said that `2faac77f26` actually works. Sanitizer's armv7-linux-androideabi24 configuration has other issues which haven't been identified yet, but that's unrelated to the empty symbol name issue.	2021-11-03 00:56:09 -07:00
Markus Böck	24f80d94b4	[mlir] Fix typos in comments in DebugAction.h	2021-11-03 08:54:47 +01:00
Ben Shi	59c3b48d99	Revert "[AArch64] Optimize add/sub with immediate" This reverts commit `3de3ca3137`.	2021-11-03 14:15:21 +08:00
Chen Zheng	5a8b196340	[PowerPC] handle more splat loads without stack operation This mostly improves splat loads code generation on Power7 Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D106555	2021-11-03 05:17:41 +00:00
Johannes Doerfert	d61aac76bf	[OpenMP][FIX] Do not signal SPMD-mode but then keep generic-mode If we assume SPMD-mode during the fixpoint iteration we have to execute the kernel in SPMD-mode. If we change our mind during manifest there is the chance of a mismatch between the simplification, e.g., of `__kmpc_is_spmd_exec_mode` calls, and the execution mode. This problem was introduced in D109438. This patch is compromise to resolve the problem purely in OpenMP-opt while trying to keep the benefits of D109438 around. This might not always work, see `get_hardware_num_threads_in_block_fold` but it often does. At the same time we do keep value specialization and execution mode in sync. Proper solutions to this problem should be considered. I believe a new execution mode is the easiest way forward (Singleton-SPMD). Alternatively, SPMD-mode execution can be used with a way to provide a new thread_limit (here 1) to the runtime. This is more general and could be useful if we see `num_threads` clauses or workshared loops with small trip counts in the kernel. In either proposal we need to disable the guarding for the kernel (which was the motivation for D109438). Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D112894	2021-11-02 23:22:04 -05:00
Johannes Doerfert	73720c8059	[OpenMP][FIX] Introduce and use a simple generic-mode barrier Before we had aligned barriers the `__kmpc_barrier_simple_spmd` was OK to be used in the custom state machine. Now that SPMD barriers are assumed to be aligned we need to use a "generic" barrier in places that are not aligned. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D112893	2021-11-02 23:22:01 -05:00
Johannes Doerfert	c690c1c977	[NVVM] Update intrinsic definitions to include more attributes A lot of NVVM intrinsics can use the default intrinsic attributes (e.g., nosync, nofree, ...) as well as `speculatable`. The latter is important if we want to recompute intrinsics results instead of communicating them via memory. I did use default attributes for almost all `readnone` attributes but speculatable only where I had reasonable confidence they cannot experience UB. That said, someone should double check. TODO: There seem to be various intrinsics marked `Commutative` which should not, e.g., fma and div. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D109987	2021-11-02 23:21:57 -05:00
Johannes Doerfert	e6e440ae5f	[OpenMP][FIX] Ensure guarding uses proper global name Global symbols cannot have any name so we need to sanitize the string first. Also remove an assertion that is not actually necessary nor true in general. Reviewed By: ggeorgakoudis Differential Revision: https://reviews.llvm.org/D112892	2021-11-02 23:21:53 -05:00
Johannes Doerfert	ccb5d2726a	[OpenMP][FIX] Avoid a race between initialization and first state reads When we pick state 0 to initialize state but thread N is going to be the "main thread", in generic mode, we would require extra synchronization. Instead, we should pick the main thread to initialize state in generic mode and any thread in SPMD mode. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D112874	2021-11-02 23:21:49 -05:00
Abinav Puthan Purayil	fbe61fb0aa	[AMDGPU] Fix SGPR checks in S_MOV_B64_IMM_PSEUDO generation. The function to generate S_MOV_B64_IMM_PSEUDO was recently modified to optimize AGPR to AGPR copy but it missed checking for the SGPR clobbering for the S_MOV_B64_IMM_PSEUDO generation. Differential Revision: https://reviews.llvm.org/D113005	2021-11-03 09:09:24 +05:30
Ben Shi	3de3ca3137	[AArch64] Optimize add/sub with immediate Optimize ([add\|sub] r, imm) -> ([ADD\|SUB] ([ADD\|SUB] r, #imm0, lsl #12), #imm1), if imm == (imm0<<12)+imm1. and both imm0 and imm1 are non-zero 12-bit unsigned integers. Optimize ([add\|sub] r, imm) -> ([SUB\|ADD] ([SUB\|ADD] r, #imm0, lsl #12), #imm1), if imm == -(imm0<<12)-imm1, and both imm0 and imm1 are non-zero 12-bit unsigned integers. Reviewed By: jaykang10, dmgreen Differential Revision: https://reviews.llvm.org/D111034	2021-11-03 03:06:43 +00:00
wlei	dc9f037955	[llvm-profgen] Refactor the code of getHashCode Refactor to generate hash code lazily. Tested on clang self build, no observable generating time regression. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D113059	2021-11-02 19:56:20 -07:00
wlei	138202a8c3	[llvm-profgen] Warn on invalid range and show warning summary Two things in this diff: 1) Warn on the invalid range, currently three types of checking, see the detailed message in the code. 2) In some situation, llvm-profgen gives lots of warnings on the truncated stacks which is noisy. This change provides a switch to `--show-detailed-warning` to skip the warnings. Alternatively, we use a summary for those warning and show the percentage of cases with those issues. Example of warning summary. ``` warning: 0.05%(1120/2428958) cases with issue: Profile context truncated due to missing probe for call instruction. warning: 0.00%(2/178637) cases with issue: Range does not belong to any functions, likely from external function. ``` Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D111902	2021-11-02 19:55:55 -07:00
Liren Peng	57e093162e	[ScalarEvolution] Infer loop max trip count from array accesses Data references in a loop should not access elements over the statically allocated size. So we can infer a loop max trip count from this undefined behavior. Reviewed By: reames, mkazantsev, nikic Differential Revision: https://reviews.llvm.org/D109821	2021-11-03 10:40:18 +08:00
Phoebe Wang	8f101971b6	[X86][VARARG] Assign MMO earlier to avoid prolog insert point been sunk across VASTART_SAVE_XMM_REGS The changes in D80163 defered the assignment of MachineMemOperand (MMO) until the X86ExpandPseudo pass. This will result in crash due to prolog insert point been sunk across the pseudo instruction VASTART_SAVE_XMM_REGS. Moving the assignment to the creation of the node can avoid the problem. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D112859	2021-11-03 10:13:32 +08:00

1 2 3 4 5 ...

403617 Commits All Branches Search

403617 Commits

All Branches