llvm-project

Commit Graph

Author	SHA1	Message	Date
rkayaith	f78fe0b7b8	[mlir][python] Make Operation and Value hashable This allows operations and values to be used as dict keys Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D112669	2021-11-03 10:40:03 +01:00
Andrew Savonichev	30a3a17df8	[NVPTX] Copy machine operand flags in TII::insertBranch Before this patch, flags such as undef were dropped by TII::insertBranch (used by BranchFolding pass), resulting in the following error from machine verifier: * Bad machine code: Reading virtual register without a def * - function: hoge - basic block: %bb.0 bb (0x562e9c240e68) - instruction: CBranch %2:int1regs, %bb.3 - operand 0: %2:int1regs Differential Revision: https://reviews.llvm.org/D113001	2021-11-03 12:38:27 +03:00
Yi Kong	803d4f8a35	[ARM][AsmParser] Don't emit "deprecated instruction in IT block" warning if requested Also fixed formatting in AsmMatcherEmitter because it was confusing. Differential Revision: https://reviews.llvm.org/D112993	2021-11-03 17:18:04 +08:00
Valentin Clement	3c7ff45cbb	[fir] Add substr information to fircg.ext_embox and fircg.ext_rebox operations This patch adds the substring information to the fircg.ext_embox and fircg.ext_rebox operations. Substring is used for CHARACTER types. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D112807 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-11-03 10:15:10 +01:00
Andrew Savonichev	a8083d42b1	[X86][clang] Disable long double type for -mno-x87 option This patch attempts to fix a compiler crash that occurs when long double type is used with -mno-x87 compiler option. The option disables x87 target feature, which in turn disables x87 registers, so CG cannot select them for x86_fp80 LLVM IR type. Long double is lowered as x86_fp80 for some targets, so it leads to a crash. The option seems to contradict the SystemV ABI, which requires long double to be represented as a 80-bit floating point, and it also requires to use x87 registers. To avoid that, `long double` type is disabled when -mno-x87 option is set. In addition to that, `float` and `double` also use x87 registers for return values on 32-bit x86, so they are disabled as well. Differential Revision: https://reviews.llvm.org/D98895	2021-11-03 12:08:39 +03:00
Kazushi (Jam) Marukawa	3d32218d1a	[VE] Change to omitting the frame pointer on leaf functions Change to omitting the frame pointer on leaf functions by default for VE. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D113087	2021-11-03 17:45:18 +09:00
Piotr Sobczak	03961709ed	[InstCombine] Extend pattern to replace shuffle's insertelement operand In D71220 a pattern was added to replace shuffle's insertelement operand if inserted scalar is not demanded. The pattern was added only for the case where the shuffle's mask size is equal to element's vector size. However, that condition is not required because the pattern does not change the shuffle vector size. This patch extends the pattern to also include cases where shuffle's mask size is not equal to element's vector size. Differential Revision: https://reviews.llvm.org/D112318	2021-11-03 09:43:04 +01:00
Nicolas Vasilache	9c4971740b	[mlir][Linalg] Refactor vectorization of conv1d more aggressively. This better decouples transfer read/write from vector-only rewrite of conv. This form is close to ready to plop into a new vector.conv op and the vector.transfer operations to be generalized as part of generic vectorization once the properties ConvolutionOpInterface are inferred from the indexing maps. This also results in a nice perf boost in the dw == 1 cases. Differential revision: https://reviews.llvm.org/D112822	2021-11-03 08:18:01 +00:00
Nicolas Vasilache	7b09f157e1	[mlir][Linalg] Refactor conv vectorization to decouple memory from vector ops. This refactoring prepares conv1d vectorization for a future integration into the generic codegen path. Once transfer_read / transfer_write vectorization also supports sliding windows, the special pattern for conv can disappear. This will also likely need a vector.conv operation. Differential Revision: https://reviews.llvm.org/D112797	2021-11-03 08:03:40 +00:00
Fangrui Song	c977564fc2	Revert "[ELF] Try appeasing --target=armv7-linux-androideabi24 sanitizer symbolization tests" This reverts commit `5cbec88cbf`. Vitaly said that `2faac77f26` actually works. Sanitizer's armv7-linux-androideabi24 configuration has other issues which haven't been identified yet, but that's unrelated to the empty symbol name issue.	2021-11-03 00:56:09 -07:00
Markus Böck	24f80d94b4	[mlir] Fix typos in comments in DebugAction.h	2021-11-03 08:54:47 +01:00
Ben Shi	59c3b48d99	Revert "[AArch64] Optimize add/sub with immediate" This reverts commit `3de3ca3137`.	2021-11-03 14:15:21 +08:00
Chen Zheng	5a8b196340	[PowerPC] handle more splat loads without stack operation This mostly improves splat loads code generation on Power7 Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D106555	2021-11-03 05:17:41 +00:00
Johannes Doerfert	d61aac76bf	[OpenMP][FIX] Do not signal SPMD-mode but then keep generic-mode If we assume SPMD-mode during the fixpoint iteration we have to execute the kernel in SPMD-mode. If we change our mind during manifest there is the chance of a mismatch between the simplification, e.g., of `__kmpc_is_spmd_exec_mode` calls, and the execution mode. This problem was introduced in D109438. This patch is compromise to resolve the problem purely in OpenMP-opt while trying to keep the benefits of D109438 around. This might not always work, see `get_hardware_num_threads_in_block_fold` but it often does. At the same time we do keep value specialization and execution mode in sync. Proper solutions to this problem should be considered. I believe a new execution mode is the easiest way forward (Singleton-SPMD). Alternatively, SPMD-mode execution can be used with a way to provide a new thread_limit (here 1) to the runtime. This is more general and could be useful if we see `num_threads` clauses or workshared loops with small trip counts in the kernel. In either proposal we need to disable the guarding for the kernel (which was the motivation for D109438). Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D112894	2021-11-02 23:22:04 -05:00
Johannes Doerfert	73720c8059	[OpenMP][FIX] Introduce and use a simple generic-mode barrier Before we had aligned barriers the `__kmpc_barrier_simple_spmd` was OK to be used in the custom state machine. Now that SPMD barriers are assumed to be aligned we need to use a "generic" barrier in places that are not aligned. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D112893	2021-11-02 23:22:01 -05:00
Johannes Doerfert	c690c1c977	[NVVM] Update intrinsic definitions to include more attributes A lot of NVVM intrinsics can use the default intrinsic attributes (e.g., nosync, nofree, ...) as well as `speculatable`. The latter is important if we want to recompute intrinsics results instead of communicating them via memory. I did use default attributes for almost all `readnone` attributes but speculatable only where I had reasonable confidence they cannot experience UB. That said, someone should double check. TODO: There seem to be various intrinsics marked `Commutative` which should not, e.g., fma and div. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D109987	2021-11-02 23:21:57 -05:00
Johannes Doerfert	e6e440ae5f	[OpenMP][FIX] Ensure guarding uses proper global name Global symbols cannot have any name so we need to sanitize the string first. Also remove an assertion that is not actually necessary nor true in general. Reviewed By: ggeorgakoudis Differential Revision: https://reviews.llvm.org/D112892	2021-11-02 23:21:53 -05:00
Johannes Doerfert	ccb5d2726a	[OpenMP][FIX] Avoid a race between initialization and first state reads When we pick state 0 to initialize state but thread N is going to be the "main thread", in generic mode, we would require extra synchronization. Instead, we should pick the main thread to initialize state in generic mode and any thread in SPMD mode. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D112874	2021-11-02 23:21:49 -05:00
Abinav Puthan Purayil	fbe61fb0aa	[AMDGPU] Fix SGPR checks in S_MOV_B64_IMM_PSEUDO generation. The function to generate S_MOV_B64_IMM_PSEUDO was recently modified to optimize AGPR to AGPR copy but it missed checking for the SGPR clobbering for the S_MOV_B64_IMM_PSEUDO generation. Differential Revision: https://reviews.llvm.org/D113005	2021-11-03 09:09:24 +05:30
Ben Shi	3de3ca3137	[AArch64] Optimize add/sub with immediate Optimize ([add\|sub] r, imm) -> ([ADD\|SUB] ([ADD\|SUB] r, #imm0, lsl #12), #imm1), if imm == (imm0<<12)+imm1. and both imm0 and imm1 are non-zero 12-bit unsigned integers. Optimize ([add\|sub] r, imm) -> ([SUB\|ADD] ([SUB\|ADD] r, #imm0, lsl #12), #imm1), if imm == -(imm0<<12)-imm1, and both imm0 and imm1 are non-zero 12-bit unsigned integers. Reviewed By: jaykang10, dmgreen Differential Revision: https://reviews.llvm.org/D111034	2021-11-03 03:06:43 +00:00
wlei	dc9f037955	[llvm-profgen] Refactor the code of getHashCode Refactor to generate hash code lazily. Tested on clang self build, no observable generating time regression. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D113059	2021-11-02 19:56:20 -07:00
wlei	138202a8c3	[llvm-profgen] Warn on invalid range and show warning summary Two things in this diff: 1) Warn on the invalid range, currently three types of checking, see the detailed message in the code. 2) In some situation, llvm-profgen gives lots of warnings on the truncated stacks which is noisy. This change provides a switch to `--show-detailed-warning` to skip the warnings. Alternatively, we use a summary for those warning and show the percentage of cases with those issues. Example of warning summary. ``` warning: 0.05%(1120/2428958) cases with issue: Profile context truncated due to missing probe for call instruction. warning: 0.00%(2/178637) cases with issue: Range does not belong to any functions, likely from external function. ``` Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D111902	2021-11-02 19:55:55 -07:00
Liren Peng	57e093162e	[ScalarEvolution] Infer loop max trip count from array accesses Data references in a loop should not access elements over the statically allocated size. So we can infer a loop max trip count from this undefined behavior. Reviewed By: reames, mkazantsev, nikic Differential Revision: https://reviews.llvm.org/D109821	2021-11-03 10:40:18 +08:00
Phoebe Wang	8f101971b6	[X86][VARARG] Assign MMO earlier to avoid prolog insert point been sunk across VASTART_SAVE_XMM_REGS The changes in D80163 defered the assignment of MachineMemOperand (MMO) until the X86ExpandPseudo pass. This will result in crash due to prolog insert point been sunk across the pseudo instruction VASTART_SAVE_XMM_REGS. Moving the assignment to the creation of the node can avoid the problem. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D112859	2021-11-03 10:13:32 +08:00
Fangrui Song	5cbec88cbf	[ELF] Try appeasing --target=armv7-linux-androideabi24 sanitizer symbolization tests	2021-11-02 18:57:04 -07:00
Mircea Trofin	34f4fe3a90	[NFC][Regalloc] Ensure Query::interferingVRegs is accurate. To correctly use Query, one had to first call collectInterferingVRegs to pre-cache the query result, then call interferingVRegs. Failing the former, interferingVRegs could be stale. This did cause a bug which was addressed in D98232, but the underlying usability issue of the Query API wasn't. This patch addresses the latter by making collectInterferingVRegs an implementation detail, and having interferingVRegs play both roles. One side-effect of this is that interferingVRegs is not const anymore. Differential Revision: https://reviews.llvm.org/D112882	2021-11-02 18:26:54 -07:00
Kazu Hirata	1b108ab975	[Transforms] Use make_early_inc_range (NFC)	2021-11-02 18:13:23 -07:00
Hongtao Yu	d0eb472f33	[llvm-profdata] Print out section flags for FunctionMetadata section As titled. Reviewed By: wenlei, wlei Differential Revision: https://reviews.llvm.org/D113064	2021-11-02 17:59:22 -07:00
Fangrui Song	2faac77f26	[ARM] Make empty name symbols SF_FormatSpecific to try appeasing --target=armv7-linux-androideabi24 sanitizer symbolization tests This is similar to D98669 but I don't know whether and why ARM needs it. I suspect this may fix the sanitizer-x86_64-linux-android bot which runs --target=armv7-linux-androideabi24 tests (https://lab.llvm.org/buildbot/#/builders/77) Someone needs to investigate the root cause and I am not sure this hack fixes the bot.	2021-11-02 17:10:42 -07:00
Arthur Eubanks	f2e807797e	Revert "[gn build] Manually port 6fd2db04" This reverts commit `43390d38f0`. Corresponding commit was reverted.	2021-11-02 16:51:56 -07:00
Vitaly Buka	ee4634f7fe	[NFC][asan] Fix confusing variable name There is no such thing as ModuleUseAfterScope.	2021-11-02 16:49:15 -07:00
Vitaly Buka	eb9423ae0e	[NFC][asan] Simplify AddressSanitizerOptions	2021-11-02 16:49:15 -07:00
MaheshRavishankar	3ecc2a63eb	[mlir][Linalg] Allow transformation filter to match by default. The current setup of LinalgTransformationFilter allows a transformation to trigger when either 1) The StringAttr is not set and no filter identifier is specified. 2) The StringAttr is set and its value matches (one of) the provided identifier. This misses the case where the transformation should trigger either when the attribute is not set or its value matches (one of) the provided identifier. Since `Identifier` does not allow empty strings, add a boolean option to match when the attribute is not set. This option is by default off. Differential Revision: https://reviews.llvm.org/D113057	2021-11-02 15:59:56 -07:00
Mehdi Amini	ba7a6b314f	Fix iterator_adaptor_base/enumerator_iter to allow composition of llvm::enumerate with llvm::make_filter_range * Properly specify reference type in enumerator_iter * Fix constness of iterator_adaptor_base::operator* Differential Revision: https://reviews.llvm.org/D112981	2021-11-02 22:49:43 +00:00
Mehdi Amini	ca0ed40e00	Remove builder that takes SSA value instead of Attribute on ExtractValueOp, InsetValueOp, and InsertOnRangeOp This builder exposed a somehow "unsafe" API: it pretends we can construct an InsertOnRangeOp from a range of SSA values, even though this will crash if these aren't the result of `arith.constant` since the operation actually needs attribute values (a build method can't fail gracefully). That means that the caller must check for the producer, at which point they can just assemble the attribute array directly and call the existing builder. The existing call-sites were even in a worse state here: they would actually create a constant operation that wouldn't be used and only serve to carry the attribute through the builder API. Differential Revision: https://reviews.llvm.org/D112946	2021-11-02 22:35:47 +00:00
Nicolas Vasilache	885072820c	[mlir][Vector] Add a pattern to lower 2-D vector.transpose to shape_cast+shuffle. The 2-D case can be rewritten to generate quite fewer instructions and a single vector.shuffle which seems to provide a nice performance boost. Add this arrow to our quiver by exposing it with a new vector transform option. Differential Revision: https://reviews.llvm.org/D113062	2021-11-02 22:12:46 +00:00
Eli Friedman	c964afb2c8	[AArch64] Diagnose large adrp offset on Windows. On Windows, this relocation can only encode a 21-bit offset. Make sure we emit an error, instead of silently truncating the offset. Found investigating https://bugs.llvm.org/show_bug.cgi?id=52378 Differential Revision: https://reviews.llvm.org/D113051	2021-11-02 15:11:22 -07:00
Kirill Stoimenov	bab3f32d6b	[mlir] Fixed a typo. Reviewed By: kda Differential Revision: https://reviews.llvm.org/D113053	2021-11-02 21:39:10 +00:00
Lawrence D'Anna	7f01f78593	[lldb] update TestEchoCommands Followup to https://reviews.llvm.org/D112988 Sorry, I broke this test. The test was verifying the bad behavior of --source-quietly that the previous change fixed -- namely that it still echos the initial list of startup commands while sourcing them. Updated the test to verify that --source-quietly is quiet, rather than loud. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D113047	2021-11-02 14:30:08 -07:00
Jonas Devlieghere	50b40b0518	[lldb] Improve error reporting in `lang objc tagged-pointer info` Improve error handling for the lang objc tagged-pointer info. Rather than failing silently, report an error if we couldn't convert an argument to an address or resolve the class descriptor. (lldb) lang objc tagged-pointer info 0xbb6404c47a587764 error: could not get class descriptor for 0xbb6404c47a587764 (lldb) lang objc tagged-pointer info n1 error: could not convert 'n1' to a valid address Differential revision: https://reviews.llvm.org/D112945	2021-11-02 14:25:42 -07:00
Florian Hahn	e515d3a433	[LV] Add test case from PR51794 for over-eager truncation. This patch adds a test case for PR51794 where reductions are performed on types that are too small.	2021-11-02 22:15:09 +01:00
Aart Bik	b3175fc2da	[mlir][sparse] bazel correction after filename change Reviewed By: GMNGeoffrey, rdzhabarov Differential Revision: https://reviews.llvm.org/D113052	2021-11-02 14:09:45 -07:00
Rich Lowe	de6f7252da	[sanitizer_common] Fix readlink error handling in sanitizer_procmaps_solaris.cpp As pointed out in Bug 52371, the Solaris version of `MemoryMappingLayout::Next` completely failed to handle `readlink` errors or properly NUL-terminate the result. This patch fixes this. Originally provided in the PR with slight formatting changes. Tested on `amd64-pc-solaris2.11`. Differential Revision: https://reviews.llvm.org/D112998	2021-11-02 22:06:17 +01:00
Yaxun (Sam) Liu	60a085beb0	Revert "[clang] deprecate frelaxed-template-template-args, make it on by default" This reverts commit `2d7fba5f95`. The patch was reverted because it caused regression with rocThrust due to ambiguity of template specialization. For details please see https://reviews.llvm.org/D109496	2021-11-02 17:02:19 -04:00
Vy Nguyen	37f96cb478	Revert "[lld-macho] Change bitfield types to be identical." This reverts commit `ae31f9fbad`. Reason: bitfields can't be merged across parent/child classes anyway. So this change doesn't help.	2021-11-02 16:57:51 -04:00
HarrietAkot	8a91bc7bf4	[mlir][sparse] Rename SparseUtils.cpp file to SparseTensorUtils.cpp Bug 52304 - Rename the sparse runtime support library cpp file Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D113043	2021-11-02 13:54:33 -07:00
Nikita Popov	c00e9c6345	[BasicAA] Check known access sizes earlier (NFC) All heuristics for variable accesses require both access sizes to be known, so check this once at the start, rather than for each particular heuristic.	2021-11-02 21:26:26 +01:00
Nikita Popov	0b6ed92c8a	[BasicAA] Use early returns (NFC) Reduce nesting in aliasGEP() a bit by returning early.	2021-11-02 21:17:36 +01:00
Simon Pilgrim	53900a19fd	[X86][AVX] combineConcatVectorOps - use getBROADCAST_LOAD helper for splat of normal vector loads. NFCI. Reapplied from rG1cfecf4fc427 with fix for PR51226 - ensure the load is a normal (non-ext) load.	2021-11-02 20:03:25 +00:00
Martin Storsjö	dd5ce506f7	[libcxx] [test] Remove a LIBCXX-WINDOWS-FIXME, don't test an unsupported strftime() pattern Testing the unsupported pattern can trigger the invalid parameter handler, which depending on CRT configuration can abort the process. Differential Revision: https://reviews.llvm.org/D112352	2021-11-02 21:53:15 +02:00

... 2 3 4 5 6 ...

403741 Commits All Branches Search

403741 Commits

All Branches