llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	a1b43d2bc9	[LazyValueInfo] getPredicateAt - remove unnecessary null pointer check. NFC. We already dereference the CxtI pointer several times before reaching the "if(CxtI)", we have no need to check it again. Fixes a coverity warning.	2021-10-16 11:20:19 +01:00
Simon Pilgrim	c288241795	[ConstantFolding] ConstantFoldScalarCall2 - early-out if getLibFunc fails. NFC.	2021-10-16 11:20:19 +01:00
Simon Pilgrim	c18cf10a04	[ConstantFolding] Use getValueAPF const ref value where possible. NFC. Don't copy the value if we can avoid it.	2021-10-16 11:20:19 +01:00
Simon Pilgrim	76ca0d67ab	[ConstantFolding] ConstantFoldScalarCall1 - early-out if getLibFunc fails. NFC.	2021-10-16 11:20:18 +01:00
Roman Lebedev	d137f1288e	[X86][LV] X86 does not prefer vectorized addressing And another attempt to start untangling this ball of threads around gather. There's `TTI::prefersVectorizedAddressing()`hoop, which confusingly defaults to `true`, which tells LV to try to vectorize the addresses that lead to loads, but X86 generally can not deal with vectors of addresses, the only instructions that support that are GATHER/SCATTER, but even those aren't available until AVX2, and aren't really usable until AVX512. This specializes the hook for X86, to return true only if we have AVX512 or AVX2 w/ fast gather. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D111546	2021-10-16 12:32:18 +03:00
Ben Shi	9bf6bef995	[AArch64] Optimize add/sub with immediate Optimize ([add\|sub] r, imm) -> ([ADD\|SUB] ([ADD\|SUB] r, #imm0, lsl #12), #imm1), if imm == (imm0<<12)+imm1. and both imm0 and imm1 are non-zero 12-bit unsigned integers. Optimize ([add\|sub] r, imm) -> ([SUB\|ADD] ([SUB\|ADD] r, #imm0, lsl #12), #imm1), if imm == -(imm0<<12)-imm1, and both imm0 and imm1 are non-zero 12-bit unsigned integers. Reviewed By: jaykang10, dmgreen Differential Revision: https://reviews.llvm.org/D111034	2021-10-16 08:50:39 +00:00
Carlos Galvez	f0711106dc	[clang-tidy] Fix false positive in cppcoreguidelines-virtual-class-destructor Incorrectly triggers for template classes that inherit from a base class that has virtual destructor. Any class inheriting from a base that has a virtual destructor will have their destructor also virtual, as per the Standard: https://timsong-cpp.github.io/cppwp/n4140/class.dtor#9 > If a class has a base class with a virtual destructor, > its destructor (whether user- or implicitly-declared) is virtual. Added unit tests to prevent regression. Fixes bug https://bugs.llvm.org/show_bug.cgi?id=51912 Differential Revision: https://reviews.llvm.org/D110614	2021-10-16 08:27:08 +00:00
Matthias Springer	e7bb8dd929	[mlir][linalg][bufferize] Relax rules for extract_slice/insert_slice matching The rules were too restrictive, causing out-of-place bufferization when the result of two ExtractSliceOp is fed into an InsertSliceOp. Differential Revision: https://reviews.llvm.org/D111861	2021-10-16 17:08:47 +09:00
Craig Topper	64591f217d	[TableGen] Replace static_cast with llvm's cast. NFC These all appear next to an isa<> and cast<> is much more common in these cases.	2021-10-16 00:27:53 -07:00
Juneyoung Lee	37ca7a795b	Fix missing failures in clang-ppc64be* and retry fixing clang-x64-windows-msvc	2021-10-16 16:20:14 +09:00
Groverkss	52d6c5df85	[MLIR] Generalize Affine dependence analysis using Affine Relations This patch removes code very specific to affine dependence analysis and refactors it as a FlatAfffineRelation. A FlatAffineRelation represents a set of ordered pairs (domain -> range) where "domain" and "range" are tuples of identifiers. These relations are used to represent an "access relation" for memory access on a memref. An access relation maps elements of an iteration domain to the element(s) of an array domain accessed by that iteration of the associated statement through some array reference. The dependence relation representing the dependence constraints between two memory accesses can be built by composing the access relation of the destination access by the inverse of the access relation of source access. This patch does not change the functionality of the existing dependence analysis in checkMemrefAccessDependence, but refactors it to use FlatAffineRelations to deduplicate code and enable code reuse for future development of features like scheduling, value-based dependence analysis, etc. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D110563	2021-10-16 12:16:42 +05:30
Juneyoung Lee	9aa6c72b92	Fix lit test failures in clang-ppc* and clang-x64-windows-msvc	2021-10-16 14:33:59 +09:00
Juneyoung Lee	705387c507	Resolve lit failures in clang after 8ca4b3e's land	2021-10-16 13:51:50 +09:00
Juneyoung Lee	8ca4b3ef19	[Clang/Test]: Rename enable_noundef_analysis to disable-noundef-analysis and turn it off by default (2) This patch updates test files after D105169. Autogenerated test codes are changed by `utils/update_cc_test_checks.py,` and non-autogenerated test codes are changed as follows: (1) I wrote a python script that (partially) updates the tests using regex: {F18594904} The script is not perfect, but I believe it gives hints about which patterns are updated to have `noundef` attached. (2) The remaining tests are updated manually. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D108453	2021-10-16 12:01:41 +09:00
Juneyoung Lee	80dba72a66	[Clang/Test]: Rename enable_noundef_analysis to disable-noundef-analysis and turn it off by default Turning on `enable_noundef_analysis` flag allows better codegen by removing freeze instructions. I modified clang by renaming `enable_noundef_analysis` flag to `disable-noundef-analysis` and turning it off by default. Test updates are made as a separate patch: D108453 Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D105169	2021-10-16 12:01:37 +09:00
Michael Kruse	da2e1f622d	[Polly][docs] Fix Sphinx warning. ReStructured Text is not Markdown.	2021-10-15 21:13:43 -05:00
Craig Topper	f6cd43c098	[X86] Add more tests for D111858. NFC Add tests with sub instead of neg.	2021-10-15 17:51:43 -07:00
Zhi An Ng	da07942834	[WebAssembly] Add prototype relaxed laneselect instructions Add i8x16, i16x8, i32x4, i64x2 laneselect instructions. These are only exposed as builtins, and require user opt-in.	2021-10-15 17:45:09 -07:00
Jacques Pienaar	965ec6dbe7	[mlir] Add folder for shape.add	2021-10-15 17:30:17 -07:00
Aart Bik	e9b1c974be	[mlir][sparse] run less combinations of SpMM in test (to reduce runtime) This revision also adds a few passes to the sparse compiler part to unify the transformation sequence with all other paths we currently use. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D111900	2021-10-15 16:04:01 -07:00
Geoffrey Martin-Noble	efc6fe963c	[MLIR][TOSA] Drop "OnTensors" suffix This is the only lowering to Linalg Tosa has, so it's needlessly verbose. Likely this was a carry over from IREE's usage where we originally lowered to linalg on buffers (the only linalg that existed at the time), so the everything on tensors needed the suffix. We're dropping it in IREE also, having transitioned entirely to using Linalg on tensors. Reviewed By: sjarus Differential Revision: https://reviews.llvm.org/D111911	2021-10-15 16:01:19 -07:00
Fangrui Song	f8ee74fc13	[ELF] Require two-dash form for --pack-dyn-relocs LLD specific options can be more rigid. Also add a test.	2021-10-15 15:36:30 -07:00
Matheus Izvekov	489561d463	[clang] fix typo correction not looking for candidates in base classes. RecordMemberExprValidator was not looking through ElaboratedType nodes when looking for candidates which occur in base classes. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D111830	2021-10-16 00:35:22 +02:00
Anshil Gandhi	1830ec94ac	Revert "[HIP] [AlwaysInliner] Disable AlwaysInliner to eliminate undefined symbols" This reverts commit `03375a3fb3`.	2021-10-15 16:16:18 -06:00
$Lawrence D'\''Anna$ Lawrence D'\''Anna	4594f81165	Fix Xcode project for debugserver It seems StringConvert.cpp was moved, and the Xcode project file wasn't updated. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D111910	2021-10-15 15:08:06 -07:00
Nikita Popov	587493b441	[ConstantRange] Compute precise shl range for single elements For the common case where the shift amount is constant (a single element range) we can easily compute a precise range (up to unsigned envelope), so do that.	2021-10-15 23:44:41 +02:00
Anshil Gandhi	f92db6d3ff	[HIP] Relax conditions for address space cast in builtin args Allow (implicit) address space casting between LLVM-equivalent target address spaces. Reviewed By: yaxunl, tra Differential Revision: https://reviews.llvm.org/D111734	2021-10-15 15:35:52 -06:00
Arthur Eubanks	2a2432e95f	[NFC] Make Assume2KnowledgeMap's typedef more precise	2021-10-15 14:34:17 -07:00
Sanjay Patel	a49f5386ce	[InstCombine] generalize fold for mask-with-signbit-splat, part 2 This removes an over-specified fold. The more general transform was added with: `727e642e97` There's a difference on an existing test that shows a potentially unnecessary use limit on an icmp fold. That fold is in InstCombinerImpl::foldICmpSubConstant(), and IIRC there was some back-and-forth on it and similar folds because they could cause analysis/passes (SCEV, LSR?) to miss optimizations. Differential Revision: https://reviews.llvm.org/D111410	2021-10-15 17:11:29 -04:00
Stanislav Mekhanoshin	cd538a6b14	[AMDGPU] Precommit fused-bitlogic.ll test. NFC.	2021-10-15 13:56:24 -07:00
Nikita Popov	9eb8040a28	[ConstantRange] Support checking optimality for subset of inputs (NFC) We always want to check correctness, but for some operations we can only guarantee optimality for a subset of inputs. Accept an additional predicate that determines whether optimality for a given pair of ranges should be checked.	2021-10-15 22:48:07 +02:00
Anshil Gandhi	53fc5100e0	Revert "[HIP] Relax conditions for address space cast in builtin args" This reverts commit `3b48e1170d`.	2021-10-15 14:42:28 -06:00
Sanjay Patel	727e642e97	[InstCombine] generalize fold for mask-with-signbit-splat (iN X s>> (N-1)) & Y --> (X < 0) ? Y : 0 https://alive2.llvm.org/ce/z/qeYhdz I was looking at a missing abs() transform and found my way to this generalization of an existing fold that was added with D67799. As discussed in that review, we want to make sure codegen handles this difference well, and for all of the targets/types that I spot-checked, it looks good. I am leaving the existing fold in place in this commit because it covers a potentially missing icmp fold, but I plan to remove that as a follow-up commit as suggested during review. Differential Revision: https://reviews.llvm.org/D111410	2021-10-15 16:25:48 -04:00
Anshil Gandhi	3b48e1170d	[HIP] Relax conditions for address space cast in builtin args Allow (implicit) address space casting between LLVM-equivalent target address spaces. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D111734	2021-10-15 14:06:47 -06:00
Nikita Popov	0c52c271a5	[BasicAA] Rename ExtendedValue to CastedValue (NFC) As suggested on D110977, rename ExtendedValue to CastedValue, because it will contain more than just extensions in the future.	2021-10-15 21:56:54 +02:00
Nikita Popov	82e858d1bf	[ConstantRange] Better diagnostic for correctness test failure (NFC) Print a friendly error message including the inputs, result and not-contained element if an exhaustive correctness test fails, same as we do if the optimality test fails.	2021-10-15 21:52:17 +02:00
Volodymyr Sapsai	d0e7bdc208	[modules] Make a module map referenced by a system map a system one too. Mimic the behavior of including headers where a system includer makes an includee a system header too. rdar://84049469 Differential Revision: https://reviews.llvm.org/D111476	2021-10-15 12:46:51 -07:00
Florian Hahn	4a1d63d7d0	[VectorCombine] Add option to only run scalarization transforms. This patch adds a pass option to only run transforms that scalarize vector operations and do not create new vector instructions. When running VectorCombine early in the pipeline introducing new vector operations can have negative effects, like blocking loop or SLP vectorization. To avoid regressions, restrict the early VectorCombine run (when using -enable-matrix) to only perform scalarization and not introduce new vector operations. This is done as option to the pass directly, which is then set when adding the pass to the pipeline. This is done for the new pass manager only. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D111800	2021-10-15 20:35:58 +01:00
Pirama Arumuga Nainar	69708477be	[compiler-rt/profile] Hide __llvm_profile_raw_version Hide __llvm_profile_raw_version so as not to resolve reference from a dependent shared object. Since libclang_rt.profile is added later in the command line, a definition of __llvm_profile_raw_version is not included if it is provided from an earlier object, e.g. from a shared dependency. This causes an extra dependence edge where if libA.so depends on libB.so and both are coverage-instrumented, libA.so uses libB.so's definition of __llvm_profile_raw_version. This leads to a runtime link failure if the libB.so available at runtime does not provide this symbol (but provides the other dependent symbols). Such a scenario can occur in Android's mainline modules. E.g.: ld -o libB.so libclang_rt.profile-x86_64.a ld -o libA.so -l B libclang_rt.profile-x86_64.a libB.so has a global definition of __llvm_profile_raw_version. libA.so uses libB.so's definition of __llvm_profile_raw_version. At runtime, libB.so may not be coverage-instrumented (i.e. not export __llvm_profile_raw_version) so runtime linking of libA.so will fail. Marking this symbol as hidden forces each binary to use the definition of __llvm_profile_raw_version from libclang_rt.profile. Differential Revision: https://reviews.llvm.org/D111759	2021-10-15 11:56:16 -07:00
Sam Clegg	659a08399a	[WebAssembly] Add import info to `dylink` section of shared libraries See https://github.com/WebAssembly/tool-conventions/pull/175 Differential Revision: https://reviews.llvm.org/D111345	2021-10-15 11:49:16 -07:00
Mingming Liu	cfd155c41b	[SelectionDAG] Fix typo in option help Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D111867	2021-10-15 11:27:40 -07:00
Anshil Gandhi	03375a3fb3	[HIP] [AlwaysInliner] Disable AlwaysInliner to eliminate undefined symbols By default clang emits complete contructors as alias of base constructors if they are the same. The backend is supposed to emit symbols for the alias, otherwise it causes undefined symbols. @yaxunl observed that this issue is related to the llvm options `-amdgpu-early-inline-all=true` and `-amdgpu-function-calls=false`. This issue is resolved by only inlining global values with internal linkage. The `getCalleeFunction()` in AMDGPUResourceUsageAnalysis also had to be extended to support aliases to functions. inline-calls.ll was corrected appropriately. Reviewed By: yaxunl, #amdgpu Differential Revision: https://reviews.llvm.org/D109707	2021-10-15 11:39:15 -06:00
Nico Weber	4e572db0c2	[lld/mac] Mark private externs with GOT relocs as LOCAL in indirect symbtab prepareSymbolRelocation() in Writer.cpp adds both symbols that need binding and symbols relocated with a pointer relocation to the got. Pointer relocations are emitted for non-movq GOTPCREL(%rip) loads. (movqs become GOT_LOADs so that the linker knows they can be relaxed to leaqs, while others, such as addq, become just GOT -- a pointer relocation -- since they can't be relaxed in that way). For example, this C file produces a private_extern GOT relocation when compiled with -O2 with clang: extern const char kString[]; const char* g(int a) { return kString + a; } Linkers need to put pointer-relocated symbols into the GOT, but ld64 marks them as LOCAL in the indirect symbol table. This matters, since `strip -x` looks at the indirect symbol table when deciding what to strip. The indirect symtab emitting code was assuming that only symbols that need binding are in the GOT, but pointer relocations where there too. Hence, the code needs to explicitly check if a symbol is a private extern. Fixes https://crbug.com/1242638, which has some more information in comments 14 and 15. With this patch, the output of `nm -U` on Chromium Framework after stripping now contains just two symbols when using lld, just like with ld64. Differential Revision: https://reviews.llvm.org/D111852	2021-10-15 13:24:47 -04:00
Michael Liao	bacddf47a8	[amdgpu] Fix a crash case when preserving MDT in SILowerControlFlow - When a redundant MBB is being erased from MDT, check whether its single successor is dominiated by it. If yes, update that successor's idom before erasing MBB; otherwise, it implies MBB is a leaf node and could be erased directly. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D111831	2021-10-15 13:21:53 -04:00
Vitaly Buka	e0f3a3b228	[ubsan] Remove REQUIRED from some TestCases It's not obvious why they are needed, and tests pass. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D111859	2021-10-15 10:20:34 -07:00
Arthur Eubanks	47eb99aa44	[clang] Pass -clear-ast-before-backend in Clang::ConstructJob() This clears the memory used for the Clang AST before we run LLVM passes. https://llvm-compile-time-tracker.com/compare.php?from=d0a5f61c4f6fccec87fd5207e3fcd9502dd59854&to=b7437fee79e04464dd968e1a29185495f3590481&stat=max-rss shows significant memory savings with no slowdown (in fact -O0 slightly speeds up). For more background, see https://lists.llvm.org/pipermail/cfe-dev/2021-September/068930.html. Turn this off for the interpreter since it does codegen multiple times. Differential Revision: https://reviews.llvm.org/D111270	2021-10-15 10:13:17 -07:00
Jonas Paulsson	ccbfcfda1e	[SystemZ] Handle huge immediates in SystemZInstrInfo::loadImmediate(). This is needed during isel pseudo expansion in order not to crash on huge immediates. Review: Ulrich Weigand	2021-10-15 19:08:45 +02:00
Kazu Hirata	6a154e606e	[clang] Use llvm::is_contained (NFC)	2021-10-15 10:07:08 -07:00
Jessica Paquette	59b94c4a60	NFC: Remove wayward MIR tests from lib/Target These were put in lib/Target instead of tests. Thankfully dupes of them already existed in the tests directory. So, just delete them.	2021-10-15 09:59:00 -07:00
Raphael Isemann	ff4c98c055	[lldb] Harden TestCompletion against new settings in 'target.process' This test starts failing when people add a setting starting with `target.process.t` which of course can easily happen. Make it a bit more resistant by only requiring that `target.process.thr` has a unique completion.	2021-10-15 18:50:21 +02:00

1 2 3 4 5 ...

402092 Commits All Branches Search

402092 Commits

All Branches