llvm-project

Commit Graph

Author	SHA1	Message	Date
Ivan Kosarev	75950be836	[AMDGPU][NFC] Validate G_MERGE_VALUES as we match zero-extended 32-bit scalars. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D130001	2022-07-21 14:49:57 +01:00
Jez Ng	241f62d8d3	[lld-macho] Fix assertion when two symbols at same addr have unwind info If there are multiple symbols at the same address, our unwind info implementation assumes that we always register unwind entries to a single canonical symbol. This assumption was violated by the `registerEhFrame` code. Fixes #56570. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D130208	2022-07-21 09:44:49 -04:00
Erich Keane	1da3119025	Revert "Rewording the "static_assert" to static assertion" Looks like we again are going to have problems with libcxx tests that are overly specific in their dependency on clang's diagnostics. This reverts commit `6542cb55a3`.	2022-07-21 06:40:14 -07:00
Daniel Bertalan	888d0a5ef2	[lld-macho][NFC] Remove redundant StringRef construction It's only used in one branch, so we were unnecessarily calculating the length of many symbol names. Tiny speedup when linking chromium_framework on my M1 Mac mini: x before.txt + after.txt N Min Max Median Avg Stddev x 10 3.9917109 4.0418 4.0318099 4.0203902 0.021459873 + 10 3.944725 4.053988 3.9708955 3.9825602 0.037257609 Difference at 95.0% confidence -0.03783 +/- 0.0285663 -0.940953% +/- 0.710536% (Student's t, pooled s = 0.0304028) Differential Revision: https://reviews.llvm.org/D130234	2022-07-21 15:36:56 +02:00
Muhammad Usman Shahid	6542cb55a3	Rewording the "static_assert" to static assertion This patch is basically the rewording of the static assert statement's output(error) on screen after failing. Failing a _Static_assert in C should not report that static_assert failed. It’d probably be better to reword the diagnostic to be more like GCC and say “static assertion” failed in both C and C++. consider a c file having code _Static_assert(0, "oh no!"); In clang the output is like: <source>:1:1: error: static_assert failed: oh no! _Static_assert(0, "oh no!"); ^ ~ 1 error generated. Compiler returned: 1 Thus here the "static_assert" is not much good, it will be better to reword it to the "static assertion failed" to more generic. as the gcc prints as: <source>:1:1: error: static assertion failed: "oh no!" 1 \| _Static_assert(0, "oh no!"); \| ^~~~~~~~~~~~~~ Compiler returned: 1 The above can also be seen here. This patch is about rewording the static_assert to static assertion. Differential Revision: https://reviews.llvm.org/D129048	2022-07-21 06:34:14 -07:00
Joseph Huber	bc33c2fa0c	[Binary] Hard-code the alignment of the offloading binary Summary: We previously used `alignof` to get the necessary alignment of the binary header. However this was different on 32-bit platforms and caused a few tests to fail because of it. This patch just changes this to be a hard-coded constant of 8.	2022-07-21 09:28:26 -04:00
Jay Foad	716ca2e3ef	[AMDGPU] Pre-sink IR input for some tests Edit the IR input for some codegen tests to simulate what the IR code sinking pass would do to it. This makes the tests immune to the presence or absence of the code sinking pass in the codegen pass pipeline, which does not belong there. Differential Revision: https://reviews.llvm.org/D130169	2022-07-21 14:25:44 +01:00
Michael Buch	140bcd369b	[LLDB][ClangExpression] Fix initialization of static enum alias members `IntegerLiteral::Create` operates on integer types. For that reason when we parse DWARF into an AST, when we encounter a constant initialized enum member variable, we try to determine the underlying integer type before creating the `IntegerLiteral`. However, we currently don't desugar the type and for enum typedefs `dyn_cast<EnumType>` fails. In debug builds this triggers following assert: ``` Assertion failed: (type->isIntegerType() && "Illegal type in IntegerLiteral"), function IntegerLiteral, file Expr.cpp, line 892 ``` This patch turns the `dyn_cast<EnumType>` into a `getAs<EnumType>` which `dyn_cast`s the canonical type, allowing us to get to the underlying integer type. Testing * API test * Manual repro is fixed Differential Revision: https://reviews.llvm.org/D130213	2022-07-21 14:23:41 +01:00
Michael Buch	6703812688	[LLDB][DataFormatter] Add support for std::__map_const_iterator This patch adds support for formatting `std::map::const_iterator`. It's just a matter of adding `const_` to the existing regex. Testing * Added test case to existing API tests Differential Revision: https://reviews.llvm.org/D129962	2022-07-21 14:21:12 +01:00
Matt Arsenault	5a5439cb73	AMDGPU: Refine user-sgpr-init16-bug It only applies to gfx1100 and gfx1102, and for wave32.	2022-07-21 08:57:00 -04:00
Nikita Popov	1f69503107	[MemoryBuiltins] Add getReallocatedOperand() function (NFC) Replace the value-accepting isReallocLikeFn() overload with a getReallocatedOperand() function, which returns which operand is the one being reallocated. Currently, this is always the first one, but once allockind(realloc) is respected, the reallocated operand will be determined by the allocptr parameter attribute.	2022-07-21 14:54:16 +02:00
Nikita Popov	46e6dd84b7	[MemoryBuiltins] Remove isFreeCall() function (NFC) Remove isFreeCall() in favor of getFreedOperand(). Replace the two remaining uses with a getFreedOperand() != nullptr check, as they only care that something is getting freed. (The usage in DSE is correct as such. The allocator-related checks in CFLGraph look rather questionable in general.)	2022-07-21 14:44:23 +02:00
Nikita Popov	5e856a8578	[InstCombine] Use getFreedOperand() (NFC) Use getFreedOperand() instead of isFreeCall() to remove the implicit assumption that any pointer operand to a free function is the operand being freed. This won't actually matter until we handle allockind(free).	2022-07-21 14:33:55 +02:00
Nikita Popov	3ac8587a2b	[Attributor] Use getFreedOperand() (NFC) Track which operand is actually freed, to avoid the implicit assumption that it is the first call argument.	2022-07-21 14:26:47 +02:00
Thomas Symalla	fd64a857ee	[AMDGPU] Combine s_or_saveexec, s_xor instructions. This patch merges a consecutive sequence of s_or_saveexec s_o, s_i s_xor exec, exec, s_o into a single s_andn2_saveexec s_o, s_i instruction. This patch also cleans up the SIOptimizeExecMasking pass a bit. Reviewed By: nhaehnle Differential Revision: https://reviews.llvm.org/D129073	2022-07-21 14:16:37 +02:00
Haojian Wu	65c8e24622	[pseudo] Fix an invalid assertion on recoveryBrackets. The `Begin` is not the index of the left bracket, `Begin-1` is, otherwise the assertion will be triggered on case `Foo().call();`.	2022-07-21 14:02:11 +02:00
Andrzej Warzynski	ce824078de	Revert "[Flang] Generate documentation for compiler flags" This reverts commit `396e944d82`. Failing bot: https://lab.llvm.org/buildbot/#/builders/89/builds/30096	2022-07-21 11:54:49 +00:00
Dylan Fleming	396e944d82	[Flang] Generate documentation for compiler flags This patch aims to create a webpage to document Flang's command line options on https://flang.llvm.org/docs/ in a similar way to Clang's https://clang.llvm.org/docs/ClangCommandLineReference.html This is done by using clang_tablegen to generate an .rst file from Options.td (which is current shared with Clang) For this to work, ClangOptionDocEmitter.cpp was updated to allow specific Flang flags to be included, rather than bulk excluding clang flags. Reviewed By: awarzynski Differential Revision: https://reviews.llvm.org/D129864	2022-07-21 11:33:19 +00:00
Alexey Lapshin	8bb4451a65	[Reland][DebugInfo][llvm-dwarfutil] Combine overlapped address ranges. DWARF files may contain overlapping address ranges. f.e. it can happen if the two copies of the function have identical instruction sequences and they end up sharing. That looks incorrect from the point of view of DWARF spec. Current implementation of DWARFLinker does not combine overlapped address ranges. It would be good if such ranges would be handled in some useful way. Thus, this patch allows DWARFLinker to combine overlapped ranges in a single one. Depends on D86539 Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D123469	2022-07-21 14:15:39 +03:00
Matt Devereau	cd3d7bf15d	[AArch64][SVE] Add DAG-Combine to push bitcasts from floating point loads after DUPLANE128 This patch lowers duplane128(insert_subvector(undef, bitcast(op(128bitsubvec)), 0), 0) to bitcast(duplane128(insert_subvector(undef, op(128bitsubvec), 0), 0)). This enables floating-point loads to match patterns added in https://reviews.llvm.org/D130010 Differential Revision: https://reviews.llvm.org/D130013	2022-07-21 11:00:10 +00:00
Matt Devereau	e0fbd990c9	[AArch64][SVE] Add ISel pattern to lower DUPLANE128 to LD1RQD Following on from https://reviews.llvm.org/D128902, lower DUPLANE128 to LD1RQD for integer load types from instruction selection. Differential Revision: https://reviews.llvm.org/D130010	2022-07-21 10:56:43 +00:00
Simon Pilgrim	2feb99b02c	[AArch64] Add i128 parity test AArch64 has custom i128 ctpop handling, so match this in the parity tests Added as part of triaging Issue #56531	2022-07-21 11:46:35 +01:00
Jay Foad	9383b09858	[AMDGPU][GlobalISel] Fix subtarget checks for combining to v_med3_i16 Differential Revision: https://reviews.llvm.org/D130243	2022-07-21 11:41:31 +01:00
Alexey Lapshin	3aad49082c	Revert "[DebugInfo][llvm-dwarfutil] Combine overlapped address ranges." This reverts commit `d2a4d6bf9c`.	2022-07-21 13:40:20 +03:00
Nikita Popov	c81dff3c30	[MemoryBuiltins] Add getFreedOperand() function (NFCI) We currently assume in a number of places that free-like functions free their first argument. This is true for all hardcoded free-like functions, but with the new attribute-based design, the freed argument is supposed to be indicated by the allocptr attribute. To make sure we handle this correctly once allockind(free) is respected, add a getFreedOperand() helper which returns the freed argument, rather than just indicating whether the call frees some argument. This migrates most but not all users of isFreeCall() to the new API. The remaining users are a bit more tricky.	2022-07-21 12:39:35 +02:00
Alexey Lapshin	d2a4d6bf9c	[DebugInfo][llvm-dwarfutil] Combine overlapped address ranges. DWARF files may contain overlapping address ranges. f.e. it can happen if the two copies of the function have identical instruction sequences and they end up sharing. That looks incorrect from the point of view of DWARF spec. Current implementation of DWARFLinker does not combine overlapped address ranges. It would be good if such ranges would be handled in some useful way. Thus, this patch allows DWARFLinker to combine overlapped ranges in a single one. Depends on D86539 Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D123469	2022-07-21 13:15:18 +03:00
Dmitry Vyukov	b988d8ddc2	tsan: remove unnecessary brackets Reviewed By: melver Differential Revision: https://reviews.llvm.org/D130236	2022-07-21 12:11:44 +02:00
Nikita Popov	8d58c8e57b	Reapply [InstCombine] Don't check for alloc fn before fetching alloc size Reapply the patch with getObjectSize() replaced by getAllocSize(). The former will also look through calls that return their argument, and we'll end up placing dereferenceable attributes on intrinsics like llvm.launder.invariant.group. While this isn't wrong, it also doesn't seem to be particularly useful. For now, use getAllocSize() instead, which sticks closer to the original behavior of this code. ----- This code is just interested in the allocsize, not any other allocator properties.	2022-07-21 11:48:24 +02:00
Nikita Popov	d144ae6e1b	[MemoryBuiltins] Default to trivial mapper in getAllocSize() (NFC) Default getAllocSize() to use the trivial mapper. Also switch from using std::function to function_ref. Furthermore, update the doc comment to point out a subtle difference between getAllocSize() and getObjectSize(): The latter may also return something for calls that return their argument (via "returned" attribute or special intrinsics like invariant groups).	2022-07-21 11:43:48 +02:00
jacquesguan	e60eb7053d	recommit "[DAGCombiner] Teach scalarizeBinOpOfSplats handle scalable splat." With fix for AArch64 and Hexgon test cases.	2022-07-21 17:34:34 +08:00
Nikita Popov	235fb602ed	[MemoryBuiltins] Don't query TLI for non-pointer functions (NFC) Fetching allocation data for calls is a rather hot operation, and TLI lookups are slow. We can greatly reduce the number of calls for which TLI is queried by checking that they return a pointer value first, as this is a requirement for allocation functions anyway.	2022-07-21 11:28:36 +02:00
Chuanqi Xu	ea623af7c9	[C++20] [Modules] Avoid inifinite loop when iterating default args Currently, clang may meet an infinite loop in a very tricky case when it iterates the default args. This patch tries to fix this by adding a `fixed` check.	2022-07-21 17:25:05 +08:00
Andrzej Warzynski	7c49f56956	[flang][nfc] Add missing `REQUIRES: asserts` in tests Tests that use `--mlir-pass-statistics-display=` from MLIR require the following condition to hold: (extracted from LLVM's Statistics.h): ``` #define LLVM_ENABLE_STATS 1 ``` This is normally enforced with `REQUIRES: asserts`. This patch updates relevant Flang tests accordingly. For "Release" builds (with assertions disabled), the affected tests will be failing without this change. Differential Revision: https://reviews.llvm.org/D130185	2022-07-21 09:22:01 +00:00
Ivan Butygin	d4217e6cc8	[mlir][memref] Missing type conversion in memref.reshape llvm lowering Shape can be memref of index type, so memref::LoadOp result need to be converted into llvm type. Differential Revision: https://reviews.llvm.org/D129965	2022-07-21 11:15:35 +02:00
Nikita Popov	70056d04e2	Revert "[InstCombine] Don't check for alloc fn before fetching object size" This reverts commit `c72c22c04d`. This affected an Analysis test that I missed. Reverting for now.	2022-07-21 10:59:12 +02:00
Nikita Popov	c72c22c04d	[InstCombine] Don't check for alloc fn before fetching object size This code is just interested in the allocsize, not any other allocator properties.	2022-07-21 10:45:03 +02:00
Qiu Chaofan	708084ec37	[PowerPC] Support x86 compatible intrinsics on AIX These headers used to be guarded only on PowerPC64 Linux or FreeBSD, but they can also be enabled for AIX OS target since it's big-endian ready. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D129461	2022-07-21 16:33:41 +08:00
Chen Zheng	bc5c637376	enable P10 vector builtins test on AIX 64 bit; NFC Verify that P10 vector builtins with type `vector signed __int128` and `vector unsigned __int128` work well on AIX 64 bit.	2022-07-21 04:23:02 -04:00
Iain Sandoe	97af17c5ca	re-land [C++20][Modules] Update handling of implicit inlines [P1779R3] re-land fixes an unwanted interaction with module-map modules, seen in Greendragon testing. This provides updates to [class.mfct]: Pre C++20 [class.mfct]p2: A member function may be defined (8.4) in its class definition, in which case it is an inline member function (7.1.2) Post C++20 [class.mfct]p1: If a member function is attached to the global module and is defined in its class definition, it is inline. and [class.friend]: Pre-C++20 [class.friend]p5 A function can be defined in a friend declaration of a class . . . . Such a function is implicitly inline. Post C++20 [class.friend]p7 Such a function is implicitly an inline function if it is attached to the global module. We add the output of implicit-inline to the TextNodeDumper, and amend a couple of existing tests to account for this, plus add tests for the cases covered above. Differential Revision: https://reviews.llvm.org/D129045	2022-07-21 09:17:01 +01:00
lorenzo chelini	2ed7c3fd84	[MLIR][SCF] Enable better bufferization for `TileConsumerAndFuseProducersUsingSCFForOp` Replace iterators of the outermost loop with region arguments of the innermost one. The changes avoid later `bufferization` passes to insert allocation within the body of the innermost loop. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D130083	2022-07-21 10:14:26 +02:00
Haojian Wu	2955192df8	[pseudo] Make sure we rebuild pseudo_gen tool.	2022-07-21 10:09:21 +02:00
Daniel Bertalan	54e18b2397	[lld-macho] Optimize rebase opcode generation This commit reduces the size of the emitted rebase sections by generating the REBASE_OPCODE_DO_REBASE_ADD_ADDR_ULEB and REBASE_OPCODE_DO_REBASE_ULEB_TIMES_SKIPPING_ULEB opcodes. With this change, chromium_framework's rebase section is a 40% smaller 197 kilobytes, down from the previous 320 kB. That is 6 kB smaller than what ld64 produces for the same input. Performance figures from my M1 Mac mini: x before + after N Min Max Median Avg Stddev x 10 4.2269349 4.3300061 4.2689675 4.2690016 0.031151669 + 10 4.219331 4.2914009 4.2398136 4.2448277 0.023817308 No difference proven at 95.0% confidence Differential Revision: https://reviews.llvm.org/D130180	2022-07-21 10:00:39 +02:00
Zi Xuan Wu (Zeson)	08db089124	[CSKY] Fix the testcase error due to the verifyInstructionPredicates - Test cases for arch only has 16-bit instruction such as ck801/ck802 need compile with -mattr=+btst16 - Fix the GPR copy instruction with MOV16 for 16-bit only arch.	2022-07-21 15:53:50 +08:00
Chen Zheng	ecdeabef38	enable P10 vector builtins test on AIX 64 bit; NFC Verify that P10 vector builtins with type `vector signed __int128` and `vector unsigned __int128` work well on AIX 64 bit.	2022-07-21 03:51:30 -04:00
lorenzo chelini	7f1c03171d	Revert "[RFC][MLIR][SCF] Enable better bufferization for `TileConsumerAndFuseProducersUsingSCFForOp`" This reverts commit `9e65850305`.	2022-07-21 09:40:30 +02:00
Nikita Popov	f45ab43332	[MemoryBuiltins] Avoid isAllocationFn() call before checking removable alloc Alloc directly checking whether a given call is a removable allocation, instead of first checking whether it is an allocation first.	2022-07-21 09:39:19 +02:00
Rainer Orth	3776db9a4f	[sanitizer_common] Support Solaris < 11.4 in GetStaticTlsBoundary This patch, on top of D120048 <https://reviews.llvm.org/D120048>, supports GetTls on Solaris 11.3 and Illumos that lack `dlpi_tls_modid`. It's the same method originally used in D91605 <https://reviews.llvm.org/D91605>, but integrated into `GetStaticTlsBoundary`. Tested on `amd64-pc-solaris2.11`, `sparcv9-sun-solaris2.11`, and `x86_64-pc-linux-gnu`. Differential Revision: https://reviews.llvm.org/D120059	2022-07-21 09:18:10 +02:00
David Green	23d6186be0	[SelectionDAG] Fix fptoi.sat scalable vector lowering Vector fptosi_sat and fptoui_sat were being expanded by unrolling the vector operation. This doesn't work for scalable vector, so this patch adds a call to TLI.expandFP_TO_INT_SAT if the vector is scalable. Scalable tests are added for AArch64 and RISCV. Some of the AArch64 fptoi_sat operations should be legal, but that will be handled in another patch. Differential Revision: https://reviews.llvm.org/D130028	2022-07-21 08:00:22 +01:00
lorenzo chelini	9e65850305	[RFC][MLIR][SCF] Enable better bufferization for `TileConsumerAndFuseProducersUsingSCFForOp` Replace iterators of the outermost loop with region arguments of the innermost one. The changes avoid later `bufferization` passes to insert allocation within the body of the innermost loop. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D130083	2022-07-21 08:56:50 +02:00
Luo, Yuanke	cc72af4e13	[X86] Add test case for shuffle	2022-07-21 14:42:03 +08:00

1 2 3 4 5 ...

430548 Commits All Branches Search

430548 Commits

All Branches