llvm-project

Commit Graph

Author	SHA1	Message	Date
Kerry McLaughlin	f7185b271f	[SVE][CodeGen] Lower floating point -> integer conversions This patch adds new ISD nodes, FCVTZS_MERGE_PASSTHRU & FCVTZU_MERGE_PASSTHRU, which are used to lower scalable vector FP_TO_SINT/FP_TO_UINT operations and the following intrinsics: - llvm.aarch64.sve.fcvtzu - llvm.aarch64.sve.fcvtzs Reviewed By: efriedma, paulwalker-arm Differential Revision: https://reviews.llvm.org/D87232	2020-09-17 14:04:22 +01:00
Georgii Rymar	279943edf8	[obj2yaml] - Don't emit EM_NONE. When ELF header's `e_machine == 0`, we emit: ``` Machine: EM_NONE ``` We can avoid doing this, because yaml2obj sets the `e_machine` field to `EM_NONE` by default. Differential revision: https://reviews.llvm.org/D87829	2020-09-17 15:58:44 +03:00
Georgii Rymar	0dca1ac617	[llvm-readelf/obj][test] - Document what we print in various places for unnamed section symbols. We have an issue with `ELFDumper<ELFT>::getSymbolSectionName`: 1) It is used deeply for both LLVM/GNU styles and might return LLVM-style only values to describe symbols: "Undefined", "Processor Specific", "Absolute", etc. 2) `getSymbolSectionName` is used by `getFullSymbolName` and these special values might appear in instead of symbol names in many places. This occurs for unnamed section symbols. It was not noticed because for most cases I've found it is unexpected to have an unnamed section symbol. This patch documents the existent behavior, adds tests and FIXMEs. Differential revision: https://reviews.llvm.org/D87763	2020-09-17 15:56:51 +03:00
Sanjay Patel	03783f19dc	[SLP] sort candidates to increase chance of optimal compare reduction This is one (small) part of improving PR41312: https://llvm.org/PR41312 As shown there and in the smaller tests here, if we have some member of the reduction values that does not match the others, we want to push it to the end (bring the matching members forward and together). In the regression tests, we have 5 candidates for the 4 slots of the reduction. If the one "wrong" compare is grouped with the others, it prevents forming the ideal v4i1 compare reduction. Differential Revision: https://reviews.llvm.org/D87772	2020-09-17 08:49:27 -04:00
Jessica Clarke	788c7d2ec1	[clang][docs] Fix documentation of -O D79916 changed the behaviour from -O2 to -O1 but the documentation was not updated to reflect this.	2020-09-17 13:44:01 +01:00
Simon Pilgrim	aa896a0b3a	Remove unnecessary forward declarations. NFCI. All of these forward declarations are fully defined in headers that are directly included.	2020-09-17 13:31:52 +01:00
Mikael Holmen	bb037c2a76	[ConstraintSystem] Remove local variable that is set but not read [NFC] gcc 7.4 warns about it.	2020-09-17 14:26:48 +02:00
mydeveloperday	40e771c1c0	[clang-format][regression][PR47461] ifdef causes catch to be seen as a function https://bugs.llvm.org/show_bug.cgi?id=47461 The following change {D80940} caused a regression in code which ifdef's around the try and catch block cause incorrect brace placement around the catch ``` try { } catch (...) { // This is not a small function bar = 1; } } ``` The brace after the catch will be placed on a newline Reviewed By: curdeius Differential Revision: https://reviews.llvm.org/D87291	2020-09-17 13:23:06 +01:00
Simon Pilgrim	abe0d8551d	MetadataLoader.cpp - remove unnecessary StringRef include. NFCI. Already included in MetadataLoader.h	2020-09-17 13:18:54 +01:00
Simon Pilgrim	ed53ff4cde	SymbolizableObjectFile.h - remove unnecessary includes. NFCI. Use forward declarations where possible, move includes down to SymbolizableObjectFile.cpp and avoid duplicate includes.	2020-09-17 13:18:53 +01:00
Sam Parker	97a476eb56	[NFC][ARM] Tail fold test changes Run update script on one test and add another.	2020-09-17 13:09:10 +01:00
David Spickett	c65627a1fe	Revert "[lldb] Don't send invalid region addresses to lldb server" This reverts commit `c687af0c30` due to a test failure on Windows.	2020-09-17 13:07:44 +01:00
David Green	fece1489d1	[ARM] Additional tests for qr intrinsics in loops. NFC	2020-09-17 12:39:21 +01:00
Simon Pilgrim	572e542c5e	DwarfStringPool.cpp - remove unnecessary StringRef include. NFCI. Already included in DwarfStringPool.h	2020-09-17 12:18:27 +01:00
Simon Pilgrim	71f237506b	DwarfFile.h - remove unnecessary includes. NFCI. Use forward declarations where possible, move includes down to DwarfFile.cpp and avoid duplicate includes.	2020-09-17 12:12:18 +01:00
David Green	a615226743	[ARM] Extra fp16 bitcast tests. NFC	2020-09-17 12:10:23 +01:00
Alex Zinenko	68cfb02668	[mlir] turn clang-format back on in C API test C API test uses FileCheck comments inside C code and needs to temporarily switch off clang-format to prevent it from messing with FileCheck directives. A recently landed commit forgot to turn it back on after a block of FileCheck comments. Fix that.	2020-09-17 12:59:57 +02:00
Nico Weber	504697e6f4	[gn build] (manually) port `c9af34027b`	2020-09-17 06:33:24 -04:00
Vincent Zhao	f108e71437	[MLIR] Turns swapId into a FlatAffineConstraints member func `swapId` used to be a static function in `AffineStructures.cpp`. This diff makes it accessible from the external world by turning it into a member function of `FlatAffineConstraints`. This will be very helpful for other projects that need to manipulate the content of `FlatAffineConstraints`. Differential Revision: https://reviews.llvm.org/D87766	2020-09-17 11:22:10 +01:00
Simon Pilgrim	550b1a6fd4	[AsmPrinter] DwarfDebug - use DebugLoc const references where possible. NFC. Avoid unnecessary copies.	2020-09-17 10:45:54 +01:00
Simon Pilgrim	8adf92e2d1	[AMDGPU] Remove orphan SITargetLowering::LowerINT_TO_FP declaration. NFCI. Method implementation no longer exists.	2020-09-17 10:45:53 +01:00
Simon Pilgrim	4ae1bb193a	[AsmPrinter] Remove orphan DwarfUnit::shareAcrossDWOCUs declaration. NFCI. Method implementation no longer exists.	2020-09-17 10:45:52 +01:00
Jakub Lichman	347d59b16c	[mlir][Linalg] Convolution tiling added to ConvOp vectorization pass ConvOp vectorization supports now only convolutions of static shapes with dimensions of size either 3(vectorized) or 1(not) as underlying vectors have to be of static shape as well. In this commit we add support for convolutions of any size as well as dynamic shapes by leveraging existing matmul infrastructure for tiling of both input and kernel to sizes accepted by the previous version of ConvOp vectorization. In the future this pass can be extended to take "tiling mask" as a user input which will enable vectorization of user specified dimensions. Differential Revision: https://reviews.llvm.org/D87676	2020-09-17 09:39:41 +00:00
Cullen Rhodes	9218f92838	[clang][aarch64] ACLE: Support implicit casts between GNU and SVE vectors This patch adds support for implicit casting between GNU vectors and SVE vectors when `__ARM_FEATURE_SVE_BITS==N`, as defined by the Arm C Language Extensions (ACLE, version 00bet5, section 3.7.3.3) for SVE [1]. This behavior makes it possible to use GNU vectors with ACLE functions that operate on VLAT. For example: typedef int8_t vec __attribute__((vector_size(32))); vec f(vec x) { return svasrd_x(svptrue_b8(), x, 1); } Tests are also added for implicit casting between GNU and fixed-length SVE vectors created by the 'arm_sve_vector_bits' attribute. This behavior makes it possible to use VLST with existing interfaces that operate on GNUT. For example: typedef int8_t vec1 __attribute__((vector_size(32))); void f(vec1); #if __ARM_FEATURE_SVE_BITS==256 && __ARM_FEATURE_SVE_VECTOR_OPERATORS typedef svint8_t vec2 __attribute__((arm_sve_vector_bits(256))); void g(vec2 x) { f(x); } // OK #endif The `__ARM_FEATURE_SVE_VECTOR_OPERATORS` feature macro indicates interoperability with the GNU vector extension. This is the first patch providing support for this feature, which once complete will be enabled by the `-msve-vector-bits` flag, as the `__ARM_FEATURE_SVE_BITS` feature currently is. [1] https://developer.arm.com/documentation/100987/latest Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D87607	2020-09-17 09:35:30 +00:00
David Spickett	c687af0c30	[lldb] Don't send invalid region addresses to lldb server Previously when <addr> in "memory region <addr>" didn't parse correctly, we'd print an error then also ask lldb-server for a region containing LLDB_INVALID_ADDRESS. (lldb) memory region not_an_address error: invalid address argument "not_an_address"... error: Server returned invalid range Only send the command to lldb-server if the address parsed correctly. (lldb) memory region not_an_address error: invalid address argument "not_an_address"... Reviewed By: labath Differential Revision: https://reviews.llvm.org/D87694	2020-09-17 10:26:16 +01:00
Rainer Orth	a9cbe5cf30	[X86] Fix stack alignment on 32-bit Solaris/x86 On Solaris/x86, several hundred 32-bit tests `FAIL`, all in the same way: env ASAN_OPTIONS=halt_on_error=false ./halt_on_error_suppress_equal_pcs.cpp.tmp Segmentation Fault (core dumped) They segfault during startup: Thread 2 received signal SIGSEGV, Segmentation fault. [Switching to Thread 1 (LWP 1)] 0x080f21f0 in __sanitizer::internal_mmap(void*, unsigned long, int, int, int, unsigned long long) () at /vol/llvm/src/llvm-project/dist/compiler-rt/lib/sanitizer_common/sanitizer_solaris.cpp:65 65 int prot, int flags, int fd, OFF_T offset) { 1: x/i $pc => 0x80f21f0 <_ZN11__sanitizer13internal_mmapEPvmiiiy+16>: movaps 0x30(%esp),%xmm0 (gdb) p/x $esp $3 = 0xfeffd488 The problem is that `movaps` expects 16-byte alignment, while 32-bit Solaris/x86 only guarantees 4-byte alignment following the i386 psABI. This patch updates `X86Subtarget::initSubtargetFeatures` accordingly, handles Solaris/x86 in the corresponding testcase, and allows for some variation in address alignment in `compiler-rt/test/ubsan/TestCases/TypeCheck/vptr.cpp`. Tested on `amd64-pc-solaris2.11` and `x86_64-pc-linux-gnu`. Differential Revision: https://reviews.llvm.org/D87615	2020-09-17 11:17:11 +02:00
Douglas Yung	b03c2b8395	Revert "Re-land: Add new hidden option -print-changed which only reports changes to IR" The test added in this commit is failing on Windows bots: http://lab.llvm.org:8011/builders/llvm-clang-win-x-armv7l/builds/1269 This reverts commit `f9e6d1edc0` and follow-up commit `6859d95ea2`.	2020-09-17 01:32:29 -07:00
Roman Lebedev	aadf55d1ce	[NFC] EliminateDuplicatePHINodes(): small-size optimization: if there are <= 32 PHI's, O(n^2) algo is faster (geomean -0.08%) This is functionally equivalent to the old implementation. As per https://llvm-compile-time-tracker.com/compare.php?from=5f4e9bf6416e45eba483a4e5e263749989fdb3b3&to=4739e6e4eb54d3736e6457249c0919b30f6c855a&stat=instructions this is a clear geomean compile-time regression-free win with overall geomean of `-0.08%` 32 PHI's appears to be the sweet spot; both the 16 and 64 performed worse: https://llvm-compile-time-tracker.com/compare.php?from=5f4e9bf6416e45eba483a4e5e263749989fdb3b3&to=c4efe1fbbfdf0305ac26cd19eacb0c7774cdf60e&stat=instructions https://llvm-compile-time-tracker.com/compare.php?from=5f4e9bf6416e45eba483a4e5e263749989fdb3b3&to=e4989d1c67010d3339d1a40ff5286a31f10cfe82&stat=instructions If we have more PHI's than that, we fall-back to the original DenseSet-based implementation, so the not-so-fast cases will still be handled. However compile-time isn't the main motivation here. I can name at least 3 limitations of this CSE: 1. Assumes that all PHI nodes have incoming basic blocks in the same order (can be fixed while keeping the DenseMap) 2. Does not special-handle `undef` incoming values (i don't see how we can do this with hashing) 3. Does not special-handle backedge incoming values (maybe can be fixed by hashing backedge as some magical value) Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D87408	2020-09-17 11:29:03 +03:00
Jay Foad	6f6d389da5	[SplitKit] Only copy live lanes When splitting a live interval with subranges, only insert copies for the lanes that are live at the point of the split. This avoids some unnecessary copies and fixes a problem where copying dead lanes was generating MIR that failed verification. The test case for this is test/CodeGen/AMDGPU/splitkit-copy-live-lanes.mir. Without this fix, some earlier live range splitting would create %430: %430 [256r,848r:0)[848r,2584r:1) 0@256r 1@848r L0000000000000003 [848r,2584r:0) 0@848r L0000000000000030 [256r,2584r:0) 0@256r weight:1.480938e-03 ... 256B undef %430.sub2:vreg_128 = V_LSHRREV_B32_e32 16, %20.sub1:vreg_128, implicit $exec ... 848B %430.sub0:vreg_128 = V_AND_B32_e32 %92:sreg_32, %20.sub1:vreg_128, implicit $exec ... 2584B %431:vreg_128 = COPY %430:vreg_128 Then RAGreedy::tryLocalSplit would split %430 into %432 and %433 just before 848B giving: %432 [256r,844r:0) 0@256r L0000000000000030 [256r,844r:0) 0@256r weight:3.066802e-03 %433 [844r,848r:0)[848r,2584r:1) 0@844r 1@848r L0000000000000030 [844r,2584r:0) 0@844r L0000000000000003 [844r,844d:0)[848r,2584r:1) 0@844r 1@848r weight:2.831776e-03 ... 256B undef %432.sub2:vreg_128 = V_LSHRREV_B32_e32 16, %20.sub1:vreg_128, implicit $exec ... 844B undef %433.sub0:vreg_128 = COPY %432.sub0:vreg_128 { internal %433.sub2:vreg_128 = COPY %432.sub2:vreg_128 848B } %433.sub0:vreg_128 = V_AND_B32_e32 %92:sreg_32, %20.sub1:vreg_128, implicit $exec ... 2584B %431:vreg_128 = COPY %433:vreg_128 Note that the copy from %432 to %433 at 844B is a curious bundle-without-a-BUNDLE-instruction that SplitKit creates deliberately, and it includes a copy of .sub0 which is not live at this point, and that causes it to fail verification: * Bad machine code: No live subrange at use * - function: zextload_global_v64i16_to_v64i64 - basic block: %bb.0 (0x7faed48) [0B;2848B) - instruction: 844B undef %433.sub0:vreg_128 = COPY %432.sub0:vreg_128 - operand 1: %432.sub0:vreg_128 - interval: %432 [256r,844r:0) 0@256r L0000000000000030 [256r,844r:0) 0@256r weight:3.066802e-03 - at: 844B Using real bundles with a BUNDLE instruction might also fix this problem, but the current fix is less invasive and also avoids some unnecessary copies. https://bugs.llvm.org/show_bug.cgi?id=47492 Differential Revision: https://reviews.llvm.org/D87757	2020-09-17 09:26:11 +01:00
Jay Foad	d49707cf4b	[AMDGPU] Generate test checks for splitkit-copy-bundle.mir This is a pre-commit for D87757 "[SplitKit] Only copy live lanes".	2020-09-17 09:26:09 +01:00
Sjoerd Meijer	6637d72ddd	[Lint] Add check for intrinsic get.active.lane.mask As @efriedma pointed out in D86301, this "not equal to 0 check" of get.active.lane.mask's second operand needs to live here in Lint and not the Verifier. Differential Revision: https://reviews.llvm.org/D87228	2020-09-17 09:22:03 +01:00
Qiu Chaofan	a2fb5446be	[SelectionDAG] Check any use of negation result before removal `2508ef01` fixed a bug about constant removal in negation. But after sanitizing check I found there's still some issue about it so it's reverted. Temporary nodes will be removed if useless in negation. Before the removal, they'd be checked if any other nodes used it. So the removal was moved after getNode. However in rare cases the node to be removed is the same as result of getNode. We missed that and will be fixed by this patch. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D87614	2020-09-17 16:00:54 +08:00
Tres Popp	b05629230e	[mlir] Remove redundant shape.cstr_broadcastable canonicalization. These canonicalizations are already handled by folding which will occur in a superset of situations, so they are being removed. Differential Revision: https://reviews.llvm.org/D87706	2020-09-17 09:01:13 +02:00
Fangrui Song	c16417f65f	[llvm-cov gcov] Add --demangled-names (-m) gcov 4.9 introduced the option.	2020-09-16 23:18:50 -07:00
Artur Bialas	4ce84b0e70	[mlir][spirv] Add GroupNonUniformBroadcastOp Added GroupNonUniformBroadcastOp to spirv dialect. Differential Revision: https://reviews.llvm.org/D87688	2020-09-16 23:13:06 -07:00
Igor Kudrin	027d47d1c7	[DebugInfo] Simplify DIEInteger::SizeOf(). An AsmPrinter should always be provided to the method because some forms depend on its parameters. The only place in the codebase which passed a nullptr value was found in the unit tests, so the patch updates it to use some dummy AsmPrinter instead. Differential Revision: https://reviews.llvm.org/D85293	2020-09-17 12:47:38 +07:00
Fangrui Song	e69092be52	[llvm-cov gcov][test] Move tests to gcov/ And rename llvm-cov.test (misnomer) to basic.test	2020-09-16 22:42:49 -07:00
Craig Topper	c9af34027b	Add __divmodti4 to match libgcc. gcc has used this on x86-64 since at least version 7. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D80506	2020-09-16 21:56:01 -07:00
Jonas Devlieghere	57dd92746a	[lldb] Return FileSP and StreamFileSP by value in IOHandler (NFC) Smart pointers should be returned by value.	2020-09-16 21:15:05 -07:00
Jianzhou Zhao	aec80c5cfd	Fix the arguments of std::min fixing `11201315d5`	2020-09-17 04:03:31 +00:00
Jianzhou Zhao	352a55ef06	Add the header of std::min fixing `11201315d5`	2020-09-17 03:48:36 +00:00
Jianzhou Zhao	11201315d5	Flush bitcode incrementally for LTO output Bitcode writer does not flush buffer until the end by default. This is fine to small bitcode files. When -flto,--plugin-opt=emit-llvm,-gmlt are used, the final bitcode file is large, for example, >8G. Keeping all data in memory consumes a lot of memory. This change allows bitcode writer flush data to disk early when buffered data size is above some threshold. This is only enabled when lld emits LLVM bitcode. One issue to address is backpatching bitcode: subblock length, function body indexes, meta data indexes need to backfill. If buffer can be flushed partially, we introduced raw_fd_stream that supports read/seek/write, and enables backpatching bitcode flushed in disk. Reviewed-by: tejohnson, MaskRay Differential Revision: https://reviews.llvm.org/D86905	2020-09-17 03:32:31 +00:00
LLVM GN Syncbot	0dd4d70ec2	[gn build] Port `a895040eb0`	2020-09-17 03:02:00 +00:00
Stella Stamenova	a895040eb0	Revert "[IRSim] Adding IR Instruction Mapper" This reverts commit `b04c1a9d31`.	2020-09-16 20:00:43 -07:00
David Blaikie	6a07f1edf8	debug_rnglists/symbolizing: reduce memory usage by not caching rnglists This matches the debug_ranges behavior - though is currently implemented differently. (the debug_ranges parsing was handled by creating a new ranges parser during DIE address querying, and just destroying it after the query - whereas the rnglists parser is a member of the DWARFUnit currently - so the API doesn't cache anymore) I think this could/should be improved by not parsing debug_rnglists headers at all when dumping debug_info or symbolizing - do it the way DWARF (roughly) intended: take the rnglists_base, add addr*index to it, read the offset, parse the list at rnglists_base+offset. This would have no error checking for valid index (because the number of valid indexes is stored in the header, which has a negative offset from rnglists_base - and is sort of only intended for use by dumpers, not by parsers going from debug_info to a rnglist) or out of contribution bounds access (since it wouldn't know the length of the contribution, also in the header) - nor any error-checking that the rnglist contribution was using the same properties as the debug_info (version, DWARF32/64, address size, etc).	2020-09-16 19:36:07 -07:00
Eric Christopher	c140322819	Use zu rather than llu format specifier for size_t (-Wformat warning fix).	2020-09-16 19:28:05 -07:00
Qiu Chaofan	ebfbdebe96	[PowerPC] Fix store-fptoi combine of f128 on Power8 llc would crash for (store (fptosi-f128-i32)) when -mcpu=pwr8, we should not generate FP_TO_(S\|U)INT_IN_VSR for f128 types at this time. This patch fixes it. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D86686	2020-09-17 10:21:35 +08:00
Chen Zheng	5782ab0f52	[MachineSink] add one more mir case - nfc	2020-09-16 22:03:06 -04:00
Ryan Prichard	fb1abe0063	[libunwind][DWARF] Fix end of .eh_frame calculation * When .eh_frame is located using .eh_frame_hdr (PT_GNU_EH_FRAME), the start of .eh_frame is known, but not the size. In this case, the unwinder must rely on a terminator present at the end of .eh_frame. Set dwarf_section_length to UINTPTR_MAX to indicate this. * Add a new field, text_segment_length, that the FrameHeaderCache uses to track the size of the PT_LOAD segment indicated by dso_base. * Compute ehSectionEnd by adding sectionLength to ehSectionStart, never to fdeHint. Fixes PR46829. Differential Revision: https://reviews.llvm.org/D87750	2020-09-16 19:00:57 -07:00
LLVM GN Syncbot	436a43afb2	[gn build] Port `b04c1a9d31`	2020-09-17 01:54:10 +00:00

1 2 3 4 5 ...

366469 Commits All Branches Search

366469 Commits

All Branches