llvm-project

Commit Graph

Author	SHA1	Message	Date
Florian Hahn	bc41653a1f	[ThreadPool] Use auto again for future with ENABLE_THREADS=Off. This fixes a build failure with LLVM_ENABLE_THREADS=Off.	2021-11-25 20:18:33 +00:00
Michal Terepeta	cc311a155a	[mlir][Vector] Support 0-D vectors in `VectorPrintOpConversion` Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114549	2021-11-25 20:12:18 +00:00
Jake Egan	0796869e4e	[AIX] Disable unsupported offloading gpu tests GPUs are not supported on AIX, so this patch sets these tests as unsupported. Reviewed By: stevewan Differential Revision: https://reviews.llvm.org/D114381	2021-11-25 15:10:51 -05:00
Florian Hahn	8cb1af73c6	Recommit [ThreadPool] Support returning futures with results. This reverts commit `71a7c55f0f`. The revert broken building llvm-reduce and it is not clear it fixes an issue with LLVM_ENABLE_THREADS=OFF. See discussion in https://reviews.llvm.org/D114183 for more details.	2021-11-25 20:07:53 +00:00
Jesses Gott	813d486cbc	[clang-format] Extend AllowShortBlocksOnASingleLine for else blocks Extend AllowShortBlocksOnASingleLine for else blocks. See https://bugs.llvm.org/show_bug.cgi?id=49722 Reviewed By: HazardyKnusperkeks, owenpan, MyDeveloperDay Differential Revision: https://reviews.llvm.org/D114320	2021-11-25 19:45:07 +00:00
Quinn Pham	a712b661eb	[NFC][llvm] Inclusive language: replace master in llvm docs [NFC] As part of using inclusive language within the llvm project, this patch removes instances of master in these files. Reviewed By: ZarkoCA Differential Revision: https://reviews.llvm.org/D114187	2021-11-25 13:36:51 -06:00
mydeveloperday	3c8666ef9a	[clang-format] NFC update LLVM overall clang-formatted status Whilst the % clang-formatted remains the same, the number of files added to the LLVM project has risen by almost by 259. - 190 of them have been added clang-format clean. - 69 files have been added unformatted. (lit tests should be excluded from this number) - 291 files have been added to the list of files that are clang-format clean - 101 files have either become unclean or have been removed As this updates the clang-formatted-files there are now 8139 files that are clean which we can be used as a regression test when making changes to clang-format. ``` clang-format -verbose -n -files ./clang/docs/tools/clang-formatted-files.txt ```	2021-11-25 19:35:45 +00:00
Quinn Pham	5c162ec545	[NFC][compiler-rt] Inclusive language: replace master/slave with primary/secondary [NFC] As part of using inclusive language within the llvm project, this patch replaces master and slave with primary and secondary respectively in `sanitizer_mac.cpp`. Reviewed By: ZarkoCA Differential Revision: https://reviews.llvm.org/D114255	2021-11-25 13:30:56 -06:00
Uday Bondhugula	c89fc1eec3	[MLIR] NFC. Rename MLIR CAPI ExecutionEngine target for consistency Rename MLIR CAPI ExecutionEngine target for consistency: MLIRCEXECUTIONENGINE -> MLIRCAPIExecutionEngine in line with other targets. Differential Revision: https://reviews.llvm.org/D114596	2021-11-26 00:23:17 +05:30
Amy Kwan	150681f2f3	[PowerPC] Prevent the optimizer from producing wide vector types in IR. This patch prevents the optimizer from producing wide vectors in the IR, specifically the MMA types (v256i1, v512i1). The idea is that on Power, we only want to be producing these types only if the vector_pair and vector_quad types are used specifically. To prevent the optimizer from producing these types in the IR, vectorCostAdjustmentFactor() is updated to return an instruction cost factor or an invalid instruction cost if the current type is that of an MMA type. An invalid instruction cost returned by this function signifies to other cost computing functions to return the maximum instruction cost to inform the optimizer that producing these types within the IR is expensive, and should not be produced in the first place. This issue was first seen in the test case included within this patch. Differential Revision: https://reviews.llvm.org/D113900	2021-11-25 12:35:26 -06:00
Quinn Pham	34303d3db7	[NFC][llvm] Inclusive language: replace master with main in dbg-call-site-spilled-arg.mir [NFC] As part of using inclusive language within the llvm project, this patch replaces master with main in `dbg-call-site-spilled-arg.mir`. Reviewed By: ZarkoCA Differential Revision: https://reviews.llvm.org/D114097	2021-11-25 12:34:39 -06:00
Pavel Kosov	1aab5e653d	[LLDB] Provide target specific directories to libclang On Linux some C++ and C include files reside in target specific directories, like /usr/include/x86_64-linux-gnu. Patch adds them to libclang, so LLDB jitter has more chances to compile expression. OS Laboratory. Huawei Russian Research Institute. Saint-Petersburg Reviewed By: teemperor Differential Revision: https://reviews.llvm.org/D110827	2021-11-25 21:27:02 +03:00
Louis Dionne	151a7dafd3	[libc++] Fix ssize test that made an assumption about ptrdiff_t being 'long' On some platforms like armv7m, the size() method of containers returns unsigned long, while ptrdiff_t is just int. Hence, std::ssize_t ends up being long, which is not the same as ptrdiff_t. This is usually not an issue because std::ptrdiff_t is long, so everything works out, but it breaks on some more exotic architectures. Differential Revision: https://reviews.llvm.org/D114563	2021-11-25 13:12:47 -05:00
Lei Huang	1db1cb028d	[CMake] Add new cmake option to control adding comments in GenDAGISel Add new cmake option `LLVM_OMIT_DAGISEL_COMMENTS` to control adding of comments in GenDAGISel for none debug builds Ref: https://reviews.llvm.org/D78884 Reviewed By: nemanjai, MaskRay, #powerpc Differential Revision: https://reviews.llvm.org/D114122	2021-11-25 12:11:35 -06:00
Dmitry Vyukov	66d4ce7e26	tsan: new runtime (v3) This change switches tsan to the new runtime which features: - 2x smaller shadow memory (2x of app memory) - faster fully vectorized race detection - small fixed-size vector clocks (512b) - fast vectorized vector clock operations - unlimited number of alive threads/goroutimes Depends on D112602. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D112603	2021-11-25 18:32:04 +01:00
Daniel McIntosh	71a7c55f0f	Revert "[ThreadPool] Support returning futures with results." This reverts commit `6149e57dc1`. The offending commit broke building with LLVM_ENABLE_THREADS=OFF.	2021-11-25 12:19:35 -05:00
Quinn Pham	c3dc6b081d	[NFC][clang-tools-extra] Inclusive language: replace master with main [NFC] As part of using inclusive language within the llvm project, this patch replaces master with main in `SubModule2.h`. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D114100	2021-11-25 11:01:16 -06:00
Kazu Hirata	bfd5dd1568	[llvm] Use range-based for loops (NFC)	2021-11-25 08:55:16 -08:00
Joe Loser	3e7452a812	[libc++] Avoid overload resolution in path comparison operators Rework `std::filesystem::path::operator==` and friends to avoid overload resolution and atomic constraint caching issues shown from https://reviews.llvm.org/D113161. Always call `__compare(string_view)` from the comparison operators which avoids overload resolution. Differential Revision: https://reviews.llvm.org/D114570	2021-11-25 11:40:55 -05:00
Joe Loser	68e7e76a9b	[libc++] Fix constraints for string_view's iterator/sentinel constructor The `string_view` constructor taking an iterator/sentinel uses concepts instead of type traits like the Standard states. Using `same_as` instead of `is_same_v` should be harmless. Prefer `std::is_same_v` instead which is cheaper to compile. Replace `convertible_to` with `is_convertible_v` as well. This observation came up while working on https://reviews.llvm.org/D113161 Differential Revision: https://reviews.llvm.org/D114561	2021-11-25 11:39:59 -05:00
Dmitry Vyukov	976bb4724c	tsan: fix another potential deadlock in fork Linux/fork_deadlock.cpp currently hangs in debug mode in the following stack. Disable memory access handling in OnUserAlloc/Free around fork. 1 0x000000000042c54b in __sanitizer::internal_sched_yield () at sanitizer_linux.cpp:452 2 0x000000000042da15 in __sanitizer::StaticSpinMutex::LockSlow (this=0x57ef02 <__sanitizer::internal_allocator_cache_mu>) at sanitizer_mutex.cpp:24 3 0x0000000000423927 in __sanitizer::StaticSpinMutex::Lock (this=0x57ef02 <__sanitizer::internal_allocator_cache_mu>) at sanitizer_mutex.h:32 4 0x000000000042354c in __sanitizer::GenericScopedLock<__sanitizer::StaticSpinMutex>::GenericScopedLock (this=this@entry=0x7ffcabfca0b8, mu=0x1) at sanitizer_mutex.h:367 5 0x0000000000423653 in __sanitizer::RawInternalAlloc (size=size@entry=72, cache=cache@entry=0x0, alignment=8, alignment@entry=0) at sanitizer_allocator.cpp:52 6 0x00000000004235e9 in __sanitizer::InternalAlloc (size=size@entry=72, cache=0x1, cache@entry=0x0, alignment=4, alignment@entry=0) at sanitizer_allocator.cpp:86 7 0x000000000043aa15 in __sanitizer::SymbolizedStack::New (addr=4802655) at sanitizer_symbolizer.cpp:45 8 0x000000000043b353 in __sanitizer::Symbolizer::SymbolizePC (this=0x7f578b77a028, addr=4802655) at sanitizer_symbolizer_libcdep.cpp:90 9 0x0000000000439dbe in __sanitizer::(anonymous namespace)::StackTraceTextPrinter::ProcessAddressFrames (this=this@entry=0x7ffcabfca208, pc=4802655) at sanitizer_stacktrace_libcdep.cpp:36 10 0x0000000000439c89 in __sanitizer::StackTrace::PrintTo (this=this@entry=0x7ffcabfca2a0, output=output@entry=0x7ffcabfca260) at sanitizer_stacktrace_libcdep.cpp:109 11 0x0000000000439fe0 in __sanitizer::StackTrace::Print (this=0x18) at sanitizer_stacktrace_libcdep.cpp:132 12 0x0000000000495359 in __sanitizer::PrintMutexPC (pc=4802656) at tsan_rtl.cpp:774 13 0x000000000042e0e4 in __sanitizer::InternalDeadlockDetector::Lock (this=0x7f578b1ca740, type=type@entry=2, pc=pc@entry=4371612) at sanitizer_mutex.cpp:177 14 0x000000000042df65 in __sanitizer::CheckedMutex::LockImpl (this=<optimized out>, pc=4) at sanitizer_mutex.cpp:218 15 0x000000000042bc95 in __sanitizer::CheckedMutex::Lock (this=0x600001000000) at sanitizer_mutex.h:127 16 __sanitizer::Mutex::Lock (this=0x600001000000) at sanitizer_mutex.h:165 17 0x000000000042b49c in __sanitizer::GenericScopedLock<__sanitizer::Mutex>::GenericScopedLock (this=this@entry=0x7ffcabfca370, mu=0x1) at sanitizer_mutex.h:367 18 0x000000000049504f in __tsan::TraceSwitch (thr=0x7f578b1ca980) at tsan_rtl.cpp:656 19 0x000000000049523e in __tsan_trace_switch () at tsan_rtl.cpp:683 20 0x0000000000499862 in __tsan::TraceAddEvent (thr=0x7f578b1ca980, fs=..., typ=__tsan::EventTypeMop, addr=4499472) at tsan_rtl.h:624 21 __tsan::MemoryAccessRange (thr=0x7f578b1ca980, pc=4499472, addr=135257110102784, size=size@entry=16, is_write=true) at tsan_rtl_access.cpp:563 22 0x000000000049853a in __tsan::MemoryRangeFreed (thr=thr@entry=0x7f578b1ca980, pc=pc@entry=4499472, addr=addr@entry=135257110102784, size=16) at tsan_rtl_access.cpp:487 23 0x000000000048f6bf in __tsan::OnUserFree (thr=thr@entry=0x7f578b1ca980, pc=pc@entry=4499472, p=p@entry=135257110102784, write=true) at tsan_mman.cpp:260 24 0x000000000048f61f in __tsan::user_free (thr=thr@entry=0x7f578b1ca980, pc=4499472, p=p@entry=0x7b0400004300, signal=true) at tsan_mman.cpp:213 25 0x000000000044a820 in __interceptor_free (p=0x7b0400004300) at tsan_interceptors_posix.cpp:708 26 0x00000000004ad599 in alloc_free_blocks () at fork_deadlock.cpp:25 27 __tsan_test_only_on_fork () at fork_deadlock.cpp:32 28 0x0000000000494870 in __tsan::ForkBefore (thr=0x7f578b1ca980, pc=pc@entry=4904437) at tsan_rtl.cpp:510 29 0x000000000046fcb4 in syscall_pre_fork (pc=1) at tsan_interceptors_posix.cpp:2577 30 0x000000000046fc9b in __sanitizer_syscall_pre_impl_fork () at sanitizer_common_syscalls.inc:3094 31 0x00000000004ad5f5 in myfork () at syscall.h:9 32 main () at fork_deadlock.cpp:46 Depends on D114595. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D114597	2021-11-25 17:08:00 +01:00
Dmitry Vyukov	b584741d06	tsan: fix Java heap block begin in reports We currently use a wrong value for heap block (only works for C++, but not for Java). Use the correct value (we already computed it before, just forgot to use). Depends on D114593. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D114595	2021-11-25 17:07:53 +01:00
Dmitry Vyukov	debac0ef37	tsan: add a benchmark for vector memory accesses Depends on D114592. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D114593	2021-11-25 17:07:46 +01:00
Dmitry Vyukov	5cac2b956b	tsan: add a test for vector memory accesses Add a basic test that checks races between vector/non-vector read/write accesses of different sizes/offsets in different orders. This gives coverage of __tsan_read/write16 callbacks. Depends on D114591. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D114592	2021-11-25 17:07:18 +01:00
Dmitry Vyukov	d841086ae6	tsan: enable -msse4 when compiling tests Vector SSE accesses make compiler emit __tsan_[unaligned_]read/write16 callbacks. Make it possible to test these. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D114591	2021-11-25 17:07:02 +01:00
David Green	fbb61adb70	[ARM] Convert fptoi.sat to fixed point multiply This is a very small addition to the existing MVE fixed point vcvt code to also create them from FP_TO_SINT_SAT and FP_TO_UINT_SAT nodes, which should be equally valid for native saturating converts under MVE. Differential Revision: https://reviews.llvm.org/D114360	2021-11-25 15:43:45 +00:00
Zarko Todorovski	890e3c55b5	[llvm][ubsan] Inclusive language: replace use of blacklist HandleLLVMOptions.cmake but use old option name Retry at https://reviews.llvm.org/D113689, this time with using the old option name to support older versions of clang. Reviewed By: bjope Differential Revision: https://reviews.llvm.org/D114033	2021-11-25 10:23:14 -05:00
Jeremy Morse	102d2a8a99	[DebugInfo][InstrRef] Track variable assignments in out-of-scope blocks DBG_INSTR_REF's and DBG_VALUE's can end up in blocks that aren't in the lexical scope of their variable. It's arguable as to what we should do about this, however VarLocBasedLDV permits such variable locations to be propagated, so let's allow it in InstrRefBasedLDV. It's necessary for the modified test to work. Differential Revision: https://reviews.llvm.org/D114578	2021-11-25 14:52:11 +00:00
David Green	e6cca3125d	[ARM] Add fptosi.sat variants of the fixed point vcvt tests. NFC	2021-11-25 14:41:20 +00:00
Alok Kumar Sharma	36cb7477d1	[clang][OpenMP][DebugInfo] Debug support for private variables inside an OpenMP task construct Currently variables appearing inside private/firstprivate/lastprivate clause of openmp task construct are not visible inside lldb debugger. This is because compiler does not generate debug info for it. Please consider the testcase debug_private.c attached with patch. ``` 28 #pragma omp task shared(res) private(priv1, priv2) firstprivate(fpriv) 29 { 30 priv1 = n; 31 priv2 = n + 2; 32 printf("Task n=%d,priv1=%d,priv2=%d,fpriv=%d\n",n,priv1,priv2,fpriv); 33 -> 34 res = priv1 + priv2 + fpriv + foo(n - 1); 35 } 36 #pragma omp taskwait 37 return res; (lldb) p priv1 error: <user expression 0>:1:1: use of undeclared identifier 'priv1' priv1 ^ (lldb) p priv2 error: <user expression 1>:1:1: use of undeclared identifier 'priv2' priv2 ^ (lldb) p fpriv error: <user expression 2>:1:1: use of undeclared identifier 'fpriv' fpriv ^ ``` After the current patch, lldb is able to show the variables ``` (lldb) p priv1 (int) $0 = 10 (lldb) p priv2 (int) $1 = 12 (lldb) p fpriv (int) $2 = 14 ``` Reviewed By: djtodoro Differential Revision: https://reviews.llvm.org/D114504	2021-11-25 19:55:22 +05:30
Tres Popp	6eca1957ee	Don't store nullptrs in mlir::FuncOp::getAll*Attrs' result These methods for results and arguments would create an ArrayRef full of nullptrs when there were no argument attributes. This is problematic because this result could not be passed to the FuncOp::build creator without causing a segfault. Now the list will have empty attributes. Differential Revision: https://reviews.llvm.org/D114358	2021-11-25 15:12:29 +01:00
Simon Pilgrim	a25e08dd3c	[PowerPC/ Regenerate fp128-bitcast-after-operation test checks	2021-11-25 13:39:57 +00:00
Alexey Bataev	4675a1654c	Revert "[SLP]Improve analysis/emission of vector operands for alternate nodes." This reverts commit `496254cf80` to fix compiler crashes reported in D114101#3152982.	2021-11-25 05:19:49 -08:00
seongwon bang	35c1e6ac1a	[MLIR] [docs] Fix misguided examples in memref.subview operation. The examples in `memref.subview` operation are misguided in that subview's strides operands mean "memref-rank number of strides that compose multiplicatively with the base memref strides in each dimension.". So the below examples should be changed from `Strides: [64, 4, 1]` to `Strides: [1, 1, 1]` Before changes ``` // Subview with constant offsets, sizes and strides. %1 = memref.subview %0[0, 2, 0][4, 4, 4][64, 4, 1] : memref<8x16x4xf32, (d0, d1, d2) -> (d0 * 64 + d1 * 4 + d2)> to memref<4x4x4xf32, (d0, d1, d2) -> (d0 * 64 + d1 * 4 + d2 + 8)> ``` After changes ``` // Subview with constant offsets, sizes and strides. %1 = memref.subview %0[0, 2, 0][4, 4, 4][1, 1, 1] : memref<8x16x4xf32, (d0, d1, d2) -> (d0 * 64 + d1 * 4 + d2)> to memref<4x4x4xf32, (d0, d1, d2) -> (d0 * 64 + d1 * 4 + d2 + 8)> ``` Also I fixed some syntax issues in docs related with memref layout map and added detailed explanation in subview rank reducing case. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D114500	2021-11-25 21:24:10 +09:00
Zarko Todorovski	7f7dac7126	[NFC][llvm] Inclusive language: reword uses of sanity test and check Part of continuing work to use more inclusive language. Reworded uses of sanity check and sanity test in llvm/test/	2021-11-25 07:21:42 -05:00
Kirill Bobyrev	59e4a67081	[clangd] Move IncludeCleaner tracer to the actual computation This way we won't get results with 0 ms for all the users with disabled IncludeCleaner.	2021-11-25 13:19:01 +01:00
mydeveloperday	c2fe2b5a63	[clang-format] [C++20] [Module] clang-format couldn't recognize partitions https://bugs.llvm.org/show_bug.cgi?id=52517 clang-format is butchering modules, this could easily become a barrier to entry for modules given clang-formats wide spread use. Prevent the following from adding spaces around the `:` (cf was considering the ':' as an InheritanceColon) Reviewed By: HazardyKnusperkeks, owenpan, ChuanqiXu Differential Revision: https://reviews.llvm.org/D114151	2021-11-25 11:51:21 +00:00
Pavel Labath	165545c7a4	[lldb/gdb-remote] Ignore spurious ACK packets Although I cannot find any mention of this in the specification, both gdb and lldb agree on sending an initial + packet after establishing the connection. OTOH, gdbserver and lldb-server behavior is subtly different. While lldb-server expects the initial ack, and drops the connection if it is not received, gdbserver will just ignore a spurious ack at _any_ point in the connection. This patch changes lldb's behavior to match that of gdb. An ACK packet is ignored at any point in the connection (except when expecting an ACK packet, of course). This is inline with the "be strict in what you generate, and lenient in what you accept" philosophy, and also enables us to remove some special cases from the server code. I've extended the same handling to NAK (-) packets, mainly because I don't see a reason to treat them differently here. (The background here is that we had a stub which was sending spurious + packets. This bug has since been fixed, but I think this change makes sense nonetheless.) Differential Revision: https://reviews.llvm.org/D114520	2021-11-25 12:34:08 +01:00
Pavel Labath	a6fedbf20c	[lldb/gdb-remote] Remove initial pipe-draining workaround This code, added in rL197579 (Dec 2013) is supposed to work around what was presumably a qemu bug, where it would send unsolicited stop-reply packets after the initial connection. At present, qemu does not exhibit such behavior. Also, the 10ms delay introduced by this code is sufficient to mask bugs in other stubs, but it is not sufficient to reliably mask those bugs. This resulted in flakyness in one of our stubs, which was (incorrectly) sending a + packet at the start of the connection, resulting in a small-but-annoying number of dropped connections. Differential Revision: https://reviews.llvm.org/D114529	2021-11-25 12:33:23 +01:00
Simon Pilgrim	63b1e58f07	[DAG] SimplifyDemandedBits - simplify rotl/rotr to shl/srl (REAPPLIED) If we only demand bits from one half of a rotation pattern, see if we can simplify to a logical shift. For the ARM/AArch64 rev16/32 patterns, I had to drop a fold to prevent srl(bswap()) -> rotr(bswap) -> srl(bswap) infinite loops. I've replaced this with an isel PatFrag which should do the same task. Reapplied with fix for AArch64 rev patterns to matching the ARM fix. https://alive2.llvm.org/ce/z/iroxki (rol -> shl by amt iff demanded bits has at least as many trailing zeros as the shift amount) https://alive2.llvm.org/ce/z/4ez_U- (ror -> shl by revamt iff demanded bits has at least as many trailing zeros as the reverse shift amount) https://alive2.llvm.org/ce/z/cD7dR- (ror -> lshr by amt iff demanded bits has at least as many leading zeros as the shift amount) https://alive2.llvm.org/ce/z/_XGHtQ (rol -> lshr by revamt iff demanded bits has at least as many leading zeros as the reverse shift amount) Differential Revision: https://reviews.llvm.org/D114354	2021-11-25 11:14:15 +00:00
mydeveloperday	d44f2a6db2	[clang-format]NFC improve the comment to match the code Missing from {D114519}	2021-11-25 11:11:30 +00:00
mydeveloperday	c94667a810	[clang-format] [PR52595] clang-format does not recognize rvalue references to array https://bugs.llvm.org/show_bug.cgi?id=52595 missing space between `T(&&)` but not between `T (&` due to && being incorrectly thought of as `UnaryOperator` rather than `PointerOrReference` ``` int operator()(T (&)[N]) { return 0; } int operator()(T(&&)[N]) { return 1; } ``` Existing Unit tests are changed because actually I think they are originally incorrect, and are inconsistent with the (&) cases that are 4 or 5 lines above them. Reviewed By: curdeius Differential Revision: https://reviews.llvm.org/D114519	2021-11-25 11:05:46 +00:00
Alexander Belyaev	57470abc41	[mlir] Move memref.[tensor_load\|buffer_cast\|clone] to "bufferization" dialect. https://llvm.discourse.group/t/rfc-dialect-for-bufferization-related-ops/4712 Differential Revision: https://reviews.llvm.org/D114552	2021-11-25 11:50:39 +01:00
Tobias Gysi	43dc6d5d57	[mlir][linalg] Cleanup hoisting test (NFC). Rename the check prefixes to HOIST21 and HOIST32 to clarify the different flag configurations. Depends On D114438 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114442	2021-11-25 10:42:24 +00:00
Tobias Gysi	4b03906346	[mlir][linalg] Perform checks early in hoist padding. Instead of checking for unexpected operations (any operation with a region except for scf::For and `padTensorOp` or operations with a memory effect) while cloning the packing loop nest perform the checks early. Update `dropNonIndexDependencies` to check for unexpected operations. Additionally, check all of these operations have index type operands only. Depends On D114428 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114438	2021-11-25 10:37:12 +00:00
Tobias Gysi	fd723eaa92	[mlir][linalg] Limit hoist padding to constant paddings. Limit hoist padding to pad tensor ops that depend only on a constant value. Supporting arbitrary padding values that depend on computations part of the backward slice to hoist require complex analysis to ensure the computation can be hoisted. Depends On D114420 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114428	2021-11-25 10:31:39 +00:00
Tobias Gysi	ed7c1fb9b0	[mlir][linalg] Add backward slice filtering in hoist padding. Adapt hoist padding to filter the backward slice before cloning the packing loop nest. The filtering removes all operations that are not used to index the hoisted pad tensor op and its extract slice op. The filtering is needed to support the more complex loop nests created after fusion. For example, fusing the producer of an output operand can added linalg ops and pad tensor ops to the backward slice. These operations have regions and currently prevent hoisting. The following example demonstrates the effect of the newly introduced `dropNonIndexDependencies` method that filters the backward slice: ``` %source = linalg.fill(%cst, %arg0) scf.for %i %unrelated = linalg.fill(%cst, %arg1) // not used to index %source! scf.for %j (%arg2 = %unrelated) scf.for %k // not used to index %source! %ubi = affine.min #map(%i) %ubj = affine.min #map(%j) %slice = tensor.extract_slice %source [%i, %j] [%ubi, %ubj] %padded_slice = linalg.pad_tensor %slice ``` dropNonIndexDependencies(%padded_slice, %slice) removes [scf.for %k, linalg.fill(%cst, %arg1)] from backwardSlice. Depends On D114175 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114420	2021-11-25 10:30:10 +00:00
Sheldon Neuberger	e2cad4df22	[clangd] Add ObjC method support to prepareCallHierarchy This fixes "textDocument/prepareCallHierarchy" in clangd for ObjC methods. Details at https://github.com/clangd/vscode-clangd/issues/247. clangd uses Decl::isFunctionOrFunctionTemplate to check if the decl given in a prepareCallHierarchy request is eligible for prepareCallHierarchy. We change to use isFunctionOrMethod which includes functions and ObjC methods. Reviewed By: kadircet Differential Revision: https://reviews.llvm.org/D114058	2021-11-25 11:23:24 +01:00
David Green	3a700cabdc	[SDAG] Allow Unknown sizes when refining MMO alignments. NFC The changes in D113888 / `32b6c17b29` altered the memory size of a masked store, as it will store an unknown number of bytes not the full vector size. We can have situations where the masked stores is legalized and then turned to a normal store, as the mask is known to be all ones. This creates a store with an unknown size MMO that was hitting this assert. The store created can be given a better size in a followup patch. This currently adjusts the assert to handle unknown sizes.	2021-11-25 10:19:29 +00:00
Matthias Springer	48107eaa07	[mlir][linalg][bufferize][NFC] Move SCF interface impl to new build target This makes ComprehensiveBufferize entirely independent of the SCF dialect. Differential Revision: https://reviews.llvm.org/D114221	2021-11-25 19:00:17 +09:00

1 2 3 4 5 ...

405768 Commits All Branches Search

405768 Commits

All Branches