llvm-project

Commit Graph

Author	SHA1	Message	Date
MaheshRavishankar	42444d0cf0	[mlir][Linalg] NFC: Verify tiling on linalg.generic operation on tensors. With the recent changes to linalg on tensor semantics, the tiling operations works out-of-the-box for generic operations. Add a test to verify that and some minor refactoring. Differential Revision: https://reviews.llvm.org/D93077	2021-01-14 16:17:08 -08:00
MaheshRavishankar	774c9c6ef3	[mlir][Linalg] Add canonicalization of linalg op -> dim op. Add canonicalization to replace use of the result of a linalg operation on tensors in a dim operation, to use one of the operands of the linalg operations instead. This allows the linalg op itself to be deleted when all its non-dim uses are removed (say through tiling, etc.) Differential Revision: https://reviews.llvm.org/D93076	2021-01-14 16:17:08 -08:00
Shilei Tian	547b032ccc	[OpenMP] Remove omptarget-nvptx from deps as it is no longer a valid target `omptarget-nvptx` is still a dependence for `check-libomptarget-nvtpx` although it has been removed by D94573. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D94725	2021-01-14 19:16:11 -05:00
Sanjay Patel	b21905dfe3	[SLP] remove unnecessary state in matching reductions This is NFC-intended. I'm still trying to figure out how the loop where this is used works. It does not seem like we require this data at all, but it's hard to confirm given the complicated predicates.	2021-01-14 18:32:37 -05:00
MaheshRavishankar	722ae10907	[mlir][Linalg] Add canonicalization to remove no-op linalg operations. linalg.generic/indexed_generic operations on tensors whose body is just yielding the (non-induction variable) arguments of the operation can be canonicalized by replacing uses of the result with the corresponding arguments. Differential Revision: https://reviews.llvm.org/D94581	2021-01-14 14:59:24 -08:00
Sam McCall	4183999e0f	[clangd] Reduce logspam for CDB scanning	2021-01-14 23:55:02 +01:00
Andrew Young	a55a0a3056	[mlir] Remove over specified memory effects The standard and gpu dialect both have `alloc` operations which use the memory effect `MemAlloc`. In both cases, it is specified on both the operation itself and on the result. This results in two memory effects being created for these operations. When `MemAlloc` is defined on an operation, it represents some background effect which the compiler cannot reason about, and inhibits the ability of the compiler to remove dead `std.alloc` operations. This change removes the uneeded `MemAlloc` effect from these operations and leaves the effect on the result, which allows dead allocs to be erased. There is the same problem, but to a lesser extent, with MemFree, MemRead and MemWrite. Over-specifying these traits is not currently inhibiting any optimization. Differential Revision: https://reviews.llvm.org/D94662	2021-01-14 14:49:41 -08:00
Sam Elliott	8a53a7375a	[RISCV][NFC] Regenerate Calling Convention Tests This regenerates these tests using utils/update_llc_test_checks.py so that future changes in this area don't have the noise of lots of `@plt` lines being added. I also removed the `nounwind`s from the stack-realignment.ll test to increase coverage on the generated call frame information.	2021-01-14 22:35:17 +00:00
Teresa Johnson	5b42fd8dd4	[LTO] Test format fix (NFC) As requested in D91583, use ';;' instead of ';' to preceed comments in lld test. I did this in the equivalent gold test as well.	2021-01-14 14:09:50 -08:00
Alexandre Ganea	4fcb25583c	Re-land [Support] On Windows, take the affinity mask into account The number of hardware threads available to a ThreadPool can be limited if setting an affinity mask. For example: > start /B /AFFINITY 0xF lld-link.exe ... Would let LLD only use 4 hyper-threads. Previously, there was an outstanding issue on Windows Server 2019 on dual-CPU machines, which was preventing from using both CPU sockets. In normal conditions, when no affinity mask was set, ProcessorGroup::AllThreads was different from ProcessorGroup::UsableThreads. The previous code in llvm/lib/Support/Windows/Threading.inc L201 was improperly assuming those two values to be equal, and consequently was limiting the execution to only one CPU socket. Differential Revision: https://reviews.llvm.org/D92419	2021-01-14 17:03:22 -05:00
Craig Topper	b894a9fb23	[RISCV] Optimize select_cc after fp compare expansion Some FP compares expand to a sequence ending with (xor X, 1) to invert the result. If the consumer is a select_cc we can likely get rid of this xor by fixing up the select_cc condition. This patch combines (select_cc (xor X, 1), 0, setne, trueV, falseV) - (select_cc X, 0, seteq, trueV, falseV) if we can prove X is 0/1. Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D94546	2021-01-14 13:41:40 -08:00
Douglas Yung	f85b153166	Add -fexceptions to test as it uses them and fails on platforms where it is not on by default (like the PS4).	2021-01-14 13:27:06 -08:00
Roland McGrath	e7228062b2	[libc] Use #undef isascii in specific header Standard C allows all standard headers to declare macros for all their functions. So after possibly including any standard header like <ctype.h>, it's perfectly normal for any and all of the functions it declares to be defined as macros. Standard C requires explicit `#undef` before using that identifier in a way that is not compatible with function-like macro definitions. The C standard's rules for this are extended to POSIX as well for the interfaces it defines, and it's the expected norm for nonstandard extensions declared by standard C library headers too. So far the only place this has come up for llvm-libc's code is with the isascii function in Fuchsia's libc. But other cases can arise for any standard (or common extension) function names that source code in llvm-libc is using in nonstandard ways, i.e. as C++ identifiers. The only correct and robust way to handle the possible inclusion of standard C library headers when building llvm-libc source code is to use `#undef` explicitly for each identifier before using it. The easy and obvious place to do that is in the per-function header. This requires that all code, such as test code, that might include any standard C library headers, e.g. via utils/UnitTest/Test.h, make sure to include those first before the per-function header. This change does that for isascii and its test. But it should be done uniformly for all the code and documented as a consistent convention so new implementation files are sure to get this right. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D94642	2021-01-14 13:25:05 -08:00
Nico Weber	0975604cc0	[gn build] (manually) port `387d3c2479`	2021-01-14 16:19:25 -05:00
Jinsong Ji	0f588ac03e	[PowerPC] Only use some extend mne if assembler is modern enough Legacy AIX assembly might not support all extended mnes, add one feature bit to control the generation in MC, and avoid generating them by default on AIX. Reviewed By: sfertile Differential Revision: https://reviews.llvm.org/D94458	2021-01-14 20:36:10 +00:00
Sean Silva	e2d7d3cb0e	[mlir][docs] Bring bufferization docs up to date. This spilts out BufferDeallocationInternals.md, since buffer deallocation is not part of bufferization per se. Differential Revision: https://reviews.llvm.org/D94351	2021-01-14 12:28:35 -08:00
Adam Czachorowski	a71877edfb	[clang] Do not crash when CXXRecordDecl has a non-CXXRecordDecl base. This can happen on some invalid code, like the included test case. Differential Revision: https://reviews.llvm.org/D94704	2021-01-14 21:20:06 +01:00
Tres Popp	5cf2696317	[mlir] Remove TosaToLinalg dependency on all Passes TosaToLinalg was depending on its header file indirectly through Passes.h rather than directly. This removes that indirection. Differential Revision: https://reviews.llvm.org/D94706	2021-01-14 21:08:32 +01:00
River Riddle	c8fb6ee341	[mlir][PatternRewriter] Add a new hook to selectively replace uses of an operation This revision adds a new `replaceOpWithIf` hook that replaces uses of an operation that satisfy a given functor. If all uses are replaced, the operation gets erased in a similar manner to `replaceOp`. DialectConversion support will be added in a followup as this requires adjusting how replacements are tracked there. Differential Revision: https://reviews.llvm.org/D94632	2021-01-14 11:58:21 -08:00
Craig Topper	387d3c2479	[RISCV] Merge Utils library into MCTargetDesc MCTargetDesc includes headers from Utils and Utils includes headers from MCTargetDesc. So from a library layering perspective it makes sense for them to be in the same library. I guess the other option might be to move the tablegen includes from RISCVMCTargetDesc.h to RISCVBaseInfo.h so that RISCVBaseInfo.h didn't need to include RISCVMCTargetDesc.h. Everything else that depends on Utils also depends on MCTargetDesc so having one library seemed simpler. Differential Revision: https://reviews.llvm.org/D93168	2021-01-14 11:47:30 -08:00
Fangrui Song	e3b9af92a4	[Driver] -gsplit-dwarf: Produce .dwo regardless of -gN for IR input This generalizes D94647 to IR input, as suggested by @tejohnson. Ideally the driver should just forward split dwarf options, but doing this currently will cause `clang -gsplit-dwarf -c a.c` to create a .dwo with just `.strtab`. Reviewed By: dblaikie, tejohnson Differential Revision: https://reviews.llvm.org/D94655	2021-01-14 11:46:22 -08:00
River Riddle	93592b726c	[mlir][OpFormatGen] Format enum attribute cases as keywords when possible In the overwhelmingly common case, enum attribute case strings represent valid identifiers in MLIR syntax. This revision updates the format generator to format as a keyword in these cases, removing the need to wrap values in a string. The parser still retains the ability to parse the string form, but the printer will use the keyword form when applicable. Differential Revision: https://reviews.llvm.org/D94575	2021-01-14 11:35:49 -08:00
River Riddle	00a61b327d	[mlir][ODS] Add new RangedTypesMatchWith operation predicate This is a variant of TypesMatchWith that provides support for variadic arguments. This is necessary because ranges generally can't use the default operator== comparators for checking equality. Differential Revision: https://reviews.llvm.org/D94574	2021-01-14 11:35:49 -08:00
Nikita Popov	a3904cc77f	[BasicAA] Handle recursive queries more efficiently An alias query currently works out roughly like this: * Look up location pair in cache. * Perform BasicAA logic (including cache lookup and insertion...) * Perform a recursive query using BestAAResults. * Look up location pair in cache (and thus do not recurse into BasicAA) * Query all the other AA providers. * Query all the other AA providers. This is a lot of unnecessary work, all ultimately caused by the BestAAResults query at the end of aliasCheck(). The reason we perform it, is that aliasCheck() is getting called recursively, and we of course want those recursive queries to also make use of other AA providers, not just BasicAA. We can solve this by making the recursive queries directly use BestAAResults (which will check both BasicAA and other providers), rather than recursing into aliasCheck(). There are some tradeoffs: * We can no longer pass through the precomputed underlying object to aliasCheck(). This is not a major concern, because nowadays getUnderlyingObject() is quite cheap. * Results from other AA providers are no longer cached inside BasicAA. The way this worked was already a bit iffy, in that a result could be cached, but if it was MayAlias, we'd still end up re-querying other providers anyway. If we want to cache non-BasicAA results, we should do that in a more principled manner. In any case, despite those tradeoffs, this works out to be a decent compile-time improvment. I think it also simplifies the mental model of how BasicAA works. It took me quite a while to fully understand how these things interact. Differential Revision: https://reviews.llvm.org/D90094	2021-01-14 20:32:41 +01:00
Mehdi Amini	d8113cda78	Add newline to terminate debug message (NFC)	2021-01-14 19:29:18 +00:00
Rob Suderman	1d973b7ded	[MLIR][TOSA] First lowerings from Tosa to Linalg Initial commit to add support for lowering from TOSA to Linalg. The focus is on the essential infrastructure for these lowerings and integration with existing passes. Includes lowerings for a subset of operations including: abs, add, sub, pow, and, or, xor, left shift, right shift, tanh Lit tests are used to validate correctness. Differential Revision: https://reviews.llvm.org/D94247	2021-01-14 11:24:23 -08:00
Erich Keane	9e53c94d8d	[NFC] Update test to not check for 'opaque' in the file name. The intent presumably is to avoid generating 'opaque' in the IR, but the header contains the filename. Thus, having the workspace in a directory with opaque in it causes this test to fail. This just adds a 'CHECK' line on target-triple, which is the last line of the IR-header.	2021-01-14 11:24:06 -08:00
Valentin Clement	ca98baa042	[openacc] Rename generated file from ACC.cpp.inc to ACC.inc to match D92955 This patch rename the tablegen generated file ACC.cpp.inc to ACC.inc in order to match what was done in D92955. This file is included in header file as well as .cpp file so it make more sense. Reviewed By: sameeranjoshi Differential Revision: https://reviews.llvm.org/D93485	2021-01-14 14:19:53 -05:00
Mitch Phillips	a8520f6970	[GWP-ASan] Minor refactor of optional components. In preparation for the inbuilt options parser, this is a minor refactor of optional components including: - Putting certain optional elements in the right header files, according to their function and their dependencies. - Cleaning up some old and mostly-dead code. - Moving some functions into anonymous namespaces to prevent symbol export. Reviewed By: cryptoad, eugenis Differential Revision: https://reviews.llvm.org/D94117	2021-01-14 11:14:11 -08:00
Shilei Tian	64e9e9aeee	[OpenMP] Dropped unnecessary define when compiling deviceRTLs for NVPTX The comment said CUDA 9 header files use the `nv_weak` attribute which `clang` is not yet prepared to handle. It's three years ago and now things have changed. Based on my test, removing the definition doesn't have any problem on my machine with CUDA 11.1 installed. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D94700	2021-01-14 13:55:12 -05:00
Hiroshi Yamauchi	202d359753	[X86] Add the FSRM feature (Fast Short Rep Mov) to Zen3. Note -x86-use-fsrm-for-memcpy is still disabled by default and there's no default behavior change. Differential Revision: https://reviews.llvm.org/D94436	2021-01-14 10:47:33 -08:00
Zequan Wu	4fffbc150c	[clang][MSVC] Fix missing MSInheritanceAttr in template specialization. Fix PR48687. Differential Revision: https://reviews.llvm.org/D94646	2021-01-14 10:37:35 -08:00
Shilei Tian	763c1f9933	[OpenMP] Drop the static library libomptarget-nvptx For NVPTX target, OpenMP provides a static library `libomptarget-nvptx` built by NVCC, and another bitcode `libomptarget-nvptx-sm_{$sm}.bc` generated by Clang. When compiling an OpenMP program, the `.bc` file will be fed to `clang` in the second run on the program that compiles the target part. Then the generated PTX file will be fed to `ptxas` to generate the object file, and finally the driver invokes `nvlink` to generate the binary, where the static library will be appened to `nvlink`. One question is, why do we need two libraries? The only difference is, the static library contains `omp_data.cu` and the bitcode library doesn't. It's unclear why they were implemented in this way, but per D94565, there is no issue if we also include the file into the bitcode library. Therefore, we can safely drop the static library. This patch is about the change in OpenMP. The driver will be updated as well if this patch is accepted. Reviewed By: jdoerfert, JonChesterfield Differential Revision: https://reviews.llvm.org/D94573	2021-01-14 13:34:25 -05:00
Arjun P	6ebeba88f5	Support emptiness checks for unbounded FlatAffineConstraints. With this, we have complete support for emptiness checks. This also paves the way for future support to check if two FlatAffineConstraints are equal. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D94272	2021-01-14 19:33:37 +01:00
Aaron En Ye Shi	be40c12040	[HIP] Add signbit(long double) decl An _MSC_VER version of signbit(long double) is required for MSVC headers. Fixes: SWDEV-256409 Differential Revision: https://reviews.llvm.org/D93062	2021-01-14 18:23:37 +00:00
Joseph Tremoulet	85dfcaadc5	[LLDB] MinidumpParser: Prefer executable module even at higher address When a program maps one of its own modules for reading, and then crashes, breakpad can emit two entries for that module in the ModuleList. We have logic to identify this case by checking permissions on mapped memory regions and report just the module with an executable region. As currently written, though, the check is asymmetric -- the entry with the executable region must be the second one encountered for the preference to kick in. This change makes the logic symmetric, so that the first-encountered module will similarly be preferred if it has an executable region but the second-encountered module does not. This happens for example when the module in question is the executable itself, which breakpad likes to report first -- we need to ignore the other entry for that module when we see it later, even though it may be mapped at a lower virtual address. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D94629	2021-01-14 13:17:57 -05:00
Jay Foad	868da2ea93	[SelectionDAG] Remove an early-out from computeKnownBits for smin/smax Even if we know nothing about LHS, it can still be useful to know that smax(LHS, RHS) >= RHS and smin(LHS, RHS) <= RHS. Differential Revision: https://reviews.llvm.org/D87145	2021-01-14 18:15:17 +00:00
Jon Chesterfield	5d165f0b89	[libomptarget][amdgpu] Fix kernel launch tracing to match previous behavior Restore control of kernel launch tracing to be >= 1 as it was before export LIBOMPTARGET_KERNEL_TRACE=1 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D94695	2021-01-14 18:13:22 +00:00
Simon Pilgrim	b99782cf78	[X86][AVX] Adjust unsigned saturation downconvert negative test D87145 was showing that this test (added in D45315) could always be constant folded (with suitable value tracking). What we actually needed was smax(smin()) negative test coverage, the invert of negative_test2_smax_usat_trunc_wb_256_mem, so I've tweaked the test to provide that instead.	2021-01-14 17:51:23 +00:00
Arthur Eubanks	a03ffa9850	[NewPM] Fix placement of LoopFlatten https://reviews.llvm.org/D90402 was inconsistent with where it put LoopFlatten between the two pass managers. It also missed adding it to the non-O1 function simplification pipeline. PR48738 Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D94650	2021-01-14 09:49:31 -08:00
Mircea Trofin	35c8a6cbf5	[NFC] Disallow unused prefixes under MC/AArch64 Differential Revision: https://reviews.llvm.org/D94616	2021-01-14 09:46:13 -08:00
peter klausler	4864d9f7e9	[flang] Fix some module file issues exposed by Whizard Generic type-bound interfaces for user-defined operators need to be formatted as "OPERATOR(.op.)", not just ".op." PRIVATE generics need to be marked as such. Declaration ordering: when a generic interface shadows a derived type of the same name, it needs to be emitted to the module file at the point of definition of the derived type; otherwise, the derived type's definition may appear after its first use. The module symbol for a module read from a module file needs to be marked as coming from a module file before semantic processing is performed on the contents of the module so that any special handling for declarations in module files can be properly activated. IMPORT statements were sometimes missing for use-associated symbols in surrounding scopes; fine-tune NeedImport(). Differential Revision: https://reviews.llvm.org/D94636	2021-01-14 09:44:50 -08:00
LLVM GN Syncbot	b4e083b0ef	[gn build] Port `2f395b7092`	2021-01-14 17:39:58 +00:00
Utkarsh Saxena	8b09cf7956	[clangd] Trivial: Documentation fix in ASTSignals.	2021-01-14 18:38:42 +01:00
Utkarsh Saxena	2f395b7092	[clangd] Make AST-based signals available to runWithPreamble. Many useful signals can be derived from a valid AST which is regularly updated by the ASTWorker. `runWithPreamble` does not have access to the ParsedAST but it can be provided access to some signals derived from a (possibly stale) AST. Differential Revision: https://reviews.llvm.org/D94424	2021-01-14 18:34:50 +01:00
Mircea Trofin	e21bf875c0	[NFC] Disallow unused prefixes under MC/ARM Differential Revision: https://reviews.llvm.org/D94620	2021-01-14 08:56:45 -08:00
Andrzej Warzynski	0afdbb4d2d	[flang][driver] Use __FLANG_VERISION__ in f18.cpp (nfc) Just a minor improvement suggested in a post-commit review here: https://reviews.llvm.org/D94422	2021-01-14 16:51:49 +00:00
Sam Elliott	7c9c2a2ea5	Revert "[RISCV] Legalize select when Zbt extension available" We found issues with this patch in additional testing. Backing out while we work on a fix. This reverts commit `71ed4b6ce5`.	2021-01-14 16:44:34 +00:00
Sam McCall	17fb21f875	[clangd] Remove another option that was effectively always true. NFC	2021-01-14 17:19:47 +01:00
Simon Pilgrim	0a59647ee4	[SystemZ] misched-cutoff tests can only be tested on non-NDEBUG (assertion) builds Fixes clang-with-thin-lto-ubuntu buildbot after D94383/rGddd03842c347	2021-01-14 15:46:27 +00:00

1 2 3 4 5 ...

377077 Commits All Branches Search

377077 Commits

All Branches