llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	7977fee43c	[X86] Autogenerate complete checks. NFC	2020-12-12 16:37:28 -08:00
Amara Emerson	21de99d43c	[[GlobalISel][IRTranslator] Fix a crash when the use of an extractvalue is a non-dominated metadata use. We don't expect uses to come before defs in the CFG, so allocateVRegs() asserted. Fixes PR48211	2020-12-12 14:58:54 -08:00
Roman Lebedev	d38205144f	[SimplifyCFG] FoldBranchToCommonDest(): bonus instrns must only be used by PHI nodes in successors (PR48450) In particular, if the successor block, which is about to get a new predecessor block, currently only has a single predecessor, then the bonus instructions will be directly used within said successor, which is fine, since the block with bonus instructions dominates that successor. But once there's a new predecessor, the IR is no longer valid, and we don't fix it, because we only update PHI nodes. Which means, the live-out bonus instructions must be exclusively used by the PHI nodes in successor blocks. So we have to form trivial PHI nodes. which will then be successfully updated to recieve cloned bonus instns. This all works fine, except for the fact that we don't have access to the dominator tree, and we don't ignore unreachable code, so we sometimes do end up having to deal with some weird IR. Fixes https://bugs.llvm.org/show_bug.cgi?id=48450	2020-12-13 00:06:57 +03:00
Zarko Todorovski	ce4040a43d	[PPC] Check for PPC64 when emitting 64bit specific VSX nodes when pattern matching built vectors Some of the pattern matching in PPCInstrVSX.td and node lowering involving vectors assumes 64bit mode. This patch disables some of the unsafe pattern matching and lowering of BUILD_VECTOR in 32bit mode. Reviewed By: Xiangling_L Differential Revision: https://reviews.llvm.org/D92789	2020-12-12 15:28:28 -05:00
Alexey Bader	a500a43587	[CodeGen][AMDGPU] Fix ICE for static initializer IR generation Differential Revision: https://reviews.llvm.org/D92782	2020-12-12 23:26:54 +03:00
Nico Weber	956034c6c8	[mac/arm] XFAIL two more tests on arm64-apple Part of PR46644	2020-12-12 15:20:50 -05:00
Nikita Popov	afbb6d97b5	[CVP] Simplify and generalize switch handling CVP currently handles switches by checking an equality predicate on all edges from predecessor blocks. Of course, this can only work if the value being switched over is defined in a different block. Replace this implementation with a call to getPredicateAt(), which also does the predecessor edge predicate check (if not defined in the same block), but can also do quite a bit more: It can reason about phi-nodes by checking edge predicates for incoming values, it can reason about assumes, and it can reason about block values. As such, this makes the implementation both simpler and more powerful. The compile-time impact on CTMark is in the noise.	2020-12-12 21:12:27 +01:00
Nico Weber	a5c65de295	mac/arm: XFAIL the last 3 failing tests We should fix them, but let's XFAIL them for now so that we can start running check-clang on bots and lock in the passing tests. Part of 46644.	2020-12-12 15:09:17 -05:00
Nikita Popov	ff523aa441	[CVP] Add additional switch tests (NFC) These cover cases handled by getPredicateAt(), but not by the current implementation: * Assumes based on context instruction. * Value from phi node in same block (using per-pred reasoning). * Value from non-phi node in same block (using block-val reasoning).	2020-12-12 20:58:00 +01:00
Krzysztof Parzyszek	baf931a842	[Hexagon] Reconsider getMask fix, return original mask, convert later The getPayload/getMask/getPassThrough functions should return values that could be composed into a masked load/store without any additional type casts. The previous fix violated that. Instead, convert scalar mask to a vector right before rescaling.	2020-12-12 13:27:22 -06:00
Tony	7beee561e2	[AMDGPU] Add missing targets to target-invalid-cpu-note.c Differential Revision: https://reviews.llvm.org/D93018	2020-12-12 18:19:03 +00:00
Tony	92ab6ed667	[AMDGPU] Add missing targets to amdgpu-features.cl Differential Revision: https://reviews.llvm.org/D93017	2020-12-12 18:19:02 +00:00
Tony	87a4e14e40	[NFC][AMDGPU] AMDGPUUsage updates - Document which processors are supported by which runtimes. - Add missing mappings for code object V2 note records Differential Revision: https://reviews.llvm.org/D93016	2020-12-12 18:19:02 +00:00
Brian Gesiak	09b0e0884a	[mlir] Print bad size in AttrSizedOperandSegments When printing verification errors for ops with the incorrect number of operand segments, print the required number as well as the actual number. Split off from D93005. Differential Revision: https://reviews.llvm.org/D93145	2020-12-12 13:12:31 -05:00
Kazu Hirata	9293b251b5	[Analysis/Interval] Remove isLoop (NFC) The last use of isLoop was removed on Apr 29, 2002 in commit `09bbb5c015` as part of an effort to remove "old induction varaible cannonicalization pass built on top of interval analysis".	2020-12-12 10:09:35 -08:00
Kazu Hirata	215c1b1935	[Transforms] Use is_contained (NFC)	2020-12-12 09:37:49 -08:00
Krzysztof Parzyszek	2cf5310471	[Hexagon] Create vector masks for scalar loads/stores AlignVectors treats all loaded/stored values as vectors of bytes, and masks as corresponding vectors of booleans, so make getMask produce a 1-element vector for scalars from the start.	2020-12-12 11:12:17 -06:00
Harald van Dijk	67c97ed4a5	[UpdateTestChecks] Add --(no-)x86_scrub_sp option. This makes it possible to use update_llc_test_checks to manage tests that check for incorrect x86 stack offsets. It does not yet modify any test to make use of this new option.	2020-12-12 17:11:13 +00:00
Harald van Dijk	f61e5ecb91	[X86] Avoid data16 prefix for lea in x32 mode The ABI demands a data16 prefix for lea in 64-bit LP64 mode, but not in 64-bit ILP32 mode. In both modes this prefix would ordinarily be ignored, but the instructions may be changed by the linker to instructions that are affected by the prefix. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D93157	2020-12-12 17:05:24 +00:00
David Green	a4823377fd	[ARM] Add basic masked load/store costs This adds some basic MVE masked load/store costs, notably changing the cost of legal loads/stores to the MVECostFactor and the cost of scalarized instructions to 8*NumElts. Differential Revision: https://reviews.llvm.org/D86538	2020-12-12 15:26:32 +00:00
David Green	ab97c9bdb7	[LV] Fix scalar cost for tail predicated loops When it comes to the scalar cost of any predicated block, the loop vectorizer by default regards this predication as a sign that it is looking at an if-conversion and divides the scalar cost of the block by 2, assuming it would only be executed half the time. This however makes no sense if the predication has been introduced to tail predicate the loop. Original patch by Anna Welker Differential Revision: https://reviews.llvm.org/D86452	2020-12-12 14:21:40 +00:00
Nikita Popov	d716eab197	[BasicAA] Make non-equal index handling simpler to extend (NFC)	2020-12-12 15:00:47 +01:00
Nikita Popov	b0ce2b72e8	[BasicAA] Add tests for non-zero var index (NFC)	2020-12-12 15:00:46 +01:00
Melanie Blower	320af6b138	Create SPIRABIInfo to enable SPIR_FUNC calling convention. Background: Call to library arithmetic functions for div is emitted by the compiler and it set wrong “C” calling convention for calls to these functions, whereas library functions are declared with `spir_function` calling convention. InstCombine optimization replaces such calls with “unreachable” instruction. It looks like clang lacks SPIRABIInfo class which should specify default calling conventions for “system” function calls. SPIR supports only SPIR_FUNC and SPIR_KERNEL calling convention. Reviewers: Erich Keane, Anastasia Differential Revision: https://reviews.llvm.org/D92721	2020-12-12 05:48:20 -08:00
Tatyana Krasnukha	a01b26fb51	[lldb] Make CommandInterpreter's execution context the same as debugger's one. Currently, the interpreter's context is not updated until a command is executed. This has resulted in the behavior of SB-interface functions and some commands depends on previous user actions. The interpreter's context can stay uninitialized, point to a currently selected target, or point to one of previously selected targets. This patch removes any usages of CommandInterpreter::UpdateExecutionContext. CommandInterpreter::HandleCommand* functions still may override context temporarily, but now they always restore it before exiting. CommandInterpreter saves overriden contexts to the stack, that makes nesting commands possible. Added test reproduces one of the issues. Without this fix, the last assertion fails because interpreter's execution context is empty until running "target list", so, the value of the global property was updated instead of process's local instance. Differential Revision: https://reviews.llvm.org/D92164	2020-12-12 16:40:59 +03:00
Tatyana Krasnukha	7832d7e95a	[lldb] Modernize TargetList for-loops, NFC Replace loops with standard algorithms or range-based loops.	2020-12-12 16:40:58 +03:00
Tatyana Krasnukha	2634ec6ce9	[lldb] "target create" shouldn't save target if the command failed TargetList::CreateTarget automatically adds created target to the list, however, CommandObjectTargetCreate does some additional preparation after creating a target and which can fail. The command should remove created target if it failed. Since the function has many ways to return, scope guard does this work safely. Changes to the TargetList make target adding and selection more transparent. Other changes remove unnecessary SetSelectedTarget after CreateTarget. Differential Revision: https://reviews.llvm.org/D93052	2020-12-12 16:40:58 +03:00
Luo, Yuanke	e52bc1d2bb	[X86] Add chain in ISel for x86_tdpbssd_internal intrinsic.	2020-12-12 21:14:38 +08:00
Nathan James	0e5bfffb13	[YAML] Support extended spellings when parsing bools. Support all the spellings of boolean datatypes according to https://yaml.org/type/bool.html Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D92755	2020-12-12 12:50:34 +00:00
David Green	f6e885ad2a	[ARM] Test for showing scalar vector costs. NFC	2020-12-12 11:43:14 +00:00
Jan Svoboda	adf3c27742	[clang][cli] Revert accidental access-control flag rename This commit <https://reviews.llvm.org/rGe5158b52730d323bb8cd2cba6dc6c89b90cba452> introduced an accidental change, which renames `-faccess-control` and `-fno-access-control` to `-fno-access-control` and `-fno-no-access-control`. Reviewed By: dexonsmith, MaskRay Differential Revision: https://reviews.llvm.org/D93104	2020-12-12 11:26:53 +01:00
Jan Svoboda	6baa9769ed	[clang][cli] Add flexible TableGen multiclass for boolean options This introduces more flexible multiclass for declaring two flags controlling the same boolean keypath. Compared to existing Opt{In,Out}FFlag multiclasses, the new syntax makes it easier to read option declarations and reason about the keypath. This also makes specifying common properties of both flags possible. I'm open to suggestions on the class names. Not 100% sure the benefits are worth the added complexity. Depends on D92774. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D92775	2020-12-12 10:53:28 +01:00
Jan Svoboda	10f40576f7	[clang][cli] Don't always emit -f[no-]legacy-pass-manager We don't need to always generate `-f[no-]experimental-new-pass-manager`. This patch does not change the behavior of any other command line flag. (For example `-triple` is still being always generated.) Reviewed By: dexonsmith, Bigcheese Differential Revision: https://reviews.llvm.org/D92857	2020-12-12 10:11:23 +01:00
Jan Svoboda	6f26a6de48	Reland "[clang][cli] CompilerInvocationTest: add tests for boolean options" Add more tests of the command line marshalling infrastructure. The new tests now make a "round-trip": from arguments, to CompilerInvocation instance to arguments again in a single test case. The TODOs are resolved in a follow-up patch. Depends on D92830. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D92774	2020-12-12 09:46:20 +01:00
Kazu Hirata	eb44682d67	[Analysis] Use is_contained (NFC)	2020-12-11 21:19:31 -08:00
Mircea Trofin	f76b7f22f0	[MLGO] Fix build break as result of new InstructionCost (D91174)	2020-12-11 20:28:39 -08:00
Giorgis Georgakoudis	e007b32864	[OpenMP] Add time profiling for libomptarget Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D93055	2020-12-11 18:53:37 -08:00
Fangrui Song	7698a01808	[llvm-cov gcov] Replace Donald B. Johnson's cycle enumeration with iterative cycle finding gcov computes the line execution count as the sum of (a) counts from predecessors on other lines and (b) the sum of loop execution counts of blocks on the same line (think of loops on one line). For (b), we use Donald B. Johnson's cycle enumeration algorithm and perform cycle cancelling for each cycle. This number of candidate cycles were exponential and D93036 made it polynomial by skipping zero count cycles. The time complexity is high (O(VE^2) (it could be O(E^2) but the linear `Blocks` check made it higher) and the implementation is complex. We could just identify loops and sum all back edges. However, this requires a dominator tree construction which is more complex. The time complexity can be decreased to almost linear, though. This patch just performs cycle cancelling iteratively. Add two members `traversable` and `incoming` to GCOVArc. There are 3 states: `!traversable`: blocks not on this line or explored blocks * `traversable && incoming == nullptr`: unexplored blocks * `traversable && incoming != nullptr`: blocks which are being explored (on the stack) If an arc points to a block being explored, a cycle has been found. Let E be the number of arcs. Every time a cycle is found, at least one arc is saturated (`edgeCount` reduced to 0), so there are at most E cycles. Finding one cycle takes O(E) time, so the overall time complexity is O(E^2). Note that we always augment through a back edge and never need to augment its reverse edge so reverse edges in traditional flow networks are not needed. Reviewed By: xinhaoyuan Differential Revision: https://reviews.llvm.org/D93073	2020-12-11 18:28:16 -08:00
Fangrui Song	3b3bc5d45a	[Kaleidoscope] Migrate DebugInfo::get to DILocation::get	2020-12-11 18:01:04 -08:00
River Riddle	e9987ad878	[mlir][docs] Tidy up the pass infrastructure documentation The doc has grown stale and is missing some recent changes to the infrastructure. Differential Revision: https://reviews.llvm.org/D93081	2020-12-11 17:53:33 -08:00
Duncan P. N. Exon Smith	e095959e0c	Fixup for `8c86197de3` to avoid making it platform-dependent	2020-12-11 17:34:00 -08:00
Duncan P. N. Exon Smith	8c86197de3	clang-import-test: Clean up error output for files that cannot be found Pass on the filesystem error string `FileManager::getFileRef` in `clang-import-test`'s `ParseSource` function. Also include "error:" and a newline in the output. As a side effect, migrate to the `FileEntryRef` overload of `SourceManager::createFileID`. No real functionality change here, just slightly better output on error. Differential Revision: https://reviews.llvm.org/D92971	2020-12-11 17:07:58 -08:00
Duncan P. N. Exon Smith	a600432199	Frontend: Migrate to FileEntryRef in TextDiagnosticTest, NFC Migrate over to the `FileEntryRef` overloads of `SourceManager::createFileID` and `overrideFileContents` (using `getVirtualFileRef`) in `TextDiagnostic`'s `ShowLine` test. No functionality change. Differential Revision: https://reviews.llvm.org/D92968	2020-12-11 17:06:28 -08:00
Jonas Paulsson	42f628c842	Reapply "[SystemZFrameLowering] Don't overrwrite R1D (backchain) when probing." Fixed to properly compute the live-in lists of new blocks. Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D92803	2020-12-11 18:25:47 -06:00
Jonas Paulsson	0c2d23933f	[SystemZTTIImpl] Allow some non-prefetched accesses in getMinPrefetchStride(). The performance improvement on LBM previously achieved with improved software prefetching (`36d4421`) have gone lost recently with `e00f189`. There now is one memory access in the loop that LoopDataPrefetch cannot handle (while before there was none) which the heuristic rejects. This patch adds a small margin by allowing 1 non-prefetched memory access for every 32 prefetched ones, so that the heuristic doesn't bail in this type of case. Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D92985	2020-12-11 18:06:07 -06:00
diggerlin	7c8072ce2d	[AIX] Fixed a link error. Summary: "Speculative fix for link failure on bots" with a mention of "the clang-ppc64le-rhel bot fails on link: http://lab.llvm.org:8011/#/builders/57/builds/2307/steps/6/logs/stdio". PPCAsmPrinter.cpp:(.text._ZN12_GLOBAL__N_116PPCAIXAsmPrinter19emitFunctionBodyEndEv+0x2f8): undefined reference to `llvm::XCOFF::getNameForTracebackTableLanguageId(llvm::XCOFF::TracebackTable::LanguageID)' PPCAsmPrinter.cpp:(.text._ZN12_GLOBAL__N_116PPCAIXAsmPrinter19emitFunctionBodyEndEv+0x2170): undefined reference to `llvm::XCOFF::parseParmsType(unsigned int, unsigned int)'	2020-12-11 18:53:10 -05:00
Craig Topper	6e9e53895c	[LoopIdiomRecognize] Autogenerate complete checks for the X86 ctlz/cttz tests. NFC Preparation for D92745 which will add more tests to these files.	2020-12-11 15:35:37 -08:00
Nikita Popov	8d4b139e9d	Revert "Consider reference, pointer, and pointer-to-member TemplateArguments to be different if they have different types." This reverts commit `7b3470baf8`. Causes a crash while building tramp3d-v4 from test-suite.	2020-12-12 00:04:10 +01:00
diggerlin	997d286f2d	[AIX][XCOFF] emit traceback table for function in aix SUMMARY: 1. added a new option -xcoff-traceback-table to control whether generate traceback table for function. 2. implement the functionality of emit traceback table of a function. Reviewers: hubert.reinterpretcast, Jason Liu Differential Revision: https://reviews.llvm.org/D92398	2020-12-11 17:50:25 -05:00
Mehdi Amini	aadcb26ee1	Store a MlirIdentifier instead of a MlirStringRef in MlirNameAttribute This mirror the C++ API for NamedAttribute, and has the advantage or internalizing earlier in the Context and not requiring the caller to keep the StringRef alive beyong this call. Differential Revision: https://reviews.llvm.org/D93133	2020-12-11 22:38:48 +00:00

1 2 3 4 5 ...

374688 Commits All Branches Search

374688 Commits

All Branches