llvm-project

Commit Graph

Author	SHA1	Message	Date
Nikita Popov	719354a571	Revert "[SCEV] Fix and validate ValueExprMap/ExprValueMap consistency" This reverts commit `bee8dcda1f`. Some sanitizer buildbots fail with: > Attempt to use a SCEVCouldNotCompute object! For example: https://lab.llvm.org/buildbot/#/builders/85/builds/7020/steps/9/logs/stdio	2021-11-26 22:18:23 +01:00
Erik Desjardins	a68af62b42	[InstSimplify] baseline tests for icmp of lshr/udiv fold (NFC) Precommits tests for https://reviews.llvm.org/D114279 Differential Revision: https://reviews.llvm.org/D114280	2021-11-26 15:57:04 -05:00
Peter Klausler	45a8caf1cd	[flang] Fix reversed comparison in RESHAPE() runtime RESHAPE() fails inappropriately at runtime if the source array is larger than the result -- which is perfectly valid -- because of an obviously reversed comparison of their numbers of elements is activating the runtime asserts meant for the opposite case (source smaller than result). Differential Revision: https://reviews.llvm.org/D114474	2021-11-26 12:34:00 -08:00
Kazu Hirata	803cec0268	[mlir] Fix a warning This patch fixes: mlir/lib/IR/MLIRContext.cpp:1020:3: error: use of the 'nodiscard' attribute is a C++17 extension [-Werror,-Wc++17-extensions]	2021-11-26 12:27:11 -08:00
Nikita Popov	bfa91f38a9	[DAG] Restore dropped condition This was dropped in `fcee33bd5a`, presumably accidentally.	2021-11-26 21:18:54 +01:00
Nikita Popov	bee8dcda1f	[SCEV] Fix and validate ValueExprMap/ExprValueMap consistency Relative to the previous landing attempt, this makes insertValueToMap() resilient against the value already being present in the map -- previously I only checked this for the createSimpleAffineAddRec() case, but the same issue can also occur for the general createNodeForPHI(). In both cases, the addrec may be constructed and added to the map in a recursive query trying to create said addrec. In this case, this happens due to the invalidation when the BE count is computed, which ends up clearing out the symbolic name as well. ----- This adds validation for consistency of ValueExprMap and ExprValueMap, and fixes identified issues: * Addrec construction directly wrote to ValueExprMap in a few places, without updating ExprValueMap. Add a helper to ensures they stay consistent. The adjustment in forgetSymbolicName() explicitly drops the old value from the map, so that we don't rely on it being overwritten. * forgetMemoizedResultsImpl() was dropping the SCEV from ExprValueMap, but not dropping the corresponding entries from ValueExprMap. Differential Revision: https://reviews.llvm.org/D113349	2021-11-26 20:57:47 +01:00
Fangrui Song	3b4dd68de5	[ELF][PPC64] Make --power10-stubs/--no-power10-stubs proper aliases for --power10-stubs={auto,no} This allows --power10-stubs= and --[no-]power10-stubs to override each other (they are position dependent in GNU ld). Also improve --help messages and the manpage. Note: GNU ld's default "auto" mode uses heuristics to decide whether Power10 instructions are used. Arguably it is a design mistake of R_PPC64_REL24_NOTOC (acked by the relevant folks on a libc-alpha discussion). We don't implement "auto", so the default --power10-stubs is the same as "yes".	2021-11-26 11:51:45 -08:00
Arnab Dutta	c2280b5517	[MLIR] Avoid creation of buggy affine maps when incorrect values of number of dimensions and number of symbols are provided. We check whether the maximum index of dimensional identifier present in the result expressions is less than dimCount (number of dimensional identifiers) argument passed in the AffineMap::get() and the maximum index of symbolic identifier present in the result expressions is less than symbolCount (number of symbolic identifiers) argument passed in AffineMap::get(). Reviewed By: nicolasvasilache, bondhugula Differential Revision: https://reviews.llvm.org/D114238	2021-11-27 00:37:08 +05:30
Arnab Dutta	e4e4da86af	[MLIR] Prevent creation of buggy affine map after linearizing collapsed dimensions of source map Initially we were passing wrong numSymbols argument while calling AffineMap::get() for creaating affine map with linearized result expressions. The main problems was the number of symbols of the newly to be created map may be different from that of the source map, as new symbolic identifiers may be introduced while creating strided layout linearized expressions. Reviewed By: nicolasvasilache, bondhugula Differential Revision: https://reviews.llvm.org/D114240	2021-11-27 00:32:58 +05:30
Fangrui Song	09401dfcf1	[ELF] Rename fetch to extract The canonical term is "extract" (GNU ld documentation, Solaris's `-z *extract` options). Avoid inventing a term and match --why-extract. (ld64 prefers "load" but the word is overloaded too much) Mostly MFC, except for --help messages and the header row in --print-archive-stats output.	2021-11-26 10:58:50 -08:00
Simon Pilgrim	fcee33bd5a	[DAG] Pull out repeated isLittleEndian() calls. NFC.	2021-11-26 18:41:56 +00:00
Chris Jones	344eee6f38	[MLIR] Allow `Idempotent` trait to be applied to binary ops. Add `Idempotent` trait to `arith.{andi,ori}`. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D114574	2021-11-26 18:22:49 +00:00
Louis Dionne	5c454033dd	[libc++] Trigger rebuild of the Docker image so we get a new nightly Clang	2021-11-26 12:57:30 -05:00
Michal Terepeta	7e65fc9a60	[mlir][Vector] Support 0-D vectors in `BroadcastOp` This changes the op to produce `AnyVectorOfAnyRank` following mostly the code for 1-D vectors. Depends On D114598 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114550	2021-11-26 17:17:18 +00:00
Michal Terepeta	d0f927121e	[mlir][Standard] Support 0-D vectors in `SplatOp` This changes the op to produce `AnyVectorOfAnyRank` and implements this by just inserting the element (skipping the shuffle that we do for the 1-D case). Depends On D114549 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114598	2021-11-26 17:05:15 +00:00
Arjun P	ad34ce94d5	[MLIR] Simplex: fix a bug when rolling back a Simplex with no solutions Previously, when adding a constraint to a Simplex that is already marked as having no solutions (marked empty), the Simplex would be marked empty again, and a second UnmarkEmpty entry would be pushed to the undo log. When rolling back, Simplex should be unmarked empty only after rolling back past the creation of the first constraint that made it empty. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D114613	2021-11-26 22:33:48 +05:30
Zarko Todorovski	715d2dc126	[llvm-cov][NFC] Add missing character to fix docs buildbot break.	2021-11-26 11:57:10 -05:00
Siva Chandra Reddy	7b59fcb7de	[libc] Make string entrypoints mutualy exclusive. For example, strcpy does not pull memcpy now. Reviewed By: gchatelet Differential Revision: https://reviews.llvm.org/D114300	2021-11-26 16:29:22 +00:00
Kazu Hirata	562356d6e3	[Target] Use range-based for loops (NFC)	2021-11-26 08:23:01 -08:00
Arjun P	f074bbb04a	[MLIR] Simplex::pivot: also update the redundant rows when pivoting Previously, the pivot function would only update the non-redundant rows when pivoting. This is incorrect because in some cases, when rolling back past a `detectRedundant` call, the basis being used could be different from that which was used at the time of returning from the `detectRedundant` call. Therefore, it is important to update the redundant rows as well during pivots. This could also be triggered by pivots that occur when testing successive constraints for being redundant in `detectRedundant` after some initial constraints are marked redundant. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D114614	2021-11-26 21:42:41 +05:30
Zarko Todorovski	e714394ab8	[LLVM][llvm-cov] Inclusive language: rename option -name-whitelist to -name-allowlist Renamed the option for llvm-cov and changed variable names to use more inclusive terms. Also changed the binary for the test. Reviewed By: alanphipps Differential Revision: https://reviews.llvm.org/D112816	2021-11-26 11:08:01 -05:00
Louis Dionne	f18f9ce366	[libc++] Properly handle errors happening during Lit configuration Instead of silently swallowing errors that happen during Lit configuration (for example trying to obtain compiler macros but compiling fails), raise an exception with some amount of helpful information. This should avoid the possibility of silently configuring Lit in a bogus way, and also provides more helpful information when things fail. Note that this requires a bit more finesse around how we handle some failing configuration checks that we would previously return None for. Differential Revision: https://reviews.llvm.org/D114010	2021-11-26 11:03:15 -05:00
Louis Dionne	7dc9a03cfd	[libc++] Add missing __format__ attributes -Wformat-nonliteral was turned on in https://reviews.llvm.org/D112927, however we forgot to apply some __format__ attributes in Linux specific code paths, which led to warnings when building on Linux. This patch addresses that oversight. Differential Revision: https://reviews.llvm.org/D113876	2021-11-26 11:03:14 -05:00
David Spickett	0df522969a	Revert "Reland "[lldb] Remove non address bits when looking up memory regions"" This reverts commit `fac3f20de5`. I found this has broken how we detect the last memory region in GetMemoryRegions/"memory region" command. When you're debugging an AArch64 system with pointer authentication, the ABI plugin will remove the top bit from the end address of the last user mapped area. (lldb) [0x0000fffffffdf000-0x0001000000000000) rw- [stack] ABI plugin removes anything above the 48th bit (48 bit virtual addresses by default on AArch64, leaving an address of 0. (lldb) [0x0000000000000000-0x0000000000400000) --- You get back a mapping for 0 and get into an infinite loop.	2021-11-26 15:35:02 +00:00
Kirill Bobyrev	34cc210aa8	[clangd] IncludeCleaner: Attribute symbols from non self-contained headers to their parents When a symbol comes from the non self-contained header, we recursively uplift the file we consider used to the first includer that has a header guard. We need to do this while we still have FileIDs because every time a non self-contained header is included, it gets a new FileID but is later deduplicated by HeaderID and it's not possible to understand where it was included from. Based on D114370. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D114623	2021-11-26 16:20:48 +01:00
Alexey Bataev	fc0aacf324	[SLP]Improve analysis/emission of vector operands for alternate nodes. Compiler has an analysis for perfect diamond matching but it does not support nodes with main/alternate opcodes. The problem is that the scalars themselves are different and might not match directly with other nodes, but operands and main/alternate opcodes might match and compiler might reuse some previously emitted vector instructions. Need to include this analysis in the cost model and actual vector instructions emission process. Differential Revision: https://reviews.llvm.org/D114101	2021-11-26 06:38:02 -08:00
Ruslan Arutyunyan	f824bb0e36	[pstl] Fix incorrect usage of std::invoke_result std::invoke_result takes function object type and arguments separately (unlike std::result_of) so, std::invoke_result_t<F()> usage is incorrect. On the other hand, we don't need std::invoke() semantics here at all. So just simplifying the code without extra dependency and use trailing return type as the fix. Reviewed By: MikeDvorskiy Differential Revision: https://reviews.llvm.org/D114624	2021-11-26 17:29:08 +03:00
Mats Petersson	30238c3676	[mlir][OpenMP] Add support for SIMD modifier Add support for SIMD modifier in OpenMP worksharing loops. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D111051	2021-11-26 14:04:46 +00:00
Alexey Bataev	6263982172	[SLP][NFC]Add a test for gathered instructions in loop, NFC.	2021-11-26 05:52:48 -08:00
Venkata Ramanaiah Nalamothu	7f05ff8be4	[Bug 49018][lldb] Fix incorrect help text for 'memory write' command Certain commands like 'memory write', 'register read' etc all use the OptionGroupFormat options but the help usage text for those options is not customized to those commands. One such example is: (lldb) help memory read -s <byte-size> ( --size <byte-size> ) The size in bytes to use when displaying with the selected format. (lldb) help memory write -s <byte-size> ( --size <byte-size> ) The size in bytes to use when displaying with the selected format. This patch allows such commands to overwrite the help text for the options in the OptionGroupFormat group as needed and fixes help text of memory write. llvm.org/pr49018. Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D114448	2021-11-26 19:14:26 +05:30
Florian Hahn	b927aa69bf	[SCEV] Turn check in createSimpleAffineAddRec to assertion. (NFC) Accum is guaranteed to be defined outside L (via Loop::isLoopInvariant checks above). I think that should guarantee that the more powerful ScalarEvolution::isLoopInvariant also determines that the value is loop invariant. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D114634	2021-11-26 13:23:48 +00:00
Matthias Springer	b62b21b980	[mlir][linalg][bufferize][NFC] InsertSliceOp no-copy detection as PostAnalysis There is special logic for InsertSliceOp to check if a memcpy is needed. This change extracts that piece of code and makes it a PostAnalysisStep. The purpose of this change is to untangle `bufferize` from BufferizationAliasInfo. (Not fully there yet.) Differential Revision: https://reviews.llvm.org/D114513	2021-11-26 22:19:29 +09:00
Kirill Bobyrev	cd0ca5a0ea	[clangd] Record information about non self-contained headers in IncludeStructure This will be useful for IncludeCleaner. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D114370	2021-11-26 14:12:54 +01:00
Benjamin Kramer	8521850f20	Provide a definition for OperationPosition::kDown This isn't necessary in C++17, but C++14 still requires it.	2021-11-26 14:11:59 +01:00
Benjamin Kramer	1b0312d280	[PDL] fix unused variable warning in Release builds	2021-11-26 14:11:58 +01:00
Benjamin Kramer	0e099a64be	[tsan] Relax atexit5.cpp a bit more so it's not as dependent on the standard library implementation	2021-11-26 14:02:34 +01:00
Jan Svoboda	97e504cff9	[clang][deps] NFC: Extract function This commits extracts a couple of nested conditions into a separate function with early returns, making the control flow easier to understand.	2021-11-26 14:01:24 +01:00
Stanislav Funiak	d35f119094	Added line numbers to the debug output of PDL bytecode. This is a small diff that splits out the debug output for PDL bytecode. When running bytecode with debug output on, it is useful to know the line numbers where the PDLIntepr operations are performed. Usually, these are in a single MLIR file, so it's sufficient to print out the line number rather than the entire location (which tends to be quite verbose). This debug output is gated by `LLVM_DEBUG` rather than `#ifndef NDEBUG` to make it easier to test. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D114061	2021-11-26 18:11:37 +05:30
Stanislav Funiak	a76ee58f3c	Multi-root PDL matching using upward traversals. This is commit 4 of 4 for the multi-root matching in PDL, discussed in https://llvm.discourse.group/t/rfc-multi-root-pdl-patterns-for-kernel-matching/4148 (topic flagged for review). This PR integrates the various components (root ordering algorithm, nondeterministic execution of PDL bytecode) to implement multi-root PDL matching. The main idea is for the pattern to specify mulitple candidate roots. The PDL-to-PDLInterp lowering selects one of these roots and "hangs" the pattern from this root, traversing the edges downwards (from operation to its operands) when possible and upwards (from values to its uses) when needed. The root is selected by invoking the optimal matching multiple times, once for each candidate root, and the connectors are determined form the optimal matching. The costs in the directed graph are equal to the number of upward edges that need to be traversed when connecting the given two candidate roots. It can be shown that, for this choice of the cost function, "hanging" the pattern an inner node is no better than from the optimal root. The following three main additions were implemented as a part of this PR: 1. OperationPos predicate has been extended to allow tracing the operation accepting a value (the opposite of operation defining a value). 2. Predicate checking if two values are not equal - this is useful to ensure that we do not traverse the edge back downwards after we traversed it upwards. 3. Function for for building the cost graph among the candidate roots. 4. Updated buildPredicateList, building the predicates optimal branching has been determined. Testing: unit tests (an integration test to follow once the stack of commits has landed) Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108550	2021-11-26 18:11:37 +05:30
Stanislav Funiak	6df7cc7f47	Implementation of the root ordering algorithm This is commit 3 of 4 for the multi-root matching in PDL, discussed in https://llvm.discourse.group/t/rfc-multi-root-pdl-patterns-for-kernel-matching/4148 (topic flagged for review). We form a graph over the specified roots, provided in `pdl.rewrite`, where two roots are connected by a directed edge if the target root can be connected (via a chain of operations) in the underlying pattern to the source root. We place a restriction that the path connecting the two candidate roots must only contain the nodes in the subgraphs underneath these two roots. The cost of an edge is the smallest number of upward traversals (edges) required to go from the source to the target root, and the connector is a `Value` in the intersection of the two subtrees rooted at the source and target root that results in that smallest number of such upward traversals. Optimal root ordering is then formulated as the problem of finding a spanning arborescence (i.e., a directed spanning tree) of minimal weight. In order to determine the spanning arborescence (directed spanning tree) of minimum weight, we use the [Edmonds' algorithm](https://en.wikipedia.org/wiki/Edmonds%27_algorithm). The worst-case computational complexity of this algorithm is O(_N_^3) for a single root, where _N_ is the number of specified roots. The `pdl`-to-`pdl_interp` lowering calls this algorithm as a subroutine _N_ times (once for each candidate root), so the overall complexity of root ordering is O(_N_^4). If needed, this complexity could be reduced to O(_N_^3) with a more efficient algorithm. However, note that the underlying implementation is very efficient, and _N_ in our instances tends to be very small (<10). Therefore, we believe that the proposed (asymptotically suboptimal) implementation will suffice for now. Testing: a unit test of the algorithm Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108549	2021-11-26 18:11:37 +05:30
Stanislav Funiak	3eb1647af0	Introduced iterative bytecode execution. This is commit 2 of 4 for the multi-root matching in PDL, discussed in https://llvm.discourse.group/t/rfc-multi-root-pdl-patterns-for-kernel-matching/4148 (topic flagged for review). This commit implements the features needed for the execution of the new operations pdl_interp.get_accepting_ops, pdl_interp.choose_op: 1. The implementation of the generation and execution of the two ops. 2. The addition of Stack of bytecode positions within the ByteCodeExecutor. This is needed because in pdl_interp.choose_op, we iterate over the values returned by pdl_interp.get_accepting_ops until we reach finalize. When we reach finalize, we need to return back to the position marked in the stack. 3. The functionality to extend the lifetime of values that cross the nondeterministic choice. The existing bytecode generator allocates the values to memory positions by representing the liveness of values as a collection of disjoint intervals over the matcher positions. This is akin to register allocation, and substantially reduces the footprint of the bytecode executor. However, because with iterative operation pdl_interp.choose_op, execution "returns" back, so any values whose original liveness cross the nondeterminstic choice must have their lifetime executed until finalize. Testing: pdl-bytecode.mlir test Reviewed By: rriddle, Mogball Differential Revision: https://reviews.llvm.org/D108547	2021-11-26 18:11:37 +05:30
Igor Kirillov	08d45e6f4d	[AArch64][SVEIntrinsicOpts] Fix: predicated SVE mul/fmul are not commutative We can not swap multiplicand and multiplier because the sve intrinsics are predicated. Imagine lanes in vectors having the following values: pg = 0 multiplicand = 1 (from dup) multiplier = 2 The resulting value should be 1, but if we swap multiplicand and multiplier it will become 2, which is incorrect. Differential Revision: https://reviews.llvm.org/D114577	2021-11-26 12:41:27 +00:00
Bhumitram Kumar	a3b099b68c	[Docs] Removed /Zd flag still mentioned in documentation https://reviews.llvm.org/D93458 removed the /Zd flag as MSVC doesn't support that syntax. Instead users should be using -gline-tables-only. The /Zd flag is still mentioned at https://clang.llvm.org/docs/UsersManual.html#clang-cl : /Zd Emit debug line number tables only. Fix PR52571 Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D114632	2021-11-26 18:08:06 +05:30
Stanislav Funiak	842b6861c0	Defines new PDLInterp operations needed for multi-root matching in PDL. This is commit 1 of 4 for the multi-root matching in PDL, discussed in https://llvm.discourse.group/t/rfc-multi-root-pdl-patterns-for-kernel-matching/4148 (topic flagged for review). These operations are: * pdl.get_accepting_ops: Returns a list of operations accepting the given value or a range of values at the specified position. Thus if there are two operations `%op1 = "foo"(%val)` and `%op2 = "bar"(%val)` accepting a value at position 0, `%ops = pdl_interp.get_accepting_ops of %val : !pdl.value at 0` will return both of them. This allows us to traverse upwards from a value to operations accepting the value. * pdl.choose_op: Iteratively chooses one operation from a range of operations. Therefore, writing `%op = pdl_interp.choose_op from %ops` in the example above will select either `%op1`or `%op2`. Testing: Added the corresponding test cases to mlir/test/Dialect/PDLInterp/ops.mlir. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108543	2021-11-26 17:59:22 +05:30
Daniel Kiss	632acec737	[libunwind][ARM] Handle end of stack during unwind When unwind step reaches the end of the stack that means the force unwind should notify the stop function. This is not an error, it could mean just the thread is cleaned up completely. Reviewed By: #libunwind, mstorsjo Differential Revision: https://reviews.llvm.org/D109856	2021-11-26 13:26:49 +01:00
Carl Ritson	8967d044fc	[AMDGPU] Add SIMemoryLegalizer comments to clarify bit usage Attempt to further document the intended cache policies requested by different combinations of GLC, SLC and DLC bits. GFX10 non-temporal stores are updated to set GLC. Reviewed By: t-tye Differential Revision: https://reviews.llvm.org/D114351	2021-11-26 21:05:58 +09:00
Abinav Puthan Purayil	4af45f10cc	[GlobalISel] Fold or of shifts to funnel shift. This change folds a basic funnel shift idiom: - (or (shl x, amt), (lshr y, sub(bw, amt))) -> fshl(x, y, amt) - (or (shl x, sub(bw, amt)), (lshr y, amt)) -> fshr(x, y, amt) This also helps in folding to rotate shift if x and y are equal since we already have a funnel shift to rotate combine. Differential Revision: https://reviews.llvm.org/D114499	2021-11-26 17:05:29 +05:30
David Sherwood	e20391fc5d	[LoopVectorize] When tail-folding, don't always predicate uniform loads In VPRecipeBuilder::handleReplication if we believe the instruction is predicated we then proceed to create new VP region blocks even when the load is uniform and only predicated due to tail-folding. I have updated isPredicatedInst to avoid treating a uniform load as predicated when tail-folding, which means we can do a single scalar load and a vector splat of the value. Tests added here: Transforms/LoopVectorize/AArch64/tail-fold-uniform-memops.ll Differential Revision: https://reviews.llvm.org/D112552	2021-11-26 11:30:54 +00:00
Jan Svoboda	12eafd944e	[clang][deps] NFC: Clean up wording (ignored vs minimized) The filesystem used during dependency scanning does two things: it caches file entries and minimizes source file contents. We use the term "ignored file" in a couple of places, but it's not clear what exactly that means. This commit clears up the semantics, explicitly spelling out this relates to minimization.	2021-11-26 12:18:37 +01:00
Jan Svoboda	d8a3538788	[clang][deps] NFC: Remove else after early return	2021-11-26 12:18:37 +01:00

... 4 5 6 7 8 ...

406005 Commits All Branches Search

406005 Commits

All Branches