llvm-project

Commit Graph

Author	SHA1	Message	Date
Nikita Popov	61cfdf636d	[BasicAA] Model implicit trunc of GEP indices GEP indices larger than the GEP index size are implicitly truncated to the index size. BasicAA currently doesn't model this, resulting in incorrect alias analysis results. Fix this by explicitly modelling truncation in CastedValue in the same way we do zext and sext. Additionally we need to disable a number of optimizations for truncated values, in particular "non-zero" and "non-equal" may no longer hold after truncation. I believe the constant offset heuristic is also not necessarily correct for truncated values, but wasn't able to come up with a test for that one. A possible followup here would be to use the new mechanism to model explicit trunc as well (which should be much more common, as it is the canonical form). This is straightforward, but omitted here to separate the correctness fix from the analysis improvement. (Side note: While I say "index size" above, BasicAA currently uses the pointer size instead. Something for another day...) Differential Revision: https://reviews.llvm.org/D110977	2021-10-22 23:47:02 +02:00
Peter Klausler	a5a493e192	[flang] Speed common runtime cases of DOT_PRODUCT & MATMUL Look for contiguous numeric argument arrays at runtime and use specialized code for them. Differential Revision: https://reviews.llvm.org/D112239	2021-10-22 14:36:13 -07:00
peter klausler	14b90d1fe0	[flang] Fix generic resolution case Don't try to convert INTEGER argument expressions to the kind of the dummy argument when performing generic resolution; specific procedures may be distinguished only by their kinds. Differential Revision: https://reviews.llvm.org/D112240	2021-10-22 13:38:47 -07:00
peter klausler	b8452dba28	[flang] Support NAMELIST input of short arrays NAMELIST array input does not need to fully define an array. If another input item begins after at least one element, it ends input into the array and the remaining items are not modified. The tricky part of supporting this feature is that it's not always easy to determine whether the next non-blank thing in the input is a value or the next item's name, esp. in the case of logical data where T and F can be names. E.g., &group logicalArray = t f f t = 1 / should read three elements into "logicalArray" and then read an integer or real variable named "t". So the I/O runtime has to do some look-ahead to determine whether the next thing in the input is a name followed by '=', '(', or '%'. Since the '=' may be on a later record, possibly with intervening NAMELIST comments, the runtime has to support a general form of saving and restoring its current position. The infrastructure in the I/O runtime already has to support repositioning for list-directed repetition, even on non-positionable input sources like terminals and sockets; this patch adds an internal RAII API to make it easier to save a position and then do arbitrary look-ahead. Differential Revision: https://reviews.llvm.org/D112245	2021-10-22 13:38:11 -07:00
Vy Nguyen	236197e2d0	[lld-macho] Implement -oso_prefix https://bugs.llvm.org/show_bug.cgi?id=50229 Differential Revision: https://reviews.llvm.org/D112291	2021-10-22 16:32:42 -04:00
Nicolas Vasilache	89d55d3c86	[mlir][Linalg] Retire CodegenStrategy::transform Instead each pass should constructed a nested OpPassManager and runPipeline on that. Differential Revision: https://reviews.llvm.org/D112308	2021-10-22 20:27:14 +00:00
Jason Molenda	e4235bdf26	Fix locals naming in DNBArchMachARM64::GetGPRState for 32-bit builds The local variables names used for logging when built on armv7k weren't unique, resulting in build error. rdar://84274006	2021-10-22 13:24:25 -07:00
Matt Arsenault	ec57b37551	AMDGPU: Use attributor to propagate amdgpu-flat-work-group-size This can merge the acceptable ranges based on the call graph, rather than the simple application of the attribute. Remove the handling from the old pass.	2021-10-22 16:23:50 -04:00
Matt Arsenault	8d4b74ac3f	AMDGPU: Don't consider whether amdgpu-flat-work-group-size was set It should be semantically identical if it was set to the same value as the default. Also improve the documentation.	2021-10-22 16:23:50 -04:00
Craig Topper	cd824f9e39	[X86] Fix bad formatting. NFC	2021-10-22 13:16:35 -07:00
Louis Dionne	a41837d1da	[libc++][NFC] Remove duplicate Python imports	2021-10-22 16:16:28 -04:00
Duncan P. N. Exon Smith	2410fb4616	Support: Use Expected<T>::moveInto() in a few places These are some usage examples for `Expected<T>::moveInto()`. Differential Revision: https://reviews.llvm.org/D112280	2021-10-22 12:40:10 -07:00
peter klausler	c4ba1108dd	[flang] Extension to distinguish specific procedures Allocatable dummy arguments can be used to distinguish two specific procedures in a generic interface when it is the case that exactly one of them is polymorphic or exactly one of them is unlimited polymorphic. The standard requires that an actual argument corresponding to an (unlimited) polymorphic allocatable dummy argument must also be an (unlimited) polymorphic allocatable, so an actual argument that's acceptable to one procedure must necessarily be a bad match for the other. Differential Revision: https://reviews.llvm.org/D112237	2021-10-22 12:38:21 -07:00
Matt Arsenault	7d962f9ca3	AMDGPU: Regenerate MIR test checks Recently this started using -NEXT checks, so regenerate these to avoid extra test churn in a future change.	2021-10-22 15:36:50 -04:00
Matt Arsenault	ae698f89b8	AMDGPU: Fix hardcoded registers in tests	2021-10-22 15:36:50 -04:00
Nicolas Vasilache	489fec2777	[mlir][Linalg] NFC - Drop Optional in favor of FailureOr Differential revision: https://reviews.llvm.org/D112332	2021-10-22 19:28:18 +00:00
Jay Foad	58e7ec471c	[AMDGPU] Run SIShrinkInstructions before post-RA scheduling Run post-RA SIShrinkInstructions just before post-RA scheduling, instead of afterwards. After the fixes in D112305 and D112317 this seems to make no difference, but it paves the way for scheduler tweaks that are sensitive to the e32 vs e64 encoding of VALU instructions. Differential Revision: https://reviews.llvm.org/D112341	2021-10-22 20:24:03 +01:00
Med Ismail Bennani	42e4959253	[lldb/Formatters] Remove space from vector type string summaries (NFCI) This patch changes the string summaries for vector types by removing the space between the type and the bracket, conforming to `277623f4d5`. This should also fix TestCompactVectors failure. Differential Revision: https://reviews.llvm.org/D112340 Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2021-10-22 21:18:54 +02:00
Jay Foad	3f34f75a68	[AMDGPU] Fix latency for implicit vcc_lo operands on GFX10 wave32 As described in the comment, the way we change vcc to vcc_lo in these operands confuses addPhysRegDataDeps into treating them as implicit pseudo operands. Fix this by setting the correct latency from the SchedModel after addPhysRegDataDeps wrongly set it to 0. Differential Revision: https://reviews.llvm.org/D112317	2021-10-22 20:03:29 +01:00
Jay Foad	2915889d74	[ScheduleDAGInstrs] Call adjustSchedDependency in more cases This removes a condition and the corresponding FIXME comment, because the Hexagon assertion it refers to has apparently been fixed, probably by D76134. NFCI. This just gives targets the opportunity to adjust latencies that were set to 0 by the generic code because they involve "implicit pseudo" operands. Differential Revision: https://reviews.llvm.org/D112306	2021-10-22 20:03:29 +01:00
Stanislav Mekhanoshin	55f7cc1a9a	[InstCombine] Precommit new and-xor-or.ll tests. NFC.	2021-10-22 11:59:15 -07:00
Duncan P. N. Exon Smith	27181cad0d	Support: Add Expected<T>::moveInto() to avoid extra names Expected<T>::moveInto() takes as an out parameter any `OtherT&` that's assignable from `T&&`. It moves any stored value before returning takeError(). Since moveInto() consumes both the Error and the value, it's only anticipated that we'd use call it on temporaries/rvalues, with naming the Expected first likely to be an anti-pattern of sorts (either you want to deal with both at the same time, or you don't). As such, starting it out as `&&`-qualified... but it'd probably be fine to drop that if there's a good use case for lvalues that appears. There are two common patterns that moveInto() cleans up: ``` // If the variable is new: Expected<std::unique_ptr<int>> ExpectedP = makePointer(); if (!ExpectedP) return ExpectedP.takeError(); std::unique_ptr<int> P = std::move(ExpectedP); // If the target variable already exists: if (Expected<T> ExpectedP = makePointer()) P = std::move(ExpectedP); else return ExpectedP.takeError(); ``` moveInto() takes less typing and avoids needing to name (or leak into the scope) an extra variable. ``` // If the variable is new: std::unique_ptr<int> P; if (Error E = makePointer().moveInto(P)) return E; // If the target variable already exists: if (Error E = makePointer().moveInto(P)) return E; ``` It also seems useful for unit tests, to log errors (but continue) when there's an unexpected failure. E.g.: ``` // Crash on error, or undefined in non-asserts builds. std::unique_ptr<MemoryBuffer> MB = cantFail(makeMemoryBuffer()); // Avoid crashing on error without moveInto() :(. Expected<std::unique_ptr<MemoryBuffer>> ExpectedMB = makeMemoryBuffer(); ASSERT_THAT_ERROR(ExpectedMB.takeError(), Succeeded()); std::unique_ptr<MemoryBuffer> MB = std::move(ExpectedMB); // Avoid crashing on error with moveInto() :). std::unique_ptr<MemoryBuffer> MB; ASSERT_THAT_ERROR(makeMemoryBuffer().moveInto(MB), Succeeded()); ``` Differential Revision: https://reviews.llvm.org/D112278	2021-10-22 11:47:03 -07:00
Nikita Popov	4d692daa3a	[ConstantFolding] Drop misleading comment (NFC) As pointed out by Philip, this part of the comment is misleading, as it describes undef rather than poison behavior. Just mentioning poison should be sufficient.	2021-10-22 20:45:25 +02:00
Stephen Tozer	6cf6917917	[Dexter] Add DexFinishTest command to conditionally early-exit a test program This patch adds a command, DexFinishTest, that allows a Dexter test to be conditionally finished at a given breakpoint. This command has the same set of arguments as DexLimitSteps, except that it does not allow a line range (from_line, to_line), only a single line (on_line). Reviewed By: Orlando Differential Revision: https://reviews.llvm.org/D111988	2021-10-22 19:41:07 +01:00
Louis Dionne	e6f39c8b48	[libunwind] Fix path to libunwind for per-target-runtime-dir builds We recently introduced a from-scratch config to run the libunwind tests. However, that config was always looking for libunwind in <install>/lib, and never in <install>/<target>/lib, which is necessary for tests to work when the per-target-runtime-dir configuration is enabled. This commit fixes that. I believe this is what caused the CI failures we saw after `5a8ad80b6f` and caused it to be reverted. Differential Revision: https://reviews.llvm.org/D112322	2021-10-22 14:36:14 -04:00
peter klausler	f2360e1156	[flang] Enforce rest of semantic constraint C919 A reference to an allocatable or pointer component must be applied to a scalar base object. (This is the second part of constraint C919; the first part is already checked.) Differential Revision: https://reviews.llvm.org/D112241	2021-10-22 11:35:58 -07:00
Jeremy Morse	e7084ceab3	[DebugInfo][Instr] Track subregisters across stack spills/restores Sometimes we generate code that writes to a subregister, then spills / restores a super-register to the stack, for example: $eax = MOV32ri 0 MOV64mr $rsp, 1, $noreg, 16, $noreg, $rax $rcx = MOV64rm $rsp, 1, $noreg, 8, $noreg This patch takes a different approach: it adds another index to MLocTracker that identifies a size/offset within a stack slot. A location on the stack is then a pari of {FrameIndex, SlotNum}. Spilling and restoring now involves pairing up the src/dest register numbers, and the dest/src stack position to be transferred to/from. Location coverage improves as a result, compile-time performance decreases, alas. One limitation is that if a PHI occurs inside a stack slot: DBG_PHI %stack.0, 1 We don't know how large the resulting value is, and so might have difficulty picking which value to use. DBG_PHI might need to be augmented in the future with such a size. Unit tests added ensure that spills and restores correctly transfer to positions in the Location => Value map, and that different register classes written to the stack will correctly clobber all other positions in the stack slot. Differential Revision: https://reviews.llvm.org/D112133	2021-10-22 19:20:55 +01:00
peter klausler	b03628d986	[flang] Emit unformatted headers & footers even with RECL= The runtime library was emitting unformatted record headers and footers when an external unit had no fixed RECL=. This is wrong for sequential files, which should have headers & footers even with RECL. Change to omit headers & footers from unformatted I/O only for direct access files. Differential Revision: https://reviews.llvm.org/D112243	2021-10-22 11:07:01 -07:00
Craig Topper	93139a3c32	[LegalizeTypes] Only expand CTLZ/CTTZ/CTPOP during type promotion if the new type is legal. We might be promoting a large non-power of 2 type and the new type may need to be split. Once we split it we may have a ctlz/cttz/ctpop instruction for the split type. I'm also concerned that we may create large shifts with shift amounts that are too small.	2021-10-22 11:02:35 -07:00
peter klausler	d5074c7166	[flang] Fix bogus folding error for ISHFT(x, negative) Negative shift counts are of course valid for ISHFT when shifting to the right. This patch decouples the folding of ISHFT from that of SHIFTA/L/R and adds tests. Differential Revision: https://reviews.llvm.org/D112244	2021-10-22 10:38:27 -07:00
David Green	d4da71282f	[InstCombine] Various tests for truncating saturates and related patterns.	2021-10-22 18:36:08 +01:00
Simon Pilgrim	a5f56342b0	[DAG] narrowExtractedVectorLoad - EXTRACT_SUBVECTOR indices are always constant EXTRACT_SUBVECTOR indices are always constant, we don't need to check for ConstantSDNode, we should just use getConstantOperandVal which will assert for the constant.	2021-10-22 18:32:14 +01:00
Philip Reames	412eb07edd	[indvars] Use fact loop must exit to canonicalize to unsigned conditions The logic in this patch is that if we find a comparison which would be unsigned except for when the loop is infinite, and we can prove that an infinite loop must be ill defined, we can still make the predicate unsigned. The eventual goal (combined with a follow on patch) is to use the fact the loop exits to remove the zext (see tests) entirely. A couple of points worth noting: * We loose the ability to prove the loop unreachable by committing to the must exit interpretation. If instead, we later proved that rhs was definitely outside the range required for finiteness, we could have killed the loop entirely. (We don't currently implement this transform, but could in theory, do so.) * simplifyAndExtend has a very limited list of users it walks. In particular, in the examples is stops at the zext and never visits the icmp. (Because we can't fold the zext to an addrec yet in SCEV.) Being willing to visit when we haven't simplified regresses multiple tests (seemingly because of less optimal results when computing trip counts). D112170 explores fixing that, but - at least so far - appears to be too expensive compile time wise. Differential Revision: https://reviews.llvm.org/D111836	2021-10-22 10:31:36 -07:00
Jeremy Morse	d9eebe3cd7	[DebugInfo][InstrRef] Add unit tests for transfer-function building This patch adds some unit tests for the machine-location transfer-function building parts of InstrRefBasedLDV: i.e., test that if we feed some MIR into the transfer-function building code, does it create the correct transfer function. There are a number of minor defects that get corrected in the process: * The unit test was selecting the x86 (i.e. 32 bit) backend rather than x86_64's 64 bit backend, * COPY instructions weren't actually having their subregister values correctly represented in the transfer function. Subregisters were being defined by the COPY, rather than taking the value in the source register. * SP aliases were at risk of being clobbered, if an SP subregister was clobbered. Differential Revision: https://reviews.llvm.org/D112006	2021-10-22 18:29:03 +01:00
Craig Topper	04c184bba7	[TargetLowering] Simplify the interface of expandABS. NFC Instead of returning a bool to indicate success and a separate SDValue, return the SDValue and have the callers check if it is null. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D112331	2021-10-22 10:22:23 -07:00
Jonas Devlieghere	0f12cf7eba	[lldb] Pass the target triple when determining the DWARF version When targeting iOS, the default dwarf version is 2 and not 4. Currently, the test suite does not pick up on that because it invokes the test compiler without a target triple. This patch fixes that and now correctly skips tests that have a dwarf version specified in a skipIf decorator. rdar://84530477 Differential revision: https://reviews.llvm.org/D112325	2021-10-22 10:13:24 -07:00
Quinn Pham	950f22a5e1	[llvm]Inclusive language: replace master with main [NFC] This patch fixes a url in a testcase due to the renaming of the branch.	2021-10-22 11:56:44 -05:00
Louis Dionne	7ea5409e42	[libc++] Fix tests after `aee4925507`	2021-10-22 12:50:29 -04:00
Nikita Popov	3a10fe2d89	[Loads] Use more powerful constant folding API This follows up on D111023 by exporting the generic "load value from constant at given offset as given type" and using it in the store to load forwarding code. We now need to make sure that the load size is smaller than the store size, previously this was implicitly ensured by ConstantFoldLoadThroughBitcast(). Differential Revision: https://reviews.llvm.org/D112260	2021-10-22 18:33:03 +02:00
Nikita Popov	5bb7562962	[Attributor] Generalize GEP construction Make use of the getGEPIndicesForOffset() helper for creating GEPs. This handles arrays as well, uses correct GEP index types and reduces code duplication. Differential Revision: https://reviews.llvm.org/D112263	2021-10-22 18:30:43 +02:00
Craig Topper	0766aef3f3	[LegalizeTypes][RISCV][PowerPC] Expand CTLZ/CTTZ/CTPOP instead of promoting if they'll be expanded later. Expanding these requires multiple constants. If we promote during type legalization when they'll end up getting expanded in LegalizeDAG, we'll use larger constants. These constants may be harder to materialize. For example, 64-bit constants on 64-bit RISCV are very expensive. This is similar to what has already been done to BSWAP and BITREVERSE. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D112268	2021-10-22 09:10:01 -07:00
Steven Wan	28ef8052d2	[AIX] Enable rtl for plugins test On AIX, the plugins are linked with `-WL,-G`, which produces shared objects enabled for use with the run-time linker. This patch sets the run-time linker at the main executable link step to allow symbols from the plugins shared objects to be properly bound. Reviewed By: daltenty Differential Revision: https://reviews.llvm.org/D112275	2021-10-22 12:08:22 -04:00
Craig Topper	ce7b8343be	[RISCV] Merge vector tests for rv32 and rv64 into a single test file These tests have nearly identical content the only difference is that the rv64 test has a signext attribute on some parameters. That attribute should be harmless on rv32. Merge them into a single test file with 2 RUN lines. Differential Revision: https://reviews.llvm.org/D112242	2021-10-22 09:04:30 -07:00
Vladimir Inđić	ba02586fbe	[OpenMP][OMPT][GOMP] task frame support in KMP_API_NAME_GOMP_PARALLEL_SECTIONS KMP_API_NAME_GOMP_PARALLEL_SECTIONS function was missing the task frame support. This patch introduced a fix responsible to set properly the exit_frame of the innermost implicit task that corresponds to the parallel section construct, as well as the enter_frame of the task that encloses the mentioned implicit task. This patch also introduced a simple test case sections_serialized.c that contains serialized parallel section construct and validates whether the mentioned task frames are set correctly. Differential Revision: https://reviews.llvm.org/D112205	2021-10-22 11:01:10 -05:00
Kazu Hirata	6fe949c4ed	[Target, Transforms] Use StringRef::contains (NFC)	2021-10-22 08:52:33 -07:00
Jonas Paulsson	12b44bf5ee	[SystemZ] Give the EXRL_Pseudo a size value of 6 bytes. This pseudo is expanded very late (AsmPrinter) and therefore has to have a correct size value, or the branch relaxation pass may make a wrong decision. Review: Ulrich Weigand	2021-10-22 17:38:51 +02:00
Mark de Wever	7593f68a05	[libc++][nfc] Remove double spaces. Based on the comment of @Quuxplusone in D111961. It seems no tests are affected, but give it a run on the CI to be sure. Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D112231	2021-10-22 17:26:13 +02:00
Mark de Wever	09dc8ab74c	[libc++][doc] Fixes FeatureTestMacroTable.html. `utils/generate_feature_test_macro_components.py` uses the wrong indentation. `:name: feature-status-table :widths: auto` is rendered as text instead of being used by Sphinx to render the table properly. This fixes the identation in the souce and updates the generated output. Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D112251	2021-10-22 17:25:43 +02:00
Piotr Sobczak	7457fe3dd4	[InstCombine][NFC] Precommit new tests	2021-10-22 17:15:53 +02:00
Konstantin Boyarinov	c87a4a46b2	[libc++][test][NFC] Add tests for std::vector comparisons Add missing tests for std::vector operator==, !=, <, <=, >, >= Reviewed By: ldionne, rarutyun, Quuxplusone, Mordante, #libc Differential Revision: https://reviews.llvm.org/D111738	2021-10-22 18:11:04 +03:00

1 2 3 4 5 ...

402672 Commits All Branches Search

402672 Commits

All Branches