llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	023b243d1d	[RISCV] Cleanup up the spec version references around fmaxnum/fminnum. This previously made references to 2.3-draft which was a short lived version number in 2017. It was replaced by date based versions leading up to ratification. This patch uses the latest ratified version number and just says what the behavior is. Nothing here is in flux. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D100878	2021-04-21 14:50:29 -07:00
Craig Topper	a8822caa1b	[RISCV] Temporary in vmsge(u).vx pseudo instructions can't be V0. This was checked in some asserts, but not enforced by the instruction matching. There's still a second bug that we don't check that vt and vd are different registers, but that will require custom checking. Differential Revision: https://reviews.llvm.org/D100928	2021-04-21 14:50:29 -07:00
Petr Hosek	7a718e1630	[MC] Use COMDAT for LSDA only if IR comdat type is any This fixed issue introduced in `16af973933` and `796feb6163`. Differential Revision: https://reviews.llvm.org/D100909	2021-04-21 14:41:39 -07:00
Olle Fredriksson	f5446b769a	[MemCpyOpt] Allow variable lengths in memcpy optimizer This makes the memcpy-memcpy and memcpy-memset optimizations work for variable sizes as long as they are equal, relaxing the old restriction that they are constant integers. If they're not equal, the old requirement that they are constant integers with certain size restrictions is used. The implementation works by pushing the length tests further down in the code, which reveals some places where it's enough that the lengths are equal (but not necessarily constant). Differential Revision: https://reviews.llvm.org/D100870	2021-04-21 23:23:38 +02:00
Arthur Eubanks	b606e2df4d	[Evaluator] Bitcast result of pointer stripping Trying to evaluate a GEP would assert with "Ty == cast<PointerType>(C->getType()->getScalarType())->getElementType()" because the type of the pointer we would evaluate the GEP argument to would be a different type than the GEP was expecting. We should treat pointer stripping as a bitcast. The test adds a redundant GEP that would crash due to type mismatch. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D100970	2021-04-21 13:32:29 -07:00
Arthur Eubanks	24e12e0726	[LLParser] Print mismatched types in error message Helps with debugging invalid handcrafted IR. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D100990	2021-04-21 13:10:37 -07:00
Nikita Popov	24e9fbc1a3	Revert "[InstCombine] Fold multiuse shr eq zero" This reverts commit `9423f78240`. A performance regression with this patch has been reported at https://reviews.llvm.org/rG9423f78240a2#990953. Reverting for now.	2021-04-21 21:40:52 +02:00
sstefan1	62cdcd6c5a	[FuncAttrs] Don't infer willreturn for nonexact definitions Discovered during attributor testing comparing stats with and without the attributor. Willreturn should not be inferred for nonexact definitions. Differential Revision: https://reviews.llvm.org/D100988	2021-04-21 21:26:09 +02:00
sstefan1	656ebd519e	[SimplifyLibCalls] Don't change alignment when creating memset Fix for PR49984 This was discovered during Attributor testing. Memset was always created with alignment of 1 and in case when strncpy alignment was changed it triggered an assertion in the AttrBuilder. Memset will now be created with appropriate alignment. Differential Revision: https://reviews.llvm.org/D100875	2021-04-21 20:34:13 +02:00
Sanjay Patel	5e6dc5e404	[InstSimplify] generalize ctlz-of-shifted-constant https://alive2.llvm.org/ce/z/zWL_VQ	2021-04-21 14:23:55 -04:00
Simon Pilgrim	a511b55cfd	[X86][SSE] getFauxShuffleMask - don't decode OR(SHUFFLE,SHUFFLE) containing UNDEFs. (PR50049) PR50049 demonstrated an infinite loop between OR(SHUFFLE,SHUFFLE) <-> BLEND(SHUFFLE,SHUFFLE) patterns. The UNDEF elements were allowing a combined shuffle mask to be widened which lost the undef element, resulting us needing to use the BLEND pattern (as the undef element would need to be zero for the OR pattern). But then bitcast folds would re-expose the undef element allowing us to use OR again.....	2021-04-21 18:47:00 +01:00
Hongtao Yu	b6db6f5530	[CSSPGO] Exclude pseudo probe from slotindex verification.	2021-04-21 09:17:12 -07:00
Stanislav Mekhanoshin	f9d0d0d7e0	[AMDGPU] Lower regbanks reassign threshold to 15000 Let it work on a very small kernels only. Measurements showed the performance benefit is not worth the compile time. Differential Revision: https://reviews.llvm.org/D100904	2021-04-21 08:34:11 -07:00
dfukalov	a8b35e0f52	[TTI] NFC: Change getVectorSplitCost to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D100952	2021-04-21 17:32:02 +03:00
Anirudh Prasad	8f6185c713	[AsmParser][ms][X86] Fix possible misbehaviour in parsing of special tokens at start of string. - Previously, https://reviews.llvm.org/D72680 introduced a new attribute called `AllowSymbolAtNameStart` (in relation to the MAsmParser changes) in `MCAsmInfo.h` which (according to the comment in the header) allows the following behaviour: ``` /// This is true if the assembler allows $ @ ? characters at the start of /// symbol names. Defaults to false. ``` - However, the usage of this field in AsmLexer.cpp doesn't seem completely accurate* for a couple of reasons. ``` default: if (MAI.doesAllowSymbolAtNameStart()) { // Handle Microsoft-style identifier: [a-zA-Z_$.@?][a-zA-Z0-9_$.@#?]* if (!isDigit(CurChar) && isIdentifierChar(CurChar, MAI.doesAllowAtInName(), AllowHashInIdentifier)) return LexIdentifier(); } ``` 1. The Dollar and At tokens, when occurring at the start of the string, are treated as separate tokens (AsmToken::Dollar and AsmToken::At respectively) and not lexed as an Identifier. 2. I'm not too sure why `MAI.doesAllowAtInName()` is used when `AllowAtInIdentifier` could be used. For X86 platforms, afaict, this shouldn't be an issue, since the `CommentString` attribute isn't "@". (alternatively the call to the setter can be set anywhere else as needed). The `AllowAtInName` does have an additional important meaning, but in the context of AsmLexer, shouldn't mean anything different compared to `AllowAtInIdentifier` My proposal is the following: - Introduce 3 new fields called `AllowQuestionTokenAtStartOfString`, `AllowDollarTokenAtStartOfString` and `AllowAtTokenAtStartOfString` in MCAsmInfo.h which will encapsulate the previously documented behaviour of "allowing $, @, ? characters at the start of symbol names") - Introduce these fields where "$", "@" are lexed, and treat them as identifiers depending on whether `Allow[Dollar\|At]TokenAtStartOfString` is set. - For the sole case of "?", append it to the existing logic for treating a "default" token as an Identifier. z/OS (HLASM) will also make use of some of these fields in follow up patches. completely accurate* - This was based on the comments and the intended behaviour the code. I might have completely misinterpreted it, and if that is the case my sincere apologies. We can close this patch if necessary, if there are no changes to be made :) Depends on https://reviews.llvm.org/D99374 Reviewed By: Jonathan.Crowther Differential Revision: https://reviews.llvm.org/D99889	2021-04-21 10:21:09 -04:00
Nico Weber	ba7a92c01e	[Support] Don't include VirtualFileSystem.h in CommandLine.h CommandLine.h is indirectly included in ~50% of TUs when building clang, and VirtualFileSystem.h is large. (Already remarked by jhenderson on D70769.) No behavior change. Differential Revision: https://reviews.llvm.org/D100957	2021-04-21 10:19:01 -04:00
Caroline Concatto	ca9b7e2e2f	[AArch64][SVE] Fix crash with icmp+select This patch changes the lowering of SELECT_CC from Legal to Expand for scalable vector and adds support for scalable vectors in performSelectCombine. When selecting the nodes to lower in visitSELECT it checks if it is possible to use SELECT_CC in cases where SETCC is followed by SELECT. visistSELECT checks if SELECT_CC is legal or custom to replace SELECT by SELECT_CC. SELECT_CC used to be legal for scalable vector, so the node changes to SELECT_CC. This used to crash the compiler as there is no support for SELECT_CC with scalable vectors. So now the compiler lowers to VSELECT instead of SELECT_CC. Differential Revision: https://reviews.llvm.org/D100485	2021-04-21 14:16:27 +01:00
Matt Arsenault	70ab76a81b	AMDGPU: Fix indirect tail calls Fix a selection error on uniform callees, and use a regular call if divergent.	2021-04-21 09:15:24 -04:00
Simon Tatham	77e170db86	[ARM][Driver][Windows] Allow command-line upgrade to Armv8. If you gave clang the options `--target=arm-pc-windows-msvc` and `-march=armv8-a+crypto` together, the crypto extension would not be enabled in the compilation, and you'd see the following warning message suggesting that the 'armv8-a' had been ignored: clang: warning: ignoring extension 'crypto' because the 'armv7-a' architecture does not support it [-Winvalid-command-line-argument] This happens because Triple::getARMCPUForArch(), for the Win32 OS, unconditionally returns "cortex-a9" (an Armv7 CPU) regardless of MArch, which overrides the architecture setting on the command line. I don't think that the combination of Windows and AArch32 _should_ unconditionally outlaw the use of the crypto extension. MSVC itself doesn't think so: you can perfectly well compile Thumb crypto code using its AArch32-targeted compiler. All the other default CPUs in the same switch statement are conditional on a particular MArch setting; this is the only one that returns a particular CPU _regardless_ of MArch. So I've fixed this one by adding a condition, so that if you ask for an architecture above v7, the default of Cortex-A9 no longer overrides it. Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D100937	2021-04-21 11:20:05 +01:00
Simon Pilgrim	d860bf2d0e	[DAG] TargetLowering.cpp - breakup if-else chains where each block returns. NFCI. Match style guide that requests that if+return blocks are separate.	2021-04-21 11:17:27 +01:00
Fraser Cormack	c141bd3cf9	[DAGCombiner] Support all-ones/all-zeros SPLAT_VECTOR in more combines This patch adds incrementally-better support for SPLAT_VECTOR in a handful of vector combines by changing a few more isBuildVectorAllOnes/isBuildVectorAllZeros to the equivalent isConstantSplatVectorAllOnes/Zeros calls. Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D100851	2021-04-21 11:05:37 +01:00
Fraser Cormack	3f02d26943	[RISCV] Further fixes for RVV stack offset computation This patch fixes a case missed out by D100574, in which RVV scalable stack offset computations may require three live registers in the case where the offset's fixed component is 12 bits or larger and has a scalable component. Instead of adding an additional emergency spill slot, this patch further optimizes the scalable stack offset computation sequences to reduce register usage. By emitting the sequence to compute the scalable component before the fixed component, we can free up one scratch register to be reallocated by the sequence for the fixed component. Doing this saves one register and thus one additional emergency spill slot. Compare: $x5 = LUI 1 $x1 = ADDIW killed $x5, -1896 $x1 = ADD $x2, killed $x1 $x5 = PseudoReadVLENB $x6 = ADDI $x0, 50 $x5 = MUL killed $x5, killed $x6 $x1 = ADD killed $x1, killed $x5 versus: $x5 = PseudoReadVLENB $x1 = ADDI $x0, 50 $x5 = MUL killed $x5, killed $x1 $x1 = LUI 1 $x1 = ADDIW killed $x1, -1896 $x1 = ADD $x2, killed $x1 $x1 = ADD killed $x1, killed $x5 Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D100847	2021-04-21 10:51:07 +01:00
David Sherwood	57ca65e21e	[AArch64] Add instruction costs for FP_TO_UINT and FP_TO_SINT with half types We were missing some instruction costs when converting vectors of floating point half types into integers, so I've added those here. I also manually generated assembly code for each FP->int case and looked at the number of instructions generated, which meant adjusting some of the existing costs too. I've updated an existing test to reflect the new costs: Analysis/CostModel/AArch64/sve-fptoi.ll Differential Revision: https://reviews.llvm.org/D99935	2021-04-21 09:39:45 +01:00
Yang Fan	4307446e9f	[SCEV] Fix -Wunused-variable warning (NFC) GCC warning: ``` /llvm-project/llvm/lib/Analysis/ScalarEvolution.cpp: In member function ‘const llvm::SCEV* llvm::ScalarEvolution::getLosslessPtrToIntExpr(const llvm::SCEV, unsigned int)::SCEVPtrToIntSinkingRewriter::visitUnknown(const llvm::SCEVUnknown)’: /llvm-project/llvm/lib/Analysis/ScalarEvolution.cpp:1152:13: warning: unused variable ‘ExprPtrTy’ [-Wunused-variable] 1152 \| Type *ExprPtrTy = Expr->getType(); \| ^~~~~~~~~ ```	2021-04-21 16:01:46 +08:00
Nikita Popov	de18fa9e52	Revert "[InstSimplify] Bypass no-op `and`-mask, using known bits (PR49543)" This reverts commit `ea1a0d7c9a`. While this is strictly more powerful, it is also strictly slower. InstSimplify intentionally does not perform many folds that it is allowed to perform, if doing so requires a KnownBits calculation that will be repeated in InstCombine. Maybe it's worthwhile to do this here, but that needs a more explicitly stated motivation, evaluated in a review.	2021-04-21 09:55:25 +02:00
Zakk Chen	ad0fe5db2f	[RISCV][MC] Mask load should not have VMConstraint. Add a test, dest register could be v0. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D100825	2021-04-21 15:21:37 +08:00
Serge Pavlov	d20a2376d8	[RISCV] Introduce floating point control and state registers New registers FRM, FFLAGS and FCSR was defined. They represent corresponding system registers. The new registers are necessary to properly order floating point instructions in non-default modes. Differential Revision: https://reviews.llvm.org/D99083	2021-04-21 12:55:30 +07:00
Zi Xuan Wu	ca31b43ae8	[NFC][CSKY] Resort the instruction description in td Resort the instruction description in td to make it easy to upstream more instructions and add predicts later.	2021-04-21 12:36:07 +08:00
George Balatsouras	79b5280a6c	[dfsan] Enable origin tracking with fast8 mode All related instrumentation tests have been updated. Reviewed By: stephan.yichao.zhao Differential Revision: https://reviews.llvm.org/D100903	2021-04-20 18:10:32 -07:00
Adrian Prantl	81cad0be68	Make sure PHIElimination doesn't copy debug locations across basic blocks. PHIElimination may insert copy instructions in multiple basic blocks. Moving debug locations across basic block boundaries would be misleading as illustrated by the test case. rdar://75463656 Differential Revision: https://reviews.llvm.org/D100886	2021-04-20 17:03:29 -07:00
Sam Clegg	103956170b	[WebAssembly] Update README. NFC. This is just a cleanup of the very high level stuff. I'm sure there is more to update here but I'll leave that to others and/or a followup. Differential Revision: https://reviews.llvm.org/D100888	2021-04-20 16:59:08 -07:00
Arthur Eubanks	326da4adcb	[FuncAttrs] Always preserve FunctionAnalysisManagerCGSCCProxy FunctionAnalysisManagerCGSCCProxy should not be preserved if any of its keys may be invalid. Since we are not removing/adding functions in FuncAttrs, it's fine to preserve it. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D100893	2021-04-20 16:37:45 -07:00
Reid Kleckner	91f7a4fff7	Revert "[InstCombine] Recognize `((x * y) s/ x) !=/== y` as an signed multiplication overflow check (PR48769)" This reverts commit `13ec913bdf`. This commit introduces new uses of the overflow checking intrinsics that depend on implementations in compiler-rt, which Windows users generally do not link against. I filed an issue (somewhere) to make clang auto-link the builtins library to resolve this situation, but until that happens, it isn't reasonable for the optimizer to introduce new link time dependencies.	2021-04-20 15:53:34 -07:00
Philip Reames	4824d876f0	Revert "Allow invokable sub-classes of IntrinsicInst" This reverts commit `d87b9b81cc`. Post commit review raised concerns, reverting while discussion happens.	2021-04-20 15:38:38 -07:00
Roman Lebedev	5a654bfeab	Revert "[InstCombine] `sext(trunc(x)) --> sext(x)` iff trunc is NSW (PR49543)" I forgot about the case where we sign-extend to width smaller than the original. This reverts commit `1e6ca23ab8`.	2021-04-21 01:11:15 +03:00
Roman Lebedev	1e68d338c1	Revert "[InstCombine] "Bypass" NUW trunc of lshr if we are going to sext the result (PR49543)" I forgot about the case where we sign-extend to width smaller than the original. This reverts commit `41b71f718b`.	2021-04-21 01:11:14 +03:00
Philip Reames	d87b9b81cc	Allow invokable sub-classes of IntrinsicInst It used to be that all of our intrinsics were call instructions, but over time, we've added more and more invokable intrinsics. According to the verifier, we're up to 8 right now. As IntrinsicInst is a sub-class of CallInst, this puts us in an awkward spot where the idiomatic means to check for intrinsic has a false negative if the intrinsic is invoked. This change switches IntrinsicInst from being a sub-class of CallInst to being a subclass of CallBase. This allows invoked intrinsics to be instances of IntrinsicInst, at the cost of requiring a few more casts to CallInst in places where the intrinsic really is known to be a call, not an invoke. After this lands and has baked for a couple days, planned cleanups: Make GCStatepointInst a IntrinsicInst subclass. Merge intrinsic handling in InstCombine and use idiomatic visitIntrinsicInst entry point for InstVisitor. Do the same in SelectionDAG. Do the same in FastISEL. Differential Revision: https://reviews.llvm.org/D99976	2021-04-20 15:03:49 -07:00
Roman Lebedev	41b71f718b	[InstCombine] "Bypass" NUW trunc of lshr if we are going to sext the result (PR49543) This is a more convoluted form of the same pattern "sext of NSW trunc", but in this case the operand of trunc was a right-shift, and the truncation chops off just the zero bits that were shifted-in.	2021-04-21 00:31:46 +03:00
Roman Lebedev	ea1a0d7c9a	[InstSimplify] Bypass no-op `and`-mask, using known bits (PR49543) We already special-cased a few interesting patterns, but that is strictly less powerful than using KnownBits. So instead get the known bits for the operand of `and`, and iff all the unset bits of the `and`-mask are known to be zeros in the operand, we can omit said `and`.	2021-04-21 00:31:46 +03:00
Roman Lebedev	1e6ca23ab8	[InstCombine] `sext(trunc(x)) --> sext(x)` iff trunc is NSW (PR49543) If we can tell that trunc only chops off sign bits, and not all of them, then we can simply sign-extend the trunc's source.	2021-04-21 00:31:45 +03:00
Sanjay Patel	1e202e8f39	[InstCombine] fold shift-of-srem-by-2 to mask+shift There are several potential srem-by-2 folds because the result is known {-1,0,1}. https://alive2.llvm.org/ce/z/LuVyeK	2021-04-20 17:10:16 -04:00
Sam Clegg	d2de2d1724	[WebAssembly] Remove unused known_gcc_test_failures.txt. NFC Differential Revision: https://reviews.llvm.org/D100887	2021-04-20 14:07:25 -07:00
Alexey Bataev	673e2f1b70	[COST][AARCH64] Improve cost of reverse shuffles for AArch64. Introduced the cost of thre reverse shuffles for AArch64, currently just copied the costs for PermuteSingleSrc. Differential Revision: https://reviews.llvm.org/D100871	2021-04-20 13:47:56 -07:00
Philip Reames	6792e26c0d	Reapply "Look through invertible recurrences in isKnownNonEqual" I'd reverted this in commit `3b6acb1797` due to buildbot failures. This patch contains the fix for said issue. I'd forgotten to handle the case where two phis in the same block have different operand order. We canonicalize away from this, but it's still valid IR. The tests included in this change (as opposed to simply having test output changed), crashed without the fix. Original commit message follows... This extends the phi handling in isKnownNonEqual with a special case based on invertible recurrences. If we can prove the recurrence is invertible (which many common ones are), we can recurse through the start operands of the recurrence skipping the phi cycle. (Side note: Instcombine currently does not push back through these cases. I will implement that in a follow up change w/separate review.) Differential Revision: https://reviews.llvm.org/D99912	2021-04-20 12:47:59 -07:00
Jon Roelofs	167da6c9e8	[AArch64][GlobalISel] Clarify fallback debug print ... to only print when that fallback actually happens.	2021-04-20 12:41:14 -07:00
Thomas Lively	693d767c60	[WebAssembly] More codegen for f64x2.convert_low_i32x4_{s,u} `af7925b4dd` added a custom DAG combine for recognizing fp-to-ints of extract_subvectors that could be lowered to f64x2.convert_low_i32x4_{s,u} instructions. This commit extends the combines to recognize equivalent extract_subvectors of fp-to-ints as well. Differential Revision: https://reviews.llvm.org/D100790	2021-04-20 12:37:13 -07:00
Philip Reames	3b6acb1797	Revert "Look through invertible recurrences in isKnownNonEqual" This reverts commit `be20eae25f`. It appears to have caused a crash on a buildbot (https://lab.llvm.org/buildbot#builders/77/builds/5653). Reverting while investigating.	2021-04-20 11:47:10 -07:00
Philip Reames	9c1a145aeb	Rearrange code to reduce diff for D99687 [nfc] Adding the switches to reduce diffs. I'm about to split that into an lshr part and an ashr part, doing the NFC part first makes it easier to maintain both diffs.	2021-04-20 11:40:15 -07:00
Roman Lebedev	13ec913bdf	[InstCombine] Recognize `((x * y) s/ x) !=/== y` as an signed multiplication overflow check (PR48769) We already had support for it's unsigned variant, so simply extend it to also handle the signed variant. Fixes https://bugs.llvm.org/show_bug.cgi?id=48769	2021-04-20 21:29:43 +03:00
Roman Lebedev	7186764884	[NFC][SCEV] Split getLosslessPtrToIntExpr out of getPtrToIntExpr()	2021-04-20 21:29:21 +03:00

1 2 3 4 5 ...

146191 Commits