llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	a0f967418f	[VectorCombine] give invalid index value a name; NFC	2020-06-24 11:10:36 -04:00
Matt Arsenault	c5d240093b	WebAssembly: Don't store MachineFunction in MachineFunctionInfo Soon it will be disallowed to depend on MachineFunction state in the constructor. This was only being used to get the MachineRegisterInfo for an assert, which I'm not sure is necessarily worth it. I would think any missing defs would be caught by the verifier later instead.	2020-06-24 10:52:58 -04:00
Tim Corringham	c3b3b999ec	[AMDGPU] Avoid redundant mode register writes Summary: The SIModeRegister pass attempts to generate the minimal number of writes to the mode register. However it was failing to correctly deal with some loops, resulting in some redundant setreg instructions being inserted. This change amends the pass to avoid generating these redundant instructions. Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82215	2020-06-24 14:11:29 +01:00
Simon Pilgrim	bf77c7ef2d	Loads.h - reduce AliasAnalysis.h include to forward declarations. NFC. Fix implicit include dependencies in source files.	2020-06-24 13:49:04 +01:00
Florian Hahn	4e62c6359c	[DSE] Eliminate stores at the end of the function. This patch add support for eliminating MemoryDefs that do not have any aliasing users, which indicates that there are no reads/writes to the memory location until the end of the function. To eliminate such defs, we have to ensure that the underlying object is not visible in the caller and does not escape via returning. We need a separate check for that, as InvisibleToCaller does not consider returns. Reviewers: dmgreen, rnk, efriedma, bryant, asbirlea, Tyker, george.burgess.iv Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D72631	2020-06-24 12:58:20 +01:00
sstefan1	0f426935bb	[OpenMPOpt] ICV macro definitions Summary: This defines some basic information about ICVs in `OMPKinds.def`. We also emit remarks with initial values for each function (which are default for now) as a way to test this. Reviewers: jdoerfert, JonChesterfield, hamax97, jhuber6 Subscribers: yaxunl, hiraditya, guansong, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82193	2020-06-24 13:43:35 +02:00
Simon Pilgrim	90ad37646f	ObjCARC.h - remove unnecessary includes. NFC. Add implicit InstIterator.h dependency in ObjCARCContract.cpp	2020-06-24 12:30:59 +01:00
Cullen Rhodes	26502ad609	[AArch64][SVE] Add bfloat16 support to perm and select intrinsics Summary: Added for following intrinsics: * zip1, zip2, zip1q, zip2q * trn1, trn2, trn1q, trn2q * uzp1, uzp2, uzp1q, uzp2q * splice * rev * sel Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D82182	2020-06-24 10:04:51 +00:00
Kerry McLaughlin	3d6cab271c	[AArch64][SVE] Add bfloat16 support to load intrinsics Summary: Bfloat16 support added for the following intrinsics: - LD1 - LD1RQ - LDNT1 - LDNF1 - LDFF1 Reviewers: sdesmalen, c-rhodes, efriedma, stuij, fpetrogalli, david-arm Reviewed By: fpetrogalli Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D82298	2020-06-24 10:32:19 +01:00
alex-t	521ac0b5ce	[AMDGPU] Enable compare operations to be selected by divergence Summary: Details: This patch enables SETCC to be selected to S_CMP_* if uniform and V_CMP_* if divergent. Reviewers: rampitec, arsenm Reviewed By: rampitec Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82194	2020-06-24 11:50:40 +03:00
Simon Tatham	b769eb02b5	[ARM][BFloat] Legalize bf16 type even without fullfp16. Summary: This change permits scalar bfloats to be loaded, stored, moved and used as function call arguments and return values, whenever the bf16 feature is supported by the subtarget. Previously that was only supported in the presence of the fullfp16 feature, because the code generation strategy depended on instructions from that extension. This change adds alternative code generation strategies so that those operations can be done even without fullfp16. The strategy for loads and stores is to replace VLDRH/VSTRH with integer LDRH/STRH plus a move between register classes. I've written isel patterns for those, conditional on //not// having the fullfp16 feature (so that in the fullfp16 case, the existing patterns will still be used). For function arguments and returns, instead of writing isel patterns to match `VMOVhr` and `VMOVrh`, I've avoided generating those SDNodes in the first place, by factoring out the code that constructs them into helper functions `MoveToHPR` and `MoveFromHPR` which have a fallback for non-fullfp16 subtargets. The current output code is not especially pretty: in the new test file you can see unnecessary store/load pairs implementing no-op bitcasts, and lots of pointless moves back and forth between FP registers and GPRs. But it at least works, which is an improvement on the previous situation. Reviewers: dmgreen, SjoerdMeijer, stuij, chill, miyuki, labrinea Reviewed By: dmgreen, labrinea Subscribers: labrinea, kristof.beyls, hiraditya, danielkiss, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82372	2020-06-24 09:36:26 +01:00
Craig Topper	8172ed91f8	[X86] Speculatively fix to X86AvoidStoreForwardingBlocks not deference a machine mem operand if there isn't one present. Eric Christopher informed me that FastISel memcpy handling creates load/store instructions without mem operands. We should fix that, but I doubt that's the only case of missed mem operands so seems better to be defensive here. I don't have a test case yet, but I'll try to add one if i get a test from Eric.	2020-06-24 00:13:58 -07:00
Craig Topper	31c40f2d6b	[X86] Add mayLoad/mayStore flags to some X87 instructions that don't have isel patterns to infer them from. Should remove part of the differences in D81833 due to some some of these getting isel patterns.	2020-06-23 23:40:30 -07:00
Eli Friedman	b5740105d2	[BitcodeReader] Fix DelayedShuffle handling for ConstantExpr shuffles. The indexing was messed up, so the result was completely broken. Shuffle constant exprs are rare in practice; without vscale types, constant folding generally elminates them. So sort of hard to trip over. Fixes regression from D72467. Differential Revision: https://reviews.llvm.org/D80330	2020-06-23 19:50:30 -07:00
Amara Emerson	fceadbcb33	[AArch64][GlobalISel] Improve codegen for some constant vectors by using constant pool loads. There's more smarts in AArch64ISelLowering that we don't have yet, but this change incrementally improves some of the more common patterns. I think future iterations will want to use some combination of PostLegalizerCombiner and the selector to catch the other cases. Differential Revision: https://reviews.llvm.org/D82340	2020-06-23 19:23:47 -07:00
Eli Friedman	a2caa3b614	Remove GlobalValue::getAlignment(). This function is deceptive at best: it doesn't return what you'd expect. If you have an arbitrary GlobalValue and you want to determine the alignment of that pointer, Value::getPointerAlignment() returns the correct value. If you want the actual declared alignment of a function or variable, GlobalObject::getAlignment() returns that. This patch switches all the users of GlobalValue::getAlignment to an appropriate alternative. Differential Revision: https://reviews.llvm.org/D80368	2020-06-23 19:13:42 -07:00
Vedant Kumar	f8bd6a75ed	[SimplifyCFG] Drop debug loc in SpeculativelyExecuteBB Summary: According to HowToUpdateDebugInfo.rst: ``` Preserving the debug locations of speculated instructions can make it seem like a condition is true when it's not (or vice versa), which leads to a confusing single-stepping experience ``` This patch follows the recommendation to drop debug locations on speculated instructions. Reviewers: aprantl, davide Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82420	2020-06-23 18:25:52 -07:00
Matt Arsenault	a162048a47	AMDGPU/GlobalISel: Fix fixed ABI special VGPR function arguments I forgot to copy the new fixed function ABI into GlobalISel, so this was mismatched with the DAG compiled calling function. This was allocating part of the argument list to v31, which was supposed to be reserved for the workitem IDs.	2020-06-23 21:21:35 -04:00
Eli Friedman	e9d4e34ab8	[AArch64][SVE] Add legalization support for i32/i64 vector srem/urem Implement them on top of sdiv/udiv, similar to what we do for integer types. Potential future work: implementing i8/i16 srem/urem, optimizations for constant divisors, optimizing the mul+sub to mls. Differential Revision: https://reviews.llvm.org/D81511	2020-06-23 16:27:52 -07:00
Eli Friedman	90ad786947	[IR] Prefer scalar type for struct indexes in GEP constant expressions. This has two advantages: one, it's simpler, and two, it doesn't require heroic pattern matching with scalable vectors. Also includes a small fix to DataLayout to allow the scalable vector testcase to work correctly. Differential Revision: https://reviews.llvm.org/D82061	2020-06-23 16:14:36 -07:00
Sam Clegg	e49584a34a	[WebAssembly] Fix for use of uninitialized member in WasmObjectWriter.cpp Currently, section indices may be passed uninitialized by value if writing the section fails. Removes section indices form class initialization and returns them from the write{Code,Data}Section function calls instead. Patch by Gui Andrade! Differential Revision: https://reviews.llvm.org/D81702	2020-06-23 15:26:18 -07:00
David Green	d604cc6e9a	[ARM] Mark more integer instructions as not having side effects. LDRD and STRD along with UBFX and SBFX are selected from DAGToDAG transforms, so do not have tblgen patterns. They don't get marked as having side effects so cannot be scheduled as efficiently as you would like. This specifically marks then as not having side effects. Differential Revision: https://reviews.llvm.org/D82358	2020-06-23 22:45:51 +01:00
Christopher Tetreault	433c9adf7b	[SVE] Remove calls to VectorType::getNumElements from AsmParser Reviewers: efriedma, RKSimon, c-rhodes, fpetrogalli Reviewed By: fpetrogalli Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82208	2020-06-23 14:31:49 -07:00
Zequan Wu	6a822e20ce	[ASan][MSan] Remove EmptyAsm and set the CallInst to nomerge to avoid from merging. Summary: `nomerge` attribute was added at D78659. So, we can remove the EmptyAsm workaround in ASan the MSan and use this attribute. Reviewers: vitalybuka Reviewed By: vitalybuka Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82322	2020-06-23 14:22:53 -07:00
Ryan Santhiraraja	f64dc4e686	Preserve GlobalsAA analysis result in InjectTLIMappings InjectTLIMappings fails to preserve the analysis result of GlobalsAA. Not preserving the analysis might affect benchmark performance. This change fixes this issue. Patch by: Ryan Santhiraraja <rsanthir@quicinc.com> Reviewers: fpetrogalli, joerg, fhahn Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D82343	2020-06-23 22:05:42 +01:00
Nikita Popov	6904c7129b	[IR] Remove MSVC warning workaround (NFC) While LLVM does fold this to x+1, GCC does not. As this is hot code, let's try to avoid that. According to https://developercommunity.visualstudio.com/content/problem/211134/unsigned-integer-overflows-in-constexpr-functionsa.html this spurious warning in MSVC has been fixed in Visual Studio 2019 Version 16.4. Let's see if there are any build bots running old MSVC versions with warnings treated as errors...	2020-06-23 22:33:57 +02:00
Christopher Tetreault	e6d8636935	[SVE] Remove calls to VectorType::getNumElements from Bitcode Reviewers: efriedma, evgeny777, tejohnson, david-arm, kmclaughlin Reviewed By: david-arm Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82209	2020-06-23 13:21:40 -07:00
Nikita Popov	52e86797ba	[IR] Remove unnecessary uint64_t casts (NFC) As pointed out by foad, it's not necessary to work on uint64_t here. The values used here fit uint8_t.	2020-06-23 22:20:15 +02:00
Florian Hahn	ff4de8683a	[DSE,MSSA] Treat `store 0` after calloc as noop stores. This patch extends storeIsNoop to also detect stores of 0 to an calloced object. This basically ports the logic from legacy DSE to the MemorySSA backed version. It triggers in a few cases on MultiSource, SPEC2000, SPEC2006 with -O3 LTO: Same hash: 218 (filtered out) Remaining: 19 Metric: dse.NumNoopStores Program base patch2 diff test-suite...CFP2000/177.mesa/177.mesa.test 1.00 15.00 1400.0% test-suite...6/482.sphinx3/482.sphinx3.test 1.00 14.00 1300.0% test-suite...lications/ClamAV/clamscan.test 2.00 28.00 1300.0% test-suite...CFP2006/433.milc/433.milc.test 1.00 8.00 700.0% test-suite...pplications/oggenc/oggenc.test 2.00 9.00 350.0% test-suite.../CINT2000/176.gcc/176.gcc.test 6.00 6.00 0.0% test-suite.../CINT2006/403.gcc/403.gcc.test NaN 137.00 nan% test-suite...libquantum/462.libquantum.test NaN 3.00 nan% test-suite...6/464.h264ref/464.h264ref.test NaN 7.00 nan% test-suite...decode/alacconvert-decode.test NaN 2.00 nan% test-suite...encode/alacconvert-encode.test NaN 2.00 nan% test-suite...ications/JM/ldecod/ldecod.test NaN 9.00 nan% test-suite...ications/JM/lencod/lencod.test NaN 39.00 nan% test-suite.../Applications/lemon/lemon.test NaN 2.00 nan% test-suite...pplications/treecc/treecc.test NaN 4.00 nan% test-suite...hmarks/McCat/08-main/main.test NaN 4.00 nan% test-suite...nsumer-lame/consumer-lame.test NaN 3.00 nan% test-suite.../Prolangs-C/bison/mybison.test NaN 1.00 nan% test-suite...arks/mafft/pairlocalalign.test NaN 30.00 nan% Reviewers: efriedma, zoecarver, asbirlea Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D82204	2020-06-23 21:01:39 +01:00
Your Name	cc9d693856	[AMDGPU/MemOpsCluster] Implement new heuristic for computing max mem ops cluster size Summary: Make use of both the - (1) clustered bytes and (2) cluster length, to decide on the max number of mem ops that can be clustered. On an average, when loads are dword or smaller, consider `5` as max threshold, otherwise `4`. This heuristic is purely based on different experimentation conducted, and there is no analytical logic here. Reviewers: foad, rampitec, arsenm, vpykhtin Reviewed By: rampitec Subscribers: llvm-commits, kerbowa, hiraditya, t-tye, Anastasia, tpr, dstuttard, yaxunl, nhaehnle, wdng, jvesely, kzhuravl, thakis Tags: #llvm Differential Revision: https://reviews.llvm.org/D82393	2020-06-24 00:39:41 +05:30
Christopher Tetreault	4d1fd33561	[SVE] Remove calls to VectorType::getNumElements from FuzzMutate Reviewers: efriedma, bkramer, kmclaughlin, sdesmalen Reviewed By: sdesmalen Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82212	2020-06-23 11:02:20 -07:00
Simon Pilgrim	e7e204a373	[X86][AVX] Attempt to lower v16i32/v16f32 shuffles with lowerShuffleAsRepeatedMaskAndLanePermute Avoids prematurely creating permps/permd variable shuffles. Fixes PR46249	2020-06-23 18:33:50 +01:00
Simon Pilgrim	ddc6ec9470	WithColor.h - reduce CommandLine.h include to forward declaration. NFC. WithColor.h is one of the most common headers, we can severely reduce its frontend impact (in ClangBuildAnalyzer reports) by removing the bulky CommandLine.h include, forward declaring llvm:🆑:OptionCategory and just including raw_ostream.h instead.	2020-06-23 17:07:53 +01:00
Xing GUO	45fa936855	[ObjectYAML][DWARF] Remove unused context. NFC. The context is unused. This patch helps remove it. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D82351	2020-06-24 00:02:51 +08:00
Xing GUO	fad54c50e4	[ObjectYAML][ELF] Add support for emitting the .debug_pubtypes section. This patch helps add support for emitting the .debug_pubtypes section. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D82347	2020-06-24 00:01:07 +08:00
Momchil Velikov	adf7973fd3	[ARM] Describe defs/uses of VLLDM and VLSTM The VLLDM and VLSTM instructions are incompletely specified. They (potentially) write (or read, respectively) registers Q0-Q7, VPR, and FPSCR, but the compiler is unaware of it. In the new test case `cmse-vlldm-no-reorder.ll` case the compiler missed an anti-dependency and reordered a `VLLDM` ahead of the instruction, which stashed the return value from the non-secure call, effectively clobbering said value. This test case does not fail with upstream LLVM, because of scheduling differences and I couldn't find a test case for the VLSTM either. Differential Revision: https://reviews.llvm.org/D81586	2020-06-23 16:04:23 +01:00
Valentin Clement	d90443b1d9	[openmp] Base of tablegen generated OpenMP common declaration Summary: As discussed previously when landing patch for OpenMP in Flang, the idea is to share common part of the OpenMP declaration between the different Frontend. While doing this it was thought that moving to tablegen instead of Macros will also give a cleaner and more powerful way of generating these declaration. This first part of a future series of patches is setting up the base .td file for DirectiveLanguage as well as the OpenMP version of it. The base file is meant to be used by other directive language such as OpenACC. In this first patch, the Directive and Clause enums are generated with tablegen instead of the macros on OMPConstants.h. The next pacth will extend this to other enum and move the Flang frontend to use it. Reviewers: jdoerfert, DavidTruby, fghanim, ABataev, jdenny, hfinkel, jhuber6, kiranchandramohan, kiranktp Reviewed By: jdoerfert, jdenny Subscribers: arphaman, martong, cfe-commits, mgorny, yaxunl, hiraditya, guansong, jfb, sstefan1, aaron.ballman, llvm-commits Tags: #llvm, #openmp, #clang Differential Revision: https://reviews.llvm.org/D81736	2020-06-23 10:32:32 -04:00
Mikhail Maltsev	3f353a2e5a	[BFloat] Add convert/copy instrinsic support This patch is part of a series implementing the Bfloat16 extension of the Armv8.6-a architecture, as detailed here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a Specifically it adds intrinsic support in clang and llvm for Arm and AArch64. The bfloat type, and its properties are specified in the Arm Architecture Reference Manual: https://developer.arm.com/docs/ddi0487/latest/arm-architecture-reference-manual-armv8-for-armv8-a-architecture-profile The following people contributed to this patch: - Alexandros Lamprineas - Luke Cheeseman - Mikhail Maltsev - Momchil Velikov - Luke Geeson Differential Revision: https://reviews.llvm.org/D80928	2020-06-23 14:27:05 +00:00
Matt Arsenault	db777eaea3	AMDGPU/GlobalISel: Fix asserts on non-s32 sitofp/uitofp sources The combine to form cvt_f32_ubyte0 was assuming the source type was always 32-bit, but this needs to tolerate any legal source type.	2020-06-23 10:00:35 -04:00
Xing GUO	8c7775e9a7	[ObjectYAML][ELF] Add support for emitting the .debug_pubnames section. This patch helps add support for emitting the .debug_pubnames section to yaml2elf. Known issues: - Current implementation doesn't support emitting multiple sets of entries. - Doesn't support DWARF64. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D82296	2020-06-23 20:40:33 +08:00
Mikhail Maltsev	9c579540ff	[ARM] BFloat MatMul Intrinsics&CodeGen Summary: This patch adds support for BFloat Matrix Multiplication Intrinsics and Code Generation from __bf16 to AArch32. This includes IR intrinsics. Tests are provided as needed. This patch is part of a series implementing the Bfloat16 extension of the Armv8.6-a architecture, as detailed here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a The bfloat type and its properties are specified in the Arm Architecture Reference Manual: https://developer.arm.com/docs/ddi0487/latest/arm-architecture-reference-manual-armv8-for-armv8-a-architecture-profile The following people contributed to this patch: - Luke Geeson - Momchil Velikov - Mikhail Maltsev - Luke Cheeseman - Simon Tatham Reviewers: stuij, t.p.northover, SjoerdMeijer, sdesmalen, fpetrogalli, LukeGeeson, simon_tatham, dmgreen, MarkMurrayARM Reviewed By: MarkMurrayARM Subscribers: MarkMurrayARM, danielkiss, kristof.beyls, hiraditya, cfe-commits, llvm-commits, chill, miyuki Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D81740	2020-06-23 12:06:37 +00:00
hsmahesha	5832950adb	[AMDGPU/MemOpsCluster] Compute `width` for `MIMG` instruction class. Summary: `width` computation is missing for newly added `MIMG` instruction class. Add it. Reviewers: foad, rampitec, arsenm Reviewed By: foad Subscribers: MatzeB, javed.absar, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81649	2020-06-23 17:32:17 +05:30
Georgii Rymar	1e820e82b1	[DebugInfo/DWARF] - Do not hang when CFI are truncated. Currently when the .eh_frame section is truncated so that CFI instructions can't be read, it is possible to enter an infinite loop. It happens because `CFIProgram::parse` does not handle errors properly. This patch fixes the issue. Differential revision: https://reviews.llvm.org/D82017	2020-06-23 14:39:24 +03:00
Simon Pilgrim	cdceef4a4f	[Analysis] Ensure we include CommandLine.h if we declare any cl::opt flags. NFC.	2020-06-23 12:29:51 +01:00
Sander de Smalen	121e585ec8	[AArch64][SVE] ACLE: Add bfloat16 to struct load/stores. This patch contains: - Support in LLVM CodeGen for bfloat16 types for ld2/3/4 and st2/3/4. - New bfloat16 ACLE builtins for svld(2\|3\|4)[_vnum] and svst(2\|3\|4)[_vnum] Reviewers: stuij, efriedma, c-rhodes, fpetrogalli Reviewed By: fpetrogalli Tags: #clang, #lldb, #llvm Differential Revision: https://reviews.llvm.org/D82187	2020-06-23 12:12:35 +01:00
Simon Pilgrim	36bc10e74a	[Transforms] Ensure we include CommandLine.h if we declare any cl::opt flags	2020-06-23 12:11:51 +01:00
Kerry McLaughlin	5080503174	[SVE][CodeGen] Legalisation of vsetcc with scalable types Summary: Changes SplitVecOp_VSETCC to use getVectorElementCount() Reviewers: sdesmalen, efriedma, dancgr Reviewed By: efriedma Subscribers: david-arm, tschuett, hiraditya, rkruppe, psnobl, huihuiz, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79167	2020-06-23 11:56:29 +01:00
Roman Lebedev	d57e9aca01	[IndVarSimplify] Don't replace IV user with unsafe loop-invariant (PR45360) Summary: As [[ https://bugs.llvm.org/show_bug.cgi?id=45360 \| PR45360 ]] reports, with new cost-model we can sometimes end up being able to expand `udiv`/`urem` instructions. And that exposes at least one instance of when we do that regardless of whether or not it is safe to do. In this particular case, it's `SimplifyIndvar::replaceIVUserWithLoopInvariant()`. It seems to me, we simply need to check with `isSafeToExpandAt()` first. The test isn't great. I'm not sure how to make it only run `-indvars`. Fixes [[ https://bugs.llvm.org/show_bug.cgi?id=45360 \| PR45360 ]]. Reviewers: mkazantsev, reames, helloqirun Reviewed By: mkazantsev Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82108	2020-06-23 13:53:15 +03:00
Chen Zheng	7ab05d9a60	[PowerPC] fold addi's imm operand to its imm form consumer's displacement This patch adds a function to do following transformation: %0:g8rc_and_g8rc_nox0 = ADDI8 %5:g8rc_and_g8rc_nox0, 144 STD killed %7:g8rc, 16, %0:g8rc_and_g8rc_nox0 :: (store 8 into %ir.8) ------> STD killed %7:g8rc, 160, %5:g8rc_and_g8rc_nox0 :: (store 8 into %ir.8) Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D81723	2020-06-23 06:28:18 -04:00
Simon Pilgrim	4c257bb44e	[X86] truncateVectorWithPACK - fix outdated comment. NFC. We perform PACKSS/PACKUS on AVX512 targets if the calling function wants to.	2020-06-23 10:45:27 +01:00

1 2 3 4 5 ...

135948 Commits