llvm-project

Commit Graph

Author	SHA1	Message	Date
Florian Hahn	4837daf883	[DSE,MSSA] Check if Def is removable only wen we try to remove it. Non-removable MemoryDefs can still eliminate other defs. Update the isRemovable checks to only candidates for removal.	2020-06-25 14:01:10 +01:00
Tyker	c95ffadb24	[AssumeBundles] Use operand bundles to encode alignment assumptions Summary: NOTE: There is a mailing list discussion on this: http://lists.llvm.org/pipermail/llvm-dev/2019-December/137632.html Complemantary to the assumption outliner prototype in D71692, this patch shows how we could simplify the code emitted for an alignemnt assumption. The generated code is smaller, less fragile, and it makes it easier to recognize the additional use as a "assumption use". As mentioned in D71692 and on the mailing list, we could adopt this scheme, and similar schemes for other patterns, without adopting the assumption outlining. Reviewers: hfinkel, xbolva00, lebedev.ri, nikic, rjmccall, spatel, jdoerfert, sstefan1 Reviewed By: jdoerfert Subscribers: yamauchi, kuter, fhahn, merge_guards_bot, hiraditya, bollu, rkruppe, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D71739	2020-06-25 12:59:44 +02:00
Sam Tebbs	187f627a50	[ARM] Allow tail predication on sadd_sat and uadd_sat intrinsics This patch stops the sadd_sat and uadd_sat intrinsics from blocking tail predication. Differential revision: https://reviews.llvm.org/D82377	2020-06-25 11:54:29 +01:00
Simon Pilgrim	e367c0081c	FPEnv.h - reduce includes to forward declarations. NFC. Ensure FPEnv.cpp includes FPEnv.h first to check for hidden dependencies.	2020-06-25 11:40:45 +01:00
Piotr Sobczak	0045786f14	[AMDGPU] Select s_cselect Summary: Add patterns to select s_cselect in the isel. Handle more cases of implicit SCC accesses in si-fix-sgpr-copies to allow new patterns to work. Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, asbirlea, kerbowa, llvm-commits Tags: #llvm Re-commit D81925 with a bugfix D82370. Differential Revision: https://reviews.llvm.org/D81925 Differential Revision: https://reviews.llvm.org/D82370	2020-06-25 10:38:23 +02:00
David Sherwood	ee26a31e7b	[SVE] Make ConstantFoldGetElementPtr work for scalable vectors of indices This patch fixes a compiler crash that was hit when trying to simplify the following code: getelementptr [2 x i64], [2 x i64]* null, i64 0, <vscale x 2 x i64> zeroinitializer For the case where we have a null pointer value like above, we just need to ensure we don't assume the indices are always fixed width. Differential Revision: https://reviews.llvm.org/D82183	2020-06-25 07:28:19 +01:00
Max Kazantsev	1eeb714787	[InstCombine] Combine select & Phi by same condition This patch transforms ``` p = phi [x, y] s = select cond, z, p ``` with ``` s = phi[x, z] ``` if we can prove that the Phi node takes values basing on select's condition. Differential Revision: https://reviews.llvm.org/D82072 Reviewed By: nikic	2020-06-25 10:44:10 +07:00
Craig Topper	a5041987ed	[X86] Emit a reg-reg copy for fast isel of vector bitcasts. Previously we just updated a map and moved on. But it possible we cached known bits information with the vreg that can be used by another basic block. If the other basic block has a different view of the VT these known bits won't make sense. By emitting a copy we ensure we have different vregs before and after the bitcast. This prevents the known bits from being used with the wrong type. Differential Revision: https://reviews.llvm.org/D82517	2020-06-24 20:15:21 -07:00
Wang, Pengfei	b2eb1c5793	[X86] Fix a typo error. Summary: This will result opcode MULX32Hrm been emitted to MULX32Hrr. Reviewed by: craig.topper Differential Revision: https://reviews.llvm.org/D82472	2020-06-25 10:06:27 +08:00
Amara Emerson	090c108d04	Don't inline dynamic allocas that simplify to huge static allocas. Some sequences of optimizations can generate call sites which may never be executed during runtime, and through constant propagation result in dynamic allocas being converted to static allocas with very large allocation amounts. The inliner tries to move these to the caller's entry block, resulting in the stack limits being reached/bypassed. Avoid inlining functions if this would result. The threshold of 64k currently doesn't get triggered on the test suite with an -Os LTO build on arm64, care should be taken in changing this in future to avoid needlessly pessimising inlining behaviour. Differential Revision: https://reviews.llvm.org/D81765	2020-06-24 17:39:03 -07:00
Xing GUO	93bc571d47	[DWARFYAML][debug_gnu_*] 'Descriptor' field should be 1-byte. NFC. The 'Descriptor' field of .debug_gnu_pubnames and .debug_gnu_pubtypes section should be 1-byte rather than 4-byte. This patch helps resolve this issue.	2020-06-25 08:21:13 +08:00
Kirill Naumov	7f094f7f9d	[InlineCost] PrinterPass prints constants to which instructions are simplified This patch enables printing of constants to see which instructions were constant-folded. Needed for tests and better visiual analysis of inliner's work. Reviewers: apilipenko, mtrofin, davidxl, fedor.sergeev Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D81024	2020-06-24 22:52:31 +00:00
Scott Linder	4d81aec40c	[MIR] Fix CFI_INSTRUCTION escape printing Summary: The printer seems to intend to not print the trailing comma but has a copy-paste error for the last value in the escape, and the parser enforces having no trailing comma, but somehow a test was never included to actually confirm it. Reviewers: thegameg, arsenm Reviewed By: thegameg, arsenm Subscribers: wdng, arsenm, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82478	2020-06-24 18:15:28 -04:00
Roman Lebedev	8911a35180	[SROA] convertValue(): we can have <N x iK*> to <M x iQ> cast Provided test case crashes otherwise. Much like to the opposite case.	2020-06-25 00:58:54 +03:00
Roman Lebedev	07a23c06dd	[SROA] convertValue(): we can have <N x iK> to <M x iQ*> cast Provided test case crashes otherwise. If NewTy is already DL.getIntPtrType(NewTy), CreateBitCast() won't actually create any bitcast, so we are better off just doing the general thing.	2020-06-25 00:58:53 +03:00
Roman Lebedev	2b8d706b19	[IR] GetUnderlyingObject(), stripPointerCastsAndOffsets(): don't crash on `bitcast <1 x i8> to i8` I'm not sure how to write standalone tests for each of two changes here. If either one of these two fixes is missing, the test fill crash.	2020-06-25 00:58:53 +03:00
Roman Lebedev	381054a989	[InstCombine] visitBitCast(): do not crash on weird `bitcast <1 x i8> to i8` Even if we know that RHS of a bitcast is a pointer, we can't assume LHS is, because it might be a single-element vector of pointer.	2020-06-25 00:58:53 +03:00
Roman Lebedev	1e2691fe23	[NFCI] SCEV: promote ScalarEvolutionDivision into an publicly usable class This makes it usable from outside of SCEV, while previously it was internal to the ScalarEvolution.cpp In particular, i want to use it in an WIP alloca promotion helper pass, to analyze if some SCEV is a multiple of some other SCEV.	2020-06-25 00:58:53 +03:00
Yuanfang Chen	ebc88811b5	Remove Passes dependency on CodeGen The dependency was introduced in `5134020ea6`. The only functional change from this removal would be the new PM interface for the two codegen passes. This is not necessary since we don't have codegen pipeline using new PM yet. This removal is to break the potential circular dependency between Passes and CodeGen once the codegen begins to gain new PM support.	2020-06-24 14:52:46 -07:00
Fangrui Song	c6d01ed046	[TextAPI/MachO] Fix style issues. NFC See https://llvm.org/docs/CodingStandards.html#use-namespace-qualifiers-to-implement-previously-declared-functions	2020-06-24 14:43:45 -07:00
Mitch Phillips	10045cbe01	Revert "[BitcodeReader] Fix DelayedShuffle handling for ConstantExpr shuffles." Patch has a memory leak bug that broke the ASan buildbots. More info available at: https://reviews.llvm.org/D80330 This reverts commit `b5740105d2`.	2020-06-24 14:40:45 -07:00
Stefan Agner	b7d41a11cd	[ARM] Make cp10 and cp11 usage a warning The ARM ARM considers p10/p11 valid arguments for MCR/MRC instructions. MRC instructions with p10 arguments are also used in kernel code which is shared for different architectures. Turn usage of p10/p11 to warnings for ARMv7/ARMv8-M. Reviewers: rengolin, olista01, t.p.northover, efriedma, psmith, simon_tatham Reviewed By: simon_tatham Subscribers: hiraditya, danielkiss, jcai19, tpimh, nickdesaulniers, peter.smith, javed.absar, kristof.beyls, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59733	2020-06-24 23:37:54 +02:00
Kirill Naumov	6a5d7d498c	[InlineCost] InlineCostAnnotationWriterPass introduced This class allows to see the inliner's decisions for better optimization verifications and tests. To use, use flag "-passes="print<inline-cost>"". This is the second attempt to integrate the patch. The problem from the first try has been discussed and fixed in D82205. Reviewers: apilipenko, mtrofin, davidxl, fedor.sergeev Reviewed By: mtrofin Differential revision: https://reviews.llvm.org/D81743	2020-06-24 21:27:07 +00:00
Amy Kwan	d82f26cc4b	[PowerPC][Power10] Implement Count Leading/Trailing Zeroes Builtins under bit Mask in LLVM/Clang This patch implements builtins for the following prototypes: unsigned long long __builtin_cntlzdm (unsigned long long, unsigned long long) unsigned long long __builtin_cnttzdm (unsigned long long, unsigned long long) vector unsigned long long vec_cntlzm (vector unsigned long long, vector unsigned long long) vector unsigned long long vec_cnttzm (vector unsigned long long, vector unsigned long long) Differential Revision: https://reviews.llvm.org/D80941	2020-06-24 16:03:45 -05:00
Jinsong Ji	81b2d1d112	[NFC][PowerPC] Fix some typos in MachineCombiner comments	2020-06-24 20:40:57 +00:00
Christopher Tetreault	3d123e17d8	[SVE] Remove calls to VectorType::getNumElements from IPO Reviewers: efriedma, jdoerfert, sdesmalen, kmclaughlin Reviewed By: efriedma, jdoerfert Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82219	2020-06-24 13:38:51 -07:00
dfukalov	7ddee0922f	[NFCI][CostModel] Add const to Value*. Summary: Get back `const` partially lost in one of recent changes. Additionally specify explicit qualifiers in few places. Reviewers: samparker Reviewed By: samparker Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82383	2020-06-24 23:16:08 +03:00
Kirill Naumov	ca899bf90a	[InlineCost] Added InlineCostCallAnalyzer::print() For the upcoming changes, we need to have an ability to dump InlineCostCallAnalyzer info in non-debug builds as well. Reviewed-By: mtrofin Differential Revision: https://reviews.llvm.org/D82205	2020-06-24 20:07:27 +00:00
Florian Hahn	35bb9bfbb0	[SLP] Limit GEP lists based on width of index computation. D68667 introduced a tighter limit to the number of GEPs to simplify together. The limit was based on the vector element size of the pointer, but the pointers themselves are not actually put in vectors. IIUC we try to vectorize the index computations here, so we should base the limit on the vector element size of the computation of the index. This restores the test regression on AArch64 and also restores the vectorization for a important pattern in SPEC2006/464.h264ref on AArch64 (@test_i16_extend). We get a large benefit from doing a single load up front and then processing the index computations in vectors. Note that we could probably even further improve the AArch64 codegen, if we would do zexts to i32 instead of i64 for the sub operands and then do a single vector sext on the result of the subtractions. AArch64 provides dedicated vector instructions to do so. Sketch of proof in Alive: https://alive2.llvm.org/ce/z/A4xYAB Reviewers: craig.topper, RKSimon, xbolva00, ABataev, spatel Reviewed By: ABataev, spatel Differential Revision: https://reviews.llvm.org/D82418	2020-06-24 19:56:53 +01:00
Simon Pilgrim	6c6adde84f	InstCombineInternal.h - reduce AliasAnalysis.h include to forward declaration. NFC. Fix implicit include dependencies in source files and replace legacy AliasAnalysis typedef with AAResults where necessary.	2020-06-24 19:27:38 +01:00
Simon Pilgrim	a53dddb3e9	Local.h - reduce includes to forward declarations. NFC. Fix implicit include dependencies in source files and replace legacy AliasAnalysis typedef with AAResults where necessary.	2020-06-24 19:27:37 +01:00
tatz.j@northeastern.edu	af5e61bf4f	[NVPTX] Fix for NVPTX module asm regression Currently module asm ends up emitted twice and at the wrong place in the PTX. This patch moves module asm generation into emitStartOfAsmFile() which puts at the correct location in the generated PTX. Differential Revision: https://reviews.llvm.org/D82280	2020-06-24 11:17:09 -07:00
Teresa Johnson	d291bd510e	[WPD] Allow virtual calls to be analyzed with multiple type tests Summary: In D52514 I had fixed a bug with WPD after indirect call promotion, by checking that a type test being analyzed dominates potential virtual calls. With that fix I included a small effiency enhancement to avoid processing a devirt candidate multiple times (when there are multiple type tests). This latter change wasn't in response to any measured efficiency issues, it was merely theoretical. Unfortuantely, it turns out to limit optimization opportunities after inlining. Specifically, consider code that looks like: class A { virtual void foo(); }; class B : public A { void foo(); } void callee(A a) { a->foo(); // Call 1 } void caller(B b) { b->foo(); // Call 2 callee(b); } After inlining callee into caller, because of the existing call to b->foo() in caller there will be 2 type tests in caller for the vtable pointer of b: the original type test against B from Call 2, and the inlined type test against A from Call 1. If the code was compiled with -fstrict-vtable-pointers, then after optimization WPD will see that both type tests are associated with the inlined virtual Call 1. With my earlier change to only process a virtual call against one type test, we may only consider virtual Call 1 against the base class A type test, which can't be devirtualized. With my change here to remove this restriction, it also gets considered for the type test against the derived class B type test, where it can be devirtualized. Note that if caller didn't include it's own earlier virtual call b->foo() we will not be able to devirtualize after inlining callee even after this fix, since there would not be a type test against B in the IR. As a future enhancement we can consider inserting type tests at call sites that pass pointers to classes with virtual calls, to enable context-sensitive devirtualization after inlining. Reviewers: pcc, vitalybuka, evgeny777 Subscribers: Prazek, hiraditya, steven_wu, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79235	2020-06-24 10:51:24 -07:00
Craig Topper	8dc92142e3	[X86] Replace PROC macros with an enum and a lookup table of processor information. This patch removes the PROC macro in favor of CPUKind enum and a table that contains information about CPUs. The current information in the table is the CPU name, CPUKind enum value, key feature for target multiversioning, and Is64Bit capable. For the strings that are aliases, I've duplicated the information in the table. This means there are more rows in the table than CPUKind enums. This replaces multiple StringSwitch's with loops through the table. They are linear searches due to the table being more logically ordered than alphabetical. The StringSwitch's would have also been linear. I've used StringLiteral on the strings in the table so we can quickly check the length while searching. I contemplated having a CPUKind for each string so there was a 1:1 mapping, but didn't want to spread more names to the places that use the enum. My ultimate goal here is to store the features for each CPU as a bitset within the table. Hoping to use constexpr to make this composable so we can group features and inherit them. After the table lookup we can turn the bitset into a list of strings for the frontend. The current switch we have for selecting features for CPUs has become difficult to maintain while trying to express inheritance relationships. Differential Revision: https://reviews.llvm.org/D82414	2020-06-24 10:46:25 -07:00
Simon Pilgrim	c18b753686	LoopUtils.h - reduce AliasAnalysis.h include to forward declarations. NFC. Fix implicit include dependencies in source files and replace legacy AliasAnalysis typedef with AAResults where necessary.	2020-06-24 17:58:38 +01:00
dstuttar	e8775c8d81	[AMDGPU] Make sure to fix implicit operands on insertBranch Summary: Without fixImplicitOperands we may end up creating default implicit operands that are the wrong wave size Includes simple test that provokes insertBranch in the correct way to expose the issue being fixed. Change-Id: I92bdcdee9fcb7b4d91529b84e76a48ac8218483e Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82459	2020-06-24 16:50:48 +01:00
Matt Arsenault	a448670752	AMDGPU/GlobalISel: Legalize 64-bit G_SDIV/G_SREM Now all the divisions should be complete, although we should fix emitting the entire common part for div/rem when you use both.	2020-06-24 11:39:45 -04:00
Matt Arsenault	b5c4e6c148	AMDGPU/GlobalISel: Invert parameter for div/rem lowering function	2020-06-24 11:39:45 -04:00
Ikhlas Ajbar	085701b8b0	[Hexagon] Reducing minimum alignment requirement This patch reduces minimum alignment requirement to 1 byte for arguments passed by value on stack.	2020-06-24 10:28:37 -05:00
Matt Arsenault	778351df77	Revert "[AMDGPU] Enable compare operations to be selected by divergence" This reverts commit `521ac0b5ce`. Reported to break thousands of piglit tests.	2020-06-24 11:21:30 -04:00
Arthur Eubanks	b5979a383a	[NewPM] Add SimpleLoopUnswitchPass to PassRegistry.def Summary: Seems to just be missing from PassRegistry.def. Makes the number of check-llvm failures under new PM go from 2619 to 2581. Reviewers: hans, ychen, asbirlea, leonardchan Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82422	2020-06-24 08:20:34 -07:00
Arthur Eubanks	fcf0741262	[NewPM] Handle -simplifycfg in opt Summary: -simplifycfg is the legacy pass name for SimplifyCFGPass. There is already -simplify-cfg in FUNCTION_PASS_WITH_PARAMS which handles options for SimplifyCFGPass. Maybe that should be renamed to -simplifycfg as well? This reduces the number of check-llvm failures under NewPM from 2619 to 2392. Reviewers: hans, leonardchan, asbirlea, ychen Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82421	2020-06-24 08:20:08 -07:00
Mircea Trofin	bdceefe95b	[llvm] Release-mode ML InlineAdvisor Summary: This implementation uses a pre-trained model which is statically compiled into a native function. RFC: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140763.html Reviewers: davidxl, jdoerfert, dblaikie Subscribers: mgorny, eraman, hiraditya, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81515	2020-06-24 08:18:42 -07:00
Sanjay Patel	a0f967418f	[VectorCombine] give invalid index value a name; NFC	2020-06-24 11:10:36 -04:00
Matt Arsenault	c5d240093b	WebAssembly: Don't store MachineFunction in MachineFunctionInfo Soon it will be disallowed to depend on MachineFunction state in the constructor. This was only being used to get the MachineRegisterInfo for an assert, which I'm not sure is necessarily worth it. I would think any missing defs would be caught by the verifier later instead.	2020-06-24 10:52:58 -04:00
Tim Corringham	c3b3b999ec	[AMDGPU] Avoid redundant mode register writes Summary: The SIModeRegister pass attempts to generate the minimal number of writes to the mode register. However it was failing to correctly deal with some loops, resulting in some redundant setreg instructions being inserted. This change amends the pass to avoid generating these redundant instructions. Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82215	2020-06-24 14:11:29 +01:00
Simon Pilgrim	bf77c7ef2d	Loads.h - reduce AliasAnalysis.h include to forward declarations. NFC. Fix implicit include dependencies in source files.	2020-06-24 13:49:04 +01:00
Florian Hahn	4e62c6359c	[DSE] Eliminate stores at the end of the function. This patch add support for eliminating MemoryDefs that do not have any aliasing users, which indicates that there are no reads/writes to the memory location until the end of the function. To eliminate such defs, we have to ensure that the underlying object is not visible in the caller and does not escape via returning. We need a separate check for that, as InvisibleToCaller does not consider returns. Reviewers: dmgreen, rnk, efriedma, bryant, asbirlea, Tyker, george.burgess.iv Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D72631	2020-06-24 12:58:20 +01:00
sstefan1	0f426935bb	[OpenMPOpt] ICV macro definitions Summary: This defines some basic information about ICVs in `OMPKinds.def`. We also emit remarks with initial values for each function (which are default for now) as a way to test this. Reviewers: jdoerfert, JonChesterfield, hamax97, jhuber6 Subscribers: yaxunl, hiraditya, guansong, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82193	2020-06-24 13:43:35 +02:00
Simon Pilgrim	90ad37646f	ObjCARC.h - remove unnecessary includes. NFC. Add implicit InstIterator.h dependency in ObjCARCContract.cpp	2020-06-24 12:30:59 +01:00

1 2 3 4 5 ...

135991 Commits