llvm-project

Commit Graph

Author	SHA1	Message	Date
Jonas Paulsson	5ed4d4638f	[CodeGenPrepare] Handle all debug calls in dupRetToEnableTailCallOpts() This patch makes sure that a debug value that is after the bitcast in dupRetToEnableTailCallOpts() is also skipped. The reduced test case is from SPEC-2006 on SystemZ. Review: Vedant Kumar, Wolfgang Pieb https://reviews.llvm.org/D57050 llvm-svn: 352462	2019-01-29 09:03:35 +00:00
Jeremy Morse	27631cc670	Fix an incorrectly configured test. This should have had a target triple in it, my mistake. llvm-svn: 352460	2019-01-29 08:41:44 +00:00
Philip Reames	3cfd351efc	Correct contents for r352453 I had a local change I hadn't realized when submitting that auto-update. As such, the auto-update was wrong. This should fix it, and with that, it's clearly time to stop submitting changes and go to bed. llvm-svn: 352454	2019-01-29 06:40:02 +00:00
Philip Reames	2ddf96db50	[Tests] Regen to remove future test diffs This file appears to have been manually editted at some point after being auto-updated. A future change adjusts this file slightly, and all of the updates makes the diff super confusing. llvm-svn: 352453	2019-01-29 06:34:46 +00:00
Philip Reames	3846b9b443	[Test] Add tests for gather/maked.load demanded elements, and convert the whole file to auto generated checks. llvm-svn: 352452	2019-01-29 05:58:32 +00:00
Max Kazantsev	468ad52213	[SCEV] Take correct loop in AddRec simplification. PR40420 The code of AddRec simplification is using wrong loop when it creates a new AddRecExpr. It should be using AddRecLoop which we have saved and against which all gate checks are made, and not calling AddRec->getLoop() over and over again because AddRec may change and become an AddRecurrency from outer loop during the transform iterations. Considering this change trivial, commiting for postcommit review. llvm-svn: 352451	2019-01-29 05:37:59 +00:00
Max Kazantsev	d4de606ddb	[NFC] Merge failing test from PR40420 llvm-svn: 352450	2019-01-29 05:12:40 +00:00
Teresa Johnson	87cc05055a	Try to make new test more resilient to different orderings New test added in r352441 getting a bot failure which I believe is due to different ordering in the dumping which isn't being handled well. Try to make test more resilient to ordering differences. llvm-svn: 352446	2019-01-29 02:04:01 +00:00
Sam Clegg	b54927cc48	[WebAssembly] Handle more types of uses in WebAssemblyAddMissingPrototypes Previously we were only handling bitcast operations, however prototypeless functions can also appear in other places such as comparisons and as function params. Switch to using replaceAllUsesWith() to replace the prototype-less function uses. This new approach results in some redundant bitcasting but is much simpler and handles all cases. Differential Revision: https://reviews.llvm.org/D56938 llvm-svn: 352445	2019-01-29 00:30:46 +00:00
Thomas Lively	33f87b8aef	[WebAssembly] Expand BUILD_PAIR nodes Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish Differential Revision: https://reviews.llvm.org/D57276 llvm-svn: 352442	2019-01-28 23:44:31 +00:00
Teresa Johnson	2f616e479b	[ThinLTO] Add option to dump per-module summary dot graph Summary: I found that there currently isn't a way to invoke exportToDot from the command line for a per-module summary index, and therefore no testing of that case. Add an internal option and use it to test dumping of per module summary indexes. In particular, I am looking at fixing the limitation that causes the aliasee GUID in the per-module summary to be 0, and want to be able to test that change. Reviewers: evgeny777 Subscribers: mehdi_amini, inglorion, eraman, steven_wu, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D57206 llvm-svn: 352441	2019-01-28 23:43:26 +00:00
Philip Reames	6c5341bc5a	Demanded elements support for vector GEPs GEPs can produce either scalar or vector results. If we're extracting only a subset of the vector lanes, simplifying the operands is helpful in eliminating redundant computation, and (eventually) allowing further optimizations Differential Revision: https://reviews.llvm.org/D57177 llvm-svn: 352440	2019-01-28 23:24:49 +00:00
Sanjay Patel	a36a293a56	[CGP] auto-generate complete checks for add overflow tests; NFC llvm-svn: 352437	2019-01-28 22:07:37 +00:00
Craig Topper	390ac61b93	Recommit r352255 "[SelectionDAG][X86] Don't use SEXTLOAD for promoting masked loads in the type legalizer" This did not cause the buildbot failure it was previously reverted for. Original commit message: I'm not sure why we were using SEXTLOAD. EXTLOAD seems more appropriate since we don't care about the upper bits. This patch changes this and then modifies the X86 post legalization combine to emit a extending shuffle instead of a sign_extend_vector_inreg. Could maybe use an any_extend_vector_inre On AVX512 targets I think we might be able to use a masked vpmovzx and not have to expand this at all. llvm-svn: 352433	2019-01-28 21:38:47 +00:00
Jessica Paquette	2d73ecd0a3	[GlobalISel][AArch64] Add legalization for G_FLOG This adds support for legalizing G_FLOG into a RTLib call. It adds a legalizer test, and updates the existing floating point tests. https://reviews.llvm.org/D57347 llvm-svn: 352429	2019-01-28 21:27:23 +00:00
Sanjay Patel	8965411619	[InstCombine] add another saturating uadd test (no undefs); NFC I forgot that our undef matching hasn't been completed in the previous commit. llvm-svn: 352424	2019-01-28 20:37:18 +00:00
Sanjay Patel	dc543300a9	[InstCombine] add tests for saturating uadd with constant; NFC llvm-svn: 352423	2019-01-28 20:32:48 +00:00
Matt Arsenault	cdd191d9db	AMDGPU: Add DS append/consume intrinsics Since these pass the pointer in m0 unlike other DS instructions, these need to worry about whether the address is uniform or not. This assumes the address is dynamically uniform, and just uses readfirstlane to get a copy into an SGPR. I don't know if these have the same 16-bit add for the addressing mode offset problem on SI or not, but I've just assumed they do. Also includes some misc. changes to avoid test differences between the LDS and GDS versions. llvm-svn: 352422	2019-01-28 20:14:49 +00:00
Jessica Paquette	c49428a97d	[GlobalISel][AArch64] Add instruction selection support for @llvm.log10 This adds instruction selection support for @llvm.log10 in AArch64. It teaches GISel to lower it to a library call, updates the relevant tests, and adds a legalizer test for log10. https://reviews.llvm.org/D57341 llvm-svn: 352418	2019-01-28 19:53:14 +00:00
Scott Linder	b5d6292822	[MC] Do not consider .ifdef/.ifndef as a use This is allowed by GAS and seems correct. Differential Revision: https://reviews.llvm.org/D55439 llvm-svn: 352414	2019-01-28 19:32:08 +00:00
Francis Visoiu Mistrih	556ea7d2e0	[AArch64] Add 'apple-latest' CPU alias The 'apple-latest' alias is supposed to provide a CPU that contains the latest Apple processor model supported by LLVM. This is supposed to be used by tools like lldb to provide a target that supports most of the CPU features. For now, this is mapped to Cyclone. Differential Revision: https://reviews.llvm.org/D56384 llvm-svn: 352412	2019-01-28 19:27:33 +00:00
Jessica Paquette	2e35dc5185	[GlobalISel] Add ISel support for @llvm.lifetime.start and @llvm.lifetime.end This adds ISel support for lifetime markers in opt levels above O0. It also updates the arm64-irtranslator test, and updates some AArch64 tests that use them for added coverage. It also adds a testcase taken from the X86 codegen tests which verified a bug caused by lifetime markers + stack colouring in the past. This is intended to make sure that GISel doesn't re-introduce the bug. (This is basically a straight copy from what SelectionDAG does in SelectionDAGBuilder.cpp) https://reviews.llvm.org/D57187 llvm-svn: 352410	2019-01-28 19:22:29 +00:00
Nikita Popov	8e1a464e6a	[CodeGen][X86] Expand UADDSAT to NOT+UMIN+ADD Followup to D56636, this time handling the UADDSAT case by expanding uadd.sat(a, b) to umin(a, ~b) + b. Differential Revision: https://reviews.llvm.org/D56869 llvm-svn: 352409	2019-01-28 19:19:09 +00:00
Vedant Kumar	1c3694a4d4	[CodeExtractor] Add support for the `swifterror` attribute When passing a `swifterror` argument or alloca as an input to an extraction region, mark the input parameter `swifterror`. llvm-svn: 352408	2019-01-28 19:13:37 +00:00
Jessica Paquette	7db82d7257	[GlobalISel][AArch64] Add instruction selection support for G_FCOS and G_FSIN This contains all of the legalizer changes from D57197 necessary to select G_FCOS and G_FSIN. It also updates several existing IR tests in test/CodeGen/AArch64 that verify that we correctly lower the G_FCOS and G_FSIN instructions. https://reviews.llvm.org/D57197 3/3 llvm-svn: 352402	2019-01-28 18:34:18 +00:00
Jessica Paquette	296f19b3d9	[GlobalISel][AArch64] Add IRTranslator support for G_FCOS and G_FSIN This adds IRTranslator support for the G_FCOS and G_FSIN generic instructions. https://reviews.llvm.org/D57197 2/3 llvm-svn: 352401	2019-01-28 18:34:17 +00:00
Jessica Paquette	9f6afad913	[GlobalISel] Add G_FSIN and G_FCOS generic instructions This introduces generic instrutions for floating point sin and cos, G_FCOS and G_FSIN. It updates the tests, etc. https://reviews.llvm.org/D57197 1/3 llvm-svn: 352400	2019-01-28 18:34:16 +00:00
Simon Pilgrim	2c17512456	[X86][AVX] Remove lowerShuffleByMerging128BitLanes 2-lane restriction First step towards adding support for 64-bit unary "sublane" handling (a bit like lowerShuffleAsRepeatedMaskAndLanePermute). This allows us to add lowerV64I8Shuffle handling. llvm-svn: 352389	2019-01-28 17:02:35 +00:00
Sanjay Patel	94cca60b82	[x86] allow more shuffle splitting to avoid vpermps (PR40434) This is tricky to make optimal: sometimes we're better off using a single wider op, but other times it makes more sense to combine a narrow ops to achieve the same result. This solves the case from: https://bugs.llvm.org/show_bug.cgi?id=40434 There's potentially a similar change for vectors with 64-bit elements, but it needs adjustments similar to rL352333 to avoid creating infinite loops. llvm-svn: 352380	2019-01-28 15:51:34 +00:00
George Rimar	7d6fd6d73d	[llvm-objdump] - Update test after r352366. NFC. Change the column name. llvm-svn: 352379	2019-01-28 15:49:41 +00:00
George Rimar	3168496822	[obj2yaml] - Dump the sh_entsize section field. I faced with the fact that obj2yaml does not dump the sh_entsize field. A problem arose when I tried to dump ELF versioning sections. This is close to what D50235 did, but D50235 did the change for yaml2obj, and now I had to do the same for obj2yaml. Differential revision: https://reviews.llvm.org/D57229 llvm-svn: 352373	2019-01-28 15:05:10 +00:00
Jordan Rupprecht	b2702d6a45	[llvm-objcopy] Fix crash when writing empty binary output Summary: When using llvm-objcopy -O binary and the resulting file will be empty (e.g. removing the only section that would be written, or using --only-keep with a section that doesn't exist/isn't SHF_ALLOC), we crash because FileOutputBuffer expects Size > 0. Add a regression test, and change Buffer to open/truncate the output file in this case. Reviewers: alexshap, jhenderson, jakehehrlich, espindola Reviewed By: alexshap, jhenderson Subscribers: jfb, llvm-commits, emaste, arichardson Differential Revision: https://reviews.llvm.org/D56806 llvm-svn: 352371	2019-01-28 15:02:40 +00:00
Aleksandar Beserminji	6c5dfcb89e	[mips] Support for +abs2008 attribute Instruction abs.[ds] is not generating correct result when working with NaNs for revisions prior mips32r6 and mips64r6. To generate a sequence which always produce a correct result, but also to allow user more control on how his code is compiled, attribute +abs2008 is added, so user can choose legacy or 2008. By default legacy mode is used on revisions prior R6. Mips32r6 and mips64r6 use abs2008 mode by default. Differential Revision: https://reviews.llvm.org/D35983 llvm-svn: 352370	2019-01-28 14:59:30 +00:00
George Rimar	87fa2e66e7	[llvm-objdump] - Print LMAs when dumping section headers. When --section-headers is used, GNU objdump prints both LMA and VMA for sections. llvm-objdump does not do that what makes it's output be slightly inconsistent. Patch teaches llvm-objdump to print LMA/VMA for ELF file formats. The behavior for other formats remains unchanged. Differential revision: https://reviews.llvm.org/D57146 llvm-svn: 352366	2019-01-28 14:11:35 +00:00
Tim Corringham	824ca3f3dd	[AMDGPU] Add intrinsics for 16 bit interpolation Summary: Added the intrinsics llvm.amdgcn.interp.p1.f16() and llvm.amdgcn.interp.p2.f16() and related LIT test. The p1 intrinsic generates code appropriate for both 16 and 32 bank LDS. Reviewers: #amdgpu, dstuttard, arsenm, tpr Reviewed By: #amdgpu, arsenm Subscribers: jvesely, mgorny, arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D46754 llvm-svn: 352357	2019-01-28 13:48:59 +00:00
Petar Avramovic	7cecadb9af	[MIPS GlobalISel] Select sub Lower G_USUBO and G_USUBE. Add narrowScalar for G_SUB. Legalize and select G_SUB for MIPS 32. Differential Revision: https://reviews.llvm.org/D53416 llvm-svn: 352351	2019-01-28 12:10:17 +00:00
Jeremy Morse	8ebffb4b82	[DebugInfo][DAG] Avoid re-ordering of DBG_VALUEs This patch improves the placement of DBG_VALUEs when by SelectionDAG, which as documented in PR40427 can go very wrong. At the core of this is ProcessSourceNode, which assumes the last instruction in a BB is the start of the last processed IR instruction, which isn't always true. Instead, use a helper function to call InstrEmitter::EmitNode, that records before-and-after iterators and determines the first of any new instruction created during emission. This is passed to ProcessSourceNode, which can then make more elightened decisions about ordering for DBG_VALUE placement. Differential revision: https://reviews.llvm.org/D57163 llvm-svn: 352350	2019-01-28 12:08:31 +00:00
George Rimar	4c3b297621	[llvm-objdump] - Implement the --adjust-vma option. GNU objdump's help says: "--adjust-vma: Add OFFSET to all displayed section addresses" In real life what it does is a bit more complicated (and IMO not always reasonable. For example, GNU objdump prints not only VMA, but also LMA for sections. And with --adjust-vma it adjusts LMA, but only when a section has relocations. llvm-objsump does not seem to support printing LMAs yet, but GNU's logic anyways does not make sense for me here). This patch tries to adjust VMA. I tried to implement a reasonable approach. I am not adjusting sections that are not allocatable. As, for example, adjusting debug sections VA's and rel[a] sections VA's should not make sense. This behavior seems to be GNU compatible. Differential revision: https://reviews.llvm.org/D57051 llvm-svn: 352347	2019-01-28 10:44:01 +00:00
Diana Picus	574e0c5e32	[ARM GlobalISel] Support integer division for Thumb2 Support G_SDIV, G_UDIV, G_SREM and G_UREM. The only significant difference between arm and thumb mode is that we need to check a different subtarget feature. llvm-svn: 352346	2019-01-28 10:37:30 +00:00
Craig Topper	453150bc18	[X86] Add new variadic avx512 compress/expand intrinsics that use vXi1 types for the mask argument. Remove and autoupgrade the old intrinsics llvm-svn: 352343	2019-01-28 07:03:03 +00:00
Craig Topper	b23d5ccafc	[X86] Add vbmi2 compressstore and expandload tests that aren't fast-isel tests. These got removed when we autoupgraded to target independent intrinsics, but we didn't have coverage anywhere else. The avx512f/avx512vl versions do have coverage. Also move some tests back from the upgrade file that aren't really upgraded. llvm-svn: 352342	2019-01-28 05:42:39 +00:00
Amara Emerson	fd31bf95c1	[AArch64][GlobalISel] Teach RBS about G_FNEG default mapping. llvm-svn: 352340	2019-01-28 03:21:14 +00:00
Amara Emerson	0bfa2faccc	[AArch64][GlobalISel] Add some missing vector support for FP arithmetic ops. Moved the fneg lowering legalization test from AArch64 to X86, as we want to specify that it's already legal. llvm-svn: 352338	2019-01-28 02:28:22 +00:00
Amara Emerson	92ffb305cc	[AArch64][GlobalISel] Add some vector support for fp <-> int conversions. Some unrelated, but benign, test changes as well due to the test update script. llvm-svn: 352337	2019-01-28 02:27:59 +00:00
Matt Arsenault	cfca2a7adf	GlobalISel: Don't reduce elements for atomic load/store This is invalid for the same reason as in the narrowScalar handling for load. llvm-svn: 352334	2019-01-27 22:36:24 +00:00
Sanjay Patel	ebe6b43aec	[x86] add restriction for lowering to vpermps This transform was added with rL351346, and we had an escape for shufps, but we also want one for unpckps vs. vpermps because vpermps doesn't take an immediate shuffle index operand. llvm-svn: 352333	2019-01-27 21:53:33 +00:00
Sanjay Patel	9ceaf2932a	[x86] add tests for extract/extract/unpack; NFC llvm-svn: 352331	2019-01-27 21:34:51 +00:00
Simon Pilgrim	670a6971f8	[X86][SSE] Add UNDEF handling to combineSelect ISD::USUBSAT matching (PR40083) llvm-svn: 352330	2019-01-27 21:01:23 +00:00
Simon Pilgrim	e5cf884018	[X86][SSE] Add UNDEF test case for combineSelect ISD::USUBSAT matching (PR40083) llvm-svn: 352329	2019-01-27 20:52:34 +00:00
Simon Pilgrim	f10b6623cc	[X86][SSE] Permit UNDEFs in combineAddToSUBUS matching (PR40083) llvm-svn: 352328	2019-01-27 20:36:37 +00:00

1 2 3 4 5 ...

58936 Commits