llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	18619afe1d	GlobalISel: Fix narrowScalar for load/store with different mem size This was ignoring the memory size, and producing multiple loads/stores if the operand size was different from the memory size. I assume this is the intent of not having an explicit G_ANYEXTLOAD (although I think that would probably be better). llvm-svn: 352523	2019-01-29 18:13:02 +00:00
Sanjay Patel	cd6b240303	[x86] add tests for vector bool math; NFC llvm-svn: 352520	2019-01-29 17:00:47 +00:00
Sanjay Patel	22dd34b0ec	[AArch64] add tests for vector bool math; NFC llvm-svn: 352519	2019-01-29 17:00:07 +00:00
Andrea Di Biagio	815cdbff29	[X86][Btver2] Improved latency/throughput model for scalar int-to-float conversions. Account for bypass delays when computing the latency of scalar int-to-float conversions. On Jaguar we need to account for an extra 6cy latency (see AMD fam16h SOG). This patch also fixes the number of micropcodes for the register-memory variants of scalar int-to-float conversions. Differential Revision: https://reviews.llvm.org/D57148 llvm-svn: 352518	2019-01-29 16:47:27 +00:00
Sanjay Patel	2e87df9112	[InstCombine] regenerate test checks; NFC llvm-svn: 352517	2019-01-29 16:44:05 +00:00
Sanjay Patel	f044d1884d	[InstCombine] add tests for ext-of-bool + add/sub; NFC We should choose one of these as canonical: %z = zext i1 %cmp to i32 %r = sub i32 %x, %z => %s = sext i1 %cmp to i32 %r = add i32 %x, %s The test comments assume that the zext form is better, but we can adjust that if we decide to go the other way. llvm-svn: 352515	2019-01-29 16:39:23 +00:00
James Y Knight	5d71fc5d7b	Adjust documentation for git migration. This fixes most references to the paths: llvm.org/svn/ llvm.org/git/ llvm.org/viewvc/ github.com/llvm-mirror/ github.com/llvm-project/ reviews.llvm.org/diffusion/ to instead point to https://github.com/llvm/llvm-project. This is not a trivial substitution, because additionally, all the checkout instructions had to be migrated to instruct users on how to use the monorepo layout, setting LLVM_ENABLE_PROJECTS instead of checking out various projects into various subdirectories. I've attempted to not change any scripts here, only documentation. The scripts will have to be addressed separately. Additionally, I've deleted one document which appeared to be outdated and unneeded: lldb/docs/building-with-debug-llvm.txt Differential Revision: https://reviews.llvm.org/D57330 llvm-svn: 352514	2019-01-29 16:37:27 +00:00
Jordan Rupprecht	c892741e74	[llvm-objcopy] Implement --set-section-flags. Summary: --set-section-flags is used to change the section flags (e.g. SHF_ALLOC) for given sections. The flags allowed are the same from the existing --rename-section=.old=.new[,flags] feature. Additionally, make sure that --set-section-flag cannot be used with --rename-section (either the source or destination), since --rename-section accepts flags. This avoids ambiguity for something like "--rename-section=.foo=.bar,alloc --set-section-flag=.bar,code". Reviewers: jhenderson, jakehehrlich, alexshap, espindola Reviewed By: jhenderson, jakehehrlich Subscribers: llvm-commits, emaste, arichardson Differential Revision: https://reviews.llvm.org/D57198 llvm-svn: 352505	2019-01-29 15:05:38 +00:00
Ayonam Ray	a1f6973ade	Reversing the checkin for version 352484 as tests are failing. llvm-svn: 352504	2019-01-29 15:00:50 +00:00
Neil Henning	0799352026	[AMDGPU] Fix a weird WWM intrinsic issue. I found a really strange WWM issue through a very convoluted shader that essentially boils down to a bug in SIInstrInfo where canReadVGPR did not correctly identify that WWM is like a copy and can have a VGPR as its source. Differential Revision: https://reviews.llvm.org/D56002 llvm-svn: 352500	2019-01-29 14:28:17 +00:00
Ayonam Ray	4272af9b3e	[CodeGen] Omit range checks from jump tables when lowering switches with unreachable default During the lowering of a switch that would result in the generation of a jump table, a range check is performed before indexing into the jump table, for the switch value being outside the jump table range and a conditional branch is inserted to jump to the default block. In case the default block is unreachable, this conditional jump can be omitted. This patch implements omitting this conditional branch for unreachable defaults. Review ID: D52002 Reviewers: Hans Wennborg, Eli Freidman, Roman Lebedev llvm-svn: 352484	2019-01-29 12:01:32 +00:00
Simon Pilgrim	4293ad8ab2	[X86] Add PR40483 test case llvm-svn: 352480	2019-01-29 10:58:42 +00:00
Dan Gohman	4684f824d4	[WebAssembly] Re-enable main-function signature rewriting Re-enable the code to rewrite main-function signatures into "int main(int argc, char argv[])", but limited to only handling the case of "int main(void)", so that it doesn't silently strip an argument in the "int main(int argc, char argv[], char *envp[])" case. This allows main to be called by C startup code, since WebAssembly requires caller and callee signatures to match, so it can't rely on passing main a different number of arguments than it expects. Differential Revision: https://reviews.llvm.org/D57323 llvm-svn: 352479	2019-01-29 10:53:42 +00:00
Simon Pilgrim	06a342b2d6	[X86] Fix linux32 pic tests to use correct relocation model (PR39684) Differential Revision: https://reviews.llvm.org/D57301 llvm-svn: 352476	2019-01-29 10:41:48 +00:00
David Green	54b0115547	[ARM] Use sub for negative offset load/store in thumb1 This attempts to optimise negative values used in load/store operands a little. We currently try to selct them as rr, materialising the negative constant using a MOV/MVN pair. This instead selects ri with an immediate of 0, forcing the add node to become a simpler sub. Differential Revision: https://reviews.llvm.org/D57121 llvm-svn: 352475	2019-01-29 10:40:31 +00:00
Simon Pilgrim	0b7fce6d72	[X86] Regenerate abi-isel.ll test Adds note requested in D57301 and fixes some missing GOTPCREL addressmath checks llvm-svn: 352474	2019-01-29 10:39:02 +00:00
David Green	5c33c5da1a	[ARM] Add extra testcases for D57121. NFC llvm-svn: 352472	2019-01-29 10:25:56 +00:00
Jeremy Morse	ba467024f4	Remove 'XFAIL: powerpc64' from a debuginfo test This test started XPASSing with r352467, and the change in behaviour performed by that patch does appear to fix the cause of the original XFAIL (missing FrameIndex DBG_VALUE), which I've replicated locally with -mtriple=powerpc64--. I'll write this up in PR21881 which documents the XFAIL, and seek confirmation I haven't overlooked something here. llvm-svn: 352471	2019-01-29 10:23:43 +00:00
Bjorn Pettersson	d014d576a9	[IPCP] Don't crash due to arg count/type mismatch between caller/callee Summary: This patch avoids an assert in IPConstantPropagation when there is a argument count/type mismatch between the caller and the callee. While this is actually UB on C-level (clang emits a warning), the IR verifier seems to accept it. I'm not sure what other frontends/languages might think about this, so simply bailing out to avoid hitting an assert (in CallSiteBase<>::getArgOperand or Value::doRAUW) seems like a simple solution. The problem is exposed by the fact that AbstractCallSites will look through a bitcast at the callee position of a call/invoke. Reviewers: jdoerfert, reames, efriedma Reviewed By: jdoerfert, efriedma Subscribers: eli.friedman, efriedma, llvm-commits Differential Revision: https://reviews.llvm.org/D57052 llvm-svn: 352469	2019-01-29 10:19:44 +00:00
Jeremy Morse	66ac86b58d	[DebugInfo][DAG] Process FrameIndex dbg.values unconditionally A FrameIndex should be valid throughout a block regardless of what instructions get selected in that block -- therefore we shouldn't harness dbg.values that refer to FrameIndexes to an SDNode. There are numerous codegen reasons why an SDNode never appears or doesn't become a location that a DBG_VALUE can refer to. None of them actually affect the variable location. Therefore, before any other tests to encode dbg_values in a SelectionDAG, identify FrameIndex operands and encode them unattached to any SDNode. Differential Revision: https://reviews.llvm.org/D57328 llvm-svn: 352467	2019-01-29 09:40:05 +00:00
Martin Storsjo	f5884d255e	[COFF, ARM64] Don't put jump table into a separate COFF section for EK_LabelDifference32 Windows ARM64 has PIC relocation model and uses jump table kind EK_LabelDifference32. This produces jump table entry as ".word LBB123 - LJTI1_2" which represents the distance between the block and jump table. A new relocation type (IMAGE_REL_ARM64_REL32) is needed to do the fixup correctly if they are in different COFF section. This change saves the jump table to the same COFF section as the associated code. An ideal fix could be utilizing IMAGE_REL_ARM64_REL32 relocation type. Patch by Tom Tan! Differential Revision: https://reviews.llvm.org/D57277 llvm-svn: 352465	2019-01-29 09:36:48 +00:00
Jonas Paulsson	5ed4d4638f	[CodeGenPrepare] Handle all debug calls in dupRetToEnableTailCallOpts() This patch makes sure that a debug value that is after the bitcast in dupRetToEnableTailCallOpts() is also skipped. The reduced test case is from SPEC-2006 on SystemZ. Review: Vedant Kumar, Wolfgang Pieb https://reviews.llvm.org/D57050 llvm-svn: 352462	2019-01-29 09:03:35 +00:00
Jeremy Morse	27631cc670	Fix an incorrectly configured test. This should have had a target triple in it, my mistake. llvm-svn: 352460	2019-01-29 08:41:44 +00:00
Philip Reames	3cfd351efc	Correct contents for r352453 I had a local change I hadn't realized when submitting that auto-update. As such, the auto-update was wrong. This should fix it, and with that, it's clearly time to stop submitting changes and go to bed. llvm-svn: 352454	2019-01-29 06:40:02 +00:00
Philip Reames	2ddf96db50	[Tests] Regen to remove future test diffs This file appears to have been manually editted at some point after being auto-updated. A future change adjusts this file slightly, and all of the updates makes the diff super confusing. llvm-svn: 352453	2019-01-29 06:34:46 +00:00
Philip Reames	3846b9b443	[Test] Add tests for gather/maked.load demanded elements, and convert the whole file to auto generated checks. llvm-svn: 352452	2019-01-29 05:58:32 +00:00
Max Kazantsev	468ad52213	[SCEV] Take correct loop in AddRec simplification. PR40420 The code of AddRec simplification is using wrong loop when it creates a new AddRecExpr. It should be using AddRecLoop which we have saved and against which all gate checks are made, and not calling AddRec->getLoop() over and over again because AddRec may change and become an AddRecurrency from outer loop during the transform iterations. Considering this change trivial, commiting for postcommit review. llvm-svn: 352451	2019-01-29 05:37:59 +00:00
Max Kazantsev	d4de606ddb	[NFC] Merge failing test from PR40420 llvm-svn: 352450	2019-01-29 05:12:40 +00:00
Teresa Johnson	87cc05055a	Try to make new test more resilient to different orderings New test added in r352441 getting a bot failure which I believe is due to different ordering in the dumping which isn't being handled well. Try to make test more resilient to ordering differences. llvm-svn: 352446	2019-01-29 02:04:01 +00:00
Sam Clegg	b54927cc48	[WebAssembly] Handle more types of uses in WebAssemblyAddMissingPrototypes Previously we were only handling bitcast operations, however prototypeless functions can also appear in other places such as comparisons and as function params. Switch to using replaceAllUsesWith() to replace the prototype-less function uses. This new approach results in some redundant bitcasting but is much simpler and handles all cases. Differential Revision: https://reviews.llvm.org/D56938 llvm-svn: 352445	2019-01-29 00:30:46 +00:00
Thomas Lively	33f87b8aef	[WebAssembly] Expand BUILD_PAIR nodes Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish Differential Revision: https://reviews.llvm.org/D57276 llvm-svn: 352442	2019-01-28 23:44:31 +00:00
Teresa Johnson	2f616e479b	[ThinLTO] Add option to dump per-module summary dot graph Summary: I found that there currently isn't a way to invoke exportToDot from the command line for a per-module summary index, and therefore no testing of that case. Add an internal option and use it to test dumping of per module summary indexes. In particular, I am looking at fixing the limitation that causes the aliasee GUID in the per-module summary to be 0, and want to be able to test that change. Reviewers: evgeny777 Subscribers: mehdi_amini, inglorion, eraman, steven_wu, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D57206 llvm-svn: 352441	2019-01-28 23:43:26 +00:00
Philip Reames	6c5341bc5a	Demanded elements support for vector GEPs GEPs can produce either scalar or vector results. If we're extracting only a subset of the vector lanes, simplifying the operands is helpful in eliminating redundant computation, and (eventually) allowing further optimizations Differential Revision: https://reviews.llvm.org/D57177 llvm-svn: 352440	2019-01-28 23:24:49 +00:00
Sanjay Patel	a36a293a56	[CGP] auto-generate complete checks for add overflow tests; NFC llvm-svn: 352437	2019-01-28 22:07:37 +00:00
Craig Topper	390ac61b93	Recommit r352255 "[SelectionDAG][X86] Don't use SEXTLOAD for promoting masked loads in the type legalizer" This did not cause the buildbot failure it was previously reverted for. Original commit message: I'm not sure why we were using SEXTLOAD. EXTLOAD seems more appropriate since we don't care about the upper bits. This patch changes this and then modifies the X86 post legalization combine to emit a extending shuffle instead of a sign_extend_vector_inreg. Could maybe use an any_extend_vector_inre On AVX512 targets I think we might be able to use a masked vpmovzx and not have to expand this at all. llvm-svn: 352433	2019-01-28 21:38:47 +00:00
Jessica Paquette	2d73ecd0a3	[GlobalISel][AArch64] Add legalization for G_FLOG This adds support for legalizing G_FLOG into a RTLib call. It adds a legalizer test, and updates the existing floating point tests. https://reviews.llvm.org/D57347 llvm-svn: 352429	2019-01-28 21:27:23 +00:00
Sanjay Patel	8965411619	[InstCombine] add another saturating uadd test (no undefs); NFC I forgot that our undef matching hasn't been completed in the previous commit. llvm-svn: 352424	2019-01-28 20:37:18 +00:00
Sanjay Patel	dc543300a9	[InstCombine] add tests for saturating uadd with constant; NFC llvm-svn: 352423	2019-01-28 20:32:48 +00:00
Matt Arsenault	cdd191d9db	AMDGPU: Add DS append/consume intrinsics Since these pass the pointer in m0 unlike other DS instructions, these need to worry about whether the address is uniform or not. This assumes the address is dynamically uniform, and just uses readfirstlane to get a copy into an SGPR. I don't know if these have the same 16-bit add for the addressing mode offset problem on SI or not, but I've just assumed they do. Also includes some misc. changes to avoid test differences between the LDS and GDS versions. llvm-svn: 352422	2019-01-28 20:14:49 +00:00
Jessica Paquette	c49428a97d	[GlobalISel][AArch64] Add instruction selection support for @llvm.log10 This adds instruction selection support for @llvm.log10 in AArch64. It teaches GISel to lower it to a library call, updates the relevant tests, and adds a legalizer test for log10. https://reviews.llvm.org/D57341 llvm-svn: 352418	2019-01-28 19:53:14 +00:00
Scott Linder	b5d6292822	[MC] Do not consider .ifdef/.ifndef as a use This is allowed by GAS and seems correct. Differential Revision: https://reviews.llvm.org/D55439 llvm-svn: 352414	2019-01-28 19:32:08 +00:00
Francis Visoiu Mistrih	556ea7d2e0	[AArch64] Add 'apple-latest' CPU alias The 'apple-latest' alias is supposed to provide a CPU that contains the latest Apple processor model supported by LLVM. This is supposed to be used by tools like lldb to provide a target that supports most of the CPU features. For now, this is mapped to Cyclone. Differential Revision: https://reviews.llvm.org/D56384 llvm-svn: 352412	2019-01-28 19:27:33 +00:00
Jessica Paquette	2e35dc5185	[GlobalISel] Add ISel support for @llvm.lifetime.start and @llvm.lifetime.end This adds ISel support for lifetime markers in opt levels above O0. It also updates the arm64-irtranslator test, and updates some AArch64 tests that use them for added coverage. It also adds a testcase taken from the X86 codegen tests which verified a bug caused by lifetime markers + stack colouring in the past. This is intended to make sure that GISel doesn't re-introduce the bug. (This is basically a straight copy from what SelectionDAG does in SelectionDAGBuilder.cpp) https://reviews.llvm.org/D57187 llvm-svn: 352410	2019-01-28 19:22:29 +00:00
Nikita Popov	8e1a464e6a	[CodeGen][X86] Expand UADDSAT to NOT+UMIN+ADD Followup to D56636, this time handling the UADDSAT case by expanding uadd.sat(a, b) to umin(a, ~b) + b. Differential Revision: https://reviews.llvm.org/D56869 llvm-svn: 352409	2019-01-28 19:19:09 +00:00
Vedant Kumar	1c3694a4d4	[CodeExtractor] Add support for the `swifterror` attribute When passing a `swifterror` argument or alloca as an input to an extraction region, mark the input parameter `swifterror`. llvm-svn: 352408	2019-01-28 19:13:37 +00:00
Jessica Paquette	7db82d7257	[GlobalISel][AArch64] Add instruction selection support for G_FCOS and G_FSIN This contains all of the legalizer changes from D57197 necessary to select G_FCOS and G_FSIN. It also updates several existing IR tests in test/CodeGen/AArch64 that verify that we correctly lower the G_FCOS and G_FSIN instructions. https://reviews.llvm.org/D57197 3/3 llvm-svn: 352402	2019-01-28 18:34:18 +00:00
Jessica Paquette	296f19b3d9	[GlobalISel][AArch64] Add IRTranslator support for G_FCOS and G_FSIN This adds IRTranslator support for the G_FCOS and G_FSIN generic instructions. https://reviews.llvm.org/D57197 2/3 llvm-svn: 352401	2019-01-28 18:34:17 +00:00
Jessica Paquette	9f6afad913	[GlobalISel] Add G_FSIN and G_FCOS generic instructions This introduces generic instrutions for floating point sin and cos, G_FCOS and G_FSIN. It updates the tests, etc. https://reviews.llvm.org/D57197 1/3 llvm-svn: 352400	2019-01-28 18:34:16 +00:00
Simon Pilgrim	2c17512456	[X86][AVX] Remove lowerShuffleByMerging128BitLanes 2-lane restriction First step towards adding support for 64-bit unary "sublane" handling (a bit like lowerShuffleAsRepeatedMaskAndLanePermute). This allows us to add lowerV64I8Shuffle handling. llvm-svn: 352389	2019-01-28 17:02:35 +00:00
Sanjay Patel	94cca60b82	[x86] allow more shuffle splitting to avoid vpermps (PR40434) This is tricky to make optimal: sometimes we're better off using a single wider op, but other times it makes more sense to combine a narrow ops to achieve the same result. This solves the case from: https://bugs.llvm.org/show_bug.cgi?id=40434 There's potentially a similar change for vectors with 64-bit elements, but it needs adjustments similar to rL352333 to avoid creating infinite loops. llvm-svn: 352380	2019-01-28 15:51:34 +00:00

1 2 3 4 5 ...

58957 Commits