llvm-project

Commit Graph

Author	SHA1	Message	Date
Petr Hosek	b16ed493dd	[Fuchsia] Rely on linker switch rather than dead code ref for profile runtime Follow the model used on Linux, where the clang driver passes the linker a -u switch to force the profile runtime to be linked in, rather than having every TU emit a dead function with a reference. Differential Revision: https://reviews.llvm.org/D79835	2020-06-04 15:47:05 -07:00
Petr Hosek	e1ab90001a	Revert "[Fuchsia] Rely on linker switch rather than dead code ref for profile runtime" This reverts commit `d510542174` since it broke several bots.	2020-06-04 15:44:10 -07:00
Craig Topper	3ad8fbd205	[Reassociate] Teach ConvertShiftToMul to preserve nsw flag if the shift amount is not bitwidth - 1. Multiply and shl have different signed overflow behavior in some cases. But it looks like we should be ok as long as the shift amount is less than bitwidth - 1. Alive2: http://volta.cs.utah.edu:8080/z/MM4WZP Differential Revision: https://reviews.llvm.org/D81189	2020-06-04 14:51:34 -07:00
Matt Arsenault	1657f0ebc2	AMDGPU: Fix overriding global FP atomic feature predicates Global TableGen let override blocks are pretty dangerous and override any local special cases. In this case, the broader HasFlatGlobalInsts was overriding the more specific predicate for FeatureAtomicFaddInsts. Make sure HasFlatGlobalInsts is implied by FeatureAtomicFaddInsts, and make sure the right predicate is used. One issue with independently setting the subtarget features on incompatible targets is all of the encoding families do not define all opcodes. This will hit an assert on gfx10 for example, since we set the encoding independently based on the generation and not based on a feature.	2020-06-04 17:50:38 -04:00
Matt Arsenault	651c36b508	AMDGPU: Select strict_fmul	2020-06-04 17:49:00 -04:00
Matt Arsenault	483d4daa5e	AMDGPU: Select strict_fma Like with strict_fadd, the legalization is scalarizing the v4f16 when it should split.	2020-06-04 17:49:00 -04:00
Matt Arsenault	ae26c064ce	AMDGPU: Select strict_fadd	2020-06-04 17:49:00 -04:00
Matt Arsenault	d259668731	AMDGPU: Set mayRaiseFPException This may be missing a few overrides to set it off still in some special cases. Since the flags set during selection should now be reliably preserved, this should not change codegen for non-strictfp functions.	2020-06-04 17:35:27 -04:00
Sanjay Patel	192cb71836	[InstCombine] avoid crashing on select-shuffle detection As mentioned in the post-commit comments of D81013 - the mask check API has to assume the shuffle is not length-changing, but we have not ruled that out in this code. Use the ShuffleVectorInst call instead.	2020-06-04 17:27:14 -04:00
Petr Hosek	d510542174	[Fuchsia] Rely on linker switch rather than dead code ref for profile runtime Follow the model used on Linux, where the clang driver passes the linker a -u switch to force the profile runtime to be linked in, rather than having every TU emit a dead function with a reference. Patch By: mcgrathr Differential Revision: https://reviews.llvm.org/D79835	2020-06-04 14:25:19 -07:00
Matt Arsenault	fe0d5121fa	AMDGPU/GlobalISel: Fix making LDS FP atomics legal on SI/CI	2020-06-04 16:50:19 -04:00
Thomas Lively	a07c08f74f	[WebAssembly] Lower llvm.debugtrap properly Summary: Unlike normal traps, debug traps are allowed to return and can have additional instructions in the same basic block. Without explicit backend support for debug traps, they are lowered in ISel as normal traps. Since normal traps are lowered in the WebAssembly backend to the UNREACHABLE instruction, which is a terminator, using debug traps could lead to invalid MBBs when there are additional instructions after the trap. This patch fixes the issue by lowering debug traps to a new version of the UNREACHABLE instruction, DEBUG_UNREACHABLE, that is not a terminator. An alternative approach would have been to make UNREACHABLE not a terminator, but that breaks a large number of tests. In particular, it would require removing the traps inserted after noreturn calls to @llvm.wasm.throw because otherwise the terminator throw would be followed by a non-terminator UNREACHABLE and we would be back to having invalid MBBs. Overall the approach in this patch seems simpler. Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81055	2020-06-04 13:25:10 -07:00
Hiroshi Yamauchi	e52a38db07	[PGO] Enable the working set size scaling under the partial sample PGO. Summary: Following up D79831. Reviewers: davidxl Subscribers: eraman, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80939	2020-06-04 11:30:54 -07:00
Sanjay Patel	8a96c1f627	[InstCombine] move vector select ahead of select-shuffle select Cond, (shuf_sel X, Y), X --> shuf_sel X, (select Cond, Y, X) A select of a select-shuffle ("blend" in x86 lingo) can be reversed so that the select is done first. This is a more limited version of what I was trying in D80658, but it enables existing demanded bits transforms to catch some of the motivating cases. The tricky bit in that seems to be that by moving the shuffle later, we can always guarantee that poison is correctly inhibited by the shuffle mask in the final value. Alive2 checks for the basic tests: http://volta.cs.utah.edu:8080/z/Qqd3RK http://volta.cs.utah.edu:8080/z/S4wchM http://volta.cs.utah.edu:8080/z/wf9zPL http://volta.cs.utah.edu:8080/z/wJeEGk Differential Revision: https://reviews.llvm.org/D81013	2020-06-04 14:29:13 -04:00
Amara Emerson	e53f558057	[AArch64][GlobalISel] Move GlobalISel source files to a dedicated subdir. Differential Revision: https://reviews.llvm.org/D81116	2020-06-04 10:51:38 -07:00
Layton Kifer	7381fcdf62	[TRE] Allow accumulator elimination when base case returns non-constant Remove the requirement, that when performing accumulator elimination, all other cases must return the same dynamic constant. We can do this by initializing the accumulator with the identity value of the accumulation operation, and inserting an additional operation before any return. Differential Revision: https://reviews.llvm.org/D80844	2020-06-04 10:34:42 -07:00
Huihui Zhang	bd43f78c76	[LSR][SCEVExpander] Avoid blind cast 'Factor' to SCEVConstant in FactorOutConstant. Summary: In SCEVExpander FactorOutConstant(), when GEP indexing into/over scalable vector, it is legal for the 'Factor' in a MulExpr to be the size of a scalable vector instead of a compile-time constant. Current upstream crash with the test attached. Reviewers: efriedma, sdesmalen, sanjoy.google, mkazantsev Reviewed By: efriedma Subscribers: hiraditya, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80973	2020-06-04 10:33:39 -07:00
Christopher Tetreault	c2625f330f	[SVE] Eliminate calls to default-false VectorType::get() from SystemZ Reviewers: efriedma, jnspaulsson, kmclaughlin, sdesmalen, samparker, uweigand Reviewed By: uweigand Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80329	2020-06-04 10:05:38 -07:00
Fangrui Song	9be3567df2	[llvm-dwarfdump] Add a table header for -debug-line -verbose output Like non-verbose output, so that it is easy to recognize the `Line,Column,File,ISA,Discriminator` column values. Reviewed By: JDevlieghere, jhenderson Differential Revision: https://reviews.llvm.org/D80874	2020-06-04 08:56:17 -07:00
Matt Arsenault	af867b7850	DAG: Change computeKnownBitsForFrameIndex to be usable by GISel This wasn't getting much value from the DAG or depth arguments, since it's only called on the frame index root nodes. FrameIndexes can also only return a scalar value, so it also didn't need DemandedElts.	2020-06-04 10:50:26 -04:00
Matt Arsenault	931a68f26b	RegAllocFast: Remove dead code	2020-06-04 09:38:31 -04:00
Sanjay Patel	652b3757c8	[x86] add test/code comment for chain value use (PR46195); NFC	2020-06-04 09:15:17 -04:00
Pavel Labath	48cd9d9dd8	[Support] Use outs() in ToolOutputFile Summary: If the output filename was specified as "-", the ToolOutputFile class would create a brand new raw_ostream object referring to the stdout. This patch changes it to reuse the llvm::outs() singleton. At the moment, this change should be "NFC", but it does enable other enhancements, like the automatic stdout/stderr synchronization as discussed on D80803. I've checked the history, and I did not find any indication that this class has to use a brand new stream object instead of outs() -- indeed, it is special-casing "-" in a number of places already, so this change fits the pattern pretty well. I suspect the main reason for the current state of affairs is that the class was originally introduced (r111595, in 2010) as a raw_fd_ostream subclass, which made any other solution impossible. Another potential benefit of this patch is that it makes it possible to move the raw_ostream class out of the business of special-casing "-" for stdout handling. That state of affairs does not seem appropriate because "-" is a valid filename (albeit hard to access with a lot of command line tools) on most systems. Handling "-" in ToolOutputFile seems more appropriate. To make this possible, this patch changes the return type of llvm::outs() and errs() to raw_fd_ostream&. Previously the functions were constructing objects of that type, but returning a generic raw_ostream reference. This makes it possible for new ToolOutputFile and other code to use raw_fd_ostream methods like error() on the outs() object. This does not seem like a bad thing (since stdout is a file descriptor which can be redirected to anywhere, it makes sense to ask it whether the writing was successful or if it supports seeking), and indeed a lot of code was already depending on this fact via the ToolOutputFile "back door". Reviewers: dblaikie, JDevlieghere, MaskRay, jhenderson Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81078	2020-06-04 14:56:35 +02:00
Simon Moll	a0dfdda4e5	[VP][Fix] canIgnoreVectorLength for scalable types This patch fixes VPIntrinsic::canIgnoreVectorLength when used on a VPIntrinsic with scalable vector types. Also includes new unittest cases for the '<vscale x 1 x whatever>' and '%evl == vscale' corner cases.	2020-06-04 14:17:42 +02:00
Max Kazantsev	9bdb918890	[InstCombine][NFC] Factor out constant check We plan to add more transforms here. Besides, this check should be done in the beginning just from function's name.	2020-06-04 18:54:23 +07:00
Georgii Rymar	9d739a9157	[ObjectYAML] - Remove unused function. NFC. Was introduced in D81005 by mistake. Catched by BB: http://lab.llvm.org:8011/builders/clang-ppc64le-rhel/builds/4070/steps/build%20stage%201/logs/stdio	2020-06-04 14:22:51 +03:00
Paul Walker	ed9df8621a	[FileCheck] Implement equality operators for ExpressionValue. Subscribers: hiraditya, thopre, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81094	2020-06-04 11:18:35 +00:00
Yvan Roux	6b9e102243	[ARM][MachineOutliner] Remove unneeded dynamic allocation.	2020-06-04 13:12:26 +02:00
Simon Pilgrim	adf10dcf2e	[DAG] scalarizeBinOpOfSplats - extract from the source of splat vector (PR46189) D79003/rG9fa58d1bf2f8 exposed an issue with scalarizeBinOpOfSplats that we were extracting from the splatted vector result instead of the source, the splat index is only valid for the source vector not the result, which may contain undefs, including at the splat index.	2020-06-04 11:58:59 +01:00
Tim Northover	87e24c3200	Revert "[DAGCombiner] avoid unnecessary indirection from SDNode/SDValue; NFCI" This reverts commit `21dadd774f`. In at least PromoteIntBinOps, they wanted to know about users of all values produced by the node not just the integer being promoted. For example not replacing chain users if the operation was a load breaks the ordering of the DAG.	2020-06-04 11:53:14 +01:00
Georgii Rymar	c781e7370e	[yaml2obj] - Add a way to exclude specified sections from the section header. This implements a new "Excluded" key that can be used to exclude entries from section header: ``` SectionHeaderTable: Sections: ... Excluded: - Name: .foo ``` Differential revision: https://reviews.llvm.org/D81005	2020-06-04 13:50:35 +03:00
Djordje Todorovic	7fbbc82057	[CSInfo][MIPS] Describe parameter value loaded by ADDiu Describe parameter's value loaded by MIPS ADDiu instruction. When parameter's value is loaded into a register by mips ADDiu/DADDiu instruction, it could be described correctly and emitted as DW_AT_GNU_call_site_value. Patch by Nikola Tesic Differential revision: https://reviews.llvm.org/D78108	2020-06-04 12:39:56 +02:00
Georgii Rymar	5750f12b82	Revert "[yaml2obj] - Allocate the file space for SHT_NOBITS sections in some cases." This reverts commit `aa3a85cdaa`. There are problems with it. See here: https://reviews.llvm.org/D80629	2020-06-04 13:17:48 +03:00
Vitaly Buka	af6e054730	[StackSafety] Rename testing opts	2020-06-04 02:39:16 -07:00
Vitaly Buka	81826c7ac6	[StackSafety,NFC] Remove SCEVRewriteVisitor Summary: Depends on D80956. Reviewers: eugenis Reviewed By: eugenis Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80976	2020-06-04 02:32:36 -07:00
Jay Foad	590964c835	[AMDGPU] More accurate gfx10 latencies Differential Revision: https://reviews.llvm.org/D81012	2020-06-04 10:29:32 +01:00
Jay Foad	9ce0f7eed6	[AMDGPU] Introduce new sched classes for transcendental instructions This is in preparation for scheduling them slightly differently on gfx10. NFC. Differential Revision: https://reviews.llvm.org/D81011	2020-06-04 10:29:32 +01:00
Kazushi (Jam) Marukawa	52ed34deeb	[VE] Clean SDNodeXForm stuff Summary: Gather definitions of SDNodeXForm and change them to call C functions instead of copying C expressions in td files. Doing this solved some bugs in mimm detections. Differential Revision: https://reviews.llvm.org/D81132	2020-06-04 11:28:24 +02:00
Qiu Chaofan	7a001a2d92	[PowerPC] Require nsz flag for c-ab to FNMSUB On PowerPC, FNMSUB (both VSX and non-VSX version) means -(ab-c). But the backend used to generate these instructions regardless whether nsz flag exists or not. If a*b-c==0, such transformation changes sign of zero. This patch introduces PPC specific FNMSUB ISD opcode, which may help improving combined FMA code sequence. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D76585	2020-06-04 16:41:27 +08:00
Yevgeny Rouban	dcfa78a4cc	Extend InvokeInst !prof branch_weights metadata to unwind branches Allow InvokeInst to have the second optional prof branch weight for its unwind branch. InvokeInst is a terminator with two successors. It might have its unwind branch taken many times. If so the BranchProbabilityInfo unwind branch heuristic can be inaccurate. This patch allows a higher accuracy calculated with both branch weights set. Changes: - A new section about InvokeInst is added to the BranchWeightMetadata page. It states the old information that missed in the doc and adds new about the second branch weight. - Verifier is changed to allow either 1 or 2 branch weights for InvokeInst. - A new test is written for BranchProbabilityInfo to demonstrate the main improvement of the simple fix in calcMetadataWeights(). - Several new testcases are created for Inliner. Those check that both weights are accounted for invoke instruction weight calculation. - PGOUseFunc::setBranchWeights() is fixed to be applicable to InvokeInst. Reviewers: davidxl, reames, xur, yamauchi Tags: #llvm Differential Revision: https://reviews.llvm.org/D80618	2020-06-04 15:37:15 +07:00
Yevgeny Rouban	417bcb8827	[Instruction] Remove setProfWeight() Remove the function Instruction::setProfWeight() and make use of Instruction::copyMetadata(.., {LLVMContext::MD_prof}). This is correct for all use cases of setProfWeight() as it is applied to CallBase instructions only. This change results in prof metadata copied intact even if the source has "VP". The old pair of calls extractProfTotalWeight() + setProfWeight() resulted in setting branch_weights if the source had "VP" data. Reviewers: yamauchi, davidxl Tags: #llvm Differential Revision: https://reviews.llvm.org/D80987	2020-06-04 15:10:55 +07:00
Mikael Holmen	2f671c4225	[WebAssembly] Fix gcc warning [NFC] gcc 7.4 complained with ../lib/Target/WebAssembly/WebAssemblyFixBrTableDefaults.cpp:125:23: warning: extra ';' [-Wpedantic] false); ^	2020-06-04 10:03:13 +02:00
Sam Parker	6f24ebc4ba	[NFCI][CostModel][AMDGPU] Simplify getUserCost Casts and intrinsics are now handled by the default implementation of getUserCost, so remove them from the backends switch statement. https://reviews.llvm.org/D80994	2020-06-04 08:51:28 +01:00
Kazu Hirata	347a599e5f	[Inlining] Introduce -enable-npm-pgo-inline-deferral Summary: Experiments show that inline deferral past pre-inlining slightly pessimizes the performance. This patch introduces an option to control inline deferral during PGO. The option defaults to true for now (that is, NFC). Reviewers: davidxl Reviewed By: davidxl Subscribers: eraman, hiraditya, haicheng, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80776	2020-06-04 00:40:58 -07:00
Craig Topper	7eff1a7136	[X86] Remove (V)MOVHPDrm patterns that involve bitcast+scalar_to_vec+loadi64. I think these are left over from when we used to type legalize v2f32 loads using bitcast+scalar_to_vec+loadi64 on 64-bit targets. These days we use loadf64. If this becomes a problem a better solution would be a DAG combine to turn it into scalar_to_vec+loadf64.	2020-06-04 00:31:47 -07:00
Kazushi (Jam) Marukawa	6b461ba459	[VE] Change to use EXTRACT_SUBREG instead of COPY_TO_REGCLASS Summary: Change to use EXTRACT_SUBREG instead of COPY_TO_REGCLASS in order to remove unnecessary copy instructions. Differential Revision: https://reviews.llvm.org/D81129	2020-06-04 09:05:36 +02:00
David Sherwood	a3e3986be1	[SVE] Fix ubsan issues in DecodeIITType In an earlier patch I removed the need for IITDescriptor::ScalableVecArgument, which involved changing DecodeIITType to pull out the last IIT_Info from the list. However, it turns out this is unsafe and causes ubsan failures. I've tried to fix this a different way by simply passing the last IIT_Info as an additional argument to DecodeIITType. Differential Revision: https://reviews.llvm.org/D81057	2020-06-04 07:58:24 +01:00
Madhur Amilkanthwar	b3cff3c720	Utility to dump .dot representation of SelectionDAG without firing viewer Summary: This patch adds support for dumping .dot representation of SelectionDAG. It is inspired from the fact that, a developer may want to just dump the graph at a predictable path with a simple name to compare. The exisitng utility (i.e. viewGraph) are overkill for this motive hence this patch adds the requires support while using the core routines from GraphWriter. Example usage: DAG.dumpDotGraph("/tmp/graph.dot", "MyGraph") will create /tmp/graph.dot file when DAG is an object of SelectionDAG class. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D80711	2020-06-04 11:51:48 +05:30
Fangrui Song	1a2d4bf34e	[gcov] Don't error 'unexpected end of memory buffe'	2020-06-03 22:05:15 -07:00
Fangrui Song	904b971aac	[gcov] Make `Creating 'filename'` compatible with gcov And clean up llvm-cov.test a bit	2020-06-03 21:48:01 -07:00

1 2 3 4 5 ...

135183 Commits