llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	505c4dabe2	isImmPCRel/isImmSigned - both functions should return bool not unsigned. NFCI.	2019-11-02 21:04:07 +00:00
Simon Pilgrim	f722071a9e	X86_MC::createX86MCSubtargetInfo - X86_MC::ParseX86Triple never returns an empty string. NFCI. PVS Studio was complaining that the expression '!ArchFS.empty()' is always true.	2019-11-02 18:46:53 +00:00
Simon Pilgrim	2cbb9653d8	X86Operand::print - fix SymName shadow variable warning. NFCI.	2019-11-02 18:46:53 +00:00
Simon Pilgrim	79818f8c70	X86AsmPrinter - fix uninitialized variable warnings. NFCI.	2019-11-02 18:03:21 +00:00
Simon Pilgrim	579a56bec3	TargetMachine - fix uninitialized variable warning. NFCI. TargetPassConfig::addCoreISelPasses() always initializes O0WantsFastISel but it appeases static analyzers that complain that O0WantsFastISel isn't initialized in the constructor.	2019-11-02 16:11:01 +00:00
Simon Pilgrim	254b8461ac	[X86] Move computeZeroableShuffleElements before getTargetShuffleAndZeroables. NFCI. Prep work toward merging some of the functionality.	2019-11-02 13:38:35 +00:00
Craig Topper	83503ad119	[X86] Remove FeatureSSE3 from the implies list of HasFastHorizontalOps. HasFastHorizontalOps is a tuning flag. It shouldn't imply an ISA flag.	2019-11-01 23:17:53 -07:00
Pengfei Wang	02728f49da	[X86] Model MXCSR for MMX FP instructions Summary: This patch models MXCSR and adds flag "mayRaiseFPException" for MMX FP instructions. Reviewers: craig.topper, andrew.w.kaylor, RKSimon, cameron.mcinally Reviewed By: craig.topper Subscribers: hiraditya, llvm-commits, LiuChen3 Tags: #llvm Differential Revision: https://reviews.llvm.org/D69702	2019-11-01 21:21:46 -07:00
Pengfei Wang	af3a7de20c	[X86] add mayRaiseFPException flag and FPCW registers for X87 instructions Summary: This patch adds flag "mayRaiseFPException" , FPCW and FPSW for X87 instructions which could raise float exception. Reviewers: pengfei, RKSimon, andrew.w.kaylor, uweigand, kpn, spatel, cameron.mcinally, craig.topper Reviewed By: craig.topper Subscribers: thakis, hiraditya, llvm-commits Patch by LiuChen. Differential Revision: https://reviews.llvm.org/D68854	2019-11-01 21:12:43 -07:00
Michael Liao	4531aee2ac	[amdgpu] Fix known bits compuation on `MUL_I24`/`MUL_U24`. Reviewers: arsenm, rampitec Subscribers: kzhuravl, jvesely, wdng, nhaehnle, dstuttard, tpr, t-tye, hiraditya, llvm-commits, yaxunl Tags: #llvm Differential Revision: https://reviews.llvm.org/D69735	2019-11-01 17:06:17 -04:00
Craig Topper	eeeb18cd07	[X86] Change the behavior of canWidenShuffleElements used by lowerV2X128Shuffle to match the behavior in lowerVectorShuffle with regards to zeroable elements. Previously we marked zeroable elements in a way that prevented the widening check from recognizing that it could widen. Now we only mark them zeroable if V2 is an all zeros vector. This matches what we do for widening elements in lowerVectorShuffle. Fixes PR43866.	2019-11-01 13:06:03 -07:00
Fangrui Song	45ee0d6de6	[MIPS GlobalISel] Fix -Wunused-variable in -DLLVM_ENABLE_ASSERTIONS=off builds after D69663	2019-11-01 11:07:50 -07:00
Simon Pilgrim	9b0dfdf5e1	[X86][AVX] Add support for and/or scalar bool reduction with AVX512 mask registers combineBitcastvxi1 only handles bitcast->MOVMSK combines, with mask registers we use BITCAST directly.	2019-11-01 17:55:31 +00:00
Thomas Lively	935c84c3c2	[WebAssembly] Add experimental SIMD dot product instruction Summary: This instruction is not merged to the spec proposal, but we need it to be implemented in the toolchain to experiment with it. It is available only on an opt-in basis through a clang builtin. Defined in https://github.com/WebAssembly/simd/pull/127. Depends on D69696. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D69697	2019-11-01 10:45:48 -07:00
Thomas Lively	ecb7daf68f	Reland "[WebAssembly] Expand setcc of v2i64" This reverts commit `e5cae5692b`, which reverted `11850a6305`. The original revert was done because of breakage that was actually in a separate commit, `2ab1b8c1ec`, which was also reverted and has since been fixed and relanded.	2019-11-01 10:34:01 -07:00
Simon Pilgrim	ea27d82814	[X86] isFNEG - use switch() instead of if-else tree. NFCI. In a future patch this will avoid some checks which don't need to be done for some opcodes.	2019-11-01 17:09:04 +00:00
Oliver Stannard	a3f4745428	Revert "[AArch64][MachineOutliner] Return address signing for outlined functions" This is causing faults when an instruction which modifies SP is outlined, causing the PAC and AUT instructions to not match. This reverts commits `70caa1fc30` and `55314d3237`.	2019-11-01 16:06:09 +00:00
Momchil Velikov	7849862f46	[AArch64] Output the pseudo SPACE in asm and object files Summary: It outputs nothing, but is useful for writing tests, checking asm output. Reviewers: t.p.northover, ostannard, tellenbach Reviewed By: tellenbach Subscribers: tellenbach, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69185 Change-Id: I6b58310e9e5632f0976d2000ce975ee28df90ebe	2019-11-01 15:01:53 +00:00
Petar Avramovic	d32a6f0812	[MIPS GlobalISel] Improve reg bank handling in MipsInstructionSelector Introduce helper methods and refactor pieces of code related to register banks in MipsInstructionSelector. Add a few detailed asserts in order to get a better overview of LLT, register bank combinations that are supported at the moment and reduce need to look at other files. Differential Revision: https://reviews.llvm.org/D69663	2019-11-01 13:24:07 +01:00
Kerry McLaughlin	5ec34dfdf7	[AArch64][SVE] Implement several floating-point arithmetic intrinsics Summary: Adds intrinsics for the following: - fabd, fadd, fsub & fsubr - fmul, fmulx, fdiv & fdivr - fmax, fmaxnm, fmin & fminnm - fscale & ftsmul Reviewers: huntergr, sdesmalen, dancgr Reviewed By: sdesmalen Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, cameron.mcinally, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69657	2019-11-01 10:40:36 +00:00
Matt Arsenault	19e7f8a21d	AMDGPU: Add default denormal mode to MachineFunctionInfo The default FP mode should really be a property of a specific function, and not a subtarget. Introduce the necessary fields to the SIMachineFunctionInfo to help move towards this goal.	2019-11-01 00:03:39 -07:00
David Zarzycki	cb6822c9de	[X86] Reland: Enable YMM memcmp with AVX1 Update TargetTransformInfo to allow AVX1 to use YMM registers for memcmp. This is a follow up to D68632 which enabled XOR compares which made this possible. This also updates the memcmp-optsize.ll test unlike the first patch. https://reviews.llvm.org/D69658	2019-11-01 08:58:48 +02:00
Matt Arsenault	6221767055	DAG: Add DAG argument to isFPExtFoldable For AMDGPU this is dependent on the FP mode, which should eventually not be a property of the subtarget.	2019-10-31 22:32:45 -07:00
Thomas Lively	a07019a275	[WebAssembly] SIMD integer min and max instructions Summary: Introduces a clang builtins and LLVM intrinsics representing integer min/max instructions. These instructions have not been merged to the SIMD spec proposal yet, so they are currently opt-in only via builtins and not produced by general pattern matching. If these instructions are accepted into the spec proposal the builtins and intrinsics will be replaced with normal pattern matching. Defined in https://github.com/WebAssembly/simd/pull/27. Reviewers: aheejin Reviewed By: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D69696	2019-10-31 20:22:11 -07:00
Thomas Lively	3479fd25b9	Reland "[WebAssembly] Handle multiple loads of splatted loads" This reverts commit `92a25fbf11` and fixes the ambiguous method call that was causing build failures.	2019-10-31 20:02:24 -07:00
Vlad Tsyrklevich	92a25fbf11	Revert "[WebAssembly] Handle multiple loads of splatted loads" This reverts commit `2ab1b8c1ec`, it is causing build failures on numerous bots, including sanitizer-x86_64-linux-bootstrap-ubsan. My previous revert was for the wrong commit.	2019-10-31 16:52:44 -07:00
Vlad Tsyrklevich	e5cae5692b	Revert "[WebAssembly] Expand setcc of v2i64" This reverts commit `11850a6305`, it was causing build failures on numerous bots, including sanitizer-x86_64-linux-bootstrap-ubsan.	2019-10-31 16:44:09 -07:00
Nico Weber	a5bf48b84c	Revert "[X86] add mayRaiseFPException flag and FPCW registers for X87 instructions" This reverts commit `a678677da4`. It broke CodeGen/ms-inline-asm.c on most bots.	2019-10-31 19:14:42 -04:00
Craig Topper	a678677da4	[X86] add mayRaiseFPException flag and FPCW registers for X87 instructions This patch adds flag "mayRaiseFPException" , FPCW and FPSW for X87 instructions which could raise float exception. Patch by LiuChen. With a couple small fixes from me. Differential Revision: https://reviews.llvm.org/D68854	2019-10-31 15:05:29 -07:00
Thomas Lively	2ab1b8c1ec	[WebAssembly] Handle multiple loads of splatted loads Summary: Fixes an ISel failure when a splatted load is used more than once. The failure was due to the hacks we were doing in ISel lowering to preserve the original load as the operand of a LOAD_SPLAT node. The fix is to properly lower the splatted use of the load to a separate LOAD_SPLAT node. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69640	2019-10-31 14:59:30 -07:00
Thomas Lively	11850a6305	[WebAssembly] Expand setcc of v2i64 Summary: The SIMD spec does not include i64x2 comparisons, so they need to be expanded. Using setOperationAction to expand them also causes f64x2 comparisons to be expanded, so setCondCodeAction needs to be used instead. But since there are no legal condition codes, the legalizer does not know how to expand the comparisons. We therefore manually unroll the operation, taking care to fill each lane with -1 or 0 rather than 1 or 0 for consistency with the other vector comparisons. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69604	2019-10-31 14:22:30 -07:00
Craig Topper	a0aef63208	[X86] Remove FSIN/FCOS isel patterns and the pseudo instructions that they selected for the FP stackifier. We always expand these to libcalls so get rid of the last vestiges of using the instructions.	2019-10-31 13:42:01 -07:00
Evandro Menezes	f9af4ccb8a	[AArch64] Update for Exynos Fix the costs of `add` and `orr` with an immediate operand.	2019-10-31 15:25:22 -05:00
Simon Pilgrim	04813ded98	Revert rG0e252ae19ff8d99a59d64442c38eeafa5825d441 : [X86] Enable YMM memcmp with AVX1 Breaks build bots Differential Revision: https://reviews.llvm.org/D69658	2019-10-31 19:05:04 +00:00
David Zarzycki	0e252ae19f	[X86] Enable YMM memcmp with AVX1 Update TargetTransformInfo to allow AVX1 to use YMM registers for memcmp. This is a follow up to D68632 which enabled XOR compares which made this possible. https://reviews.llvm.org/D69658	2019-10-31 20:07:07 +02:00
Simon Pilgrim	3842b94c4e	Revert rG57ee0435bd47f23f3939f402914c231b4f65ca5e - [TII] Use optional destination and source pair as a return value; NFC This is breaking MSVC builds: http://lab.llvm.org:8011/builders/llvm-clang-x86_64-expensive-checks-win/builds/20375	2019-10-31 18:00:29 +00:00
David Green	2179867ddc	[AArch64] Select saturating Neon instructions This adds some extra patterns to select AArch64 Neon SQADD, UQADD, SQSUB and UQSUB from the existing target independent sadd_sat, uadd_sat, ssub_sat and usub_sat nodes. It does not attempt to replace the existing int_aarch64_neon_uqadd intrinsic nodes as they are apparently used for both scalar and vector, and need to be legal on scalar types for some of the patterns to work. The int_aarch64_neon_uqadd on scalar would move the two integers into floating point registers, perform a Neon uqadd and move the value back. I don't believe this is good idea for uadd_sat to do the same as the scalar alternative is simpler (an adds with a csinv). For signed it may be smaller, but I'm not sure about it being better. So this just adds some extra patterns for the existing vector instructions, matching on the _sat nodes. Differential Revision: https://reviews.llvm.org/D69374	2019-10-31 17:28:36 +00:00
Matt Arsenault	1725f28841	DAG: Add new control for ISD::FMAD formation For AMDGPU this depends on whether denormals are enabled in the default FP mode for the function. Currently this is treated as a subtarget feature, so FMAD is selectively legal based on that. I want to move this out of the subtarget features so this can be controlled with a denormal mode attribute. Additionally, this will allow folding based on a future ftz fast math flag.	2019-10-31 07:51:38 -07:00
Matt Arsenault	bc56166281	AMDGPU: Simplify getAddressSpace calls These can be directly taken from the GlobalValue instead of going through the type.	2019-10-31 07:51:38 -07:00
Djordje Todorovic	57ee0435bd	[TII] Use optional destination and source pair as a return value; NFC Refactor usage of isCopyInstrImpl, isCopyInstr and isAddImmediate methods to return optional machine operand pair of destination and source registers. Patch by Nikola Prica Differential Revision: https://reviews.llvm.org/D69622	2019-10-31 15:34:49 +01:00
Simon Pilgrim	a780b94cd1	[X86][SSE] Convert computeZeroableShuffleElements to emit KnownUndef and KnownZero	2019-10-31 11:21:39 +00:00
David Candler	92aa0c2dbc	[cfi] Add flag to always generate .debug_frame This adds a flag to LLVM and clang to always generate a .debug_frame section, even if other debug information is not being generated. In situations where .eh_frame would normally be emitted, both .debug_frame and .eh_frame will be used. Differential Revision: https://reviews.llvm.org/D67216	2019-10-31 09:48:30 +00:00
Ehsan Amiri	ed7bcb2cb1	[AArch64][SVE] Add patterns for some integer vector instructions Add pattern matching for SVE vector instructions: -- add, sub, and, or, xor instructions -- sqadd, uqadd, sqsub, uqsub target-independent intrinsics -- bic intrinsics -- predicated add, sub, subr intrinsics Patch Review: https://reviews.llvm.org/D69128 Patch authored by: dancgr (Danilo Carvalho Grael)	2019-10-30 21:52:19 -04:00
Craig Topper	8f48ba993b	[X86] Model MXCSR for all SSE instructions This patch adds MXCSR as a reserved physical register and models its use by X86 SSE instructions. It also adds flag "mayRaiseFPException" for the instructions that possibly can raise FP exception according to the architecture definition. Following what SystemZ and other targets does, only the current rounding modes and the IEEE exception masks are modeled. Changes of the MXCSR due to exceptions are not modeled. Patch by Pengfei Wang Differential Revision: https://reviews.llvm.org/D68121	2019-10-30 15:07:49 -07:00
Matt Arsenault	d9e0a2942a	AMDGPU: Disallow spill folding with m0 copies readlane and writelane instructions are not allowed to use m0 as the data operand, so spilling them is tricky and would require an intermediate SGPR to spill it. Constrain the virtual register class in this caes to disallow the inline spiller from folding the m0 operand directly into the spill instruction. I copied this hack from AArch64 which has the same problem for $sp.	2019-10-30 14:56:33 -07:00
Matt Arsenault	edca9ac0de	AMDGPU: Don't fold S_NOPs with implicit operands	2019-10-30 14:40:56 -07:00
Craig Topper	6cb181f086	[X86] Rewrite hasReassociableOperands and setSpecialOperandAttr to not hardcode number of operands or position of the EFLAGS operand. This makes the code immune to the MXCSR addition in D68121.	2019-10-30 14:34:10 -07:00
Evandro Menezes	215da6606c	[clang][llvm] Obsolete Exynos M1 and M2	2019-10-30 15:02:59 -05:00
Evandro Menezes	42c8fae9d1	[AArch64] Remove overlapping scheduling definitions (NFC) The scheduling definitions for ASIMD transpose and zipping overlapped with others a few lines below. Somehow, they didn't raise errors before. There seem to be other overlapping definitions. Somehow, they still don't raise errors. Differential revision: https://reviews.llvm.org/D68353	2019-10-30 13:57:27 -05:00
Simon Pilgrim	f25f3d39df	[X86] Add FIXME comment to merge more of computeZeroableShuffleElements and getTargetShuffleAndZeroables	2019-10-30 18:30:01 +00:00

1 2 3 4 5 ...

54540 Commits