llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Northover	1f1b2756a4	ARM: make sure ARM-mode pseudo-inst requires IsARM I'd forgotten that "Requires" blocks override rather than add to the constraints, so my pseudo-instruction was being selected in Thumb mode leading to nonsense instructions. rdar://problem/14817358 llvm-svn: 189096	2013-08-23 10:16:39 +00:00
Daniel Sanders	3c9a0ad444	[mips][msa] Split MSA128 regset into size-specific sets containing the same registers. llvm-svn: 189095	2013-08-23 10:10:13 +00:00
Jakob Stoklund Olesen	0c00704f27	Use register masks on SPARC call instructions. llvm-svn: 189085	2013-08-23 02:33:47 +00:00
Jakob Stoklund Olesen	a8960a1f7c	Add an OtherPreserved field to the CalleeSaved TableGen class. This field specifies registers that are preserved across function calls, but that should not be included in the generates SaveList array. This can be used ot generate regmasks for architectures that save registers through other means, like SPARC's register windows. llvm-svn: 189084	2013-08-23 02:25:47 +00:00
Tom Stellard	15e4811455	R600/SI: Fix another case of illegal VGPR to SGPR copy This fixes a crash in Unigine Tropics. https://bugs.freedesktop.org/show_bug.cgi?id=68389 llvm-svn: 189057	2013-08-22 20:21:02 +00:00
Joey Gouly	881eab53be	[ARMv8] Add CodeGen support for VSEL. This uses the ARMcmov pattern that Tim cleaned up in r188995. Thanks to Simon Tatham for his floating point help! llvm-svn: 189024	2013-08-22 15:29:11 +00:00
Mihai Popa	5500c0ff89	Fix ARM vcvt encoding when the number of fractional bits is zero. The instruction to convert between floating point and fixed point representations takes an immediate operand for the number of fractional bits of the fixed point value. ARMARM specifies that when that number of bits is zero, the assembler should encode floating point/integer conversion instructions. This patch adds the necessary instruction aliases to achieve this behaviour. llvm-svn: 189009	2013-08-22 13:16:07 +00:00
Joey Gouly	e1de9e9c33	[ARM] Constrain some register classes in EmitAtomicBinary64 so that we pass these tests with -verify-machineinstrs. llvm-svn: 189006	2013-08-22 12:19:24 +00:00
Elena Demikhovsky	c35219e3ee	AVX-512: Added masked SHIFT commands, more encoding tests llvm-svn: 189005	2013-08-22 12:18:28 +00:00
Logan Chien	2361f51e82	Fix ARM FastISel PIC function call. The function call to external function should come with PLT relocation type if the PIC relocation model is used. llvm-svn: 189002	2013-08-22 12:08:04 +00:00
Tim Northover	421804420d	ARM: use TableGen patterns to select CMOV operations. Back in the mists of time (2008), it seems TableGen couldn't handle the patterns necessary to match ARM's CMOV node that we convert select operations to, so we wrote a lot of fairly hairy C++ to do it for us. TableGen can deal with it now: there were a few minor differences to CodeGen (see tests), but nothing obviously worse that I could see, so we should probably address anything that does come up in a localised manner. llvm-svn: 188995	2013-08-22 09:57:11 +00:00
Tim Northover	2ddeeed096	ARM: respect tied 64-bit inlineasm operands when printing The code for 'Q' and 'R' operand modifiers needs to look through tied operands to discover the register class. llvm-svn: 188990	2013-08-22 06:51:04 +00:00
Jim Grosbach	6a7a727174	ARM: R9 is not safe to use for tcGPR. Indirect tail-calls shouldn't use R9 for the branch destination, as it's not reliably a call-clobbered register. rdar://14793425 llvm-svn: 188967	2013-08-22 00:14:24 +00:00
Tom Stellard	f6d8023ca4	R600: Remove unnecessary casts Spotted by Bill Wendling. llvm-svn: 188942	2013-08-21 22:14:17 +00:00
Bill Wendling	0cb8c0b1c2	Remove use of forbidden 'iostream' header. Also obsessively reorder the headers to be in something closer to alphabetical order. llvm-svn: 188928	2013-08-21 20:36:42 +00:00
Hao Liu	546bcd2f50	A minor change for an obvous problem caused by r188451: def imm0_63 : Operand<i32>, ImmLeaf<i32, [{ return Imm >= 0 && Imm < 63;}]>{ As it seems Imm <63 should be Imm <= 63. ImmLeaf is used in pattern match, but there is already a function check the shift amount range, so just remove ImmLeaf. Also add a test to check 63. llvm-svn: 188911	2013-08-21 17:47:53 +00:00
Mihai Popa	ae1112bae5	Make "mov" work for all Thumb2 MOV encodings According to the ARM specification, "mov" is a valid mnemonic for all Thumb2 MOV encodings. To achieve this, the patch adds one instruction alias with a special range condition to avoid collision with the Thumb1 MOV. llvm-svn: 188901	2013-08-21 13:14:58 +00:00
Elena Demikhovsky	33d447a2d6	AVX-512: Added SHIFT instructions. llvm-svn: 188899	2013-08-21 09:36:02 +00:00
Richard Sandiford	7d86e47d04	[SystemZ] Define remainig *MUL_LOHI patterns The initial port used MLG(R) for i64 UMUL_LOHI but left the other three combinations as not-legal-or-custom. Although 32x32->{32,32} multiplications exist, they're not as quick as doing a normal 64-bit multiplication, so it didn't seem like i32 SMUL_LOHI and UMUL_LOHI would be useful. There's also no direct instruction for i64 SMUL_LOHI, so it needs to be implemented in terms of UMUL_LOHI. However, not defining these patterns means that we don't convert division by a constant into multiplication, so this patch fills in the other cases. The new i64 SMUL_LOHI sequence is simpler than the one that we used previously for 64x64->128 multiplication, so int-mul-08.ll now tests the full sequence. llvm-svn: 188898	2013-08-21 09:34:56 +00:00
Daniel Sanders	41194e3f9e	[mips][msa] Matheus Almeida pointed out a silly mistake in r188893. Fixed it. I accidentally changed the encoding of the MSA registers to zero instead of 0 to 31. This change restores the encoding the registers had prior to r188893. This didn't show up in the existing tests because direct-object emission isn't implemented yet for MSA. llvm-svn: 188896	2013-08-21 09:09:52 +00:00
Richard Sandiford	af5f66ac9e	[SystemZ] Use FI[EDX]BRA for codegen llvm-svn: 188895	2013-08-21 09:04:20 +00:00
Richard Sandiford	8e92c389e4	[SystemZ] Add FI[EDX]BRA These are extensions of the existing FI[EDX]BR instructions, but use a spare bit to suppress inexact conditions. llvm-svn: 188894	2013-08-21 08:58:08 +00:00
Daniel Sanders	ec12322a28	[mips][msa] Define registers using foreach No functional change llvm-svn: 188893	2013-08-21 08:48:25 +00:00
Craig Topper	77df9cdd0b	Synchronize VEX JIT encoding code with the MCJIT version. Fix a bug in the MCJIT code where CurOp was being incremented even if the operand it was pointing at wasn't used. Maybe only matters if there are any EVEX_K instructions that aren't VEX_4V. llvm-svn: 188868	2013-08-21 05:57:45 +00:00
Nadav Rotem	7efc04cb40	In LLVM FMA3 operands are dst, src1, src2, src3, however dst is not encoded as it is always src1. This was causing the encoding of the operands to be off by one. Patch by Chris Bieneman. llvm-svn: 188866	2013-08-21 05:03:10 +00:00
Craig Topper	5c94bb8551	Rename mattr names for AVX-512 to from avx-512 -> avx512f, avx-512-pfi -> av512pf, avx-512-cdi -> avx512cd, avx-512-eri->avx512er. This matches better with official docs and what gcc patches appearto be using. I didn't touch the has* functions or the feature flag names to avoid change the td and lowering file while commits are still happening. llvm-svn: 188859	2013-08-21 03:57:57 +00:00
NAKAMURA Takumi	de8880a23d	X86TargetMachine.cpp: Clarify to emit GOT in i686-{cygming\|win32}-elf for mcjit. I suppose all "lli -use-mcjit i686-*" should require GOT, (and to fail.) llvm-svn: 188856	2013-08-21 02:37:25 +00:00
Akira Hatanaka	39f915b58a	[micromips] Print instruction alias "not" if the last operand of a nor is zero. llvm-svn: 188851	2013-08-21 01:18:46 +00:00
Akira Hatanaka	9a1fb6b9fc	[mips] Add support for mfhc1 and mthc1. llvm-svn: 188848	2013-08-20 23:47:25 +00:00
Akira Hatanaka	bfb6624797	[mips] Add support for calling convention CC_MipsO32_FP64, which is used when the size of floating point registers is 64-bit. Test case will be added when support for mfhc1 and mthc1 is added. llvm-svn: 188847	2013-08-20 23:38:40 +00:00
Akira Hatanaka	8dd951bc9f	[mips] Remove predicates that were incorrectly or unnecessarily added. llvm-svn: 188845	2013-08-20 23:21:55 +00:00
Akira Hatanaka	14e31a2fe7	[mips] Define register class FGRH32 for the high half of the 64-bit floating point registers. We will need this register class later when we add definitions for instructions mfhc1 and mthc1. Also, remove sub-register indices sub_fpeven and sub_fpodd and use sub_lo and sub_hi instead. llvm-svn: 188842	2013-08-20 22:58:56 +00:00
Akira Hatanaka	6781fc1648	[mips] Resolve register classes dynamically using ptr_rc to reduce the number of load/store instructions defined. Previously, we were defining load/store instructions for each pointer size (32 and 64-bit), but now we need just one definition. llvm-svn: 188830	2013-08-20 21:08:22 +00:00
Reed Kotler	d8f3362557	Add an option which permits the user to specify using a bitmask, that various functions be compiled as mips32, without having to add attributes. This is useful in certain situations where you don't want to have to edit the function attributes in the source. For now it's only an option used for the compiler developers when debugging the mips16 port. llvm-svn: 188826	2013-08-20 20:53:09 +00:00
Akira Hatanaka	a43b56d9af	[mips] Guard micromips instructions with predicate InMicroMips. Also, fix assembler predicate HasStdEnd so that it is false when the target is micromips. llvm-svn: 188824	2013-08-20 20:46:51 +00:00
Jim Grosbach	71a78f962b	ARM: Fix fast-isel copy/paste-o. Update testcase to be more careful about checking register values. While regexes are general goodness for these sorts of testcases, in this example, the registers are constrained by the calling convention, so we can and should check their explicit values. rdar://14779513 llvm-svn: 188819	2013-08-20 19:12:42 +00:00
Elena Demikhovsky	540d582594	AVX-512: Added more patterns for VMOVSS, VMOVSD, VMOVD, VMOVQ llvm-svn: 188786	2013-08-20 11:00:29 +00:00
Daniel Sanders	4260527f5f	[mips][msa] Removed fcge, fcgt, fsge, fsgt These instructions were present in a draft spec but were removed before publication. llvm-svn: 188782	2013-08-20 09:41:47 +00:00
Richard Sandiford	2bf7b8cc4e	[SystemZ] Update README We now use MVST, CLST and SRST for the obvious cases. llvm-svn: 188781	2013-08-20 09:40:35 +00:00
Richard Sandiford	6f6d55161b	[SystemZ] Use SRST to optimize memchr SystemZTargetLowering::emitStringWrapper() previously loaded the character into R0 before the loop and made R0 live on entry. I'd forgotten that allocatable registers weren't allowed to be live across blocks at this stage, and it confused LiveVariables enough to cause a miscompilation of f3 in memchr-02.ll. This patch instead loads R0 in the loop and leaves LICM to hoist it after RA. This is actually what I'd tried originally, but I went for the manual optimisation after noticing that R0 often wasn't being hoisted. This bug forced me to go back and look at why, now fixed as r188774. We should also try to optimize null checks so that they test the CC result of the SRST directly. The select between null and the SRST GPR result could then usually be deleted as dead. llvm-svn: 188779	2013-08-20 09:38:48 +00:00
Daniel Sanders	f2a0f1d133	[mips][msa] Added insve llvm-svn: 188777	2013-08-20 09:22:54 +00:00
Tim Northover	f79c3a5aef	ARM: implement some simple f64 materializations. Previously we used a const-pool load for virtually all 64-bit floating values. Actually, we can get quite a few common values (including 0.0, 1.0) via "vmov" instructions of one stripe or another. llvm-svn: 188773	2013-08-20 08:57:11 +00:00
Daniel Sanders	869bdad93a	[mips][msa] Added and.v, bmnz.v, bmz.v, bsel.v, nor.v, or.v, xor.v llvm-svn: 188767	2013-08-20 08:38:21 +00:00
Craig Topper	7a8cf01090	Fix formatting. No functional change. llvm-svn: 188746	2013-08-20 05:23:59 +00:00
Craig Topper	e13a066c94	Add AVX-512 and related features to the CPUID detection code. llvm-svn: 188745	2013-08-20 05:22:42 +00:00
Craig Topper	fd2b389263	Move AVX and non-AVX replication inside a couple multiclasses to avoid repeating each instruction for both individually. llvm-svn: 188743	2013-08-20 04:24:14 +00:00
Bill Schmidt	f381afc906	[PowerPC] More refactoring prior to real PPC emitPrologue/Epilogue changes. (Patch committed on behalf of Mark Minich, whose log entry follows.) This is a continuation of the refactorings performed in svn rev 188573 (see that rev's comments for more detail). This is my stage 2 refactoring: I combined the emitPrologue() & emitEpilogue() PPC32 & PPC64 code into a single flow, simplifying a lot of the code since in essence the PPC32 & PPC64 code generation logic is the same, only the instruction forms are different (in most cases). This simplification is necessary because my functional changes (yet to come) add significant complexity, and without the simplification of my stage 2 refactoring, the overall complexity of both emitPrologue() & emitEpilogue() would have become almost intractable for most mortal programmers (like me). This submission was intended to be a pure refactoring (no functional changes whatsoever). However, in the process of combining the PPC32 & PPC64 flows, I spotted a difference that I believe is a bug (see svn rev 186478 line 863, or svn rev 188573 line 888): This line appears to be restoring the BP with the original FP content, not the original BP content. When I merged the 32-bit and 64-bit code, I used the corresponding code from the 64-bit flow, which I believe uses the correct offset (BPOffset) for this operation. llvm-svn: 188741	2013-08-20 03:12:23 +00:00
Venkatraman Govindaraju	f625773bca	[Sparc] Use HWEncoding instead of unused Num field in Sparc register definitions. Also, correct the definitions of RETL and RET instructions. llvm-svn: 188738	2013-08-20 01:26:14 +00:00
Hal Finkel	0c5c01aa4a	Add a llvm.copysign intrinsic This adds a llvm.copysign intrinsic; We already have Libfunc recognition for copysign (which is turned into the FCOPYSIGN SDAG node). In order to autovectorize calls to copysign in the loop vectorizer, we need a corresponding intrinsic as well. In addition to the expected changes to the language reference, the loop vectorizer, BasicTTI, and the SDAG builder (the intrinsic is transformed into an FCOPYSIGN node, just like the function call), this also adds FCOPYSIGN to a few lists in LegalizeVector{Ops,Types} so that vector copysigns can be expanded. In TargetLoweringBase::initActions, I've made the default action for FCOPYSIGN be Expand for vector types. This seems correct for all in-tree targets, and I think is the right thing to do because, previously, there was no way to generate vector-values FCOPYSIGN nodes (and most targets don't specify an action for vector-typed FCOPYSIGN). llvm-svn: 188728	2013-08-19 23:35:46 +00:00
Hal Finkel	1cf48ab811	Don't form PPC CTR-based loops around a copysignl call copysign/copysignf never become function calls (because the SDAG expansion code does not lower to the corresponding function call, but rather directly implements the associated logic), but copysignl almost always is lowered into a call to the requested libm functon (and, thus, might clobber CTR). llvm-svn: 188727	2013-08-19 23:35:24 +00:00

1 2 3 4 5 ...

25314 Commits