llvm-project

Commit Graph

Author	SHA1	Message	Date
Akira Hatanaka	a66d676b20	Define ADJCALLSTACKDOWN/UP nodes. These nodes are emitted regardless of whether or not it is in mips16 mode. Define MipsPseudo (mode-independant pseudo) and PseudoSE (mips32/64 pseudo) classes. llvm-svn: 161071	2012-07-31 19:13:07 +00:00
Akira Hatanaka	3a810eda91	Change name of class MipsInst to InstSE to distinguish it from mips16's instruction class. SE stands for standard encoding. llvm-svn: 161069	2012-07-31 18:55:01 +00:00
Akira Hatanaka	beda2241a4	When store nodes or memcpy nodes are created to copy the function call arguments to the stack in MipsISelLowering::LowerCall, use stack pointer and integer offset operands rather than frame object operands. llvm-svn: 161068	2012-07-31 18:46:41 +00:00
Chad Rosier	710be7df71	[x86 frame lowering] In 32-bit mode, use ESI as the base pointer. Previously, we were using EBX, but PIC requires the GOT to be in EBX before function calls via PLT GOT pointer. llvm-svn: 161066	2012-07-31 18:29:21 +00:00
Akira Hatanaka	4ce7c4060d	Fix type of LUXC1 and SUXC1. These instructions were incorrectly defined as single-precision load and store. Also avoid selecting LUXC1 and SUXC1 instructions during isel. It is incorrect to map unaligned floating point load/store nodes to these instructions. llvm-svn: 161063	2012-07-31 18:16:49 +00:00
Craig Topper	c2efce404e	Make INSTRUCTION_SPECIFIER_FIELDS match X86DisassemblerCommon.h. Also remove trailing whitespace. llvm-svn: 161029	2012-07-31 05:18:26 +00:00
Craig Topper	fb39f97d4c	Tidy up trailing whitespace llvm-svn: 161027	2012-07-31 04:58:05 +00:00
Craig Topper	5f33d90214	Tidy up trailing whitespace llvm-svn: 161026	2012-07-31 04:38:27 +00:00
Kevin Enderby	5c490f1b8f	Fix a bug in ARMMachObjectWriter::RecordRelocation() in ARMMachObjectWriter.cpp where the other_half of the movt and movw relocation entries needs to get set and only with the 16 bits of the other half. rdar://10038370 llvm-svn: 160978	2012-07-30 18:46:15 +00:00
Craig Topper	efd97044a3	Mark MOVZX16/MOVSX16 as neverHasSideEffects/mayLoad llvm-svn: 160953	2012-07-30 07:14:07 +00:00
Craig Topper	c6b7ef61f4	Mark MOVZX32_NOREX as isCodeGenOnly and neverHasSideEffects. The isCodeGenOnly change allows special detection of _NOREX instructions to be removed from tablegen disassembler code. llvm-svn: 160951	2012-07-30 06:48:11 +00:00
Craig Topper	14eac5dda8	Give VCVTTPD2DQ priority over CVTTPD2DQ. llvm-svn: 160942	2012-07-30 02:20:32 +00:00
Craig Topper	f881d385da	Fix patterns for CVTTPS2DQ to specify SSE2 instead of SSE1. llvm-svn: 160941	2012-07-30 02:14:02 +00:00
Craig Topper	415b3586d0	Fix up patterns for VCVTSS2SD. Specifically give it priority over SSE form. Add an OptForSpeed to explicitly pair up with an OptForSize that was already on another pattern. llvm-svn: 160939	2012-07-30 01:38:57 +00:00
Craig Topper	28402efcb6	Fix load types on intrinsic forms of SS2SD and SD2SS AVX/SSE convert instruction patterns. llvm-svn: 160938	2012-07-29 23:26:34 +00:00
Craig Topper	b6767f3acd	Move more SSE/AVX convert instruction patterns into their definitions. llvm-svn: 160937	2012-07-29 22:30:06 +00:00
Manman Ren	f87dd7c01b	Revert r160920 and r160919 due to dragonegg and clang selfhost failure llvm-svn: 160927	2012-07-29 02:44:09 +00:00
Craig Topper	fc93281c07	Fold patterns for some of the SSE/AVX convert instructions into their instruction definitions. llvm-svn: 160922	2012-07-28 18:59:19 +00:00
Craig Topper	024797b9a2	Mark some of the SSE/AVX convert instructions as mayLoad/neverHasSideEffects. llvm-svn: 160921	2012-07-28 18:36:39 +00:00
Manman Ren	0fa3ab88ba	X86 Peephole: fold loads to the source register operand if possible. Machine CSE and other optimizations can remove instructions so folding is possible at peephole while not possible at ISel. rdar://10554090 and rdar://11873276 llvm-svn: 160919	2012-07-28 16:48:01 +00:00
Craig Topper	44f9b5343d	Make CVTSS2SI instruction definition consistent with CVTSD2SI. llvm-svn: 160914	2012-07-28 08:28:23 +00:00
Craig Topper	1c1aef07b8	Fix up memory load types for SSE scalar convert intrinsic patterns. llvm-svn: 160913	2012-07-28 07:59:59 +00:00
Manman Ren	32367c063b	X86 Peephole: fix PR13475 in optimizeCompare. It is possible that an instruction can use and update EFLAGS. When checking the safety, we should check the usage of EFLAGS first before declaring it is safe to optimize due to the update. llvm-svn: 160912	2012-07-28 03:15:46 +00:00
Akira Hatanaka	97ba7696f8	Pass the correct call frame size to callseq_start node. This is needed to replace uses of function getMaxCallFrameSize defined in MipsFunctionInfo with the one MachineFrameInfo has. llvm-svn: 160841	2012-07-26 23:27:01 +00:00
Jakob Stoklund Olesen	7cd08536c2	Remove the X86 sub_ss and sub_sd sub-register indexes completely. llvm-svn: 160833	2012-07-26 23:07:20 +00:00
Jakob Stoklund Olesen	77cd55b4ee	Remove the last mentions of sub_ss and sub_sd from patterns. I'll remove these two sub-register indexes shortly. llvm-svn: 160831	2012-07-26 23:03:08 +00:00
Jakob Stoklund Olesen	b96d0b4e08	Eliminate sub_ss, sub_sd from broadcast patterns. The (COPY_TO_REGCLASS GR32:$src, VR128) pattern looks odd, but copyPhysReg does the right thing with it. (The old pattern would eventually produce the same cross-class copy). llvm-svn: 160830	2012-07-26 22:59:06 +00:00
Jakob Stoklund Olesen	206b825f5c	Eliminate more sub_ss / sub_sd patterns. This gets rid of some more INSERT_SUBREG - IMPLICIT_DEF patterns, simplifying the emitted code a bit. llvm-svn: 160820	2012-07-26 22:30:18 +00:00
Jakob Stoklund Olesen	75d17b0577	Eliminate some SUBREG_TO_REG patterns with sub_ss and sub_sd. The SUBREG_TO_REG instruction has magic semantics asserting that the source value was defined by an instruction that cleared the high half of the register. Those semantics are never actually exploited for xmm registers. llvm-svn: 160818	2012-07-26 22:03:21 +00:00
Jakob Stoklund Olesen	ceee4a9d0c	Eliminate a batch of uses of sub_ss and sub_sd in the X86 target. These idempotent sub-register indices don't do anything --- They simply map XMM registers to themselves. They no longer affect register classes either since the SubRegClasses field has been removed from Target.td. This patch replaces XMM->XMM EXTRACT_SUBREG and INSERT_SUBREG patterns with COPY_TO_REGCLASS patterns which simply become COPY instructions. The number of IMPLICIT_DEF instructions before register allocation is reduced, and that is the cause of the test case changes. llvm-svn: 160816	2012-07-26 21:40:42 +00:00
Craig Topper	c7690ac7ac	Make l/q suffixes on AVX forms of scalar convert instructions consistent with their non-AVX forms. llvm-svn: 160775	2012-07-26 07:48:28 +00:00
Akira Hatanaka	64626fc20f	Fix call setup for PIC. Patch by Reed Kotler. llvm-svn: 160774	2012-07-26 02:24:43 +00:00
Jim Grosbach	6df755cc4e	ARM: Don't assume an SDNode is a constant. Before accessing a node as a ConstandSDNode, make sure it actually is one. No testcase of non-trivial size. rdar://11948669 llvm-svn: 160735	2012-07-25 17:02:47 +00:00
Nuno Lopes	89702e94b5	make all Emit*() functions consult the TargetLibraryInfo information before creating a call to a library function. Update all clients to pass the TLI information around. Previous draft reviewed by Eli. llvm-svn: 160733	2012-07-25 16:46:31 +00:00
Rafael Espindola	73173c55c2	Fix typos. Thanks to Matt Beaumont-Gay for noticing it. llvm-svn: 160731	2012-07-25 15:42:45 +00:00
Rafael Espindola	11c38b9657	When a return struct pointer is passed in registers, the called has nothing to pop. llvm-svn: 160725	2012-07-25 13:41:10 +00:00
Rafael Espindola	2caee7f4d2	Factor a long list of conditions into a predicate function. No functionality change. llvm-svn: 160724	2012-07-25 13:35:45 +00:00
Akira Hatanaka	5a69c235ae	Eliminate the stack slot used to save the global base register. The long branch pass (fixed in r160601) no longer uses the global base register to compute addresses of branch destinations, so it is not necessary to reserve a slot on the stack. llvm-svn: 160703	2012-07-25 03:16:47 +00:00
Kevin Enderby	216ac31971	Fix a bug in the x86 disassembler's symbolic disassembly support for Jcc-Jump if Condition Is Met instuctions that was not correctly determining the target instruction. So for a jne rel32 instruction: % cat x.s .byte 0x0f, 0x85, 0x09, 0x00, 0x00, 0x00 % as x.s it was incorrectly deterining the target: % otool -q -tv a.out a.out: (__TEXT,__text) section 0000000000000000 jne 0xd and with the fix it gets this correct as: % otool -q -tv a.out a.out: (__TEXT,__text) section 0000000000000000 jne 0xf rdar://11505997 llvm-svn: 160694	2012-07-24 21:40:01 +00:00
Nuno Lopes	342cf787ef	add a few more functions to TargetLibraryInfo: fputc, memchr, memcmp, putchar, puts, strchr, strncmp llvm-svn: 160690	2012-07-24 21:00:36 +00:00
David Chisnall	5b8c1680de	ELF does not imply GNU/Linux. Do not assume GNU conventions just because we are targeting an ELF platform. Only fold gs-relative (and fs-relative) loads if it is actually sensible to do so for the target platform. This fixes PR13438. llvm-svn: 160687	2012-07-24 20:04:16 +00:00
Nuno Lopes	20f5a7aeb7	TargetLibraryInfo: add strn?cat, strn?cpy, and strn?len llvm-svn: 160678	2012-07-24 17:25:06 +00:00
Akira Hatanaka	45da9e2653	Fix function MipsCodeEmitter::emitExternalSymbolAddress to pass test ExecutionEngine/test-fp.ll. Patch by Petar Jovanovic. llvm-svn: 160653	2012-07-24 00:08:26 +00:00
Akira Hatanaka	26e9ecb7a3	Add basic ability to setup call frame, and make procedure calls. Hello world will compile and execute with this patch. Patch by Reed Kotler. llvm-svn: 160651	2012-07-23 23:45:54 +00:00
Akira Hatanaka	adec58c091	Add comment for relocations MO_HIGHER and HIGHEST in MipsBaseInfo.h. llvm-svn: 160636	2012-07-23 19:19:20 +00:00
Micah Villmow	9eedce1e7c	Test revert of test changes. llvm-svn: 160632	2012-07-23 16:42:45 +00:00
Micah Villmow	780c24f19c	Test commit. llvm-svn: 160631	2012-07-23 16:37:24 +00:00
Sylvestre Ledru	35521e2310	Fix a typo (the the => the) llvm-svn: 160621	2012-07-23 08:51:15 +00:00
Akira Hatanaka	f72efdb62f	Fix Mips long branch pass. This pass no longer requires that the global pointer value be saved to the stack or register since it uses bal instruction to compute branch distance. llvm-svn: 160601	2012-07-21 03:30:44 +00:00
Akira Hatanaka	6035fe78c7	Add HIGHER and HIGHEST relocations to Mips backend. llvm-svn: 160599	2012-07-21 03:09:04 +00:00
Akira Hatanaka	b49c68a65d	Revert accidental commit. llvm-svn: 160598	2012-07-21 02:20:33 +00:00
Akira Hatanaka	f73e362758	Add VK_Mips_HIGHER and VK_Mips_HIGHEST to MCSymbolRefExpr::VariantKind. Test case will be added later when long branch patch is checked in. llvm-svn: 160597	2012-07-21 02:15:19 +00:00
Craig Topper	0b94e46ce3	Don't use implicit register operands to calculate L-bit for AVX instructions. Needed because super reg defs and kills are added as implicit operands on 128-bit instructions. Fixes PR13349. Patch by Jose Fonseca. llvm-svn: 160543	2012-07-20 07:03:46 +00:00
Preston Gurd	8e082688a1	Adds the family codes for the Midview Atom processors so that the Atom buildbot will auto-detect Atom. llvm-svn: 160521	2012-07-19 19:05:37 +00:00
Sebastian Pop	221e07e140	default to use -mv4 when no version of Hexagon has been specified This fixes a bunch of make check failures of the form: Unknown Architecture Version. UNREACHABLE executed at ../lib/Target/Hexagon/HexagonSubtarget.cpp:60! llvm-svn: 160518	2012-07-19 18:24:50 +00:00
Jush Lu	e67e07b901	[arm-fast-isel] Add support for vararg function calls. llvm-svn: 160500	2012-07-19 09:49:00 +00:00
Bill Wendling	723444e767	Remove tabs. llvm-svn: 160483	2012-07-19 00:25:04 +00:00
Bill Wendling	318f03f56f	Remove tabs. llvm-svn: 160479	2012-07-19 00:15:11 +00:00
Bill Wendling	ea6397f67b	Remove tabs. llvm-svn: 160477	2012-07-19 00:11:40 +00:00
Bill Wendling	2b07965042	Remove tabs. llvm-svn: 160476	2012-07-19 00:06:06 +00:00
Manman Ren	d0a4ee8427	X86: remove redundant cmp against zero. Updated OptimizeCompare in peephole to remove redundant cmp against zero. We only remove Compare if CF and OF are not used. rdar://11855129 llvm-svn: 160454	2012-07-18 21:40:01 +00:00
Preston Gurd	f0a48ec8f1	This patch fixes 8 out of 20 unexpected failures in "make check" when run on an Intel Atom processor. The failures have arisen due to changes elsewhere in the trunk over the past 8 weeks or so. These failures were not detected by the Atom buildbot because the CPU on the Atom buildbot was not being detected as an Atom CPU. The fix for this problem is in Host.cpp and X86Subtarget.cpp, but shall remain commented out until the current set of Atom test failures are fixed. Patch by Andy Zhang and Tyler Nowicki! llvm-svn: 160451	2012-07-18 20:49:17 +00:00
Andrew Trick	a22cdb713b	Fix ARMTargetLowering::isLegalAddImmediate to consider thumb encodings. Based on Evan's suggestion without a commitable test. llvm-svn: 160441	2012-07-18 18:34:27 +00:00
Andrew Trick	bc325168c3	whitespace llvm-svn: 160440	2012-07-18 18:34:24 +00:00
Nadav Rotem	4c12245b3a	The vbroadcast family of instructions has 'fallback patterns' in case where the load source operand is used by multiple nodes. The v2i64 broadcast was emulated by shuffling the two lower i32 elements to the upper two. We had a bug in the immediate used for the broadcast. Replacing 0 to 0x44. 0x44 means [01\|00\|01\|00] which corresponds to the correct lane. Patch by Michael Kuperstein. llvm-svn: 160430	2012-07-18 08:14:48 +00:00
Jack Carter	a62ba82825	Mips specific inline asm operand modifier 'M': Print the high order register of a double word register operand. In 32 bit mode, a 64 bit double word integer will be represented by 2 32 bit registers. This modifier causes the high order register to be used in the asm expression. It is useful if you are using doubles in assembler and continue to control register to variable relationships. This patch also fixes a related bug in a previous patch: case 'D': // Second part of a double word register operand case 'L': // Low order register of a double word register operand case 'M': // High order register of a double word register operand I got 'D' and 'M' confused. The second part of a double word operand will only match 'M' for one of the endianesses. I had 'L' and 'D' be the opposite twins when 'L' and 'M' are. llvm-svn: 160429	2012-07-18 06:41:36 +00:00
Craig Topper	6bf3ed454a	Remove tab characters. llvm-svn: 160425	2012-07-18 04:59:16 +00:00
Craig Topper	8532423268	Fix typo in error message and remove some tab characters. llvm-svn: 160423	2012-07-18 04:36:35 +00:00
Craig Topper	01deb5f2df	Make x86 asm parser to check for xmm vs ymm for index register in gather instructions. Also fix Intel syntax for gather instructions to use 'DWORD PTR' or 'QWORD PTR' to match gas. llvm-svn: 160420	2012-07-18 04:11:12 +00:00
Joel Jones	b84f7bea09	More replacing of target-dependent intrinsics with target-indepdent intrinsics. The second instruction(s) to be handled are the vector versions of count set bits (ctpop). The changes here are to clang so that it generates a target independent vector ctpop when it sees an ARM dependent vector bits set count. The changes in llvm are to match the target independent vector ctpop and in VMCore/AutoUpgrade.cpp to update any existing bc files containing ARM dependent vector pop counts with target-independent ctpops. There are also changes to an existing test case in llvm for ARM vector count instructions and to a test for the bitcode upgrade. <rdar://problem/11892519> There is deliberately no test for the change to clang, as so far as I know, no consensus has been reached regarding how to test neon instructions in clang; q.v. <rdar://problem/8762292> llvm-svn: 160410	2012-07-18 00:02:16 +00:00
Akira Hatanaka	f640f040d1	Clean up Mips16InstrFormats.td and Mips16InstrInfo.td. Patch by Reed Kotler. llvm-svn: 160403	2012-07-17 22:55:34 +00:00
Evan Cheng	e6a3b03ee0	Back out r160101 and instead implement a dag combine to recover from instcombine transformation. llvm-svn: 160387	2012-07-17 18:54:11 +00:00
Evan Cheng	780f9b5f92	Implement r160312 as target indepedenet dag combine. llvm-svn: 160354	2012-07-17 08:31:11 +00:00
Evan Cheng	f579beca6d	This is another case where instcombine demanded bits optimization created large immediates. Add dag combine logic to recover in case the large immediates doesn't fit in cmp immediate operand field. int foo(unsigned long l) { return (l>> 47) == 1; } we produce %shr.mask = and i64 %l, -140737488355328 %cmp = icmp eq i64 %shr.mask, 140737488355328 %conv = zext i1 %cmp to i32 ret i32 %conv which codegens to movq $0xffff800000000000,%rax andq %rdi,%rax movq $0x0000800000000000,%rcx cmpq %rcx,%rax sete %al movzbl %al,%eax ret TargetLowering::SimplifySetCC would transform (X & -256) == 256 -> (X >> 8) == 1 if the immediate fails the isLegalICmpImmediate() test. For x86, that's immediates which are not a signed 32-bit immediate. Based on a patch by Eli Friedman. PR10328 rdar://9758774 llvm-svn: 160346	2012-07-17 06:53:39 +00:00
Evan Cheng	75315b877c	For something like uint32_t hi(uint64_t res) { uint_32t hi = res >> 32; return !hi; } llvm IR looks like this: define i32 @hi(i64 %res) nounwind uwtable ssp { entry: %lnot = icmp ult i64 %res, 4294967296 %lnot.ext = zext i1 %lnot to i32 ret i32 %lnot.ext } The optimizer has optimize away the right shift and truncate but the resulting constant is too large to fit in the 32-bit immediate field. The resulting x86 code is worse as a result: movabsq $4294967296, %rax ## imm = 0x100000000 cmpq %rax, %rdi sbbl %eax, %eax andl $1, %eax This patch teaches the x86 lowering code to handle ult against a large immediate with trailing zeros. It will issue a right shift and a truncate followed by a comparison against a shifted immediate. shrq $32, %rdi testl %edi, %edi sete %al movzbl %al, %eax It also handles a ugt comparison against a large immediate with trailing bits set. i.e. X > 0x0ffffffff -> (X >> 32) >= 1 rdar://11866926 llvm-svn: 160312	2012-07-16 19:35:43 +00:00
Tom Stellard	1be1aa84ec	Revert "AMDGPU: Add core backend files for R600/SI codegen v6" This reverts commit 4ea70107c5e51230e9e60f0bf58a0f74aa4885ea. llvm-svn: 160303	2012-07-16 18:19:53 +00:00
Tom Stellard	95bd0be903	Revert "Build script changes for R600/SI Codegen v6" This reverts commit e3013202259ed1e006c21817c63cf25d75982721. llvm-svn: 160301	2012-07-16 18:19:46 +00:00
Tom Stellard	151dc338e4	Revert "Target/AMDGPU/R600KernelParameters.cpp: Fix two includes, <llvm/IRBuilder.h> and <llvm/TypeBuilder.h>" This reverts commit 0258a6bdd30802f5cc0e8e57c8e768fde2aef590. llvm-svn: 160299	2012-07-16 18:19:41 +00:00
Tom Stellard	1bd3012505	Revert "Target/AMDGPU: [CMake] Fix dependencies. 1) Add intrinsics_gen. Add AMDGPUCommonTableGen." This reverts commit ebc934ba32ee71abbb8f0f2eb6a0fbaa613ba0d2. llvm-svn: 160298	2012-07-16 18:19:40 +00:00
Tom Stellard	781853e11f	Revert "Target/AMDGPU/R600KernelParameters.cpp: Don't use "and", "or" as conditional operator..." This reverts commit 29f28bc14ad5a907f5dc849f004fafeec0aab33a. llvm-svn: 160297	2012-07-16 18:19:38 +00:00
Tom Stellard	2e007de42d	Revert "Target/AMDGPU/AMDILIntrinsicInfo.cpp: Use llvm_unreachable() in nonreturn function, instead of assert(0)." This reverts commit 4ba4acc1bc2561b944a571edbb6a2dc78e357dfe. llvm-svn: 160296	2012-07-16 18:19:37 +00:00
Tom Stellard	f65e78b2fa	Revert "Target/AMDGPU: Fix includes, or msvc build failed." This reverts commit fef4aa1b16fcf7a472559abbbcf4c1adc9eb5ca6. llvm-svn: 160295	2012-07-16 18:19:32 +00:00
Chad Rosier	10e8207c9e	With r160248 in place this code is no longer needed. llvm-svn: 160293	2012-07-16 17:42:13 +00:00
NAKAMURA Takumi	96cc5e5bf9	Target/AMDGPU: Fix includes, or msvc build failed. llvm-svn: 160280	2012-07-16 15:43:50 +00:00
NAKAMURA Takumi	dc4261794f	Target/AMDGPU/AMDILIntrinsicInfo.cpp: Use llvm_unreachable() in nonreturn function, instead of assert(0). llvm-svn: 160279	2012-07-16 15:43:09 +00:00
NAKAMURA Takumi	5f5fd8e545	Target/AMDGPU/R600KernelParameters.cpp: Don't use "and", "or" as conditional operator... llvm-svn: 160278	2012-07-16 15:42:35 +00:00
Jack Carter	f649043aa5	Doubleword Shift Left Logical Plus 32 Mips shift instructions DSLL, DSRL and DSRA are transformed into DSLL32, DSRL32 and DSRA32 respectively if the shift amount is between 32 and 63 Here is a description of DSLL: Purpose: Doubleword Shift Left Logical Plus 32 To execute a left-shift of a doubleword by a fixed amount--32 to 63 bits Description: GPR[rd] <- GPR[rt] << (sa+32) The 64-bit doubleword contents of GPR rt are shifted left, inserting zeros into the emptied bits; the result is placed in GPR rd. The bit-shift amount in the range 0 to 31 is specified by sa. This patch implements the direct object output of these instructions. llvm-svn: 160277	2012-07-16 15:14:51 +00:00
NAKAMURA Takumi	bb42a5e2cf	Target/AMDGPU: [CMake] Fix dependencies. 1) Add intrinsics_gen. Add AMDGPUCommonTableGen. llvm-svn: 160276	2012-07-16 15:09:11 +00:00
NAKAMURA Takumi	3128d26124	Target/AMDGPU/R600KernelParameters.cpp: Fix two includes, <llvm/IRBuilder.h> and <llvm/TypeBuilder.h> llvm-svn: 160275	2012-07-16 15:08:47 +00:00
Tom Stellard	812e652b43	Build script changes for R600/SI Codegen v6 llvm-svn: 160272	2012-07-16 14:17:16 +00:00
Tom Stellard	bcce80fa95	AMDGPU: Add core backend files for R600/SI codegen v6 llvm-svn: 160270	2012-07-16 14:17:08 +00:00
Nadav Rotem	4968e45b9f	Fix a bug in the 3-address conversion of LEA when one of the operands is an undef virtual register. The problem is that ProcessImplicitDefs removes the definition of the register and marks all uses as undef. If we lose the undef marker then we get a register which has no def, is not marked as undef. The live interval analysis does not collect information for these virtual registers and we crash in later passes. Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160260	2012-07-16 10:52:25 +00:00
Alexey Samsonov	dcc1291d17	This CL changes the function prologue and epilogue emitted on X86 when stack needs realignment. It is intended to fix PR11468. Old prologue and epilogue looked like this: push %rbp mov %rsp, %rbp and $alignment, %rsp push %r14 push %r15 ... pop %r15 pop %r14 mov %rbp, %rsp pop %rbp The problem was to reference the locations of callee-saved registers in exception handling: locations of callee-saved had to be re-calculated regarding the stack alignment operation. It would take some effort to implement this in LLVM, as currently MachineLocation can only have the form "Register + Offset". Funciton prologue and epilogue are now changed to: push %rbp mov %rsp, %rbp push %14 push %15 and $alignment, %rsp ... lea -$size_of_saved_registers(%rbp), %rsp pop %r15 pop %r14 pop %rbp Reviewed by Chad Rosier. llvm-svn: 160248	2012-07-16 06:54:09 +00:00
Nadav Rotem	eec74c7279	Teach getTargetVShiftNode about TargetConstant nodes. llvm-svn: 160234	2012-07-15 20:27:43 +00:00
Nadav Rotem	ee3552f88d	Rename VBROADCASTSDrm into VBROADCASTSDYrm to match the naming convention. Allow the folding of vbroadcastRR to vbroadcastRM, where the memory operand is a spill slot. PR12782. Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160230	2012-07-15 12:26:30 +00:00
Nadav Rotem	9466e81df6	AVX: Fix a bug in getTargetVShiftNode. The shift amount has to be a 128bit vector with the same element type as the input vector. This is needed because of the patterns we have for the VP[SLL/SRA/SRL][W/D/Q] instructions. llvm-svn: 160222	2012-07-14 22:26:05 +00:00
Joel Jones	43cb87839c	This is one of the first steps at moving to replace target-dependent intrinsics with target-indepdent intrinsics. The first instruction(s) to be handled are the vector versions of count leading zeros (ctlz). The changes here are to clang so that it generates a target independent vector ctlz when it sees an ARM dependent vector ctlz. The changes in llvm are to match the target independent vector ctlz and in VMCore/AutoUpgrade.cpp to update any existing bc files containing ARM dependent vector ctlzs with target-independent ctlzs. There are also changes to an existing test case in llvm for ARM vector count instructions and a new test for the bitcode upgrade. <rdar://problem/11831778> There is deliberately no test for the change to clang, as so far as I know, no consensus has been reached regarding how to test neon instructions in clang; q.v. <rdar://problem/8762292> llvm-svn: 160200	2012-07-13 23:25:25 +00:00
Jakob Stoklund Olesen	ed6c0408fa	Remove variable_ops from call instructions in most targets. Call instructions are no longer required to be variadic, and variable_ops should only be used for instructions that encode a variable number of arguments, like the ARM stm/ldm instructions. llvm-svn: 160189	2012-07-13 20:44:29 +00:00
Jakob Stoklund Olesen	6a81d30269	Remove variable_ops from ARM call instructions. Function argument registers are added to the call SDNode, but InstrEmitter now knows how to make those operands implicit, and the call instruction doesn't have to be variadic. Explicit register operands should only be those that are encoded in the instruction, implicit register operands are for extra dependencies like call argument and return values. llvm-svn: 160188	2012-07-13 20:27:00 +00:00
Jack Carter	5ddcfda8ef	The Mips specific relocation R_MIPS_GOT_DISP is used in cases where global symbols are directly represented in the GOT and we use an offset into the global offset table. This patch adds direct object support for R_MIPS_GOT_DISP. llvm-svn: 160183	2012-07-13 19:15:47 +00:00
Benjamin Kramer	abbfe69356	Make helper functions static. llvm-svn: 160173	2012-07-13 13:25:15 +00:00
Craig Topper	b3bac4908e	Mark VINSERTI128rm as MayLoad=1. Fixes PR13348. llvm-svn: 160162	2012-07-13 05:46:28 +00:00
Benjamin Kramer	4d0916788d	Give the rdrand instructions a SideEffect flag and a chain so MachineCSE and MachineLICM don't touch it. I already had the necessary things in place for IR-level passes but missed the machine passes. llvm-svn: 160137	2012-07-12 18:14:57 +00:00
Benjamin Kramer	0ab2794eda	Add intrinsics for Ivy Bridge's rdrand instruction. The rdrand/cmov sequence is the same that is emitted by both GCC and ICC. Fixes PR13284. llvm-svn: 160117	2012-07-12 09:31:43 +00:00
Craig Topper	f7755df776	Update GATHER instructions to support 2 read-write operands. Patch from myself and Manman Ren. llvm-svn: 160110	2012-07-12 06:52:41 +00:00
Manman Ren	88a0d3313b	ARM: fix typo in comments llvm-svn: 160093	2012-07-11 23:47:00 +00:00
Manman Ren	34cb93e192	ARM: Fix optimizeCompare to correctly check safe condition. It is safe if CPSR is killed or re-defined. When we are done with the basic block, check whether CPSR is live-out. Do not optimize away cmp if CPSR is live-out. llvm-svn: 160090	2012-07-11 22:51:44 +00:00
Jack Carter	570ae0b1f7	Patch for Mips direct object generation. When WriteFragmentData() case FT_align called Asm.getBackend().writeNopData() is called, nothing is done since Mips implementation of writeNopData just returned "true". For some reason this has not caused problems in 32 bit mode, but in 64 bit mode it caused an assert when processing multiple function units. The test case included will assert without this patch. It runs twice with different flags to prevent false positives due to changes in code generation over time. llvm-svn: 160084	2012-07-11 22:17:39 +00:00
Jack Carter	42ebf98b04	This change removes an "initialization" warning. Even though variable in question could not be initialized before use, the code was such that the compiler had no way of knowing that. llvm-svn: 160081	2012-07-11 21:41:49 +00:00
Akira Hatanaka	bb5519154c	In register classes in MipsRegisterInfo.td, list the registers in ascending order of binary encoding. Patch by Vladimir Medic. llvm-svn: 160073	2012-07-11 20:51:50 +00:00
Chad Rosier	8446ede023	[x86 fast-isel] Per discussion with Eric, add all cases to switch with verbose comments. llvm-svn: 160069	2012-07-11 19:58:38 +00:00
Manman Ren	1553ce0e81	X86: Update to peephole optimization to move Movr0 before (Sub, Cmp) pair. When Movr0 is between sub and cmp, we move Movr0 before sub if it enables removal of Cmp. llvm-svn: 160066	2012-07-11 19:35:12 +00:00
Akira Hatanaka	24cf4e36e5	Implement MipsTargetLowering::LowerSELECT_CC to custom lower SELECT_CC. llvm-svn: 160064	2012-07-11 19:32:27 +00:00
Chad Rosier	43218c59c3	[x86 fast-isel] Rather then call llvm_unreachable() have fast-isel fall back to Selection DAG isel. Patch by Andrew Kaylor <andrew.kaylor@intel.com>. llvm-svn: 160055	2012-07-11 17:23:17 +00:00
Nadav Rotem	d2bdcebb14	When ext-loading and trunc-storing vectors to memory, on x86 32bit systems, allow loads/stores of 64bit values from xmm registers. llvm-svn: 160044	2012-07-11 13:27:05 +00:00
Akira Hatanaka	878ad8b28d	Lower RETURNADDR node in Mips backend. Patch by Sasa Stankovic. llvm-svn: 160031	2012-07-11 00:53:32 +00:00
Jack Carter	e8cb2fc616	Mips specific inline asm operand modifier 'L'. Low order register of a double word register operand. Operands are defined by the name of the variable they are marked with in the inline assembler code. This is a way to specify that the operand just refers to the low order register for that variable. It is the opposite of modifier 'D' which specifies the high order register. Example: main() { long long ll_input = 0x1111222233334444LL; long long ll_val = 3; int i_result = 0; __asm__ __volatile__( "or %0, %L1, %2" : "=r" (i_result) : "r" (ll_input), "r" (ll_val)); } Which results in: lui $2, %hi(_gp_disp) addiu $2, $2, %lo(_gp_disp) addiu $sp, $sp, -8 addu $2, $2, $25 sw $2, 0($sp) lui $2, 13107 ori $3, $2, 17476 <-- Low 32 bits of ll_input lui $2, 4369 ori $4, $2, 8738 <-- High 32 bits of ll_input addiu $5, $zero, 3 <-- Low 32 bits of ll_val addiu $2, $zero, 0 <-- High 32 bits of ll_val #APP or $3, $4, $5 <-- or i_result, high 32 ll_input, low 32 of ll_val #NO_APP addiu $sp, $sp, 8 jr $ra If not direction is done for the long long for 32 bit variables results in using the low 32 bits as ll_val shows. There is an existing bug if 'L' or 'D' is used for the destination register for 32 bit long longs in that the target value will be updated incorrectly for the non-specified part unless explicitly set within the inline asm code. llvm-svn: 160028	2012-07-10 22:41:20 +00:00
Chad Rosier	97c2214277	Move [get\|set]BasePtrStackAdjustment() from MachineFrameInfo to X86MachineFunctionInfo as this is currently only used by X86. If this ever becomes an issue on another arch (e.g., ARM) then we can hoist it back out. llvm-svn: 160009	2012-07-10 18:27:15 +00:00
Chad Rosier	bdb08ac50a	Add support for dynamic stack realignment in the presence of dynamic allocas on X86. Basically, this is a reapplication of r158087 with a few fixes. Specifically, (1) the stack pointer is restored from the base pointer before popping callee-saved registers and (2) in obscure cases (see comments in patch) we must cache the value of the original stack adjustment in the prologue and apply it in the epilogue. rdar://11496434 llvm-svn: 160002	2012-07-10 17:45:53 +00:00
Nadav Rotem	d908ddc186	Improve the loading of load-anyext vectors by allowing the codegen to load multiple scalars and insert them into a vector. Next, we shuffle the elements into the correct places, as before. Also fix a small dagcombine bug in SimplifyBinOpWithSameOpcodeHands, when the migration of bitcasts happened too late in the SelectionDAG process. llvm-svn: 159991	2012-07-10 13:25:08 +00:00
Richard Barton	1dc44dcedd	Fix instruction description of VMOV (between two ARM core registers and two single-precision resiters) (and do it properly this time! llvm-svn: 159989	2012-07-10 12:51:09 +00:00
Craig Topper	be41e2daa6	Reverse assembler/disassembler operand order for gather instructions. llvm-svn: 159983	2012-07-10 06:38:33 +00:00
Jim Grosbach	16b43dbbfe	ARM: Allow more flexible patterns in NEON formats. Some NEON instructions want to match against normal SDNodes for some operand types and Intrinsics for others. For example, CTLZ. To enable this, switch from explicitly requiring Intrinsic on the class templates to using SDPatternOperator instead. llvm-svn: 159974	2012-07-10 00:51:13 +00:00
Akira Hatanaka	efff7b763b	Make register Mips::RA allocatable if not in mips16 mode. llvm-svn: 159971	2012-07-10 00:19:06 +00:00
Chad Rosier	aeed158f75	Revert r159938 (and r159945) to appease the buildbots. llvm-svn: 159960	2012-07-09 20:43:34 +00:00
Manman Ren	5f6fa428fa	X86: implement functions to analyze & synthesize CMOV\|SET\|Jcc getCondFromSETOpc, getCondFromCMovOpc, getSETFromCond, getCMovFromCond No functional change intended. If we want to update the condition code of CMOV\|SET\|Jcc, we first analyze the opcode to get the condition code, then update the condition code, finally synthesize the new opcode form the new condition code. llvm-svn: 159955	2012-07-09 18:57:12 +00:00
Akira Hatanaka	9bf2b5677d	Reapply r158846. Access mips register classes via MCRegisterInfo's functions instead of via the TargetRegisterClasses defined in MipsGenRegisterInfo.inc. llvm-svn: 159953	2012-07-09 18:46:47 +00:00
Richard Barton	984d0ba6b6	Some formatting to keep Clang happy llvm-svn: 159948	2012-07-09 18:30:56 +00:00
Richard Barton	5beef2d242	Oops - correct broken disassembly for VMOV llvm-svn: 159945	2012-07-09 18:20:02 +00:00
Richard Barton	c9e1c94fae	Fix instruction description of VMOV (between two ARM core registers and two single-precision resiters) llvm-svn: 159938	2012-07-09 16:41:33 +00:00
Richard Barton	35aceb86fe	Prevent ARM assembler from losing a right shift by #32 applied to a register llvm-svn: 159937	2012-07-09 16:31:14 +00:00
Richard Barton	d56603722e	Spelling! llvm-svn: 159936	2012-07-09 16:14:28 +00:00
Richard Barton	a39625ecc6	Teach the assembler to use the narrow thumb encodings of various three-register dp instructions where permissable. llvm-svn: 159935	2012-07-09 16:12:24 +00:00
Andrew Trick	87255e340e	I'm introducing a new machine model to simultaneously allow simple subtarget CPU descriptions and support new features of MachineScheduler. MachineModel has three categories of data: 1) Basic properties for coarse grained instruction cost model. 2) Scheduler Read/Write resources for simple per-opcode and operand cost model (TBD). 3) Instruction itineraties for detailed per-cycle reservation tables. These will all live side-by-side. Any subtarget can use any combination of them. Instruction itineraries will not change in the near term. In the long run, I expect them to only be relevant for in-order VLIW machines that have complex contraints and require a precise scheduling/bundling model. Once itineraries are only actively used by VLIW-ish targets, they could be replaced by something more appropriate for those targets. This tablegen backend rewrite sets things up for introducing MachineModel type #2: per opcode/operand cost model. llvm-svn: 159891	2012-07-07 04:00:00 +00:00
Manman Ren	bb36074047	X86: Fix optimizeCompare to correctly check safe condition. It is safe if EFLAGS is killed or re-defined. When we are done with the basic block, check whether EFLAGS is live-out. Do not optimize away cmp if EFLAGS is live-out. llvm-svn: 159888	2012-07-07 03:34:46 +00:00
Chad Rosier	73b02825d0	Fix the naming of ensureAlignment. Per the coding standard function names should be camel case, and start with a lower case letter. llvm-svn: 159877	2012-07-06 23:13:38 +00:00
Jim Grosbach	09487775d3	ARM: Add test cleanup entry to the README. llvm-svn: 159864	2012-07-06 21:52:04 +00:00
Akira Hatanaka	b577ff116d	revert r159851. llvm-svn: 159854	2012-07-06 20:16:48 +00:00
Akira Hatanaka	cfa35fa0ff	Reapply r158846. Include file MipsGenRegisterInfo.inc. llvm-svn: 159851	2012-07-06 19:29:11 +00:00
Manman Ren	c965673707	X86: peephole optimization to remove cmp instruction For each Cmp, we check whether there is an earlier Sub which make Cmp redundant. We handle the case where SUB operates on the same source operands as Cmp, including the case where the two source operands are swapped. llvm-svn: 159838	2012-07-06 17:36:20 +00:00
NAKAMURA Takumi	0246724cd6	Revert r159804, "[arm-fast-isel] Add support for vararg function calls." It broke LLVM :: CodeGen/Thumb2/large-call.ll on several hosts. llvm-svn: 159817	2012-07-06 11:12:44 +00:00
Jush Lu	5e6e6264f4	[arm-fast-isel] Add support for vararg function calls. llvm-svn: 159804	2012-07-06 03:02:37 +00:00
Jack Carter	2ab73b13a5	Changes per review of commit 159787 Mips specific inline asm operand modifier D. Comment changes and predicate change. llvm-svn: 159802	2012-07-06 02:44:22 +00:00
Jack Carter	b2af512cef	Mips specific inline asm operand modifier D. Print the second half of a double word operand. The include list was cleaned up a bit as well. Also the test case was modified to test for both big and little patterns. llvm-svn: 159787	2012-07-05 23:58:21 +00:00
Akira Hatanaka	7d33c78e3b	Enclose instruction rdhwr with directives, which are needed when target is mips32 rev1 (the directives are emitted when target is mips32r2 too). llvm-svn: 159770	2012-07-05 19:26:38 +00:00
Jakob Stoklund Olesen	d14101e0b9	Make X86 call and return instructions non-variadic. Function argument and return value registers aren't part of the encoding, so they should be implicit operands. llvm-svn: 159728	2012-07-04 23:53:27 +00:00
Jakob Stoklund Olesen	2dee812445	Ensure CopyToReg nodes are always glued to the call instruction. The CopyToReg nodes that set up the argument registers before a call must be glued to the call instruction. Otherwise, the scheduler may emit the physreg copies long before the call, causing long live ranges for the fixed registers. Besides disabling good register allocation, that can also expose problems when EmitInstrWithCustomInserter() splits a basic block during the live range of a physreg. llvm-svn: 159721	2012-07-04 19:28:31 +00:00
Jakob Stoklund Olesen	49e4d4b3ef	Add early if-conversion support to X86. Implement the TII hooks needed by EarlyIfConversion to create cmov instructions and estimate their latency. Early if-conversion is still not enabled by default. llvm-svn: 159695	2012-07-04 00:09:58 +00:00
Craig Topper	85c938f44f	Remove extra space. llvm-svn: 159647	2012-07-03 06:48:58 +00:00
Craig Topper	f067f9aa51	Change i128mem/i256mem to f128mem/f256mem on some floating point vector instructions. llvm-svn: 159646	2012-07-03 06:11:06 +00:00

1 2 3 4 5 ...

21878 Commits