llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	93492b8765	Testing vector code without sse doesn't make much sense. Should bring arm and ppc testers back to life (they default to -mcpu=generic) llvm-svn: 149821	2012-02-05 11:19:39 +00:00
Chris Lattner	6987fdb620	Add a test for the miscompilation my recent ConstantDataArray patches introduced, to make sure we don't regress on it in the future. llvm-svn: 149803	2012-02-05 02:37:36 +00:00
Craig Topper	4daa67483d	Remove most of the intrinsics for XOP VPCMOV instruction. They all aliased to the same instruction with different types. This would be better accomplished with casts in the not yet created xopintrin.h header file. llvm-svn: 149795	2012-02-05 00:55:56 +00:00
Hal Finkel	135cac922c	Boost the effective chain depth of loads and stores. By default, boost the chain depth contribution of loads and stores. This will allow a load/store pair to vectorize even when it would not otherwise be long enough to satisfy the chain depth requirement. llvm-svn: 149761	2012-02-04 04:14:04 +00:00
Chad Rosier	6d68c7cf79	[fast-isel] HandlePHINodesInSuccessorBlocks() can promite i8 and i16 types too. llvm-svn: 149730	2012-02-04 00:39:19 +00:00
Chad Rosier	41f0e78b6c	[fast-isel] Add support for FPToUI. Also add test cases for FPToSI. llvm-svn: 149706	2012-02-03 20:27:51 +00:00
Chad Rosier	a8a8ac5d47	[fast-isel] Add support for selecting UIToFP. llvm-svn: 149704	2012-02-03 19:42:52 +00:00
Nadav Rotem	5399f4d6bf	The type-legalizer often scalarizes code. One of the common patterns is extract-and-truncate. In this patch we optimize this pattern and convert the sequence into extract op of a narrow type. This allows the BUILD_VECTOR dag optimizations to construct efficient shuffle operations in many cases. llvm-svn: 149692	2012-02-03 13:18:25 +00:00
Akira Hatanaka	f0b08445f6	Add a new MachineJumpTableInfo entry type, EK_GPRel64BlockAddress, which is needed to emit a 64-bit gp-relative relocation entry. Make changes necessary for emitting jump tables which have entries with directive .gpdword. This patch does not implement the parts needed for direct object emission or JIT. llvm-svn: 149668	2012-02-03 04:33:00 +00:00
Dan Gohman	2d54ce841c	Fix SSAUpdaterImpl's RecordMatchingPHI to record exactly the PHI nodes which were matched, rather than climbing up the original PHI node's operands to rediscover PHI nodes for recording, since the PHI nodes found that are not necessarily part of the matched set. This fixes rdar://10589171. llvm-svn: 149654	2012-02-03 01:07:01 +00:00
Jim Grosbach	0ab54184d7	Revert "Disable InstCombine unsafe folding bitcasts of calls w/ varargs." This reverts commit d0e277d272d517ca1cda368267d199f0da7cad95. llvm-svn: 149647	2012-02-03 00:00:50 +00:00
Matt Beaumont-Gay	1e8d720743	Unix line endings llvm-svn: 149615	2012-02-02 19:00:49 +00:00
NAKAMURA Takumi	9df3de538b	Move test/CodeGen/Generic/2012-02-01-CoalescerBug.ll to CodeGen/ARM, for now. It requires TARGETS=arm. I cannot reproduce a fixed issue with other targets. llvm-svn: 149604	2012-02-02 11:44:58 +00:00
Elena Demikhovsky	fb44980b41	Optimization for SIGN_EXTEND operation on AVX. Special handling was added for v4i32 -> v4i64 and v8i16 -> v8i32 extensions. llvm-svn: 149600	2012-02-02 09:10:43 +00:00
Lang Hames	0269caafa6	Set EFLAGS correctly in EmitLoweredSelect on X86. llvm-svn: 149597	2012-02-02 07:48:37 +00:00
Lang Hames	3a20bc3652	PR11868. The previous loop in LiveIntervals::join would sometimes fall over if more than two adjacent ranges needed to be merged. The new version should be able to handle an arbitrary sequence of adjancent ranges. llvm-svn: 149588	2012-02-02 05:37:34 +00:00
Andrew Trick	8523b16ff5	Instruction scheduling itinerary for Intel Atom. Adds an instruction itinerary to all x86 instructions, giving each a default latency of 1, using the InstrItinClass IIC_DEFAULT. Sets specific latencies for Atom for the instructions in files X86InstrCMovSetCC.td, X86InstrArithmetic.td, X86InstrControl.td, and X86InstrShiftRotate.td. The Atom latencies for the remainder of the x86 instructions will be set in subsequent patches. Adds a test to verify that the scheduler is working. Also changes the scheduling preference to "Hybrid" for i386 Atom, while leaving x86_64 as ILP. Patch by Preston Gurd! llvm-svn: 149558	2012-02-01 23:20:51 +00:00
Mon P Wang	9f05206659	Avoid creating an extract element to an illegal type after LegalizeTypes has run. llvm-svn: 149548	2012-02-01 22:15:20 +00:00
Andrew Trick	d06df96a7c	VLIW specific scheduler framework that utilizes deterministic finite automaton (DFA). This new scheduler plugs into the existing selection DAG scheduling framework. It is a top-down critical path scheduler that tracks register pressure and uses a DFA for pipeline modeling. Patch by Sergei Larin! llvm-svn: 149547	2012-02-01 22:13:57 +00:00
NAKAMURA Takumi	fe8186f8b0	test/CodeGen/X86/avx-minmax.ll: Relax expressions for Win32 targets. YMM arguments are passed as indirect on Win32 x64. llvm-svn: 149505	2012-02-01 14:35:29 +00:00
Elena Demikhovsky	824eed70a6	Passing AVX 256-bit structures in Win64 was wrong. Fixed Win64 calling conventions. llvm-svn: 149494	2012-02-01 10:46:14 +00:00
Elena Demikhovsky	0e48c70ba7	Optimization for "truncate" operation on AVX. Truncating v4i64 -> v4i32 and v8i32 -> v8i16 may be done with set of shuffles. llvm-svn: 149485	2012-02-01 07:56:44 +00:00
Hal Finkel	c34e51132c	Add a basic-block autovectorization pass. This is the initial checkin of the basic-block autovectorization pass along with some supporting vectorization infrastructure. Special thanks to everyone who helped review this code over the last several months (especially Tobias Grosser). llvm-svn: 149468	2012-02-01 03:51:43 +00:00
Jim Grosbach	9fa0481569	Disable InstCombine unsafe folding bitcasts of calls w/ varargs. Changing arguments from being passed as fixed to varargs is unsafe, as the ABI may require they be handled differently (stack vs. register, for example). Remove two tests which rely on the bitcast being folded into the direct call, which is exactly the transformation that's unsafe. llvm-svn: 149457	2012-02-01 00:08:17 +00:00
Kevin Enderby	e1a12cf3ee	Fixed a crash in llvm-mc for Mach-O when a symbol difference expression uses a symbol from an assignment. In this case the symbol did not have a fragment so MCObjectWriter::IsSymbolRefDifferenceFullyResolved() should not have been calling IsSymbolRefDifferenceFullyResolvedImpl() with a NULL fragment and should just have returned false in that case. llvm-svn: 149442	2012-01-31 23:02:57 +00:00
Craig Topper	b85e40f738	Remove pcmpgt/pcmpeq intrinsics as clang is not using them. llvm-svn: 149367	2012-01-31 06:52:44 +00:00
Bill Wendling	d7cd9727ee	Remove all references to the old EH. There was always the current EH. -- Ministry of Truth llvm-svn: 149335	2012-01-31 02:09:07 +00:00
Bill Wendling	f98a6f4aab	Update test to new EH model. llvm-svn: 149333	2012-01-31 02:05:13 +00:00
Bill Wendling	eb4adf3a27	Update test to new EH model. llvm-svn: 149332	2012-01-31 02:04:20 +00:00
Chandler Carruth	2c469ff14a	Chris's constant data sequence refactoring actually enabled printing vectors of all one bits to be printed more cleverly in the AsmPrinter. Unfortunately, the byte value for all one bits is the same with -fsigned-char as the error return of '-1'. Force this to be the unsigned byte value when returning it to avoid this problem, and update the test case for the shiny new behavior. Yay for building LLVM and Clang with -funsigned-char. Chris, please review, and let me know if there is any reason to not desire this change. It seems good on the surface, and certainly intended based on the code written. llvm-svn: 149299	2012-01-30 23:47:44 +00:00
Devang Patel	7cdb2ff6b5	Intel syntax. Adjust special code, used to recognize cmp<comparison code>{ss,sd,ps,pd}, for intel syntax. llvm-svn: 149291	2012-01-30 22:47:12 +00:00
Devang Patel	9a9bb5c5db	Intel syntax. Support .intel_syntax directive. llvm-svn: 149270	2012-01-30 20:02:42 +00:00
Craig Topper	516cba3380	Fix pattern for memory form of PSHUFD for use with FP vectors to remove bitcast to an integer vector that normal code wouldn't have. Also remove bitcasts from code that turns splat vector loads into a shuffle as it was making the broken pattern necessary. llvm-svn: 149232	2012-01-30 07:50:31 +00:00
NAKAMURA Takumi	185de44959	CMake: Promote the testing targets out of folders on IDE. llvm-svn: 149220	2012-01-30 03:15:47 +00:00
James Molloy	b47489d4ef	Ensure .AliasedSymbol() is called on all uses of getSymbol(). Affects ARM and MIPS ELF backends. Fixes PR11877 llvm-svn: 149180	2012-01-28 15:58:32 +00:00
Rafael Espindola	004725876e	Small improvement to the recursion detection logic from the previous commit. llvm-svn: 149175	2012-01-28 06:22:14 +00:00
Rafael Espindola	72f5f170c6	Handle recursive variable definitions directly. This gives us better error messages and allows us to fix PR11865. llvm-svn: 149174	2012-01-28 05:57:00 +00:00
Rafael Espindola	bb893fea6b	Add r149110 back with a fix for when the vector and the int have the same width. llvm-svn: 149151	2012-01-27 23:33:07 +00:00
Rafael Espindola	a4062624d1	Revert r149110 and add a testcase that was crashing since that revision. Unfortunately I also had to disable constant-pool-sharing.ll the code it tests has been updated to use the IL logic. llvm-svn: 149148	2012-01-27 22:42:48 +00:00
Devang Patel	63fe5697f4	Intel Syntax: Parse mem operand with seg reg. QWORD PTR FS:[320] llvm-svn: 149142	2012-01-27 19:48:28 +00:00
Matt Beaumont-Gay	f3a9a38db4	Unix line endings llvm-svn: 149115	2012-01-27 02:31:29 +00:00
Chris Lattner	111d6ee655	enhance constant folding to be able to constant fold bitcast of ConstantVector's to integer type. llvm-svn: 149110	2012-01-27 01:44:03 +00:00
Lang Hames	286fed5c8c	Rewrite instruction operands in AdjustCopiesBackFrom. Fixes PR11861. llvm-svn: 149097	2012-01-27 00:05:42 +00:00
Jakob Stoklund Olesen	fc9dce25f7	Handle call-clobbered ymm registers on Win64. The Win64 calling convention has xmm6-15 as callee-saved while still clobbering all ymm registers. Add a YMM_HI_6_15 pseudo-register that aliases the clobbered part of the ymm registers, and mark that as call-clobbered. This allows live xmm registers across calls. This hack wouldn't be necessary with RegisterMask operands representing the call clobbers, but they are not quite operational yet. llvm-svn: 149088	2012-01-26 22:59:28 +00:00
Chad Rosier	1a1531d65e	Replace the use of isPredicable() with isPredicated() in MachineBasicBlock::canFallThrough(). We're interested in the state of the instruction (i.e., is this a barrier or not?), not if the instruction is predicable or not. rdar://10501092 llvm-svn: 149070	2012-01-26 18:24:25 +00:00
Jakob Stoklund Olesen	8c139a5125	Clear kill flags before propagating a copy. The live range of the source register may be extended when a redundant copy is eliminated. Make sure any kill flags between the two copies are cleared. This fixes PR11765. llvm-svn: 149069	2012-01-26 17:52:15 +00:00
James Molloy	6685c08e5f	Add support for the R_ARM_TARGET1 relocation, which should be given to relocations applied to all C++ constructors and destructors. This enables the linker to match concrete relocation types (absolute or relative) with whatever library or C++ support code is being linked against. llvm-svn: 149057	2012-01-26 09:25:43 +00:00
Victor Umansky	5f29b0e57b	Fix for the following bug in AVX codegen for double-to-int conversions: . "fptosi" and "fptoui" IR instructions are defined with round-to-zero rounding mode. . Currently for AVX mode for <4xdouble> and <8xdouble> the "VCVTPD2DQ.128" and "VCVTPD2DQ.256" instructions are selected (for .fp_to_sint. DAG node operation ) by AVX codegen. However they use round-to-nearest-even rounding mode. . Consequently, the conversion produces incorrect numbers. The fix is to replace selection of VCVTPD2DQ instructions with VCVTTPD2DQ instructions. The latter use truncate (i.e. round-to-zero) rounding mode. As .fp_to_sint. DAG node operation is used only for lowering of "fptosi" and "fptoui" IR instructions, the fix in X86InstrSSE.td definition file doesn.t have an impact on other LLVM flows. The patch includes changes in the .td file, LIT test for the changes and a fix in a legacy LIT test (which produced asm code conflicting with LLVN IR spec). llvm-svn: 149056	2012-01-26 08:51:39 +00:00
Jakob Stoklund Olesen	4864a81aa3	Improve sub-register def handling in ProcessImplicitDefs. This boils down to using MachineOperand::readsReg() more. This fixes PR11829 where a use ended up after the first def when lowering REG_SEQUENCE instructions involving IMPLICIT_DEFs. llvm-svn: 148996	2012-01-25 23:36:27 +00:00
Anton Korobeynikov	7722a2d4e3	Properly emit ctors / dtors with priorities into desired sections and let linker handle the rest. This finally fixes PR5329 llvm-svn: 148990	2012-01-25 22:24:19 +00:00

1 2 3 4 5 ...

15571 Commits