llvm-project

Commit Graph

Author	SHA1	Message	Date
Aaron Ballman	e67173e718	Adding support for Microsoft's thiscall calling convention. LLVM side of the patch. llvm-svn: 151123	2012-02-22 03:04:40 +00:00
Ahmed Charles	636a3d618c	Remove dead code. Improve llvm_unreachable text. Simplify some control flow. llvm-svn: 150918	2012-02-19 11:37:01 +00:00
Craig Topper	de121a1000	Remove some unneeded includes and fix ordering in X86ISelLowering.cpp. Remove unneeded 'using namespace'. llvm-svn: 150916	2012-02-19 07:15:48 +00:00
Craig Topper	65a4ceea1e	Unify all shuffle mask checking functions take a mask and VT instead of VectorShuffleSDNode. llvm-svn: 150913	2012-02-19 05:41:45 +00:00
Craig Topper	3e5c04e432	Make a bunch of X86ISelLowering shuffle functions static now that they are no longer needed by isel. llvm-svn: 150908	2012-02-19 02:53:47 +00:00
Jia Liu	e1d619691b	some comment fix for X86 and ARM llvm-svn: 150902	2012-02-19 02:03:36 +00:00
Craig Topper	66a3597a4a	Add vmfunc instruction to X86 assembler and disassembler. llvm-svn: 150899	2012-02-19 01:39:49 +00:00
Jia Liu	b22310fda6	Emacs-tag and some comment fix for all ARM, CellSPU, Hexagon, MBlaze, MSP430, PPC, PTX, Sparc, X86, XCore. llvm-svn: 150878	2012-02-18 12:03:15 +00:00
Craig Topper	57d3aaed78	Add X86InstrSVM.td that I forgot to add in r150873. llvm-svn: 150874	2012-02-18 08:34:12 +00:00
Craig Topper	ed7aa46366	Add X86 assembler and disassembler support for AMD SVM instructions. Original patch by Kay Tiong Khoo. Few tweaks by me for code density and to reduce replication. llvm-svn: 150873	2012-02-18 08:19:49 +00:00
Craig Topper	ba172d2d59	Remove the last of the old vector_shuffle patterns from X86 isel. llvm-svn: 150795	2012-02-17 07:02:34 +00:00
Jakob Stoklund Olesen	bc6ba479b6	Remove the YMM_HI_6_15 hack. Call clobbers are now represented with register mask operands. The regmask can easily represent the fact that xmm6 is call-preserved while ymm6 isn't. This is automatically computed by TableGen from the CalleeSavedRegs containing xmm6. llvm-svn: 150709	2012-02-16 17:56:06 +00:00
Jakob Stoklund Olesen	97e3115dc2	Use the same CALL instructions for Windows as for everything else. The different calling conventions and call-preserved registers are represented with regmask operands that are added dynamically. llvm-svn: 150708	2012-02-16 17:56:02 +00:00
Jakob Stoklund Olesen	8a450cb2fa	Enable register mask operands for x86 calls. Call instructions no longer have a list of 43 call-clobbered registers. Instead, they get a single register mask operand with a bit vector of call-preserved registers. This saves a lot of memory, 42 x 32 bytes = 1344 bytes per call instruction, and it speeds up building call instructions because those 43 imp-def operands no longer need to be added to use-def lists. (And removed and shifted and re-added for every explicit call operand). Passes like LiveVariables, LiveIntervals, RAGreedy, PEI, and BranchFolding are significantly faster because they can deal with call clobbers in bulk. Overall, clang -O2 is between 0% and 8% faster, uniformly distributed depending on call density in the compiled code. Debug builds using clang -O0 are 0% - 3% faster. I have verified that this patch doesn't change the assembly generated for the LLVM nightly test suite when building with -disable-copyprop and -disable-branch-fold. Branch folding behaves slightly differently in a few cases because call instructions have different hash values now. Copy propagation flushes its data structures when it crosses a register mask operand. This causes it to leave a few dead copies behind, on the order of 20 instruction across the entire nightly test suite, including SPEC. Fixing this properly would require the pass to use different data structures. llvm-svn: 150638	2012-02-16 00:02:50 +00:00
Chad Rosier	f0687634c3	Use a temporary variable, rather then a series of redundant calls. llvm-svn: 150538	2012-02-15 00:36:26 +00:00
Pete Cooper	c21ebf5c41	Stop custom lowering forr x86 DEC64m from happening if the load in the lowered sequence has more than 1 user llvm-svn: 150537	2012-02-15 00:33:37 +00:00
Craig Topper	cfad98f745	Move old movl vector_shuffle patterns. Not needed anymore since vector_shuffles shouldn't reach isel. llvm-svn: 150462	2012-02-14 08:14:53 +00:00
Craig Topper	8b19d78808	Still more vector_shuffle pattern removal. llvm-svn: 150365	2012-02-13 07:23:41 +00:00
Ahmed Charles	32e983e4fc	Fix various issues (or do cleanups) found by enabling certain MSVC warnings. - Use unsigned literals when the desired result is unsigned. This mostly allows unsigned/signed mismatch warnings to be less noisy even if they aren't on by default. - Remove misplaced llvm_unreachable. - Add static to a declaration of a function on MSVC x86 only. - Change some instances of calling a static function through a variable to simply calling that function while removing the unused variable. llvm-svn: 150364	2012-02-13 06:30:56 +00:00
Craig Topper	74650add0e	Remove more vector_shuffle patterns for unpack. These should be target specific nodes when they get to isel. llvm-svn: 150363	2012-02-13 05:48:49 +00:00
Craig Topper	6d471c9e49	Recommit r150328. Previous test failures should be fixed by r150360. llvm-svn: 150362	2012-02-13 05:10:10 +00:00
Craig Topper	87119fa37f	Update CanXFormVExtractWithShuffleIntoLoad to ensure bitcasts of loads only have one use. Matches DAGCombiner and prevents vector_shuffles from reaching isel. llvm-svn: 150360	2012-02-13 04:30:38 +00:00
NAKAMURA Takumi	0826c17d00	Revert r150328, "Remove more vector_shuffle patterns." It caused 3 failures on pre-penryn and non-x86(generic) hosts. llvm-svn: 150357	2012-02-13 00:10:15 +00:00
Pete Cooper	71be57bb32	Fixed bug when custom lowering DEC64m on x86. If the DEC node had more than one user, it was doing this lowering but leaving the original DEC node around and so decrementing twice. Fixes PR11964. llvm-svn: 150356	2012-02-13 00:10:03 +00:00
Craig Topper	e24c94af81	Remove more vector_shuffle patterns. llvm-svn: 150328	2012-02-12 08:14:35 +00:00
Craig Topper	d40d9eb2b3	Remove more vector_shuffle patterns. llvm-svn: 150321	2012-02-12 01:07:34 +00:00
Craig Topper	330ca97700	Remove more vector_shuffle patterns. llvm-svn: 150314	2012-02-11 23:31:01 +00:00
Anton Korobeynikov	c6b4017ce2	Add support for implicit TLS model used with MS VC runtime. Patch by Kai Nacke! llvm-svn: 150307	2012-02-11 17:26:53 +00:00
Benjamin Kramer	915e3d9568	Don't mix declarations and code. llvm-svn: 150305	2012-02-11 16:01:02 +00:00
Benjamin Kramer	428704eb52	Make the EDis tables const. llvm-svn: 150304	2012-02-11 14:51:07 +00:00
Benjamin Kramer	478e8de8ef	Reuse the enum names from X86Desc in the X86Disassembler. This requires some gymnastics to make it available for C code. Remove the names from the disassembler tables, making them relocation free. llvm-svn: 150303	2012-02-11 14:50:54 +00:00
Craig Topper	981c6cf7b3	Remove some patterns for matching vector_shuffle instructions since vector_shuffles should be custom lowered before isel. llvm-svn: 150299	2012-02-11 07:43:35 +00:00
Craig Topper	11826a6e10	Fix shuffle lowering code to stop creating temporary DAG nodes to do shuffle mask checks on. This seemed to be confusing things such that vector_shuffle ops to got through to iselection. This is another step towards removing the vector_shuffle handling patterns from isel. llvm-svn: 150296	2012-02-11 06:24:48 +00:00
Craig Topper	a0cd970b81	More tweaks to get the size of the X86 disassembler tables down. llvm-svn: 150167	2012-02-09 08:58:07 +00:00
Craig Topper	487e744f66	Flatten some of the arrays in the X86 disassembler tables to reduce space needed to store pointers on 64-bit hosts and reduce relocations needed at startup. Part of PR11953. llvm-svn: 150161	2012-02-09 07:45:30 +00:00
Jakob Stoklund Olesen	4519fd0b21	Handle register masks when searching for EFLAGS clobbers. Calls clobber the flags, but when using register masks there is no EFLAGS<imp-def> operand. llvm-svn: 150117	2012-02-09 00:17:22 +00:00
Elena Demikhovsky	1adc1d53dd	Fixed a bug in printing "cmp" pseudo ops. > This IR code > %res = call <8 x float> @llvm.x86.avx.cmp.ps.256(<8 x float> %a0, <8 x float> %a1, i8 14) > fails with assertion: > > llc: X86ATTInstPrinter.cpp:62: void llvm::X86ATTInstPrinter::printSSECC(const llvm::MCInst, unsigned int, llvm::raw_ostream&): Assertion `0 && "Invalid ssecc argument!"' failed. > 0 llc 0x0000000001355803 > 1 llc 0x0000000001355dc9 > 2 libpthread.so.0 0x00007f79a30575d0 > 3 libc.so.6 0x00007f79a23a1945 gsignal + 53 > 4 libc.so.6 0x00007f79a23a2f21 abort + 385 > 5 libc.so.6 0x00007f79a239a810 __assert_fail + 240 > 6 llc 0x00000000011858d5 llvm::X86ATTInstPrinter::printSSECC(llvm::MCInst const, unsigned int, llvm::raw_ostream&) + 119 I added the full testing for all possible pseudo-ops of cmp. I extended X86AsmPrinter.cpp and X86IntelInstPrinter.cpp. You'l also see lines alignments (unrelated to this fix) in X86IselLowering.cpp from my previous check-in. llvm-svn: 150068	2012-02-08 08:37:26 +00:00
Craig Topper	172b9243cd	Remove a couple unneeded intrinsic patterns llvm-svn: 150067	2012-02-08 08:29:30 +00:00
Craig Topper	5405571fe0	Remove GCC builtins for vpermilp* intrinsics as clang no longer needs them. Custom lower the intrinsics to the vpermilp target specific node and remove intrinsic patterns. llvm-svn: 150060	2012-02-08 06:36:57 +00:00
Evan Cheng	1b81fddd65	Use LEA to adjust stack ptr for Atom. Patch by Andy Zhang. llvm-svn: 150008	2012-02-07 22:50:41 +00:00
Craig Topper	b27fd77c3f	Add instruction selection for 256-bit VPSHUFD and 128-bit VPERMILPS/VPERMILPD. llvm-svn: 149968	2012-02-07 06:28:42 +00:00
Derek Schuff	8b2dcad4b5	Enable streaming of bitcode This CL delays reading of function bodies from initial parse until materialization, allowing overlap of compilation with bitcode download. llvm-svn: 149918	2012-02-06 22:30:29 +00:00
Chris Lattner	8213c8af29	Remove some dead code and tidy things up now that vectors use ConstantDataVector instead of always using ConstantVector. llvm-svn: 149912	2012-02-06 21:56:39 +00:00
Benjamin Kramer	2496717052	X86: Don't call malloc for 4 bits. No functionality change. llvm-svn: 149866	2012-02-06 12:06:18 +00:00
Craig Topper	1f71057747	Add shuffle decoding support for 256-bit pshufd. Merge vpermilp* and pshufd decoding. llvm-svn: 149859	2012-02-06 07:17:51 +00:00
Duncan Sands	ae22c60f90	Persuade GCC that there is nothing worth warning about here (there isn't). llvm-svn: 149834	2012-02-05 14:20:11 +00:00
Chandler Carruth	ebd90c58e6	Begin fleshing out more convenience predicates in llvm::Triple and convert at least one client over to use them. Subsequent patches both to LLVM and Clang will try to convert more people over to a common set of predicates. This round of predicates is focused on OS-categorization predicates. llvm-svn: 149815	2012-02-05 08:26:40 +00:00
Craig Topper	c4965bce14	Convert assert(0) to llvm_unreachable llvm-svn: 149814	2012-02-05 07:21:30 +00:00
Craig Topper	4ed7278ff4	Convert assert(0) to llvm_unreachable in X86 Target directory. llvm-svn: 149809	2012-02-05 05:38:58 +00:00
Craig Topper	83f3bdaa45	Convert some assert(0) in default of switch statements to llvm_unreachable. llvm-svn: 149808	2012-02-05 03:43:23 +00:00
Craig Topper	1d471e31ba	Add target specific node for PMULUDQ. Change patterns to use it and custom lower intrinsics to it. Use it instead of intrinsic to handle 64-bit vector multiplies. llvm-svn: 149807	2012-02-05 03:14:49 +00:00
Craig Topper	4daa67483d	Remove most of the intrinsics for XOP VPCMOV instruction. They all aliased to the same instruction with different types. This would be better accomplished with casts in the not yet created xopintrin.h header file. llvm-svn: 149795	2012-02-05 00:55:56 +00:00
Andrew Trick	f8ea108c05	TargetPassConfig: confine the MC configuration to TargetMachine. Passes prior to instructon selection are now split into separate configurable stages. Header dependencies are simplified. The bulk of this diff is simply removal of the silly DisableVerify flags. Sorry for the target header churn. Attempting to stabilize them. llvm-svn: 149754	2012-02-04 02:56:59 +00:00
Craig Topper	47e6d26911	Remove getShuffleVPERMILPImmediate function, getShuffleSHUFImmediate performs the same calculation. llvm-svn: 149683	2012-02-03 06:52:33 +00:00
Craig Topper	d5ffe0900d	Remove unnecessary qualification on 256-bit vector handling in LowerBUILD_VECTOR. Condition was already guaranteed by earlier code. llvm-svn: 149680	2012-02-03 06:32:21 +00:00
Andrew Trick	ccb673659a	Added TargetPassConfig. The first little step toward configuring codegen passes. Allows command line overrides to be centralized in LLVMTargetMachine.cpp. LLVMTargetMachine can intercept common passes and give precedence to command line overrides. Allows adding "internal" target configuration options without touching TargetOptions. Encapsulates the PassManager. Provides a good point to initialize all CodeGen passes so that Pass ID's can be used in APIs. Allows modifying the target configuration hooks without rebuilding the world. llvm-svn: 149672	2012-02-03 05:12:41 +00:00
Andrew Trick	808a7a6ce6	whitespace llvm-svn: 149671	2012-02-03 05:12:30 +00:00
Lang Hames	bb682450f9	Incorporate suggestions Chad, Jakob and Evan's suggestions on r149957. llvm-svn: 149655	2012-02-03 01:13:49 +00:00
Jakob Stoklund Olesen	5e1ac45b93	Require non-NULL register masks. It doesn't seem worthwhile to give meaning to a NULL register mask pointer. It complicates all the code using register mask operands. llvm-svn: 149646	2012-02-02 23:52:57 +00:00
Elena Demikhovsky	6fbb4d2842	Minor change in signature of the getZeroVector() llvm-svn: 149601	2012-02-02 09:20:18 +00:00
Elena Demikhovsky	fb44980b41	Optimization for SIGN_EXTEND operation on AVX. Special handling was added for v4i32 -> v4i64 and v8i16 -> v8i32 extensions. llvm-svn: 149600	2012-02-02 09:10:43 +00:00
Francois Pichet	26f302d568	Unbreak the MSVC build. llvm-svn: 149599	2012-02-02 08:36:09 +00:00
Lang Hames	0269caafa6	Set EFLAGS correctly in EmitLoweredSelect on X86. llvm-svn: 149597	2012-02-02 07:48:37 +00:00
Andrew Trick	8523b16ff5	Instruction scheduling itinerary for Intel Atom. Adds an instruction itinerary to all x86 instructions, giving each a default latency of 1, using the InstrItinClass IIC_DEFAULT. Sets specific latencies for Atom for the instructions in files X86InstrCMovSetCC.td, X86InstrArithmetic.td, X86InstrControl.td, and X86InstrShiftRotate.td. The Atom latencies for the remainder of the x86 instructions will be set in subsequent patches. Adds a test to verify that the scheduler is working. Also changes the scheduling preference to "Hybrid" for i386 Atom, while leaving x86_64 as ILP. Patch by Preston Gurd! llvm-svn: 149558	2012-02-01 23:20:51 +00:00
Mon P Wang	9f05206659	Avoid creating an extract element to an illegal type after LegalizeTypes has run. llvm-svn: 149548	2012-02-01 22:15:20 +00:00
Chad Rosier	e273cb08c4	Tidy up. llvm-svn: 149521	2012-02-01 18:45:51 +00:00
Elena Demikhovsky	824eed70a6	Passing AVX 256-bit structures in Win64 was wrong. Fixed Win64 calling conventions. llvm-svn: 149494	2012-02-01 10:46:14 +00:00
Elena Demikhovsky	34cca175ab	Shortened code in shuffle masks llvm-svn: 149493	2012-02-01 10:33:05 +00:00
Elena Demikhovsky	0e48c70ba7	Optimization for "truncate" operation on AVX. Truncating v4i64 -> v4i32 and v8i32 -> v8i16 may be done with set of shuffles. llvm-svn: 149485	2012-02-01 07:56:44 +00:00
Craig Topper	9cdb8bdf04	Don't create VBROADCAST nodes if any nodes use the chain result from the load. Fixes PR11900. llvm-svn: 149478	2012-02-01 06:51:58 +00:00
Devang Patel	a173ee56fd	Add assembler dialect attribute in asm parser which lets target specific asm parser change dialect on the fly. llvm-svn: 149396	2012-01-31 18:14:05 +00:00
Craig Topper	b85e40f738	Remove pcmpgt/pcmpeq intrinsics as clang is not using them. llvm-svn: 149367	2012-01-31 06:52:44 +00:00
Evan Cheng	4e7992eeba	PR11834: Use macros which are defined on Windows. Patch by Marina Yatsina. llvm-svn: 149294	2012-01-30 23:10:32 +00:00
Devang Patel	7cdb2ff6b5	Intel syntax. Adjust special code, used to recognize cmp<comparison code>{ss,sd,ps,pd}, for intel syntax. llvm-svn: 149291	2012-01-30 22:47:12 +00:00
Devang Patel	9a9bb5c5db	Intel syntax. Support .intel_syntax directive. llvm-svn: 149270	2012-01-30 20:02:42 +00:00
Benjamin Kramer	396c590818	Fix refacto. llvm-svn: 149269	2012-01-30 20:01:35 +00:00
Douglas Gregor	e577cfe172	Eliminate narrowing conversion in initializer list, to make C++11 happy llvm-svn: 149254	2012-01-30 16:57:18 +00:00
Benjamin Kramer	20af25f47b	X86: Simplify shuffle mask generation code. llvm-svn: 149248	2012-01-30 15:16:21 +00:00
Craig Topper	516cba3380	Fix pattern for memory form of PSHUFD for use with FP vectors to remove bitcast to an integer vector that normal code wouldn't have. Also remove bitcasts from code that turns splat vector loads into a shuffle as it was making the broken pattern necessary. llvm-svn: 149232	2012-01-30 07:50:31 +00:00
Craig Topper	ca29bcfc10	Move some XOP patterns into instruction definition. Replae VPCMOV intrinsic patterns with custom lowering to a target specific nodes. llvm-svn: 149216	2012-01-30 01:10:15 +00:00
Devang Patel	63fe5697f4	Intel Syntax: Parse mem operand with seg reg. QWORD PTR FS:[320] llvm-svn: 149142	2012-01-27 19:48:28 +00:00
Craig Topper	5639e9e8fb	Move some patterns back near their instructions and use AddedComplexity to fix priority. Merge some patterns into their instruction definition. llvm-svn: 149122	2012-01-27 07:09:40 +00:00
Jim Grosbach	8f28dbdde5	Keep source location information for X86 MCFixup's. llvm-svn: 149106	2012-01-27 00:51:27 +00:00
Jakob Stoklund Olesen	fc9dce25f7	Handle call-clobbered ymm registers on Win64. The Win64 calling convention has xmm6-15 as callee-saved while still clobbering all ymm registers. Add a YMM_HI_6_15 pseudo-register that aliases the clobbered part of the ymm registers, and mark that as call-clobbered. This allows live xmm registers across calls. This hack wouldn't be necessary with RegisterMask operands representing the call clobbers, but they are not quite operational yet. llvm-svn: 149088	2012-01-26 22:59:28 +00:00
Victor Umansky	5f29b0e57b	Fix for the following bug in AVX codegen for double-to-int conversions: . "fptosi" and "fptoui" IR instructions are defined with round-to-zero rounding mode. . Currently for AVX mode for <4xdouble> and <8xdouble> the "VCVTPD2DQ.128" and "VCVTPD2DQ.256" instructions are selected (for .fp_to_sint. DAG node operation ) by AVX codegen. However they use round-to-nearest-even rounding mode. . Consequently, the conversion produces incorrect numbers. The fix is to replace selection of VCVTPD2DQ instructions with VCVTTPD2DQ instructions. The latter use truncate (i.e. round-to-zero) rounding mode. As .fp_to_sint. DAG node operation is used only for lowering of "fptosi" and "fptoui" IR instructions, the fix in X86InstrSSE.td definition file doesn.t have an impact on other LLVM flows. The patch includes changes in the .td file, LIT test for the changes and a fix in a legacy LIT test (which produced asm code conflicting with LLVN IR spec). llvm-svn: 149056	2012-01-26 08:51:39 +00:00
Craig Topper	86e44bc829	Add HasXOP predicate check covering a bunch of XOP intrinsic patterns. llvm-svn: 149054	2012-01-26 07:51:55 +00:00
Craig Topper	1c0e22f57a	Fix AVX vs SSE patterns ordering issue for VPCMPESTRM and VPCMPISTRM. llvm-svn: 149053	2012-01-26 07:31:30 +00:00
Craig Topper	b91760eff8	Remove some more patterns by custom lowering intrinsics to target specific nodes. llvm-svn: 149052	2012-01-26 07:18:03 +00:00
Chris Lattner	33633a90a0	fix a bug I introduced in r148929, this is not a splat! Thanks to Eli for noticing. llvm-svn: 148947	2012-01-25 09:56:22 +00:00
Craig Topper	7834900950	Custom lower PSIGN and PSHUFB intrinsics to their corresponding target specific nodes so we can remove the isel patterns. llvm-svn: 148933	2012-01-25 06:43:11 +00:00
Chris Lattner	47a86bdbe2	use ConstantVector::getSplat in a few places. llvm-svn: 148929	2012-01-25 06:02:56 +00:00
Craig Topper	ce4f9c5668	Custom lower phadd and phsub intrinsics to target specific nodes. Remove the patterns that are no longer necessary. llvm-svn: 148927	2012-01-25 05:37:32 +00:00
Craig Topper	5bcf070e68	Remove AVX 256-bit unaligned load intrinsics. 128-bit versions had been removed a while ago. llvm-svn: 148922	2012-01-25 04:42:03 +00:00
Craig Topper	3ad5bc019a	Merge intrinsic pattern and no pattern versions of VCVTSD2SI intruction definitions. Matches non-AVX version of same instructions. llvm-svn: 148914	2012-01-25 03:52:09 +00:00
Devang Patel	a410ed3ced	Intel Syntax: Extend special hand coded logic, to recognize special instructions, for intel syntax. llvm-svn: 148864	2012-01-24 21:43:36 +00:00
Elena Demikhovsky	0b0c5d8c4c	ZERO_EXTEND operation is optimized for AVX. v8i16 -> v8i32, v4i32 -> v4i64 - used vpunpck* instructions. llvm-svn: 148803	2012-01-24 13:54:13 +00:00
Craig Topper	0d8e67aebd	Add comments near load pattern fragments indicating that all integer vector loads are promoted to v2i64 or v4i64 so that no one tries to reintroduce pattern fragments for other types. llvm-svn: 148771	2012-01-24 03:03:17 +00:00
Devang Patel	eba7d3dba9	Fix typo. llvm-svn: 148751	2012-01-23 23:56:33 +00:00
Devang Patel	cf893a437e	Intel syntax: Robustify parsing of memory operand's displacement experssion. llvm-svn: 148737	2012-01-23 22:35:25 +00:00
Devang Patel	e660fdd953	Intel syntax: Parse memory operand with empty base reg, e.g. DWORD PTR [4*RDI] llvm-svn: 148721	2012-01-23 20:20:06 +00:00
Devang Patel	880bc1644b	Intel syntax: Parse segment registers. llvm-svn: 148712	2012-01-23 18:31:58 +00:00
Craig Topper	edd1d0acfc	Custom lower PCMPEQ/PCMPGT intrinsics to target specific nodes and remove the intrinsic patterns. llvm-svn: 148687	2012-01-23 08:18:28 +00:00
Craig Topper	6b90c5d03e	Update more places to use target specific nodes for vector shifts instead of intrinsics. llvm-svn: 148685	2012-01-23 06:46:22 +00:00
Craig Topper	5e80db4e4f	Custom lower vector shift intrinsics to target specific nodes and remove the patterns that are no longer needed. llvm-svn: 148684	2012-01-23 06:16:53 +00:00
Craig Topper	20c98df340	Remove pattern fragments for v32i8, v16i16, v8i32, v16i8, v8i16, and v4i32 loads. All integer vector loads are promoted to v2i64 or v4i64 so these pattern fragments can never match. Fix or remove patterns that used these fragments. llvm-svn: 148672	2012-01-23 00:06:44 +00:00
Craig Topper	0b7ad76bd0	Combine X86 CMPPD and CMPPS node types. Simplifies selection code and pattern matching. llvm-svn: 148670	2012-01-22 23:36:02 +00:00
Craig Topper	bd4884371b	Merge PCMPEQB/PCMPEQW/PCMPEQD/PCMPEQQ and PCMPGTB/PCMPGTW/PCMPGTD/PCMPGTQ X86 ISD node types into only two node types. Simplifying opcode selection and pattern matching. llvm-svn: 148667	2012-01-22 22:42:16 +00:00
Craig Topper	094626414d	Add target specific ISD node types for SSE/AVX vector shuffle instructions and change all the code that used to create intrinsic nodes to create the new nodes instead. llvm-svn: 148664	2012-01-22 19:15:14 +00:00
Craig Topper	a4ed5246d8	Make code a little less verbose. llvm-svn: 148651	2012-01-22 03:07:48 +00:00
Craig Topper	cb3433cd58	Remove unused X86 ISD node type defines. llvm-svn: 148644	2012-01-22 01:15:56 +00:00
Craig Topper	123adfa0f3	Move some vector shift patterns into their instruction definitions. llvm-svn: 148643	2012-01-22 00:41:20 +00:00
Craig Topper	dcaa5fbd08	Add memory patterns for some of the fp<->integer conversion instructions. Fold some patterns into instruction definitions. llvm-svn: 148641	2012-01-21 18:37:15 +00:00
Benjamin Kramer	5cff13a3fb	Remove unused variables. llvm-svn: 148635	2012-01-21 10:42:44 +00:00
Craig Topper	39bc1e4d25	Fix PR11819 introduced by r148537. I'd commit the test case, but the generated code is terrible as it gets fully scalarized. Expect a future commit to fix that. llvm-svn: 148632	2012-01-21 08:49:33 +00:00
Devang Patel	ce6a2ca8c8	Intel syntax: Robustify register parsing. llvm-svn: 148591	2012-01-20 22:32:05 +00:00
David Blaikie	46a9f016c5	More dead code removal (using -Wunreachable-code) llvm-svn: 148578	2012-01-20 21:51:11 +00:00
Devang Patel	d0930fff85	Intel syntax: Parse ... PTR [-8] llvm-svn: 148570	2012-01-20 21:21:01 +00:00
Devang Patel	f36613cb45	Intel syntax: For now, disable ambiguous JMP64pcrel32 for intel syntax. llvm-svn: 148569	2012-01-20 21:14:06 +00:00
Craig Topper	a409479023	Improve 256-bit shuffle splitting to allow 2 sources in each 128-bit lane. As long as only a single lane of the source is used in the lane in the destination. This makes the splitting match much closer to what happens with 256-bit shuffles when AVX is disabled and only 128-bit XMM is allowed. llvm-svn: 148537	2012-01-20 09:29:03 +00:00
Craig Topper	3469212c82	Add support for selecting 256-bit PALIGNR. llvm-svn: 148532	2012-01-20 05:53:00 +00:00
Eli Friedman	32c7c25dcb	Support MSVC x86-32 sret convention. PR11688. Patch by Joe Groff. llvm-svn: 148513	2012-01-20 00:05:46 +00:00
Devang Patel	f83dcfd052	Post process 'and', 'sub' instructions and select better encoding, if available. llvm-svn: 148489	2012-01-19 18:40:55 +00:00
Devang Patel	2529dd9e00	Intel syntax: There is no need to create unary expr for simple negative displacement. llvm-svn: 148486	2012-01-19 18:15:51 +00:00
Devang Patel	4a62ff9bcb	Post process 'xor', 'or' and 'cmp' instructions and select better encoding, if available. llvm-svn: 148485	2012-01-19 17:53:25 +00:00
Craig Topper	a875b7ccc7	Folding table additions and fixes for AVX. llvm-svn: 148467	2012-01-19 08:50:38 +00:00
Craig Topper	80576e8d1f	Merge 128-bit and 256-bit SHUFPS/SHUFPD handling. llvm-svn: 148466	2012-01-19 08:19:12 +00:00
Nick Lewycky	ecc0084f72	Add a TargetOption for disabling tail calls. llvm-svn: 148442	2012-01-19 00:34:10 +00:00
Jakob Stoklund Olesen	ff482f733b	Add experimental -x86-use-regmask command line option. It adds register mask operands to x86 call instructions. Once all the backend passes support register mask operands, this will be permanently enabled. llvm-svn: 148438	2012-01-18 23:52:22 +00:00
Jakob Stoklund Olesen	f1fb1d2375	Ignore register mask operands when lowering instructions to MC. This is similar to implicit register operands. MC doesn't understand register liveness and call clobbers. llvm-svn: 148437	2012-01-18 23:52:19 +00:00
Devang Patel	de47cced25	Process instructions after match to select alternative encoding which may be more desirable. llvm-svn: 148431	2012-01-18 22:42:29 +00:00
Jim Grosbach	aba3de99c0	Tidy up. MCAsmBackend naming conventions. llvm-svn: 148400	2012-01-18 18:52:16 +00:00
Jakob Stoklund Olesen	f43b599550	Add a CoveredBySubRegs property to Register descriptions. When set, this bit indicates that a register is completely defined by the value of its sub-registers. Use the CoveredBySubRegs property to infer which super-registers are call-preserved given a list of callee-saved registers. For example, the ARM registers D8-D15 are callee-saved. This now automatically implies that Q4-Q7 are call-preserved. Conversely, Win64 callees save XMM6-XMM15, but the corresponding YMM6-YMM15 registers are not call-preserved because they are not fully defined by their sub-registers. llvm-svn: 148363	2012-01-18 00:16:39 +00:00
Jakob Stoklund Olesen	d51a710bde	Move X86 callee saved register lists to the X86CallConv .td file. Add a trivial implementation of the getCallPreservedMask() hook. llvm-svn: 148347	2012-01-17 22:47:01 +00:00
Devang Patel	c9ed518792	Intel syntax: Fix parser match class to check memory operand size. llvm-svn: 148338	2012-01-17 21:48:03 +00:00
Devang Patel	a7143b6a2b	Intel syntax: Parse "BYTE PTR [RDX + RCX]" llvm-svn: 148334	2012-01-17 21:25:10 +00:00
Devang Patel	2ed6718616	Untabify. llvm-svn: 148322	2012-01-17 19:09:22 +00:00
Devang Patel	8b39be79ad	Intel syntax: Do not unncessarily create plus expression for memory operand displacement. llvm-svn: 148321	2012-01-17 19:08:07 +00:00
Devang Patel	41b9ddeb7a	Intel syntax: Robustify memory operand parsing. llvm-svn: 148312	2012-01-17 18:00:18 +00:00
Nadav Rotem	86c3807b99	Fix warning. llvm-svn: 148301	2012-01-17 09:31:09 +00:00
Nadav Rotem	86e5390dbf	Fix 11769. In CanXFormVExtractWithShuffleIntoLoad we assumed that EXTRACT_VECTOR_ELT can be later handled by the DAGCombiner. However, in some cases on AVX, the EXTRACT_VECTOR_ELT is legalized to EXTRACT_SUBVECTOR + EXTRACT_VECTOR_ELT, which currently is not handled by the DAGCombiner. In this patch I added a check that we only extract from the XMM part. llvm-svn: 148298	2012-01-17 09:13:19 +00:00
Craig Topper	9cafcd8baa	Remove unnecessary AVX check from an assert. hasSSE2 is enough. llvm-svn: 148295	2012-01-17 08:23:44 +00:00
Craig Topper	37b10ef250	Fix a crasher when PerformShiftCombine receives a BUILD_VECTOR of all UNDEF. Probably could use better handling in DAG combine or getNode. Fixes PR11772. llvm-svn: 148285	2012-01-17 04:44:50 +00:00
Eli Friedman	206ca569aa	Make sure the non-SSE lowering for fences correctly clobbers EFLAGS. PR11768. llvm-svn: 148240	2012-01-16 16:42:21 +00:00
Eli Friedman	75e3db4c7a	Get rid of unused codegen-only instruction. llvm-svn: 148239	2012-01-16 16:29:35 +00:00
Craig Topper	db8890aedd	Give priority to AVX over SSE for 128-bit floating point unpck instructions. llvm-svn: 148233	2012-01-16 09:56:42 +00:00
Nadav Rotem	57935243bd	[AVX] Optimize x86 VSELECT instructions using SimplifyDemandedBits. We know that the blend instructions only use the MSB, so if the mask is sign-extended then we can convert it into a SHL instruction. This is a common pattern because the type-legalizer sign-extends the i1 type which is used by the LLVM-IR for the condition. Added a new optimization in SimplifyDemandedBits for SIGN_EXTEND_INREG -> SHL. llvm-svn: 148225	2012-01-15 19:27:55 +00:00
Benjamin Kramer	339ced4e34	Return an ArrayRef from ShuffleVectorSDNode::getMask and push it through CodeGen. llvm-svn: 148218	2012-01-15 13:16:05 +00:00
Craig Topper	c10e1abaf3	Fix the memop type on a couple 256-bit AVX instructions that were using f128mem instead of f256mem. llvm-svn: 148196	2012-01-14 18:29:57 +00:00
Craig Topper	d78429f850	Add a bunch of AVX instructions to the folding tables. Also fixed the alignment on 256-bit AVX2 instructions. llvm-svn: 148194	2012-01-14 18:14:53 +00:00
Chad Rosier	71a185c5c6	Fix pasto from r146196. llvm-svn: 148167	2012-01-14 01:50:21 +00:00

1 2 3 4 5 ...

8171 Commits