llvm-project

Commit Graph

Author	SHA1	Message	Date
Elena Demikhovsky	29792e9a80	AVX-512: Added all forms of FP compare instructions for KNL and SKX. Added intrinsics for the instructions. CC parameter of the intrinsics was changed from i8 to i32 according to the spec. By Igor Breger (igor.breger@intel.com) llvm-svn: 236714	2015-05-07 11:24:42 +00:00
Craig Topper	a3776de242	[X86] Add the remaining 11 possible exact ModRM formats. This makes their encodings linear which can then be used to simplify some other code. llvm-svn: 229279	2015-02-15 04:16:44 +00:00
Chandler Carruth	4d31f58c88	[x86] Give movss and movsd execution domains in the x86 backend. This associates movss and movsd with the packed single and packed double execution domains (resp.). While this is largely cosmetic, as we now don't have weird ping-pong-ing between single and double precision, it is also useful because it avoids the domain fixing algorithm from seeing domain breaks that don't actually exist. It will also be much more important if we have an execution domain default other than packed single, as that would cause us to mix movss and movsd with integer vector code on a regular basis, a very bad mixture. llvm-svn: 228135	2015-02-04 10:58:53 +00:00
Craig Topper	b86338f7b2	[X86] Remove the single AdSize indicator and replace it with separate AdSize16/32/64 flags. This removes a hardcoded list of instructions in the CodeEmitter. Eventually I intend to remove the predicates on the affected instructions since in any given mode two of them are valid if we supported addr32/addr16 prefixes in the assembler. llvm-svn: 224809	2014-12-24 06:05:22 +00:00
Cameron McInally	5fb084e798	[AVX512] Add support for 512b variable bit shift intrinsics. llvm-svn: 224028	2014-12-11 17:13:05 +00:00
Michael Liao	5bf9578ce4	[X86] Clean up whitespace as well as minor coding style llvm-svn: 223339	2014-12-04 05:20:33 +00:00
Cameron McInally	04400449c5	[AVX512] Add 512b masked integer shift by immediate patterns. llvm-svn: 222002	2014-11-14 15:43:00 +00:00
Adam Nemet	449b3f0931	[AVX512] Two new attributes in X86VectorVTInfo for subvector insert The new attributes are NumElts and the CD8TupleForm. This prepares the code to enable x8 and x2 inserts. NFC, no change in X86.td.expanded except for the new attributes. llvm-svn: 219871	2014-10-15 23:42:09 +00:00
Robert Khasanov	44241440e1	[AVX512] Refactoring of avx512_binop_rm multiclass through AVX512_masking. Added new argrument for AVX512_masking: InstrItinClass and bit isCommutable. No functional change. llvm-svn: 219310	2014-10-08 14:37:45 +00:00
Adam Nemet	2e91ee58fe	[AVX512] Add masking variant for the FMA instructions This change further evolves the base class AVX512_masking in order to make it suitable for the masking variants of the FMA instructions. Besides AVX512_masking there is now a new base class that instructions including FMAs can use: AVX512_masking_3src. With three-source (destructive) instructions one of the sources is already tied to the destination. This difference from AVX512_masking is captured by this new class. The common bits between _masking and _masking_3src are broken out into a new super class called AVX512_masking_common. As with valign, there is some corresponding restructuring of the underlying format classes. The idea is the same we want to derive from two classes essentially: one providing the format bits and another format-independent multiclass supplying the various masking and non-masking instruction variants. Existing fma tests in avx512-fma.ll provide coverage here for the non-masking variants. For masking, the next patches in the series will add intrinsics and intrinsic tests. For AVX512_masking_3src to work, the (ins ...) dag has to be passed without* the leading source operand that is tied to dst ($src1). This is necessary to properly construct the (ins ...) for the different variants. For the record, I did check that if $src is mistakenly included, you do get a fairly intuitive error message from the tablegen backend. Part of <rdar://problem/17688758> llvm-svn: 215660	2014-08-14 17:13:19 +00:00
Adam Nemet	2e2537f665	[AVX512] Generate masking instruction variants with tablegen After adding the masking variants to several instructions, I have decided to experiment with generating these from the non-masking/unconditional variant. This will hopefully reduce the amount repetition that we currently have in order to define an instruction with all its variants (for a reg/mem instruction this would be 6 instruction defs and 2 Pat<> for the intrinsic). The patch is the first cut that is currently only applied to valignd/q to make the patch small. A few notes on the approach: * In order to stitch together the dag for both the conditional and the unconditional patterns I pass the RHS of the set rather than the full pattern (set dest, RHS). * Rather than subclassing each instruction base class (e.g. AVX512AIi8), with a masking variant which wouldn't scale, I derived the masking instructions from a new base class AVX512 (this is just I<> with Requires<HasAVX512>). The instructions derive from this now, plus a new set of classes that add the format bits and everything else that instruction base class provided (i.e. AVX512AIi8 vs. AVX512AIi8Base). I hope we can go incrementally from here. I expect that: * We will need different variants of the masking class. One example is instructions requiring three vector sources. In this case we tie one of the source operands to dest rather than a new implicit source operand ($src0) * Add the zero-masking variant * Add more AVX512*Base classes as new uses are added I've looked at X86.td.expanded before and after to make sure that nothing got lost for valignd/q. llvm-svn: 215125	2014-08-07 17:53:55 +00:00
Kevin Enderby	0d928a142b	Add support for the X86 secure guard extensions instructions in assembler (SGX). This allows assembling the two new instructions, encls and enclu for the SKX processor model. Note the diffs are a bigger than what might think, but to fit the new MRM_CF and MRM_D7 in things in the right places things had to be renumbered and shuffled down causing a bit more diffs. rdar://16228228 llvm-svn: 214460	2014-07-31 23:57:38 +00:00
Robert Khasanov	bfa0131365	[SKX] Enabling SKX target and AVX512BW, AVX512DQ, AVX512VL features. Enabling HasAVX512{DQ,BW,VL} predicates. Adding VK2, VK4, VK32, VK64 masked register classes. Adding new types (v64i8, v32i16) to VR512. Extending calling conventions for new types (v64i8, v32i16) Patch by Zinovy Nis <zinovy.y.nis@intel.com> Reviewed by Elena Demikhovsky <elena.demikhovsky@intel.com> llvm-svn: 213545	2014-07-21 14:54:21 +00:00
Adam Nemet	4c339abab3	[X86] AVX512: Rename EVEX_CD8V to CD8_Form This is to match the naming of CD8_EltSize, CD8_Scale, etc. No functional change. llvm-svn: 213280	2014-07-17 17:04:52 +00:00
Adam Nemet	54adb0fcbc	[X86] AVX512: Use the TD version of CD8_Scale in the assembler Passes the computed scaling factor in TSFlags rather than the old attributes. Also removes the C++ version of computing the scaling factor (MemObjSize) along with the asserts added by the previous patch. No functional change. llvm-svn: 213279	2014-07-17 17:04:50 +00:00
Adam Nemet	4dc92b9a84	[X86] AVX512: Move compressed displacement logic to TD This does not actually move the logic yet but reimplements it in the Tablegen language. Then asserts that the new implementation results in the same value. The next patch will remove the assert and the temporary use of the TSFlags and remove the C++ implementation. The formula requires a limited form of the logical left and right operators. I implemented these with the bit-extract/insert operator (i.e. blah{bits}). No functional change. llvm-svn: 213278	2014-07-17 17:04:34 +00:00
Craig Topper	c30b81ea06	[x86] Remove some unused instruction format classes. llvm-svn: 202234	2014-02-26 06:06:38 +00:00
Craig Topper	e413b628f8	[x86] Simplify disassembler code slightly. llvm-svn: 202233	2014-02-26 06:01:21 +00:00
Craig Topper	e2347df24d	[x86] Switch PAUSE instruction to use XS prefix instead of HasREPPrefix. Remove HasREPPrefix support from disassembler table generator since its now only used by CodeGenOnly instructions. llvm-svn: 201767	2014-02-20 07:59:43 +00:00
Craig Topper	56f0ed815e	Remove special FP opcode maps and instead add enough MRM_XX formats to handle all the FP operations. This increases format by 1 bit, but decreases opcode map by 1 bit so the TSFlags size doesn't change. llvm-svn: 201649	2014-02-19 08:25:02 +00:00
Craig Topper	8f540272e8	Reduce size of map field in X86 TSFlags since it now requires less bits. llvm-svn: 201646	2014-02-19 07:29:07 +00:00
Craig Topper	2fb696b214	Put some of the X86 formats in a more logical order. llvm-svn: 201645	2014-02-19 06:59:13 +00:00
Craig Topper	0d1fd55c13	Remove A6/A7 opcode maps. They can all be handled with a TB map, opcode of 0xa6/0xa7, and adding MRM_C0/MRM_E0 forms. Removes 376K from the disassembler tables. llvm-svn: 201641	2014-02-19 05:34:21 +00:00
Craig Topper	8755740de0	Add PS prefix to some classes I missed in r201538. llvm-svn: 201551	2014-02-18 08:24:22 +00:00
Craig Topper	5ccb61781f	Add an x86 prefix encoding for instructions that would decode to a different instruction with 0xf2/f3/66 were in front of them, but don't themselves have a prefix. For now this doesn't change any bbehavior, but plan to use it to fix some bugs in the disassembler. llvm-svn: 201538	2014-02-18 00:21:49 +00:00
Craig Topper	a0869dceea	Recommit r201059 and r201060 with hopefully a fix for its original failure. Original commits messages: Add MRMXr/MRMXm form to X86 for use by instructions which treat the 'reg' field of modrm byte as a don't care value. Will allow for simplification of disassembler code. Simplify a bunch of code by removing the need for the x86 disassembler table builder to know about extended opcodes. The modrm forms are sufficient to convey the information. llvm-svn: 201065	2014-02-10 06:55:41 +00:00
Bob Wilson	ebdae7c2ff	Revert r201059 and r201060. r201059 appears to cause a crash in a bootstrapped build of clang. Craig isn't available to look at it right now, so I'm reverting it while he investigates. llvm-svn: 201064	2014-02-10 05:28:30 +00:00
Craig Topper	0d88de8c56	Add MRMXr/MRMXm form to X86 for use by instructions which treat the 'reg' field of modrm byte as a don't care value. Will allow for simplification of disassembler code. llvm-svn: 201059	2014-02-10 00:50:34 +00:00
Craig Topper	fa6298a162	Merge x86 HasOpSizePrefix/HasOpSize16Prefix into a 2-bit OpSize field with 0 meaning no 0x66 prefix in any mode. Rename Opsize16->OpSize32 and OpSize->OpSize16. The classes now refer to their operand size rather than the mode in which they need a 0x66 prefix. Hopefully can merge REX_W into this as OpSize64. llvm-svn: 200626	2014-02-02 09:25:09 +00:00
Craig Topper	d402df3ce8	Merge HasVEXPrefix/HasEVEXPrefix/HasXOPPrefix into a 2-bit 'encoding' field in TSFlags. llvm-svn: 200624	2014-02-02 07:08:01 +00:00
Craig Topper	da7160d6d2	Simplify some x86 format classes and remove some ambiguities in their application. llvm-svn: 200608	2014-02-01 08:17:56 +00:00
Craig Topper	10243c8907	Separate x86 opcode maps and 0x66/0xf2/0xf3 prefixes from each other in the TSFlags. This greatly simplifies the switch statements in the disassembler tables and the code emitters. llvm-svn: 200522	2014-01-31 08:47:06 +00:00
Craig Topper	ec68866f55	Move REP out of the Prefix field of the X86 format. Give it its own bit. It had special handling anyway and this enables a future patch. llvm-svn: 200520	2014-01-31 07:00:55 +00:00
David Woodhouse	0b6c94909e	[x86] Fix signed relocations for i64i32imm operands These should end up (in ELF) as R_X86_64_32S relocs, not R_X86_64_32. Kill the horrid and incomplete special case and FIXME in EncodeInstruction() and set things up so it can infer the signedness from the ImmType just like it can the size and whether it's PC-relative. llvm-svn: 200495	2014-01-30 22:20:41 +00:00
David Woodhouse	9bbf7ca13d	]x86] Allow segment and address-size overrides for CMPS[BWLQ] (PR9385) llvm-svn: 199806	2014-01-22 15:08:36 +00:00
David Woodhouse	b33c2ef215	[x86] Allow address-size overrides for STOS[BWLQ] (PR9385) llvm-svn: 199804	2014-01-22 15:08:21 +00:00
David Woodhouse	2ef8d9c05c	[x86] Allow segment and address-size overrides for LODS[BWLQ] (PR9385) llvm-svn: 199803	2014-01-22 15:08:08 +00:00
Craig Topper	35da3d190a	Allow x86 mov instructions to/from memory with absolute address to be encoded and disassembled with a segment override prefix. Fixes PR16962. llvm-svn: 199364	2014-01-16 07:36:58 +00:00
Elena Demikhovsky	767fc967b4	AVX-512: optimized scalar compare patterns removed AVX512SI format, since it is similar to AVX512BI. llvm-svn: 199217	2014-01-14 15:10:08 +00:00
Craig Topper	ae11aed9d7	Separate the concept of 16-bit/32-bit operand size controlled by 0x66 prefix and the current mode from the concept of SSE instructions using 0x66 prefix as part of their encoding without being affected by the mode. This should allow SSE instructions to be encoded correctly in 16-bit mode which r198586 probably broke. llvm-svn: 199193	2014-01-14 07:41:20 +00:00
David Woodhouse	4e033b0e92	[x86] Fix retq/retl handling in 64-bit mode This finishes the job started in r198756, and creates separate opcodes for 64-bit vs. 32-bit versions of the rest of the RET instructions too. LRETL/LRETQ are interesting... I can't see any justification for their existence in the SDM. There should be no 'LRETL' in 64-bit mode, and no need for a REX.W prefix for LRETQ. But this is what GAS does, and my Sandybridge CPU and an Opteron 6376 concur when tested as follows: asm __volatile__("pushq $0x1234\nmovq $0x33,%rax\nsalq $32,%rax\norq $1f,%rax\npushq %rax\nlretl $8\n1:"); asm __volatile__("pushq $1234\npushq $0x33\npushq $1f\nlretq $8\n1:"); asm __volatile__("pushq $0x33\npushq $1f\nlretq\n1:"); asm __volatile__("pushq $0x1234\npushq $0x33\npushq $1f\nlretq $8\n1:"); cf. PR8592 and commit r118903, which added LRETQ. I only added LRETIQ to match it. I don't quite understand how the Intel syntax parsing for ret instructions is working, despite r154468 allegedly fixing it. Aren't the explicitly sized 'retw', 'retd' and 'retq' supposed to work? I have at least made the 'lretq' work with (and indeed require) the 'q'. llvm-svn: 199106	2014-01-13 14:05:59 +00:00
Elena Demikhovsky	b19c9dc1a1	AVX-512: Embedded Rounding Control - encoding and printing Changed intrinsics for vrcp14/vrcp28 vrsqrt14/vrsqrt28 - aligned with GCC. llvm-svn: 199102	2014-01-13 12:55:03 +00:00
Craig Topper	7c6baa7834	Remove SegOvrBits from X86 TSFlags since they weren't being used. llvm-svn: 198588	2014-01-06 06:51:58 +00:00
Craig Topper	7ceb54a2a1	Add OpSize16 bit, for instructions which need 0x66 prefix in 16-bit mode The 0x66 prefix toggles between 16-bit and 32-bit addressing mode. So in 32-bit mode it is used to switch to 16-bit addressing mode for the following instruction, while in 16-bit mode it's the other way round — it's used to switch to 32-bit mode instead. Thus, emit the 0x66 prefix byte for OpSize only in 32-bit (and 64-bit) mode, and introduce a new OpSize16 bit which is used in 16-bit mode instead. This is just the basic infrastructure for that change; a subsequent patch will add the new OpSize16 bit to the 32-bit instructions that need it. Patch from David Woodhouse. llvm-svn: 198586	2014-01-06 06:02:58 +00:00
Craig Topper	3484fc2161	Add a new x86 specific instruction flag to force some isCodeGenOnly instructions to go through to the disassembler tables without resorting to string matches. Apply flag to all _REV instructions. llvm-svn: 198543	2014-01-05 04:17:28 +00:00
Craig Topper	99f02458e5	Remove MRMInitReg form now that it's last use is gone. llvm-svn: 198257	2013-12-31 03:19:03 +00:00
Eric Christopher	c0a5aaeab0	[x86] Rename In32BitMode predicate to Not64BitMode That's what it actually means, and with 16-bit support it's going to be a little more relevant since in a few corner cases we may actually want to distinguish between 16-bit and 32-bit mode (for example the bare 'push' aliases to pushw/pushl etc.) Patch by David Woodhouse llvm-svn: 197768	2013-12-20 02:04:49 +00:00
Craig Topper	bc749db947	Add in64BitMode/in32BitMode to the MMX/SSE2/AVX maskmovq/dq instructions. This way the asm parser will pick the right one based on the mode. Instruction selection already did the right thing based on the pointer size. llvm-svn: 192266	2013-10-09 02:18:34 +00:00
Elena Demikhovsky	b30371cb6b	AVX-512: Added TB prefix to all instructions without prefixes, otherwise encoding fails after the last change in X86MCCodeEmitter.cpp. llvm-svn: 191812	2013-10-02 06:39:07 +00:00
Yunzhong Gao	b8bbcbfcc8	Adding intrinsics to the llvm backend for TBM instruction set. Phabricator code review is located here: http://llvm-reviews.chandlerc.com/D1750 llvm-svn: 191539	2013-09-27 18:38:42 +00:00
Preston Gurd	3fe264d625	Adds support for Atom Silvermont (SLM) - -march=slm Implements Instruction scheduler latencies for Silvermont, using latencies from the Intel Silvermont Optimization Guide. Auto detects SLM. Turns on post RA scheduler when generating code for SLM. llvm-svn: 190717	2013-09-13 19:23:28 +00:00
Elena Demikhovsky	3ce8dbbac2	AVX-512: Added VMOVD, VMOVQ, VMOVSS, VMOVSD instructions. llvm-svn: 188637	2013-08-18 13:08:57 +00:00
Elena Demikhovsky	003e7d73b9	Added encoding prefixes for KNL instructions (EVEX). Added 512-bit operands printing. Added instruction formats for KNL instructions. llvm-svn: 187324	2013-07-28 08:28:38 +00:00
Kevin Enderby	285da02094	Fix the move to/from accumulator register instructions that use a full 64-bit absolute address encoded in the instruction. rdar://8612627 and rdar://14299221 llvm-svn: 186878	2013-07-22 21:25:31 +00:00
Elena Demikhovsky	89703c06f2	Removed PackedDouble domain from scalar instructions. Added more formats for the scalar stuff. llvm-svn: 183626	2013-06-09 07:37:10 +00:00
Elena Demikhovsky	0dd4025ae9	removed commented lines llvm-svn: 182377	2013-05-21 13:27:44 +00:00
Elena Demikhovsky	fad029202f	Removed SSEPacked domain from all forms (AVX, SSE, signed, unsigned) scalar compare instructions, like COMISS, COMISD. No functional changes. llvm-svn: 182371	2013-05-21 12:04:22 +00:00
Michael Liao	95d9440348	Add CLAC/STAC instruction encoding/decoding support As these two instructions in AVX extension are privileged instructions for special purpose, it's only expected to be used in inlined assembly. llvm-svn: 179266	2013-04-11 04:52:28 +00:00
Nadav Rotem	ff8c45529c	Add the X86 FMAs to the scheduling model. llvm-svn: 178303	2013-03-28 22:54:45 +00:00
Jakob Stoklund Olesen	4d39e81fb8	Remove IIC_DEFAULT from X86Schedule.td All the instructions tagged with IIC_DEFAULT had nothing in common, and we already have a NoItineraries class to represent untagged instructions. llvm-svn: 177937	2013-03-25 23:12:41 +00:00
Dave Zarzycki	656e8515fc	x86 -- add the XTEST instruction llvm-svn: 177888	2013-03-25 18:59:43 +00:00
Benjamin Kramer	fee7d21ae7	X86: Make sure we account for the FMA4 register immediate value, otherwise rip-rel relocations will be off by one byte. PR15040. llvm-svn: 173176	2013-01-22 18:05:59 +00:00
Craig Topper	72beaa6733	Fix execution domain for packed FMA4 instructions. llvm-svn: 168417	2012-11-21 08:08:21 +00:00
Michael Liao	73cffddb95	Add support of RTM from TSX extension - Add RTM code generation support throught 3 X86 intrinsics: xbegin()/xend() to start/end a transaction region, and xabort() to abort a tranaction region llvm-svn: 167573	2012-11-08 07:28:54 +00:00
Michael Liao	bbd10792c2	Introduce 'UseSSEx' to force SSE legacy encoding - Add 'UseSSEx' to force SSE legacy insn not being selected when AVX is enabled. As the penalty of inter-mixing SSE and AVX instructions, we need prevent SSE legacy insn from being generated except explicitly specified through some intrinsics. For patterns supported by both SSE and AVX, so far, we force AVX insn will be tried first relying on AddedComplexity or position in td file. It's error-prone and introduces bugs accidentally. 'UseSSEx' is disabled when AVX is turned on. For SSE insns inherited by AVX, we need this predicate to force VEX encoding or SSE legacy encoding only. For insns not inherited by AVX, we still use the previous predicates, i.e. 'HasSSEx'. So far, these insns fall into the following categories: * SSE insns with MMX operands * SSE insns with GPR/MEM operands only (xFENCE, PREFETCH, CLFLUSH, CRC, and etc.) * SSE4A insns. * MMX insns. * x87 insns added by SSE. 2 test cases are modified: - test/CodeGen/X86/fast-isel-x86-64.ll AVX code generation is different from SSE one. 'vcvtsi2sdq' cannot be selected by fast-isel due to complicated pattern and fast-isel fallback to materialize it from constant pool. - test/CodeGen/X86/widen_load-1.ll AVX code generation is different from SSE one after fixing SSE/AVX inter-mixing. Exec-domain fixing prefers 'vmovapd' instead of 'vmovaps'. llvm-svn: 162919	2012-08-30 16:54:46 +00:00
Craig Topper	f881d385da	Fix patterns for CVTTPS2DQ to specify SSE2 instead of SSE1. llvm-svn: 160941	2012-07-30 02:14:02 +00:00
Craig Topper	7e9415220a	Make CVTDQ2PS instruction use SSE2 predicate instead of SSE1. No functional change because there are no patterns in the instructions. Also fix a typo in a comment. llvm-svn: 159087	2012-06-23 20:52:45 +00:00
Craig Topper	24e3418215	Move CVTPD2DQ to use SSE2 predicate instead of SSE3. Move DQ2PD and PD2DQ to the SSE2 section of the file. llvm-svn: 159086	2012-06-23 20:15:42 +00:00
Craig Topper	bf2409e8aa	Mark several instructions SSE2 instead of SSE3 as they should be. llvm-svn: 158049	2012-06-06 06:45:27 +00:00
Craig Topper	79dbb0c6e4	Rename FMA3 feature flag to just FMA to match gcc so it can be added to clang. llvm-svn: 157903	2012-06-03 18:58:46 +00:00
Benjamin Kramer	a0396e4583	X86: Rename the CLMUL target feature to PCLMUL. It was renamed in gcc/gas a while ago and causes all kinds of confusion because it was named differently in llvm and clang. llvm-svn: 157745	2012-05-31 14:34:17 +00:00
Preston Gurd	09de6ae399	Added X86 Atom latencies to X86InstrMMX.td. llvm-svn: 156615	2012-05-11 14:27:12 +00:00
Preston Gurd	fa3f6cb830	This patch continues the work of adding instruction latencies for X86 Atom, by providing the latencies for the instructions in X86InstrFPStack.td. llvm-svn: 155996	2012-05-02 16:03:35 +00:00
Craig Topper	c0cef32b83	Remove HasSSE2 from AES and CLMUL predicates. It's now implied by the HasAES and HasCLMUL predicates. llvm-svn: 155890	2012-05-01 05:35:02 +00:00
Craig Topper	66a3597a4a	Add vmfunc instruction to X86 assembler and disassembler. llvm-svn: 150899	2012-02-19 01:39:49 +00:00
Jia Liu	b22310fda6	Emacs-tag and some comment fix for all ARM, CellSPU, Hexagon, MBlaze, MSP430, PPC, PTX, Sparc, X86, XCore. llvm-svn: 150878	2012-02-18 12:03:15 +00:00
Craig Topper	ed7aa46366	Add X86 assembler and disassembler support for AMD SVM instructions. Original patch by Kay Tiong Khoo. Few tweaks by me for code density and to reduce replication. llvm-svn: 150873	2012-02-18 08:19:49 +00:00
Andrew Trick	8523b16ff5	Instruction scheduling itinerary for Intel Atom. Adds an instruction itinerary to all x86 instructions, giving each a default latency of 1, using the InstrItinClass IIC_DEFAULT. Sets specific latencies for Atom for the instructions in files X86InstrCMovSetCC.td, X86InstrArithmetic.td, X86InstrControl.td, and X86InstrShiftRotate.td. The Atom latencies for the remainder of the x86 instructions will be set in subsequent patches. Adds a test to verify that the scheduler is working. Also changes the scheduling preference to "Hybrid" for i386 Atom, while leaving x86_64 as ILP. Patch by Preston Gurd! llvm-svn: 149558	2012-02-01 23:20:51 +00:00
Craig Topper	eb8f9e9e5b	Instruction selection priority fixes to remove the XMM/XMMInt/orAVX predicates. Another commit will remove orAVX functions from X86SubTarget. llvm-svn: 147841	2012-01-10 06:30:56 +00:00
Craig Topper	744f6311d3	Don't disable MMX support when AVX is enabled. Fix predicates for MMX instructions that were added along with SSE instructions to check for AVX in addition to SSE level. llvm-svn: 147762	2012-01-09 00:11:29 +00:00
Craig Topper	b910984458	Allow CRC32 instructions to be selected when AVX is enabled. llvm-svn: 147411	2012-01-01 19:51:58 +00:00
Craig Topper	1c064e0a89	Fix sfence, lfence, mfence, and clflush to be able to be selected when AVX is enabled. Fix monitor and mwait to require SSE3 or AVX, previously they worked even if SSE3 was disabled. Make prefetch instructions not set the execution domain since they don't use XMM registers. llvm-svn: 147409	2012-01-01 19:40:22 +00:00
Craig Topper	cd93de93fa	Separate the concept of having memory access in operand 4 from the concept of having the W bit set for XOP instructons. Removes ORing W-bits in the encoder and will similarly simplify the disassembler implementation. llvm-svn: 147366	2011-12-30 04:48:54 +00:00
Craig Topper	d773607eee	Fix execution domains for PS/PD FMA3 instructions. Add SS/SD forms o FMA3 instructions. llvm-svn: 147353	2011-12-29 20:43:40 +00:00
Craig Topper	97f05c5768	Mark non-VEX forms of PCLMUL instructions as requiring SSE2 to be enabled along with CLMUL. That's required for the XMM registers to be valid for integer data. Doesn't change any behavior since the CLMUL instructions don't have patterns yet. llvm-svn: 147345	2011-12-29 18:08:36 +00:00
Craig Topper	1559123c77	Mark non-VEX forms of AES instructions as requiring SSE2 to be enabled along with AES. Since that's required for the XMM registers to be valid for integer data. Doesn't change any behavior though since you can't use an intrinsic with an illegal type anyway. Just makes it consistent with the VEX forms. llvm-svn: 147344	2011-12-29 18:00:08 +00:00
Eli Friedman	64944090ff	Make sure we correctly note the existence of an i8 immediate for vblendvps and friends, so we compute fixups correctly. PR11586. llvm-svn: 146709	2011-12-15 23:46:18 +00:00
Jan Sjödin	7c0face455	XOP instructions and encoding tests. llvm-svn: 146407	2011-12-12 19:37:49 +00:00
Jan Sjödin	6dd2488383	XOP encoding bits and logic. llvm-svn: 146397	2011-12-12 19:12:26 +00:00
Evan Cheng	557cda7f1d	Remove hasSSE1orAVX(). It's the same as hasXMM(). llvm-svn: 146246	2011-12-09 06:32:46 +00:00
Evan Cheng	4d1a2d449f	Many of the SSE patterns should not be selected when AVX is available. This led to the following code in X86Subtarget.cpp if (HasAVX) X86SSELevel = NoMMXSSE; This is so patterns that are predicated on hasSSE3, etc. would not be selected when avx is available. Instead, the AVX variant is selected. However, this breaks instructions which do not have AVX variants. The right way to fix this is for the SSE but not-AVX patterns to predicate on something like hasSSE3() && !hasAVX(). Then we can take out the hack in X86Subtarget.cpp. Patterns which do not have AVX variants do not need to change. However, we need to audit all the patterns before we make the change. This patch is workaround that fixes one specific case, the prefetch instructions. rdar://10538297 llvm-svn: 146163	2011-12-08 19:00:42 +00:00
Bruno Cardoso Lopes	0f9a1f5e6c	This patch contains support for encoding FMA4 instructions and tablegen patterns for scalar FMA4 operations and intrinsic. Also add tests for vfmaddsd. Patch by Jan Sjodin llvm-svn: 145133	2011-11-25 19:33:42 +00:00
Craig Topper	f01f1b5cb9	More AVX2 instructions and their intrinsics. llvm-svn: 143895	2011-11-06 23:04:08 +00:00
Craig Topper	05d1cb98e7	Add more AVX2 instructions and intrinsics. llvm-svn: 143861	2011-11-06 06:12:20 +00:00
Craig Topper	980d59832a	Add X86 RORX instruction llvm-svn: 142741	2011-10-23 07:34:00 +00:00
Craig Topper	96fa597828	Add X86 PEXTR and PDEP instructions. llvm-svn: 142141	2011-10-16 16:50:08 +00:00
Craig Topper	aea148c366	Add X86 BZHI instruction as well as BMI2 feature detection. llvm-svn: 142122	2011-10-16 07:55:05 +00:00
Craig Topper	f18c896337	Add support in the disassembler for ignoring the L-bit on certain VEX instructions. Mark instructions that have this behavior. Fixes PR10676. llvm-svn: 141065	2011-10-04 06:30:42 +00:00
Bruno Cardoso Lopes	123dff0f58	- Handle special scalar_to_vector case: splats. Using a native 128-bit shuffle before inserting on a 256-bit vector. - Add AVX versions of movd/movq instructions - Introduce a few COPY patterns to match insert_subvector instructions. This turns a trivial insert_subvector instruction into a register copy, coalescing the xmm into a ymm and avoid emiting on more instruction. llvm-svn: 136002	2011-07-25 23:05:25 +00:00
Eli Friedman	415412e82f	Add assembler/disassembler support for non-AVX pclmulqdq. While I'm here, use proper aliases for the pclmullqlqdq and friends. PR10269. llvm-svn: 134424	2011-07-05 18:21:20 +00:00

1 2 3 4 5

213 Commits