llvm-project

Commit Graph

Author	SHA1	Message	Date
NAKAMURA Takumi	19675898da	X86JITInfo.cpp: Apply x64 version of X86CompilationCallback() to Cygwin64. For now, (defined(X86_64_JIT) && defined(__CYGWIN__)) satisfies Cygwin64. llvm-svn: 189437	2013-08-28 03:04:09 +00:00
NAKAMURA Takumi	9ea7c6d463	X86Subtarget.h: Recognize x86_64-cygwin. In the LLVM side, x86_64-cygwin is almost as same as x86_64-mingw32. llvm-svn: 189436	2013-08-28 03:04:02 +00:00
David Majnemer	aa34d79ab5	[ms-inline asm] Support offsets after segment registers Summary: MASM let's you do stuff like 'MOV FS:20, EAX' and 'MOV EAX, FS:20' Reviewers: craig.topper, rnk Reviewed By: rnk CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1470 llvm-svn: 189407	2013-08-27 21:56:17 +00:00
Elena Demikhovsky	93eeb47d49	AVX-512: added conversion instructions. llvm-svn: 189349	2013-08-27 13:54:04 +00:00
Elena Demikhovsky	12f24673e0	AVX-512: Added FMA instructions. llvm-svn: 189326	2013-08-27 08:39:25 +00:00
Charles Davis	1827bd8a6c	Revert "Fix the build broken by r189315." and "Move everything depending on Object/MachOFormat.h over to Support/MachO.h." This reverts commits r189319 and r189315. r189315 broke some tests on what I believe are big-endian platforms. llvm-svn: 189321	2013-08-27 05:38:30 +00:00
Charles Davis	0c6f71b40d	Move everything depending on Object/MachOFormat.h over to Support/MachO.h. llvm-svn: 189315	2013-08-27 05:00:43 +00:00
Elena Demikhovsky	0a2b6290f1	AVX-512: Added shuffle instructions - VPSHUFD, VPERMILPS, VMOVDDUP, VMOVLHPS, VMOVHLPS, VSHUFPS, VALIGN single and double forms. llvm-svn: 189215	2013-08-26 12:45:35 +00:00
Craig Topper	6269f49505	Make sure x86 instructions using ssmem/sdmem operand types are only able to parse memory operands of the proper size in Intel syntax. Primarily affects some of sse cvt instructions. llvm-svn: 189206	2013-08-26 00:39:04 +00:00
Craig Topper	5500d83793	Remove some unnecessary PredicateMethod overrides. Add RenderMethod overrides to remove forwarding in the X86AsmParser code itself. No functional change. llvm-svn: 189205	2013-08-26 00:13:09 +00:00
Craig Topper	8c26c424c6	Put some of the AVX-512 parsing stuff in a more consistent place with the existing functions. llvm-svn: 189204	2013-08-25 23:18:05 +00:00
Craig Topper	1885417372	First round of fixes for the x86 fixes for the x86 move accumulator from/to memory offset instructions. -Assembly parser now properly check the size of the memory operation specified in intel syntax. So 'mov word ptr [5], al' is no longer accepted. -x86-32 disassembly of these instructions no longer sign extends the 32-bit address immediate based on size. -Intel syntax printing prints the ptr size and places brackets around the address immediate. Known remaining issues with these instructions: -Segment override prefix is not supported. PR16962 and PR16961. -Immediate size should be changed by address size prefix. llvm-svn: 189201	2013-08-25 22:23:38 +00:00
Elena Demikhovsky	f8f478b19d	AVX-512: added UNPACK instructions and tests for all-zero/all-ones vectors llvm-svn: 189189	2013-08-25 12:54:30 +00:00
Craig Topper	0714d51cba	Add hasSideEffects/mayLoad/mayStore flags to the X86 moffs8/moffs16/moffs32/moffs64 versions of move. llvm-svn: 189182	2013-08-24 20:31:14 +00:00
Craig Topper	092e2fe426	Remove trailing whitespace. llvm-svn: 189178	2013-08-24 19:50:11 +00:00
Rafael Espindola	94a2c5642d	Rename features to match what gcc and clang use. There is no advantage in being different and using the same names simplifies clang a bit. llvm-svn: 189141	2013-08-23 20:21:34 +00:00
Jim Cownie	b09bb1ce19	Checking commit access; added one space llvm-svn: 189111	2013-08-23 15:51:37 +00:00
Elena Demikhovsky	c35219e3ee	AVX-512: Added masked SHIFT commands, more encoding tests llvm-svn: 189005	2013-08-22 12:18:28 +00:00
Elena Demikhovsky	33d447a2d6	AVX-512: Added SHIFT instructions. llvm-svn: 188899	2013-08-21 09:36:02 +00:00
Craig Topper	77df9cdd0b	Synchronize VEX JIT encoding code with the MCJIT version. Fix a bug in the MCJIT code where CurOp was being incremented even if the operand it was pointing at wasn't used. Maybe only matters if there are any EVEX_K instructions that aren't VEX_4V. llvm-svn: 188868	2013-08-21 05:57:45 +00:00
Nadav Rotem	7efc04cb40	In LLVM FMA3 operands are dst, src1, src2, src3, however dst is not encoded as it is always src1. This was causing the encoding of the operands to be off by one. Patch by Chris Bieneman. llvm-svn: 188866	2013-08-21 05:03:10 +00:00
Craig Topper	5c94bb8551	Rename mattr names for AVX-512 to from avx-512 -> avx512f, avx-512-pfi -> av512pf, avx-512-cdi -> avx512cd, avx-512-eri->avx512er. This matches better with official docs and what gcc patches appearto be using. I didn't touch the has* functions or the feature flag names to avoid change the td and lowering file while commits are still happening. llvm-svn: 188859	2013-08-21 03:57:57 +00:00
NAKAMURA Takumi	de8880a23d	X86TargetMachine.cpp: Clarify to emit GOT in i686-{cygming\|win32}-elf for mcjit. I suppose all "lli -use-mcjit i686-*" should require GOT, (and to fail.) llvm-svn: 188856	2013-08-21 02:37:25 +00:00
Elena Demikhovsky	540d582594	AVX-512: Added more patterns for VMOVSS, VMOVSD, VMOVD, VMOVQ llvm-svn: 188786	2013-08-20 11:00:29 +00:00
Craig Topper	7a8cf01090	Fix formatting. No functional change. llvm-svn: 188746	2013-08-20 05:23:59 +00:00
Craig Topper	e13a066c94	Add AVX-512 and related features to the CPUID detection code. llvm-svn: 188745	2013-08-20 05:22:42 +00:00
Craig Topper	fd2b389263	Move AVX and non-AVX replication inside a couple multiclasses to avoid repeating each instruction for both individually. llvm-svn: 188743	2013-08-20 04:24:14 +00:00
Elena Demikhovsky	1490c5eb5b	AVX-512: added arithmetic and logical operations. ADD, SUB, MUL integer and FP types. OR, AND, XOR. Added embeded broadcast form for these instructions. llvm-svn: 188673	2013-08-19 13:26:14 +00:00
Elena Demikhovsky	3ce8dbbac2	AVX-512: Added VMOVD, VMOVQ, VMOVSS, VMOVSD instructions. llvm-svn: 188637	2013-08-18 13:08:57 +00:00
Craig Topper	e6861c9ce5	Make more of the lowering helpers static. Also use MVT instead of EVT in a couple places. llvm-svn: 188629	2013-08-18 08:53:01 +00:00
Craig Topper	8c929627d9	Don't use v16i32 for load pattern matching. All 512-bit loads are cated to v8i64. llvm-svn: 188534	2013-08-16 06:07:34 +00:00
Bill Wendling	2851907cdb	Constify the function parameters. llvm-svn: 188469	2013-08-15 18:46:14 +00:00
Craig Topper	8dbc7e9d35	Revert r188449 as it turns out we're just missing the instructions that need the v16i32/v16f32 matching. llvm-svn: 188454	2013-08-15 08:38:25 +00:00
Craig Topper	2ffd06528d	Don't let isPermImmMask handle v16i32 since VPERMI doesn't match on that type. Remove 128-bit vector handling from isPermImmMask too, it's covered by isPSHUFDMask. llvm-svn: 188449	2013-08-15 07:30:51 +00:00
Craig Topper	83e042a21b	Use MVT instead of EVT in X86ISelDAGToDAG since all the types should be legal. llvm-svn: 188446	2013-08-15 05:57:07 +00:00
Craig Topper	6f4dd2dacf	Use MVT in place of EVT in more X86 operation lowering functions. llvm-svn: 188445	2013-08-15 05:33:45 +00:00
Craig Topper	5671010cbb	Replace getValueType().getSimpleVT() with getSimpleValueType(). Also remove one weird cast from MVT->EVT just to call getSimpleVT(). llvm-svn: 188441	2013-08-15 02:33:50 +00:00
Craig Topper	d03748cf5e	Make more helper methods into static functions. llvm-svn: 188366	2013-08-14 07:53:41 +00:00
Craig Topper	7b7b159574	Remove tab characters. llvm-svn: 188365	2013-08-14 07:35:18 +00:00
Craig Topper	d905fded68	Make some helper methods static. llvm-svn: 188364	2013-08-14 07:34:43 +00:00
Craig Topper	60769e050d	Use MVT in more lowering code. llvm-svn: 188363	2013-08-14 07:04:42 +00:00
Craig Topper	52b00359b1	Replace EVT with MVT in isVectorShift. Keeps compiler from generating unneeded checks and handling for extended types. llvm-svn: 188362	2013-08-14 06:21:10 +00:00
Craig Topper	67476d7485	Replace EVT with MVT in many of the shuffle lowering functions. Keeps compiler from generating unneeded checks and handling for extended types. llvm-svn: 188361	2013-08-14 05:58:39 +00:00
Evgeniy Stepanov	7dee697faa	Fix compiler warnings. ../lib/Target/X86/X86ISelLowering.cpp:9715:7: error: unused variable 'OpVT' [-Werror,-Wunused-variable] EVT OpVT = Op0.getValueType(); ^ ../lib/Target/X86/X86ISelLowering.cpp:9763:14: error: unused variable 'NumElems' [-Werror,-Wunused-variable] unsigned NumElems = VT.getVectorNumElements(); llvm-svn: 188269	2013-08-13 14:04:20 +00:00
Elena Demikhovsky	60b1f289f2	AVX-512: Added CMP and BLEND instructions. Lowering for SETCC. llvm-svn: 188265	2013-08-13 13:24:07 +00:00
Kevin Enderby	b03f3fe4e8	Fix a crash with X86 Mach-O and a subtraction expression where both symbols are undefined and produce an error message instead as this is a non-relocatable expression with X86 Mach-O. rdar://8920876 llvm-svn: 188218	2013-08-12 22:45:44 +00:00
Elena Demikhovsky	5fed3b95db	AVX-512: Added more tests for BROADCAST llvm-svn: 188148	2013-08-11 12:29:16 +00:00
Elena Demikhovsky	cf5b1458e6	AVX-512: Added VPERM* instructons and MOV* zmm-to-zmm instructions. Added a test for shuffles using VPERM. llvm-svn: 188147	2013-08-11 07:55:09 +00:00
Benjamin Kramer	21585fd9c1	Add a overload to CostTable which allows it to infer the size of the table. Use it to avoid repeating ourselves too often. Also store MVT::SimpleValueType in the TTI tables so they can be statically initialized, MVT's constructors create bloated initialization code otherwise. llvm-svn: 188095	2013-08-09 19:33:32 +00:00
Michael J. Spencer	126973ba93	[Object] Split the ELF interface into 3 parts. * ELFTypes.h contains template magic for defining types based on endianess, size, and alignment. * ELFFile.h defines the ELFFile class which provides low level ELF specific access. * ELFObjectFile.h contains ELFObjectFile which uses ELFFile to implement the ObjectFile interface. llvm-svn: 188022	2013-08-08 22:27:13 +00:00
Jakub Staszak	9c34922ff2	Use pop_back() instead of pop_back_val() when the returned value is not used. llvm-svn: 187986	2013-08-08 15:48:46 +00:00
Jakub Staszak	b5ab81d5d0	Fix the comment. llvm-svn: 187984	2013-08-08 15:19:25 +00:00
Elena Demikhovsky	45c54ad8dc	AVX-512 set: Added BROADCAST instructions with lowering logic and a test. llvm-svn: 187884	2013-08-07 12:34:55 +00:00
Craig Topper	c5b0ad27ab	Simplify code. No functional change intended. llvm-svn: 187870	2013-08-07 08:16:07 +00:00
Tim Northover	a4415854db	Refactor isInTailCallPosition handling This change came about primarily because of two issues in the existing code. Niether of: define i64 @test1(i64 %val) { %in = trunc i64 %val to i32 tail call i32 @ret32(i32 returned %in) ret i64 %val } define i64 @test2(i64 %val) { tail call i32 @ret32(i32 returned undef) ret i32 42 } should be tail calls, and the function sameNoopInput is responsible. The main problem is that it is completely symmetric in the "tail call" and "ret" value, but in reality different things are allowed on each side. For these cases: 1. Any truncation should lead to a larger value being generated by "tail call" than needed by "ret". 2. Undef should only be allowed as a source for ret, not as a result of the call. Along the way I noticed that a mismatch between what this function treats as a valid truncation and what the backends see can lead to invalid calls as well (see x86-32 test case). This patch refactors the code so that instead of being based primarily on values which it recurses into when necessary, it starts by inspecting the type and considers each fundamental slot that the backend will see in turn. For example, given a pathological function that returned {{}, {{}, i32, {}}, i32} we would consider each "real" i32 in turn, and ask if it passes through unchanged. This is much closer to what the backend sees as a result of ComputeValueVTs. Aside from the bug fixes, this eliminates the recursion that's going on and, I believe, makes the bulk of the code significantly easier to understand. The trade-off is the nasty iterators needed to find the real types inside a returned value. llvm-svn: 187787	2013-08-06 09:12:35 +00:00
Craig Topper	cf969eadaf	Simplify vector lane handling math a bit. No functional change intended. llvm-svn: 187783	2013-08-06 07:23:12 +00:00
Craig Topper	7418ff460c	Simplify math a little bit. llvm-svn: 187781	2013-08-06 06:54:25 +00:00
NAKAMURA Takumi	aaf66c7357	Target//CMakeLists.txt: Add the dependency to CommonTableGen explicitly for each corresponding CodeGen. Without explicit dependencies, both per-file action and in-CommonTableGen action could run in parallel. It races to emit .inc files simultaneously. llvm-svn: 187780	2013-08-06 06:38:37 +00:00
Craig Topper	9bc00b65b6	Replace EVT with MVT in isHorizontalBinOp as it is only called with legal types. llvm-svn: 187779	2013-08-06 06:05:05 +00:00
Craig Topper	47d7c5c8fe	Simplify code slightly. No functional change. llvm-svn: 187771	2013-08-06 04:12:40 +00:00
Aaron Ballman	5b4634576e	Silencing an MSVC11 type conversion warning. llvm-svn: 187727	2013-08-05 13:47:03 +00:00
Elena Demikhovsky	40864b690b	AVX-512 set: added mask operations, lowering BUILD_VECTOR for i1 vector types. Added intrinsics and tests. llvm-svn: 187717	2013-08-05 08:52:21 +00:00
Benjamin Kramer	5bc180c14f	X86: Turn fp selects into mask operations. double test(double a, double b, double c, double d) { return a<b ? c : d; } before: _test: ucomisd %xmm0, %xmm1 ja LBB0_2 movaps %xmm3, %xmm2 LBB0_2: movaps %xmm2, %xmm0 after: _test: cmpltsd %xmm1, %xmm0 andpd %xmm0, %xmm2 andnpd %xmm3, %xmm0 orpd %xmm2, %xmm0 Small speedup on Benchmarks/SmallPT llvm-svn: 187706	2013-08-04 12:05:16 +00:00
Elena Demikhovsky	cd46691728	AVX-512 set: added VEXTRACTPS instruction llvm-svn: 187705	2013-08-04 10:46:07 +00:00
Tim Northover	ecc018c7b7	X86: correct tail return address calculation Due to the weird and wondeful usual arithmetic conversions, some calculations involving negative values were getting performed in uint32_t and then promoted to int64_t, which is really not a good idea. Patch by Katsuhiro Ueno. llvm-svn: 187703	2013-08-04 09:35:57 +00:00
Bill Wendling	a5c536e1ee	Use function attributes to indicate that we don't want to realign the stack. Function attributes are the future! So just query whether we want to realign the stack directly from the function instead of through a random target options structure. llvm-svn: 187618	2013-08-01 21:42:05 +00:00
Daniel Malea	a3d4245a72	Fixed the Intel-syntax X86 disassembler to respect the (existing) option for hexadecimal immediates, to match AT&T syntax. This also brings a new option for C-vs-MASM-style hex. Patch by Richard Mitton Reviewed: http://llvm-reviews.chandlerc.com/D1243 llvm-svn: 187614	2013-08-01 21:18:16 +00:00
Elena Demikhovsky	b1266b5447	EVEX and compressed displacement encoding for AVX512 llvm-svn: 187576	2013-08-01 13:34:06 +00:00
Elena Demikhovsky	b0a75431ad	Fixed assertion in Extract128BitVector() llvm-svn: 187493	2013-07-31 12:03:08 +00:00
Elena Demikhovsky	67b05fc0b3	Added INSERT and EXTRACT intructions from AVX-512 ISA. All insertf/extractf functions replaced with insert/extract since we have insertf and inserti forms. Added lowering for INSERT_VECTOR_ELT / EXTRACT_VECTOR_ELT for 512-bit vectors. Added lowering for EXTRACT/INSERT subvector for 512-bit vectors. Added a test. llvm-svn: 187491	2013-07-31 11:35:14 +00:00
Craig Topper	efd67d4612	Changed register names (and pointer keywords) to be lower case when using Intel X86 assembler syntax. Patch by Richard Mitton. llvm-svn: 187476	2013-07-31 02:47:52 +00:00
Craig Topper	75a5ba7ed0	Remove trailing whitespace and some tab characters. llvm-svn: 187472	2013-07-31 02:00:15 +00:00
Craig Topper	6e8cd80def	Fixed incorrect disassembly for MOV16o16a when using Intel syntax. Patch by Richard Mitton. llvm-svn: 187471	2013-07-31 01:50:26 +00:00
Nico Rieck	06d17c80cc	Proper va_arg/va_copy lowering on win64 Win64 uses CharPtrBuiltinVaList instead of X86_64ABIBuiltinVaList like other 64-bit targets. llvm-svn: 187355	2013-07-29 13:07:06 +00:00
Elena Demikhovsky	003e7d73b9	Added encoding prefixes for KNL instructions (EVEX). Added 512-bit operands printing. Added instruction formats for KNL instructions. llvm-svn: 187324	2013-07-28 08:28:38 +00:00
Justin Holewinski	d3f2035a3c	Add a target legalize hook for SplitVectorOperand (again) CustomLowerNode was not being called during SplitVectorOperand, meaning custom legalization could not be used by targets. This also adds a test case for NVPTX that depends on this custom legalization. Differential Revision: http://llvm-reviews.chandlerc.com/D1195 Attempt to fix the buildbots by making the X86 test I just added platform independent llvm-svn: 187202	2013-07-26 13:28:29 +00:00
Rafael Espindola	1d812728cc	Revert "Add a target legalize hook for SplitVectorOperand" This reverts commit 187198. It broke the bots. The soft float test probably needs a -triple because of name differences. On the hard float test I am getting a "roundss $1, %xmm0, %xmm0", instead of "vroundss $1, %xmm0, %xmm0, %xmm0". llvm-svn: 187201	2013-07-26 13:18:16 +00:00
Justin Holewinski	f848a24e50	Add a target legalize hook for SplitVectorOperand CustomLowerNode was not being called during SplitVectorOperand, meaning custom legalization could not be used by targets. This also adds a test case for NVPTX that depends on this custom legalization. Differential Revision: http://llvm-reviews.chandlerc.com/D1195 llvm-svn: 187198	2013-07-26 12:46:39 +00:00
Craig Topper	8ae6fb2932	Fix more Intel syntax issues with FP instruction aliases. Test cases coming in a subsequent patch. llvm-svn: 187187	2013-07-26 05:37:46 +00:00
Craig Topper	0f7807375a	Take advantage of the register enums being in order to remove a couple static tables. llvm-svn: 187182	2013-07-26 02:02:47 +00:00
Elena Demikhovsky	8cfb43f73b	I'm starting to commit KNL backend. I'll push patches one-by-one. This patch includes support for the extended register set XMM16-31, YMM16-31, ZMM0-31. The full ISA you can see here: http://software.intel.com/en-us/intel-isa-extensions llvm-svn: 187030	2013-07-24 11:02:47 +00:00
Craig Topper	690d8ea181	Split generated asm mnemonic matching table into a separate table for each asm variant. This removes the need to store the asm variant in each row of the single table that existed before. Shaves ~16K off the size of X86AsmParser.o. llvm-svn: 187026	2013-07-24 07:33:14 +00:00
Craig Topper	593b76de44	Fix aliases for shrd/shld to handle Intel syntax properly. Also suppress them from being used by the asm printer. llvm-svn: 187020	2013-07-24 04:38:13 +00:00
Craig Topper	6030a65039	Remove some errant space charcters in mnemonic strings. llvm-svn: 186932	2013-07-23 06:45:34 +00:00
Craig Topper	db90f65bbe	Don't let x86 asm printer use the no operand movsd alias. It should use the normal movsl instead. llvm-svn: 186924	2013-07-23 01:50:47 +00:00
Craig Topper	bf547cea0e	Revert r186907 to fix bots. llvm-svn: 186910	2013-07-23 01:29:37 +00:00
Craig Topper	80c310056c	Don't let x86 asm printer use the no operand movsd alias. It should use the normal movsl instead. llvm-svn: 186907	2013-07-23 01:21:36 +00:00
Craig Topper	c15a9e4fa4	Add aliases to map 'imm, mem' form of x86 bts/btr/btc without a size suffix to their 32-bit forms. This makes them consistent with 'bt' which already had this handling. gas has the same behavior. There have been discussions on the mailing list about determining size based on the immediate, but my goal here was just to remove the inconsistency. llvm-svn: 186904	2013-07-23 00:56:15 +00:00
Craig Topper	001582833a	Explicitly don't let the asm printer use the clrb/w/l aliases for xor %reg, %reg. It only didn't use it before because it seems InstAlias handling in the asm printer fails to count tied operands so it tried to find an xor with 2 operands instead of the 3 it wfails to count tied. llvm-svn: 186900	2013-07-23 00:15:19 +00:00
Craig Topper	c638f0cd4f	Suppress argumentless aliases for some x86 FP operations from being used by the asm writer. Prefer to use the explicit %st(1) form. llvm-svn: 186897	2013-07-23 00:03:33 +00:00
Kevin Enderby	285da02094	Fix the move to/from accumulator register instructions that use a full 64-bit absolute address encoded in the instruction. rdar://8612627 and rdar://14299221 llvm-svn: 186878	2013-07-22 21:25:31 +00:00
Craig Topper	998bcf9534	Recommit r186813: More Intel syntax alias fixes. With the addition of suppressing some of the aliases from being emitted by the asm printer. llvm-svn: 186869	2013-07-22 20:46:37 +00:00
Tim Northover	1a9dafcd6f	Revert "More Intel syntax alias fixes." This reverts commit r186813, which broke the bots. llvm-svn: 186818	2013-07-22 11:02:32 +00:00
Craig Topper	b095c09330	Fix typo. Change %cl to CL in Intel pattern. llvm-svn: 186815	2013-07-22 10:07:26 +00:00
Craig Topper	61da939a17	More Intel syntax alias fixes. llvm-svn: 186814	2013-07-22 09:58:07 +00:00
Craig Topper	1b9e4e7e9d	More Intel syntax alias fixes. llvm-svn: 186813	2013-07-22 09:42:31 +00:00
Craig Topper	03db790dc6	Change %xmm0 to XMM0 in Intel side of asm strings for PBLENDVB. llvm-svn: 186812	2013-07-22 09:22:49 +00:00
Craig Topper	8f9402a989	Add Intel variants to aliases for some FP instructions. llvm-svn: 186811	2013-07-22 09:18:43 +00:00
Craig Topper	d0ed3a417e	Reverse operands for Intel syntax form of 'bt' alias. llvm-svn: 186809	2013-07-22 07:47:51 +00:00
Craig Topper	8956fe0dbc	Mark that the _ftol2 function used by windows on x86 to handle fptoui modifies ECX. llvm-svn: 186787	2013-07-21 07:28:13 +00:00
Craig Topper	ad1fff9be7	Fix copy and paste bug from r186491 to make v2f64 use MOVAPD/MOVUPD as it should. llvm-svn: 186566	2013-07-18 07:16:44 +00:00
Craig Topper	55475d448b	Teach x86 fast-isel to use AVX opcodes for vector stores when AVX is enabled. llvm-svn: 186496	2013-07-17 06:58:23 +00:00
Craig Topper	4f55b0efd2	Make x86 fast-isel correctly choose between aligned and unaligned operations for vector stores. Fixes PR16640. llvm-svn: 186491	2013-07-17 05:57:45 +00:00
Juergen Ributzka	3d527d80b8	[X86] Use min/max to optimze unsigend vector comparison on X86 Use PMIN/PMAX for UGE/ULE vector comparions to reduce the number of required instructions. This trick also works for UGT/ULT, but there is no advantage in doing so. It wouldn't reduce the number of instructions and it would actually reduce performance. Reviewer: Ben radar:5972691 llvm-svn: 186432	2013-07-16 18:20:45 +00:00
Craig Topper	202fbc2c9b	Add 'static' keyword to some const arrays for consistency. llvm-svn: 186308	2013-07-15 06:54:12 +00:00
Craig Topper	b94011fd28	Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. llvm-svn: 186274	2013-07-14 04:42:23 +00:00
Arnold Schwaighofer	6042a261b8	X86 cost model: Add cost for vectorized gather/scather radar://14351991 llvm-svn: 186189	2013-07-12 19:16:07 +00:00
Benjamin Kramer	068a2253e9	X86: Shrink certain forms of movsx. In particular: movsbw %al, %ax --> cbtw movswl %ax, %eax --> cwtl movslq %eax, %rax --> cltq According to Intel's manual those have the same performance characteristics but come with a smaller encoding. llvm-svn: 186174	2013-07-12 18:06:44 +00:00
Stephen Lin	fda967fdea	X86: fold SSE2/AVX2 logical shift by immediate amount into zero vector when possible Patch by Andrea Di Biagio llvm-svn: 186165	2013-07-12 15:31:36 +00:00
Charles Davis	e8f297ca94	Target/X86: Add explicit Win64 and System V/x86-64 calling conventions. Summary: This patch adds explicit calling convention types for the Win64 and System V/x86-64 ABIs. This allows code to override the default, and use the Win64 convention on a target that wants to use SysV (and vice-versa). This is needed to implement the `ms_abi` and `sysv_abi` GNU attributes. Reviewers: CC: llvm-svn: 186144	2013-07-12 06:02:35 +00:00
Stephen Lin	73de7bf5de	AArch64/PowerPC/SystemZ/X86: This patch fixes the interface, usage, and all in-tree implementations of TargetLoweringBase::isFMAFasterThanMulAndAdd in order to resolve the following issues with fmuladd (i.e. optional FMA) intrinsics: 1. On X86(-64) targets, ISD::FMA nodes are formed when lowering fmuladd intrinsics even if the subtarget does not support FMA instructions, leading to laughably bad code generation in some situations. 2. On AArch64 targets, ISD::FMA nodes are formed for operations on fp128, resulting in a call to a software fp128 FMA implementation. 3. On PowerPC targets, FMAs are not generated from fmuladd intrinsics on types like v2f32, v8f32, v4f64, etc., even though they promote, split, scalarize, etc. to types that support hardware FMAs. The function has also been slightly renamed for consistency and to force a merge/build conflict for any out-of-tree target implementing it. To resolve, see comments and fixed in-tree examples. llvm-svn: 185956	2013-07-09 18:16:56 +00:00
Jim Grosbach	340b6da4f2	X86: Add comment. llvm-svn: 185900	2013-07-09 02:07:28 +00:00
Jim Grosbach	c35388f103	X86 fast-isel: Avoid explicit AH subreg reference for [SU]Rem. Explicit references to %AH for an i8 remainder instruction can lead to references to %AH in a REX prefixed instruction, which causes things to blow up. Do the same thing in FastISel as we do for DAG isel and instead shift %AX right by 8 bits and then extract the 8-bit subreg from that result. rdar://14203849 http://llvm.org/bugs/show_bug.cgi?id=16105 llvm-svn: 185899	2013-07-09 02:07:25 +00:00
Nico Rieck	51969be724	Reuse %rax after calling __chkstk on win64 Reapply this as I reverted the wrong commit. llvm-svn: 185807	2013-07-08 11:20:11 +00:00
Nico Rieck	4801303ce1	Revert "Proper va_arg/va_copy lowering on win64" This reverts commit 2b52880592a525cfe04d8f9008a35da8c2ea94c3. Needs review. llvm-svn: 185806	2013-07-08 11:19:44 +00:00
Nico Rieck	43b51056d6	Revert "Reuse %rax after calling __chkstk on win64" This reverts commit 01f8d579f7672872324208ac5bc4ac311e81b22e. llvm-svn: 185781	2013-07-08 01:30:57 +00:00
Nico Rieck	7adf6111a8	Reuse %rax after calling __chkstk on win64 llvm-svn: 185778	2013-07-07 16:48:39 +00:00
Nico Rieck	99ef2890c0	Proper va_arg/va_copy lowering on win64 llvm-svn: 185763	2013-07-06 18:08:19 +00:00
Jakob Stoklund Olesen	db429d9483	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185625	2013-07-04 13:54:20 +00:00
Jakob Stoklund Olesen	a1f5b901a5	Revert r185595-185596 which broke buildbots. Revert "Simplify landing pad lowering." Revert "Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes." llvm-svn: 185600	2013-07-04 00:26:30 +00:00
Jakob Stoklund Olesen	f33ec531fa	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185596	2013-07-03 23:56:31 +00:00
Craig Topper	31ee5866de	Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid specifying the vector size. llvm-svn: 185540	2013-07-03 15:07:05 +00:00
Ulrich Weigand	2b6fc8d613	[DebugInfo] Allow getDebugThreadLocalSymbol to return MCExpr This allows getDebugThreadLocalSymbol to return a generic MCExpr instead of just a MCSymbolRefExpr. This is in preparation for supporting debug info for TLS variables on PowerPC, where we need to describe the variable location using a more complex expression than just MCSymbolRefExpr. llvm-svn: 185460	2013-07-02 18:47:09 +00:00
David Blaikie	1b01ae8648	PR16493: DebugInfo with TLS on PPC crashing due to invalid relocation Restrict the current TLS support to X86 ELF for now. Test that we don't produce it on PPC & we can flesh that test case out with the right thing once someone implements it. llvm-svn: 185389	2013-07-01 21:45:25 +00:00
Ahmed Bougacha	8347352e11	X86: POP*rmm: move address operand to (ins) from (outs). llvm-svn: 185292	2013-06-30 20:44:50 +00:00
Chad Rosier	ee740c4d88	Fix an off-by-one error. Also make the code a little more explicit in what it is trying to do. llvm-svn: 185191	2013-06-28 18:57:01 +00:00
David Blaikie	c3ccdbe2bf	Integrate Assembler: Support X86_64_DTPOFF64 relocations llvm-svn: 185131	2013-06-28 04:24:32 +00:00
Nadav Rotem	02dd93ec1a	Get rid of the unused class member. llvm-svn: 185086	2013-06-27 17:54:10 +00:00
Nadav Rotem	f9ecbcb835	CostModel: improve the cost model for load/store of non power-of-two types such as <3 x float>, which are popular in graphics. llvm-svn: 185085	2013-06-27 17:52:04 +00:00
Benjamin Kramer	02ff1cd015	Don't cast away constness. llvm-svn: 185071	2013-06-27 11:07:42 +00:00
Elena Demikhovsky	6769c50d9e	Optimized integer vector multiplication operation by replacing it with shift/xor/sub when it is possible. Fixed a bug in SDIV, where the const operand is not a splat constant vector. llvm-svn: 184931	2013-06-26 10:55:03 +00:00
Arnold Schwaighofer	a04b9ef1e8	X86 cost model: Vectorizing integer division is a bad idea radar://14057959 llvm-svn: 184872	2013-06-25 19:14:09 +00:00
Andrew Trick	121124acf8	Revert "Temporarily enable MI-Sched on X86." This reverts commit 98a9b72e8c56dc13a2617de84503a3d78352789c. llvm-svn: 184823	2013-06-25 02:48:58 +00:00
Andrew Trick	5a1e0af838	Temporarily enable MI-Sched on X86. Sorry for the unit test churn. I'll try to make the change permanently next time. llvm-svn: 184705	2013-06-24 09:13:20 +00:00
Andrew Trick	47740deb26	Add MI-Sched support for x86 macro fusion. This is an awful implementation of the target hook. But we don't have abstractions yet for common machine ops, and I don't see any quick way to make it table-driven. llvm-svn: 184664	2013-06-23 09:00:28 +00:00
Chad Rosier	295bd43adb	The getRegForInlineAsmConstraint function should only accept MVT value types. llvm-svn: 184642	2013-06-22 18:37:38 +00:00
David Blaikie	97c6c5bd98	DebugInfo: Don't lose unreferenced non-trivial by-value parameters A FastISel optimization was causing us to emit no information for such parameters & when they go missing we end up emitting a different function type. By avoiding that shortcut we not only get types correct (very important) but also location information (handy) - even if it's only live at the start of a function & may be clobbered later. Reviewed/discussion by Evan Cheng & Dan Gohman. llvm-svn: 184604	2013-06-21 22:56:30 +00:00
Andrew Trick	7201f4f7ec	Fix IMULX machine model. Multiple def operands require multiple SchedWrites. llvm-svn: 184566	2013-06-21 18:33:04 +00:00
Kevin Enderby	35fd79237f	Update the X86 disassembler to use xacquire and xrelease when appropriate. This is a bit tricky as the xacquire and xrelease hints use the same bytes, 0xf2 and 0xf3, as the repne and rep prefixes. Fortunately llvm has different llvm MCInst Opcode enums for rep/xrelease and repne/xacquire. So to make this work a boolean was added the InternalInstruction struct as part of the Prefix state which is set with the added logic in readPrefixes() when decoding an instruction to determine if these prefix bytes are to be disassembled as xacquire or xrelease. Then we let the matcher pick the normal prefix instructionID and we change the Opcode after that when it is set into the MCInst being created. rdar://11019859 llvm-svn: 184490	2013-06-20 22:32:18 +00:00
Bill Wendling	a3cd350249	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184360	2013-06-19 21:36:55 +00:00
Bill Wendling	afc1036f3e	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184349	2013-06-19 20:51:24 +00:00
Nadav Rotem	7d6c625235	Fix 80 col violation. llvm-svn: 184228	2013-06-18 20:41:52 +00:00
Stefanus Du Toit	8811ad4f81	Add support for encoding the HLE XACQUIRE and XRELEASE prefixes. For decoding, keep the current behavior of always decoding these as their REP versions. In the future, this could be improved to recognize the cases where these behave as XACQUIRE and XRELEASE and decode them as such. llvm-svn: 184207	2013-06-18 17:08:10 +00:00
Bill Wendling	bc07a8900c	Use pointers to the MCAsmInfo and MCRegInfo. Someone may want to do something crazy, like replace these objects if they change or something. No functionality change intended. llvm-svn: 184175	2013-06-18 07:20:20 +00:00
David Blaikie	b735b4d6db	DebugInfo: remove target-specific Frame Index handling for DBG_VALUE MachineInstrs Frame index handling is now target-agnostic, so delete the target hooks for creation & asm printing of target-specific addressing in DBG_VALUEs and any related functions. llvm-svn: 184067	2013-06-16 20:34:27 +00:00
Andrew Trick	40c4f38071	Support BufferSize on ProcResGroup for unified MOp schedulers. And add Sandybridge/Haswell resource buffers. llvm-svn: 184034	2013-06-15 04:50:06 +00:00
Andrew Trick	18dc3da855	Update machine models. Specify buffer sizes for OOO processors. llvm-svn: 184033	2013-06-15 04:50:02 +00:00
Andrew Trick	de2109eb4c	Machine Model: Add MicroOpBufferSize and resource BufferSize. Replace the ill-defined MinLatency and ILPWindow properties with with straightforward buffer sizes: MCSchedMode::MicroOpBufferSize MCProcResourceDesc::BufferSize These can be used to more precisely model instruction execution if desired. Disabled some misched tests temporarily. They'll be reenabled in a few commits. llvm-svn: 184032	2013-06-15 04:49:57 +00:00
Benjamin Kramer	b289319fb8	X86: cvtpi2ps is just an SSE instruction with MMX operands. It has no AVX equivalent. Give it the right register format so we can also emit it when AVX is enabled. llvm-svn: 183971	2013-06-14 09:31:41 +00:00
Benjamin Kramer	af6c3b7002	X86: Make the cmov aliases work with intel syntax too. llvm-svn: 183907	2013-06-13 15:45:24 +00:00

1 2 3 4 5 ...

9537 Commits