llvm-project

Commit Graph

Author	SHA1	Message	Date
Bill Wendling	77ad1dc56d	Rename the narrow shift right immediate operands to "shr_imm*" operands. Also expand the testing of the narrowing shift right instructions. No functionality change. llvm-svn: 127193	2011-03-07 23:38:41 +00:00
Cameron Zwarich	df61694417	Move getRegPressureLimit() from TargetLoweringInfo to TargetRegisterInfo. llvm-svn: 127175	2011-03-07 21:56:36 +00:00
Anton Korobeynikov	692f633df9	ARM assembler stuff is crazy: for .setfp positive values of offset corresponds to "add" instruction, not to "sub" as in .pad case llvm-svn: 127106	2011-03-05 18:44:00 +00:00
Anton Korobeynikov	9e66cbb366	In Thumb1 mode the constant might be materialized via the load from constpool. Emit unwinding information in case when this load from constpool is used to change the stack pointer in the prologue. llvm-svn: 127105	2011-03-05 18:43:55 +00:00
Anton Korobeynikov	a8d177b2d4	Implement frame unwinding information emission for Thumb1. Not finished yet because there is no way given the constpool index to examine the actual entry: the reason is clones inserted by constant island pass, which are not tracked at all! The only connection is done during asmprinting time via magic label names which is really gross and needs to be eventually fixed. llvm-svn: 127104	2011-03-05 18:43:50 +00:00
Anton Korobeynikov	51537f1c7f	Add unwind information emission for thumb stuff llvm-svn: 127103	2011-03-05 18:43:43 +00:00
Anton Korobeynikov	acca7adf16	Handle MI flags inside Thumb2SizeReduction pass. llvm-svn: 127102	2011-03-05 18:43:38 +00:00
Anton Korobeynikov	e7410dd0d5	Preliminary support for ARM frame save directives emission via MI flags. This is just very first approximation how the stuff should be done (e.g. ARM-only for now). More to follow. llvm-svn: 127101	2011-03-05 18:43:32 +00:00
Anton Korobeynikov	a7ec2dcefd	Some first rudimentary support for ARM EHABI: print exception table in "text mode". llvm-svn: 127099	2011-03-05 18:43:15 +00:00
Bob Wilson	00d09428fe	Remove unused conditional negate operations. llvm-svn: 127090	2011-03-05 16:54:31 +00:00
Che-Liang Chiou	369ea3fdb4	ptx: add basic intrinsic support llvm-svn: 127084	2011-03-05 14:17:37 +00:00
Andrew Trick	641e2d4f8c	Increased the register pressure limit on x86_64 from 8 to 12 regs. This is the only change in this checkin that may affects the default scheduler. With better register tracking and heuristics, it doesn't make sense to artificially lower the register limit so much. Added -sched-high-latency-cycles and X86InstrInfo::isHighLatencyDef to give the scheduler a way to account for div and sqrt on targets that don't have an itinerary. It is currently defaults to 10 (the actual number doesn't matter much), but only takes effect on non-default schedulers: list-hybrid and list-ilp. Added several heuristics that can be individually disabled for the non-default sched=list-ilp mode. This helps us determine how much better we can do on a given benchmark than the default scheduler. Certain compute intensive loops run much faster in this mode with the right set of heuristics, and it doesn't seem to have much negative impact elsewhere. Not all of the heuristics are needed, but we still need to experiment to decide which should be disabled by default for sched=list-ilp. llvm-svn: 127067	2011-03-05 08:00:22 +00:00
Andrew Trick	27c079e1b0	whitespace llvm-svn: 127065	2011-03-05 06:31:54 +00:00
Bill Wendling	88842e4574	Initialize variable. llvm-svn: 127038	2011-03-04 21:38:47 +00:00
Bruno Cardoso Lopes	434248a62c	Improve div/rem node handling on mips. Patch by Akira Hatanaka llvm-svn: 127034	2011-03-04 21:03:24 +00:00
Bruno Cardoso Lopes	a744ef3f90	Expands register/immediate pairs when the immediate is too large to fit in 16-bit field. Patch by Akira Hatanaka llvm-svn: 127032	2011-03-04 20:48:08 +00:00
Bruno Cardoso Lopes	8887d6593f	Rewrite and simplify o32 vaarg passing, no functional changes. Patch by Sasa Stankovic llvm-svn: 127029	2011-03-04 20:27:44 +00:00
Bruno Cardoso Lopes	f8198e4311	Lowers block address. Currently asserts when relocation model is not PIC. Patch by Akira Hatanaka llvm-svn: 127027	2011-03-04 20:01:52 +00:00
Bruno Cardoso Lopes	328e2ce043	Fix an old copy-n-paste llvm-svn: 127020	2011-03-04 19:20:24 +00:00
Devang Patel	a0d73fd65e	Disable ARMGlobalMerge on darwin. The debugger is not yet able to extract individual variable's info from merged global. llvm-svn: 127019	2011-03-04 19:11:05 +00:00
Bruno Cardoso Lopes	22b69db8dd	Expands FCOS and FSIN nodes when type is f64. llvm-svn: 127017	2011-03-04 18:54:14 +00:00
Bruno Cardoso Lopes	db93ddb41b	Fixes addc pattern when immediate cannot be represented with 16-bit. Patch by Akira Hatanaka llvm-svn: 127005	2011-03-04 17:59:18 +00:00
Bruno Cardoso Lopes	ed874eff93	Remove (hopefully) all trailing whitespaces from the mips backend. Patch by Hatanaka, Akira llvm-svn: 127003	2011-03-04 17:51:39 +00:00
Kalle Raiskila	a1d947dd14	Allow vector shifts (shl,lshr,ashr) on SPU. There was a previous implementation with patterns that would have matched e.g. shl <v4i32> <i32>, but this is not valid LLVM IR so they never were selected. llvm-svn: 126998	2011-03-04 13:19:18 +00:00
Kalle Raiskila	3531e9b0d9	Allow load from constant on SPU. A 'load <4 x i32>* null' crashes llc before this fix. llvm-svn: 126995	2011-03-04 12:00:11 +00:00
Eli Friedman	f63614a982	PR9377: Handle x86 str with register operand in a way consistent with gas. llvm-svn: 126970	2011-03-04 00:10:17 +00:00
Bob Wilson	f5d23beff7	PR8053: Fix encoding of S bit in some ARM instructions. Patch by Zonr Chang! llvm-svn: 126967	2011-03-03 23:07:15 +00:00
Richard Osborne	af52c52569	Optimize fprintf -> iprintf if there are no floating point arguments and siprintf is available on the target. llvm-svn: 126940	2011-03-03 14:20:22 +00:00
Justin Holewinski	8e9a126a6c	PTX: Fix Emacs renaming a symbol llvm-svn: 126938	2011-03-03 14:09:40 +00:00
Richard Osborne	2dfb888392	Optimize sprintf -> siprintf if there are no floating point arguments and siprintf is available on the target. llvm-svn: 126937	2011-03-03 14:09:28 +00:00
Justin Holewinski	969dfbcff6	PTX: Fix a couple of lint violations llvm-svn: 126936	2011-03-03 13:34:29 +00:00
Richard Osborne	815de536e5	Optimize printf -> iprintf if there are no floating point arguments and iprintf is available on the target. Currently iprintf is only marked as being available on the XCore. llvm-svn: 126935	2011-03-03 13:17:51 +00:00
Tilmann Scheller	3bc0bcf3ad	Use X86_thiscall calling convention for Win64 as well. llvm-svn: 126934	2011-03-03 07:49:07 +00:00
Bob Wilson	ab8881accd	Add a readme entry for the redundant movw issue for pr9370. llvm-svn: 126930	2011-03-03 06:39:09 +00:00
Bob Wilson	ec84568904	pr9367: Add missing predicated BLX instructions. Patch by Jyun-Yan You, with some minor adjustments and a testcase from me. llvm-svn: 126915	2011-03-03 01:41:01 +00:00
Kevin Enderby	b8b6041734	Fixes an assertion failure while disassembling ARM rsbs reg/reg form. Patch by Ted Kremenek! llvm-svn: 126895	2011-03-02 23:08:33 +00:00
Renato Golin	e84af17b6e	Fixing a bug when printing fpu text to object file. Patch by Mans Rullgard. llvm-svn: 126882	2011-03-02 21:20:09 +00:00
Tilmann Scheller	a3769f8021	Add Win64 thiscall calling convention. llvm-svn: 126862	2011-03-02 19:29:22 +00:00
David Greene	dd567b214b	[AVX] Fix mask predicates for 256-bit UNPCKLPS/D and implement missing patterns for them. Add a SIMD test subdirectory to hold tests for SIMD instruction selection correctness and quality. ' llvm-svn: 126845	2011-03-02 17:23:43 +00:00
Che-Liang Chiou	7ed32cc51b	ptx: fix lint and compiler warnings llvm-svn: 126838	2011-03-02 07:58:46 +00:00
Che-Liang Chiou	59515dc703	Add 64-bit addressing to PTX backend - Add '64bit' sub-target option. - Select 32-bit/64-bit loads/stores based on '64bit' option. - Fix function parameter order. Patch by Justin Holewinski llvm-svn: 126837	2011-03-02 07:36:48 +00:00
Che-Liang Chiou	65b1476031	Extend initial support for primitive types in PTX backend - Allow i16, i32, i64, float, and double types, using the native .u16, .u32, .u64, .f32, and .f64 PTX types. - Allow loading/storing of all primitive types. - Allow primitive types to be passed as parameters. - Allow selection of PTX Version and Shader Model as sub-target attributes. - Merge integer/floating-point test cases for load/store. - Use .u32 instead of .s32 to conform to output from NVidia nvcc compiler. Patch by Justin Holewinski llvm-svn: 126824	2011-03-02 03:20:28 +00:00
Duncan Sands	c76ae9c8e0	Add datalayout information for the IEEE quad precision fp128 type. llvm-svn: 126780	2011-03-01 20:56:50 +00:00
Bill Wendling	3b1459b810	Narrow right shifts need to encode their immediates differently from a normal shift. 16-bit: imm6<5:3> = '001', 8 - <imm> is encded in imm6<2:0> 32-bit: imm6<5:4> = '01',16 - <imm> is encded in imm6<3:0> 64-bit: imm6<5> = '1', 32 - <imm> is encded in imm6<4:0> llvm-svn: 126723	2011-03-01 01:00:59 +00:00
Chris Lattner	0c6cb46ac1	add a note llvm-svn: 126719	2011-03-01 00:24:51 +00:00
Renato Golin	ec0fc7d842	Fix .fpu printing in ARM assembly, regarding bug http://llvm.org/bugs/show_bug.cgi?id=8931 llvm-svn: 126689	2011-02-28 22:04:27 +00:00
Kevin Enderby	63b0d108a2	Add missing whitespace in the formatting. llvm-svn: 126687	2011-02-28 21:45:12 +00:00
Chris Lattner	c93d207e8c	fix a signed comparison warning. llvm-svn: 126682	2011-02-28 20:50:35 +00:00
David Greene	20a1cbefad	[AVX] Add decode support for VUNPCKLPS/D instructions, both 128-bit and 256-bit forms. Because the number of elements in a vector does not determine the vector type (4 elements could be v4f32 or v4f64), pass the full type of the vector to decode routines. llvm-svn: 126664	2011-02-28 19:06:56 +00:00
Kevin Enderby	58775fea6f	Fix the arm's disassembler for blx that was building an MCInst without the needed two predicate operands before the imm operand. llvm-svn: 126662	2011-02-28 18:46:31 +00:00
Evan Cheng	6e3d443646	Fix a typo which cause dag combine crash. rdar://9059537. llvm-svn: 126661	2011-02-28 18:45:27 +00:00
Stuart Hastings	67c5c3e939	Support for byval parameters on ARM. Will be enabled by a forthcoming patch to the front-end. Radar 7662569. llvm-svn: 126655	2011-02-28 17:17:53 +00:00
Kalle Raiskila	612b85e58c	Add branch hinting for SPU. The implemented algorithm is overly simplistic (just speculate all branches are taken)- this is work in progress. llvm-svn: 126651	2011-02-28 14:08:24 +00:00
Che-Liang Chiou	75a800d3bf	Add preliminary support for .f32 in the PTX backend. - Add appropriate TableGen patterns for fadd, fsub, fmul. - Add .f32 as the PTX type for the LLVM float type. - Allow parameters, return values, and global variable declarations to accept the float type. - Add appropriate test cases. Patch by Justin Holewinski llvm-svn: 126636	2011-02-28 06:34:09 +00:00
Benjamin Kramer	25bddae404	Silence enum conversion warnings. llvm-svn: 126578	2011-02-27 18:13:53 +00:00
NAKAMURA Takumi	d4e5003a3f	Target/X86: Always emit "push/pop GPRs" in prologue/epilogue and emit "spill/reload frames" for XMMs. It improves Win64's prologue/epilogue but it would not affect ia32 and amd64 (lack of nonvolatile XMMs). llvm-svn: 126568	2011-02-27 08:47:19 +00:00
Benjamin Kramer	26691d9660	Add some DAGCombines for (adde 0, 0, glue), which are useful to optimize legalized code for large integer arithmetic. 1. Inform users of ADDEs with two 0 operands that it never sets carry 2. Fold other ADDs or ADDCs into the ADDE if possible It would be neat if we could do the same thing for SETCC+ADD eventually, but we can't do that in target independent code. llvm-svn: 126557	2011-02-26 22:48:07 +00:00
Owen Anderson	b2c80da4ae	Allow targets to specify a the type of the RHS of a shift parameterized on the type of the LHS. llvm-svn: 126518	2011-02-25 21:41:48 +00:00
Cameron Zwarich	fcf51fd298	Roll out r126425 and r126450 to see if it fixes the failures on the buildbots. llvm-svn: 126488	2011-02-25 16:30:32 +00:00
Bob Wilson	e3ecd5fb9b	Add patterns to use post-increment addressing for Neon VST1-lane instructions. llvm-svn: 126477	2011-02-25 06:42:42 +00:00
Evan Cheng	a921dc5860	Fix typo. llvm-svn: 126467	2011-02-25 01:29:29 +00:00
Evan Cheng	70d29634a9	Each prologue may have multiple vpush instructions to store callee-saved D registers since the vpush list may not have gaps. Make sure the stack adjustment instruction isn't moved between them. Ditto for vpop in epilogues. Sorry, can't reduce a small test case. rdar://9043312 llvm-svn: 126457	2011-02-25 00:24:46 +00:00
Chris Lattner	0152b7bc7c	remove command line option debugging hook. llvm-svn: 126441	2011-02-24 21:53:03 +00:00
Devang Patel	b037383a35	Enable DebugInfo support for COFF object files. Patch by Nathan Jeffords! llvm-svn: 126425	2011-02-24 21:04:00 +00:00
Richard Osborne	42f52e737e	Add XCore intrinsic for eeu instruction. llvm-svn: 126384	2011-02-24 13:39:18 +00:00
Evan Cheng	3923466e82	Fix bug in X86 folding / unfolding table. Int_CMPSDrm and Int_CMPSSrm memory operands starts at index 2, not 1. rdar://9045024 PR9305 llvm-svn: 126359	2011-02-24 02:36:52 +00:00
Richard Osborne	bfa5cc0e08	Add XCore intrinsic for clre instruction. llvm-svn: 126322	2011-02-23 18:52:05 +00:00
Richard Osborne	4995b05f56	Add llvm.xcore.waitevent intrinsic. The effect of this intrinsic is to enable events on the thread and wait until a resource is ready to event. The vector of the resource that is ready is returned. llvm-svn: 126320	2011-02-23 18:35:59 +00:00
Richard Osborne	2c610aa3ed	Add XCore intrinsic for the setv instruction. llvm-svn: 126315	2011-02-23 16:46:37 +00:00
Richard Osborne	12377e0947	Fix format for setc instruction. llvm-svn: 126314	2011-02-23 15:20:16 +00:00
Richard Osborne	aab96995f6	Add XCore intrinsic for settw instruction. llvm-svn: 126313	2011-02-23 14:45:03 +00:00
Evan Cheng	97e6428014	Change VFPNeonA8 definition to make the code easier to read. llvm-svn: 126298	2011-02-23 02:35:33 +00:00
Evan Cheng	d6b641e5bc	More fcopysign correctness and performance fix. The previous codegen for the slow path (when values are in VFP / NEON registers) was incorrect if the source is NaN. The new codegen uses NEON vbsl instruction to copy the sign bit. e.g. vmov.i32 d1, #0x80000000 vbsl d1, d2, d0 If NEON is not available, it uses integer instructions to copy the sign bit. rdar://9034702 llvm-svn: 126295	2011-02-23 02:24:55 +00:00
David Greene	9a6040dc86	[AVX] General VUNPCKL codegen support. llvm-svn: 126264	2011-02-22 23:31:46 +00:00
Joerg Sonnenberger	b7e635dcad	Use the same (%dx) hack for in[bwl] as for out[bwl]. llvm-svn: 126244	2011-02-22 20:40:09 +00:00
Evan Cheng	04ad35b53f	VFP single precision arith instructions can go down to NEON pipeline, but on Cortex-A8 only. llvm-svn: 126238	2011-02-22 19:53:14 +00:00
Roman Divacky	e8a93fe8f0	Stack alignment is 16 bytes on FreeBSD/i386 too. llvm-svn: 126226	2011-02-22 17:30:05 +00:00
Evan Cheng	666cf56668	Guard against de-referencing MBB.end(). llvm-svn: 126192	2011-02-22 07:07:59 +00:00
Evan Cheng	2ce663031f	available_externally (hidden or not) GVs are always accessed via stubs. rdar://9027648. llvm-svn: 126191	2011-02-22 06:58:34 +00:00
Eric Christopher	919772fd5d	Only use blx for external function calls on thumb, these could be fixed up by the dynamic linker, but it's better to use the correct instruction to begin with. Fixes rdar://9011034 llvm-svn: 126176	2011-02-22 01:37:10 +00:00
Joerg Sonnenberger	60e7629258	Recognize loopz and loopnz as aliases for loope and loopne. From Dimitry Andric. llvm-svn: 126168	2011-02-22 00:43:07 +00:00
Rafael Espindola	e39062199e	Implement xgetbv and xsetbv. Patch by Jai Menon. llvm-svn: 126165	2011-02-22 00:35:18 +00:00
Evan Cheng	87a9f19f9c	Skipping over debugvalue instructions to determine whether the split spot is in a IT block. rdar://9030770 llvm-svn: 126159	2011-02-21 23:40:47 +00:00
Devang Patel	f3292b2196	Revert r124611 - "Keep track of incoming argument's location while emitting LiveIns." In other words, do not keep track of argument's location. The debugger (gdb) is not prepared to see line table entries for arguments. For the debugger, "second" line table entry marks beginning of function body. This requires some coordination with debugger to get this working. - The debugger needs to be aware of prolog_end attribute attached with line table entries. - The compiler needs to accurately mark prolog_end in line table entries (at -O0 and at -O1+) llvm-svn: 126155	2011-02-21 23:21:26 +00:00
Sean Callanan	5e8603d1b9	Fixed a bug in the X86 disassembler where a member of the X86 instruction decode structure was being interpreted as being in units of bits, although it is actually stored in units of bytes. llvm-svn: 126147	2011-02-21 21:55:05 +00:00
Richard Osborne	1ae65c7cb8	Add XCore intrinsics for various instructions on ports. llvm-svn: 126132	2011-02-21 18:23:30 +00:00
Duncan Sands	bda7175a43	The stack should be 16 byte aligned on 32 bit solaris. Patch by Yuri. llvm-svn: 126130	2011-02-21 17:37:17 +00:00
Chris Lattner	5237febf0c	a serious "compare CSE" issue that is nontrivial to get right, but which is responsible for us doing really bad things to 256.bzip2. llvm-svn: 126126	2011-02-21 17:03:47 +00:00
NAKAMURA Takumi	860abd0f28	Target/X86/X86FastISel: [PR6275] Fix Win32's dllimport function with fastisel. "dllimport" function must not be GlobalVariable, but Function. It is enough to check with GlobalValue. test/CodeGen/X86/dll-linkage.ll is updated to check llc -O0. llvm-svn: 126110	2011-02-21 04:50:06 +00:00
Venkatraman Govindaraju	a82203f875	Generate correct Sparc32 ABI compliant code for functions that return a struct. llvm-svn: 126108	2011-02-21 03:42:44 +00:00
Chris Lattner	e9cba7bd34	add a missed loop deletion case. llvm-svn: 126103	2011-02-21 02:13:39 +00:00
Chris Lattner	659c793a4e	add an idiom that loop idiom could theoretically catch. llvm-svn: 126101	2011-02-21 01:33:38 +00:00
Cameron Zwarich	39314bdbc8	A lo/hi mul has higher latency than an imul r,ri, e.g. 5 cycles compared to 3 on Core 2 and Nehalem, so the code we generate is better than GCC's here. llvm-svn: 126100	2011-02-21 01:29:32 +00:00
Cameron Zwarich	8731d0cc83	The signed version of our "magic number" computation for the integer approximation of a constant had a minor typo introduced when copying it from the book, which caused it to favor negative approximations over positive approximations in many cases. Positive approximations require fewer operations beyond the multiplication. In the case of division by 3, we still generate code that is a single instruction larger than GCC's code. llvm-svn: 126097	2011-02-21 00:22:02 +00:00
Eric Christopher	ac6b001f56	If both operands are loads from stores in memory we can't use movlpd/movlps since one needs to be a register operand. Just use movss instead of forcing an operand into a register. Fixes PR9239 llvm-svn: 126072	2011-02-20 05:04:42 +00:00
Oscar Fuentes	ba1186c23e	Use explicit add_subdirectory's for LLVM target sublibraries instead of testing for its presence at cmake time. This way the build automatically regenerates the makefiles when a svn update brings in a new sublibrary. llvm-svn: 126068	2011-02-20 02:55:27 +00:00
Eli Friedman	78b9851a3a	Minor x86 README updates. llvm-svn: 126054	2011-02-19 21:54:28 +00:00
Chris Lattner	47ffd35bea	implement PR9264: disambiguating 'bt mem, imm' as a btl. This is reasonable to do since all bt-mem forms do the same thing. llvm-svn: 126047	2011-02-19 21:06:36 +00:00
Eric Christopher	c509ff6944	Fix typos. llvm-svn: 126018	2011-02-19 03:19:09 +00:00
Joerg Sonnenberger	740467a245	Avoid dangling else warnings. llvm-svn: 126004	2011-02-19 00:43:45 +00:00
Chris Lattner	1341df93f7	add a way to disable all builtins, wire it up to opt's -disable-simplifylibcalls flag. llvm-svn: 125978	2011-02-18 22:34:03 +00:00
Oscar Fuentes	5ed962656c	Move library stuff out of the toplevel CMakeLists.txt file. llvm-svn: 125968	2011-02-18 22:06:14 +00:00
Chris Lattner	0e125bb4d0	introduce a new TargetLibraryInfo pass, which transformations can use to query about available library functions. For now this just has memset_pattern16, which exists on darwin, but it can be extended for a bunch of other things in the future. llvm-svn: 125965	2011-02-18 21:50:34 +00:00
Bruno Cardoso Lopes	cdd20affec	Fix style and a typo llvm-svn: 125949	2011-02-18 19:49:06 +00:00
Bruno Cardoso Lopes	9cd43977c3	Add assembly parsing support for "msr" and also fix its encoding. Also add testcases for the disassembler to make sure it still works for "msr". llvm-svn: 125948	2011-02-18 19:45:59 +00:00
Chris Lattner	0281731cc2	add a poor division by constant case. llvm-svn: 125832	2011-02-18 05:35:49 +00:00
Joerg Sonnenberger	f69c80bac2	Recognize monitor/mwait with explicit register arguments llvm-svn: 125805	2011-02-18 00:48:11 +00:00
Joerg Sonnenberger	889a508157	Recognize leavel and leaveq aliases for leave. Validate encoding of leave in 64bit mode. llvm-svn: 125795	2011-02-17 23:36:39 +00:00
David Greene	3a2b508e8f	[AVX] Recorganize X86ShuffleDecode into its own library (LLVMX86Utils.a) to break cyclic library dependencies between LLVMX86CodeGen.a and LLVMX86AsmParser.a. Previously this code was in a header file and marked static but AVX requires some additional functionality here that won't be used by all clients. Since including unused static functions causes a gcc compiler warning, keeping it as a header would break builds that use -Werror. Putting this in its own library solves both problems at once. llvm-svn: 125765	2011-02-17 19:18:59 +00:00
Dan Gohman	f0f8e14370	The labyrinthine X86 backend no longer appears to require these patterns. llvm-svn: 125759	2011-02-17 18:50:19 +00:00
NAKAMURA Takumi	4c14a5cc2c	Triple::MinGW64 is deprecated and removed. We can use Triple::MinGW32 generally. No one uses *-mingw64. mingw-w64 is represented as {i686\|x86_64}-w64-mingw32. In llvm side, i686 and x64 can be treated as similar way. llvm-svn: 125747	2011-02-17 12:24:17 +00:00
NAKAMURA Takumi	0544fe7287	Fix whitespace. llvm-svn: 125746	2011-02-17 12:23:50 +00:00
Duncan Sands	491eb276a7	This has been implemented. llvm-svn: 125738	2011-02-17 08:16:56 +00:00
Chris Lattner	727eebee58	add some notes on compares + binops. Remove redundant entries. llvm-svn: 125702	2011-02-17 01:43:46 +00:00
Chris Lattner	28bf91f78e	Add a few missed xforms from GCC PR14753 llvm-svn: 125681	2011-02-16 19:16:34 +00:00
Stuart Hastings	81c4306005	Swap VT and DebugLoc operands of getExtLoad() for consistency with other getNode() methods. Radar 9002173. llvm-svn: 125665	2011-02-16 16:23:55 +00:00
Eli Friedman	c8fb2557b9	Remove outdated README entry. llvm-svn: 125660	2011-02-16 07:41:19 +00:00
Eli Friedman	0254c4c01b	Remove outdated README entry. llvm-svn: 125659	2011-02-16 07:18:18 +00:00
Eli Friedman	5f75515e5d	Update README entry. llvm-svn: 125658	2011-02-16 07:17:44 +00:00
Rafael Espindola	58ac6e1677	Add support for pushsection and popsection. Patch by Joerg Sonnenberger. llvm-svn: 125629	2011-02-16 01:08:29 +00:00
Evan Cheng	4a8c43fe6d	Some single precision VFP instructions may be executed on NEON pipeline, but not double precision ones. llvm-svn: 125624	2011-02-16 00:35:02 +00:00
Jakob Stoklund Olesen	d9c80ef837	Teach ARMLoadStoreOptimizer to remove kill flags from merged instructions as well. This is necessary to avoid a crash in certain tangled situations where a kill flag is first correctly moved to a merged instruction, and then needs to be moved again: STR %R0, a... STR %R0<kill>, b... First becomes: STR %R0, b... STM a, %R0<kill>, ... and then: STM a, %R0, ... STM b, %R0<kill>, ... We can now remove the kill flag from the merged STM when needed. 8960050. llvm-svn: 125591	2011-02-15 19:51:58 +00:00
Duncan Sands	75b5d27b84	Spelling fix: consequtive -> consecutive. llvm-svn: 125563	2011-02-15 09:23:02 +00:00
Bob Wilson	9cd0b581b8	Remove unused bitvectors that record ARM callee-saved registers. llvm-svn: 125534	2011-02-14 23:40:38 +00:00
Bruno Cardoso Lopes	57a522f30d	A fail to match coprocessor number and register number must fail instead of assert. llvm-svn: 125521	2011-02-14 21:10:33 +00:00
Bruno Cardoso Lopes	90d1dfe4c6	Fix encoding and add parsing support for the arm/thumb CPS instruction: - Add custom operand matching for imod and iflags. - Rename SplitMnemonicAndCC to SplitMnemonic since it splits more than CC from mnemonic. - While adding ".w" as an operand, don't change "Head" to avoid passing the wrong mnemonic to ParseOperand. - Add asm parser tests. - Add disassembler tests just to make sure it can catch all cps versions. llvm-svn: 125489	2011-02-14 13:09:44 +00:00
Chris Lattner	46c01a30f4	Enhance ComputeMaskedBits to know that aligned frameindexes have their low bits set to zero. This allows us to optimize out explicit stack alignment code like in stack-align.ll:test4 when it is redundant. Doing this causes the code generator to start turning FI+cst into FI\|cst all over the place, which is general goodness (that is the canonical form) except that various pieces of the code generator don't handle OR aggressively. Fix this by introducing a new SelectionDAG::isBaseWithConstantOffset predicate, and using it in places that are looking for ADD(X,CST). The ARM backend in particular was missing a lot of addressing mode folding opportunities around OR. llvm-svn: 125470	2011-02-13 22:25:43 +00:00
Reid Kleckner	2406b7d179	Add encodings and mnemonics for FXSAVE64 and FXRSTOR64. These are just FXSAVE and FXRSTOR with REX.W prefixes. These versions use 64-bit pointer values instead of 32-bit pointer values in the memory map they dump and restore. llvm-svn: 125446	2011-02-12 23:24:13 +00:00
Venkatraman Govindaraju	0c1f65317b	Prevent IMPLICIT_DEF/KILL to become a delay filler instruction in SPARC backend. llvm-svn: 125444	2011-02-12 19:02:33 +00:00
Benjamin Kramer	69affe6a94	Add a note about SSE4.1 roundss/roundsd. llvm-svn: 125438	2011-02-12 17:58:16 +00:00
Jim Grosbach	861e49ce3b	AsmMatcher custom operand parser failure enhancements. Teach the AsmMatcher handling to distinguish between an error custom-parsing an operand and a failure to match. The former should propogate the error upwards, while the latter should continue attempting to parse with alternative matchers. Update the ARM asm parser accordingly. llvm-svn: 125426	2011-02-12 01:34:40 +00:00
Nate Begeman	fa62d50481	Implement sdiv & udiv for <4 x i16> and <8 x i8> NEON vector types. This avoids moving each element to the integer register file and calling __divsi3 etc. on it. llvm-svn: 125402	2011-02-11 20:53:29 +00:00
Rafael Espindola	34b59389ea	Remove std::string version of getNameWithPrefix. llvm-svn: 125363	2011-02-11 05:23:09 +00:00
Evan Cheng	2da1c95993	Fix buggy fcopysign lowering. This define float @foo(float %x, float %y) nounwind readnone { entry: %0 = tail call float @copysignf(float %x, float %y) nounwind readnone ret float %0 } Was compiled to: vmov s0, r1 bic r0, r0, #-2147483648 vmov s1, r0 vcmpe.f32 s0, #0 vmrs apsr_nzcv, fpscr it lt vneglt.f32 s1, s1 vmov r0, s1 bx lr This fails to copy the sign of -0.0f because it's lost during the float to int conversion. Also, it's sub-optimal when the inputs are in GPR registers. Now it uses integer and + or operations when it's profitable. And it's correct! lsrs r1, r1, #31 bfi r0, r1, #31, #1 bx lr rdar://8984306 llvm-svn: 125357	2011-02-11 02:28:55 +00:00
David Greene	79827a5a26	[AVX] Implement 256-bit vector lowering for SCALAR_TO_VECTOR. This largely completes support for 128-bit fallback lowering for code that is not 256-bit ready. llvm-svn: 125315	2011-02-10 23:11:29 +00:00
Bruno Cardoso Lopes	61a61e9da3	Fix a lot of o32 CC issues and add a bunch of tests. Patch by Akira Hatanaka with some small modifications by me. llvm-svn: 125292	2011-02-10 18:05:10 +00:00
David Greene	ce318e4958	[AVX] Implement 256-bit vector lowering for EXTRACT_VECTOR_ELT. llvm-svn: 125284	2011-02-10 16:57:36 +00:00
Che-Liang Chiou	84fde9ef2b	ptx: add passing parameter to kernel functions llvm-svn: 125279	2011-02-10 12:01:24 +00:00
David Greene	b36195ab26	[AVX] Implement 256-bit vector lowering for INSERT_VECTOR_ELT. llvm-svn: 125187	2011-02-09 15:32:06 +00:00
Richard Osborne	d9dde78c27	Add intrinsic for setc instruction on the XCore. llvm-svn: 125186	2011-02-09 13:22:12 +00:00
Owen Anderson	4ebf471c9b	Revert both r121082 (which broke a bunch of constant pool stuff) and r125074 (which worked around it). This should get us back to the old, correct behavior, though it will make the integrated assembler unhappy for the time being. llvm-svn: 125127	2011-02-08 22:39:40 +00:00
David Greene	10b0db1d5f	[AVX] Implement BUILD_VECTOR lowering for 256-bit vectors. For anything but the simplest of cases, lower a 256-bit BUILD_VECTOR by splitting it into 128-bit parts and recombining. llvm-svn: 125105	2011-02-08 19:04:41 +00:00
Evan Cheng	558ccef74f	Temporary workaround for a bad bug introduced by r121082 which replaced t2LDRpci with t2LDRi12. There are a couple of problems with this. 1. The encoding for the literal and immediate constant are different. Note bit 7 of the literal case is 'U' so it can be negative. 2. t2LDRi12 is now narrowed to tLDRpci before constant island pass is run. So we end up never using the Thumb2 instruction, which ends up creating a lot more constant islands. llvm-svn: 125074	2011-02-08 03:07:03 +00:00
Bruno Cardoso Lopes	36dd43fda6	Add support for parsing dmb/dsb instructions llvm-svn: 125055	2011-02-07 22:09:15 +00:00
Bruno Cardoso Lopes	c9253b4deb	Remove the MCR asm parser hack and start using the custom target specific asm parsing of operands introduced in r125030. As a small note, besides using a more generic approach we can also have more descriptive output when debugging llvm-mc, example: mcr p7, #1, r5, c1, c1, #4 note: parsed instruction: ['mcr', <ARMCC::al>, <coprocessor number: 7>, 1, <register 73>, <coprocessor register: 1>, <coprocessor register: 1>, 4] llvm-svn: 125052	2011-02-07 21:41:25 +00:00
David Greene	79651c527b	[AVX] Insert/extract subvector lowering support. This includes a couple of utility functions that will be used in other places for more AVX lowering. llvm-svn: 125029	2011-02-07 19:36:54 +00:00
Jason W Kim	e5ce4c9bcd	ARM/MC/ELF Lowercase .cpu attributes in .s, but make them uppercase in .o llvm-svn: 125025	2011-02-07 19:07:11 +00:00
Evan Cheng	e1a4ac9b5b	Fix an obvious typo which caused an isel assertion. rdar://8964854. llvm-svn: 125023	2011-02-07 18:50:47 +00:00
Bob Wilson	06fce87c4a	Add codegen support for using post-increment NEON load/store instructions. The vld1-lane, vld1-dup and vst1-lane instructions do not yet support using post-increment versions, but all the rest of the NEON load/store instructions should be handled now. llvm-svn: 125014	2011-02-07 17:43:21 +00:00
Bob Wilson	a609b8954e	Change VLD3/4 and VST3/4 for quad registers to not update the address register. These operations are expanded to pairs of loads or stores, and the first one uses the address register update to produce the address for the second one. So far, the second load/store has also updated the address register, just for convenience, since that output has never been used. In anticipation of actually supporting post-increment updates for these operations, this changes the non-updating operations to use a non-updating load/store for the second instruction. llvm-svn: 125013	2011-02-07 17:43:15 +00:00
Bob Wilson	42e67b5f73	Fix some NEON instruction itineraries. llvm-svn: 125012	2011-02-07 17:43:12 +00:00
Bob Wilson	f3c8df3202	Fix a comment: addrmode6 no longer includes the optional writeback flag. llvm-svn: 125011	2011-02-07 17:43:09 +00:00
Bob Wilson	3dfe815358	Remove inaccurate comments: so_imm and t2_so_imm operands are not encoded until the instructions are emitted or printed. llvm-svn: 125010	2011-02-07 17:43:06 +00:00
Bob Wilson	0d95ed90cc	Move code for OffsetCompare struct closer to where it is used. llvm-svn: 125009	2011-02-07 17:43:03 +00:00
Jason W Kim	85b0af177f	Rework some .ARM.attribute work for improved gcc compatibility. Unified EmitTextAttribute for both Asm and Obj emission (.cpu only) Added necessary cortex-A8 related attrs for codegen compat tests. llvm-svn: 124995	2011-02-07 00:49:53 +00:00
Anders Carlsson	49d81a3d7e	Remove a virtual inheritance case that clang can devirtualize fully now. llvm-svn: 124989	2011-02-06 20:16:49 +00:00
NAKAMURA Takumi	1850c80afb	Target/X86: Tweak allocating shadow area (aka home) on Win64. It must be enough for caller to allocate one. llvm-svn: 124949	2011-02-05 15:11:32 +00:00
NAKAMURA Takumi	b21c3db920	lib/Target/X86/X86ISelLowering.cpp: Introduce a new variable "IsWin64". No functional changes. llvm-svn: 124948	2011-02-05 15:11:13 +00:00
NAKAMURA Takumi	b1bbdd8d44	lib/Target/X86/X86JITInfo.cpp: Add Win64 stuff. llvm-svn: 124947	2011-02-05 15:11:03 +00:00
NAKAMURA Takumi	f7f319d4d3	Target/X86: Fix whitespace. llvm-svn: 124946	2011-02-05 15:10:54 +00:00
David Greene	96d07a82b2	[AVX] Revert 124910 until clients are ready. llvm-svn: 124912	2011-02-05 00:24:41 +00:00
David Greene	bdd481507a	[AVX] Add some utilities to insert and extract 128-bit subvectors. This allows us to easily support 256-bit operations that don't have native 256-bit support. This applies to integer operations, certain types of shuffles and various othher things. llvm-svn: 124910	2011-02-04 23:29:33 +00:00
Jason W Kim	d2e2f56c36	Teach ARM/MC/ELF to handle R_ARM_JUMP24 relocation type for conditional jumps. (yes, this is different from R_ARM_CALL) - Adds a new method getARMBranchTargetOpValue() which handles the necessary distinction between the conditional and unconditional br/bl needed for ARM/ELF At least for ARM mode, the needed fixup for conditional versus unconditional br/bl is identical, but the ARM docs and existing ARM tools expect this reloc type... Added a few FIXME's for future naming fixups in ARMInstrInfo.td llvm-svn: 124895	2011-02-04 19:47:15 +00:00
Daniel Dunbar	6619340462	MC/AsmParser: Add support for allowing the conversion process to fail (via custom conversion functions). llvm-svn: 124872	2011-02-04 17:12:23 +00:00
David Greene	653f1eed2d	[AVX] Support VSINSERTF128 with more patterns and appropriate infrastructure. This makes lowering 256-bit vectors to 128-bit vectors simple when 256-bit vector support is not available. llvm-svn: 124868	2011-02-04 16:08:29 +00:00
Bob Wilson	fb0bd049da	Fix 80-column violations and whitespace. llvm-svn: 124819	2011-02-03 21:46:10 +00:00
David Greene	c4da110fd2	[AVX] VEXTRACTF128 support. This commit includes patterns for matching EXTRACT_SUBVECTOR to VEXTRACTF128 along with support routines to examine and translate index values. VINSERTF128 comes next. With these two in place we can begin supporting more AVX operations as INSERT/EXTRACT can be used as a fallback when 256-bit support is not available. llvm-svn: 124797	2011-02-03 15:50:00 +00:00
Richard Osborne	a31b9c2f7c	Add XCore intrinsics for resource instructions. llvm-svn: 124794	2011-02-03 13:14:25 +00:00
Rafael Espindola	d11311f291	Fix PR9127 by reversing the operands even if they have more then one use. Reversing the operands allows us to fold, but doesn't force us to. Also, at this point the DAG is still being optimized, so the check for hasOneUse is not very precise. llvm-svn: 124773	2011-02-03 03:58:05 +00:00
Bob Wilson	09a6b46c89	Update comment to match my recent change. llvm-svn: 124725	2011-02-02 17:29:40 +00:00
Benjamin Kramer	f4ea1d5f79	SimplifyCFG: Turn switches into sub+icmp+branch if possible. This makes the job of the later optzn passes easier, allowing the vast amount of icmp transforms to chew on it. We transform 840 switches in gcc.c, leading to a 16k byte shrink of the resulting binary on i386-linux. The testcase from README.txt now compiles into decl %edi cmpl $3, %edi sbbl %eax, %eax andl $1, %eax ret llvm-svn: 124724	2011-02-02 15:56:22 +00:00
Richard Osborne	8607a67d37	Add support for trampolines on the XCore. llvm-svn: 124722	2011-02-02 14:57:41 +00:00
Sean Callanan	26fc7858db	Fixed a bug in the disassembler where the mandatory 0x66 prefix would be misinterpreted in some cases on 32-bit x86 platforms. Thanks to Olivier Meurant for identifying the bug. llvm-svn: 124709	2011-02-02 01:09:02 +00:00
Evan Cheng	d42641c6b5	Given a pair of floating point load and store, if there are no other uses of the load, then it may be legal to transform the load and store to integer load and store of the same width. This is done if the target specified the transformation as profitable. e.g. On arm, this can transform: vldr.32 s0, [] vstr.32 s0, [] to ldr r12, [] str r12, [] rdar://8944252 llvm-svn: 124708	2011-02-02 01:06:55 +00:00
Bob Wilson	59513209aa	PR9081: Split up LDM instruction with deprecated use of both LR and PC. This is completely untested but pretty straightforward, so hopefully I got it right. llvm-svn: 124694	2011-02-01 22:30:51 +00:00
Anton Korobeynikov	1f3bc9b5e6	Fix imm printing for logical instructions. Patch by Brian G. Lucas! llvm-svn: 124679	2011-02-01 20:22:53 +00:00
Carl Norum	ecd90b5946	Test commit - fix a double 'should' in a comment. llvm-svn: 124652	2011-02-01 07:38:42 +00:00
Evan Cheng	d22a4a1fd6	Patches to build EFI with Clang/LLVM. By Carl Norum. llvm-svn: 124639	2011-02-01 01:14:13 +00:00
Devang Patel	56cc5fdf09	Keep track of incoming argument's location while emitting LiveIns. llvm-svn: 124611	2011-01-31 21:38:14 +00:00
David Greene	f3c6873544	Fix vector sign extend to put the source and destination types in the correct places. llvm-svn: 124601	2011-01-31 20:39:01 +00:00
Chris Lattner	865fe3b283	add a note, progress unblocked by PR8575 being fixed. llvm-svn: 124599	2011-01-31 20:23:28 +00:00
Anton Korobeynikov	221f4faa92	Save a mapping between original and cloned constpool entries. llvm-svn: 124570	2011-01-30 22:07:39 +00:00
Benjamin Kramer	946e1522b6	Teach DAGCombine to fold fold (sra (trunc (sr x, c1)), c2) -> (trunc (sra x, c1+c2) when c1 equals the amount of bits that are truncated off. This happens all the time when a smul is promoted to a larger type. On x86-64 we now compile "int test(int x) { return x/10; }" into movslq %edi, %rax imulq $1717986919, %rax, %rax movq %rax, %rcx shrq $63, %rcx sarq $34, %rax <- used to be "shrq $32, %rax; sarl $2, %eax" addl %ecx, %eax This fires 96 times in gcc.c on x86-64. llvm-svn: 124559	2011-01-30 16:38:43 +00:00
Bob Wilson	775eec2280	PR9030: Fix disassembly of ARM "mov pc, lr" instruction. Patch by Jyun-Yan You. llvm-svn: 124492	2011-01-28 17:50:30 +00:00
Evan Cheng	bb8420a070	Fix PLD encoding. llvm-svn: 124458	2011-01-27 23:48:34 +00:00
Kevin Enderby	e9f2f0cb0b	Changed llvm-mc arm target to give an error if .syntax divided is used. Since only .syntax unified is supported. llvm-svn: 124454	2011-01-27 23:22:36 +00:00
David Greene	34f7c0d8aa	[AVX] Clean up the code to configure target lowering for AVX. Specify how to lower more/new operations. This is a prerequisite for adding additional AVX lowering. llvm-svn: 124447	2011-01-27 22:38:56 +00:00
Roman Divacky	36b1b47c5a	Introduce virtual ParseRegister method in TargetAsmParser. Create override of this method in X86/ARM/MBlaze. llvm-svn: 124378	2011-01-27 17:14:22 +00:00
Eric Christopher	331cc5218d	Use the incoming VT not the VT of where we're trying to store to determine if we can store a value. Also, the exclusion is or, not and. Fixes rdar://8920247. llvm-svn: 124357	2011-01-27 05:44:56 +00:00
NAKAMURA Takumi	f3e20b9f0f	lib/Target/X86/X86ISelDAGToDAG.cpp: __main should be WINCALL64 on Win64. CALL64 marks %xmm* as dead. llvm-svn: 124354	2011-01-27 03:20:19 +00:00
Bill Wendling	5a13d4fa8f	Add support for printing out floating point values from the ARM assembly parser. The parser will always give us a binary representation of the floating point number. llvm-svn: 124318	2011-01-26 20:57:43 +00:00
David Greene	bab5e6ed0e	[AVX] Add INSERT_SUBVECTOR and support it on x86. This provides a default implementation for x86, going through the stack in a similr fashion to how the codegen implements BUILD_VECTOR. Eventually this will get matched to VINSERTF128 if AVX is available. llvm-svn: 124307	2011-01-26 19:13:22 +00:00
David Greene	b6f1611928	[AVX] Support EXTRACT_SUBVECTOR on x86. This provides a default implementation of EXTRACT_SUBVECTOR for x86, going through the stack in a similr fashion to how the codegen implements BUILD_VECTOR. Eventually this will get matched to VEXTRACTF128 if AVX is available. llvm-svn: 124292	2011-01-26 15:38:49 +00:00
Bruno Cardoso Lopes	178c1e0c9b	fix the encoding and add testcases for ARM nop, yield, wfe and wfi instructions llvm-svn: 124288	2011-01-26 13:28:14 +00:00
Bill Wendling	d13d13496f	Add needed braces. llvm-svn: 124273	2011-01-26 02:06:22 +00:00
NAKAMURA Takumi	0cfdac078e	Target/X86: Tweak win64's tailcall. llvm-svn: 124272	2011-01-26 02:04:09 +00:00
NAKAMURA Takumi	9d29eff198	Fix whitespace. llvm-svn: 124270	2011-01-26 02:03:37 +00:00
NAKAMURA Takumi	c780782560	lib/Target/X86/X86RegisterInfo.cpp: Fix whitespace. llvm-svn: 124268	2011-01-26 01:28:06 +00:00
NAKAMURA Takumi	86278dc3ea	lib/Target/X86/X86RegisterInfo.cpp: Fix a typo in comment. llvm-svn: 124267	2011-01-26 01:27:58 +00:00
Bill Wendling	57990c4910	Revert 124230. It was causing test failures. llvm-svn: 124233	2011-01-25 21:48:36 +00:00

... 2 3 4 5 6 ...

17407 Commits