llvm-project

Commit Graph

Author	SHA1	Message	Date
Bob Wilson	97919e9c59	Use pseudo instructions for VST3. llvm-svn: 112208	2010-08-26 18:51:29 +00:00
Bill Wendling	a9c03f4fae	Reapply r112176 without removing the other CMN patterns (that was unintentional). llvm-svn: 112206	2010-08-26 18:33:51 +00:00
Bob Wilson	a967c42a3d	Fix comment typos. llvm-svn: 112202	2010-08-26 18:08:11 +00:00
Jim Grosbach	074d22e1ac	Restrict the register to tGPR to make sure the str instruction will be encodable as a 16-bit wide instruction. llvm-svn: 112195	2010-08-26 17:02:47 +00:00
Dan Gohman	10b20b2b81	Revert r112176; it broke test/CodeGen/Thumb2/thumb2-cmn.ll. llvm-svn: 112191	2010-08-26 15:50:25 +00:00
Dan Gohman	ca26f79051	Reapply r112091 and r111922, support for metadata linking, with a fix: add a flag to MapValue and friends which indicates whether any module-level mappings are being made. In the common case of inlining, no module-level mappings are needed, so MapValue doesn't need to examine non-function-local metadata, which can be very expensive in the case of a large module with really deep metadata (e.g. a large C++ program compiled with -g). This flag is a little awkward; perhaps eventually it can be moved into the ClonedCodeInfo class. llvm-svn: 112190	2010-08-26 15:41:53 +00:00
Bill Wendling	a9a0599b39	There seems to be a (potential) hardware bug with the CMN instruction and comparison with 0. These two pieces of code should give identical results: rsbs r1, r1, 0 cmp r0, r1 mov r0, #0 it ls mov r0, #1 and: cmn r0, r1 mov r0, #0 it ls mov r0, #1 However, the CMN gives the opposite result when r1 is 0. This is because the carry flag is set in the CMP case but not in the CMN case. In short, the CMP instruction doesn't perform a truncate of the (logical) NOT of 0 plus the value of r0 and the carry bit (because the "carry bit" parameter to AddWithCarry is defined as 1 in this case, the carry flag will always be set when r0 >= 0). The CMN instruction doesn't perform a NOT of 0 so there is never a "carry" when this AddWithCarry is performed (because the "carry bit" parameter to AddWithCarry is defined as 0). The AddWithCarry in the CMP case seems to be relying upon the identity: ~x + 1 = -x However when x is 0 and unsigned, this doesn't hold: x = 0 ~x = 0xFFFF FFFF ~x + 1 = 0x1 0000 0000 (-x = 0) != (0x1 0000 0000 = ~x + 1) Therefore, we should disable all versions of CMN, especially when comparing against zero, until we can limit when the CMN instruction is used (when we know that the RHS is not 0) or when we have a hardware fix for this. (See the ARM docs for the "AddWithCarry" pseudo-code.) This is related to <rdar://problem/7569620>. llvm-svn: 112176	2010-08-26 09:07:33 +00:00
Chris Lattner	eb2cc0ce0e	implement SplitVecOp_CONCAT_VECTORS, fixing the included testcase with SSE1. llvm-svn: 112171	2010-08-26 05:51:22 +00:00
Bob Wilson	4cec44975e	Use pseudo instructions for VST1d64Q. llvm-svn: 112170	2010-08-26 05:33:30 +00:00
Chris Lattner	cc60609cb4	fix sse1 only codegen in x86-64 mode, which is something we apparently try to support. llvm-svn: 112168	2010-08-26 05:24:29 +00:00
Chris Lattner	2d482bb96b	remove dead proto llvm-svn: 112131	2010-08-26 01:14:37 +00:00
Bruno Cardoso Lopes	184eaea855	Fix PR7748 without using microsoft extensions llvm-svn: 112128	2010-08-26 01:02:53 +00:00
Jim Grosbach	08da771ec3	Enable pre-RA virtual frame base register allocation. rdar://8277890 llvm-svn: 112127	2010-08-26 00:58:06 +00:00
Bob Wilson	4629f423f8	Revert svn 107892 (with changes to work with trunk). It caused a crash if a VLD result was not used (Radar 8355607). It should also fix pr7988, but I haven't verified that yet. llvm-svn: 112118	2010-08-26 00:13:36 +00:00
Chris Lattner	aecf47a5cb	we should pattern match the SSE complex arithmetic ops. llvm-svn: 112109	2010-08-25 23:31:42 +00:00
Bob Wilson	9392b0e960	Start converting NEON load/stores to use pseudo instructions, beginning here with the VST4 instructions. Until after register allocation, we want to represent sets of adjacent registers by a single super-register. These VST4 pseudo instructions have a single QQ or QQQQ source register operand. They get expanded to the real VST4 instructions with 4 separate D register operands. Once this conversion is complete, we'll be able to remove the NEONPreAllocPass and avoid some fragile and hacky code elsewhere. llvm-svn: 112108	2010-08-25 23:27:42 +00:00
Bruno Cardoso Lopes	d4085f6e91	Revert this for now, PUNPCKLDQ dont operate on v4f32 llvm-svn: 112090	2010-08-25 21:26:37 +00:00
Daniel Dunbar	3d148ac089	X86: Fix misencode of RI64mi8. This fixes OpenSSL / x86_64-apple-darwin10 / clang -O3. llvm-svn: 112089	2010-08-25 21:11:02 +00:00
Jim Grosbach	0a84487fa7	Don't override the var from the enclosing scope. When doing copy/paste/modify, it's apparently rather important to remember the 'modify' bit... llvm-svn: 112075	2010-08-25 19:11:34 +00:00
Chris Lattner	bf80d28a74	zap dead code llvm-svn: 112073	2010-08-25 19:00:00 +00:00
Benjamin Kramer	f1f2133ac0	Remove dead recursive function. Yay for clang -Wunused-function. llvm-svn: 112060	2010-08-25 17:27:58 +00:00
Daniel Dunbar	a54a1b0edf	ARM/Thumb2: Fix a misselect in getARMCmp, when attempting to adjust a signed comparison that would overflow. - The other under/overflow cases can't actually happen because the immediates which would trigger them are legal (so we don't enter this code), but adjusted the style to make it clear the transform is always valid. llvm-svn: 112053	2010-08-25 16:58:05 +00:00
Eric Christopher	7a0d8c69cb	Do type checks before we bother to do everything else. llvm-svn: 112039	2010-08-25 08:43:57 +00:00
Anton Korobeynikov	b3b53ecac0	Fix nasty mingw32 bug, which e.g. prevented llvm-gcc bootstrap there. Mark _alloca call as clobberring EFLAGS, otherwise some DCE might remove other flags-clobberring stuff (e.g. cmp instructions) occuring after _alloca call. llvm-svn: 112034	2010-08-25 07:50:11 +00:00
Eric Christopher	761e7fb605	Reorganize load mechanisms. Handle types in a little less fixed way. Fix some todos. No functional change. llvm-svn: 112031	2010-08-25 07:23:49 +00:00
Bruno Cardoso Lopes	0770d25758	PUNPCKLDQ should also be used for v4f32 llvm-svn: 112020	2010-08-25 02:55:40 +00:00
Bruno Cardoso Lopes	2e45d522c1	teach lowering to get target specific nodes for pshufd, emulating the same isel behavior for now, so we can pass all vector shuffle tests llvm-svn: 112017	2010-08-25 02:35:37 +00:00
Eric Christopher	15b182f4d4	Fix predicate and add a comment. llvm-svn: 111981	2010-08-24 22:34:11 +00:00
Eric Christopher	236ec8f3b5	Rework braindead conditionals I put in yesterday. llvm-svn: 111974	2010-08-24 22:07:27 +00:00
Eric Christopher	6c99ebf5b0	Fix thumb2 mode loads to have the correct operand ordering. Add a todo to fix this in the port. llvm-svn: 111973	2010-08-24 22:03:02 +00:00
Jim Grosbach	2eedb7949e	Add ARM heuristic for when to allocate a virtual base register for stack access. rdar://8277890&7352504 llvm-svn: 111968	2010-08-24 21:19:33 +00:00
Daniel Dunbar	1c8d777c93	MC/X86: Tweak imul recognition, previous hack only applies for the imul form taking immediates. llvm-svn: 111950	2010-08-24 19:37:56 +00:00
Daniel Dunbar	09392785b4	MC/X86: Add custom hack for recognizing "imul $12, %eax" and friends. llvm-svn: 111947	2010-08-24 19:24:18 +00:00
Daniel Dunbar	94b84a19b9	MC/X86: Warn on scale factors > 1 without index register, instead of erroring, for 'as' compatibility. llvm-svn: 111945	2010-08-24 19:13:38 +00:00
Jim Grosbach	b77d67f318	Move enabling the local stack allocation pass into the target where it belongs. For now it's still a command line option, but the interface to the generic code doesn't need to know that. llvm-svn: 111942	2010-08-24 19:05:43 +00:00
Jim Grosbach	35b7c033d4	add ARM cmd line option to force always using virtual base regs when possible. Intended to help ease reproducing problems by increasing base register usage after heuristics for only using the when needed are in place. llvm-svn: 111930	2010-08-24 18:04:52 +00:00
Dan Gohman	c88fda477a	Fix X86's isLegalAddressingMode to recognize that static addresses need not be RIP-relative in small mode. llvm-svn: 111917	2010-08-24 15:55:12 +00:00
Kalle Raiskila	7e25bc4145	Fix SPU BE to use all the available return registers. llc used to assert on the added testcase. llvm-svn: 111911	2010-08-24 11:50:48 +00:00
Kalle Raiskila	8f3e3ba5ff	Remove some dead code from SPU BE that remained from 64bit vector support. llvm-svn: 111910	2010-08-24 11:05:51 +00:00
Bruno Cardoso Lopes	758d7b1f5c	Use pshufhw and pshuflw in more cases and fix getTargetShuffleNode number of arguments llvm-svn: 111890	2010-08-24 01:16:15 +00:00
Bill Wendling	2c64ba63a1	Add comments for what the condition code symbols mean. llvm-svn: 111889	2010-08-24 01:11:30 +00:00
Eric Christopher	46d3a56e5d	Update comment. llvm-svn: 111887	2010-08-24 01:10:52 +00:00
Eric Christopher	c0c00ca33f	Fix the opcode and the operands for the load instruction. llvm-svn: 111885	2010-08-24 01:10:04 +00:00
Eric Christopher	eb47692c22	Add register class hack that needs to go away, but makes it more obvious that it needs to go away. Use loadRegFromStackSlot where possible. Also, remember to update the value map. llvm-svn: 111883	2010-08-24 00:50:47 +00:00
Eric Christopher	9d4e471cc2	Add some more debugging code, make it more obvious that RegOffset is getting an address for an object and select some default values. llvm-svn: 111871	2010-08-24 00:07:24 +00:00
Eric Christopher	e3107d6283	Don't need the extra register here. llvm-svn: 111864	2010-08-23 23:28:04 +00:00
Eric Christopher	414501c511	Add some more "get address into register" code and a more TODOs/FIXMEs. llvm-svn: 111860	2010-08-23 23:14:31 +00:00
Eric Christopher	8d03b8a8ce	Add an ARMFunctionInfo member and use it. llvm-svn: 111854	2010-08-23 22:32:45 +00:00
Eric Christopher	00202ee329	Start getting ARM loads/address computation going. llvm-svn: 111850	2010-08-23 21:44:12 +00:00
Bruno Cardoso Lopes	264d90fff7	Start using target speficic nodes for shuffles: pshufhw and pshuflw llvm-svn: 111837	2010-08-23 20:41:02 +00:00
Gabor Greif	21fed6616c	tyops llvm-svn: 111835	2010-08-23 20:30:51 +00:00
Chris Lattner	58bd73a5a7	Add a new llvm.x86.int intrinsic, allowing access to the x86 int and int3 instructions. Patch by Peter Housel! llvm-svn: 111831	2010-08-23 19:39:25 +00:00
Chris Lattner	a42202e0e4	random improvement for variable shift codegen. llvm-svn: 111813	2010-08-23 17:30:29 +00:00
Anton Korobeynikov	cbbe4501df	Revert invalid r111792. Jump tables are not broken on x86-64 / coff, it's COFF emitter which does not support differences of two symbols (and needs to be fixed). GAS is pretty fine with code produced. llvm-svn: 111801	2010-08-23 07:38:51 +00:00
Michael J. Spencer	e87231232a	Workaround broken jump tables on x86-64 COFF. llvm-svn: 111792	2010-08-23 04:45:37 +00:00
Anton Korobeynikov	db9820ecaa	Use rip-rel addressing on win64 by default. For this we just defaults to small pic code model. llvm-svn: 111741	2010-08-21 17:21:11 +00:00
Michael J. Spencer	377aa20e6e	MC: Add partial x86-64 support to COFF. llvm-svn: 111728	2010-08-21 05:58:13 +00:00
Dan Gohman	42ef669d81	Fix x86 fast-isel's cmp+branch folding to avoid folding when the comparison is in a different basic block from the branch. In such cases, the comparison's operands may not have initialized virtual registers available. llvm-svn: 111709	2010-08-21 02:32:36 +00:00
Bruno Cardoso Lopes	9f20e7a1bf	Prepare LowerVECTOR_SHUFFLEv8i16 to use x86 target specific nodes directly llvm-svn: 111704	2010-08-21 01:32:18 +00:00
Bruno Cardoso Lopes	6f3b38a851	This is the first step towards refactoring the x86 vector shuffle code. The general idea here is to have a group of x86 target specific nodes which are going to be selected during lowering and then directly matched in isel. The commit includes the addition of those specific nodes and a bunch of patterns, and incrementally we're going to switch between them and what we have right now. Both the patterns and target specific nodes can change as we move forward with this work. llvm-svn: 111691	2010-08-20 22:55:05 +00:00
Bill Wendling	578ee4070c	Create the new linker type "linker_private_weak_def_auto". It's similar to "linker_private_weak", but it's known that the address of the object is not taken. For instance, functions that had an inline definition, but the compiler decided not to inline it. Note, unlike linker_private and linker_private_weak, linker_private_weak_def_auto may have only default visibility. The symbols are removed by the linker from the final linked image (executable or dynamic library). llvm-svn: 111684	2010-08-20 22:05:50 +00:00
Bob Wilson	9a511c07e4	Replace the arm.neon.vmovls and vmovlu intrinsics with vector sign-extend and zero-extend operations. llvm-svn: 111614	2010-08-20 04:54:02 +00:00
Eric Christopher	985d9e4ea8	Fix loop conditionals (MO.isDef() asserts that it's a reg) and move some constraints around. llvm-svn: 111594	2010-08-20 00:36:24 +00:00
Eric Christopher	d8e8a2945e	Add a couple of random comments. llvm-svn: 111592	2010-08-20 00:20:31 +00:00
Jim Grosbach	56e56323c8	Better handling of offsets on frame index references. rdar://8277890 llvm-svn: 111585	2010-08-19 23:52:25 +00:00
Jim Grosbach	8c58bd30dc	Add Thumb1 support for virtual frame indices. rdar://8277890 llvm-svn: 111533	2010-08-19 17:52:13 +00:00
Eric Christopher	a5d60c62b1	Silence warning. llvm-svn: 111518	2010-08-19 15:35:27 +00:00
Chris Lattner	f547740d3f	fix PR7465, mishandling of lcall and ljmp: intersegment long call and jumps. llvm-svn: 111496	2010-08-19 01:18:43 +00:00
Chris Lattner	beb506eeed	minor progress towards fixing PR7465 llvm-svn: 111494	2010-08-19 01:00:34 +00:00
Eric Christopher	0d274a0258	Add an AddOptionalDefs method and use it. llvm-svn: 111489	2010-08-19 00:37:05 +00:00
Bill Wendling	768d3b510c	Add the "isCompare" attribute to the defm instead of each individual instr. llvm-svn: 111481	2010-08-19 00:05:48 +00:00
Jakob Stoklund Olesen	92d57cee61	Don't call Predicate_* in Mips. llvm-svn: 111468	2010-08-18 23:56:46 +00:00
Eric Christopher	8a70781cac	Remove extra header. llvm-svn: 111456	2010-08-18 23:38:16 +00:00
Jim Grosbach	dbfc2ce95d	Enable ARM base register reuse to local stack slot allocation. Whenever a new frame index reference to an object in the local block is seen, check if it's near enough to any previously allocaated base register to re-use. rdar://8277890 llvm-svn: 111443	2010-08-18 22:44:49 +00:00
Bill Wendling	ad2aa57774	Minor simplification. Gets rid of a needless temporary. llvm-svn: 111430	2010-08-18 21:32:07 +00:00
Bill Wendling	817e857b13	Marked with ATTRIBUTE_USED so that clang doesn't complain. llvm-svn: 111383	2010-08-18 18:40:57 +00:00
Jim Grosbach	e0e9b3013f	Add hook for re-using virtual base registers for local stack slot access. Nothing fancy, just ask the target if any currently available base reg is in range for the instruction under consideration and use the first one that is. Placeholder ARM implementation simply returns false for now. ongoing saga of rdar://8277890 llvm-svn: 111374	2010-08-18 17:57:37 +00:00
Kalle Raiskila	e60b5161d1	Fix a bug with insertelement on SPU. The previous algorithm in LowerVECTOR_SHUFFLE didn't check all requirements for "monotonic" shuffles. llvm-svn: 111361	2010-08-18 10:20:29 +00:00
Kalle Raiskila	ab49360f59	Remove all traces of v2[i,f]32 on SPU. The "half vectors" are now widened to full size by the legalizer. The only exception is in parameter passing, where half vectors are expanded. This causes changes to some dejagnu tests. llvm-svn: 111360	2010-08-18 10:04:39 +00:00
Kalle Raiskila	f3984d1ef6	Change SPU C calling convention to match that described in "SPU Application Binary Interface Specification, v1.9" by IBM. Specifically: use r3-r74 to pass parameters and the return value. llvm-svn: 111358	2010-08-18 09:50:30 +00:00
Chris Lattner	2edfdd289f	remove some dead code. llvm-svn: 111345	2010-08-18 02:42:11 +00:00
Chris Lattner	3e3e63efe1	remove some code that is dead now that lea's are modeled with segment registers. llvm-svn: 111343	2010-08-18 02:40:44 +00:00
Bob Wilson	fb7eaff759	Expand ZERO_EXTEND operations for NEON vector types. Testcase from Nick Lewycky. llvm-svn: 111341	2010-08-18 01:45:52 +00:00
Jim Grosbach	3cf08661f4	Add materialization of virtual base registers for frame indices allocated into the local block. Resolve references to those indices to a new base register. For simplification and testing purposes, a new virtual base register is allocated for each frame index being resolved. The result is truly horrible, but correct, code that's good for exercising the new code paths. Next up is adding thumb1 support, which should be very simple. Following that will be adding base register re-use and implementing a reasonable ARM heuristic for when a virtual base register should be generated at all. llvm-svn: 111315	2010-08-17 22:41:55 +00:00
Anton Korobeynikov	88c09879c7	Revert part of one of the prev. patches - tailjmp will follow later. llvm-svn: 111291	2010-08-17 21:08:28 +00:00
Anton Korobeynikov	231ab847ca	More fixes for win64: - Do not clobber al during variadic calls, this is AMD64 ABI-only feature - Emit wincall64, where necessary Patch by Cameron Esfahani! llvm-svn: 111289	2010-08-17 21:06:07 +00:00
Anton Korobeynikov	cd78af6e3c	Enable more win64 calls folding opportunities. Patch by Cameron Esfahani! llvm-svn: 111288	2010-08-17 21:06:01 +00:00
Jakob Stoklund Olesen	e2cbaf6ed7	Don't call tablegen'ed Predicate_* functions in the ARM target. llvm-svn: 111277	2010-08-17 20:39:04 +00:00
Jim Grosbach	62800a990b	80 column cleanup. llvm-svn: 111266	2010-08-17 18:39:16 +00:00
Jakob Stoklund Olesen	f02b4a686a	Don't call Predicate_* methods directly from Sparc target. Modernize predicates a bit. The Predicate_* methods are not used by TableGen any longer. They are only emitted for the sake of legacy code. llvm-svn: 111263	2010-08-17 18:17:12 +00:00
Jim Grosbach	c252ee2375	Add hook to examine an instruction referencing a frame index to determine whether to allocate a virtual frame base register to resolve the frame index reference in it. Implement a simple version for ARM to aid debugging. In LocalStackSlotAllocation, scan the function for frame index references to local frame indices and ask the target whether to allocate virtual frame base registers for any it encounters. Purely infrastructural for debug output. Next step is to actually allocate base registers, then add intelligent re-use of them. rdar://8277890 llvm-svn: 111262	2010-08-17 18:13:53 +00:00
Jim Grosbach	8995a1018c	explicitly handle no-op cases for clarity. Fixes clang warning. llvm-svn: 111260	2010-08-17 18:00:41 +00:00
Bob Wilson	942b10f511	Change ARM PKHTB and PKHBT instructions to use a shift_imm operand to avoid printing "lsl #0". This fixes the remaining parts of pr7792. Make corresponding changes for encoding/decoding these instructions. llvm-svn: 111251	2010-08-17 17:23:19 +00:00
Chris Lattner	72a364c107	fix emacs language spec's, patch by Edmund Grimley-Evans! llvm-svn: 111241	2010-08-17 16:20:04 +00:00
Bob Wilson	411dfad981	Allow more cases of undef shuffle indices and add tests for them. llvm-svn: 111226	2010-08-17 05:54:34 +00:00
Eric Christopher	09f757d4bc	Copy over some overridden MI wrappers for ARM fast-isel. This is where we're adding predicates and optional defs to the MachineInstrs. llvm-svn: 111222	2010-08-17 01:25:29 +00:00
Eric Christopher	663f49900d	Make arm fast-isel possible to enable via command line. llvm-svn: 111219	2010-08-17 00:46:57 +00:00
Bob Wilson	c350e7a509	Ignore undef shuffle indices when checking for a VTRN shuffle. Radar 8290937. llvm-svn: 111208	2010-08-16 23:37:17 +00:00
Bob Wilson	804f6159f1	Generalize a pattern for PKHTB: an SRL of 16-31 bits will guarantee that the high halfword is zero. The shift need not be exactly 16 bits. llvm-svn: 111196	2010-08-16 22:26:55 +00:00
Eli Friedman	2444da0652	Comment out some broken/unused/useless instructions which mess up disassembly. llvm-svn: 111185	2010-08-16 21:18:51 +00:00
Eli Friedman	51ec745509	Don't attempt to SimplifyShortMoveForm in 64-bit mode. llvm-svn: 111182	2010-08-16 21:03:32 +00:00
Matt Fleming	f751d856f0	Hookup ELF support for X86. llvm-svn: 111173	2010-08-16 18:36:14 +00:00
Bob Wilson	481d7a9ab4	Rename sat_shift operand to shift_imm, in preparation for using it for other instructions besides saturate instructions. No functional changes. llvm-svn: 111168	2010-08-16 18:27:34 +00:00
Jakob Stoklund Olesen	2cd00737c0	Partially revert r111155. It looks like MSVC is calling an operator<() that clang says is unused. llvm-svn: 111167	2010-08-16 18:24:54 +00:00
Jakob Stoklund Olesen	b7f872197a	Remove unused functions. llvm-svn: 111155	2010-08-16 17:18:18 +00:00
Bob Wilson	8303fbbcf9	Remove unused code. llvm-svn: 111154	2010-08-16 17:06:03 +00:00
Argyrios Kyrtzidis	d0fcc9a818	Revert r111082. No warnings for this common pattern. llvm-svn: 111102	2010-08-15 10:27:23 +00:00
Eric Christopher	54194bd127	Rework how the non-sse2 memory barrier is lowered so that the encoding is correct for the built-in assembler. Based on a patch from Chris. llvm-svn: 111083	2010-08-14 21:51:50 +00:00
Argyrios Kyrtzidis	7c09ddf0ae	Add ATTRIBUTE_UNUSED to methods that are not supposed to be used. llvm-svn: 111082	2010-08-14 21:35:10 +00:00
Chris Lattner	2f6c3434ac	improve indentation llvm-svn: 111073	2010-08-14 17:26:09 +00:00
Bob Wilson	bffc757df7	T2I_rbin_irs rr variant is for disassembly only, so don't provide a pattern. llvm-svn: 111068	2010-08-14 03:18:29 +00:00
Bob Wilson	4577f37d49	Add a Thumb2 t2RSBrr instruction for disassembly only. This fixes another part of PR7792. llvm-svn: 111057	2010-08-13 23:24:25 +00:00
Bob Wilson	3c9ed76ba5	Temporarily disable tail calls on ARM to work around some linker problems. llvm-svn: 111050	2010-08-13 22:43:33 +00:00
Bob Wilson	15b3c3d0ac	Move the Thumb2 SSAT and USAT optional shift operator out of the instruction opcode. This fixes part of PR7792. llvm-svn: 111047	2010-08-13 21:48:10 +00:00
Bruno Cardoso Lopes	160be2936b	Add comments to some pattern fragments in x86 llvm-svn: 111041	2010-08-13 20:39:01 +00:00
Bob Wilson	d3a828ce68	Refactor the code for disassembling Thumb2 saturate instructions along the same lines as the change I made for ARM saturate instructions. llvm-svn: 111029	2010-08-13 19:04:21 +00:00
Dale Johannesen	8d3c89e765	Revert 110491. While not wrong, it was based on a misanalysis and is undesirable. llvm-svn: 111028	2010-08-13 18:43:45 +00:00
Bruno Cardoso Lopes	081861b6b7	Fix comment to reflect code, and remove an unused argument llvm-svn: 111022	2010-08-13 17:50:47 +00:00
Bruno Cardoso Lopes	1187e3f09b	Improve comment to make explicit why not to touch this could before JIT goes MC llvm-svn: 111021	2010-08-13 17:44:10 +00:00
Eric Christopher	6e5b67ccc4	Revert last patch and r110954 as I meant to. llvm-svn: 111001	2010-08-13 02:37:50 +00:00
Eric Christopher	5e027fe113	Revert r110954 for now, pseudo instructions can't make it through to the JIT. llvm-svn: 111000	2010-08-13 02:30:00 +00:00
Bruno Cardoso Lopes	cc20fe5937	Some small clean-up: use of pseudo instructions llvm-svn: 110954	2010-08-12 20:55:18 +00:00
Johnny Chen	8e8f1c133a	Cleaned up the for-disassembly-only entries in the arm instruction table so that the memory barrier variants (other than 'SY' full system domain read and write) are treated as one instruction with option operand. llvm-svn: 110951	2010-08-12 20:46:17 +00:00
Evan Cheng	44a320dafa	Make sure ARM constant island pass does not break up an IT block. If the split point is in the middle of an IT block, it should move it up to just above the IT instruction. rdar://8302637 llvm-svn: 110947	2010-08-12 20:30:05 +00:00
Bruno Cardoso Lopes	7f704b31a9	- Teach SSEDomainFix to switch between different levels of AVX instructions. Here we guess that AVX will have domain issues, so just implement them for consistency and in the future we remove if it's unnecessary. - Make foldMemoryOperandImpl aware of 256-bit zero vectors folding and support the 128-bit counterparts of AVX too. - Make sure MOV[AU]PS instructions are only selected when SSE1 is enabled, and duplicate the patterns to match AVX. - Add a testcase for a simple 128-bit zero vector creation. llvm-svn: 110946	2010-08-12 20:20:53 +00:00
Bruno Cardoso Lopes	7e1a30c0d3	Define AVX 128-bit pattern versions of SET0PS/PD. llvm-svn: 110937	2010-08-12 18:20:59 +00:00
Bruno Cardoso Lopes	1401e040eb	Fix comment order llvm-svn: 110898	2010-08-12 02:08:52 +00:00
Bruno Cardoso Lopes	7306c86886	Begin to support some vector operations for AVX 256-bit intructions. The long term goal here is to be able to match enough of vector_shuffle and build_vector so all avx intrinsics which aren't mapped to their own built-ins but to shufflevector calls can be codegen'd. This is the first (baby) step, support building zeroed vectors. llvm-svn: 110897	2010-08-12 02:06:36 +00:00
Johnny Chen	74491bb52c	The autogened decoder was confusing the ARM STRBT for ARM USAT, because the .td entry for ARM STRBT is actually a super-instruction for A8.6.199 STRBT A1 & A2. Recover by looking for ARM:USAT encoding pattern before delegating to the auto- gened decoder. Added a "usat" test case to arm-tests.txt. llvm-svn: 110894	2010-08-12 01:40:54 +00:00
Daniel Dunbar	7d7b4d1b0f	MC/X86/AsmParser: Give an explicit error message when we reject an instruction because it could have an ambiguous suffix. llvm-svn: 110890	2010-08-12 00:55:42 +00:00
Daniel Dunbar	2ecc3bb4f7	MC/AsmParser: Push the burdon of emitting diagnostics about unmatched instructions onto the target specific parser, which can do a better job. llvm-svn: 110889	2010-08-12 00:55:38 +00:00
Daniel Dunbar	167b9d7f30	tblgen/AsmMatcher: Always emit the match function as 'MatchInstructionImpl', target specific parsers can adapt the TargetAsmParser to this. llvm-svn: 110888	2010-08-12 00:55:32 +00:00
Johnny Chen	d59c73f998	Changed the format of DMBsy, DSBsy, and friends from Pseudo to MiscFrm. Added two test cases to arm-tests.txt. llvm-svn: 110880	2010-08-11 23:35:12 +00:00
Bob Wilson	add513112a	Move the ARM SSAT and USAT optional shift amount operand out of the instruction opcode. This also fixes part of PR7792. llvm-svn: 110875	2010-08-11 23:10:46 +00:00
Jakob Stoklund Olesen	9c473e46f3	Fix <rdar://problem/8282498> even if it doesn't reproduce on trunk. When a register is defined by a partial load: %reg1234:sub_32 = MOV32mr <fi#-1>; GR64:%reg1234 That load cannot be folded into an instruction using the full 64-bit register. It would become a 64-bit load. This is related to the recent change to have isLoadFromStackSlot return false on a sub-register load. llvm-svn: 110874	2010-08-11 23:08:22 +00:00
Dan Gohman	a5a25036bb	Don't use unsigned char for alignments in TargetData. There aren't that many of these things, so the memory savings isn't significant, and there are now situations where there can be alignments greater than 128. llvm-svn: 110836	2010-08-11 18:15:01 +00:00
Dan Gohman	5531aa4de1	Use ISD::ADD instead of ISD::SUB with a negated constant. This avoids trouble if the return type of TD->getPointerSize() is changed to something which doesn't promote to a signed type, and is simpler anyway. Also, use getCopyFromReg instead of getRegister to read a physical register's value. llvm-svn: 110835	2010-08-11 18:14:00 +00:00
Jim Grosbach	4d5dc3e7e5	cortex m4 has floating point support, but only single precision. llvm-svn: 110810	2010-08-11 15:44:15 +00:00
Bill Wendling	6a98131468	Consider this code snippet: float t1(int argc) { return (argc == 1123) ? 1.234f : 2.38213f; } We would generate truly awful code on ARM (those with a weak stomach should look away): _t1: movw r1, #1123 movs r2, #1 movs r3, #0 cmp r0, r1 mov.w r0, #0 it eq moveq r0, r2 movs r1, #4 cmp r0, #0 it ne movne r3, r1 adr r0, #LCPI1_0 ldr r0, [r0, r3] bx lr The problem was that legalization was creating a cascade of SELECT_CC nodes, for for the comparison of "argc == 1123" which was fed into a SELECT node for the ?: statement which was itself converted to a SELECT_CC node. This is because the ARM back-end doesn't have custom lowering for SELECT nodes, so it used the default "Expand". I added a fairly simple "LowerSELECT" to the ARM back-end. It takes care of this testcase, but can obviously be expanded to include more cases. Now we generate this, which looks optimal to me: _t1: movw r1, #1123 movs r2, #0 cmp r0, r1 adr r0, #LCPI0_0 it eq moveq r2, #4 ldr r0, [r0, r2] bx lr .align 2 LCPI0_0: .long 1075344593 @ float 2.382130e+00 .long 1067316150 @ float 1.234000e+00 llvm-svn: 110799	2010-08-11 08:43:16 +00:00
Evan Cheng	5190f09291	Report error if codegen tries to instantiate a ARM target when the cpu does support it. e.g. cortex-m* processors. llvm-svn: 110798	2010-08-11 07:17:46 +00:00
Evan Cheng	163b624b4e	ArchV7M implies HW division instructions. llvm-svn: 110797	2010-08-11 07:00:16 +00:00
Evan Cheng	1c3c0009bd	ArchV6T2, V7A, and V7M implies Thumb2; Archv7A implies NEON. llvm-svn: 110796	2010-08-11 06:57:53 +00:00
Evan Cheng	40921a4e62	Add ARM Archv6M and let it implies FeatureDB (having dmb, etc.) llvm-svn: 110795	2010-08-11 06:51:54 +00:00
Daniel Dunbar	188b47b214	MC/ARM: Add basic support for handling predication by parsing it out of the mnemonic into a separate operand form. llvm-svn: 110794	2010-08-11 06:37:20 +00:00
Daniel Dunbar	75d26be81a	MC/ARM: Split mnemonic on '.' characters. llvm-svn: 110793	2010-08-11 06:37:16 +00:00
Daniel Dunbar	4a863e6cf7	MC/ARM: Fill in ARMOperand::dump a bit. llvm-svn: 110792	2010-08-11 06:37:12 +00:00
Daniel Dunbar	ebace2248f	MCAsmParser: Add dump() hook to MCParsedAsmOperand. llvm-svn: 110790	2010-08-11 06:37:04 +00:00
Daniel Dunbar	d8042b7bd7	MC/ARM: Add an ARMOperand class for condition codes. llvm-svn: 110788	2010-08-11 06:36:53 +00:00
Evan Cheng	91033bed94	Really control isel of barrier instructions with cpu feature. llvm-svn: 110787	2010-08-11 06:36:31 +00:00
Evan Cheng	49e02fc414	Add Cortex-M0 support. It's a ARMv6m device (no ARM mode) with some 32-bit instructions: dmb, dsb, isb, msr, and mrs. llvm-svn: 110786	2010-08-11 06:30:38 +00:00
Evan Cheng	6e809de90c	- Add subtarget feature -mattr=+db which determine whether an ARM cpu has the memory and synchronization barrier dmb and dsb instructions. - Change instruction names to something more sensible (matching name of actual instructions). - Added tests for memory barrier codegen. llvm-svn: 110785	2010-08-11 06:22:01 +00:00
Daniel Dunbar	5cd4d0f9ac	MC/ARM: Switch to using the generated match functions instead of stub implementations. llvm-svn: 110783	2010-08-11 05:24:50 +00:00
Daniel Dunbar	56e77c409b	MC/ARM: Enable generation of the ARM asm matcher, not that it can do much. llvm-svn: 110782	2010-08-11 05:09:20 +00:00
Daniel Dunbar	07cc87438f	ARM: Mark some disassembler only instructions as not available for matching -- for some reason they have a very odd MCInst form where the operands overlap, but I haven't dug in to find out why yet. llvm-svn: 110781	2010-08-11 04:46:13 +00:00
Daniel Dunbar	740c50385c	ARM: Quote $p in an asm string. llvm-svn: 110780	2010-08-11 04:46:10 +00:00
Bill Wendling	79553bad50	Handle ARM compares as well as converting for ARM adds, subs, and thumb2's adds. llvm-svn: 110762	2010-08-11 00:23:00 +00:00
Bill Wendling	920f74aaab	Mark ARM compare instructions as isCompare. llvm-svn: 110761	2010-08-11 00:22:27 +00:00
Bob Wilson	9664984be8	Add a separate ARM instruction format for Saturate instructions. (I discovered 2 more copies of the ARM instruction format list, bringing the total to 4!! Two of them were already out of sync. I haven't yet gotten into the disassembler enough to know the best way to fix this, but something needs to be done.) Add support for encoding these instructions. llvm-svn: 110754	2010-08-11 00:01:18 +00:00
Evan Cheng	5415713d9a	CBZ and CBNZ are implemented. llvm-svn: 110745	2010-08-10 23:27:11 +00:00
Bruno Cardoso Lopes	91d61df3eb	Add AVX matching patterns to Packed Bit Test intrinsics. Apply the same approach of SSE4.1 ptest intrinsics but create a new x86 node "testp" since AVX introduces vtest{ps}{pd} instructions which set ZF and CF depending on sign bit AND and ANDN of packed floating-point sources. This is slightly different from what the "ptest" does. Tests comming with the other 256 intrinsics tests. llvm-svn: 110744	2010-08-10 23:25:42 +00:00
Bill Wendling	0757820f8f	Turn optimize compares back on with fix. We needed to test that a machine op was a register before checking if it was defined. llvm-svn: 110733	2010-08-10 21:38:11 +00:00
Evan Cheng	fa16acae44	Delete some unused instructions. llvm-svn: 110710	2010-08-10 19:36:22 +00:00
Evan Cheng	3f251fb26e	Re-apply r110655 with fixes. Epilogue must restore sp from fp if the function stack frame has a var-sized object. Also added a test case to check for the added benefit of this patch: it's optimizing away the unnecessary restore of sp from fp for some non-leaf functions. llvm-svn: 110707	2010-08-10 19:30:19 +00:00
Daniel Dunbar	0dd47bfca3	Revert r110655, "Fix ARM hasFP() semantics. It should return true whenever FP register is", it breaks a couple test-suite tests. llvm-svn: 110701	2010-08-10 18:32:02 +00:00
Evan Cheng	8d5d1c1331	Fix ARM hasFP() semantics. It should return true whenever FP register is reserved, not available for general allocation. This eliminates all the extra checks for Darwin. This change also fixes the use of FP to access frame indices in leaf functions and cleaned up some confusing code in epilogue emission. llvm-svn: 110655	2010-08-10 06:26:49 +00:00
Bruno Cardoso Lopes	39f215bd33	Add AVX movnt{pd,ps,dq} 256-bit intrinsics llvm-svn: 110650	2010-08-10 02:49:24 +00:00
Bruno Cardoso Lopes	cedf23dfe5	Add AVX movmsk 256-bit intrinsics llvm-svn: 110648	2010-08-10 02:34:56 +00:00
Bruno Cardoso Lopes	85da72a88f	Support AVX 256-bit load and store intrinsics llvm-svn: 110645	2010-08-10 01:43:16 +00:00
Bruno Cardoso Lopes	b2b6b65b86	Patterns to match AVX cmp instructions llvm-svn: 110633	2010-08-10 00:13:20 +00:00
Bruno Cardoso Lopes	001d6fa174	Add matching patterns for vblend AVX intrinsics llvm-svn: 110630	2010-08-10 00:02:05 +00:00
Eric Christopher	b9627ee79b	Wording. llvm-svn: 110618	2010-08-09 22:52:47 +00:00
Evan Cheng	9113832571	ARMBaseRegisterInfo::hasFP() has been broken for a while now. :-( This will always be false before PEI: (DisableFramePointerElim(MF) && MFI->adjustsStack()) Which means it's going to make r11 available as a general purpose register even if -disable-fp-elim is specified. It's working on Darwin only because r7 is always reserved. But it's obviously broken for other targets. llvm-svn: 110614	2010-08-09 22:32:45 +00:00
Bruno Cardoso Lopes	685cb32d2b	Add VCVTPD2PS, VCVTPS2DQ, VCVTPS2PDY, VCVTTPD2DQY, VCVTTPS2DQ and VCVTPD2DQ 256-bit conversion intrinsics llvm-svn: 110608	2010-08-09 21:51:56 +00:00
Bruno Cardoso Lopes	3e9b567643	Add patterns to AVX conversions instructions. Do that instead of declaring more intructions whenever is possible, more coming llvm-svn: 110605	2010-08-09 21:24:59 +00:00
Oscar Fuentes	212cfde6ec	CMake: eliminated unnecessary target_link_libraries. Next time the build is broken due to wrong library dependencies, just try building again (if you are on some Unix and are building all LLVM targets) or ask someone to commit the regenerated LLVMLibDeps.cmake. llvm-svn: 110593	2010-08-09 20:33:08 +00:00
Evan Cheng	891f831963	Explicitly initialize SlowFPBrcc and Pref32BitThumb to false. llvm-svn: 110587	2010-08-09 19:19:36 +00:00
Evan Cheng	ce8fb68078	Change -prefer-32bit-thumb to attribute -mattr=+32bit instead to disable more 32-bit to 16-bit optimizations. llvm-svn: 110584	2010-08-09 18:35:19 +00:00
Bruno Cardoso Lopes	c33940b3aa	Memory version of vcvtdq2pd intrinsic llvm-svn: 110582	2010-08-09 18:20:14 +00:00
Bruno Cardoso Lopes	828f6aeced	Patterns to match vinsert, vbroadcast, vmovmask and vcvtdq2pd AVX intrinsics llvm-svn: 110580	2010-08-09 18:03:43 +00:00
Evan Cheng	7d8d9a5dd5	Add an option to disable 32 -> 16-bit Thumb2 size reduction pass for experimentation. llvm-svn: 110579	2010-08-09 17:16:10 +00:00
Kalle Raiskila	999da1f3a0	Have SPU handle halfvec stores aligned by 8 bytes. llvm-svn: 110576	2010-08-09 16:33:00 +00:00
Nick Lewycky	bb10e90487	Add optimization to Target/README.txt. llvm-svn: 110543	2010-08-08 07:04:25 +00:00
Bill Wendling	798617b1ab	Use the "isCompare" machine instruction attribute instead of calling the relatively expensive comparison analyzer on each instruction. Also rename the comparison analyzer method to something more in line with what it actually does. This pass is will eventually be folded into the Machine CSE pass. llvm-svn: 110539	2010-08-08 05:04:59 +00:00
Dale Johannesen	a3bd31a923	Use sdmem and sse_load_f64 (etc.) for the vector form of CMPSD (etc.) Matching a 128-bit memory operand is wrong, the instruction uses only 64 bits (same as ADDSD etc.) 8193553. llvm-svn: 110491	2010-08-07 00:33:42 +00:00
Bruno Cardoso Lopes	93cc666a58	Patterns to match AVX 256-bit vzero intrinsics llvm-svn: 110480	2010-08-06 22:10:01 +00:00
Bruno Cardoso Lopes	3d6a3a0ede	Patterns to match AVX 256-bit permutation intrinsics llvm-svn: 110468	2010-08-06 20:03:27 +00:00
Jim Grosbach	4603d09660	Remove empty processFunctionBeforeFrameFinalized(). The default implementation of the function is equivalent, so no need to provide the target-specific version until/unless it needs to do something. llvm-svn: 110465	2010-08-06 18:57:24 +00:00
Owen Anderson	a7aed18624	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Rafael Espindola	027d5bcf89	Fix eabi calling convention when a 64 bit value shadows r3. Without this what was happening was: * R3 is not marked as "used" * ARM backend thinks it has to save it to the stack because of vaarg * Offset computation correctly ignores it * Offsets are wrong llvm-svn: 110446	2010-08-06 15:35:32 +00:00
Bruno Cardoso Lopes	1cf067cb3d	Patterns to match AVX 256-bit horizontal arithmetic intrinsics llvm-svn: 110427	2010-08-06 02:10:30 +00:00
Bruno Cardoso Lopes	b9ad94fbf7	Patterns to match AVX 256-bit arithmetic intrinsics llvm-svn: 110425	2010-08-06 01:52:29 +00:00
Bill Wendling	7de9d52c13	Add the Optimize Compares pass (disabled by default). This pass tries to remove comparison instructions when possible. For instance, if you have this code: sub r1, 1 cmp r1, 0 bz L1 and "sub" either sets the same flag as the "cmp" instruction or could be converted to set the same flag, then we can eliminate the "cmp" instruction all together. This is a important for ARM where the ALU instructions could set the CPSR flag, but need a special suffix ('s') to do so. llvm-svn: 110423	2010-08-06 01:32:48 +00:00
Owen Anderson	bda59bd247	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Eric Christopher	e1fb772aa5	Add an option to always emit realignment code for a particular module. llvm-svn: 110404	2010-08-05 23:57:43 +00:00
Owen Anderson	755aceb5d0	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Dan Gohman	ddb2d65c50	Remove IntrWriteMem, as it's the default. Rename IntrWriteArgMem to IntrReadWriteArgMem, as it's for reading as well as writing. llvm-svn: 110395	2010-08-05 23:36:21 +00:00
Bruno Cardoso Lopes	77954bdf7a	Support very basic (doesn't include ABI support in the front-end, varags, ...) 256-bit argument passing and return for AVX llvm-svn: 110394	2010-08-05 23:35:51 +00:00
Eric Christopher	4d9c3400f3	Handle the memory barrier pseudo that goes to nothing for the JIT. llvm-svn: 110371	2010-08-05 20:04:36 +00:00
Eric Christopher	7fd06eb8ce	Set hasSideEffects on the 64-bit no-sse memory barrier. llvm-svn: 110369	2010-08-05 19:54:59 +00:00
Jim Grosbach	f50693d1ab	For local variables in functions with a frame pointer, use FP as a base register for local access when it's closer to the stack slot being refererenced than the stack pointer. Make sure to take into account any argument frame SP adjustments that are in affect at the time. rdar://8256090 llvm-svn: 110366	2010-08-05 19:27:37 +00:00
Bob Wilson	b1021395b8	Fix indentation. llvm-svn: 110363	2010-08-05 19:00:21 +00:00
Bob Wilson	72de307116	Add an ARM RSCrr instruction for disassembly only. Partial fix for PR7792. llvm-svn: 110361	2010-08-05 18:59:36 +00:00
Eric Christopher	32f5d6b9be	Be a little bit more specific about target for the memory barrier instructions. llvm-svn: 110360	2010-08-05 18:36:20 +00:00
Eric Christopher	4abffad17c	Handle the pseudo in MCInstLower. llvm-svn: 110359	2010-08-05 18:34:30 +00:00
Bob Wilson	adb93e56a3	Add an ARM RSBrr instruction for disassembly only. Partial fix for PR7792. llvm-svn: 110358	2010-08-05 18:23:43 +00:00
Chandler Carruth	e6ca1cfef7	Silence a GCC warning about && and \|\| without explicit parentheses. This preserves the existing behavior, as it seems a concious choice to allow RS to be null and BigStack marked true. llvm-svn: 110307	2010-08-05 03:04:21 +00:00
Bob Wilson	97886d59d1	ARM "rrx" shift operands do not have an immediate. PR7790. llvm-svn: 110292	2010-08-05 00:34:42 +00:00
Eric Christopher	2db8464282	Make x86-64 membarriers work without sse and clean up some of the uses. llvm-svn: 110274	2010-08-04 23:03:04 +00:00
Jim Grosbach	8aaadea8ef	and back in. false alarm on the tests from another unrelated local change. llvm-svn: 110269	2010-08-04 22:46:09 +00:00
Eli Friedman	39d0f57cab	PR7814: Truncates cannot be ignored for signed comparisons. llvm-svn: 110268	2010-08-04 22:40:58 +00:00
Devang Patel	a52ddc496a	Implement target specific getDebugValueLocation(). llvm-svn: 110267	2010-08-04 22:39:39 +00:00
Jim Grosbach	8732d966e1	oops. revert for a moment to clean up tests first. llvm-svn: 110259	2010-08-04 22:12:43 +00:00
Jim Grosbach	22be317fe4	Reserve a stack slot if the function adjusts the stack but doesn't simplify the call frame pseudo instructions. In that situation, the calculations for estimating the stack size will be way off, leading to not having an emergency spill slot when we need one. It should be possible to be more precise about tracking the adjustment values, but not really necessary for correctness. Upcoming cleanups for PEI in general will render that moot. llvm-svn: 110258	2010-08-04 22:10:15 +00:00
Devang Patel	6e9a979414	Implement target specific getDebugValueLocation(). llvm-svn: 110256	2010-08-04 22:07:50 +00:00
Torok Edwin	31e90d2dd1	Use indirect calls in PowerPC JIT. See PR5201. There is no way to know if direct calls will be within the allowed range for BL. Hence emit all calls as indirect when in JIT mode. Without this long-running applications will fail to JIT on PowerPC with a relocation failure. llvm-svn: 110246	2010-08-04 20:47:44 +00:00
Dale Johannesen	21f13209f8	Remove switch for disabling ARM tail calls. They seem to be working correctly. No functional change. llvm-svn: 110226	2010-08-04 18:07:17 +00:00
Devang Patel	2bf0f3ceff	Add DEBUG message. llvm-svn: 110224	2010-08-04 18:06:05 +00:00
Benjamin Kramer	a53a4eefa6	Enable COFF writer on mingw32 and cygwin. llvm-svn: 110200	2010-08-04 15:32:40 +00:00
Kalle Raiskila	8b2f70125f	Make SPU backend handle insertelement and store for "half vectors" llvm-svn: 110198	2010-08-04 13:59:48 +00:00
Benjamin Kramer	61c8e6dc16	Print an error message when someone tries -integrated-as on an unsupported target. - The COFF backend doesn't support MingW/Cygwin at the moment, it'll report an error, but it's still much better than random assertions from the MachO backend. - We want to make ELF the default eventually, it's what the majority of targets use. llvm-svn: 110197	2010-08-04 13:16:30 +00:00
Gabor Greif	94ab490260	by Alexander Herz: "The CWriter::GetValueName() method does not check if a value as an alias and emits the alias name which will never be defined in the output .c file (so the output file fails to compile). This can happen if you have multiple inheritance with several destructors defined by clang (...D0Ev, ...D1Ev, ...D2Ev)." -- applied with minor tweaks. Thanks! llvm-svn: 110194	2010-08-04 10:00:52 +00:00
Bob Wilson	79daf7e0ae	Combine NEON VABD (absolute difference) intrinsics with ADDs to make VABA (absolute difference with accumulate) intrinsics. Radar 8228576. llvm-svn: 110170	2010-08-04 00:12:08 +00:00
Chris Lattner	53befe7bc1	fix a win64 encoding problem, patch by Cameron Esfahani! llvm-svn: 110164	2010-08-03 22:49:22 +00:00
Nate Begeman	b69b182191	Add support for getting & setting the FPSCR application register on ARM when VFP is enabled. Add support for using the FPSCR in conjunction with the vcvtr instruction, for controlling fp to int rounding. Add support for the FLT_ROUNDS_ node now that the FPSCR is exposed. llvm-svn: 110152	2010-08-03 21:31:55 +00:00
Oscar Fuentes	371b1b91bf	CMake: Change somme target library names: XCore->XCoreGen PIC16->PIC16CodeGen After updating your working copy, the first build will fail because it is using the old library dependencies. Start the build again and it will work fine. llvm-svn: 110127	2010-08-03 17:40:31 +00:00
Kalle Raiskila	77558b7d13	More SPU v2f32 stuff added: insertelement and shuffle. llvm-svn: 110038	2010-08-02 11:22:10 +00:00
Kalle Raiskila	68b3886678	Add preliminary v2f32 support for SPU. Like with v2i32, we just duplicate the instructions and operate on half vectors. Also reorder code in SPUInstrInfo.td for better coherency. llvm-svn: 110037	2010-08-02 10:25:47 +00:00
Kalle Raiskila	622f8eb981	Add preliminary v2i32 support for SPU backend. As there are no such registers in SPU, this support boils down to "emulating" them by duplicating instructions on the general purpose registers. This adds the most basic operations on v2i32: passing parameters, addition, subtraction, multiplication and a few others. llvm-svn: 110035	2010-08-02 08:54:39 +00:00
Eli Friedman	7595ce05a2	PR7781: Fix incorrect shifting in PPCTargetLowering::LowerBUILD_VECTOR. llvm-svn: 109998	2010-08-02 00:18:19 +00:00
Eli Friedman	1b2bc1b844	PR7774: Fix undefined shifts in Alpha backend. As a bonus, this actually improves the generated code in some cases. llvm-svn: 109985	2010-08-01 21:13:28 +00:00
Daniel Dunbar	727be43a3d	Silence some -Asserts uninitialized variable warnings. llvm-svn: 109956	2010-07-31 21:08:54 +00:00
Michael J. Spencer	ed80f361b3	MC: Remove HasAbsolutizedSet from WindowsX86AsmBackend. llvm-svn: 109949	2010-07-31 07:21:44 +00:00
Bob Wilson	b128824b60	Move newlines before inline jumptables from the asm strings in .td files to the jtblock_operand print methods. This avoids extra newlines in the disassembler's output. PR7757. llvm-svn: 109948	2010-07-31 06:28:10 +00:00
Michael J. Spencer	6b4925e223	Add relax all support to the COFF object streamer. llvm-svn: 109947	2010-07-31 06:22:29 +00:00
Bob Wilson	cd5fc7bef1	Add support for disassembling VMVN (immediate) instructions. PR7747. llvm-svn: 109946	2010-07-31 05:57:44 +00:00
Evan Cheng	59069ec784	Add -disable-shifter-op to disable isel of shifter ops. On Cortex-a9 the shifts cost extra instructions so it might be better to emit them separately to take advantage of dual-issues. llvm-svn: 109934	2010-07-30 23:33:54 +00:00
Bob Wilson	eb7b21f3eb	Add a check in the ARM disassembler for NEON instructions that would reference registers past the end of the NEON register file, and report them as invalid instead of asserting when trying to print them. PR7746. llvm-svn: 109933	2010-07-30 23:27:59 +00:00
Dale Johannesen	cf0287e56d	PPC doesn't supported VLA with large alignment. This was formerly rejected by the FE, so asserted in the BE; now the FE only warns, so we treat it as a legitimate fatal error in PPC BE. This means the test for the feature won't pass, so it's xfail'd. llvm-svn: 109892	2010-07-30 21:09:48 +00:00
Bob Wilson	4320e2d1bb	Add the __TEXT,__StaticInit section to the list of sections emitted at the beginning on ARM Darwin assembly files so that it won't be placed after debug sections. Radar 8252813. llvm-svn: 109879	2010-07-30 19:55:47 +00:00
Bruno Cardoso Lopes	349165b48f	Support all 128-bit AVX vector intrinsics. Most part of them I already declared during the addition of the assembler support, the additional changes are: - Add missing intrinsics - Move all SSE conversion instructions in X86InstInfo64.td to the SSE.td file. - Duplicate some patterns to AVX mode. - Step into PCMPEST/PCMPIST custom inserter and add AVX versions. llvm-svn: 109878	2010-07-30 19:54:33 +00:00
Bruno Cardoso Lopes	405405bbfe	Fix typo! llvm-svn: 109877	2010-07-30 19:41:24 +00:00
Jim Grosbach	d343166a0b	Many Thumb2 instructions can reference the full ARM register set (i.e., have 4 bits per register in the operand encoding), but have undefined behavior when the operand value is 13 or 15 (SP and PC, respectively). The trivial coalescer in linear scan sometimes will merge a copy from SP into a subsequent instruction which uses the copy, and if that instruction cannot legally reference SP, we get bad code such as: mls r0,r9,r0,sp instead of: mov r2, sp mls r0, r9, r0, r2 This patch adds a new register class for use by Thumb2 that excludes the problematic registers (SP and PC) and is used instead of GPR for those operands which cannot legally reference PC or SP. The trivial coalescer explicitly requires that the register class of the destination for the COPY instruction contain the source register for the COPY to be considered for coalescing. This prevents errant instructions like that above. PR7499 llvm-svn: 109842	2010-07-30 02:41:01 +00:00
Nate Begeman	c4a96c0e8c	Add builtins for ssat/usat, similar to RealView's __ssat and __usat intrinsics. llvm-svn: 109813	2010-07-29 22:48:09 +00:00
Bob Wilson	728eb292eb	Refactor ARM-specific DAG combining in preparation for adding some more transformations. llvm-svn: 109800	2010-07-29 20:34:14 +00:00
Dale Johannesen	2bff50546c	Implement vector constants which are splat of integers with mov + vdup. 8003375. This is currently disabled by default because LICM will not hoist a VDUP, so it pessimizes the code if the construct occurs inside a loop (8248029). llvm-svn: 109799	2010-07-29 20:10:08 +00:00
Bob Wilson	a9bf1b1493	Don't assert on an unrecognized BrMiscFrm instruction. PR7745. llvm-svn: 109788	2010-07-29 18:29:28 +00:00
Nate Begeman	7010a71ac4	Add intrinsics __builtin_arm_qadd & __builtin_arm_qsub to allow access to the QADD & QSUB instructions. Behave identically to __qadd & __qsub RealView instruction intrinsics. llvm-svn: 109770	2010-07-29 17:56:55 +00:00
Jakob Stoklund Olesen	ba0e124aaf	Revert r109652, and remove the offending assert in loadRegFromStackSlot instead. We do sometimes load from a too small stack slot when dealing with x86 arguments (varargs and smaller-than-32-bit args). It looks like we know what we are doing in those cases, so I am going to remove the assert instead of artifically enlarging stack slot sizes. The assert in storeRegToStackSlot stays in. We don't want to write beyond the bounds of a stack slot. llvm-svn: 109764	2010-07-29 17:42:27 +00:00
Jim Grosbach	c445a7d29b	ARM mode version of r109693. Remove incorrect substitution pattern for UXTB16. It wrongly assumed the input shift was actually a rotate. rdar://8240138 llvm-svn: 109696	2010-07-28 23:25:44 +00:00
Jim Grosbach	716a596cf7	Remove incorrect substitution pattern for UXTB16. It wrongly assumed the input shift was actually a rotate. rdar://8240138 llvm-svn: 109693	2010-07-28 23:17:45 +00:00

... 3 4 5 6 7 ...

15237 Commits