llvm-project

Commit Graph

Author	SHA1	Message	Date
Anton Korobeynikov	7697d37777	Unbreak getOnesVector() / getZeroVector() to use valid ARM extended imm's. llvm-svn: 81262	2009-09-08 22:51:43 +00:00
Chris Lattner	7896c8ba58	filecheckize some tests llvm-svn: 81259	2009-09-08 22:38:46 +00:00
Dan Gohman	72a13d2476	Use opt -S instead of piping bitcode output through llvm-dis. llvm-svn: 81257	2009-09-08 22:34:10 +00:00
Dan Gohman	9737a63ed8	Change these tests to feed the assembly files to opt directly, instead of using llvm-as, now that opt supports this. llvm-svn: 81226	2009-09-08 16:50:01 +00:00
Anton Korobeynikov	59e2b8e894	Add NEON 'laned' operations. This fixes another bunch of gcc testsuite fails and makes the code faster. llvm-svn: 81220	2009-09-08 15:22:32 +00:00
Anton Korobeynikov	758f8c690d	Unbreak llvm-svn: 81205	2009-09-08 07:30:03 +00:00
Evan Cheng	a7afdda65d	When remat'ing and destination virtual register has a sub-register index. Make sure the sub-register class matches the register class of the remat'ed instruction definition register class. llvm-svn: 81204	2009-09-08 06:39:07 +00:00
Chris Lattner	a8cb3dffe9	disable some irrelevant eh emission llvm-svn: 81200	2009-09-08 06:26:40 +00:00
Chris Lattner	b2fcd070e2	fix PR4767, a crash because fp stackifier visited blocks in depth first order, so it wouldn't process unreachable blocks. When compiling at -O0, late dead block elimination isn't done and the bad instructions got to isel. llvm-svn: 81187	2009-09-08 04:55:44 +00:00
Dan Gohman	f4a0f0f033	Fix an abort on a store of an empty struct member. getValue returns null in the case of an empty struct, so don't try to call getNumValues on it. llvm-svn: 81180	2009-09-08 01:44:02 +00:00
Dan Gohman	2512a42548	Fix a thinko: When lowering fneg with xor, bitcast the operands from floating-point to integer first, and bitcast the result back to floating-point. Previously, this test was passing by falling back to SelectionDAG lowering. The resulting code isn't as nice, but it's correct and CodeGen now stays on the fast path. llvm-svn: 81171	2009-09-07 23:47:14 +00:00
Daniel Dunbar	b9a562b7c4	Don't depend on arch specific global prefix. llvm-svn: 81084	2009-09-05 11:53:06 +00:00
Daniel Dunbar	b9ea94c990	Eliminate uses of %prcontext. - I'd appreciate it if someone else eyeballs my changes to make sure I captured the intent of the test. llvm-svn: 81083	2009-09-05 11:35:16 +00:00
Bob Wilson	7f20002993	Stabilize the order of live intervals in the priority_queue used by the linear scan reg alloc. This fixes a problem I ran into where extracting a function from a larger file caused the generated code to change (masking the problem I was trying to debug) because the allocator behaved differently. This changes the results for two X86 regression checks. stack-color-with-reg is improved, with one less instruction, but pr3495 is worse, with one more copy. As far as I can tell, these tests were just getting lucky or unlucky, so I've changed the expected results. llvm-svn: 81060	2009-09-05 01:19:16 +00:00
Evan Cheng	3d2fce01aa	Run branch folding if if-converter make some transformations. llvm-svn: 80994	2009-09-04 07:47:40 +00:00
Daniel Dunbar	30e30587eb	Remove stale greps. llvm-svn: 80986	2009-09-04 05:07:52 +00:00
Bob Wilson	36d8c75eca	Convert tests to FileCheck. llvm-svn: 80983	2009-09-04 04:07:19 +00:00
Bob Wilson	e072f8eedb	Convert a test to FileCheck. llvm-svn: 80975	2009-09-04 00:32:31 +00:00
Dan Gohman	aa92dc1e61	LLVM currently represents floating-point negation as -0.0 - x. Fix FastISel to recognize this pattern and emit a floating-point negation using xor. llvm-svn: 80963	2009-09-03 22:53:57 +00:00
Daniel Dunbar	abf2bb683a	Remove dead greps. llvm-svn: 80946	2009-09-03 20:59:02 +00:00
Dan Gohman	d0d5e685da	Recognize more opportunities to use SSE min and max instructions, swapping the operands if necessary. llvm-svn: 80940	2009-09-03 20:34:31 +00:00
Mon P Wang	eadd21ea3c	Test cases for vector shifts changes r80935 Changed the old vector shift test to use FileCheck llvm-svn: 80936	2009-09-03 19:57:35 +00:00
Evan Cheng	1b38952c99	Reference to hidden symbols do not have to go through non-lazy pointer in non-pic mode. rdar://7187172. llvm-svn: 80904	2009-09-03 07:04:02 +00:00
Chris Lattner	cdb6fd2c7c	merge all the basic linux/32 pic tests together into one test. llvm-svn: 80902	2009-09-03 06:29:23 +00:00
Chris Lattner	4f101f98d1	rename test llvm-svn: 80901	2009-09-03 06:16:49 +00:00
Anton Korobeynikov	f0da41c3e4	More missed vdup patterns llvm-svn: 80838	2009-09-02 21:21:28 +00:00
Bob Wilson	d7797754d4	Add support for generating code for vst{234}lane intrinsics. llvm-svn: 80707	2009-09-01 18:51:56 +00:00
Bob Wilson	39dc89b458	Fix incorrect declarations of intrinsics in this test. llvm-svn: 80705	2009-09-01 18:50:43 +00:00
Bob Wilson	ff69320427	Add test for vld{234}_lane instructions. llvm-svn: 80658	2009-09-01 04:27:10 +00:00
Bob Wilson	33b408a10f	Fix pr4843: When an instruction has multiple destination registers that are tied to different source registers, the TwoAddressInstructionPass needs to be smarter. Change it to check before replacing a source register whether that source register is tied to a different destination register, and if so, defer handling it until a subsequent iteration. llvm-svn: 80654	2009-09-01 04:18:40 +00:00
Jim Grosbach	f09e8d5497	SJLJ is arm/darwin only for now. force the triple for the test llvm-svn: 80651	2009-09-01 02:34:49 +00:00
Jim Grosbach	20eac92d88	Clean up LSDA name generation and use for SJLJ exception handling. This makes an eggregious hack somewhat more palatable. Bringing the LSDA forward and making it a GV available for reference would be even better, but is beyond the scope of what I'm looking to solve at this point. Objective C++ code could generate function names that broke the previous scheme. This fixes that. llvm-svn: 80649	2009-09-01 01:57:56 +00:00
David Goodwin	c8985204d9	Don't mark a register live at an undef use. llvm-svn: 80621	2009-08-31 20:47:02 +00:00
Evan Cheng	4f835f1d7d	Remove .n suffix for some 16-bit opcodes now that Darwin assembler is fixed. llvm-svn: 80615	2009-08-31 20:14:07 +00:00
Chris Lattner	b284f7b1d9	eliminate some uses of prcontext. Any help here would be appreciated :) llvm-svn: 80520	2009-08-30 21:45:23 +00:00
Anton Korobeynikov	3681144bd8	Add missed pattern llvm-svn: 80502	2009-08-30 19:06:39 +00:00
Anton Korobeynikov	eab572a8ff	EXTRACT_VECTOR_ELEMENT can have result type different from element type. Remove the assertion and generalize the code for ARM NEON stuff. llvm-svn: 80498	2009-08-30 17:14:54 +00:00
Dan Gohman	ca73326f56	CMOV_GR8 clobbers EFLAGS when its expansion involves an xor to set a register to 0. This fixes PR4814. llvm-svn: 80445	2009-08-29 22:19:15 +00:00
Anton Korobeynikov	ece642a54c	Do not assert on too wide splats we don't support. llvm-svn: 80409	2009-08-29 00:08:18 +00:00
Anton Korobeynikov	cd41d07f29	Add missed extract_element pattern llvm-svn: 80408	2009-08-28 23:41:26 +00:00
Evan Cheng	43b9ca6f42	Let Darwin linker auto-synthesize stubs and lazy-pointers. This deletes a bunch of nasty code in ARM asm printer. llvm-svn: 80404	2009-08-28 23:18:09 +00:00
Evan Cheng	6da267de23	v4, v5 does not support sxtb / sxth. llvm-svn: 80322	2009-08-28 00:31:43 +00:00
Anton Korobeynikov	205cac837f	scalar_to_vector is fully legal now (implemented as subreg accesses) llvm-svn: 80249	2009-08-27 16:04:47 +00:00
Anton Korobeynikov	d0b0262edf	Ok, sometimes it's profitable to turn scalar_to_vector stuff into subreg access. Add a testcase. llvm-svn: 80246	2009-08-27 14:51:42 +00:00
Evan Cheng	7a37b1a2ca	Fix PR4789. Teach eliminateFrameIndex how to handle VLDRQ and VSTRQ which cannot fold any immediate offset. llvm-svn: 80191	2009-08-27 01:23:50 +00:00
Dan Gohman	7f0ca9a34c	X86FastISel support for loading and storing values of type i1. llvm-svn: 80186	2009-08-27 00:31:47 +00:00
Dan Gohman	f1abb5511b	Expand i8 selects into control flow instead of 16-bit conditional moves. This avoids the need to promote the operands (or implicitly extend them, a partial register update condition), and can reduce i8 register pressure. This substantially speeds up code such as write_hex in lib/Support/raw_ostream.cpp. subclass-coalesce.ll is too trivial and no longer tests what it was originally intended to test. llvm-svn: 80184	2009-08-27 00:14:12 +00:00
Bob Wilson	faebdee4dd	Convert some more Neon tests to FileCheck. llvm-svn: 80120	2009-08-26 18:11:50 +00:00
Dale Johannesen	f582ac7c11	Alter 79292 to produce output that actually assembles. llvm-svn: 80119	2009-08-26 18:10:32 +00:00
Anton Korobeynikov	0f756b27ae	Expand scalar_to_vector - we don't have any isel logic for it now llvm-svn: 80107	2009-08-26 16:26:09 +00:00
Dan Gohman	6c23fa2442	Don't use INSERT_SUBREG to model anyext operations on x86-64, as it leads to partial-register definitions. To help avoid redundant zero-extensions, also teach the h-register matching patterns that use movzbl to match anyext as well as zext. llvm-svn: 80099	2009-08-26 14:59:13 +00:00
Anton Korobeynikov	a0e01bec87	Add dummy inline asm handling for 'r' constraint. This fixes PR4778 llvm-svn: 80085	2009-08-26 13:44:29 +00:00
Scott Michel	c5dd8bd8d2	Updated i128 sext support for CellSPU backend, contributed by Ken Werner (IBM) llvm-svn: 80042	2009-08-25 22:37:34 +00:00
Chris Lattner	3e6c7946df	remove some dead lines. llvm-svn: 80031	2009-08-25 21:01:56 +00:00
Chris Lattner	88093c7594	convert to filecheck style llvm-svn: 80029	2009-08-25 20:57:38 +00:00
Chris Lattner	6d9d5a9c94	convert to filecheck llvm-svn: 80025	2009-08-25 20:49:04 +00:00
Daniel Dunbar	9cc4970ed3	Switch abi-isel.ll to FileCheck; it's not much faster, but it now tests a lot more and is much nicer to the OS. - Dan, please check. If there are parts of the test you think I should strip out so it doesn't cause random failures let me know (there are still some PIC label numbers in it, for example). llvm-svn: 80019	2009-08-25 18:45:03 +00:00
David Goodwin	ae6bc8214a	Fixup register kills after scheduling. llvm-svn: 80002	2009-08-25 17:03:05 +00:00
Anton Korobeynikov	271cdda8e1	Provide dynamic_stackalloc lowering for MSP430. This fixes PR4769 llvm-svn: 80001	2009-08-25 17:00:23 +00:00
Dan Gohman	0d4bbf2c4a	Remove obsolete -f flags. llvm-svn: 79992	2009-08-25 15:38:29 +00:00
Dale Johannesen	f8d37c6b81	Fix PR 4751, another difficulty with %a modifier on x86. llvm-svn: 79961	2009-08-25 00:16:14 +00:00
Scott Michel	ec89f0c41a	- Remove SelectSEXTi128 from SPUISelDAGToDAG.cpp, evidently, this is redundant code, according to Anton (I'm not totally convinced, but we can always resurrect patches if we need to do so.) - Start moving CellSPU's tests to prefer FileCheck. llvm-svn: 79958	2009-08-24 23:57:35 +00:00
Scott Michel	e208c9458d	Prefer 'FileCheck' over 'grep'. llvm-svn: 79953	2009-08-24 22:49:22 +00:00
Scott Michel	8d1602af86	128-bit sign extension and vector shift cleanups, contributed by Ken Werner (IBM). llvm-svn: 79949	2009-08-24 22:28:53 +00:00
Bob Wilson	9054d25808	Fix a typo. Somehow I thought this had passed before, but I guess not. llvm-svn: 79937	2009-08-24 21:17:17 +00:00
Bob Wilson	5fe1d38607	Convert slow test to use FileCheck. llvm-svn: 79935	2009-08-24 20:33:47 +00:00
Daniel Dunbar	6969df0fab	Convert two gratuitous abuses of poor helpless CPU cycles to FileCheck. llvm-svn: 79933	2009-08-24 20:08:27 +00:00
Dale Johannesen	fbc9a2e33b	Split test into 3. llvm-svn: 79926	2009-08-24 17:51:19 +00:00
Dale Johannesen	6bbeda41b9	Make linkerprivate work for ARM and PPC. Testcase covers all Darwin targets; could be split into separate tests for the chip subdirectories, but from Chris' last mail on testing I assume he'd rather have only one test. Generic seems to be the best available, maybe there should be a Darwin subdirectory? llvm-svn: 79877	2009-08-24 01:03:42 +00:00
Daniel Dunbar	a9e1d3f065	Rerevert (r75663 and r76805), seems there is more non-determinism. llvm-svn: 79856	2009-08-23 17:26:24 +00:00
Jakob Stoklund Olesen	972c8fab51	Fix PR4753. When undoing a reuse in ReuseInfo::GetRegForReload, check if it was only a sub-register being used. The MachineOperand::getSubReg() method is only valid for virtual registers, so we have to recover the sub-register index manually. llvm-svn: 79855	2009-08-23 13:01:45 +00:00
Daniel Dunbar	2982196fca	Speculatively revert r76823 (i.e., reapply r75663 and r76805) to see if the real problem is fixed by the TableGen determinism fix. llvm-svn: 79851	2009-08-23 10:44:51 +00:00
Eli Friedman	682d8c1881	Make x86 test actually test x86 code generation. Fix the construct on ARM, which was breaking by coincidence, and add a similar testcase for ARM. llvm-svn: 79719	2009-08-22 03:13:10 +00:00
Chris Lattner	f09250f1b1	rename test, make more specific. llvm-svn: 79712	2009-08-22 00:44:24 +00:00
Anton Korobeynikov	48e4a6c739	Add missing RUN line llvm-svn: 79707	2009-08-22 00:28:50 +00:00
Anton Korobeynikov	5f47ecb918	Reduce the test llvm-svn: 79703	2009-08-22 00:18:11 +00:00
Bob Wilson	616335f6c1	Use CHECK-NEXT to make sure we're only getting one copy of each shuffle instruction. llvm-svn: 79702	2009-08-22 00:13:23 +00:00
Bob Wilson	a70623102e	Match VTRN, VZIP, and VUZP shuffles. Restore the tests for these operations, now using shuffles instead of intrinsics. llvm-svn: 79673	2009-08-21 20:54:19 +00:00
Anton Korobeynikov	f31a44ec01	Add fcopysign instructions llvm-svn: 79664	2009-08-21 20:02:37 +00:00
Anton Korobeynikov	a39f96c6ed	Handle 'r' inline asm constraint llvm-svn: 79648	2009-08-21 18:15:41 +00:00
Bob Wilson	f73af72d30	Add some tests for vext.16 and vext.32. llvm-svn: 79638	2009-08-21 16:35:24 +00:00
Bob Wilson	51c7aa04ec	Remove Neon intrinsics for VZIP, VUZP, and VTRN. We will represent these as vector shuffles. Temporarily remove the tests for these operations until the new implementation is working. llvm-svn: 79579	2009-08-21 00:01:42 +00:00
Dale Johannesen	fa2b97e61a	Use FileCheck even though this means testing for something that has nothing to do with the point of the test, per Chris. llvm-svn: 79569	2009-08-20 22:12:08 +00:00
Dan Gohman	05046085b6	Fix an x86 code size regression: prefer RIP-relative addressing over absolute addressing even in non-PIC mode (unless the address has an index or something else incompatible), because it has a smaller encoding. llvm-svn: 79553	2009-08-20 18:23:44 +00:00
Evan Cheng	01de985ae6	Fix an obvious copy-n-paste bug. llvm-svn: 79535	2009-08-20 17:01:04 +00:00
Dale Johannesen	d16abc09c4	Use FileCheck for the test run where it's appropriate. llvm-svn: 79534	2009-08-20 16:58:04 +00:00
Dale Johannesen	1d764f61ef	Handle 'a' modifier in X86 asms. PR 4742. llvm-svn: 79484	2009-08-19 22:44:41 +00:00
Bill Wendling	6c528bc7ae	Make this test platform neutral. llvm-svn: 79447	2009-08-19 18:51:45 +00:00
Dan Gohman	ac33a9061d	Add an x86 peep that narrows TEST instructions to forms that use a smaller encoding. These kinds of patterns are very frequent in sqlite3, for example. llvm-svn: 79439	2009-08-19 18:16:17 +00:00
Bob Wilson	32cd8550ce	Add support for Neon VEXT (vector extract) shuffles. This is derived from a patch by Anton Korzh. I modified it to recognize the VEXT shuffles during legalization and lower them to a target-specific DAG node. llvm-svn: 79428	2009-08-19 17:03:43 +00:00
Eli Friedman	1e008c173a	PR4737: Fix a nasty bug in load narrowing with non-power-of-two types. llvm-svn: 79415	2009-08-19 08:46:10 +00:00
Dan Gohman	4906f73a9f	Legalize the shift amount operand of SRL_PARTS, SHL_PARTS, and SRA_PARTS, as is done for SRL, SHL, and SRA. llvm-svn: 79380	2009-08-18 23:36:17 +00:00
Richard Osborne	2349fb4d45	Add support for mergeable sections back into the XCore backend. llvm-svn: 79368	2009-08-18 21:14:31 +00:00
Richard Osborne	934d61648b	Put data with relocations in the same sections as data without relocations. llvm-svn: 79351	2009-08-18 17:58:17 +00:00
Dan Gohman	b85ad63727	Make this test less sensitive to assembler differences. llvm-svn: 79348	2009-08-18 17:19:46 +00:00
Chris Lattner	8b0e164aa6	force a triple so this passes on darwin llvm-svn: 79345	2009-08-18 16:55:45 +00:00
Dan Gohman	a41fa35992	Make tail merging handle blocks with repeated predecessors correctly, and remove RemoveDuplicateSuccessor, as it is no longer necessary, and because it breaks assumptions made in MachineBasicBlock::isOnlyReachableByFallthrough. Convert test/CodeGen/X86/omit-label.ll to FileCheck and add a testcase for PR4732. test/CodeGen/Thumb2/thumb2-ifcvt2.ll sees a diff with this commit due to it being bugpoint-reduced to the point where it doesn't matter what the condition for the branch is. Add some more interesting code to test/CodeGen/X86/2009-08-06-branchfolder-crash.ll, which is the testcase that originally motivated the RemoveDuplicateSuccessor code, to help verify that the original problem isn't being re-broken. llvm-svn: 79338	2009-08-18 15:18:18 +00:00
Evan Cheng	dd406177de	Fix revsh pattern. llvm-svn: 79318	2009-08-18 05:43:23 +00:00
Dale Johannesen	4a50e68b65	PowerPC inline asm was emitting two output operands for a single "m" constraint; this is wrong because the opcode of a load or store would have to change in parallel. This patch makes it always compute addresses into a register, which is correct but not as efficient as possible. 7144566. llvm-svn: 79292	2009-08-18 00:18:39 +00:00
Richard Osborne	94a2c1acae	Update getSectionForConstant() to to allow mergable sections to be nulled out if not supported by the ELF subtarget. llvm-svn: 79249	2009-08-17 16:37:11 +00:00
Eli Friedman	b4d7312249	Fix test on Linux. llvm-svn: 79140	2009-08-15 21:28:17 +00:00
Bill Wendling	bae6b2cca3	Reapply r79127. It was fixed by d0k. llvm-svn: 79136	2009-08-15 21:21:19 +00:00
Bill Wendling	d3fade656f	Revert r79127. It was causing compilation errors. llvm-svn: 79135	2009-08-15 21:14:01 +00:00
Evan Cheng	52d4e64711	Change allowsUnalignedMemoryAccesses to take type argument since some targets support unaligned mem access only for certain types. (Should it be size instead?) ARM v7 supports unaligned access for i16 and i32, some v6 variants support it as well. llvm-svn: 79127	2009-08-15 19:23:44 +00:00
Chris Lattner	da108b4ed4	implement support for CHECK-NEXT: in filecheck. llvm-svn: 79123	2009-08-15 18:32:21 +00:00
Jakob Stoklund Olesen	ffa73acfec	Refine EarlyClobber assert in register scavenger. It is legal for an inline asm operand to use an earlyclobber register if the use operand is tied to the earlyclobber operand. The issue is discussed here: http://gcc.gnu.org/ml/gcc/1999-04n/msg00431.html We should perhaps let only the machine code verifier worry about these finer details. EarlyClobber operands are not really interesting to the scavenger. This fixes PR4528 for the third time. llvm-svn: 79122	2009-08-15 18:16:58 +00:00
Chris Lattner	c6a803be7c	specify a target triple so global variable manglings are consistent etc. llvm-svn: 79118	2009-08-15 17:35:05 +00:00
Chris Lattner	3838c2dabf	convert to filecheck. llvm-svn: 79117	2009-08-15 17:28:09 +00:00
Chris Lattner	bb193ecec8	rename this test to sse2.ll llvm-svn: 79116	2009-08-15 17:24:09 +00:00
Chris Lattner	d3954e2790	merge a bunch more sse3 tests into sse3.ll llvm-svn: 79115	2009-08-15 17:21:44 +00:00
Chris Lattner	9bae01ec47	convert test to filecheck format. llvm-svn: 79114	2009-08-15 17:05:03 +00:00
Chris Lattner	912aa19c25	rename test llvm-svn: 79113	2009-08-15 17:01:44 +00:00
Chris Lattner	e5b9130efe	this is a test for sse3, simplify it. llvm-svn: 79112	2009-08-15 17:01:19 +00:00
Jakob Stoklund Olesen	4af3c864bc	Don't setCalleeSavedInfoValid() until spills are interted. In a naked function, the flag is never set and getPristineRegs() returns an empty list. That means naked functions are able to clobber callee saved registers, but that is the whole point of naked functions. This fixes PR4716. llvm-svn: 79096	2009-08-15 13:10:46 +00:00
Jakob Stoklund Olesen	e31228e636	Add XFAIL testcase for setcc undef. llvm-svn: 79093	2009-08-15 12:10:22 +00:00
Jakob Stoklund Olesen	46da62b174	Add XFAIL test case for a scavenger assert. llvm-svn: 79092	2009-08-15 12:09:56 +00:00
Jakob Stoklund Olesen	abff8fcb1c	Update LocalRewriter::DistanceMap when inserting stack loads. In the included test case, a stack load was not included in DistanceMap. That caused TransferDeadness to ignore the instruction, leading to a scavenger assert. llvm-svn: 79090	2009-08-15 11:03:03 +00:00
Evan Cheng	d7e1a79eea	Fix tests. llvm-svn: 79086	2009-08-15 08:23:11 +00:00
Evan Cheng	6ddd7bcdd1	Turn on if-conversion for thumb2. llvm-svn: 79084	2009-08-15 07:59:10 +00:00
Chris Lattner	93980d68e4	use XCore-specific section with xcore specific cp/dp flags to restore support for globals going into the appropriate sections with the flags. This hopefully finishes unbreaking the previous behavior that I broke before. llvm-svn: 79079	2009-08-15 06:09:35 +00:00
Dan Gohman	0700a56860	On x86-64, for a varargs function, don't store the xmm registers to the register save area if %al is 0. This avoids touching xmm regsiters when they aren't actually used. llvm-svn: 79061	2009-08-15 01:38:56 +00:00
Evan Cheng	7dae88d2c9	Leaf functions which do not save CSRs can be frameless even with -disable-fp-elim. llvm-svn: 79039	2009-08-14 20:48:13 +00:00
Evan Cheng	e41903b10d	Also shrink immediate branches; also more assembler workarounds. llvm-svn: 79014	2009-08-14 18:31:44 +00:00
Anton Korobeynikov	75821f7c69	Properly handle indirect win64 args when they're passed in memory llvm-svn: 79009	2009-08-14 18:19:10 +00:00
Evan Cheng	db73d68cbe	Shrink ADR and LDR from constantpool late during constantpool island pass. llvm-svn: 78970	2009-08-14 00:32:16 +00:00
Bruno Cardoso Lopes	62e6a8bbe6	Remove HasCrazyBSS and add a flag in TAI to indicate that '.section' must be emitted for PowerPC-Linux '.bss' section llvm-svn: 78958	2009-08-13 23:30:21 +00:00
Daniel Dunbar	86c065dd68	Revert 78892 and 78895, these break generating working executables on x86_64-apple-darwin10. --- Reverse-merging r78895 into '.': U test/CodeGen/PowerPC/2008-12-12-EH.ll U lib/Target/DarwinTargetAsmInfo.cpp --- Reverse-merging r78892 into '.': U include/llvm/Target/DarwinTargetAsmInfo.h U lib/Target/X86/X86TargetAsmInfo.cpp U lib/Target/X86/X86TargetAsmInfo.h U lib/Target/ARM/ARMTargetAsmInfo.h U lib/Target/ARM/ARMTargetMachine.cpp U lib/Target/ARM/ARMTargetAsmInfo.cpp U lib/Target/PowerPC/PPCTargetAsmInfo.cpp U lib/Target/PowerPC/PPCTargetAsmInfo.h U lib/Target/PowerPC/PPCTargetMachine.cpp G lib/Target/DarwinTargetAsmInfo.cpp llvm-svn: 78919	2009-08-13 17:03:38 +00:00
Chris Lattner	68535f7603	reintroduce support for Mips "small" section handling. This is implemented somewhat differently than before, but it should have the same functionality and the previous testcase passes again. llvm-svn: 78900	2009-08-13 06:28:06 +00:00
Evan Cheng	f59e9f4288	tPOP_RET now has predicate operands. llvm-svn: 78898	2009-08-13 06:05:07 +00:00
Chris Lattner	00a8de054c	fix typo, add 10.6 version of test for my previous patch. llvm-svn: 78895	2009-08-13 05:43:33 +00:00
Evan Cheng	e5801bd220	It's ok to spill a tGPR register as long as it's still allocated a low register. llvm-svn: 78893	2009-08-13 05:40:51 +00:00
Bruno Cardoso Lopes	607cd3b63a	Change MCSectionELF to represent a section semantically instead of syntactically as a string, very similiar to what Chris did with MachO. The parsing support and validation is not introduced yet. llvm-svn: 78890	2009-08-13 05:07:35 +00:00
Dan Gohman	ef3d457126	Various AsmWriter output cleanups. Use WriteAsOperand instead of PrintUnmangledNameSafely. llvm-svn: 78878	2009-08-13 01:36:44 +00:00
Dan Gohman	1c0e13fe66	Use WriteAsOperand to print BasicBlock names. llvm-svn: 78838	2009-08-12 20:56:56 +00:00
Bob Wilson	4b35448360	Generate Neon VTBL and VTBX instructions from the corresponding intrinsics. llvm-svn: 78835	2009-08-12 20:51:55 +00:00
Dale Johannesen	58043874ce	Test for 78821, sort of. While that bug is nondeterministic, this test failed consistently on a Darwin build. llvm-svn: 78822	2009-08-12 17:43:47 +00:00
Chris Lattner	1235f20744	one last (?) bad x86 triple test. llvm-svn: 78801	2009-08-12 06:49:44 +00:00
Chris Lattner	6628e17344	fix some pastos in triple lines. llvm-svn: 78800	2009-08-12 06:49:12 +00:00
Chris Lattner	0ea6e4cc7f	another bogus triple llvm-svn: 78798	2009-08-12 06:36:52 +00:00
Chris Lattner	8c2d846a42	fix another broken target triple. llvm-svn: 78796	2009-08-12 06:29:18 +00:00
Chris Lattner	d4a70aedb0	fix an incorrect target triple. llvm-svn: 78795	2009-08-12 06:28:51 +00:00
Chris Lattner	774a88aa77	add nounwind llvm-svn: 78791	2009-08-12 05:44:03 +00:00
Evan Cheng	608d92c943	Remove an Darwin assembler workaround. llvm-svn: 78777	2009-08-12 01:56:42 +00:00
Evan Cheng	1e6c2a1c17	Shrink ADDS, ADC, RSB, and SUBS. llvm-svn: 78776	2009-08-12 01:49:45 +00:00
Evan Cheng	f6a9d06241	Shrinkify Thumb2 r = add sp, imm. llvm-svn: 78745	2009-08-11 23:00:31 +00:00
Evan Cheng	cc9ca3500d	Shrinkify Thumb2 load / store multiple instructions. llvm-svn: 78717	2009-08-11 21:11:32 +00:00
Chris Lattner	0c533d909a	now that these are in file-check format, we can merge them together into one bigger test (which runs faster) llvm-svn: 78672	2009-08-11 15:54:17 +00:00
Evan Cheng	806845daec	Fix the previous accidental commit. Now shrinking common Thumb2 load / store instructions. llvm-svn: 78659	2009-08-11 09:37:40 +00:00
Jakob Stoklund Olesen	b39a5aa794	Rebuild RegScavenger::DistanceMap each time it is needed. The register scavenger maintains a DistanceMap that maps MI pointers to their distance from the top of the current MBB. The DistanceMap is built incrementally in forward() and in bulk in findFirstUse(). It is used by scavengeRegister() to determine which candidate register has the longest unused interval. Unfortunately the DistanceMap contents can become outdated. The first time scavengeRegister() is called, the DistanceMap is filled to cover the MBB. If then instructions are inserted in the MBB (as they always are following scavengeRegister()), the recorded distances are too short. This causes bad behaviour in the included test case where a register use /after/ the current position is ignored because findFirstUse() thinks is is /before/ the current position. A "using an undefined register" assertion follows promptly. The fix is to build a fresh DistanceMap at the top of scavengeRegister(), and discard it after use. This means that DistanceMap is no longer needed as a RegScavenger member variable, and forward() doesn't need to update it. The fix then discloses issue number two in the same test case: The candidate search in scavengeRegister() finds a CSR that has been saved in the prologue, but is currently unused. It would be both inefficient and wrong to spill such a register in the emergency spill slot. In the present case, the emergency slot restore is placed immediately before the normal epilogue restore, leading to a "Redefining a live register" assertion. Fix number two: When scavengerRegister() stumbles upon an unused register that is overwritten later in the MBB, return that register early. It is important to verify that the register is defined later in the MBB, otherwise it might be an unspilled CSR. llvm-svn: 78650	2009-08-11 06:25:12 +00:00
Bob Wilson	8f5c447bfa	Convert more Neon tests to use FileCheck. llvm-svn: 78648	2009-08-11 05:51:19 +00:00
Bob Wilson	12842f9865	Use vAny type to get rid of Neon intrinsics that differed only in whether the overloaded vector types allowed floating-point or integer vector elements. Most of these operations actually depend on the element type, so bitcasting was not an option. If you include the vpadd intrinsics that I updated earlier, this gets rid of 20 intrinsics. llvm-svn: 78646	2009-08-11 05:39:44 +00:00
Bob Wilson	741a9c7bf6	Use new EVT::vAny type to combine Neon intrinsics for VPADD. llvm-svn: 78632	2009-08-11 01:15:26 +00:00
Evan Cheng	475f8a4fa2	Enable Thumb2 instruction shrinking (32-bit to 16-bit) pass. Convert a bunch of thumb2 tests to FileCheck. llvm-svn: 78622	2009-08-10 23:56:04 +00:00
Dan Gohman	9d26c85bdc	Fix a bug in the DAGCombiner's handling of multiple linked MERGE_VALUES nodes. Replacing the result values with the operands in one MERGE_VALUES node may cause another MERGE_VALUES node be CSE'd with the first one, and bring its uses along, so that the first one isn't dead, as this code expects. Fix this by iterating until the node is really dead. This fixes PR4699. llvm-svn: 78619	2009-08-10 23:43:19 +00:00
David Goodwin	d9aedcae23	Use FileCheck. llvm-svn: 78614	2009-08-10 23:14:14 +00:00
David Goodwin	bdf1a1d1a2	Use FileCheck... its good for you... llvm-svn: 78613	2009-08-10 23:06:57 +00:00
David Goodwin	9e7c7e748f	Fix test. llvm-svn: 78611	2009-08-10 22:58:08 +00:00
David Goodwin	108b522912	Fix test. llvm-svn: 78606	2009-08-10 22:31:04 +00:00
David Goodwin	85b5b027f7	Use NEON for single-precision int<->FP conversions. llvm-svn: 78604	2009-08-10 22:17:39 +00:00
Evan Cheng	f72c13bdf5	Handle the constantfp created during post-legalization dag combiner phase. llvm-svn: 78594	2009-08-10 20:25:59 +00:00
Dan Gohman	676d115ce5	Add nounwind keywords. llvm-svn: 78568	2009-08-10 16:48:40 +00:00
Chris Lattner	cb307a27bf	Make the big switch: Change MCSectionMachO to represent a section semantically instead of syntactically as a string. This means that it keeps track of the segment, section, flags, etc directly and asmprints them in the right format. This also includes parsing and validation support for llvm-mc and "attribute(section)", so we should now start getting errors about invalid section attributes from the compiler instead of the assembler on darwin. Still todo: 1) Uniquing of darwin mcsections 2) Move all the Darwin stuff out to MCSectionMachO.[cpp\|h] 3) there are a few FIXMEs, for example what is the syntax to get the S_GB_ZEROFILL segment type? llvm-svn: 78547	2009-08-10 01:39:42 +00:00
Bob Wilson	7fc63417d7	Add tests for Neon VZIP and VUZP instructions. llvm-svn: 78529	2009-08-09 06:48:29 +00:00
Bob Wilson	f60c8807e8	Add a test for Neon VTRN instructions. llvm-svn: 78528	2009-08-09 06:30:46 +00:00
Eric Christopher	7dfa9f2e56	Add crc32 instruction and intrinsics. Add a new class of prefix bytes for F2 0F 38 and propagate. Add a FIXME for a set of possibilities which correspond to intrinsics already used. New test. llvm-svn: 78508	2009-08-08 21:55:08 +00:00
Jakob Stoklund Olesen	e2dc8a46e9	Add support for READCYCLECOUNTER in Blackfin back-end. llvm-svn: 78506	2009-08-08 21:42:22 +00:00
Jakob Stoklund Olesen	dc6bccbaa6	Don't build illegal ops in DAGCombiner::SimplifyBinOpWithSameOpcodeHands(). Blackfin supports and/or/xor on i32 but not on i16. Teach DAGCombiner::SimplifyBinOpWithSameOpcodeHands to not produce illegal nodes after legalize ops. llvm-svn: 78497	2009-08-08 20:42:17 +00:00
Jakob Stoklund Olesen	ac51533b8a	Simplify RegScavenger::forward a bit more. Verify that early clobber registers and their aliases are not used. All changes to RegsAvailable are now done as a transaction so the order of operands makes no difference. The included test case is from PR4686. It has behaviour that was dependent on the order of operands. llvm-svn: 78465	2009-08-08 13:18:47 +00:00
Anton Korobeynikov	674ffc1e59	Do not generate 32-bit call on win64 when imm does not fit llvm-svn: 78443	2009-08-07 23:59:21 +00:00
Chris Lattner	6eceb7c85d	rename test llvm-svn: 78441	2009-08-07 23:57:30 +00:00
Chris Lattner	ff8d04e815	merge a bunch of tests together into one, convert to filecheck which is more tolerant of whitespace differences. llvm-svn: 78439	2009-08-07 23:56:42 +00:00
Bob Wilson	97262e01d5	Convert more Neon tests to use FileCheck. llvm-svn: 78433	2009-08-07 23:45:02 +00:00
David Goodwin	742db6a6d4	Make NEON single-precision FP support the default for cortex-a8 (again). llvm-svn: 78430	2009-08-07 23:32:33 +00:00
Anton Korobeynikov	23b28cb824	2 more vdup.32 cases llvm-svn: 78419	2009-08-07 22:36:50 +00:00
Evan Cheng	6e130db3b7	Thumb2 32-bit ldm / stm needs .w suffix if submode is ia. llvm-svn: 78410	2009-08-07 21:19:10 +00:00
Evan Cheng	4c3b1ca5a0	Fix support to use NEON for single precision fp math. llvm-svn: 78397	2009-08-07 19:30:41 +00:00
Evan Cheng	b1aeeed03e	Another coalescer bug. When a dead copy is eliminated, transfer the kill to a def of the exact register rather than a super-register. llvm-svn: 78376	2009-08-07 07:14:14 +00:00
Evan Cheng	b972e5633f	It turns out most of the thumb2 instructions are not allowed to touch SP. The semantics of such instructions are unpredictable. We have just been lucky that tests have been passing. This patch takes pain to ensure all the PEI lowering code does the right thing when lowering frame indices, insert code to manipulate stack pointers, etc. It's also custom lowering dynamic stack alloc into pseudo instructions so we can insert the right instructions at scheduling time. This fixes PR4659 and PR4682. llvm-svn: 78361	2009-08-07 00:34:42 +00:00
Dale Johannesen	3a127ce1d8	Add the testcase from PR 4668. This works at the moment, but it's a fragile area. llvm-svn: 78358	2009-08-07 00:04:42 +00:00
Dale Johannesen	15a5fad94b	Fix PR 4626, a crash in branch folding after OptimizeBlock produced a CFG it wasn't prepared for. llvm-svn: 78351	2009-08-06 22:56:40 +00:00
Bob Wilson	0127031c20	Implement Neon VST[234] operations. llvm-svn: 78330	2009-08-06 18:47:44 +00:00
Bob Wilson	e3ec5b6d76	Fix incorrect intrinsic declarations. llvm-svn: 78329	2009-08-06 18:46:26 +00:00
Dan Gohman	b4764e5b7f	Tidy up this testcase. llvm-svn: 78322	2009-08-06 17:11:55 +00:00
Chris Lattner	a7e2662770	reduce testcase. llvm-svn: 78315	2009-08-06 16:14:33 +00:00
Dan Gohman	ee902509a8	Remove an over-aggressive assert. Functions with empty struct return types don't have any return values, from CodeGen's perspective. This fixes PR4688. llvm-svn: 78311	2009-08-06 15:07:58 +00:00
Anton Korobeynikov	644caa0cdb	Add tests for X86-64 code model handling. Small and kernel for now. llvm-svn: 78300	2009-08-06 12:25:20 +00:00
Dan Gohman	130e2c7aed	Fix a bug in x86's PreprocessForRMW logic that was exposed by aggressive chain operand optimization. UpdateNodeOperands does not modify the node in place if it would result in a node identical to an existing node. llvm-svn: 78297	2009-08-06 09:22:57 +00:00
Dan Gohman	5758e1e92a	Fix a few places in DAGCombiner that were creating all-ones-bits and high-bits values in ways that weren't correct for integer types wider than 64 bits. This fixes a miscompile in PPMacroExpansion.cpp in clang on x86-64. llvm-svn: 78295	2009-08-06 09:18:59 +00:00
Chris Lattner	87754848ab	this passes. llvm-svn: 78281	2009-08-06 03:55:49 +00:00
Sanjiv Gupta	aee88e46c1	XFAIL it while it is being worked on. llvm-svn: 78275	2009-08-06 02:19:20 +00:00
Bob Wilson	3389c2f7d0	Add tests for new NEON vld instructions. llvm-svn: 78264	2009-08-06 00:38:31 +00:00
Bob Wilson	dd611f44cb	Convert more Neon tests to FileCheck. llvm-svn: 78261	2009-08-05 23:51:20 +00:00
Anton Korobeynikov	22ef75155e	Missed pieces for ARM HardFP ABI. Patch by Sandeep Patel! llvm-svn: 78225	2009-08-05 19:04:42 +00:00
Dan Gohman	df7ea32af7	Enable the new no-SP register classes by default. This is to address PR4572. A few tests have some minor code regressions due to different coalescing. llvm-svn: 78217	2009-08-05 17:40:24 +00:00
Anton Korobeynikov	f6e25b3039	Add testcases for reg-mem arithemtics added recently llvm-svn: 78214	2009-08-05 17:04:32 +00:00
Anton Korobeynikov	be47ccffef	Convert bswap test to filecheck, add more test entries & convert stuff to filecheck llvm-svn: 78212	2009-08-05 16:50:53 +00:00
Dan Gohman	477fd55c9a	Fix a bug in the PIC16 backend. llvm-svn: 78211	2009-08-05 16:46:43 +00:00
Dan Gohman	2bebfc38af	Change these tests to use function attributes rather than special llc command-line options. llvm-svn: 78204	2009-08-05 16:37:27 +00:00
Chris Lattner	abde7f9d27	checking in broken testcases is not such a good idea. llvm-svn: 78201	2009-08-05 16:04:18 +00:00
Sanjiv Gupta	63c5ede173	Quite a few tests crashed in llc after 78142. This is just one of them. I hope to add a few more. llvm-svn: 78198	2009-08-05 15:52:14 +00:00
Evan Cheng	ea2b82b8fc	Disable stack coloring with register for now. It's not able to set kill markers. llvm-svn: 78179	2009-08-05 07:26:17 +00:00
Evan Cheng	a2ce665f60	Another nasty coalescer bug (is there another kind): After coalescing reg1027's def and kill are both at the same point: %reg1027,0.000000e+00 = [56,814:0) 0@70-(814) bb5: 60 %reg1027<def> = t2MOVr %reg1027, 14, %reg0, %reg0 68 %reg1027<def> = t2LDRi12 %reg1027<kill>, 8, 14, %reg0 76 t2CMPzri %reg1038<kill,undef>, 0, 14, %reg0, %CPSR<imp-def> 84 %reg1027<def> = t2MOVr %reg1027, 14, %reg0, %reg0 96 t2Bcc mbb<bb5,0x2030910>, 1, %CPSR<kill> Do not remove the kill marker on t2LDRi12. llvm-svn: 78178	2009-08-05 07:05:41 +00:00
Dan Gohman	e32c0fe584	Revert changes accidentally committed along with r78163. llvm-svn: 78165	2009-08-05 05:38:13 +00:00
Dan Gohman	8c79569853	Teach X86FastISel how to handle CCValAssign::BCvt, which is used for MMX arguments. This fixes PR4684. llvm-svn: 78163	2009-08-05 05:33:42 +00:00
Evan Cheng	379429200e	Turn some insert_subreg, extract_subreg, subreg_to_reg into implicit_defs. llvm-svn: 78151	2009-08-05 03:53:14 +00:00
Evan Cheng	1f7b549c79	One more. Transfer kill of the larger register when lowering an EXTRACT_SUBREG. llvm-svn: 78145	2009-08-05 02:25:11 +00:00
Evan Cheng	6376367356	One more place where subreg lowering forgot to transfer undefness. llvm-svn: 78144	2009-08-05 01:57:22 +00:00
Dan Gohman	f9bbcd1afd	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Evan Cheng	cdb125ce66	If the insert_subreg source is <undef>, insert an implicit_def instead of a copy. llvm-svn: 78141	2009-08-05 01:29:24 +00:00
Evan Cheng	7cc6aca1e6	Fix part 1 of pr4682. PICADD is a 16-bit instruction even in thumb2 mode. llvm-svn: 78126	2009-08-04 23:47:55 +00:00
Evan Cheng	28c2d9809d	Fix test. llvm-svn: 78113	2009-08-04 22:22:58 +00:00
Bob Wilson	6092c8d231	Convert more Neon tests to use FileCheck. llvm-svn: 78111	2009-08-04 22:01:41 +00:00
Bob Wilson	2b60721464	Convert a few Neon tests to use FileCheck. llvm-svn: 78108	2009-08-04 21:33:22 +00:00
Jakob Stoklund Olesen	0e0b5405f5	Clean up the handling of two-address operands in RegScavenger. This fixes PR4528. llvm-svn: 78107	2009-08-04 21:30:30 +00:00
Evan Cheng	783b65b546	Enable load / store multiple pass for Thumb2. It's not using ldrd / strd yet. llvm-svn: 78104	2009-08-04 21:12:13 +00:00
David Goodwin	30bf625ac2	Add NEON single-precision FP support for fabs and fneg. llvm-svn: 78101	2009-08-04 20:39:05 +00:00
Jakob Stoklund Olesen	6304369c4e	LowerSubregsInstructionPass::LowerExtract should not extend the live range of registers. When LowerExtract eliminates an EXTRACT_SUBREG with a kill flag, it moves the kill flag to the place where the sub-register is killed. This can accidentally overlap with the use of a sibling sub-register, and we have trouble. In the test case we have this code: Live Ins: %R0 %R1 %R2 %R2L<def> = EXTRACT_SUBREG %R2<kill>, 1 %R2H<def> = LOAD16fi <fi#-1>, 0, Mem:LD(2,4) [FixedStack-1 + 0] %R1L<def> = EXTRACT_SUBREG %R1<kill>, 1 %R0L<def> = EXTRACT_SUBREG %R0<kill>, 1 %R0H<def> = ADD16 %R2H<kill>, %R2L<kill>, %AZ<imp-def>, %AN<imp-def>, %AC0<imp-def>, %V<imp-def>, %VS<imp-def> subreg: CONVERTING: %R2L<def> = EXTRACT_SUBREG %R2<kill>, 1 subreg: eliminated! subreg: killed here: %R0H<def> = ADD16 %R2H, %R2L, %R2<imp-use,kill>, %AZ<imp-def>, %AN<imp-def>, %AC0<imp-def>, %V<imp-def>, %VS<imp-def> The kill flag on %R2 is moved to the last instruction, and the live range overlaps with the definition of %R2H: * Bad machine code: Redefining a live physical register * - function: f - basic block: 0x18358c0 (#0) - instruction: %R2H<def> = LOAD16fi <fi#-1>, 0, Mem:LD(2,4) [FixedStack-1 + 0] Register R2H was defined but already live. The fix is to replace EXTRACT_SUBREG with IMPLICIT_DEF instead of eliminating it completely: subreg: CONVERTING: %R2L<def> = EXTRACT_SUBREG %R2<kill>, 1 subreg: replace by: %R2L<def> = IMPLICIT_DEF %R2<kill> Note that these IMPLICIT_DEF instructions survive to the asm output. It is necessary to fix the stack-color-with-reg test case because of that. llvm-svn: 78093	2009-08-04 20:01:11 +00:00
Evan Cheng	a3abe2a7ce	In thumb mode, r7 is used as frame register. This fixes pr4681. llvm-svn: 78086	2009-08-04 18:46:17 +00:00
David Goodwin	a3839bc6c0	Match common pattern for FNMAC. Add NEON SP support. llvm-svn: 78085	2009-08-04 18:44:29 +00:00
David Goodwin	a2824d5700	Improve tests. llvm-svn: 78083	2009-08-04 18:11:59 +00:00
David Goodwin	3b9c52c5c1	Initial support for single-precision FP using NEON. Added "neonfp" attribute to enable. Added patterns for some binary FP operations. llvm-svn: 78081	2009-08-04 17:53:06 +00:00
Evan Cheng	206ee96bd6	Fix PR4528. This scavenger assertion is too strict. The two-address value is killed by another operand. There is probably a better fix. Either 1) scavenger can look at other operands, or 2) livevariables can be smarter about kill markers. Patches welcome. llvm-svn: 78072	2009-08-04 16:52:44 +00:00
Chris Lattner	f222054df7	enhance codegen to put 16-bit character strings into the __TEXT,__ustring section on darwin. llvm-svn: 78068	2009-08-04 16:27:13 +00:00
Chris Lattner	81bbf443fe	Add support emiting for 2/4 byte mergable strings to the ".rodata.str*" section on ELF targets. llvm-svn: 78066	2009-08-04 16:13:09 +00:00
Evan Cheng	03eb0e3c33	Emit sub r, #c instead of transforming it to add r, #-c if c fits in 8-bit. This is a bit of pre-mature optimization. 8-bit variant makes it likely it will be narrowed to a 16-bit instruction. llvm-svn: 78030	2009-08-04 01:41:15 +00:00
Bob Wilson	f307e0bd6d	Lower CONCAT_VECTOR during legalization instead of matching it during isel. Add a testcase. llvm-svn: 77992	2009-08-03 20:36:38 +00:00
Jakob Stoklund Olesen	5d8ace0902	Fix Bug 4657: register scavenger asserts with subreg lowering When LowerSubregsInstructionPass::LowerInsert eliminates an INSERT_SUBREG instriction because it is an identity copy, make sure that the same registers are alive before and after the elimination. When the super-register is marked <undef> this requires inserting an IMPLICIT_DEF instruction to make sure the super register is live. Fix a related bug where a kill flag on the inserted sub-register was not transferred properly. Finally, clear the undef flag in MachineInstr::addRegisterKilled. Undef implies dead and kill implies live, so they cant both be valid. llvm-svn: 77989	2009-08-03 20:08:18 +00:00
Evan Cheng	093e124256	Fix a coaelescer bug. If a copy val# is extended to eliminate a non-trivially coalesced copy, and the copy kills its source register. Trim the source register's live range to the last use if possible. This fixes up kill marker to make the scavenger happy. llvm-svn: 77967	2009-08-03 08:41:59 +00:00
Anton Korobeynikov	71386e08fe	Unbreak Win64 CC. Step one: honour register save area, fix some alignment and provide a different set of call-clobberred registers. llvm-svn: 77962	2009-08-03 08:12:53 +00:00
Rafael Espindola	70e9816624	Use movd instead of movq llvm-svn: 77956	2009-08-03 05:21:05 +00:00
Daniel Dunbar	0f16ea5c30	Pass target triple string in to TargetMachine constructor. This is not just a matter of passing in the target triple from the module; currently backends are making decisions based on the build and host architecture. The goal is to migrate to making these decisions based off of the triple (in conjunction with the feature string). Thus most clients pass in the target triple, or the host triple if that is empty. This has one important change in the way behavior of the JIT and llc. For the JIT, it was previously selecting the Target based on the host (naturally), but it was setting the target machine features based on the triple from the module. Now it is setting the target machine features based on the triple of the host. For LLC, -march was previously only used to select the target, the target machine features were initialized from the module's triple (which may have been empty). Now the target triple is taken from the module, or the host's triple is used if that is empty. Then the triple is adjusted to match -march. The take away is that -march for llc is now used in conjunction with the host triple to initialize the subtarget. If users want more deterministic behavior from llc, they should use -mtriple, or set the triple in the input module. llvm-svn: 77946	2009-08-03 04:03:51 +00:00
Rafael Espindola	18ba271a79	Use movq to move 64 bits in and out of mmx registers. Fixes PR4669 llvm-svn: 77940	2009-08-03 02:45:34 +00:00
Evan Cheng	8b9deebba3	Use the i12 variant of load / store opcodes if offset is zero. Now we pass all of multisource as well. llvm-svn: 77939	2009-08-03 02:38:06 +00:00
Richard Osborne	bbb772ace9	Add extra SEXT pattern. llvm-svn: 77920	2009-08-02 22:45:24 +00:00
Jakob Stoklund Olesen	7dc3b72685	Remove unneeded intrinsics from Blackfin backend. __builtin_bfin_ones does the same as ctpop, so it can be implemented in the front-end. __builtin_bfin_loadbytes loads from an unaligned pointer with the disalignexcpt instruction. It does the same as loading from a pointer with the low bits masked. It is better if the front-end creates a masked load. We can always instruction select the masked to disalignexcpt+load. We keep csync/ssync/idle. These intrinsics represent instructions that need workarounds for some silicon revisions. We may even want to convert inline assembler to intrinsics to enable the workarounds. llvm-svn: 77917	2009-08-02 21:49:05 +00:00
Jakob Stoklund Olesen	185eb035e9	Fix issue in regscavenger when scavenging a callee-saved register that has not been spilled. llvm-svn: 77912	2009-08-02 20:29:41 +00:00
Jakob Stoklund Olesen	c59cd9bcd0	Never add a kill flag to a constrained physical register in a two-addr instruction. llvm-svn: 77906	2009-08-02 19:13:03 +00:00
Jakob Stoklund Olesen	5d52bfbbc9	Scavenger asserts. Allow imp-def and imp-use of anything in the scavenger asserts, just like the machine code verifier. Allow redefinition of a sub-register of a live register. llvm-svn: 77904	2009-08-02 18:28:41 +00:00
Jakob Stoklund Olesen	2a21149b20	Add some basic blackfin intrinsics. llvm-svn: 77903	2009-08-02 18:28:11 +00:00
Jakob Stoklund Olesen	b052972a58	Inline assembly support for Blackfin. We use the same constraints as GCC, including those that are slightly insane for inline assembler. llvm-svn: 77899	2009-08-02 17:39:17 +00:00
Jakob Stoklund Olesen	552d8d6618	Analog Devices Blackfin back-end. Generate code for the Blackfin family of DSPs from Analog Devices: http://www.analog.com/en/embedded-processing-dsp/blackfin/processors/index.html We aim to be compatible with the exsisting GNU toolchain found at: http://blackfin.uclinux.org/gf/project/toolchain The back-end is experimental. llvm-svn: 77897	2009-08-02 17:32:10 +00:00
Evan Cheng	8e3889f12e	Test both darwin and linux. llvm-svn: 77852	2009-08-02 02:54:34 +00:00
Chris Lattner	c4d6f83f20	switch to filecheck format llvm-svn: 77841	2009-08-02 00:32:26 +00:00
Chris Lattner	b4b1012d29	fix a problem Eli noticed where we would compile the attached ptrtoint to: .quad X even on a 32-bit system, where X is not 64-bits. There isn't much that we can do here, so we just print: .quad ((X) & 4294967295) instead. llvm-svn: 77818	2009-08-01 22:25:12 +00:00
Dan Gohman	9023fd2b2a	Add nounwind to this test. llvm-svn: 77792	2009-08-01 19:11:04 +00:00
Eli Friedman	f165160724	Hack to make this test work on platforms which aren't Macs. Fixing this myself because I'm getting tired of seeing the red buildbots, which have been red since 5:30PM PDT last night. Proposed supplement to developer policy: committers should make sure to be around to watch for buildbot failures after committing. llvm-svn: 77785	2009-08-01 16:37:18 +00:00
Evan Cheng	e64f48ba8b	Workaround a couple of Darwin assembler bugs. llvm-svn: 77781	2009-08-01 06:13:52 +00:00
Evan Cheng	e6e8289d72	Split t2MOVCCs since some assemblers do not recognize mov shifted register alias with predicate. llvm-svn: 77764	2009-08-01 01:43:45 +00:00
Evan Cheng	6ab54fdb0a	Fix Thumb2 function call isel. Thumb1 and Thumb2 should share the same instructions for calls since BL and BLX are always 32-bit long and BX is always 16-bit long. Also, we should be using BLX to call external function stubs. llvm-svn: 77756	2009-08-01 00:16:10 +00:00
David Greene	81bcae5fda	Simplify operand padding by keying off tabs in the asm stream. If padding is disabled, tabs get replaced by spaces except in the case of the first operand, where the tab is output to line up the operands after the mnemonics. Add some better comments and eliminate redundant code. Fix some testcases to not assume tabs. llvm-svn: 77740	2009-07-31 21:57:10 +00:00
Chris Lattner	4d2c0f9008	switch off of 'Section' onto MCSection. We're not properly using MCSection subclasses yet, but this is a step in the right direction. llvm-svn: 77708	2009-07-31 18:48:30 +00:00
Evan Cheng	be8422e8e0	Until we have a "ALIGN" pseudo instruction, have asm printer emitted a .align to ensure the instruction that follows a TBB (when the number of table entries is odd) is 2-byte aligned. Patch by Sandeep Patel. llvm-svn: 77705	2009-07-31 18:35:56 +00:00
Chris Lattner	fc0264a38e	fix PR4650: we only track sizes for certain objects, so only put something into the mergable section if it is one of our special cases. This could obviously be improved, but this is the minimal fix and restores us to the previous behavior. llvm-svn: 77679	2009-07-31 16:17:13 +00:00
Evan Cheng	5811ab5cf3	When fp is not eliminated, instructions with T2_i12 modes will be changed to T2_i8 ones. Take that into consideration when determining stack size limit for reserving register scavenging slot. llvm-svn: 77642	2009-07-30 23:29:25 +00:00
David Goodwin	0bfc8312c2	Darwin assembler now recognizes "orn", so remove workaround. llvm-svn: 77627	2009-07-30 21:51:41 +00:00
David Goodwin	ce774e2383	Darwin assembler now supports "rrx", so remove workaround. llvm-svn: 77625	2009-07-30 21:38:40 +00:00
David Goodwin	79c079b478	Cleanup and include code selection for some frame index cases. llvm-svn: 77622	2009-07-30 18:56:48 +00:00
Evan Cheng	e62288fdd4	Optimize some common usage patterns of atomic built-ins __sync_add_and_fetch() and __sync_sub_and_fetch. When the return value is not used (i.e. only care about the value in the memory), x86 does not have to use add to implement these. Instead, it can use add, sub, inc, dec instructions with the "lock" prefix. This is currently implemented using a bit of instruction selection trick. The issue is the target independent pattern produces one output and a chain and we want to map it into one that just output a chain. The current trick is to select it into a merge_values with the first definition being an implicit_def. The proper solution is to add new ISD opcodes for the no-output variant. DAG combiner can then transform the node before it gets to target node selection. Problem #2 is we are adding a whole bunch of x86 atomic instructions when in fact these instructions are identical to the non-lock versions. We need a way to add target specific information to target nodes and have this information carried over to machine instructions. Asm printer (or JIT) can use this information to add the "lock" prefix. llvm-svn: 77582	2009-07-30 08:33:02 +00:00
Dan Gohman	49a6f16b7c	Add a new register class to describe operands that can't be SP, due to x86 encoding restrictions. This is currently off by default because it may cause code quality regressions. This is for PR4572. llvm-svn: 77565	2009-07-30 01:56:29 +00:00
Evan Cheng	e3493a91cc	tbb / tbh instructions only branch forward, not backwards. llvm-svn: 77522	2009-07-29 23:20:20 +00:00
Evan Cheng	1f58eed638	Add VFP3 D registers to the DPR register class. llvm-svn: 77521	2009-07-29 23:03:41 +00:00
Bob Wilson	cf19885a32	Change Neon VLDn intrinsics to return multiple values instead of really wide vectors. Likewise, change VSTn intrinsics to take separate arguments for each vector in a multi-vector struct. Adjust tests accordingly. llvm-svn: 77468	2009-07-29 16:39:22 +00:00
Chris Lattner	c5397abb52	fix PR4584 with a trivial patch now that the pieces are in place. llvm-svn: 77434	2009-07-29 05:20:33 +00:00
Evan Cheng	c6d70ae063	Optimize Thumb2 jumptable to use tbb / tbh when all the offsets fit in byte / halfword. llvm-svn: 77422	2009-07-29 02:18:14 +00:00
Eric Christopher	dce1e4949e	Add a couple more tests for the ptest intrinsics to make sure we're grabbing them all correctly. llvm-svn: 77413	2009-07-29 00:51:15 +00:00
Eric Christopher	f7802a33ce	Add support for gcc __builtin_ia32_ptest{z,c,nzc} intrinsics. Lower to ptest instruction plus setcc. Revamp ptest instruction. Add test. llvm-svn: 77407	2009-07-29 00:28:05 +00:00
Evan Cheng	c8bed03349	In thumb2 mode, add pc is unpredictable. Use add + mov pc instead (that is until more optimization goes in). llvm-svn: 77364	2009-07-28 20:53:24 +00:00
David Goodwin	68bb69d6e3	Remove support for ORN to workaround <rdar://problem/7096522>. llvm-svn: 77363	2009-07-28 20:51:25 +00:00
David Goodwin	865c6298d7	Add workaround for <rdar://problem/7098328>. llvm-svn: 77340	2009-07-28 18:15:38 +00:00
Chris Lattner	ebbbf451c9	fix testcase for previous patch. llvm-svn: 77338	2009-07-28 18:04:18 +00:00
Chris Lattner	513a36b63d	Fix PR4639, a ELF-TLS regression from some of my refactoring. llvm-svn: 77336	2009-07-28 17:57:51 +00:00
David Goodwin	e82862e24e	Add Thumb-2 patterns for ARMsrl_flag and ARMsra_flag. llvm-svn: 77329	2009-07-28 17:06:49 +00:00
Evan Cheng	12da273f90	tADDrSPI doesn't have a predicate operand, but tADDhirr and tADDi3 have. llvm-svn: 77305	2009-07-28 07:38:35 +00:00
Evan Cheng	780748d565	- More refactoring. This gets rid of all of the getOpcode calls. - This change also makes it possible to switch between ARM / Thumb on a per-function basis. - Fixed thumb2 routine which expand reg + arbitrary immediate. It was using using ARM so_imm logic. - Use movw and movt to do reg + imm when profitable. - Other code clean ups and minor optimizations. llvm-svn: 77300	2009-07-28 05:48:47 +00:00
David Goodwin	57b51d9f82	ORN does not require (and can not have) the ".w" suffix. "Orthogonality" is a dirty word at ARM. llvm-svn: 77275	2009-07-27 23:34:12 +00:00
David Goodwin	782f242fd7	Add ".w" suffix for wide thumb-2 instructions. llvm-svn: 77199	2009-07-27 16:31:55 +00:00
Sanjiv Gupta	a77a182b04	Test case to check that separate section is created for a global variable specified with section attribute. llvm-svn: 77195	2009-07-27 16:20:41 +00:00
Chris Lattner	57af4ece60	update testcase. llvm-svn: 77192	2009-07-27 15:52:58 +00:00
Chris Lattner	8e58bc9ed4	put normal data into .data instead of .data.rel on elf systems. llvm-svn: 77116	2009-07-26 03:06:11 +00:00
Chris Lattner	397792d981	finish simplifying DarwinTargetAsmInfo::SelectSectionForGlobal for now. Make the section switching directives more consistent by not including \n and including \t for them all. llvm-svn: 77107	2009-07-26 01:24:18 +00:00
Chris Lattner	5b42b45fb9	simplify DarwinTargetAsmInfo::SelectSectionForGlobal a bit and make it more aggressive, we now put: const int G2 __attribute__((weak)) = 42; into the text (readonly) segment like gcc, previously we put it into the data (readwrite) segment. llvm-svn: 77104	2009-07-26 00:51:36 +00:00
Bob Wilson	8a37bbebfd	Add support for ARM Neon VREV instructions. Patch by Anton Korzh, with some modifications from me. llvm-svn: 77101	2009-07-26 00:39:34 +00:00
Chris Lattner	2de9510572	add the most expedient hack to fix PR4619, along with a testcase. Thanks to Rafael for the great example. llvm-svn: 77083	2009-07-25 17:57:37 +00:00
Evan Cheng	3b5791f982	I've lost my mind. PR4572 has not been fixed. llvm-svn: 77031	2009-07-25 01:11:46 +00:00
Evan Cheng	f3a1fce8ae	Change Thumb2 jumptable codegen to one that uses two level jumps: Before: adr r12, #LJTI3_0_0 ldr pc, [r12, +r0, lsl #2] LJTI3_0_0: .long LBB3_24 .long LBB3_30 .long LBB3_31 .long LBB3_32 After: adr r12, #LJTI3_0_0 add pc, r12, +r0, lsl #2 LJTI3_0_0: b.w LBB3_24 b.w LBB3_30 b.w LBB3_31 b.w LBB3_32 This has several advantages. 1. This will make it easier to optimize this to a TBB / TBH instruction + (smaller) table. 2. This eliminate the need for ugly asm printer hack to force the address into thumb addresses (bit 0 is one). 3. Same codegen for pic and non-pic. 4. This eliminate the need to align the table so constantpool island pass won't have to over-estimate the size. Based on my calculation, the later is probably slightly faster as well since ldr pc with shifter address is very slow. That is, it should be a win as long as the HW implementation can do a reasonable job of branch predict the second branch. llvm-svn: 77024	2009-07-25 00:33:29 +00:00
Evan Cheng	8c8e88bd39	Remove a duplicated test. llvm-svn: 77020	2009-07-25 00:24:40 +00:00
Evan Cheng	01740ab57b	Forgot this test earlier. llvm-svn: 77007	2009-07-24 22:42:45 +00:00
Evan Cheng	aee0e1f48c	Fix these tests. llvm-svn: 77006	2009-07-24 22:42:22 +00:00
Eric Christopher	fae639c9ad	Move insertps tests to sse41 combo test file, convert to filecheck format and add an extract/insert test. llvm-svn: 76994	2009-07-24 19:24:26 +00:00
Evan Cheng	3990850a7d	Convert a test to FileCheck. llvm-svn: 76954	2009-07-24 06:01:46 +00:00
Chris Lattner	26aff56462	Remove SectionKind::Small*. This was only used on mips, and is apparently a sad mistake that is regretted. :) llvm-svn: 76935	2009-07-24 03:11:51 +00:00
Richard Osborne	fc39e417a8	Add tests for handling of globals and tls on the XCore. These currently fail but pass when run against r76652. llvm-svn: 76923	2009-07-24 00:38:20 +00:00
Dan Gohman	17151155ed	Remove the IA-64 backend. llvm-svn: 76920	2009-07-24 00:30:09 +00:00
Evan Cheng	dc99f07113	Thumb2 does not allow the use of "pc" register as part of the load / store address. llvm-svn: 76909	2009-07-23 23:09:51 +00:00
Evan Cheng	d2919a1773	Fix up ARM constant island pass for Thumb2. Also fixed up code to fully use the SoImm field for ADR on ARM mode. llvm-svn: 76890	2009-07-23 18:27:47 +00:00
Chris Lattner	dc13b7c637	merge one more sse41 test into sse41.ll llvm-svn: 76853	2009-07-23 04:49:39 +00:00
Chris Lattner	70d5783535	merge another sse41 test into sse41.ll llvm-svn: 76852	2009-07-23 04:43:48 +00:00
Chris Lattner	08fc6e6e40	merge sse41-pmovx.ll into sse41.ll llvm-svn: 76850	2009-07-23 04:39:09 +00:00
Chris Lattner	b9cdd3153c	change a test to run in filecheck style. Rename it to be a general dumping ground of various SSE4.1 tests, since filecheck can reasonably handle them all in one file. Generalize it to check x86-64 stuff as well since it has a different ABI (a convenient way to test both the reg and mem forms of these instructions). llvm-svn: 76848	2009-07-23 04:33:02 +00:00
Eric Christopher	b1b77ca862	Support insertps via the intrinsic and add a couple of simple testcases to make sure it's being generated. llvm-svn: 76843	2009-07-23 02:22:41 +00:00
Eric Christopher	327cb795a1	Add test for pinsrd and pinsrb instructions. llvm-svn: 76840	2009-07-23 01:58:04 +00:00
Dan Gohman	b215100c7c	Revert r75663 (and r76805), as it is causing regressions on powerpc. llvm-svn: 76823	2009-07-23 00:09:46 +00:00
Dan Gohman	824ab40381	x86 isel tweak: use lea (%reg,%reg) instead of lea (,%reg,2). llvm-svn: 76817	2009-07-22 23:26:55 +00:00
Dan Gohman	cdbef5f2c0	Add -march=ppc32 lines so that this test doesn't ever default to ppc64. llvm-svn: 76805	2009-07-22 22:08:31 +00:00
Evan Cheng	e270d4a4dd	Use getTargetConstant instead of getConstant since it's meant as an constant operand. llvm-svn: 76803	2009-07-22 22:03:29 +00:00
Dan Gohman	c510293251	Make the grep line in this test more specific, to avoid unintended matches. llvm-svn: 76802	2009-07-22 22:02:42 +00:00
Evan Cheng	d2d52d1906	Ignore undef uses. llvm-svn: 76799	2009-07-22 21:51:42 +00:00
Duncan Sands	0cf7f5d6d2	Revert commit 76707, it was breaking the llvm-gcc build on linux platforms. The binutils assembler does not recognize the "s" flag, see for example http://sourceware.org/binutils/docs/as/Section.html llvm-svn: 76733	2009-07-22 10:35:05 +00:00
Chris Lattner	8ebaec6b27	set the ELF "small" flag on objects that end up in .rodata.cst4 consistently, updating a mips testcase to expect it. llvm-svn: 76707	2009-07-22 00:41:56 +00:00
Evan Cheng	332a6590ae	Remove a big test case. llvm-svn: 76669	2009-07-21 22:52:04 +00:00
Evan Cheng	38e88cb53f	Do not select tSXTB / tSXTH in thumb2 mode. llvm-svn: 76600	2009-07-21 18:15:26 +00:00
Chris Lattner	8e55200089	convert this test to filecheck format, which is faster and avoids false matches of "st" -> "stdin" llvm-svn: 76591	2009-07-21 17:36:24 +00:00
Chris Lattner	b61f9c8c8d	add a testcase for the pic16 section handling stuff. llvm-svn: 76579	2009-07-21 16:48:20 +00:00
Evan Cheng	07a6ac6b29	Another rewriter bug exposed by recent coalescer changes. ReuseInfo::GetRegForReload() should make sure the "switched" register is in the desired register class. I'm surprised this hasn't caused more failures in the past. llvm-svn: 76558	2009-07-21 09:15:00 +00:00
Chris Lattner	83423aa276	remove a very large testcase for now. llvm-svn: 76537	2009-07-21 06:28:36 +00:00
Evan Cheng	a7bb55ebb6	Fix a dagga combiner bug: avoid creating illegal constant. Is this really a winning transformation? fold (shl (srl x, c1), c2) -> (shl (and x, (shl -1, c1)), (sub c2, c1)) or (srl (and x, (shl -1, c1)), (sub c1, c2)) llvm-svn: 76535	2009-07-21 05:40:15 +00:00
Evan Cheng	0d8b0cf3b8	Fix ARM isle code that optimize multiply by constants which are power-of-2 +/- 1. llvm-svn: 76520	2009-07-21 00:31:12 +00:00
Evan Cheng	9a47392f2e	Cross RC coalescing is now on by default. llvm-svn: 76519	2009-07-21 00:22:59 +00:00
David Greene	40c68ad3bb	Re-apply 75490, 75806 and 76177 with fixes and tests. Efficiency comes next. llvm-svn: 76486	2009-07-20 22:02:59 +00:00
Evan Cheng	a2b8c3f98f	Forgot this test earlier. llvm-svn: 76485	2009-07-20 21:46:42 +00:00
Evan Cheng	57106d6dc0	Use TII->findCommutedOpIndices to find the commute operands (rather than guessing). llvm-svn: 76472	2009-07-20 21:16:08 +00:00
Evan Cheng	027d9f93ea	Fix some sub-reg coalescing bugs where the coalescer wasn't updating the resulting interval's register class. llvm-svn: 76458	2009-07-20 19:47:55 +00:00
Dan Gohman	33a3fd0b9c	Revert the addition of hasNoPointerOverflow to GEPOperator. Getelementptrs that are defined to wrap are virtually useless to optimization, and getelementptrs that are undefined on any kind of overflow are too restrictive -- it's difficult to ensure that all intermediate addresses are within bounds. I'm going to take a different approach. Remove a few optimizations that depended on this flag. llvm-svn: 76437	2009-07-20 17:43:30 +00:00
Chris Lattner	58f9bb2ccd	implement a new magic global "llvm.compiler.used" which is like llvm.used, but doesn't cause ".no_dead_strip" to be emitted on darwin. llvm-svn: 76399	2009-07-20 06:14:25 +00:00
Evan Cheng	4e4eb0b00c	Restore AsmWriterEmitter.cpp back to 74742. The recent changes broke Thumb. llvm-svn: 76398	2009-07-20 06:10:07 +00:00
Jakob Stoklund Olesen	aba695c7d0	Fix http://llvm.org/bugs/show_bug.cgi?id=4583 Inline asm instructions may have additional <imp-def,kill> register operands. These operands are not marked with a flag like the normal asm operands, so we must not assert that there is a flag. llvm-svn: 76373	2009-07-19 19:09:59 +00:00
Evan Cheng	090db9b7a9	Catch more coalescing opportunities. llvm-svn: 76282	2009-07-18 04:52:23 +00:00
Evan Cheng	e20cbf3068	Enable cross register class coalescing. llvm-svn: 76281	2009-07-18 02:10:10 +00:00
Evan Cheng	a776067d3f	Fix pr4552. Stack slot coloring with register must take care not to generate illegal ams. llvm-svn: 76258	2009-07-17 22:42:51 +00:00
Evan Cheng	18fe458103	Fix x86 inline ams 'q' constraint support. In 32-bit mode, it's just like 'Q', i.e. EAX, EDX, ECX, EBX. In 64-bit mode, it just means all the i64r registers. Yeah, that makes sense. llvm-svn: 76248	2009-07-17 22:13:25 +00:00
Chris Lattner	52d436e98b	rename test. llvm-svn: 76197	2009-07-17 18:05:55 +00:00
Eli Friedman	97f3f965eb	Make promotion in operation legalization for SETCC work correctly. llvm-svn: 76153	2009-07-17 05:16:04 +00:00
Anton Korobeynikov	c5df7e2dc1	Emit cross regclass register moves for thumb2. Minor code duplication cleanup. llvm-svn: 76124	2009-07-16 23:26:06 +00:00
Dale Johannesen	c4148c4ec7	Assume an inline asm might be a call, so we get stack alignment right when it is. This is not ideal but conservatively correct. Adjust a test to compensate for changed stack offset value. gcc.apple/asm-block-57.c llvm-svn: 76120	2009-07-16 22:34:45 +00:00
Jakob Stoklund Olesen	070fab8a1f	Teach MachineInstr::isRegTiedToDefOperand() to correctly parse inline asm operands. The inline asm operands must be parsed from the first flag, you cannot assume that an immediate operand preceeding a register use operand is the flag. PowerPC "m" operands are represented as (flag, imm, reg) triples. isRegTiedToDefOperand() would incorrectly interpret the imm as the flag. llvm-svn: 76101	2009-07-16 20:58:34 +00:00
Evan Cheng	357645efad	Changed my mind. We now allow remat of instructions whose defs have subreg indices. llvm-svn: 76100	2009-07-16 20:15:00 +00:00
Evan Cheng	fdd0eb4011	With recent MC changes, RIP base register is explicitly modeled. Make sure we add it when x86 V_SET0 / V_SETALLONES (by transforming it into a constpool load) into the use instruction. llvm-svn: 76094	2009-07-16 18:44:05 +00:00
Anton Korobeynikov	77a50bd3a8	Make xfail proper llvm-svn: 76065	2009-07-16 14:53:47 +00:00
Anton Korobeynikov	73fcd3d962	Temporary disable 16 bit bswap llvm-svn: 76063	2009-07-16 14:35:57 +00:00
Anton Korobeynikov	902facfe96	Add bswap patterns llvm-svn: 76061	2009-07-16 14:34:52 +00:00
Anton Korobeynikov	3ae30e08ef	Fix logic inversion for RI-mode address selection llvm-svn: 76052	2009-07-16 14:31:14 +00:00
Anton Korobeynikov	6c2c47ecb2	Unbreak the test llvm-svn: 76051	2009-07-16 14:30:49 +00:00
Anton Korobeynikov	4121039bef	Expand 32-bit bitconverts via memory llvm-svn: 76050	2009-07-16 14:30:29 +00:00
Anton Korobeynikov	bc2ead6ea3	Fix incomin arg stack frame offset in case we need to generate stack frame llvm-svn: 76049	2009-07-16 14:29:57 +00:00
Anton Korobeynikov	bd41c83ab0	Revert the commit, it just hides the real bug llvm-svn: 76045	2009-07-16 14:28:26 +00:00
Anton Korobeynikov	2acdac0f8e	Lower anyext to zext, 32-bit stuff does not have any implicit zero-extension side effects llvm-svn: 76035	2009-07-16 14:24:41 +00:00
Anton Korobeynikov	b25949b0f5	Provide consistent subreg idx scheme. This (hopefully) fixes remaining divide problems llvm-svn: 76011	2009-07-16 14:18:17 +00:00
Anton Korobeynikov	091872cb37	Implement 'large' PIC model llvm-svn: 76006	2009-07-16 14:16:05 +00:00
Anton Korobeynikov	569a94c4d0	Implement shifts properly (hopefilly - finally!) llvm-svn: 76005	2009-07-16 14:15:24 +00:00
Anton Korobeynikov	fe8df8ff61	Properly handle divides. As a bonus - implement memory versions of them. llvm-svn: 76003	2009-07-16 14:14:33 +00:00
Anton Korobeynikov	34ad780d0d	32 bit shifts have only 12 bit displacements llvm-svn: 76000	2009-07-16 14:13:24 +00:00
Anton Korobeynikov	1eb6262b4b	Consolidate reg-imm / reg-reg-imm address mode selection logic in one place. llvm-svn: 75990	2009-07-16 14:10:17 +00:00
Anton Korobeynikov	62f8515b1c	Add support for 12 bit displacements llvm-svn: 75988	2009-07-16 14:09:35 +00:00
Anton Korobeynikov	43d33bd6d2	Emit proper lowering of load from arg stack slot llvm-svn: 75986	2009-07-16 14:08:42 +00:00
Anton Korobeynikov	a8197bb651	Implement dynamic allocas llvm-svn: 75985	2009-07-16 14:08:15 +00:00
Anton Korobeynikov	7193e2670e	Add jump tables llvm-svn: 75984	2009-07-16 14:07:50 +00:00
Anton Korobeynikov	2ff298fad0	Add rotates llvm-svn: 75981	2009-07-16 14:06:49 +00:00
Anton Korobeynikov	9362d9aa76	Add patterns for integer negate llvm-svn: 75980	2009-07-16 14:06:27 +00:00
Anton Korobeynikov	f07c7941f0	Provide proper patterns for and with imm instructions. Tune the tests accordingly. llvm-svn: 75979	2009-07-16 14:06:00 +00:00
Anton Korobeynikov	59049d9176	Add 32 bit and reg-imm and disable invalid patterns for now llvm-svn: 75978	2009-07-16 14:05:32 +00:00
Anton Korobeynikov	2d218394c6	Add z9 and z10 target processors. Mark z10-only instructions as such. llvm-svn: 75977	2009-07-16 14:05:00 +00:00
Anton Korobeynikov	d568f6dce2	Proper lower 'small' results llvm-svn: 75962	2009-07-16 13:58:24 +00:00
Anton Korobeynikov	f1bf3176c6	Completel forgot about unconditional branches llvm-svn: 75961	2009-07-16 13:57:52 +00:00
Anton Korobeynikov	15d6e8785b	Lower addresses of globals llvm-svn: 75960	2009-07-16 13:57:27 +00:00
Anton Korobeynikov	a442cdfb04	Test (incomplete) for easy muls llvm-svn: 75959	2009-07-16 13:57:03 +00:00
Anton Korobeynikov	f0d7d6ce65	Provide "wide" muls and divs/rems llvm-svn: 75958	2009-07-16 13:56:42 +00:00
Anton Korobeynikov	b04a4fa5c1	Tests for cmp / br_cc / select_cc llvm-svn: 75949	2009-07-16 13:53:15 +00:00
Anton Korobeynikov	8695a30066	Emit callee-saved regs spills / restores llvm-svn: 75943	2009-07-16 13:51:12 +00:00
Anton Korobeynikov	d694b9ff8b	Some preliminary call lowering llvm-svn: 75941	2009-07-16 13:50:21 +00:00
Anton Korobeynikov	018599fc0b	Prologue / epilogue emission llvm-svn: 75940	2009-07-16 13:49:49 +00:00
Anton Korobeynikov	09890bd434	Add simple frame index elimination llvm-svn: 75939	2009-07-16 13:49:25 +00:00
Anton Korobeynikov	5dc5629100	Provide proper test :) llvm-svn: 75938	2009-07-16 13:48:59 +00:00
Anton Korobeynikov	405833dfb6	Add address computation stuff llvm-svn: 75935	2009-07-16 13:47:59 +00:00
Anton Korobeynikov	df99232d27	Add mem-imm stores llvm-svn: 75933	2009-07-16 13:47:14 +00:00
Anton Korobeynikov	44f8bbfb3f	Add stores and truncstores llvm-svn: 75931	2009-07-16 13:45:00 +00:00
Anton Korobeynikov	11b91b4e2e	Add patterns for various extloads llvm-svn: 75930	2009-07-16 13:44:30 +00:00
Anton Korobeynikov	04be818918	Add shifts and reg-imm address matching llvm-svn: 75927	2009-07-16 13:43:18 +00:00
Anton Korobeynikov	cf7ea6a94f	Add bunch of 32-bit patterns... Uffff :) llvm-svn: 75926	2009-07-16 13:42:31 +00:00
Anton Korobeynikov	ebe2de0e14	Add bunch of reg-imm movs llvm-svn: 75921	2009-07-16 13:34:50 +00:00
Anton Korobeynikov	28234bcde2	Provide masked reg-imm 'or' and 'and' llvm-svn: 75919	2009-07-16 13:33:57 +00:00
Anton Korobeynikov	1c4c7823ae	Fix test running lines llvm-svn: 75918	2009-07-16 13:33:21 +00:00
Anton Korobeynikov	0d76b17a78	Add reg-reg and pattern llvm-svn: 75917	2009-07-16 13:32:49 +00:00
Anton Korobeynikov	f9fe4036f2	Add sub reg-reg pattern llvm-svn: 75916	2009-07-16 13:32:16 +00:00
Anton Korobeynikov	a083d7af53	Add xor reg-reg pattern llvm-svn: 75915	2009-07-16 13:31:28 +00:00
Anton Korobeynikov	65096d6a60	Add or reg-reg pattern. llvm-svn: 75914	2009-07-16 13:30:53 +00:00
Anton Korobeynikov	18172d786f	Add add reg-reg and reg-imm patterns llvm-svn: 75913	2009-07-16 13:30:15 +00:00
Anton Korobeynikov	09082fa01a	Add simple reg-reg and reg-imm moves llvm-svn: 75912	2009-07-16 13:29:38 +00:00
Anton Korobeynikov	cf4ba97dba	Minimal lowering for formal_arguments / ret llvm-svn: 75911	2009-07-16 13:28:59 +00:00
Anton Korobeynikov	a3ceeaeda5	Add testsuite dir for systemz stuff llvm-svn: 75910	2009-07-16 13:28:22 +00:00
Richard Osborne	0cceec520c	Combine an unaligned store of unaligned load into a memmove. llvm-svn: 75908	2009-07-16 12:50:48 +00:00
Richard Osborne	bfdc557c8a	Expand unaligned 32 bit loads from an address which is a constant offset from a 32 bit aligned base as follows: ldw low, base[offset >> 2] ldw high, base[(offset >> 2) + 1] shr low_shifted, low, (offset & 0x3) * 8 shl high_shifted, high, 32 - (offset & 0x3) * 8 or result, low_shifted, high_shifted Expand 32 bit loads / stores with 16 bit alignment into two 16 bit loads / stores. llvm-svn: 75902	2009-07-16 10:42:35 +00:00
Richard Osborne	25b33cb035	Custom lower unaligned 32 bit stores and loads into libcalls. This is a big code size win since before they were expanding to upto 16 instructions. llvm-svn: 75901	2009-07-16 10:21:18 +00:00
Evan Cheng	84517443ca	Let callers decide the sub-register index on the def operand of rematerialized instructions. Avoid remat'ing instructions whose def have sub-register indices for now. It's just really really hard to get all the cases right. llvm-svn: 75900	2009-07-16 09:20:10 +00:00
Evan Cheng	43229fb489	ShortenDeadCopySrcLiveRange needs to be more conservative in multi-kill situations. llvm-svn: 75838	2009-07-15 21:39:50 +00:00
Richard Osborne	a8edd048c2	Fix pattern for LD16S_3r, add basic tests to check load / store instructions are being properly selected. llvm-svn: 75797	2009-07-15 17:06:59 +00:00
Richard Osborne	57489b0658	Fix XCoreTargetLowering::isLegalAddressingMode to handle non simple VTs. llvm-svn: 75788	2009-07-15 15:46:56 +00:00
Chris Lattner	55452c2bea	fix an arm codegen bug (the same as PR4482 on ppc) where available_externally symbols were not getting stubs. While I'm at it, add a big testcase for stub generation to make sure I don't break anything. llvm-svn: 75737	2009-07-15 04:12:33 +00:00
Chris Lattner	7d1f9542c2	get the PPC stub temporary label from the mangler instead of using horrible string hacking. This gives us a different label, but it's just an assembler temporary, so the name doesn't matter. llvm-svn: 75733	2009-07-15 02:56:53 +00:00
Chris Lattner	dab248ac95	convert this to filecheck style and make it a test of darwin/PPC's extremely elaborate pic/nopic stubs. llvm-svn: 75726	2009-07-15 01:43:31 +00:00
Chris Lattner	815337abd6	simplify this test to test the esentials. llvm-svn: 75725	2009-07-15 01:32:33 +00:00
Chris Lattner	d7fec20cba	convert to filecheck style, simplify RUN line, and add comment. llvm-svn: 75667	2009-07-14 19:49:11 +00:00
Chris Lattner	109866bf21	convert this test to filecheck style llvm-svn: 75663	2009-07-14 18:57:40 +00:00
Chris Lattner	8c9a96b966	Reapply my previous asmprinter changes now with more testing and two additional bug fixes: 1. The bug that everyone hit was a problem in the asmprinter where it would remove $stub but keep the L prefix on a name when emitting the indirect symbol. This is easy to fix by keeping the name of the stub and the name of the symbol in a StringMap instead of just keeping a StringSet and trying to reconstruct it late. 2. There was a problem printing the personality function. The current logic to print out the personality function from the DWARF information is a bit of a cesspool right now that duplicates a bunch of other logic in the asm printer. The short version of it is that it depends on emitting both the L and _ prefix for symbols (at least on darwin) and until I can untangle it, it is best to switch the mangler back to emitting both prefixes. llvm-svn: 75646	2009-07-14 18:17:16 +00:00
Daniel Dunbar	966932ccb7	Revert r75610 (and r75620, which was blocking the revert), in the hopes of unbreaking llvm-gcc (on Darwin). --- Reverse-merging r75620 into '.': U include/llvm/Support/Mangler.h --- Reverse-merging r75610 into '.': U test/CodeGen/X86/loop-hoist.ll G include/llvm/Support/Mangler.h U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp U lib/VMCore/Mangler.cpp llvm-svn: 75636	2009-07-14 15:57:55 +00:00
Chris Lattner	774f2a2d51	Change the X86 asmprinter to use the mangler to apply suffixes like "$non_lazy_ptr" to symbols instead of doing it with "printSuffixedName". This gets us to the point where there is a real separation between computing a symbol name and printing it, something I need for MC printer stuff. This patch also fixes a corner case bug where unnamed private globals wouldn't get the private label prefix. Next up, rename all uses of getValueName -> getMangledName for better greppability, and then tackle the ppc/arm backends to eliminate "printSuffixedName". llvm-svn: 75610	2009-07-14 06:04:35 +00:00
Chris Lattner	f34815b32f	Change the internal interface to makeNameProper to take a bool that indicates whether the label is private or not, instead of taking prefix stuff. One effect of this is that symbols will be generated with just the private prefix, instead of both the private prefix and the user-label-prefix, but this doesn't matter as long as it is consistent. For example we'll now get "Lfoo" instead of "L_foo". These are just assembler temporary labels anyway, so they never even make it into the .o file. llvm-svn: 75607	2009-07-14 04:50:12 +00:00
David Goodwin	72b80ac9b1	Fix detection of valid BFC immediates. llvm-svn: 75576	2009-07-14 00:57:56 +00:00
Bill Wendling	e604b776a7	Check for the correct unnamed name. llvm-svn: 75573	2009-07-14 00:53:58 +00:00
Dan Gohman	dbaddda21f	Check in a reduced version of this testcase. llvm-svn: 75544	2009-07-13 23:04:44 +00:00
Chris Lattner	ec8efcb44e	Two changes: 1) unique globals with the existing "Count" local in Mangler, not with atomic nonsense. Using atomics will give us nondeterminstic output from the compiler when using multiple threads, which is bad. 2) Do not mangle an unknown global name with a type suffix. We don't need this anymore now that llvm ir doesn't have type planes. llvm-svn: 75541	2009-07-13 22:48:46 +00:00
Dan Gohman	054d2a7837	Add testcases for PR4538, PR4537, and PR4534. llvm-svn: 75533	2009-07-13 22:30:31 +00:00
Chris Lattner	92ce8381f5	remove tests for removed intrinsics. llvm-svn: 75433	2009-07-12 21:30:06 +00:00
Chris Lattner	f39f55d46c	add nounwind llvm-svn: 75407	2009-07-12 00:46:16 +00:00
Nick Lewycky	d57fb023e0	Darwin prepends an _ to internal globals, Linux doesn't. llvm-svn: 75405	2009-07-11 23:48:59 +00:00
Chris Lattner	38df005e12	fix x86-64 static codegen to materialize the address of a global with movl instead of lea. It is better for code size (and presumably efficiency) to use: movl $foo, %eax rather than: leal foo, eax Both give a nice zero extending "move immediate" instruction, the former is just smaller. Note that global addresses should be handled different by the x86 backend, but I chose to follow the style already in place and add more fixme's. llvm-svn: 75403	2009-07-11 23:17:29 +00:00
Chris Lattner	056dfc6f90	this test was incorrect for x86-64 static. It passed on darwin, because darwin doesn't have static x86-64 mode. llvm-svn: 75392	2009-07-11 22:30:05 +00:00
Chris Lattner	e91900097e	Fix PR4533, which is about buggy codegen in x86-64 -static mode. Basically, using: lea symbol(%rip), %rax is not valid in -static mode, because the current RIP may not be within 32-bits of "symbol" when an app is built partially pic and partially static. The fix for this is to compile it to: lea symbol, %rax It would be better to codegen this as: movq $symbol, %rax but that will come next. The hard part of fixing this bug was fixing abi-isel, which was actively testing for the wrong behavior. Also, the RUN lines are completely impossible to understand what they are testing. To help with this, convert the -static x86-64 codegen tests to use filecheck. This is much more stable and makes it more clear what the codegen is expected to be. llvm-svn: 75382	2009-07-11 20:29:19 +00:00
Chris Lattner	20adc670b2	We get the P modifier wrong in a lot of cases, just add some more rigorous testing. In addition to fixing this, I still need to do some more testing on darwin. llvm-svn: 75362	2009-07-11 08:30:22 +00:00
Evan Cheng	017288a4fc	Don't put IT instruction before conditional branches. llvm-svn: 75361	2009-07-11 07:26:20 +00:00
Evan Cheng	0794c6a083	Smarter isel of ldrsb / ldrsh. Only make use of these when [r,r] address is feasible. llvm-svn: 75360	2009-07-11 07:08:13 +00:00
Evan Cheng	cd4cdd1157	Major changes to Thumb (not Thumb2). Many 16-bit instructions either modifies CPSR when they are outside the IT blocks, or they can predicated when in Thumb2. Move the implicit def of CPSR to an optional def which defaults CPSR. This allows the 's' bit to be toggled dynamically. A side-effect of this change is asm printer is now using unified assembly. There are some minor clean ups and fixes as well. llvm-svn: 75359	2009-07-11 06:43:01 +00:00
Chris Lattner	e3c4765bac	convert test to use FileCheck, which is much more precise and faster than the previous RUN lines. Hopefully this will be an inspiration for future tests :) llvm-svn: 75261	2009-07-10 18:34:47 +00:00
Evan Cheng	0f9cce7951	Add a thumb2 pass to insert IT blocks. llvm-svn: 75218	2009-07-10 01:54:42 +00:00
Evan Cheng	223ac25930	Remove a bogus assertion. llvm-svn: 75206	2009-07-10 00:23:48 +00:00
Bob Wilson	9ce44e2521	Handle 'a' modifier on inline assembly operands. This is part of the fix for pr4521. llvm-svn: 75201	2009-07-09 23:54:51 +00:00
Eli Friedman	2b77eef160	Make EXTRACT_VECTOR_ELT a bit more flexible in terms of the returned value. Adjust other code to deal with that correctly. Make DAGTypeLegalizer::PromoteIntRes_EXTRACT_VECTOR_ELT take advantage of this new flexibility to simplify the code and make it deal with unusual vectors (like <4 x i1>) correctly. Fixes PR3037. llvm-svn: 75176	2009-07-09 22:01:03 +00:00
Evan Cheng	7452c968e4	Targets sometimes assign fixed stack object to spill certain callee-saved registers based on dynamic conditions. For example, X86 EBP/RBP, when used as frame register has to be spilled in the first fixed object. It should inform PEI this so it doesn't get allocated another stack object. Also, it should not be spilled as other callee-saved registers but rather its spilling and restoring are being handled by emitPrologue and emitEpilogue. Avoid spilling it twice. llvm-svn: 75116	2009-07-09 06:53:48 +00:00
Lang Hames	dab7b06de9	Improved tracking of value number kills. VN kills are now represented as an (index,bool) pair. The bool flag records whether the kill is a PHI kill or not. This code will be used to enable splitting of live intervals containing PHI-kills. A slight change to live interval weights introduced an extra spill into lsr-code-insertion (outside the critical sections). The test condition has been updated to reflect this. llvm-svn: 75097	2009-07-09 03:57:02 +00:00
Chris Lattner	44f6bcfa0b	remove eh, convert to FileCheck style llvm-svn: 75087	2009-07-09 01:07:22 +00:00
Chris Lattner	212f44d180	we have no tests for dllimport/export. Add one. llvm-svn: 75085	2009-07-09 00:53:44 +00:00
Chris Lattner	ade55bc8dd	* add some assertions for sanity checking. * remove some old code that was needed when we'd put ESP in the scale instead of the base of some instructions. * Fix a bug with the P modifier in inline asm that caused us to drop it. llvm-svn: 75077	2009-07-09 00:27:29 +00:00
Chris Lattner	da0cf0b134	add a test for dale's recent change. llvm-svn: 75074	2009-07-09 00:00:16 +00:00
Chris Lattner	f65129cb6a	switch test to FileCheck-style and test the P and non-P cases. llvm-svn: 75071	2009-07-08 23:44:06 +00:00
Chris Lattner	334e561e35	rename a test to make it a feature test. llvm-svn: 75070	2009-07-08 23:40:57 +00:00
David Goodwin	22c2fba978	Use common code for both ARM and Thumb-2 instruction and register info. llvm-svn: 75067	2009-07-08 23:10:31 +00:00
Bob Wilson	1d298fd75b	Implement NEON vst1 instruction. llvm-svn: 75037	2009-07-08 20:32:02 +00:00
Chris Lattner	d5ffa8ffb6	add some more check for vector compares. llvm-svn: 75024	2009-07-08 18:51:25 +00:00
Chris Lattner	072198a2a1	convert a test to "FileCheck" style. llvm-svn: 75023	2009-07-08 18:48:24 +00:00
Bob Wilson	f731a2df6b	Implement NEON vld1 instructions. llvm-svn: 75019	2009-07-08 18:11:30 +00:00
David Goodwin	121563c615	Add rev16 test... xfail for now llvm-svn: 75012	2009-07-08 16:15:06 +00:00
David Goodwin	af7451b674	Checkpoint Thumb2 Instr info work. Generalized base code so that it can be shared between ARM and Thumb2. Not yet activated because register information must be generalized first. llvm-svn: 75010	2009-07-08 16:09:28 +00:00
Chris Lattner	04bf64d43c	eliminate the v[if]cmp versions of these tests, now that [if]cmp+sext works. llvm-svn: 74980	2009-07-08 00:49:35 +00:00
Chris Lattner	dc84b31d94	Change these tests to use [fi]cmp+sext instead of v[fi]cmp. No functionality change. llvm-svn: 74979	2009-07-08 00:46:57 +00:00
Chris Lattner	4ac607332d	dag combine sext(setcc) -> vsetcc before legalize. To make this safe, VSETCC must define all bits, which is different than it was documented to before. Since all targets that implement VSETCC already have this behavior, and we don't optimize based on this, just change the documentation. We now get nice code for vec_compare.ll llvm-svn: 74978	2009-07-08 00:31:33 +00:00
Chris Lattner	fc74e8241a	add support for legalizing an icmp where the result is illegal (4xi1) but the input is legal (4 x i32) llvm-svn: 74964	2009-07-07 23:03:54 +00:00
Chris Lattner	cbbf747b7b	add a trivial test that vector compares work. llvm-svn: 74963	2009-07-07 22:51:09 +00:00
Chris Lattner	30220d8f98	implement support for spliting and scalarizing vector setcc's. This finishes off enough support for vector compares to get the icmp/fcmp version of 2008-07-23-VSetCC.ll passing. llvm-svn: 74961	2009-07-07 22:47:46 +00:00
Chris Lattner	87d4f309f5	verify that the fcmp version of this works just as well as the vfcmp version. We actually get better code for this silly testcase. llvm-svn: 74954	2009-07-07 22:07:47 +00:00
Evan Cheng	d0611f9a37	Add Thumb2 movcc instructions. llvm-svn: 74946	2009-07-07 20:39:03 +00:00
Evan Cheng	39d8075edc	Add missing tests. llvm-svn: 74945	2009-07-07 20:38:08 +00:00
Evan Cheng	d0f6324cdc	Add Thumb2 pkhbt / pkhtb. llvm-svn: 74895	2009-07-07 05:35:52 +00:00
Evan Cheng	b24e51e2d9	Add some more Thumb2 multiplication instructions. llvm-svn: 74889	2009-07-07 01:17:28 +00:00
Evan Cheng	40398233b7	Add bfc to armv6t2. llvm-svn: 74868	2009-07-06 22:23:46 +00:00
Evan Cheng	e63b0e6f79	Added ARM::mls for armv6t2. llvm-svn: 74866	2009-07-06 22:05:45 +00:00
Evan Cheng	ba2410b7ca	Avoid adding a duplicate def. This fixes PR4478. llvm-svn: 74857	2009-07-06 21:34:05 +00:00
Evan Cheng	0e8bde5910	Add thumb2 sign / zero extend with rotate instructions. llvm-svn: 74755	2009-07-03 01:43:10 +00:00
Evan Cheng	53cdf022b6	Added indexed stores. llvm-svn: 74740	2009-07-03 00:06:39 +00:00
Evan Cheng	8ecd7eb3f7	Sign extending pre/post indexed loads. llvm-svn: 74736	2009-07-02 23:16:11 +00:00
Evan Cheng	84c6cda2ef	Thumb2 pre/post indexed loads. llvm-svn: 74696	2009-07-02 07:28:31 +00:00
Chris Lattner	87bb642676	@GOTPCREL is also rip-relative. Fix fast-isel to do the right thing. This fixes an llvm-gcc bootstrap problem I introduced. llvm-svn: 74691	2009-07-02 04:22:01 +00:00
Chris Lattner	d1c5951615	Fix yet-another bug I introduced into fastisel, this time handling constant pool references that weren't getting properly rip-relative. llvm-svn: 74689	2009-07-02 03:14:25 +00:00
Chris Lattner	1f50b61329	Fix codegen for references to available_externally symbols. This fixes PR4482. llvm-svn: 74613	2009-07-01 16:53:44 +00:00
Evan Cheng	04f72fc955	CommuteChangesDestination() should check if to-be-commuted instruction defines any register. Also teaches the default commuteInstruction() to commute instruction without definitions (e.g. X86::test / ARM::tsp). llvm-svn: 74602	2009-07-01 08:29:08 +00:00
Evan Cheng	2a5efe14a7	Remove special handling of implicit_def. Fix a couple more bugs in liveintervalanalysis and coalescer handling of implicit_def. Note, isUndef marker must be placed even on implicit_def def operand or else the scavenger will not ignore it. This is necessary because -O0 path does not use liveintervalanalysis, it treats implicit_def just like any other def. llvm-svn: 74601	2009-07-01 08:19:36 +00:00
Chris Lattner	f95fa1b721	Fix some fast-isel problems selecting global variable addressing in pic mode. llvm-svn: 74582	2009-07-01 03:27:19 +00:00
Evan Cheng	d379e896ff	Handle IMPLICIT_DEF with isUndef operand marker, part 2. This patch moves the code to annotate machineoperands to LiveIntervalAnalysis. It also add markers for implicit_def that define physical registers. The rest, is just a lot of details. llvm-svn: 74580	2009-07-01 01:59:31 +00:00
David Goodwin	86c7e20ca6	Add PIC load and store patterns for Thumb-2. llvm-svn: 74577	2009-07-01 00:01:13 +00:00
David Goodwin	d0890a2bad	Add thumb-2 store word, halfword, and byte. llvm-svn: 74555	2009-06-30 22:11:34 +00:00
David Goodwin	28d6d87244	Improve Thumb-2 jump table support. llvm-svn: 74549	2009-06-30 19:50:22 +00:00
Rafael Espindola	317fd045e2	Fix PR4485. Avoid unnecessary duplication of operand 0 of X86::FpSET_ST0_80. This duplication would cause one register to remain on the stack at the function return. llvm-svn: 74534	2009-06-30 16:40:03 +00:00
Rafael Espindola	bd971ffcc6	Fix PR4484. This was caused by me confounding FP0 and ST(0). llvm-svn: 74523	2009-06-30 12:18:16 +00:00
Evan Cheng	dcf1f59305	Temporarily restore the scavenger implicit_def checking code. MachineOperand isUndef mark is not being put on implicit_def of physical registers (created for parameter passing, etc.). llvm-svn: 74519	2009-06-30 09:19:42 +00:00
Evan Cheng	0dc101b897	Add a bit IsUndef to MachineOperand. This indicates the def / use register operand is defined by an implicit_def. That means it can def / use any register and passes (e.g. register scavenger) can feel free to ignore them. The register allocator, when it allocates a register to a virtual register defined by an implicit_def, can allocate any physical register without worrying about overlapping live ranges. It should mark all of operands of the said virtual register so later passes will do the right thing. This is not the best solution. But it should be a lot less fragile to having the scavenger try to track what is defined by implicit_def. llvm-svn: 74518	2009-06-30 08:49:04 +00:00
Evan Cheng	57726817aa	A few more load instructions. llvm-svn: 74500	2009-06-30 02:15:48 +00:00
David Goodwin	17512663f5	Enhance tests to include shifted-register operand testing. llvm-svn: 74490	2009-06-30 01:02:20 +00:00
David Goodwin	76b37950ca	Add Thumb-2 support for TEQ amd TST. llvm-svn: 74468	2009-06-29 22:49:42 +00:00
David Goodwin	911edef65b	Thumb-2 tests llvm-svn: 74464	2009-06-29 22:25:22 +00:00
Rafael Espindola	538064d6b1	FIX PR 4459. Not sure I understand how the temp register gets used, but this fixes a bug and introduces no regressions. llvm-svn: 74446	2009-06-29 20:29:59 +00:00
David Goodwin	dbf11ba800	Rename ARMcmpNZ to ARMcmpZ and use it to represent comparisons that set only the Z flag (i.e. eq and ne). Make ARMcmpZ commutative. llvm-svn: 74423	2009-06-29 15:33:01 +00:00
Evan Cheng	b23b50d54d	Implement Thumb2 ldr. After much back and forth, I decided to deviate from ARM design and split LDR into 4 instructions (r + imm12, r + imm8, r + r << imm12, constantpool). The advantage of this is 1) it follows the latest ARM technical manual, and 2) makes it easier to reduce the width of the instruction later. The down side is this creates more inconsistency between the two sub-targets. We should split ARM LDR instruction in a similar fashion later. I've added a README entry for this. llvm-svn: 74420	2009-06-29 07:51:04 +00:00
Chris Lattner	9876bd8257	factor some logic out into a helper function, allow remat of loads from constant globals. This implements remat-constant.ll even without aggressive-remat. llvm-svn: 74373	2009-06-27 04:38:55 +00:00
Chris Lattner	fea81da433	Reimplement rip-relative addressing in the X86-64 backend. The new implementation primarily differs from the former in that the asmprinter doesn't make a zillion decisions about whether or not something will be RIP relative or not. Instead, those decisions are made by isel lowering and propagated through to the asm printer. To achieve this, we: 1. Represent RIP relative addresses by setting the base of the X86 addr mode to X86::RIP. 2. When ISel Lowering decides that it is safe to use RIP, it lowers to X86ISD::WrapperRIP. When it is unsafe to use RIP, it lowers to X86ISD::Wrapper as before. 3. This removes isRIPRel from X86ISelAddressMode, representing it with a basereg of RIP instead. 4. The addressing mode matching logic in isel is greatly simplified. 5. The asmprinter is greatly simplified, notably the "NotRIPRel" predicate passed through various printoperand routines is gone now. 6. The various symbol printing routines in asmprinter now no longer infer when to emit (%rip), they just print the symbol. I think this is a big improvement over the previous situation. It does have two small caveats though: 1. I implemented a horrible "no-rip" modifier for the inline asm "P" constraint modifier. This is a short term hack, there is a much better, but more involved, solution. 2. I had to xfail an -aggressive-remat testcase because it isn't handling the use of RIP in the constant-pool reading instruction. This specific test is easy to fix without -aggressive-remat, which I intend to do next. llvm-svn: 74372	2009-06-27 04:16:01 +00:00
Chris Lattner	df92e147c9	remove some unneeded eh info. llvm-svn: 74371	2009-06-27 04:07:31 +00:00
Chris Lattner	de36afc1fe	testcase for PR4466 llvm-svn: 74367	2009-06-27 01:33:35 +00:00
David Goodwin	5285817490	When possible, use "mvn ra, rb" instead of "eor ra, rb, -1" because mvn has a narrow version and eor(i) does not. llvm-svn: 74355	2009-06-26 23:13:13 +00:00
Dan Gohman	d3b930d426	Add some testcases for some of the recent ScalarEvolution bug fixes. llvm-svn: 74353	2009-06-26 22:54:11 +00:00
David Goodwin	3aaa751712	Thumb-2 tests llvm-svn: 74345	2009-06-26 22:37:07 +00:00
Chris Lattner	b5c2639f83	remove unwind info, add test for asmprinting of jump table labels with (%rip) llvm-svn: 74337	2009-06-26 22:16:49 +00:00
Evan Cheng	07b016856d	Add x86 support for 'n' inline asm modifier. This will be handled target independently as part of MC work. llvm-svn: 74336	2009-06-26 22:00:19 +00:00
David Goodwin	aa294c5593	Thumb-2 has CLZ. llvm-svn: 74322	2009-06-26 20:47:43 +00:00
David Goodwin	35ee722d42	Use "adcs/sbcs" only when the carry-out is live, otherwise use "adc/sbc". llvm-svn: 74321	2009-06-26 20:45:56 +00:00
Daniel Dunbar	a720af1370	More spelling Count as count. llvm-svn: 74306	2009-06-26 18:35:07 +00:00
Daniel Dunbar	6b1678d5d8	Spell Count as count. llvm-svn: 74298	2009-06-26 18:21:54 +00:00
David Goodwin	3bd42afebe	Add Thumb-2 tests. llvm-svn: 74295	2009-06-26 18:10:30 +00:00
David Goodwin	5960e6d974	ADC used to implement adde should use "adcs" opcode instead of "adc". llvm-svn: 74293	2009-06-26 18:07:25 +00:00
David Goodwin	34f7ede9e7	ORN and BIC tests. llvm-svn: 74289	2009-06-26 16:20:06 +00:00
David Goodwin	0377f737ff	Currently there is a pattern for the thumb-2 MOV 16-bit immediate instruction. That instruction cannot write the flags so it should use T2I instead of T2sI. Also, added a pattern for the thumb-2 MOV of shifted immediate since that can encode immediates not encodable by the 16-bit immediate. llvm-svn: 74288	2009-06-26 16:10:07 +00:00
Evan Cheng	7779156b39	Fix tests: Count -> count. llvm-svn: 74282	2009-06-26 07:05:57 +00:00
Evan Cheng	34c8c7414f	Fix a CodeGenDAGPatterns bug. Check if top level predicates match when it's looking for duplicates. llvm-svn: 74276	2009-06-26 05:59:16 +00:00
Daniel Dunbar	07025e2c02	Fix spelling of 'count' llvm-svn: 74249	2009-06-26 01:33:02 +00:00
Evan Cheng	97727a61f9	Select ADC, SBC, and RSC instead of the ADCS, SBCS, and RSCS when the carry bit def is not used. llvm-svn: 74228	2009-06-25 23:34:10 +00:00
David Goodwin	16f357cccf	Use MVN for ~t2_so_imm immediates. llvm-svn: 74223	2009-06-25 23:11:21 +00:00
Bill Wendling	722c6e1b70	Don't grep the -debug output. This isn't the way to test changes. llvm-svn: 74211	2009-06-25 21:59:32 +00:00
Chris Lattner	a4194b1082	down with unwind info :) llvm-svn: 74206	2009-06-25 21:48:17 +00:00
Evan Cheng	c7ea8df67e	ISD::ADDE / ISD::SUBE updates the carry bit so they should isle to ADCS and SBCS / RSCS. llvm-svn: 74200	2009-06-25 20:59:23 +00:00
Evan Cheng	83f979a48b	Add Thumb2 pc relative add. llvm-svn: 74141	2009-06-24 23:47:58 +00:00
Evan Cheng	ff1a4a7271	We should run these tests as well. llvm-svn: 74121	2009-06-24 21:36:26 +00:00
Chris Lattner	01d5049dc2	unwind info not needed. llvm-svn: 74112	2009-06-24 19:48:04 +00:00
Evan Cheng	d76d0aa68a	Move thumb and thumb2 tests into separate directories. llvm-svn: 74068	2009-06-24 06:36:07 +00:00
Evan Cheng	38f2453817	Fix support for inline asm input / output operand tying when operand spans across multiple registers (e.g. two i64 operands in 32-bit mode). llvm-svn: 74053	2009-06-24 02:05:51 +00:00
Dan Gohman	f19aeec3f5	Extend ScalarEvolution's multiple-exit support to compute exact trip counts in more cases. Generalize ScalarEvolution's isLoopGuardedByCond code to recognize And and Or conditions, splitting the code out into an isNecessaryCond helper function so that it can evaluate Ands and Ors recursively, and make SCEVExpander be much more aggressive about hoisting instructions out of loops. test/CodeGen/X86/pr3495.ll has an additional instruction now, but it appears to be due to an arbitrary register allocation difference. llvm-svn: 74048	2009-06-24 01:18:18 +00:00
Evan Cheng	4983e4550e	Proper patterns for thumb2 shift and rotate instructions. llvm-svn: 73987	2009-06-23 19:39:13 +00:00
Bob Wilson	2e076c4e02	Add support for ARM's Advanced SIMD (NEON) instruction set. This is still a work in progress but most of the NEON instruction set is supported. llvm-svn: 73919	2009-06-22 23:27:02 +00:00
Evan Cheng	16ee19738c	It's coalescer, not coaleser. llvm-svn: 73902	2009-06-22 21:09:17 +00:00
Bob Wilson	4582530a2c	For Darwin on ARMv6 and newer, make register r9 available for use as a caller-saved register. llvm-svn: 73901	2009-06-22 21:01:46 +00:00
Evan Cheng	8cbbc7944d	Fix another register coalescer crash: forgot to check if the instruction being updated has already been coalesced. llvm-svn: 73898	2009-06-22 20:49:32 +00:00
Evan Cheng	3d75d6af57	hasFP should return true if frame address is taken. llvm-svn: 73893	2009-06-22 18:38:48 +00:00
Rafael Espindola	6ead59f8ed	Fix PR4185. Handle FpSET_ST0_80 being used when ST0 is still alive. llvm-svn: 73850	2009-06-21 12:02:51 +00:00
Chris Lattner	7d2b049404	change TLS_ADDR lowering to lower to a real mem operand, instead of matching as a global with that gets printed with the :mem modifier. All operands to lea's should be handled with the lea32mem operand kind, and this allows the TLS stuff to do this. There are several better ways to do this, but I went for the minimal change since I can't really test this (beyond make check). This also makes the use of EBX explicit in the operand list in the 32-bit, instead of implicit in the instruction. llvm-svn: 73834	2009-06-20 20:38:48 +00:00
Chris Lattner	1771a852f0	no need for unwind info llvm-svn: 73832	2009-06-20 19:48:26 +00:00
Chris Lattner	fbc9778a1b	no need for unwind info here. llvm-svn: 73831	2009-06-20 19:43:09 +00:00
Evan Cheng	c6a8d0dbe9	Fix PR4419: handle defs of partial uses. llvm-svn: 73816	2009-06-20 04:34:51 +00:00
Dan Gohman	cc31110b95	Re-apply r73718, now that the fix in r73787 is in, and add a hand-crafted testcase which demonstrates the bug that was exposed in 254.gap. llvm-svn: 73793	2009-06-19 23:23:27 +00:00
Evan Cheng	b4b20bbb7d	Enable arm pre-allocation load / store multiple optimization pass. llvm-svn: 73791	2009-06-19 23:17:27 +00:00
Evan Cheng	86076c9e30	Revert 73718. It's breaking 254.gap. llvm-svn: 73783	2009-06-19 21:15:06 +00:00
Eli Friedman	2fc939c809	Fix for PR2484: add an SSE1 pattern for a shuffle we normally prefer to handle with an SSE2 instruction. llvm-svn: 73760	2009-06-19 07:00:55 +00:00
Eli Friedman	d984158320	Mark a few Thumb instructions commutable; just happened to spot this while experimenting. I'm reasonably sure this is correct, but please tell me if these instructions have some strange property which makes this change unsafe. llvm-svn: 73746	2009-06-19 01:43:08 +00:00
Evan Cheng	de9e36a74e	On Darwin, ams printer should output a second label before a jump table so the linker knows it's a new atom. But this is only needed if the jump table is put in a separate section from the function body. llvm-svn: 73720	2009-06-18 20:37:15 +00:00
Dan Gohman	8c9ac59455	Generalize LSR's OptimizeSMax to handle unsigned max tests as well as signed max tests. Along with r73717, this helps CodeGen avoid emitting code for a maximum operation for this class of loop. llvm-svn: 73718	2009-06-18 20:23:18 +00:00
Dan Gohman	a0348809b6	Remove the code from IVUsers that attempted to handle casted induction variables in cases where the cast isn't foldable. It ended up being a pessimization in many cases. This could be fixed, but it would require a bunch of complicated code in IVUsers' clients. The advantages of this approach aren't visible enough to justify it at this time. llvm-svn: 73706	2009-06-18 16:54:06 +00:00
Anton Korobeynikov	02bb33c58d	Initial support for some Thumb2 instructions. Patch by Viktor Kutuzov and Anton Korzh from Access Softek, Inc. llvm-svn: 73622	2009-06-17 18:13:58 +00:00
Anton Korobeynikov	469e8217d4	Make the test target-neutral llvm-svn: 73547	2009-06-16 20:25:25 +00:00
Anton Korobeynikov	5d28cb204f	GNU as refuses to assemble "pop {}" instruction. Do not emit such (this is the case when we have thumb vararg function with single callee-saved register, which is handled separately). llvm-svn: 73529	2009-06-16 18:49:08 +00:00
Evan Cheng	cc21a5415a	If a val# is defined by an implicit_def and it is being removed, all of the copies off the val# were removed. This causes problem later since the scavenger will see uses of registers without defs. The proper solution is to change the copies into implicit_def's instead. TurnCopyIntoImpDef turns a copy into implicit_def and remove the val# defined by it. This causes an scavenger assertion later if the def reaches other blocks. Disable the transformation if the value live interval extends beyond its def block. llvm-svn: 73478	2009-06-16 07:12:58 +00:00
Eli Friedman	abfad5d61e	Add some generic expansion logic for SMULO and UMULO. Fixes UMULO support for x86, and UMULO/SMULO for many architectures, including PPC (PR4201), ARM, and Cell. The resulting expansion isn't perfect, but it's not bad. llvm-svn: 73477	2009-06-16 06:58:29 +00:00
Dan Gohman	8e85118943	Update this test to use fmul instead of mul. llvm-svn: 73436	2009-06-15 22:49:34 +00:00
Evan Cheng	b9bff5880a	ifcvt should ignore cfg where true and false successors are the same. llvm-svn: 73423	2009-06-15 21:24:34 +00:00
Bill Wendling	20f0adfc0e	This test is failing. Revert for now. llvm-svn: 73404	2009-06-15 19:10:56 +00:00
Bill Wendling	66e104cd11	Add another testcase for r71478. llvm-svn: 73399	2009-06-15 18:36:34 +00:00
Arnold Schwaighofer	cb9046cfc8	CheckTailCallReturnConstraints is missing a check on the incomming chain of the RETURN node. The incomming chain must be the outgoing chain of the CALL node. This causes the backend to identify tail calls that are not tail calls. This patch fixes this. llvm-svn: 73387	2009-06-15 14:43:36 +00:00
Evan Cheng	1283c6a066	Part 1. - Change register allocation hint to a pair of unsigned integers. The hint type is zero (which means prefer the register specified as second part of the pair) or entirely target dependent. - Allow targets to specify alternative register allocation orders based on allocation hint. Part 2. - Use the register allocation hint system to implement more aggressive load / store multiple formation. - Aggressively form LDRD / STRD. These are formed before register allocation. It has to be done this way to shorten live interval of base and offset registers. e.g. v1025 = LDR v1024, 0 v1026 = LDR v1024, 0 => v1025,v1026 = LDRD v1024, 0 If this transformation isn't done before allocation, v1024 will overlap v1025 which means it more difficult to allocate a register pair. - Even with the register allocation hint, it may not be possible to get the desired allocation. In that case, the post-allocation load / store multiple pass must fix the ldrd / strd instructions. They can either become ldm / stm instructions or back to a pair of ldr / str instructions. This is work in progress, not yet enabled. llvm-svn: 73381	2009-06-15 08:28:29 +00:00
Evan Cheng	185c9ef0a2	Add a ARM specific pre-allocation pass that re-schedule loads / stores from consecutive addresses togther. This makes it easier for the post-allocation pass to form ldm / stm. This is step 1. We are still missing a lot of ldm / stm opportunities because of register allocation are not done in the desired order. More enhancements coming. llvm-svn: 73291	2009-06-13 09:12:55 +00:00
Evan Cheng	b6cf8dbb96	If killed register is defined by implicit_def, do not clear it since it's live range may overlap another def of same register. llvm-svn: 73255	2009-06-12 21:34:26 +00:00
Evan Cheng	d93b5b672f	Mark some pattern-less instructions as neverHasSideEffects. llvm-svn: 73252	2009-06-12 20:46:18 +00:00
Arnold Schwaighofer	e3a018d707	Fix Bug 4278: X86-64 with -tailcallopt calling convention out of sync with regular cc. The only difference between the tail call cc and the normal cc was that one parameter register - R9 - was reserved for calling functions through a function pointer. After time the tail call cc has gotten out of sync with the regular cc. We can use R11 which is also caller saved but not used as parameter register for potential function pointers and remove the special tail call cc on x86-64. llvm-svn: 73233	2009-06-12 16:26:57 +00:00
Anton Korobeynikov	c745132865	Add testcase for register scanveger assertion fix in r72755 (double def due to livevars) llvm-svn: 73096	2009-06-08 22:54:15 +00:00
Eli Friedman	0b387fbd1b	Fix the run-line for this test to work correctly outside of x86. llvm-svn: 73025	2009-06-07 09:44:19 +00:00
Eli Friedman	516479d6e7	Tweak the expansion code for BIT_CONVERT to generate better code converting from an MMX vector to an i64. llvm-svn: 73024	2009-06-07 09:41:57 +00:00
Eli Friedman	3234587213	Slightly generalize the code that handles shuffles of consecutive loads on x86 to handle more cases. Fix a bug in said code that would cause it to read past the end of an object. Rewrite the code in SelectionDAGLegalize::ExpandBUILD_VECTOR to be a bit more general. Remove PerformBuildVectorCombine, which is no longer necessary with these changes. In addition to simplifying the code, with this change, we can now catch a few more cases of consecutive loads. llvm-svn: 73012	2009-06-07 06:52:44 +00:00
Eli Friedman	be1bb0f8b1	PR3628: Add patterns to match SHL/SRL/SRA to the corresponding Altivec instructions. llvm-svn: 73009	2009-06-07 01:07:55 +00:00
Eli Friedman	c61e357aa6	Fix the expansion for CONCAT_VECTORS so that it doesn't create illegal types. llvm-svn: 72993	2009-06-06 07:08:26 +00:00
Eli Friedman	75c496f920	Avoid crashing on a variable-index insertelement with element type i16. llvm-svn: 72991	2009-06-06 06:32:50 +00:00
Eli Friedman	1b1844ad1f	Get rid of some bogus patterns for X86vzmovl. Don't create VZEXT_MOVL nodes for vectors with an i16 element type. Add an optimization for building a vector which is all zeros/undef except for the bottom element, where the bottom element is an i8 or i16. llvm-svn: 72988	2009-06-06 06:05:10 +00:00
Eli Friedman	868bd6ab52	Fix an obvious typo. llvm-svn: 72987	2009-06-06 05:55:37 +00:00
Eli Friedman	6c101ebfa8	Get rid of a bogus pattern that interferes with optimization. llvm-svn: 72985	2009-06-06 04:17:04 +00:00
Eli Friedman	b45e8ce69a	PR2598: make sure to expand illegal forms of integer/floating-point conversions for x86, like <2 x i32> -> <2 x float> and <4 x i16> -> <4 x float>. llvm-svn: 72983	2009-06-06 03:57:58 +00:00
Nate Begeman	624690c6b2	Adapt the x86 build_vector dagcombine to the current state of the legalizer. build vectors with i64 elements will only appear on 32b x86 before legalize. Since vector widening occurs during legalize, and produces i64 build_vector elements, the dag combiner is never run on these before legalize splits them into 32b elements. Teach the build_vector dag combine in x86 back end to recognize consecutive loads producing the low part of the vector. Convert the two uses of TLI's consecutive load recognizer to pass LoadSDNodes since that was required implicitly. Add a testcase for the transform. Old: subl $28, %esp movl 32(%esp), %eax movl 4(%eax), %ecx movl %ecx, 4(%esp) movl (%eax), %eax movl %eax, (%esp) movaps (%esp), %xmm0 pmovzxwd %xmm0, %xmm0 movl 36(%esp), %eax movaps %xmm0, (%eax) addl $28, %esp ret New: movl 4(%esp), %eax pmovzxwd (%eax), %xmm0 movl 8(%esp), %eax movaps %xmm0, (%eax) ret llvm-svn: 72957	2009-06-05 21:37:30 +00:00
Evan Cheng	3158790e32	Changing allocation ordering from r3 ... r0 back to r0 ... r3. The order change no longer make sense after the coalescing changes we have made since then. llvm-svn: 72955	2009-06-05 19:08:58 +00:00
Dan Gohman	5c36f4f40c	Fix an erroneous check for isFNeg; the FNeg case is handled a few lines later on. llvm-svn: 72904	2009-06-04 23:43:29 +00:00
Dan Gohman	a5b9645c4b	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Devang Patel	72a4d2fec1	Add new function attribute - noredzone. Update code generator to use this attribute and remove DisableRedZone target option. Update llc to set this attribute when -disable-red-zone command line option is used. llvm-svn: 72894	2009-06-04 22:05:33 +00:00
Evan Cheng	fa0ac19b82	RALinScan::attemptTrivialCoalescing() was returning a virtual register instead of the physical register it is allocated to. This resulted in virtual register(s) being added the live-in sets. llvm-svn: 72890	2009-06-04 20:53:36 +00:00
Evan Cheng	60fdf787a7	A value defined by an implicit_def can be liven to a use BB. This is unfortunate. But register allocator still has to add it to the live-in set of the use BB. llvm-svn: 72888	2009-06-04 20:25:48 +00:00
Dan Gohman	1aa86203f6	Check in test changes that I accidentally left out of r72872. llvm-svn: 72875	2009-06-04 18:22:31 +00:00
Eli Friedman	63488f1fbf	PR3739, part 2: Use an explicit store to spill XMM registers. (Previously, the code tried to use "push", which doesn't exist for XMM registers.) llvm-svn: 72836	2009-06-04 02:32:04 +00:00
Eli Friedman	0cb0c78a26	PR3739, part 1: Disable the red zone on Win64. llvm-svn: 72830	2009-06-04 02:02:01 +00:00
Evan Cheng	7f5976e11b	Re-apply 72756 with fixes. One of those was introduced by we changed MachineInstrBuilder::addReg() interface. llvm-svn: 72826	2009-06-04 01:15:28 +00:00
Eli Friedman	ee06b752f0	PR4317: Handle splits where the new block is unreachable correctly in DominatorTreeBase::Split. llvm-svn: 72810	2009-06-03 21:42:06 +00:00
Evan Cheng	ad6f3ff2c0	For Darwin / x86_64, override -relocation-model=static to pic if the output is assembly since Darwin assembler does not really support -static codeine. I view this as a temporary workaround until the assembler / linker changes. llvm-svn: 72806	2009-06-03 21:13:54 +00:00
Evan Cheng	b39a6be77a	Fix for PR4225: When rewriter reuse a value in a physical register , it clear the register kill operand marker and its kill ops information. However, the cleared operand may be a def of a super-register. Clear the kill ops info for the super-register's sub-registers as well. llvm-svn: 72758	2009-06-03 09:00:27 +00:00
Evan Cheng	ab0c710fae	Temporarily revert 72756 for now. llvm-svn: 72757	2009-06-03 07:40:47 +00:00
Evan Cheng	dfe6e689fd	Fold preceding / trailing base inc / dec into the single load / store as well. llvm-svn: 72756	2009-06-03 06:14:58 +00:00
Dan Gohman	fc262babc3	Revert r72734. The Darwin assembler doesn't support the static relocation model on x86-64. Higher level logic should override the relocation model to PIC on x86_64-apple-darwin. llvm-svn: 72746	2009-06-03 00:37:20 +00:00
Dan Gohman	760377effc	Fix CodeGenPrepare's address-mode sinking to handle unusual addresses, involving Base values which do not have Pointer type. This fixes PR4297. llvm-svn: 72739	2009-06-02 21:29:13 +00:00
Evan Cheng	448641d87c	On Darwin x86_64 small code model doesn't guarantee code address fits in 32-bit. llvm-svn: 72734	2009-06-02 20:09:31 +00:00
Evan Cheng	7142ad75a1	(i64 (zext (srl GR32 8))) -> movzbl AH is not safe since srl 8 only clear the top 8 bits. llvm-svn: 72618	2009-05-30 08:43:27 +00:00
Evan Cheng	8e97b85cde	Remove an accidental commit. llvm-svn: 72560	2009-05-29 05:28:52 +00:00
Evan Cheng	716e688fca	More h-registers tricks: folding zext nodes. llvm-svn: 72558	2009-05-29 01:44:43 +00:00
Evan Cheng	86cdb4b345	Do not try to create a MVT type of width 0. llvm-svn: 72557	2009-05-28 23:52:18 +00:00
Eli Friedman	e4b43e60b3	Add explicit test for PR4280. llvm-svn: 72539	2009-05-28 21:04:35 +00:00
Eli Friedman	1f906448bc	Add a testcase which got fixed by recent legalization work. llvm-svn: 72517	2009-05-28 05:10:20 +00:00
Evan Cheng	a9cda8abf2	Added optimization that narrow load / op / store and the 'op' is a bit twiddling instruction and its second operand is an immediate. If bits that are touched by 'op' can be done with a narrower instruction, reduce the width of the load and store as well. This happens a lot with bitfield manipulation code. e.g. orl $65536, 8(%rax) => orb $1, 10(%rax) Since narrowing is not always a win, e.g. i32 -> i16 is a loss on x86, dag combiner consults with the target before performing the optimization. llvm-svn: 72507	2009-05-28 00:35:15 +00:00
Bill Wendling	0a9ea80013	This looks like it passes now. llvm-svn: 72485	2009-05-27 17:43:21 +00:00
Torok Edwin	be6a9a151a	Fix PR4254. The DAGCombiner created a negative shiftamount, stored in an unsigned variable. Later the optimizer eliminated the shift entirely as being undefined. Example: (srl (shl X, 56) 48). ShiftAmt is 4294967288. Fix it by checking that the shiftamount is positive, and storing in a signed variable. llvm-svn: 72331	2009-05-23 17:29:48 +00:00
Torok Edwin	7996339dd8	available_externall linkage is not local, this was confusing the codegenerator, and it wasn't generating calls through @PLT for these functions. hasLocalLinkage() is now false for available_externally, I attempted to fix the inliner and dce to handle available_externally properly. It passed make check. llvm-svn: 72328	2009-05-23 14:06:57 +00:00
Eli Friedman	53a71147ba	Fix test to account for legalization changes; I think this ends up running an extra DAGCombine pass which improves the code a bit. llvm-svn: 72326	2009-05-23 13:15:11 +00:00
Duncan Sands	d6fb6501e3	Add a new codegen pass that normalizes dwarf exception handling code in preparation for code generation. The main thing it does is handle the case when eh.exception calls (and, in a future patch, eh.selector calls) are far away from landing pads. Right now in practice you only find eh.exception calls close to landing pads: either in a landing pad (the common case) or in a landing pad successor, due to loop passes shifting them about. However future exception handling improvements will result in calls far from landing pads: (1) Inlining of rewinds. Consider the following case: In function @f: ... invoke @g to label %normal unwind label %unwinds ... unwinds: %ex = call i8* @llvm.eh.exception() ... In function @g: ... invoke @something to label %continue unwind label %handler ... handler: %ex = call i8* @llvm.eh.exception() ... perform cleanups ... "rethrow exception" Now inline @g into @f. Currently this is turned into: In function @f: ... invoke @something to label %continue unwind label %handler ... handler: %ex = call i8* @llvm.eh.exception() ... perform cleanups ... invoke "rethrow exception" to label %normal unwind label %unwinds unwinds: %ex = call i8* @llvm.eh.exception() ... However we would like to simplify invoke of "rethrow exception" into a branch to the %unwinds label. Then %unwinds is no longer a landing pad, and the eh.exception call there is then far away from any landing pads. (2) Using the unwind instruction for cleanups. It would be nice to have codegen handle the following case: invoke @something to label %continue unwind label %run_cleanups ... handler: ... perform cleanups ... unwind This requires turning "unwind" into a library call, which necessarily takes a pointer to the exception as an argument (this patch also does this unwind lowering). But that means you are using eh.exception again far from a landing pad. (3) Bugpoint simplifications. When bugpoint is simplifying exception handling code it often generates eh.exception calls far from a landing pad, which then causes codegen to assert. Bugpoint then latches on to this assertion and loses sight of the original problem. Note that it is currently rare for this pass to actually do anything. And in fact it normally shouldn't do anything at all given the code coming out of llvm-gcc! But it does fire a few times in the testsuite. As far as I can see this is almost always due to the LoopStrengthReduce codegen pass introducing pointless loop preheader blocks which are landing pads and only contain a branch to another block. This other block contains an eh.exception call. So probably by tweaking LoopStrengthReduce a bit this can be avoided. llvm-svn: 72276	2009-05-22 20:36:31 +00:00
Eli Friedman	9030c35eb4	Fix for PR4235: to build a floating-point value from integer parts, build an integer and cast that to a float. This fixes a crash caused by trying to split an f32 into two f16's. This changes the behavior in test/CodeGen/XCore/fneg.ll because that testcase now triggers a DAGCombine which converts the fneg into an integer operation. If someone is interested, it's probably possible to tweak the test to generate an actual fneg. llvm-svn: 72162	2009-05-20 06:02:09 +00:00
Evan Cheng	1fbc2a4754	Fix test on non-darwin hosts. llvm-svn: 72161	2009-05-20 05:45:36 +00:00
Evan Cheng	960983371c	Try again. Allow call to immediate address for ELF or when in static relocation mode. llvm-svn: 72160	2009-05-20 04:53:57 +00:00
Evan Cheng	61da18645b	Cannot use immediate as call absolute target in PIC mode. llvm-svn: 72154	2009-05-20 01:11:00 +00:00
Bob Wilson	e666cc5206	Fix pr4058 and pr4059. Do not split i64 or double arguments between r3 and the stack. Patch by Sandeep Patel. llvm-svn: 72106	2009-05-19 10:02:36 +00:00
Bob Wilson	a2c462bbe9	Fix pr4091: Add support for "m" constraint in ARM inline assembly. llvm-svn: 72105	2009-05-19 05:53:42 +00:00
Dan Gohman	b81dd48fd2	Add nounwind to a few tests. llvm-svn: 72002	2009-05-18 15:16:49 +00:00
Anton Korobeynikov	6de08cd093	Mark rotl/rotr as expand. This generates pretty ugly code, but this is better than nothing. llvm-svn: 71976	2009-05-17 10:16:28 +00:00
Anton Korobeynikov	6b5523aec2	Typo llvm-svn: 71975	2009-05-17 10:15:22 +00:00
Jakob Stoklund Olesen	9d7fb58581	Help DejaGnu avoid pipe-jam by producing less output from certain test cases. When a test fails with more than a pipeful of output on stdout AND stderr, one of the DejaGnu programs blocks. The problem can be avoided by redirecting stdout to a file. llvm-svn: 71919	2009-05-16 00:34:42 +00:00
Dan Gohman	6b43dd7664	Add nounwind to this test. llvm-svn: 71734	2009-05-13 22:29:12 +00:00
Evan Cheng	85cca64dd7	If header of inner loop is aligned, do not align the outer loop header. We don't want to add nops in the outer loop for the sake of aligning the inner loop. llvm-svn: 71609	2009-05-12 23:58:14 +00:00
Evan Cheng	df1aeeeb90	Teach TransferDeadness to delete truly dead instructions if they do not produce side effects. llvm-svn: 71606	2009-05-12 23:07:00 +00:00

... 10 11 12 13 14 ...

2862 Commits