llvm-project

Commit Graph

Author	SHA1	Message	Date
Dan Gohman	69499b13fd	Add support for rematerializing FsFLD0SS and FsFLD0SD as constant-pool loads in order to reduce register pressure. llvm-svn: 82470	2009-09-21 18:30:38 +00:00
Dan Gohman	48ade83e60	Recognize SSE min and max opportunities in even more cases. And fix a bug with the behavior of min/max instructions formed from fcmp uge comparisons. Also, use FiniteOnlyFPMath() for this code instead of UnsafeFPMath, as it is more specific. llvm-svn: 82466	2009-09-21 18:03:22 +00:00
Dale Johannesen	a894053a9b	When computing live intervals for earlyclobber operands, we pushed the beginning of the interval back 1, so the interval would overlap with inputs that die. We were also pushing the end of the interval back 1, though, which means the earlyclobber didn't overlap with other output operands. Don't do this. PR 4964. llvm-svn: 82342	2009-09-20 00:36:41 +00:00
Evan Cheng	9827ad39a7	Fix PR4926. When target hook EmitInstrWithCustomInserter() insert new basic blocks and update CFG, it should also inform sdisel of the changes so the phi source operands will come from the right basic blocks. llvm-svn: 82311	2009-09-19 09:51:03 +00:00
Dan Gohman	51f5442403	Delete the label names from this test to make it less fragile. llvm-svn: 82276	2009-09-18 21:23:12 +00:00
Chris Lattner	3a78ce3a56	Make a new X8632_MachoTargetObjectFile TLOF implementation whose getSymbolForDwarfGlobalReference is smart enough to know that it needs to register the stub it references with MachineModuleInfoMachO, so that it gets emitted at the end of the file. Move stub emission from X86ATTAsmPrinter::doFinalization to the new X86ATTAsmPrinter::EmitEndOfAsmFile asmprinter hook. The important thing here is that EmitEndOfAsmFile is called after the ehframes are emitted, so we get all the stubs. This allows us to remove a gross hack from the asmprinter where it would "just know" that it needed to output stubs for personality functions. Now this is all driven from a consistent interface. The testcase change is just reordering the expected output now that the stubs come out after the ehframe instead of before. This also unblocks other changes that Bill wants to make. llvm-svn: 82269	2009-09-18 20:22:52 +00:00
Dan Gohman	722b1eefdb	Add support for using the FLAGS result of or, xor, and and instructions on x86, to avoid explicit test instructions. A few existing tests changed due to arbitrary register allocation differences. llvm-svn: 82263	2009-09-18 19:59:53 +00:00
Chris Lattner	7fc4ad6a63	make this testcase check darwin32 also llvm-svn: 82182	2009-09-17 23:56:41 +00:00
Chris Lattner	2ba262b7e8	rename test llvm-svn: 82181	2009-09-17 23:55:12 +00:00
Chris Lattner	bf2fd768f9	convert to filecheck llvm-svn: 82179	2009-09-17 23:54:26 +00:00
Chris Lattner	4a1c8fc061	rename file llvm-svn: 82178	2009-09-17 23:42:06 +00:00
Daniel Dunbar	94cb6144d2	Remove test cases using -regalloc=simple. llvm-svn: 82130	2009-09-17 06:37:07 +00:00
Evan Cheng	f56b0482c4	Fix PR4910: Broken logic in coalescer means when a physical register liveness is being shortened, the sub-registers were not. The symptom is the register allocator could not find a free register for this particular test. llvm-svn: 82108	2009-09-17 00:57:15 +00:00
Chris Lattner	db4916a123	fix PR4984 by ensuring that fastisel adds properly sign extended GEP displacement values to machineinstrs. llvm-svn: 81886	2009-09-15 18:27:02 +00:00
Chris Lattner	c25359e1a3	rename test llvm-svn: 81884	2009-09-15 18:23:37 +00:00
Chris Lattner	2503b50e4d	convert to filecheck llvm-svn: 81882	2009-09-15 18:23:23 +00:00
Dan Gohman	d29cafc083	Restore a comment that was lost in the merge. llvm-svn: 81857	2009-09-15 15:09:54 +00:00
Chris Lattner	30fceaba05	this is failing on linux hosts, force a triple. llvm-svn: 81833	2009-09-15 04:27:29 +00:00
Chris Lattner	170c116aa7	merge one more in. llvm-svn: 81824	2009-09-15 02:27:23 +00:00
Chris Lattner	42aaa6e443	merge some more cmov tests into cmov.ll llvm-svn: 81823	2009-09-15 02:25:21 +00:00
Chris Lattner	baf78e5b52	merge two cmov tests into one. llvm-svn: 81822	2009-09-15 02:22:47 +00:00
Dan Gohman	520a6856ba	Don't pull a load through a callseq_start if the load's chain has multiple uses, as one of the other uses may be on a path to a different node above the callseq_start, because that leads to a cyclic graph. This problem is exposed when -combiner-global-alias-analysis is used. This fixes PR4880. llvm-svn: 81821	2009-09-15 01:22:01 +00:00
Dan Gohman	65829a4ccb	On x86-64, the 32-bit cmov doesn't actually clear the high 32-bit of its result if the condition is false. llvm-svn: 81814	2009-09-15 00:14:11 +00:00
Chris Lattner	033d31165d	merge the linux cpool/jtbl pic tests into pic.ll and convert to filecheck. Change the picbase symbol on non-darwin systems from ".Lllvm$4.$piclabel" to ".L4$pb". The actual name doesn't matter and the darwin name is shorter. llvm-svn: 81688	2009-09-13 18:46:37 +00:00
Dan Gohman	f437e68058	Add -mattr=+sse2 to the -march=x86 version of this test. Without sse, this code falls back to SelectionDAG isel which uses an x87 instruction, which is fine, but not what this test is testing for. llvm-svn: 81656	2009-09-12 23:45:47 +00:00
Dan Gohman	a080159a7c	Convert more tests to avoid llvm-as. llvm-svn: 81545	2009-09-11 18:36:27 +00:00
Dan Gohman	1880092722	Change tests from "opt %s" to "opt < %s" so that opt doesn't see the input filename so that opt doesn't print the input filename in the output so that grep lines in the tests don't unintentionally match strings in the input filename. llvm-svn: 81537	2009-09-11 18:01:28 +00:00
Chris Lattner	992e42b606	turn on -experimental-asm-printer for x86 / AT&T by default. llvm-svn: 81532	2009-09-11 17:07:27 +00:00
Evan Cheng	74a3231de4	Follow up to 81494. When the folded reload is narrowed to a 32-bit load then change the destination register to a 32-bit one or add a sub-register index. llvm-svn: 81496	2009-09-11 01:01:31 +00:00
Evan Cheng	3cad6283b8	It's not legal to fold a load from a narrower stack slot into a wider instruction. If done, the instruction does a 64-bit load and that's not safe. This can happen we a subreg_to_reg 0 has been coalesced. One exception is when the instruction that folds the load is a move, then we can simply turn it into a 32-bit load from the stack slot. rdar://7170444 llvm-svn: 81494	2009-09-11 00:39:26 +00:00
Dan Gohman	89b090e51e	Reapply r81171 with a fix: don't try to use i64 when it isn't legal. llvm-svn: 81492	2009-09-11 00:34:46 +00:00
Bob Wilson	59e4c84c6f	Revert r81171 which was causing pr4927. llvm-svn: 81415	2009-09-10 00:49:22 +00:00
Dan Gohman	16ad903fcf	When widening a vector load, use the correct chain. This fixes PR4891. llvm-svn: 81343	2009-09-09 14:22:57 +00:00
Torok Edwin	a40184aa77	Add testcase for r81322 (PR4933). llvm-svn: 81327	2009-09-09 09:34:43 +00:00
Chris Lattner	d5f2b3f543	add a testacse for the objc problem that required required r81305 to be temporarily disabled. llvm-svn: 81320	2009-09-09 06:19:34 +00:00
Chris Lattner	afa12db8a6	disable the new asmprinter by default. Both the Mangler and MCSymbol printing stuff are quoting symbols now, breaking objc testcases. llvm-svn: 81319	2009-09-09 06:11:14 +00:00
Chris Lattner	ba0d9f538f	turn the mcinst asmprinter on by default for x86, tweaking two tests to expect the slight syntax differences in the generated code. llvm-svn: 81305	2009-09-09 00:41:36 +00:00
Chris Lattner	5fdb699f13	this got merged into lea.ll llvm-svn: 81298	2009-09-09 00:22:31 +00:00
Chris Lattner	68d3d61ec1	filecheckize llvm-svn: 81297	2009-09-09 00:19:46 +00:00
Dan Gohman	40503396da	Eliminate more uses of llvm-as and llvm-dis. llvm-svn: 81290	2009-09-08 23:54:48 +00:00
Chris Lattner	9e5674ae3f	tweak this to pass on linux. llvm-svn: 81273	2009-09-08 23:32:40 +00:00
Chris Lattner	dae3e56cb7	convert to filecheck syntax llvm-svn: 81267	2009-09-08 23:16:26 +00:00
Chris Lattner	e819cfbc71	change selectiondag to add the sign extended versions of immediate operands to instructions instead of zero extended ones. This makes the asmprinter print signed values more consistently. This apparently only really affects the X86 backend. llvm-svn: 81265	2009-09-08 23:05:44 +00:00
Chris Lattner	7896c8ba58	filecheckize some tests llvm-svn: 81259	2009-09-08 22:38:46 +00:00
Dan Gohman	72a13d2476	Use opt -S instead of piping bitcode output through llvm-dis. llvm-svn: 81257	2009-09-08 22:34:10 +00:00
Dan Gohman	9737a63ed8	Change these tests to feed the assembly files to opt directly, instead of using llvm-as, now that opt supports this. llvm-svn: 81226	2009-09-08 16:50:01 +00:00
Anton Korobeynikov	758f8c690d	Unbreak llvm-svn: 81205	2009-09-08 07:30:03 +00:00
Evan Cheng	a7afdda65d	When remat'ing and destination virtual register has a sub-register index. Make sure the sub-register class matches the register class of the remat'ed instruction definition register class. llvm-svn: 81204	2009-09-08 06:39:07 +00:00
Chris Lattner	a8cb3dffe9	disable some irrelevant eh emission llvm-svn: 81200	2009-09-08 06:26:40 +00:00
Chris Lattner	b2fcd070e2	fix PR4767, a crash because fp stackifier visited blocks in depth first order, so it wouldn't process unreachable blocks. When compiling at -O0, late dead block elimination isn't done and the bad instructions got to isel. llvm-svn: 81187	2009-09-08 04:55:44 +00:00
Dan Gohman	f4a0f0f033	Fix an abort on a store of an empty struct member. getValue returns null in the case of an empty struct, so don't try to call getNumValues on it. llvm-svn: 81180	2009-09-08 01:44:02 +00:00
Dan Gohman	2512a42548	Fix a thinko: When lowering fneg with xor, bitcast the operands from floating-point to integer first, and bitcast the result back to floating-point. Previously, this test was passing by falling back to SelectionDAG lowering. The resulting code isn't as nice, but it's correct and CodeGen now stays on the fast path. llvm-svn: 81171	2009-09-07 23:47:14 +00:00
Daniel Dunbar	b9a562b7c4	Don't depend on arch specific global prefix. llvm-svn: 81084	2009-09-05 11:53:06 +00:00
Daniel Dunbar	b9ea94c990	Eliminate uses of %prcontext. - I'd appreciate it if someone else eyeballs my changes to make sure I captured the intent of the test. llvm-svn: 81083	2009-09-05 11:35:16 +00:00
Bob Wilson	7f20002993	Stabilize the order of live intervals in the priority_queue used by the linear scan reg alloc. This fixes a problem I ran into where extracting a function from a larger file caused the generated code to change (masking the problem I was trying to debug) because the allocator behaved differently. This changes the results for two X86 regression checks. stack-color-with-reg is improved, with one less instruction, but pr3495 is worse, with one more copy. As far as I can tell, these tests were just getting lucky or unlucky, so I've changed the expected results. llvm-svn: 81060	2009-09-05 01:19:16 +00:00
Dan Gohman	aa92dc1e61	LLVM currently represents floating-point negation as -0.0 - x. Fix FastISel to recognize this pattern and emit a floating-point negation using xor. llvm-svn: 80963	2009-09-03 22:53:57 +00:00
Daniel Dunbar	abf2bb683a	Remove dead greps. llvm-svn: 80946	2009-09-03 20:59:02 +00:00
Dan Gohman	d0d5e685da	Recognize more opportunities to use SSE min and max instructions, swapping the operands if necessary. llvm-svn: 80940	2009-09-03 20:34:31 +00:00
Mon P Wang	eadd21ea3c	Test cases for vector shifts changes r80935 Changed the old vector shift test to use FileCheck llvm-svn: 80936	2009-09-03 19:57:35 +00:00
Evan Cheng	1b38952c99	Reference to hidden symbols do not have to go through non-lazy pointer in non-pic mode. rdar://7187172. llvm-svn: 80904	2009-09-03 07:04:02 +00:00
Chris Lattner	cdb6fd2c7c	merge all the basic linux/32 pic tests together into one test. llvm-svn: 80902	2009-09-03 06:29:23 +00:00
Chris Lattner	4f101f98d1	rename test llvm-svn: 80901	2009-09-03 06:16:49 +00:00
Chris Lattner	b284f7b1d9	eliminate some uses of prcontext. Any help here would be appreciated :) llvm-svn: 80520	2009-08-30 21:45:23 +00:00
Dan Gohman	ca73326f56	CMOV_GR8 clobbers EFLAGS when its expansion involves an xor to set a register to 0. This fixes PR4814. llvm-svn: 80445	2009-08-29 22:19:15 +00:00
Dan Gohman	7f0ca9a34c	X86FastISel support for loading and storing values of type i1. llvm-svn: 80186	2009-08-27 00:31:47 +00:00
Dan Gohman	f1abb5511b	Expand i8 selects into control flow instead of 16-bit conditional moves. This avoids the need to promote the operands (or implicitly extend them, a partial register update condition), and can reduce i8 register pressure. This substantially speeds up code such as write_hex in lib/Support/raw_ostream.cpp. subclass-coalesce.ll is too trivial and no longer tests what it was originally intended to test. llvm-svn: 80184	2009-08-27 00:14:12 +00:00
Dan Gohman	6c23fa2442	Don't use INSERT_SUBREG to model anyext operations on x86-64, as it leads to partial-register definitions. To help avoid redundant zero-extensions, also teach the h-register matching patterns that use movzbl to match anyext as well as zext. llvm-svn: 80099	2009-08-26 14:59:13 +00:00
Chris Lattner	6d9d5a9c94	convert to filecheck llvm-svn: 80025	2009-08-25 20:49:04 +00:00
Daniel Dunbar	9cc4970ed3	Switch abi-isel.ll to FileCheck; it's not much faster, but it now tests a lot more and is much nicer to the OS. - Dan, please check. If there are parts of the test you think I should strip out so it doesn't cause random failures let me know (there are still some PIC label numbers in it, for example). llvm-svn: 80019	2009-08-25 18:45:03 +00:00
Dan Gohman	0d4bbf2c4a	Remove obsolete -f flags. llvm-svn: 79992	2009-08-25 15:38:29 +00:00
Dale Johannesen	f8d37c6b81	Fix PR 4751, another difficulty with %a modifier on x86. llvm-svn: 79961	2009-08-25 00:16:14 +00:00
Dale Johannesen	fbc9a2e33b	Split test into 3. llvm-svn: 79926	2009-08-24 17:51:19 +00:00
Jakob Stoklund Olesen	972c8fab51	Fix PR4753. When undoing a reuse in ReuseInfo::GetRegForReload, check if it was only a sub-register being used. The MachineOperand::getSubReg() method is only valid for virtual registers, so we have to recover the sub-register index manually. llvm-svn: 79855	2009-08-23 13:01:45 +00:00
Eli Friedman	682d8c1881	Make x86 test actually test x86 code generation. Fix the construct on ARM, which was breaking by coincidence, and add a similar testcase for ARM. llvm-svn: 79719	2009-08-22 03:13:10 +00:00
Chris Lattner	f09250f1b1	rename test, make more specific. llvm-svn: 79712	2009-08-22 00:44:24 +00:00
Dale Johannesen	fa2b97e61a	Use FileCheck even though this means testing for something that has nothing to do with the point of the test, per Chris. llvm-svn: 79569	2009-08-20 22:12:08 +00:00
Dan Gohman	05046085b6	Fix an x86 code size regression: prefer RIP-relative addressing over absolute addressing even in non-PIC mode (unless the address has an index or something else incompatible), because it has a smaller encoding. llvm-svn: 79553	2009-08-20 18:23:44 +00:00
Dale Johannesen	d16abc09c4	Use FileCheck for the test run where it's appropriate. llvm-svn: 79534	2009-08-20 16:58:04 +00:00
Dale Johannesen	1d764f61ef	Handle 'a' modifier in X86 asms. PR 4742. llvm-svn: 79484	2009-08-19 22:44:41 +00:00
Bill Wendling	6c528bc7ae	Make this test platform neutral. llvm-svn: 79447	2009-08-19 18:51:45 +00:00
Dan Gohman	ac33a9061d	Add an x86 peep that narrows TEST instructions to forms that use a smaller encoding. These kinds of patterns are very frequent in sqlite3, for example. llvm-svn: 79439	2009-08-19 18:16:17 +00:00
Eli Friedman	1e008c173a	PR4737: Fix a nasty bug in load narrowing with non-power-of-two types. llvm-svn: 79415	2009-08-19 08:46:10 +00:00
Dan Gohman	4906f73a9f	Legalize the shift amount operand of SRL_PARTS, SHL_PARTS, and SRA_PARTS, as is done for SRL, SHL, and SRA. llvm-svn: 79380	2009-08-18 23:36:17 +00:00
Dan Gohman	b85ad63727	Make this test less sensitive to assembler differences. llvm-svn: 79348	2009-08-18 17:19:46 +00:00
Chris Lattner	8b0e164aa6	force a triple so this passes on darwin llvm-svn: 79345	2009-08-18 16:55:45 +00:00
Dan Gohman	a41fa35992	Make tail merging handle blocks with repeated predecessors correctly, and remove RemoveDuplicateSuccessor, as it is no longer necessary, and because it breaks assumptions made in MachineBasicBlock::isOnlyReachableByFallthrough. Convert test/CodeGen/X86/omit-label.ll to FileCheck and add a testcase for PR4732. test/CodeGen/Thumb2/thumb2-ifcvt2.ll sees a diff with this commit due to it being bugpoint-reduced to the point where it doesn't matter what the condition for the branch is. Add some more interesting code to test/CodeGen/X86/2009-08-06-branchfolder-crash.ll, which is the testcase that originally motivated the RemoveDuplicateSuccessor code, to help verify that the original problem isn't being re-broken. llvm-svn: 79338	2009-08-18 15:18:18 +00:00
Eli Friedman	b4d7312249	Fix test on Linux. llvm-svn: 79140	2009-08-15 21:28:17 +00:00
Chris Lattner	da108b4ed4	implement support for CHECK-NEXT: in filecheck. llvm-svn: 79123	2009-08-15 18:32:21 +00:00
Chris Lattner	c6a803be7c	specify a target triple so global variable manglings are consistent etc. llvm-svn: 79118	2009-08-15 17:35:05 +00:00
Chris Lattner	3838c2dabf	convert to filecheck. llvm-svn: 79117	2009-08-15 17:28:09 +00:00
Chris Lattner	bb193ecec8	rename this test to sse2.ll llvm-svn: 79116	2009-08-15 17:24:09 +00:00
Chris Lattner	d3954e2790	merge a bunch more sse3 tests into sse3.ll llvm-svn: 79115	2009-08-15 17:21:44 +00:00
Chris Lattner	9bae01ec47	convert test to filecheck format. llvm-svn: 79114	2009-08-15 17:05:03 +00:00
Chris Lattner	912aa19c25	rename test llvm-svn: 79113	2009-08-15 17:01:44 +00:00
Chris Lattner	e5b9130efe	this is a test for sse3, simplify it. llvm-svn: 79112	2009-08-15 17:01:19 +00:00
Dan Gohman	0700a56860	On x86-64, for a varargs function, don't store the xmm registers to the register save area if %al is 0. This avoids touching xmm regsiters when they aren't actually used. llvm-svn: 79061	2009-08-15 01:38:56 +00:00
Anton Korobeynikov	75821f7c69	Properly handle indirect win64 args when they're passed in memory llvm-svn: 79009	2009-08-14 18:19:10 +00:00
Bruno Cardoso Lopes	607cd3b63a	Change MCSectionELF to represent a section semantically instead of syntactically as a string, very similiar to what Chris did with MachO. The parsing support and validation is not introduced yet. llvm-svn: 78890	2009-08-13 05:07:35 +00:00
Dan Gohman	ef3d457126	Various AsmWriter output cleanups. Use WriteAsOperand instead of PrintUnmangledNameSafely. llvm-svn: 78878	2009-08-13 01:36:44 +00:00
Dan Gohman	1c0e13fe66	Use WriteAsOperand to print BasicBlock names. llvm-svn: 78838	2009-08-12 20:56:56 +00:00
Dale Johannesen	58043874ce	Test for 78821, sort of. While that bug is nondeterministic, this test failed consistently on a Darwin build. llvm-svn: 78822	2009-08-12 17:43:47 +00:00
Chris Lattner	1235f20744	one last (?) bad x86 triple test. llvm-svn: 78801	2009-08-12 06:49:44 +00:00
Chris Lattner	6628e17344	fix some pastos in triple lines. llvm-svn: 78800	2009-08-12 06:49:12 +00:00
Chris Lattner	0ea6e4cc7f	another bogus triple llvm-svn: 78798	2009-08-12 06:36:52 +00:00
Chris Lattner	8c2d846a42	fix another broken target triple. llvm-svn: 78796	2009-08-12 06:29:18 +00:00
Chris Lattner	d4a70aedb0	fix an incorrect target triple. llvm-svn: 78795	2009-08-12 06:28:51 +00:00
Dan Gohman	9d26c85bdc	Fix a bug in the DAGCombiner's handling of multiple linked MERGE_VALUES nodes. Replacing the result values with the operands in one MERGE_VALUES node may cause another MERGE_VALUES node be CSE'd with the first one, and bring its uses along, so that the first one isn't dead, as this code expects. Fix this by iterating until the node is really dead. This fixes PR4699. llvm-svn: 78619	2009-08-10 23:43:19 +00:00
Chris Lattner	cb307a27bf	Make the big switch: Change MCSectionMachO to represent a section semantically instead of syntactically as a string. This means that it keeps track of the segment, section, flags, etc directly and asmprints them in the right format. This also includes parsing and validation support for llvm-mc and "attribute(section)", so we should now start getting errors about invalid section attributes from the compiler instead of the assembler on darwin. Still todo: 1) Uniquing of darwin mcsections 2) Move all the Darwin stuff out to MCSectionMachO.[cpp\|h] 3) there are a few FIXMEs, for example what is the syntax to get the S_GB_ZEROFILL segment type? llvm-svn: 78547	2009-08-10 01:39:42 +00:00
Eric Christopher	7dfa9f2e56	Add crc32 instruction and intrinsics. Add a new class of prefix bytes for F2 0F 38 and propagate. Add a FIXME for a set of possibilities which correspond to intrinsics already used. New test. llvm-svn: 78508	2009-08-08 21:55:08 +00:00
Anton Korobeynikov	674ffc1e59	Do not generate 32-bit call on win64 when imm does not fit llvm-svn: 78443	2009-08-07 23:59:21 +00:00
Chris Lattner	6eceb7c85d	rename test llvm-svn: 78441	2009-08-07 23:57:30 +00:00
Chris Lattner	ff8d04e815	merge a bunch of tests together into one, convert to filecheck which is more tolerant of whitespace differences. llvm-svn: 78439	2009-08-07 23:56:42 +00:00
Dale Johannesen	3a127ce1d8	Add the testcase from PR 4668. This works at the moment, but it's a fragile area. llvm-svn: 78358	2009-08-07 00:04:42 +00:00
Dale Johannesen	15a5fad94b	Fix PR 4626, a crash in branch folding after OptimizeBlock produced a CFG it wasn't prepared for. llvm-svn: 78351	2009-08-06 22:56:40 +00:00
Dan Gohman	b4764e5b7f	Tidy up this testcase. llvm-svn: 78322	2009-08-06 17:11:55 +00:00
Chris Lattner	a7e2662770	reduce testcase. llvm-svn: 78315	2009-08-06 16:14:33 +00:00
Dan Gohman	ee902509a8	Remove an over-aggressive assert. Functions with empty struct return types don't have any return values, from CodeGen's perspective. This fixes PR4688. llvm-svn: 78311	2009-08-06 15:07:58 +00:00
Anton Korobeynikov	644caa0cdb	Add tests for X86-64 code model handling. Small and kernel for now. llvm-svn: 78300	2009-08-06 12:25:20 +00:00
Dan Gohman	130e2c7aed	Fix a bug in x86's PreprocessForRMW logic that was exposed by aggressive chain operand optimization. UpdateNodeOperands does not modify the node in place if it would result in a node identical to an existing node. llvm-svn: 78297	2009-08-06 09:22:57 +00:00
Dan Gohman	5758e1e92a	Fix a few places in DAGCombiner that were creating all-ones-bits and high-bits values in ways that weren't correct for integer types wider than 64 bits. This fixes a miscompile in PPMacroExpansion.cpp in clang on x86-64. llvm-svn: 78295	2009-08-06 09:18:59 +00:00
Dan Gohman	df7ea32af7	Enable the new no-SP register classes by default. This is to address PR4572. A few tests have some minor code regressions due to different coalescing. llvm-svn: 78217	2009-08-05 17:40:24 +00:00
Dan Gohman	2bebfc38af	Change these tests to use function attributes rather than special llc command-line options. llvm-svn: 78204	2009-08-05 16:37:27 +00:00
Dan Gohman	e32c0fe584	Revert changes accidentally committed along with r78163. llvm-svn: 78165	2009-08-05 05:38:13 +00:00
Dan Gohman	8c79569853	Teach X86FastISel how to handle CCValAssign::BCvt, which is used for MMX arguments. This fixes PR4684. llvm-svn: 78163	2009-08-05 05:33:42 +00:00
Dan Gohman	f9bbcd1afd	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Jakob Stoklund Olesen	6304369c4e	LowerSubregsInstructionPass::LowerExtract should not extend the live range of registers. When LowerExtract eliminates an EXTRACT_SUBREG with a kill flag, it moves the kill flag to the place where the sub-register is killed. This can accidentally overlap with the use of a sibling sub-register, and we have trouble. In the test case we have this code: Live Ins: %R0 %R1 %R2 %R2L<def> = EXTRACT_SUBREG %R2<kill>, 1 %R2H<def> = LOAD16fi <fi#-1>, 0, Mem:LD(2,4) [FixedStack-1 + 0] %R1L<def> = EXTRACT_SUBREG %R1<kill>, 1 %R0L<def> = EXTRACT_SUBREG %R0<kill>, 1 %R0H<def> = ADD16 %R2H<kill>, %R2L<kill>, %AZ<imp-def>, %AN<imp-def>, %AC0<imp-def>, %V<imp-def>, %VS<imp-def> subreg: CONVERTING: %R2L<def> = EXTRACT_SUBREG %R2<kill>, 1 subreg: eliminated! subreg: killed here: %R0H<def> = ADD16 %R2H, %R2L, %R2<imp-use,kill>, %AZ<imp-def>, %AN<imp-def>, %AC0<imp-def>, %V<imp-def>, %VS<imp-def> The kill flag on %R2 is moved to the last instruction, and the live range overlaps with the definition of %R2H: * Bad machine code: Redefining a live physical register * - function: f - basic block: 0x18358c0 (#0) - instruction: %R2H<def> = LOAD16fi <fi#-1>, 0, Mem:LD(2,4) [FixedStack-1 + 0] Register R2H was defined but already live. The fix is to replace EXTRACT_SUBREG with IMPLICIT_DEF instead of eliminating it completely: subreg: CONVERTING: %R2L<def> = EXTRACT_SUBREG %R2<kill>, 1 subreg: replace by: %R2L<def> = IMPLICIT_DEF %R2<kill> Note that these IMPLICIT_DEF instructions survive to the asm output. It is necessary to fix the stack-color-with-reg test case because of that. llvm-svn: 78093	2009-08-04 20:01:11 +00:00
Chris Lattner	f222054df7	enhance codegen to put 16-bit character strings into the __TEXT,__ustring section on darwin. llvm-svn: 78068	2009-08-04 16:27:13 +00:00
Chris Lattner	81bbf443fe	Add support emiting for 2/4 byte mergable strings to the ".rodata.str*" section on ELF targets. llvm-svn: 78066	2009-08-04 16:13:09 +00:00
Anton Korobeynikov	71386e08fe	Unbreak Win64 CC. Step one: honour register save area, fix some alignment and provide a different set of call-clobberred registers. llvm-svn: 77962	2009-08-03 08:12:53 +00:00
Rafael Espindola	70e9816624	Use movd instead of movq llvm-svn: 77956	2009-08-03 05:21:05 +00:00
Daniel Dunbar	0f16ea5c30	Pass target triple string in to TargetMachine constructor. This is not just a matter of passing in the target triple from the module; currently backends are making decisions based on the build and host architecture. The goal is to migrate to making these decisions based off of the triple (in conjunction with the feature string). Thus most clients pass in the target triple, or the host triple if that is empty. This has one important change in the way behavior of the JIT and llc. For the JIT, it was previously selecting the Target based on the host (naturally), but it was setting the target machine features based on the triple from the module. Now it is setting the target machine features based on the triple of the host. For LLC, -march was previously only used to select the target, the target machine features were initialized from the module's triple (which may have been empty). Now the target triple is taken from the module, or the host's triple is used if that is empty. Then the triple is adjusted to match -march. The take away is that -march for llc is now used in conjunction with the host triple to initialize the subtarget. If users want more deterministic behavior from llc, they should use -mtriple, or set the triple in the input module. llvm-svn: 77946	2009-08-03 04:03:51 +00:00
Rafael Espindola	18ba271a79	Use movq to move 64 bits in and out of mmx registers. Fixes PR4669 llvm-svn: 77940	2009-08-03 02:45:34 +00:00
Chris Lattner	b4b1012d29	fix a problem Eli noticed where we would compile the attached ptrtoint to: .quad X even on a 32-bit system, where X is not 64-bits. There isn't much that we can do here, so we just print: .quad ((X) & 4294967295) instead. llvm-svn: 77818	2009-08-01 22:25:12 +00:00
Dan Gohman	9023fd2b2a	Add nounwind to this test. llvm-svn: 77792	2009-08-01 19:11:04 +00:00
David Greene	81bcae5fda	Simplify operand padding by keying off tabs in the asm stream. If padding is disabled, tabs get replaced by spaces except in the case of the first operand, where the tab is output to line up the operands after the mnemonics. Add some better comments and eliminate redundant code. Fix some testcases to not assume tabs. llvm-svn: 77740	2009-07-31 21:57:10 +00:00
Chris Lattner	fc0264a38e	fix PR4650: we only track sizes for certain objects, so only put something into the mergable section if it is one of our special cases. This could obviously be improved, but this is the minimal fix and restores us to the previous behavior. llvm-svn: 77679	2009-07-31 16:17:13 +00:00
Evan Cheng	e62288fdd4	Optimize some common usage patterns of atomic built-ins __sync_add_and_fetch() and __sync_sub_and_fetch. When the return value is not used (i.e. only care about the value in the memory), x86 does not have to use add to implement these. Instead, it can use add, sub, inc, dec instructions with the "lock" prefix. This is currently implemented using a bit of instruction selection trick. The issue is the target independent pattern produces one output and a chain and we want to map it into one that just output a chain. The current trick is to select it into a merge_values with the first definition being an implicit_def. The proper solution is to add new ISD opcodes for the no-output variant. DAG combiner can then transform the node before it gets to target node selection. Problem #2 is we are adding a whole bunch of x86 atomic instructions when in fact these instructions are identical to the non-lock versions. We need a way to add target specific information to target nodes and have this information carried over to machine instructions. Asm printer (or JIT) can use this information to add the "lock" prefix. llvm-svn: 77582	2009-07-30 08:33:02 +00:00
Dan Gohman	49a6f16b7c	Add a new register class to describe operands that can't be SP, due to x86 encoding restrictions. This is currently off by default because it may cause code quality regressions. This is for PR4572. llvm-svn: 77565	2009-07-30 01:56:29 +00:00
Chris Lattner	c5397abb52	fix PR4584 with a trivial patch now that the pieces are in place. llvm-svn: 77434	2009-07-29 05:20:33 +00:00
Eric Christopher	dce1e4949e	Add a couple more tests for the ptest intrinsics to make sure we're grabbing them all correctly. llvm-svn: 77413	2009-07-29 00:51:15 +00:00
Eric Christopher	f7802a33ce	Add support for gcc __builtin_ia32_ptest{z,c,nzc} intrinsics. Lower to ptest instruction plus setcc. Revamp ptest instruction. Add test. llvm-svn: 77407	2009-07-29 00:28:05 +00:00
Chris Lattner	ebbbf451c9	fix testcase for previous patch. llvm-svn: 77338	2009-07-28 18:04:18 +00:00
Chris Lattner	513a36b63d	Fix PR4639, a ELF-TLS regression from some of my refactoring. llvm-svn: 77336	2009-07-28 17:57:51 +00:00
Chris Lattner	57af4ece60	update testcase. llvm-svn: 77192	2009-07-27 15:52:58 +00:00
Chris Lattner	8e58bc9ed4	put normal data into .data instead of .data.rel on elf systems. llvm-svn: 77116	2009-07-26 03:06:11 +00:00
Chris Lattner	397792d981	finish simplifying DarwinTargetAsmInfo::SelectSectionForGlobal for now. Make the section switching directives more consistent by not including \n and including \t for them all. llvm-svn: 77107	2009-07-26 01:24:18 +00:00
Chris Lattner	5b42b45fb9	simplify DarwinTargetAsmInfo::SelectSectionForGlobal a bit and make it more aggressive, we now put: const int G2 __attribute__((weak)) = 42; into the text (readonly) segment like gcc, previously we put it into the data (readwrite) segment. llvm-svn: 77104	2009-07-26 00:51:36 +00:00
Chris Lattner	2de9510572	add the most expedient hack to fix PR4619, along with a testcase. Thanks to Rafael for the great example. llvm-svn: 77083	2009-07-25 17:57:37 +00:00
Evan Cheng	3b5791f982	I've lost my mind. PR4572 has not been fixed. llvm-svn: 77031	2009-07-25 01:11:46 +00:00
Evan Cheng	01740ab57b	Forgot this test earlier. llvm-svn: 77007	2009-07-24 22:42:45 +00:00
Eric Christopher	fae639c9ad	Move insertps tests to sse41 combo test file, convert to filecheck format and add an extract/insert test. llvm-svn: 76994	2009-07-24 19:24:26 +00:00
Chris Lattner	dc13b7c637	merge one more sse41 test into sse41.ll llvm-svn: 76853	2009-07-23 04:49:39 +00:00
Chris Lattner	70d5783535	merge another sse41 test into sse41.ll llvm-svn: 76852	2009-07-23 04:43:48 +00:00
Chris Lattner	08fc6e6e40	merge sse41-pmovx.ll into sse41.ll llvm-svn: 76850	2009-07-23 04:39:09 +00:00
Chris Lattner	b9cdd3153c	change a test to run in filecheck style. Rename it to be a general dumping ground of various SSE4.1 tests, since filecheck can reasonably handle them all in one file. Generalize it to check x86-64 stuff as well since it has a different ABI (a convenient way to test both the reg and mem forms of these instructions). llvm-svn: 76848	2009-07-23 04:33:02 +00:00
Eric Christopher	b1b77ca862	Support insertps via the intrinsic and add a couple of simple testcases to make sure it's being generated. llvm-svn: 76843	2009-07-23 02:22:41 +00:00
Eric Christopher	327cb795a1	Add test for pinsrd and pinsrb instructions. llvm-svn: 76840	2009-07-23 01:58:04 +00:00
Dan Gohman	824ab40381	x86 isel tweak: use lea (%reg,%reg) instead of lea (,%reg,2). llvm-svn: 76817	2009-07-22 23:26:55 +00:00
Dan Gohman	c510293251	Make the grep line in this test more specific, to avoid unintended matches. llvm-svn: 76802	2009-07-22 22:02:42 +00:00
Evan Cheng	332a6590ae	Remove a big test case. llvm-svn: 76669	2009-07-21 22:52:04 +00:00
Evan Cheng	07a6ac6b29	Another rewriter bug exposed by recent coalescer changes. ReuseInfo::GetRegForReload() should make sure the "switched" register is in the desired register class. I'm surprised this hasn't caused more failures in the past. llvm-svn: 76558	2009-07-21 09:15:00 +00:00
Evan Cheng	a7bb55ebb6	Fix a dagga combiner bug: avoid creating illegal constant. Is this really a winning transformation? fold (shl (srl x, c1), c2) -> (shl (and x, (shl -1, c1)), (sub c2, c1)) or (srl (and x, (shl -1, c1)), (sub c1, c2)) llvm-svn: 76535	2009-07-21 05:40:15 +00:00
Evan Cheng	9a47392f2e	Cross RC coalescing is now on by default. llvm-svn: 76519	2009-07-21 00:22:59 +00:00
Evan Cheng	027d9f93ea	Fix some sub-reg coalescing bugs where the coalescer wasn't updating the resulting interval's register class. llvm-svn: 76458	2009-07-20 19:47:55 +00:00
Dan Gohman	33a3fd0b9c	Revert the addition of hasNoPointerOverflow to GEPOperator. Getelementptrs that are defined to wrap are virtually useless to optimization, and getelementptrs that are undefined on any kind of overflow are too restrictive -- it's difficult to ensure that all intermediate addresses are within bounds. I'm going to take a different approach. Remove a few optimizations that depended on this flag. llvm-svn: 76437	2009-07-20 17:43:30 +00:00
Chris Lattner	58f9bb2ccd	implement a new magic global "llvm.compiler.used" which is like llvm.used, but doesn't cause ".no_dead_strip" to be emitted on darwin. llvm-svn: 76399	2009-07-20 06:14:25 +00:00
Jakob Stoklund Olesen	aba695c7d0	Fix http://llvm.org/bugs/show_bug.cgi?id=4583 Inline asm instructions may have additional <imp-def,kill> register operands. These operands are not marked with a flag like the normal asm operands, so we must not assert that there is a flag. llvm-svn: 76373	2009-07-19 19:09:59 +00:00
Evan Cheng	090db9b7a9	Catch more coalescing opportunities. llvm-svn: 76282	2009-07-18 04:52:23 +00:00
Evan Cheng	e20cbf3068	Enable cross register class coalescing. llvm-svn: 76281	2009-07-18 02:10:10 +00:00
Evan Cheng	a776067d3f	Fix pr4552. Stack slot coloring with register must take care not to generate illegal ams. llvm-svn: 76258	2009-07-17 22:42:51 +00:00
Evan Cheng	18fe458103	Fix x86 inline ams 'q' constraint support. In 32-bit mode, it's just like 'Q', i.e. EAX, EDX, ECX, EBX. In 64-bit mode, it just means all the i64r registers. Yeah, that makes sense. llvm-svn: 76248	2009-07-17 22:13:25 +00:00
Chris Lattner	52d436e98b	rename test. llvm-svn: 76197	2009-07-17 18:05:55 +00:00
Dale Johannesen	c4148c4ec7	Assume an inline asm might be a call, so we get stack alignment right when it is. This is not ideal but conservatively correct. Adjust a test to compensate for changed stack offset value. gcc.apple/asm-block-57.c llvm-svn: 76120	2009-07-16 22:34:45 +00:00
Evan Cheng	357645efad	Changed my mind. We now allow remat of instructions whose defs have subreg indices. llvm-svn: 76100	2009-07-16 20:15:00 +00:00
Evan Cheng	fdd0eb4011	With recent MC changes, RIP base register is explicitly modeled. Make sure we add it when x86 V_SET0 / V_SETALLONES (by transforming it into a constpool load) into the use instruction. llvm-svn: 76094	2009-07-16 18:44:05 +00:00
Evan Cheng	84517443ca	Let callers decide the sub-register index on the def operand of rematerialized instructions. Avoid remat'ing instructions whose def have sub-register indices for now. It's just really really hard to get all the cases right. llvm-svn: 75900	2009-07-16 09:20:10 +00:00
Evan Cheng	43229fb489	ShortenDeadCopySrcLiveRange needs to be more conservative in multi-kill situations. llvm-svn: 75838	2009-07-15 21:39:50 +00:00
Chris Lattner	d7fec20cba	convert to filecheck style, simplify RUN line, and add comment. llvm-svn: 75667	2009-07-14 19:49:11 +00:00
Chris Lattner	8c9a96b966	Reapply my previous asmprinter changes now with more testing and two additional bug fixes: 1. The bug that everyone hit was a problem in the asmprinter where it would remove $stub but keep the L prefix on a name when emitting the indirect symbol. This is easy to fix by keeping the name of the stub and the name of the symbol in a StringMap instead of just keeping a StringSet and trying to reconstruct it late. 2. There was a problem printing the personality function. The current logic to print out the personality function from the DWARF information is a bit of a cesspool right now that duplicates a bunch of other logic in the asm printer. The short version of it is that it depends on emitting both the L and _ prefix for symbols (at least on darwin) and until I can untangle it, it is best to switch the mangler back to emitting both prefixes. llvm-svn: 75646	2009-07-14 18:17:16 +00:00
Daniel Dunbar	966932ccb7	Revert r75610 (and r75620, which was blocking the revert), in the hopes of unbreaking llvm-gcc (on Darwin). --- Reverse-merging r75620 into '.': U include/llvm/Support/Mangler.h --- Reverse-merging r75610 into '.': U test/CodeGen/X86/loop-hoist.ll G include/llvm/Support/Mangler.h U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp U lib/VMCore/Mangler.cpp llvm-svn: 75636	2009-07-14 15:57:55 +00:00
Chris Lattner	774f2a2d51	Change the X86 asmprinter to use the mangler to apply suffixes like "$non_lazy_ptr" to symbols instead of doing it with "printSuffixedName". This gets us to the point where there is a real separation between computing a symbol name and printing it, something I need for MC printer stuff. This patch also fixes a corner case bug where unnamed private globals wouldn't get the private label prefix. Next up, rename all uses of getValueName -> getMangledName for better greppability, and then tackle the ppc/arm backends to eliminate "printSuffixedName". llvm-svn: 75610	2009-07-14 06:04:35 +00:00
Chris Lattner	f34815b32f	Change the internal interface to makeNameProper to take a bool that indicates whether the label is private or not, instead of taking prefix stuff. One effect of this is that symbols will be generated with just the private prefix, instead of both the private prefix and the user-label-prefix, but this doesn't matter as long as it is consistent. For example we'll now get "Lfoo" instead of "L_foo". These are just assembler temporary labels anyway, so they never even make it into the .o file. llvm-svn: 75607	2009-07-14 04:50:12 +00:00
Bill Wendling	e604b776a7	Check for the correct unnamed name. llvm-svn: 75573	2009-07-14 00:53:58 +00:00
Chris Lattner	ec8efcb44e	Two changes: 1) unique globals with the existing "Count" local in Mangler, not with atomic nonsense. Using atomics will give us nondeterminstic output from the compiler when using multiple threads, which is bad. 2) Do not mangle an unknown global name with a type suffix. We don't need this anymore now that llvm ir doesn't have type planes. llvm-svn: 75541	2009-07-13 22:48:46 +00:00
Chris Lattner	f39f55d46c	add nounwind llvm-svn: 75407	2009-07-12 00:46:16 +00:00
Nick Lewycky	d57fb023e0	Darwin prepends an _ to internal globals, Linux doesn't. llvm-svn: 75405	2009-07-11 23:48:59 +00:00
Chris Lattner	38df005e12	fix x86-64 static codegen to materialize the address of a global with movl instead of lea. It is better for code size (and presumably efficiency) to use: movl $foo, %eax rather than: leal foo, eax Both give a nice zero extending "move immediate" instruction, the former is just smaller. Note that global addresses should be handled different by the x86 backend, but I chose to follow the style already in place and add more fixme's. llvm-svn: 75403	2009-07-11 23:17:29 +00:00
Chris Lattner	056dfc6f90	this test was incorrect for x86-64 static. It passed on darwin, because darwin doesn't have static x86-64 mode. llvm-svn: 75392	2009-07-11 22:30:05 +00:00
Chris Lattner	e91900097e	Fix PR4533, which is about buggy codegen in x86-64 -static mode. Basically, using: lea symbol(%rip), %rax is not valid in -static mode, because the current RIP may not be within 32-bits of "symbol" when an app is built partially pic and partially static. The fix for this is to compile it to: lea symbol, %rax It would be better to codegen this as: movq $symbol, %rax but that will come next. The hard part of fixing this bug was fixing abi-isel, which was actively testing for the wrong behavior. Also, the RUN lines are completely impossible to understand what they are testing. To help with this, convert the -static x86-64 codegen tests to use filecheck. This is much more stable and makes it more clear what the codegen is expected to be. llvm-svn: 75382	2009-07-11 20:29:19 +00:00
Chris Lattner	20adc670b2	We get the P modifier wrong in a lot of cases, just add some more rigorous testing. In addition to fixing this, I still need to do some more testing on darwin. llvm-svn: 75362	2009-07-11 08:30:22 +00:00
Eli Friedman	2b77eef160	Make EXTRACT_VECTOR_ELT a bit more flexible in terms of the returned value. Adjust other code to deal with that correctly. Make DAGTypeLegalizer::PromoteIntRes_EXTRACT_VECTOR_ELT take advantage of this new flexibility to simplify the code and make it deal with unusual vectors (like <4 x i1>) correctly. Fixes PR3037. llvm-svn: 75176	2009-07-09 22:01:03 +00:00
Evan Cheng	7452c968e4	Targets sometimes assign fixed stack object to spill certain callee-saved registers based on dynamic conditions. For example, X86 EBP/RBP, when used as frame register has to be spilled in the first fixed object. It should inform PEI this so it doesn't get allocated another stack object. Also, it should not be spilled as other callee-saved registers but rather its spilling and restoring are being handled by emitPrologue and emitEpilogue. Avoid spilling it twice. llvm-svn: 75116	2009-07-09 06:53:48 +00:00
Chris Lattner	44f6bcfa0b	remove eh, convert to FileCheck style llvm-svn: 75087	2009-07-09 01:07:22 +00:00
Chris Lattner	212f44d180	we have no tests for dllimport/export. Add one. llvm-svn: 75085	2009-07-09 00:53:44 +00:00
Chris Lattner	ade55bc8dd	* add some assertions for sanity checking. * remove some old code that was needed when we'd put ESP in the scale instead of the base of some instructions. * Fix a bug with the P modifier in inline asm that caused us to drop it. llvm-svn: 75077	2009-07-09 00:27:29 +00:00
Chris Lattner	da0cf0b134	add a test for dale's recent change. llvm-svn: 75074	2009-07-09 00:00:16 +00:00
Chris Lattner	f65129cb6a	switch test to FileCheck-style and test the P and non-P cases. llvm-svn: 75071	2009-07-08 23:44:06 +00:00
Chris Lattner	334e561e35	rename a test to make it a feature test. llvm-svn: 75070	2009-07-08 23:40:57 +00:00
Chris Lattner	d5ffa8ffb6	add some more check for vector compares. llvm-svn: 75024	2009-07-08 18:51:25 +00:00
Chris Lattner	072198a2a1	convert a test to "FileCheck" style. llvm-svn: 75023	2009-07-08 18:48:24 +00:00
Chris Lattner	04bf64d43c	eliminate the v[if]cmp versions of these tests, now that [if]cmp+sext works. llvm-svn: 74980	2009-07-08 00:49:35 +00:00
Chris Lattner	4ac607332d	dag combine sext(setcc) -> vsetcc before legalize. To make this safe, VSETCC must define all bits, which is different than it was documented to before. Since all targets that implement VSETCC already have this behavior, and we don't optimize based on this, just change the documentation. We now get nice code for vec_compare.ll llvm-svn: 74978	2009-07-08 00:31:33 +00:00
Chris Lattner	fc74e8241a	add support for legalizing an icmp where the result is illegal (4xi1) but the input is legal (4 x i32) llvm-svn: 74964	2009-07-07 23:03:54 +00:00
Chris Lattner	cbbf747b7b	add a trivial test that vector compares work. llvm-svn: 74963	2009-07-07 22:51:09 +00:00
Chris Lattner	30220d8f98	implement support for spliting and scalarizing vector setcc's. This finishes off enough support for vector compares to get the icmp/fcmp version of 2008-07-23-VSetCC.ll passing. llvm-svn: 74961	2009-07-07 22:47:46 +00:00
Chris Lattner	87d4f309f5	verify that the fcmp version of this works just as well as the vfcmp version. We actually get better code for this silly testcase. llvm-svn: 74954	2009-07-07 22:07:47 +00:00
Evan Cheng	ba2410b7ca	Avoid adding a duplicate def. This fixes PR4478. llvm-svn: 74857	2009-07-06 21:34:05 +00:00
Chris Lattner	87bb642676	@GOTPCREL is also rip-relative. Fix fast-isel to do the right thing. This fixes an llvm-gcc bootstrap problem I introduced. llvm-svn: 74691	2009-07-02 04:22:01 +00:00
Chris Lattner	d1c5951615	Fix yet-another bug I introduced into fastisel, this time handling constant pool references that weren't getting properly rip-relative. llvm-svn: 74689	2009-07-02 03:14:25 +00:00
Chris Lattner	f95fa1b721	Fix some fast-isel problems selecting global variable addressing in pic mode. llvm-svn: 74582	2009-07-01 03:27:19 +00:00
Rafael Espindola	317fd045e2	Fix PR4485. Avoid unnecessary duplication of operand 0 of X86::FpSET_ST0_80. This duplication would cause one register to remain on the stack at the function return. llvm-svn: 74534	2009-06-30 16:40:03 +00:00
Rafael Espindola	bd971ffcc6	Fix PR4484. This was caused by me confounding FP0 and ST(0). llvm-svn: 74523	2009-06-30 12:18:16 +00:00
Rafael Espindola	538064d6b1	FIX PR 4459. Not sure I understand how the temp register gets used, but this fixes a bug and introduces no regressions. llvm-svn: 74446	2009-06-29 20:29:59 +00:00
Chris Lattner	9876bd8257	factor some logic out into a helper function, allow remat of loads from constant globals. This implements remat-constant.ll even without aggressive-remat. llvm-svn: 74373	2009-06-27 04:38:55 +00:00
Chris Lattner	fea81da433	Reimplement rip-relative addressing in the X86-64 backend. The new implementation primarily differs from the former in that the asmprinter doesn't make a zillion decisions about whether or not something will be RIP relative or not. Instead, those decisions are made by isel lowering and propagated through to the asm printer. To achieve this, we: 1. Represent RIP relative addresses by setting the base of the X86 addr mode to X86::RIP. 2. When ISel Lowering decides that it is safe to use RIP, it lowers to X86ISD::WrapperRIP. When it is unsafe to use RIP, it lowers to X86ISD::Wrapper as before. 3. This removes isRIPRel from X86ISelAddressMode, representing it with a basereg of RIP instead. 4. The addressing mode matching logic in isel is greatly simplified. 5. The asmprinter is greatly simplified, notably the "NotRIPRel" predicate passed through various printoperand routines is gone now. 6. The various symbol printing routines in asmprinter now no longer infer when to emit (%rip), they just print the symbol. I think this is a big improvement over the previous situation. It does have two small caveats though: 1. I implemented a horrible "no-rip" modifier for the inline asm "P" constraint modifier. This is a short term hack, there is a much better, but more involved, solution. 2. I had to xfail an -aggressive-remat testcase because it isn't handling the use of RIP in the constant-pool reading instruction. This specific test is easy to fix without -aggressive-remat, which I intend to do next. llvm-svn: 74372	2009-06-27 04:16:01 +00:00
Chris Lattner	df92e147c9	remove some unneeded eh info. llvm-svn: 74371	2009-06-27 04:07:31 +00:00
Chris Lattner	de36afc1fe	testcase for PR4466 llvm-svn: 74367	2009-06-27 01:33:35 +00:00
Dan Gohman	d3b930d426	Add some testcases for some of the recent ScalarEvolution bug fixes. llvm-svn: 74353	2009-06-26 22:54:11 +00:00
Chris Lattner	b5c2639f83	remove unwind info, add test for asmprinting of jump table labels with (%rip) llvm-svn: 74337	2009-06-26 22:16:49 +00:00
Evan Cheng	07b016856d	Add x86 support for 'n' inline asm modifier. This will be handled target independently as part of MC work. llvm-svn: 74336	2009-06-26 22:00:19 +00:00
Chris Lattner	a4194b1082	down with unwind info :) llvm-svn: 74206	2009-06-25 21:48:17 +00:00
Chris Lattner	01d5049dc2	unwind info not needed. llvm-svn: 74112	2009-06-24 19:48:04 +00:00
Evan Cheng	38f2453817	Fix support for inline asm input / output operand tying when operand spans across multiple registers (e.g. two i64 operands in 32-bit mode). llvm-svn: 74053	2009-06-24 02:05:51 +00:00
Dan Gohman	f19aeec3f5	Extend ScalarEvolution's multiple-exit support to compute exact trip counts in more cases. Generalize ScalarEvolution's isLoopGuardedByCond code to recognize And and Or conditions, splitting the code out into an isNecessaryCond helper function so that it can evaluate Ands and Ors recursively, and make SCEVExpander be much more aggressive about hoisting instructions out of loops. test/CodeGen/X86/pr3495.ll has an additional instruction now, but it appears to be due to an arbitrary register allocation difference. llvm-svn: 74048	2009-06-24 01:18:18 +00:00
Rafael Espindola	6ead59f8ed	Fix PR4185. Handle FpSET_ST0_80 being used when ST0 is still alive. llvm-svn: 73850	2009-06-21 12:02:51 +00:00
Chris Lattner	7d2b049404	change TLS_ADDR lowering to lower to a real mem operand, instead of matching as a global with that gets printed with the :mem modifier. All operands to lea's should be handled with the lea32mem operand kind, and this allows the TLS stuff to do this. There are several better ways to do this, but I went for the minimal change since I can't really test this (beyond make check). This also makes the use of EBX explicit in the operand list in the 32-bit, instead of implicit in the instruction. llvm-svn: 73834	2009-06-20 20:38:48 +00:00
Chris Lattner	1771a852f0	no need for unwind info llvm-svn: 73832	2009-06-20 19:48:26 +00:00
Chris Lattner	fbc9778a1b	no need for unwind info here. llvm-svn: 73831	2009-06-20 19:43:09 +00:00
Dan Gohman	cc31110b95	Re-apply r73718, now that the fix in r73787 is in, and add a hand-crafted testcase which demonstrates the bug that was exposed in 254.gap. llvm-svn: 73793	2009-06-19 23:23:27 +00:00
Evan Cheng	86076c9e30	Revert 73718. It's breaking 254.gap. llvm-svn: 73783	2009-06-19 21:15:06 +00:00
Eli Friedman	2fc939c809	Fix for PR2484: add an SSE1 pattern for a shuffle we normally prefer to handle with an SSE2 instruction. llvm-svn: 73760	2009-06-19 07:00:55 +00:00
Evan Cheng	de9e36a74e	On Darwin, ams printer should output a second label before a jump table so the linker knows it's a new atom. But this is only needed if the jump table is put in a separate section from the function body. llvm-svn: 73720	2009-06-18 20:37:15 +00:00
Dan Gohman	8c9ac59455	Generalize LSR's OptimizeSMax to handle unsigned max tests as well as signed max tests. Along with r73717, this helps CodeGen avoid emitting code for a maximum operation for this class of loop. llvm-svn: 73718	2009-06-18 20:23:18 +00:00
Dan Gohman	a0348809b6	Remove the code from IVUsers that attempted to handle casted induction variables in cases where the cast isn't foldable. It ended up being a pessimization in many cases. This could be fixed, but it would require a bunch of complicated code in IVUsers' clients. The advantages of this approach aren't visible enough to justify it at this time. llvm-svn: 73706	2009-06-18 16:54:06 +00:00
Eli Friedman	abfad5d61e	Add some generic expansion logic for SMULO and UMULO. Fixes UMULO support for x86, and UMULO/SMULO for many architectures, including PPC (PR4201), ARM, and Cell. The resulting expansion isn't perfect, but it's not bad. llvm-svn: 73477	2009-06-16 06:58:29 +00:00
Dan Gohman	8e85118943	Update this test to use fmul instead of mul. llvm-svn: 73436	2009-06-15 22:49:34 +00:00
Bill Wendling	20f0adfc0e	This test is failing. Revert for now. llvm-svn: 73404	2009-06-15 19:10:56 +00:00
Bill Wendling	66e104cd11	Add another testcase for r71478. llvm-svn: 73399	2009-06-15 18:36:34 +00:00
Arnold Schwaighofer	cb9046cfc8	CheckTailCallReturnConstraints is missing a check on the incomming chain of the RETURN node. The incomming chain must be the outgoing chain of the CALL node. This causes the backend to identify tail calls that are not tail calls. This patch fixes this. llvm-svn: 73387	2009-06-15 14:43:36 +00:00
Arnold Schwaighofer	e3a018d707	Fix Bug 4278: X86-64 with -tailcallopt calling convention out of sync with regular cc. The only difference between the tail call cc and the normal cc was that one parameter register - R9 - was reserved for calling functions through a function pointer. After time the tail call cc has gotten out of sync with the regular cc. We can use R11 which is also caller saved but not used as parameter register for potential function pointers and remove the special tail call cc on x86-64. llvm-svn: 73233	2009-06-12 16:26:57 +00:00
Eli Friedman	0b387fbd1b	Fix the run-line for this test to work correctly outside of x86. llvm-svn: 73025	2009-06-07 09:44:19 +00:00
Eli Friedman	516479d6e7	Tweak the expansion code for BIT_CONVERT to generate better code converting from an MMX vector to an i64. llvm-svn: 73024	2009-06-07 09:41:57 +00:00
Eli Friedman	3234587213	Slightly generalize the code that handles shuffles of consecutive loads on x86 to handle more cases. Fix a bug in said code that would cause it to read past the end of an object. Rewrite the code in SelectionDAGLegalize::ExpandBUILD_VECTOR to be a bit more general. Remove PerformBuildVectorCombine, which is no longer necessary with these changes. In addition to simplifying the code, with this change, we can now catch a few more cases of consecutive loads. llvm-svn: 73012	2009-06-07 06:52:44 +00:00
Eli Friedman	c61e357aa6	Fix the expansion for CONCAT_VECTORS so that it doesn't create illegal types. llvm-svn: 72993	2009-06-06 07:08:26 +00:00
Eli Friedman	75c496f920	Avoid crashing on a variable-index insertelement with element type i16. llvm-svn: 72991	2009-06-06 06:32:50 +00:00
Eli Friedman	1b1844ad1f	Get rid of some bogus patterns for X86vzmovl. Don't create VZEXT_MOVL nodes for vectors with an i16 element type. Add an optimization for building a vector which is all zeros/undef except for the bottom element, where the bottom element is an i8 or i16. llvm-svn: 72988	2009-06-06 06:05:10 +00:00
Eli Friedman	868bd6ab52	Fix an obvious typo. llvm-svn: 72987	2009-06-06 05:55:37 +00:00
Eli Friedman	6c101ebfa8	Get rid of a bogus pattern that interferes with optimization. llvm-svn: 72985	2009-06-06 04:17:04 +00:00
Eli Friedman	b45e8ce69a	PR2598: make sure to expand illegal forms of integer/floating-point conversions for x86, like <2 x i32> -> <2 x float> and <4 x i16> -> <4 x float>. llvm-svn: 72983	2009-06-06 03:57:58 +00:00
Nate Begeman	624690c6b2	Adapt the x86 build_vector dagcombine to the current state of the legalizer. build vectors with i64 elements will only appear on 32b x86 before legalize. Since vector widening occurs during legalize, and produces i64 build_vector elements, the dag combiner is never run on these before legalize splits them into 32b elements. Teach the build_vector dag combine in x86 back end to recognize consecutive loads producing the low part of the vector. Convert the two uses of TLI's consecutive load recognizer to pass LoadSDNodes since that was required implicitly. Add a testcase for the transform. Old: subl $28, %esp movl 32(%esp), %eax movl 4(%eax), %ecx movl %ecx, 4(%esp) movl (%eax), %eax movl %eax, (%esp) movaps (%esp), %xmm0 pmovzxwd %xmm0, %xmm0 movl 36(%esp), %eax movaps %xmm0, (%eax) addl $28, %esp ret New: movl 4(%esp), %eax pmovzxwd (%eax), %xmm0 movl 8(%esp), %eax movaps %xmm0, (%eax) ret llvm-svn: 72957	2009-06-05 21:37:30 +00:00

... 3 4 5 6 7 ...

1594 Commits