llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	9c31b0c695	remove MAI::ZeroDirectiveSuffix, which is only used by MASM, which we don't support anymore. llvm-svn: 93886	2010-01-19 18:37:01 +00:00
Jim Grosbach	04770f2aa1	For aligned load/store instructions, it's only required to know whether a function can support dynamic stack realignment. That's a much easier question to answer at instruction selection stage than whether the function actually will have dynamic alignment prologue. This allows the removal of the stack alignment heuristic pass, and improves code quality for cases where the heuristic would result in dynamic alignment code being generated when it was not strictly necessary. llvm-svn: 93885	2010-01-19 18:31:11 +00:00
Devang Patel	1083b5fc3f	Avoid including DebugInfo.h in AsmPrinter.h llvm-svn: 93864	2010-01-19 06:09:04 +00:00
Chris Lattner	c7a062d187	Now that we have everything nicely factored (e.g. asmprinter is not doing global variable classification anymore) and hookized, sink almost all target targets global variable emission code into AsmPrinter and out of each target. Some notes: 1. PIC16 does completely custom and crazy stuff, so it is not changed. 2. XCore has some custom handling for extra directives. I'll look at it next. 3. This switches linux/ppc to use .globl instead of .global. If .globl is actually wrong, let me know and I'll fix it. 4. This makes linux/ppc get a lot of random cases right which were obviously wrong before, it is probably now a bit healthier. 5. Blackfin will probably start getting .comm and other things that it didn't before. If this is undesirable, it should explicitly opt out of these things by clearing the relevant fields of MCAsmInfo. This leads to a nice diffstat: 14 files changed, 127 insertions(+), 830 deletions(-) llvm-svn: 93858	2010-01-19 05:38:33 +00:00
Chris Lattner	b1f2934fc7	hookize the cygwin ".linkonce" directive. llvm-svn: 93855	2010-01-19 05:08:13 +00:00
Chris Lattner	826d77fb07	more cleanups. Emit the .local directive even on cygwin/mingw. I'm not sure that this is correct, but it causes no test failures, and just emitting a .comm without protecting its linkage somehow is surely not right. llvm-svn: 93854	2010-01-19 04:59:55 +00:00
Chris Lattner	f8a128a1a8	some cleanups llvm-svn: 93853	2010-01-19 04:53:18 +00:00
Chris Lattner	dccbb28bca	add a bool for whether .lcomm takes an alignment instead of basing this on "isdarwin". llvm-svn: 93852	2010-01-19 04:48:20 +00:00
Chris Lattner	6a160517a0	hoist handling of external globals and special globals up to common code. This makes a similar code dead in all the other targets, I'll clean it up in a bit. This also moves handling of lcomm up before acquisition of a section, since lcomm never needs a section. llvm-svn: 93851	2010-01-19 04:39:15 +00:00
Chris Lattner	e9d28b19cf	move production of .reference directives for static ctor/dtor list on darwin into common code. llvm-svn: 93849	2010-01-19 04:34:02 +00:00
Chris Lattner	35474ca4c4	use BSSLocal classifier to identify 'lcomm' data instead of duplicating the logic (differently) in lots of different targets. llvm-svn: 93847	2010-01-19 04:21:20 +00:00
Chris Lattner	70f0c33ec8	now that elf weak bss symbols are handled correctly, simplify a bunch of code. llvm-svn: 93845	2010-01-19 03:13:44 +00:00
Chris Lattner	a986aa33eb	fix a significant difference between llvm and gcc on ELF systems: GCC would put weak zero initialized mutable data in the .bss section, we would put it into a crasy '.gnu.linkonce.b.test,"aw",@nobits' section. Fixing this will allow simplifications next up. llvm-svn: 93844	2010-01-19 03:06:01 +00:00
Chris Lattner	5b585f8b1a	introduce a section kind for common linkage. Use this to slightly simplify and commonize some of the asmprinter logic for globals. This also avoids printing the MCSection for .zerofill, which broke the llvm-gcc build. llvm-svn: 93843	2010-01-19 02:48:26 +00:00
Bill Wendling	220c29465e	Even more explanation. llvm-svn: 93841	2010-01-19 02:44:01 +00:00
Chris Lattner	aa1f4fd0ed	change an accessor to a predicate. llvm-svn: 93839	2010-01-19 02:13:06 +00:00
Chris Lattner	1d371882b6	Cleanup handling of .zerofill on darwin: 1. TargetLoweringObjectFileMachO should decide if something goes in zerofill instead of having every target do it. 2. TargetLoweringObjectFileMachO should assign said symbols to the right MCSection, the asmprinters should just emit to the right section. 3. Since all zerofill stuff goes through mcstreamer anymore, MAI can have a bool "haszerofill" instead of having the textual directive to emit. llvm-svn: 93838	2010-01-19 02:09:44 +00:00
Dale Johannesen	a3db6ef9a2	Revert 93811 per request. llvm-svn: 93818	2010-01-19 00:10:52 +00:00
Dale Johannesen	0c90d43b70	Enable code to emit dbg.declare as DEBUG_VALUE comments (fast isel, X86). This doesn't seem to break any functionality, but will introduce cases where -g affects the generated code. I'll be fixing that. llvm-svn: 93811	2010-01-18 23:34:55 +00:00
Bill Wendling	c592725fbb	- Add getLSDAEncoding to the PowerPC backend. - Greatly improve the comments to the getLSDAEncoding method. llvm-svn: 93796	2010-01-18 22:36:35 +00:00
Eric Christopher	7eb6e0ffd6	Have FastISel handle llvm.trap(). llvm-svn: 93781	2010-01-18 22:11:29 +00:00
Bill Wendling	748ceca695	Add FIXME comment. llvm-svn: 93755	2010-01-18 19:47:53 +00:00
Bill Wendling	a73e471c62	- Add a comment to the callback indicating that it's extremely not a good idea, but unfortunately necessary. - Default to using 4-bytes for the LSDA pointer encoding to agree with the encoded value in the CIE. llvm-svn: 93753	2010-01-18 19:36:27 +00:00
Chris Lattner	1d8b954b43	switch x86 zerofill emission over to use MCStreamer. llvm-svn: 93702	2010-01-18 01:21:08 +00:00
Chris Lattner	8c21ffdcc6	Change CurrentFnSym to be a non-const pointer since asmprinter mutates it as it emits code. Switch .globl directives to use OutStreamer instead of doing it textually (in x86) llvm-svn: 93700	2010-01-18 00:59:24 +00:00
Chris Lattner	c8f7717808	remove the MAI argument to MCExpr::print and switch overthing to use << when printing them. llvm-svn: 93699	2010-01-18 00:37:40 +00:00
Chris Lattner	fae53f0c61	unbreak x86 jump tables with my previous patch. llvm-svn: 93698	2010-01-18 00:21:06 +00:00
Chris Lattner	8b5d55ed06	now that MCSymbol::print doesn't use it's MAI argument, we can remove it and change all the code that prints MCSymbols to use << instead, which is much simpler and cleaner. llvm-svn: 93695	2010-01-17 21:43:43 +00:00
Chris Lattner	f62e3ee8c5	move the mangler into libtarget from vmcore. llvm-svn: 93664	2010-01-16 21:57:06 +00:00
Chris Lattner	555ceabe64	rename GetPrivateGlobalValueSymbolStub -> GetSymbolWithGlobalValueBase, and add an explicit ForcePrivate argument. Switch FunctionEHFrameInfo to be MCSymbol based instead of string based. llvm-svn: 93646	2010-01-16 18:37:32 +00:00
Bill Wendling	bf5cfa1a41	Retrying r91337: The CIE says that the LSDA point in the FDE section is an "sdata4". That's fine, but we need it to actually be 4-bytes in the FDE for some platforms. Allow individual platforms to decide for themselves. llvm-svn: 93616	2010-01-16 01:40:55 +00:00
Chris Lattner	94d91a5b30	eliminate uses of mangler and simplify code. llvm-svn: 93615	2010-01-16 01:40:07 +00:00
Chris Lattner	8a38fc33b0	eliminate uses of deprecated mangler apis llvm-svn: 93605	2010-01-16 01:00:27 +00:00
Chris Lattner	e6b1bef33a	switch X86 target off CurFunctionName and MCIze more. Note that the code wasn't calling DecorateCygMingName when emitting the ".ascii -export" stuff at the end of file for DLLExported functions. I don't know if it should or not, but I'm preserving behavior. llvm-svn: 93603	2010-01-16 00:51:39 +00:00
Chris Lattner	274c0c0db3	MCize tis, and make it keep CurrentFnSym up to date with CurrentFnName. llvm-svn: 93598	2010-01-16 00:32:38 +00:00
Chris Lattner	c6d3d82798	revert the x86 part of my last patch, cygwin is mutating CurrentFnName! llvm-svn: 93595	2010-01-16 00:24:20 +00:00
Chris Lattner	719e908e7c	MCize a bunch more stuff, eliminating a lot of uses of the mangler and CurrentFnName. llvm-svn: 93594	2010-01-16 00:21:18 +00:00
Dale Johannesen	cb7554a3ab	Adjust some comments per review. llvm-svn: 93580	2010-01-15 23:29:29 +00:00
David Greene	b0c0e6433f	Fix PR6019. A load has more than one use if it feeds a bitconvert that has more than one use. llvm-svn: 93576	2010-01-15 23:23:41 +00:00
Dale Johannesen	188fa96cf9	DEBUG_VALUE is now variable sized, as it has a target-dependent memory address representation in it. Restore X86 printing of DEBUG_VALUE; lowering is done in X86RegisterInfo using the normal algorithm. llvm-svn: 93565	2010-01-15 22:22:35 +00:00
Dan Gohman	d2968c4c12	Fix a typo that Anton noticed. llvm-svn: 93563	2010-01-15 22:18:15 +00:00
Chris Lattner	e17df0b7f0	fix a bug in range information for $42, eliminate an unneeded argument from ParseExpression. llvm-svn: 93536	2010-01-15 19:39:23 +00:00
Chris Lattner	015cfb1577	add range information for mem X86Operand's, now all X86Operand's have range info. llvm-svn: 93535	2010-01-15 19:33:43 +00:00
Chris Lattner	528d00b913	extend MCAsmParser::ParseExpression and ParseParenExpression to return range information for subexpressions. Use this to provide range info for several new X86Operands. llvm-svn: 93534	2010-01-15 19:28:38 +00:00
Chris Lattner	86e6153382	give X86Operand a ctor and start passing SMLoc's into it. llvm-svn: 93532	2010-01-15 19:06:59 +00:00
Dale Johannesen	fb85dddba0	Revert 93499. After discussion with Chris we agreed FrameIndexes should be lowered, but the same way as everything else (target dependent) rather than in a special hacked way. The lowering needs to be done for eventual purposes of Dwarf generation. llvm-svn: 93530	2010-01-15 18:58:14 +00:00
Chris Lattner	0c2538fee2	add range location info for registers, change X86Operand::Create* implementations to avoid copy ctor use. llvm-svn: 93528	2010-01-15 18:51:29 +00:00
Chris Lattner	a2bbb7cbc6	clean up the memory management of the operands. llvm-svn: 93526	2010-01-15 18:44:13 +00:00
Chris Lattner	cc2ad08a11	refactor ParseRegister to avoid using X86Operand as a temporary datastructure when parsing a mem operand. llvm-svn: 93521	2010-01-15 18:27:19 +00:00
Dale Johannesen	0e7e55da1d	Lower FrameIndex operand of DEBUG_VALUE (specially) and print it as a comment on X86. llvm-svn: 93499	2010-01-15 01:54:55 +00:00
Chris Lattner	f29c0b6880	Split the TargetAsmParser "ParseInstruction" interface in half: the new ParseInstruction method just parses and returns a list of target operands. A new MatchInstruction interface is used to turn the operand list into an MCInst. This requires new/deleting all the operands, but it also gives targets the ability to use polymorphic operands if they want to. llvm-svn: 93469	2010-01-14 22:21:20 +00:00
Chris Lattner	77fd677111	prune #includes in TargetAsmParser.h Pass in SMLoc of instr opcode into ParseInstruction. Make AsmToken be a class, not a struct. llvm-svn: 93457	2010-01-14 21:32:45 +00:00
Chris Lattner	872501b6e0	introduce the MCParsedAsmOperand class. llvm-svn: 93454	2010-01-14 21:20:55 +00:00
Chris Lattner	3eb76c23dd	this is an SSE-specific issue. llvm-svn: 93373	2010-01-13 23:29:11 +00:00
Chris Lattner	fb534d97b5	X86 if conversion + tail merging issues from PR6032. llvm-svn: 93372	2010-01-13 23:28:40 +00:00
Evan Cheng	ceb5a4e8f6	For now, avoid issuing extract_subreg to reuse lower 8-bit, it's not safe in 32-bit. llvm-svn: 93307	2010-01-13 08:01:32 +00:00
Chris Lattner	f0a401fcf0	eliminate some uses of Mangler::makeNameProper. llvm-svn: 93305	2010-01-13 07:56:59 +00:00
Chris Lattner	209aecad0c	change Mangler::makeNameProper to return its result in a SmallVector instead of returning it in an std::string. Based on this change: 1. Change TargetLoweringObjectFileCOFF::getCOFFSection to take a StringRef 2. Change a bunch of targets to call makeNameProper with a smallstring, making several of them much more efficient. 3. Rewrite Mangler::makeNameProper to not build names and then prepend prefixes, not use temporary std::strings, and to avoid other crimes. llvm-svn: 93298	2010-01-13 06:38:18 +00:00
Evan Cheng	30bebff456	Add a quick pass to optimize sign / zero extension instructions. For targets where the pre-extension values are available in the subreg of the result of the extension, replace the uses of the pre-extension value with the result + extract_subreg. For now, this pass is fairly conservative. It only perform the replacement when both the pre- and post- extension values are used in the block. It will miss cases where the post-extension values are live, but not used. llvm-svn: 93278	2010-01-13 00:30:23 +00:00
Evan Cheng	08557ef5f4	Eliminate or_not_add and just use AddedComplexity so isel tries or_is_add patterns first. llvm-svn: 93245	2010-01-12 18:31:19 +00:00
Duncan Sands	b7168c270e	Revert commit 93204, since it causes the assembler to barf on x86-64 linux with messages like this: Error: Incorrect register `%r14' used with `l' suffix llvm-svn: 93242	2010-01-12 17:46:16 +00:00
Duncan Sands	0067d6bbbe	Fix typo. llvm-svn: 93235	2010-01-12 08:30:46 +00:00
Duncan Sands	fd75e12954	Tweak commit 91745, which changed target data for both Mingw and Cygwin, to not touch Cygwin: the change caused llvm-gcc build failures due to long double getting the wrong size. Patch by Aaron Gray. llvm-svn: 93234	2010-01-12 08:21:07 +00:00
Dan Gohman	c119580307	Reapply the MOV64r0 patch, with a fix: MOV64r0 clobbers EFLAGS. llvm-svn: 93229	2010-01-12 04:42:54 +00:00
Evan Cheng	4216615f99	Add TargetInstrInfo::isCoalescableInstr. It returns true if the specified instruction is copy like where the source and destination registers can overlap. This is to be used by the coalescable to coalesce the source and destination registers of instructions like X86::MOVSX64rr32. Apparently some crazy people believe the coalescer is too simple. llvm-svn: 93210	2010-01-12 00:09:37 +00:00
Evan Cheng	42b07e9600	Add manual ISD::OR fastisel selection routines. TableGen is no longer autogen them after 93152 and 93191. llvm-svn: 93204	2010-01-11 22:59:27 +00:00
Evan Cheng	99789a7a76	Extend r93152 to work on OR r, r. If the source set bits are known not to overlap, then select as an ADD instead. llvm-svn: 93191	2010-01-11 22:03:29 +00:00
Evan Cheng	7bdf339602	Revert 93158. It's breaking quite a few x86_64 tests. llvm-svn: 93185	2010-01-11 21:13:41 +00:00
Evan Cheng	c5f8184eec	Do not turn 8-bit OR to ADD since ADD8ri is not 3-addressfiable. llvm-svn: 93182	2010-01-11 20:18:04 +00:00
Benjamin Kramer	c6fe3c3273	Reimplement getToken and SplitString as "StringRef helper functions" - getToken is modeled after StringRef::split but it can split on multiple separator chars and skips leading seperators. - SplitString is a StringRef::split variant for more than 2 elements with the same behaviour as getToken. llvm-svn: 93161	2010-01-11 18:03:24 +00:00
Dan Gohman	e99a3c191e	Use a 32-bit and with implicit zero-extension instead of a 64-bit and if it has an immediate with at least 32 bits of leading zeros, to avoid needing to materialize that immediate in a register first. FileCheckize, tidy, and extend a testcase to cover this case. This fixes rdar://7527390. llvm-svn: 93160	2010-01-11 17:58:34 +00:00
Dan Gohman	3a55686345	Re-instate MOV64r0 and MOV16r0, with adjustments to work with the new AsmPrinter. This is perhaps less elegant than describing them in terms of MOV32r0 and subreg operations, but it allows the current register to rematerialize them. llvm-svn: 93158	2010-01-11 17:37:57 +00:00
Dan Gohman	f6e8369a5b	Pattern top-level operators don't need to be restricted to a single user. The _su forms are intended for non-top-level nodes. llvm-svn: 93155	2010-01-11 17:21:05 +00:00
Dan Gohman	40ea3e5ce2	Reword this comment to reference a more fundamental issue. llvm-svn: 93154	2010-01-11 17:14:46 +00:00
Evan Cheng	64d9f40557	Select an OR with immediate as an ADD if the input bits are known zero. This allow the instruction to be 3address-fied if needed. llvm-svn: 93152	2010-01-11 17:03:47 +00:00
David Greene	206351a1ff	Implement a feature (-vector-unaligned-mem) to allow targets to ignore alignment requirements for SIMD memory operands. This is useful on architectures like the AMD 10h that do not trap on unaligned references if a status bit is twiddled at startup time. llvm-svn: 93151	2010-01-11 16:29:42 +00:00
Jeffrey Yasskin	bb857e5d68	Fix http://llvm.org/PR5729 : x86-64 tail calls were putting their targets into R11, and then asserting that the target was in R9. Since R9 isn't reserved for the target anymore, and is used as an argument, this patch changes the assertion. llvm-svn: 93065	2010-01-09 18:56:43 +00:00
Evan Cheng	cc6d56bd3b	Fix a critical bug in 64-bit atomic operation lowering for 32-bit. The results of the cmpxchg8b instructions are being thrown away when it branches back to the top of the checking loop. This means the loop always compares against the old value and this can result in a dead lock. llvm-svn: 93028	2010-01-08 23:41:50 +00:00
Evan Cheng	4bb448c41b	Fix comment. llvm-svn: 93020	2010-01-08 19:14:57 +00:00
Eric Christopher	7482ad7272	After further thought revert the patch to make fast-isel avoid putting relocations into the constant pool - this isn't needed for correctness and in the rare occasion it happens would pull us out of fast isel for the block. If fast-isel application startup time ever becomes an issue we can add better support for these addresses instead of bailing. llvm-svn: 92995	2010-01-08 08:24:49 +00:00
Evan Cheng	b92f263ceb	Fix what looks to me obvious instruction definition bugs. 1. CMPXCHG8B and CMPXCHG16B did not specify implicit physical register defs and uses. 2. LCMPXCHG8B is loading 64 bit memory, not 32 bit. llvm-svn: 92985	2010-01-08 01:29:19 +00:00
Eric Christopher	e0297b9206	Remove extraneous include. llvm-svn: 92972	2010-01-08 00:05:33 +00:00
Eric Christopher	9f569bdf38	If the data requires a relocation then don't attempt to add it to the constant pool for fast-isel. We already don't add it for the normal case. llvm-svn: 92934	2010-01-07 19:45:14 +00:00
Evan Cheng	90dc43fcf5	Fix a minor regression from my dag combiner changes. One more place which needs to look pass truncates. llvm-svn: 92885	2010-01-07 00:54:06 +00:00
Evan Cheng	166a4e6caa	Teach dag combine to fold the following transformation more aggressively: (OP (trunc x), (trunc y)) -> (trunc (OP x, y)) Unfortunately this simple change causes dag combine to infinite looping. The problem is the shrink demanded ops optimization tend to canonicalize expressions in the opposite manner. That is badness. This patch disable those optimizations in dag combine but instead it is done as a late pass in sdisel. This also exposes some deficiencies in dag combine and x86 setcc / brcond lowering. Teach them to look pass ISD::TRUNCATE in various places. llvm-svn: 92849	2010-01-06 19:38:29 +00:00
Benjamin Kramer	d2564e3afb	Move remaining stuff to the isInteger predicate. llvm-svn: 92771	2010-01-05 21:05:54 +00:00
Benjamin Kramer	a81a6dff0d	Convert a ton of simple integer type equality tests to the new predicate. llvm-svn: 92760	2010-01-05 20:07:06 +00:00
Evan Cheng	4facc6116a	Code refactoring. llvm-svn: 92694	2010-01-05 06:52:31 +00:00
David Greene	d85fd0061d	Change errs() to dbgs(). llvm-svn: 92654	2010-01-05 01:29:34 +00:00
David Greene	d589dafba6	Change errs() to dbgs(). llvm-svn: 92653	2010-01-05 01:29:29 +00:00
David Greene	0688a242a5	Change errs() to dbgs(). llvm-svn: 92651	2010-01-05 01:29:23 +00:00
David Greene	0041181684	Change errs() to dbgs(). llvm-svn: 92648	2010-01-05 01:29:13 +00:00
David Greene	dbdb1b28b8	Change errs() to dbgs(). llvm-svn: 92647	2010-01-05 01:29:08 +00:00
David Greene	a8000359a6	Change errs() to dbgs(). llvm-svn: 92644	2010-01-05 01:28:53 +00:00
Dan Gohman	ea6f91ff64	Change SelectCode's argument from SDValue to SDNode , to make it more clear what information these functions are actually using. This is also a micro-optimization, as passing a SDNode around is simpler than passing a { SDNode *, int } by value or reference. llvm-svn: 92564	2010-01-05 01:24:18 +00:00
Dan Gohman	43324d0b29	Remove the SDNPAssociative properties for the flags-producing operators. Eli pointed out that it's not obvious what that would mean. llvm-svn: 92555	2010-01-05 00:44:20 +00:00
Evan Cheng	7844a99d60	Perform this folding as a target specific dag combine: (or (x << c) \| (y >> (64 - c))) ==> (shld64 x, y, c) The isel patterns may not catch all the cases if general dag combine has reduced width of source operands. llvm-svn: 92513	2010-01-04 21:22:48 +00:00
Dan Gohman	5d1987f9a0	Remove some README.txt entries which are now implemented. llvm-svn: 92511	2010-01-04 20:55:05 +00:00
Dan Gohman	0f960aed68	A use by operand 1 or 2 of a SELECT is not a FLAGS use. This lets the test-elimination work in more conditional-move cases. llvm-svn: 92508	2010-01-04 20:52:50 +00:00
Dan Gohman	85d4fdfe37	Flags-producing add, and, or, etc. have the same profibility rules as normal add, and, or, etc. llvm-svn: 92507	2010-01-04 20:51:50 +00:00
Dan Gohman	71671131c3	Add SDNPCommutative and SDNPAssociative to several X86 target nodes. This lets isel fold loads into them in more cases. llvm-svn: 92506	2010-01-04 20:51:05 +00:00
Benjamin Kramer	6b37c9e6fc	Replace a few more SmallVectors with arrays. llvm-svn: 92265	2009-12-29 16:57:26 +00:00
Bill Wendling	3179a89067	Remove dead variable. llvm-svn: 92184	2009-12-28 01:36:02 +00:00
Eli Friedman	ac6216d84c	PR5886: Make sure IMUL32m is marked as setting EFLAGS, so scheduling doesn't do illegal stuff around it. No testcase because the issue is very fragile. llvm-svn: 92167	2009-12-26 20:08:30 +00:00
Chris Lattner	4e26c0e52b	really remove the instruction, don't just comment it out llvm-svn: 91976	2009-12-23 01:46:40 +00:00
Chris Lattner	518b037620	completely eliminate the MOV16r0 'instruction'. The only interesting part of this is the divrem changes, which are already tested by CodeGen/X86/divrem.ll. llvm-svn: 91975	2009-12-23 01:45:04 +00:00
Sean Callanan	417c8a43d6	More fixes for Visual C++. Replaced several very small static inline functions with macros. llvm-svn: 91973	2009-12-23 01:32:29 +00:00
Chris Lattner	698def0868	stop pattern matching 16-bit zero's of a register to MOV16r0, instead use the appropriate subreggy thing. This generates identical code on some large apps (thanks to Evan's cross class coalescing stuff he did back in july). This means that MOV16r0 can go away completely in the future soon. llvm-svn: 91972	2009-12-23 01:30:26 +00:00
Sean Callanan	588785c781	Removed the "inline" keyword from the disassembler decoder, because the Visual C++ build does not build .c files as C99 llvm-svn: 91935	2009-12-22 22:51:40 +00:00
Sean Callanan	36eab80875	Fixes to the X86 disassembler: Made LEA memory operands emit only 4 MCInst operands. Made the scale operand equal 1 for instructions that have no SIB byte. llvm-svn: 91919	2009-12-22 21:12:55 +00:00
Evan Cheng	71d7eaa87e	Remove target attribute break-sse-dep. Instead, do not fold load into sse partial update instructions unless optimizing for size. llvm-svn: 91910	2009-12-22 17:47:23 +00:00
Douglas Gregor	8b858396d4	Include based on the current path, since we already -I the X86 target's path. Fixes CMake build llvm-svn: 91908	2009-12-22 17:25:11 +00:00
Bill Wendling	919b7aab2e	Add more plumbing. This time in the LowerArguments and "get" functions which return partial registers. This affected the back-end lowering code some. Also patch up some places I missed before in the "get" functions. llvm-svn: 91880	2009-12-22 02:10:19 +00:00
Sean Callanan	2f9443f422	Changed REG_* to MODRM_REG_* to avoid conflicts with symbols in AuroraUX's global namespace. llvm-svn: 91879	2009-12-22 02:07:42 +00:00
Daniel Dunbar	8b532de418	Fix some may-be-uninitialized var warnings. llvm-svn: 91878	2009-12-22 01:41:37 +00:00
Sean Callanan	5c8f4cd396	Fixed library dependencies between the X86 disassembler and X86 codegen that were causing circular symbol dependencies. llvm-svn: 91871	2009-12-22 01:11:26 +00:00
Chris Lattner	6211d7ba4b	print pcrel immediates as signed values instead of unsigned so that we get things like this out of the disassembler: 0x100000ecb: callq -96 instead of: 0x100000ecb: callq 4294967200 rdar://7491123 llvm-svn: 91864	2009-12-22 00:44:05 +00:00
Eric Christopher	a91c0f48e6	Fix setting and default setting of code model for jit. Do this by allowing backends to override routines that will default the JIT and Static code generation to an appropriate code model for the architecture. Should fix PR 5773. llvm-svn: 91824	2009-12-21 08:15:29 +00:00
Eli Friedman	dbe2aa91b9	A couple minor README updates. llvm-svn: 91823	2009-12-21 08:03:16 +00:00
Daniel Dunbar	4750efc28a	#if 0 out X86 disassembler for now, it is breaking the build in multiple places. llvm-svn: 91778	2009-12-19 17:11:53 +00:00
Nuno Lopes	3ed6d6003c	rename dprintf to dbgpritnf, in order to fix build with glibc (which already defines dprintf in stdio.h llvm-svn: 91775	2009-12-19 12:07:00 +00:00
Daniel Dunbar	c745a620a2	Use memset instead of bzero, its more portable. llvm-svn: 91754	2009-12-19 03:31:50 +00:00
Sean Callanan	04cc307edd	Table-driven disassembler for the X86 architecture (16-, 32-, and 64-bit incarnations), integrated into the MC framework. The disassembler is table-driven, using a custom TableGen backend to generate hierarchical tables optimized for fast decode. The disassembler consumes MemoryObjects and produces arrays of MCInsts, adhering to the abstract base class MCDisassembler (llvm/MC/MCDisassembler.h). The disassembler is documented in detail in - lib/Target/X86/Disassembler/X86Disassembler.cpp (disassembler runtime) - utils/TableGen/DisassemblerEmitter.cpp (table emitter) You can test the disassembler by running llvm-mc -disassemble for i386 or x86_64 targets. Please let me know if you encounter any problems with it. llvm-svn: 91749	2009-12-19 02:59:52 +00:00
Anton Korobeynikov	148d87b0b0	Bump alignment requirements for windows targets to achieve compartibility with vcpp. Based on patch by Michael Beck! llvm-svn: 91745	2009-12-19 02:04:23 +00:00
Evan Cheng	4cf30b72bf	On recent Intel u-arch's, folding loads into some unary SSE instructions can be non-optimal. To be precise, we should avoid folding loads if the instructions only update part of the destination register, and the non-updated part is not needed. e.g. cvtss2sd, sqrtss. Unfolding the load from these instructions breaks the partial register dependency and it can improve performance. e.g. movss (%rdi), %xmm0 cvtss2sd %xmm0, %xmm0 instead of cvtss2sd (%rdi), %xmm0 An alternative method to break dependency is to clear the register first. e.g. xorps %xmm0, %xmm0 cvtss2sd (%rdi), %xmm0 llvm-svn: 91672	2009-12-18 07:40:29 +00:00
Evan Cheng	3dfd04e2b7	Re-apply 91623 now that I actually know what I was trying to do. llvm-svn: 91655	2009-12-18 01:59:21 +00:00
Sean Callanan	04d8cb74f3	Instruction fixes, added instructions, and AsmString changes in the X86 instruction tables. Also (while I was at it) cleaned up the X86 tables, removing tabs and 80-line violations. This patch was reviewed by Chris Lattner, but please let me know if there are any problems. * X86.td Removed tabs and fixed 80-line violations X86Instr64bit.td (IRET, POPCNT, BT_, LSL, SWPGS, PUSH_S, POP_S, L_S, SMSW) Added (CALL, CMOV) Added qualifiers (JMP) Added PC-relative jump instruction (POPFQ/PUSHFQ) Added qualifiers; renamed PUSHFQ to indicate that it is 64-bit only (ambiguous since it has no REX prefix) (MOV) Added rr form going the other way, which is encoded differently (MOV) Changed immediates to offsets, which is more correct; also fixed MOV64o64a to have to a 64-bit offset (MOV) Fixed qualifiers (MOV) Added debug-register and condition-register moves (MOVZX) Added more forms (ADC, SUB, SBB, AND, OR, XOR) Added reverse forms, which (as with MOV) are encoded differently (ROL) Made REX.W required (BT) Uncommented mr form for disassembly only (CVT__2__) Added several missing non-intrinsic forms (LXADD, XCHG) Reordered operands to make more sense for MRMSrcMem (XCHG) Added register-to-register forms (XADD, CMPXCHG, XCHG) Added non-locked forms * X86InstrSSE.td (CVTSS2SI, COMISS, CVTTPS2DQ, CVTPS2PD, CVTPD2PS, MOVQ) Added * X86InstrFPStack.td (COM_FST0, COMP_FST0, COM_FI, COM_FIP, FFREE, FNCLEX, FNOP, FXAM, FLDL2T, FLDL2E, FLDPI, FLDLG2, FLDLN2, F2XM1, FYL2X, FPTAN, FPATAN, FXTRACT, FPREM1, FDECSTP, FINCSTP, FPREM, FYL2XP1, FSINCOS, FRNDINT, FSCALE, FCOMPP, FXSAVE, FXRSTOR) Added (FCOM, FCOMP) Added qualifiers (FSTENV, FSAVE, FSTSW) Fixed opcode names (FNSTSW) Added implicit register operand * X86InstrInfo.td (opaque512mem) Added for FXSAVE/FXRSTOR (offset8, offset16, offset32, offset64) Added for MOV (NOOPW, IRET, POPCNT, IN, BTC, BTR, BTS, LSL, INVLPG, STR, LTR, PUSHFS, PUSHGS, POPFS, POPGS, LDS, LSS, LES, LFS, LGS, VERR, VERW, SGDT, SIDT, SLDT, LGDT, LIDT, LLDT, LODSD, OUTSB, OUTSW, OUTSD, HLT, RSM, FNINIT, CLC, STC, CLI, STI, CLD, STD, CMC, CLTS, XLAT, WRMSR, RDMSR, RDPMC, SMSW, LMSW, CPUID, INVD, WBINVD, INVEPT, INVVPID, VMCALL, VMCLEAR, VMLAUNCH, VMRESUME, VMPTRLD, VMPTRST, VMREAD, VMWRITE, VMXOFF, VMXON) Added (NOOPL, POPF, POPFD, PUSHF, PUSHFD) Added qualifier (JO, JNO, JB, JAE, JE, JNE, JBE, JA, JS, JNS, JP, JNP, JL, JGE, JLE, JG, JCXZ) Added 32-bit forms (MOV) Changed some immediate forms to offset forms (MOV) Added reversed reg-reg forms, which are encoded differently (MOV) Added debug-register and condition-register moves (CMOV) Added qualifiers (AND, OR, XOR, ADC, SUB, SBB) Added reverse forms, like MOV (BT) Uncommented memory-register forms for disassembler (MOVSX, MOVZX) Added forms (XCHG, LXADD) Made operand order make sense for MRMSrcMem (XCHG) Added register-register forms (XADD, CMPXCHG) Added unlocked forms * X86InstrMMX.td (MMX_MOVD, MMV_MOVQ) Added forms * X86InstrInfo.cpp: Changed PUSHFQ to PUSHFQ64 to reflect table change * X86RegisterInfo.td: Added debug and condition register sets * x86-64-pic-3.ll: Fixed testcase to reflect call qualifier * peep-test-3.ll: Fixed testcase to reflect test qualifier * cmov.ll: Fixed testcase to reflect cmov qualifier * loop-blocks.ll: Fixed testcase to reflect call qualifier * x86-64-pic-11.ll: Fixed testcase to reflect call qualifier * 2009-11-04-SubregCoalescingBug.ll: Fixed testcase to reflect call qualifier * x86-64-pic-2.ll: Fixed testcase to reflect call qualifier * live-out-reg-info.ll: Fixed testcase to reflect test qualifier * tail-opts.ll: Fixed testcase to reflect call qualifiers * x86-64-pic-10.ll: Fixed testcase to reflect call qualifier * bss-pagealigned.ll: Fixed testcase to reflect call qualifier * x86-64-pic-1.ll: Fixed testcase to reflect call qualifier * widen_load-1.ll: Fixed testcase to reflect call qualifier llvm-svn: 91638	2009-12-18 00:01:26 +00:00
Jeffrey Yasskin	0de0ce11d8	Revert r91623 to unbreak the buildbots. llvm-svn: 91632	2009-12-17 22:44:34 +00:00
Evan Cheng	e43b403c87	Remove an unused option. llvm-svn: 91623	2009-12-17 21:23:58 +00:00
Ken Dyck	798493285c	In LowerEXTRACT_VECTOR_ELT, force an i32 value type for PEXTWR instead of incrementing the simple value type of the 16-bit type, which would give the wrong type if an intemediate MVT (such as i24) were introduced. llvm-svn: 91602	2009-12-17 15:31:52 +00:00
Evan Cheng	1be6286028	Re-enable 91381 with fixes. llvm-svn: 91489	2009-12-16 00:53:11 +00:00
Jeffrey Yasskin	e0d8e14e11	Change indirect-globals to use a dedicated allocIndirectGV. This lets us remove start/finishGVStub and the BufferState helper class from the MachineCodeEmitter interface. It has the side-effect of not setting the indirect global writable and then executable on ARM, but that shouldn't be necessary. llvm-svn: 91464	2009-12-15 22:42:46 +00:00
Evan Cheng	b3032962ef	Fix an encoding bug. llvm-svn: 91417	2009-12-15 06:49:02 +00:00
Kenneth Uildriks	792f0913ee	For fastcc on x86, let ECX be used as a return register after EAX and EDX llvm-svn: 91410	2009-12-15 03:27:52 +00:00
Evan Cheng	fcb5453dc7	Disable 91381 for now. It's miscompiling ARMISelDAG2DAG.cpp. llvm-svn: 91405	2009-12-15 03:07:11 +00:00
Evan Cheng	0e8b9e32d1	Use sbb x, x to materialize carry bit in a GPR. The result is all one's or all zero's. llvm-svn: 91381	2009-12-15 00:53:42 +00:00
Dan Gohman	cecad35728	Fix integer cast code to handle vector types. llvm-svn: 91362	2009-12-14 23:40:38 +00:00
Bill Wendling	277381f69a	Whitespace changes, comment clarification. No functional changes. llvm-svn: 91274	2009-12-14 06:51:19 +00:00
Evan Cheng	26fdd7265b	Disable r91104 for x86. It causes partial register stall which pessimize code in 32-bit. llvm-svn: 91223	2009-12-12 20:03:14 +00:00
Evan Cheng	3974c8de51	Add comment about potential partial register stall. llvm-svn: 91220	2009-12-12 18:55:26 +00:00
Evan Cheng	6d6eaafa8c	Fix an obvious bug. No test case since LEA16r is not being used. llvm-svn: 91219	2009-12-12 18:51:56 +00:00
Dan Gohman	1d459e4937	Implement vector widening, splitting, and scalarizing for SIGN_EXTEND_INREG. llvm-svn: 91158	2009-12-11 21:31:27 +00:00
Anton Korobeynikov	fc51282cbe	Honour setHasCalls() set from isel. This is used in some weird cases like general dynamic TLS model. This fixes PR5723 llvm-svn: 91144	2009-12-11 19:39:55 +00:00
Evan Cheng	766a73fb04	Add support to 3-addressify 16-bit instructions. llvm-svn: 91104	2009-12-11 06:01:48 +00:00
Evan Cheng	493b882f80	Optimize splat of a scalar load into a shuffle of a vector load when it's legal. e.g. vector_shuffle (scalar_to_vector (i32 load (ptr + 4))), undef, <0, 0, 0, 0> => vector_shuffle (v4i32 load ptr), undef, <1, 1, 1, 1> iff ptr is 16-byte aligned (or can be made into 16-byte aligned). llvm-svn: 90984	2009-12-09 21:00:30 +00:00
Evan Cheng	d938faff4b	Teach InferPtrAlignment to infer GV+cst alignment and use it to simplify x86 isl lowering code. llvm-svn: 90925	2009-12-09 01:53:58 +00:00
Evan Cheng	f5938d5d27	Move isConsecutiveLoad to SelectionDAG. It's not target dependent and it's primary used by selectdag passes. llvm-svn: 90922	2009-12-09 01:36:00 +00:00
Dan Gohman	9528ccdd77	Don't enable the post-RA scheduler on x86 except at -O3. In its current form, it is too expensive in compile time. llvm-svn: 90781	2009-12-07 19:04:31 +00:00
Dan Gohman	047a767d74	Remove the target hook TargetInstrInfo::BlockHasNoFallThrough in favor of MachineBasicBlock::canFallThrough(), which is target-independent and more thorough. llvm-svn: 90634	2009-12-05 00:44:40 +00:00
David Greene	86bafa29a3	Remove an unneeded include. llvm-svn: 90625	2009-12-04 23:55:07 +00:00
David Greene	0508e435c3	Have hasLoad/StoreFrom/ToStackSlot return the relevant MachineMemOperand. llvm-svn: 90608	2009-12-04 22:38:46 +00:00
Chris Lattner	765ac33a1a	yay for case insensitive file systems (?) llvm-svn: 90370	2009-12-03 01:10:05 +00:00
Chris Lattner	a48f44d9ee	improve portability to avoid conflicting with std::next in c++'0x. Patch by Howard Hinnant! llvm-svn: 90365	2009-12-03 00:50:42 +00:00
Jim Grosbach	2c3a6c6589	Factor the stack alignment calculations out into a target independent pass. No functionality change. llvm-svn: 90336	2009-12-02 19:30:24 +00:00
Dan Gohman	3ee8bc9b35	Minor whitespace fixes. llvm-svn: 90166	2009-11-30 23:33:53 +00:00
Dan Gohman	6f51309021	Fix a minor inconsistency. llvm-svn: 90165	2009-11-30 23:33:37 +00:00
Bob Wilson	505ddaa4dc	Remove isProfitableToDuplicateIndirectBranch target hook. It is profitable for all the processors where I have tried it, and even when it might not help performance, the cost is quite low. The opportunities for duplicating indirect branches are limited by other factors so code size does not change much due to tail duplicating indirect branches aggressively. llvm-svn: 90144	2009-11-30 18:35:03 +00:00
Mon P Wang	32f8bb9ed4	Added support to allow clients to custom widen. For X86, custom widen vectors for divide/remainder since these operations can trap by unroll them and adding undefs for the resulting vector. llvm-svn: 90108	2009-11-30 02:42:02 +00:00
Bob Wilson	120f729eca	Based on the testcase for pr3120, running on my MacPro with Xeon processors, it is definitely profitable to tail duplicate indirect branches for x86. This is likely to be true to various degrees for all modern x86 processors. llvm-svn: 89865	2009-11-25 17:27:53 +00:00
Daniel Dunbar	900f2ce31c	Sketch structure for X86 disassembler. llvm-svn: 89850	2009-11-25 06:53:08 +00:00
Jeffrey Yasskin	f2ad571443	* Move stub allocation inside the JITEmitter, instead of exposing a way for each TargetJITInfo subclass to allocate its own stubs. This means stubs aren't as exactly-sized anymore, but it lets us get rid of TargetJITInfo::emitFunctionStubAtAddr(), which lets ARM and PPC support the eager JIT, fixing http://llvm.org/PR4816. * Rename the JITEmitter's stub creation functions to describe the kind of stub they create. So far, all of them create lazy-compilation stubs, but they sometimes get used when far-call stubs are needed. Fixing http://llvm.org/PR5201 will involve fixing this. llvm-svn: 89715	2009-11-23 23:35:19 +00:00
Dan Gohman	de5dea869f	Remove ISD::DEBUG_LOC and ISD::DBG_LABEL, which are no longer used. Note that "hasDotLocAndDotFile"-style debug info was already broken; people wanting this functionality should implement it in the AsmPrinter/DwarfWriter code. llvm-svn: 89711	2009-11-23 23:20:51 +00:00
Jeffrey Yasskin	19b48370fb	Allow more than one stub to be being generated at the same time. It's probably better in the long run to replace the indirect-GlobalVariable system. That'll be done after a subsequent patch. llvm-svn: 89708	2009-11-23 22:49:00 +00:00
Devang Patel	ed85e12da6	We are not using DBG_STOPPOINT anymore. llvm-svn: 89536	2009-11-21 02:46:55 +00:00
Dan Gohman	312971513f	Fix a thinko that caused spurious @GOTOFFs. llvm-svn: 89509	2009-11-20 23:30:32 +00:00
Dan Gohman	7a6611793f	Target-independent support for TargetFlags on BlockAddress operands, and support for blockaddresses in x86-32 PIC mode. llvm-svn: 89506	2009-11-20 23:18:13 +00:00
Sean Callanan	c1f532e930	Recommitting PALIGNR shift width fixes. Thanks to Daniel Dunbar for fixing clang intrinsics: http://llvm.org/viewvc/llvm-project?view=rev&revision=89499 llvm-svn: 89500	2009-11-20 22:28:42 +00:00
Sean Callanan	19d92728d0	Reverting PALIGNR fix until I figure out how this broke the Clang testsuite. llvm-svn: 89495	2009-11-20 22:09:28 +00:00
Sean Callanan	fbed130173	Fixed PALIGNR to take 8-bit rotations in all cases. Also fixed the corresponding testcase, and the PALIGNR intrinsic (tested for correctness with llvm-gcc). llvm-svn: 89491	2009-11-20 21:40:28 +00:00
Evan Cheng	5392cc9d14	Re-apply 89011. It's not to be blamed. llvm-svn: 89081	2009-11-17 09:51:18 +00:00
Evan Cheng	05938e819b	Revert 89011. Buildbot thinks it might be breaking stuff. llvm-svn: 89076	2009-11-17 09:20:28 +00:00
Evan Cheng	d33400e636	MOV64rm should be marked isReMaterializable. llvm-svn: 89019	2009-11-17 00:55:55 +00:00
Evan Cheng	ce28f6f478	A few more instructions that should be marked re-materializable. llvm-svn: 89011	2009-11-17 00:23:22 +00:00
Jeffrey Yasskin	10d3604a9e	Make X86-64 in the Large model always emit 64-bit calls. The large code model is documented at http://www.x86-64.org/documentation/abi.pdf and says that calls should assume their target doesn't live within the 32-bit pc-relative offset that fits in the call instruction. To do this, we turn off the global-address->target-global-address conversion in X86TargetLowering::LowerCall(). The first attempt at this broke the lazy JIT because it can separate the movabs(imm->reg) from the actual call instruction. The lazy JIT receives the address of the movabs as a relocation and needs to record the return address from the call; and then when that call happens, it needs to patch the movabs with the newly-compiled target. We could thread the call instruction into the relocation and record the movabs<->call mapping explicitly, but that seems to require at least as much new complication in the code generator as this change. To fix this, we make lazy functions _always_ go through a call stub. You'd think we'd only have to force lazy calls through a stub on difficult platforms, but that turns out to break indirect calls through a function pointer. The right fix for that is to distinguish between calls and address-of operations on uncompiled functions, but that's complex enough to leave for someone else to do. Another attempt at this defined a new CALL64i pseudo-instruction, which expanded to a 2-instruction sequence in the assembly output and was special-cased in the X86CodeEmitter's emitInstruction() function. That broke indirect calls in the same way as above. This patch also removes a hack forcing Darwin to the small code model. Without far-call-stubs, the small code model requires things of the JITMemoryManager that the DefaultJITMemoryManager can't provide. Thanks to echristo for lots of testing! llvm-svn: 88984	2009-11-16 22:41:33 +00:00
Evan Cheng	f25ef4ffb0	- Check memoperand alignment instead of checking stack alignment. Most load / store folding instructions are not referencing spill stack slots. - Mark MOVUPSrm re-materializable. llvm-svn: 88974	2009-11-16 21:56:03 +00:00
Anton Korobeynikov	fd0c7bae2a	Temporary disable the error - it seems to be too conservative. llvm-svn: 88800	2009-11-14 18:01:41 +00:00
Daniel Dunbar	241d01b590	Add llvm::sys::getHostCPUName, for detecting the LLVM name for the host CPU. - This is an initial step towards -march=native support in Clang, and towards eliminating host dependencies in the targets. See PR5389. - Patch by Roman Divacky! llvm-svn: 88768	2009-11-14 10:09:12 +00:00
Evan Cheng	6ad7da96fe	- Change TargetInstrInfo::reMaterialize to pass in TargetRegisterInfo. - If destination is a physical register and it has a subreg index, use the sub-register instead. This fixes PR5423. llvm-svn: 88745	2009-11-14 02:55:43 +00:00
Jakob Stoklund Olesen	ff0302489b	The instruction pointer %RIP is a reserved register on x86_64. llvm-svn: 88705	2009-11-13 21:56:01 +00:00
David Greene	659c1a9d78	Move DebugInfo checks into EmitComments and remove them from target-specific AsmPrinters. Not all comments need DebugInfo. Re-enable the line numbers comment test. llvm-svn: 88697	2009-11-13 21:34:57 +00:00
David Goodwin	b9fe5d5d02	Allow target to specify regclass for which antideps will only be broken along the critical path. llvm-svn: 88682	2009-11-13 19:52:48 +00:00
David Greene	2f4c37425b	Fix a bootstrap failure. Provide special isLoadFromStackSlotPostFE and isStoreToStackSlotPostFE interfaces to explicitly request checking for post-frame ptr elimination operands. This uses a heuristic so it isn't reliable for correctness. llvm-svn: 87047	2009-11-13 00:29:53 +00:00
David Greene	be851acfb0	Make the MachineFunction argument of getFrameRegister const. This also fixes a build error. llvm-svn: 87027	2009-11-12 21:00:03 +00:00
David Greene	70fdd57dc1	Add hasLoadFromStackSlot and hasStoreToStackSlot to return whether a machine instruction loads or stores from/to a stack slot. Unlike isLoadFromStackSlot and isStoreFromStackSlot, the instruction may be something other than a pure load/store (e.g. it may be an arithmetic operation with a memory operand). This helps AsmPrinter determine when to print a spill/reload comment. This is only a hint since we may not be able to figure this out in all cases. As such, it should not be relied upon for correctness. Implement for X86. Return false by default for other architectures. llvm-svn: 87026	2009-11-12 20:55:29 +00:00
David Greene	1fbe054450	Add a bool flag to StackObjects telling whether they reference spill slots. The AsmPrinter will use this information to determine whether to print a spill/reload comment. Remove default argument values. It's too easy to pass a wrong argument value when multiple arguments have default values. Make everything explicit to trap bugs early. Update all targets to adhere to the new interfaces.. llvm-svn: 87022	2009-11-12 20:49:22 +00:00
Benjamin Kramer	68e4945c03	Add compare_lower and equals_lower methods to StringRef. Switch all users of StringsEqualNoCase (from StringExtras.h) to it. llvm-svn: 87020	2009-11-12 20:36:59 +00:00
Dan Gohman	d2a0f80ede	Use a tab in INT3's asm string, for consistency. llvm-svn: 86850	2009-11-11 18:07:16 +00:00
Daniel Dunbar	bc299f0092	llvm-gcc/clang don't (won't?) need this hack. llvm-svn: 86769	2009-11-11 00:28:38 +00:00
Daniel Dunbar	b9415c7d9a	Add a monstrous hack to improve X86ISelDAGToDAG compile time. - Force NDEBUG on in any Release build. This drops the compile time to ~100s from ~600s, in Release mode. - This may just be a temporary workaround, I don't know the true nature of the gcc-4.2 compile time performance problem. llvm-svn: 86695	2009-11-10 18:24:37 +00:00
Jeffrey Yasskin	b40d3f76a0	Fix DenseMap iterator constness. This patch forbids implicit conversion of DenseMap::const_iterator to DenseMap::iterator which was possible because DenseMapIterator inherited (publicly) from DenseMapConstIterator. Conversion the other way around is now allowed as one may expect. The template DenseMapConstIterator is removed and the template parameter IsConst which specifies whether the iterator is constant is added to DenseMapIterator. Actually IsConst parameter is not necessary since the constness can be determined from KeyT but this is not relevant to the fix and can be addressed later. Patch by Victor Zverovich! llvm-svn: 86636	2009-11-10 01:02:17 +00:00
David Goodwin	0d412c2528	Fixed to address code review. No functional changes. llvm-svn: 86634	2009-11-10 00:48:55 +00:00
David Goodwin	cf89db135e	Allow targets to specify register classes whose member registers should not be renamed to break anti-dependencies. llvm-svn: 86628	2009-11-10 00:15:47 +00:00
Anton Korobeynikov	eb8692cff9	Throw an error when stack realignment stuff fails instead of silent code miscompilation llvm-svn: 86463	2009-11-08 12:58:40 +00:00
Nate Begeman	3a313df69b	x86 vector shuffle cleanup/fixes: 1. rename the movhp patfrag to movlhps, since thats what it actually matches 2. eliminate the bogus movhps load and store patterns, they were incorrect. The load transforms are already handled (correctly) by shufps/unpack. 3. revert a recent test change to its correct form. llvm-svn: 86415	2009-11-07 23:17:15 +00:00
Chris Lattner	8714348afd	indicate what the native integer types for the target are. Please verify. llvm-svn: 86397	2009-11-07 19:07:32 +00:00
Chris Lattner	d82510e109	add some missing #includes llvm-svn: 86367	2009-11-07 09:20:54 +00:00
Jeffrey Yasskin	db5f24ce77	Make the need-stub variables accurate and consistent. In the case of MachineRelocations, "stub" always refers to a far-call stub or a load-a-faraway-global stub, so this patch adds "Far" to the term. (Other stubs are used for lazy compilation and dlsym address replacement.) The variable was also inconsistent between the positive and negative sense, and the positive sense ("NeedStub") was more demanding than is accurate (since a nearby-enough function can be called directly even if the platform often requires a stub). Since the negative sense causes double-negatives, I switched to "MayNeedFarStub" globally. llvm-svn: 86363	2009-11-07 08:51:52 +00:00
Eric Christopher	bd05185ef1	Fix a couple of shuffle patterns to use movhlps instead of movhps as the constraint. Changes optimizations so update testcases as appropriate as well. llvm-svn: 86360	2009-11-07 08:45:53 +00:00
Kenneth Uildriks	07119737aa	Add code to check at SelectionDAGISel::LowerArguments time to see if return values can be lowered to registers. Coming soon, code to perform sret-demotion if return values cannot be lowered to registers llvm-svn: 86324	2009-11-07 02:11:54 +00:00
Daniel Dunbar	ad36e8aceb	Pass StringRef by value. llvm-svn: 86251	2009-11-06 10:58:06 +00:00
Dan Gohman	ee8afcc59d	Factor out the printing of the leading tab into printInlineAsm. llvm-svn: 86199	2009-11-06 00:04:54 +00:00
Dan Gohman	006f9353e1	Use SUBREG_TO_REG instead of INSERT_SUBREG to model x86-64's implicit zero-extend. llvm-svn: 86196	2009-11-05 23:53:08 +00:00
Dan Gohman	b15f4a1cbd	Remove uninteresting and confusing debug output. llvm-svn: 86149	2009-11-05 18:47:09 +00:00
Jakob Stoklund Olesen	c7cfc94bcc	Print out an informative comment for KILL instructions. The KILL pseudo-instruction may survive to the asm printer pass, just like the IMPLICIT_DEF. Print the KILL as a comment instead of just leaving a blank line in the output. With -asm-verbose=0, a blank line is printed, like IMPLICIT?DEF. llvm-svn: 86041	2009-11-04 19:24:37 +00:00
Anton Korobeynikov	0f38d989bd	Do not infer the target type for COPY_TO_REGCLASS from dest regclass, this won't work if it can contain several types. Require explicit result type for the node for now. This fixes PR5364. PS: It seems that blackfin usage of copy_to_regclass is completely bogus! llvm-svn: 85766	2009-11-02 00:11:39 +00:00
Chris Lattner	50ba5c3dc2	improve x86 codegen support for blockaddress. We now compile the testcase into: _test1: ## @test1 ## BB#0: ## %entry leaq L_test1_bb6(%rip), %rax jmpq *%rax L_test1_bb: ## Address Taken LBB1_1: ## %bb movb $1, %al ret L_test1_bb6: ## Address Taken LBB1_2: ## %bb6 movb $2, %al ret Note, it is very very strange that BlockAddressSDNode doesn't carry around TargetFlags. Dan, please fix this. llvm-svn: 85703	2009-11-01 03:25:03 +00:00
Dan Gohman	49fa51d936	Fix MachineLICM to use the correct virtual register class when unfolding loads for hoisting. getOpcodeAfterMemoryUnfold returns the opcode of the original operation without the load, not the load itself, MachineLICM needs to know the operand index in order to get the correct register class. Extend getOpcodeAfterMemoryUnfold to return this information. llvm-svn: 85622	2009-10-30 22:18:41 +00:00
Dan Gohman	f7c4299312	Initial x86 support for BlockAddresses. llvm-svn: 85557	2009-10-30 01:28:02 +00:00
Dan Gohman	453d64c9f5	Rename usesCustomDAGSchedInserter to usesCustomInserter, and update a bunch of associated comments, because it doesn't have anything to do with DAGs or scheduling. This is another step in decoupling MachineInstr emitting from scheduling. llvm-svn: 85517	2009-10-29 18:10:34 +00:00
Evan Cheng	f64db3e1d0	X86 palignr intrinsics immediate field is in bits. ISel must transform it into bytes. llvm-svn: 85379	2009-10-28 06:30:34 +00:00
Evan Cheng	83896a59e1	Add a second ValueType argument to isFPImmLegal. llvm-svn: 85361	2009-10-28 01:43:28 +00:00
Bill Wendling	fd2730ee8c	Move and clarify note. llvm-svn: 85334	2009-10-27 22:48:31 +00:00
Bill Wendling	2974f63cb5	Note corrected. llvm-svn: 85332	2009-10-27 22:43:24 +00:00
Bill Wendling	cd4d148040	Modify note. llvm-svn: 85331	2009-10-27 22:40:45 +00:00
Bill Wendling	a205402c16	Add a note. llvm-svn: 85329	2009-10-27 22:34:43 +00:00
Evan Cheng	16993aa30b	Do away with addLegalFPImmediate. Add a target hook isFPImmLegal which returns true if the fp immediate can be natively codegened by target. llvm-svn: 85281	2009-10-27 19:56:55 +00:00
Chris Lattner	fb22a85baf	apparently the X86 JIT isn't fully contextized, it is still using getGlobalContext() :( llvm-svn: 85252	2009-10-27 17:01:03 +00:00
Nick Lewycky	974e12b2d3	Remove includes of Support/Compiler.h that are no longer needed after the VISIBILITY_HIDDEN removal. llvm-svn: 85043	2009-10-25 06:57:41 +00:00
Nick Lewycky	02d5f77d26	Remove VISIBILITY_HIDDEN from class/struct found inside anonymous namespaces. Chris claims we should never have visibility_hidden inside any .cpp file but that's still not true even after this commit. llvm-svn: 85042	2009-10-25 06:33:48 +00:00
Evan Cheng	8b86efefec	X86 needs critical path anti-dependency breaking. llvm-svn: 84931	2009-10-23 05:57:35 +00:00
David Goodwin	02ad4cb32e	Allow the target to select the level of anti-dependence breaking that should be performed by the post-RA scheduler. The default is none. llvm-svn: 84911	2009-10-22 23:19:17 +00:00
Dan Gohman	3d9d78463c	Following r84485, add Defs = [EFLAGS] to the 32-bit lock instructions too. llvm-svn: 84652	2009-10-20 18:14:49 +00:00
Dan Gohman	4a43e3068d	Make TranslateX86CC return COND_INVALID instead of aborting when it encounters an OEQ or UNE comparison, and update its callers to check for this return status and recover. This fixes a problem resulting from the LowerOperation hooks being called from LegalizeVectorOps, because LegalizeVectorOps only lowers vectors, so OEQ and UNE comparisons may still be at large. This fixes PR5092. llvm-svn: 84640	2009-10-20 16:22:37 +00:00
Chris Lattner	0b4a59fc07	X86 should ignore implicit regs when lowering to MCInst also, no functionality change. llvm-svn: 84567	2009-10-19 23:35:57 +00:00
Chris Lattner	d99b6974b9	simplify by using the twine form of GetOrCreateSymbol llvm-svn: 84565	2009-10-19 23:05:23 +00:00
Chris Lattner	86dfd73c38	add a twine version of MCContext::GetOrCreateSymbol. llvm-svn: 84561	2009-10-19 22:49:00 +00:00
Chris Lattner	f264f8a21c	revert r84540, fixing build breakage I didn't see because of broken makefile deps :( llvm-svn: 84544	2009-10-19 21:59:25 +00:00
Chris Lattner	e3796a0fee	pass mangler in as a reference instead of a pointer. llvm-svn: 84540	2009-10-19 21:45:31 +00:00
Chris Lattner	5db7b6a5d4	remove strings from instructions who are never asmprinted. All of these "subreg32" modifier instructions are handled explicitly by the MCInst lowering phase. If they got to the asmprinter, they would explode. They should eventually be replace with correct use of subregs. llvm-svn: 84526	2009-10-19 19:51:42 +00:00
Chris Lattner	66ebfab3ab	remove accidental comment. llvm-svn: 84510	2009-10-19 18:03:41 +00:00
Chris Lattner	e6da1826e8	emit .subsections_via_symbols through MCStreamer instead of textually. llvm-svn: 84509	2009-10-19 18:03:08 +00:00
Nate Begeman	5ca7b345b9	PR 5245 - The imediate size target flag was not set on 3A-prefixed SSSE3 instructions. llvm-svn: 84506	2009-10-19 17:31:16 +00:00
Torok Edwin	033f01c922	Fix PR5247, "lock addq" pattern (and other atomics), it DOES modify EFLAGS. LLC was scheduling compares before the adds causing wrong branches to be taken in programs, resulting in misoptimized code wherever atomic adds where used. llvm-svn: 84485	2009-10-19 11:00:58 +00:00
Nate Begeman	18df82a20c	Add support for matching shuffle patterns with palignr. llvm-svn: 84459	2009-10-19 02:17:23 +00:00
Evan Cheng	c436631a9c	Turn on post-alloc scheduling for x86. llvm-svn: 84431	2009-10-18 19:57:27 +00:00
Evan Cheng	936d87b39d	Oops. I forgot to change the tests first. Disable post-alloc scheduling. llvm-svn: 84425	2009-10-18 18:31:31 +00:00
Evan Cheng	0e9d9ca855	-Revert parts of 84326 and 84411. Distinquishing between fixed and non-fixed stack slots and giving them different PseudoSourceValue's did not fix the problem of post-alloc scheduling miscompiling llvm itself. - Apply Dan's conservative workaround by assuming any non fixed stack slots can alias other memory locations. This means a load from spill slot #1 cannot move above a store of spill slot #2. - Enable post-alloc scheduling for x86 at optimization leverl Default and above. llvm-svn: 84424	2009-10-18 18:16:27 +00:00
Evan Cheng	0b8db2dab7	Only fixed stack objects and spill slots should be get FixedStack PseudoSourceValue. llvm-svn: 84411	2009-10-18 06:27:36 +00:00
Evan Cheng	4729191bb2	Distinquish stack slots from other stack objects. They (and fixed objects) get FixedStack PseudoSourceValues. llvm-svn: 84326	2009-10-17 09:20:14 +00:00
Evan Cheng	8759585aba	Revert 84315 for now. Re-thinking the patch. llvm-svn: 84321	2009-10-17 07:53:04 +00:00
Evan Cheng	0818d87ed1	Rename getFixedStack to getStackObject. The stack objects represented are not necessarily fixed. Only those will negative frame indices are "fixed." llvm-svn: 84315	2009-10-17 06:22:26 +00:00
Evan Cheng	007ceb4603	Change createPostRAScheduler so it can be turned off at llc -O1. llvm-svn: 84273	2009-10-16 21:06:15 +00:00
Anton Korobeynikov	49e417c52c	Dllexport stuff cleanup: 1. Emit external function type information for all COFF targets since it's a feature of object format 2. Emit linker directives only for cygming (since this is ld-specific stuff) llvm-svn: 84214	2009-10-15 22:36:18 +00:00
Evan Cheng	e4a2117161	Remove X86Subtarget::IsLinux. It's no longer being used. llvm-svn: 84200	2009-10-15 20:23:21 +00:00
Dan Gohman	0be8c2b0e3	Make isSafeToClobberEFLAGS more aggressive. Teach it to scan backwards (for uses marked kill and defs marked dead) a few instructions in addition to forwards. Also, increase the maximum number of instructions to scan, as it appears to help in a fair number of cases. llvm-svn: 84061	2009-10-14 00:08:59 +00:00
Ted Kremenek	f34311779c	Update CMake file (lexically order files). llvm-svn: 84008	2009-10-13 18:57:27 +00:00
Dan Gohman	a698d7ac3c	Don't forget to mark RAX as live-out of the function when arranging for it to hold the address of an sret return value, for x86-64 ABI purposes. Also, fix the test that was originally intended to test this to actually test it, using FileCheck. llvm-svn: 83853	2009-10-12 16:36:12 +00:00
Chris Lattner	0840c823e4	Fix PR5087, patch by Jakub Staszak! llvm-svn: 83822	2009-10-12 04:22:44 +00:00
Dan Gohman	1faa11521e	Remove a no-longer-necessary #include. llvm-svn: 83697	2009-10-10 00:36:09 +00:00
Dan Gohman	e919de5acf	Replace X86's CanRematLoadWithDispOperand by calling the target-independent MachineInstr::isInvariantLoad instead, which has the benefit of being more complete. llvm-svn: 83696	2009-10-10 00:34:18 +00:00

... 3 4 5 6 7 ...

5416 Commits