llvm-project

Commit Graph

Author	SHA1	Message	Date
Evan Cheng	ed69b382ea	- Move TargetLowering::EmitTargetCodeForFrameDebugValue to TargetInstrInfo and rename it to emitFrameIndexDebugValue. - Teach spiller to modify DBG_VALUE instructions to reference spill slots. llvm-svn: 102323	2010-04-26 07:38:55 +00:00
Jakob Stoklund Olesen	dbff4e8103	Renumber SSE execution domains for better code size. SSEDomainFix will collapse to the domain with the lower number when it has a choice. The SSEPackedSingle domain often has smaller instructions, so prefer that. llvm-svn: 99952	2010-03-30 22:46:53 +00:00
Jakob Stoklund Olesen	b551aa4da5	Basic implementation of SSEDomainFix pass. Cross-block inference is primitive and wrong, but the pass is working otherwise. llvm-svn: 99848	2010-03-29 23:24:21 +00:00
Jakob Stoklund Olesen	49e121d5e4	Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. On Nehalem and newer CPUs there is a 2 cycle latency penalty on using a register in a different domain than where it was defined. Some instructions have equvivalents for different domains, like por/orps/orpd. The SSEDomainFix pass tries to minimize the number of domain crossings by changing between equvivalent opcodes where possible. This is a work in progress, in particular the pass doesn't do anything yet. SSE instructions are tagged with their execution domain in TableGen using the last two bits of TSFlags. Note that not all instructions are tagged correctly. Life just isn't that simple. The SSE execution domain issue is very similar to the ARM NEON/VFP pipeline issue handled by NEONMoveFixPass. This pass may become target independent to handle both. llvm-svn: 99524	2010-03-25 17:25:00 +00:00
Jakob Stoklund Olesen	a86ccbfe88	Revert "Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings." This reverts commit 99345. It was breaking buildbots. llvm-svn: 99352	2010-03-23 23:48:51 +00:00
Jakob Stoklund Olesen	31da45b7af	Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. This is work in progress. So far, SSE execution domain tables are added to X86InstrInfo, and a skeleton pass is enabled with -sse-domain-fix. llvm-svn: 99345	2010-03-23 23:14:44 +00:00
Chris Lattner	f83726f6ba	add encoder support and tests for rdtscp llvm-svn: 96076	2010-02-13 03:42:24 +00:00
Chris Lattner	140caa7240	remove special cases for vmlaunch, vmresume, vmxoff, and swapgs fix swapgs to be spelled right. llvm-svn: 96058	2010-02-13 00:41:14 +00:00
Chris Lattner	4ad96055fb	implement infrastructure to support fixups for rip-rel addressing. This isn't complete because I need an MCContext to generate new MCExprs. llvm-svn: 96036	2010-02-12 23:00:36 +00:00
Chris Lattner	12455ca03d	enhance the immediate field encoding to know whether the immediate is pc relative or not, mark call and branches as pcrel. llvm-svn: 96026	2010-02-12 22:27:07 +00:00
Chris Lattner	f7477e599f	add a bunch of mod/rm encoding types for fixed mod/rm bytes. This will work better for the disassembler for modeling things like lfence/monitor/vmcall etc. llvm-svn: 95960	2010-02-12 02:06:33 +00:00
Chris Lattner	44ac89f517	revert r95949, it turns out that adding new prefixes is not a great solution for the disassembler, we'll go with "plan b". llvm-svn: 95957	2010-02-12 01:55:31 +00:00
Chris Lattner	336f9abb45	add another bit of space for new kinds of instruction prefixes. llvm-svn: 95949	2010-02-12 01:15:16 +00:00
Chris Lattner	58827ff98e	port X86InstrInfo::determineREX over to the new encoder. llvm-svn: 95440	2010-02-05 22:10:22 +00:00
Chris Lattner	503243559a	move functions for decoding X86II values into the X86II namespace. llvm-svn: 95410	2010-02-05 19:24:13 +00:00
Chris Lattner	342762fdba	constant propagate a method away. llvm-svn: 95408	2010-02-05 19:20:30 +00:00
Chris Lattner	b8d375fd21	change getSizeOfImm and getBaseOpcodeFor to just take TSFlags directly instead of a TargetInstrDesc. llvm-svn: 95405	2010-02-05 19:16:26 +00:00
Chris Lattner	223084d3ac	enhance new encoder to support prefixes + RawFrm instructions with no operands. It can now handle define void @test2() nounwind { ret void } llvm-svn: 95261	2010-02-03 21:57:59 +00:00
Evan Cheng	4f026f3750	Add two target hooks to determine whether two loads are near and should be scheduled together. llvm-svn: 94147	2010-01-22 03:34:51 +00:00
Evan Cheng	30bebff456	Add a quick pass to optimize sign / zero extension instructions. For targets where the pre-extension values are available in the subreg of the result of the extension, replace the uses of the pre-extension value with the result + extract_subreg. For now, this pass is fairly conservative. It only perform the replacement when both the pre- and post- extension values are used in the block. It will miss cases where the post-extension values are live, but not used. llvm-svn: 93278	2010-01-13 00:30:23 +00:00
Evan Cheng	4216615f99	Add TargetInstrInfo::isCoalescableInstr. It returns true if the specified instruction is copy like where the source and destination registers can overlap. This is to be used by the coalescable to coalesce the source and destination registers of instructions like X86::MOVSX64rr32. Apparently some crazy people believe the coalescer is too simple. llvm-svn: 93210	2010-01-12 00:09:37 +00:00
Evan Cheng	766a73fb04	Add support to 3-addressify 16-bit instructions. llvm-svn: 91104	2009-12-11 06:01:48 +00:00
Dan Gohman	047a767d74	Remove the target hook TargetInstrInfo::BlockHasNoFallThrough in favor of MachineBasicBlock::canFallThrough(), which is target-independent and more thorough. llvm-svn: 90634	2009-12-05 00:44:40 +00:00
David Greene	0508e435c3	Have hasLoad/StoreFrom/ToStackSlot return the relevant MachineMemOperand. llvm-svn: 90608	2009-12-04 22:38:46 +00:00
Bob Wilson	505ddaa4dc	Remove isProfitableToDuplicateIndirectBranch target hook. It is profitable for all the processors where I have tried it, and even when it might not help performance, the cost is quite low. The opportunities for duplicating indirect branches are limited by other factors so code size does not change much due to tail duplicating indirect branches aggressively. llvm-svn: 90144	2009-11-30 18:35:03 +00:00
Bob Wilson	120f729eca	Based on the testcase for pr3120, running on my MacPro with Xeon processors, it is definitely profitable to tail duplicate indirect branches for x86. This is likely to be true to various degrees for all modern x86 processors. llvm-svn: 89865	2009-11-25 17:27:53 +00:00
Evan Cheng	6ad7da96fe	- Change TargetInstrInfo::reMaterialize to pass in TargetRegisterInfo. - If destination is a physical register and it has a subreg index, use the sub-register instead. This fixes PR5423. llvm-svn: 88745	2009-11-14 02:55:43 +00:00
David Greene	2f4c37425b	Fix a bootstrap failure. Provide special isLoadFromStackSlotPostFE and isStoreToStackSlotPostFE interfaces to explicitly request checking for post-frame ptr elimination operands. This uses a heuristic so it isn't reliable for correctness. llvm-svn: 87047	2009-11-13 00:29:53 +00:00
David Greene	70fdd57dc1	Add hasLoadFromStackSlot and hasStoreToStackSlot to return whether a machine instruction loads or stores from/to a stack slot. Unlike isLoadFromStackSlot and isStoreFromStackSlot, the instruction may be something other than a pure load/store (e.g. it may be an arithmetic operation with a memory operand). This helps AsmPrinter determine when to print a spill/reload comment. This is only a hint since we may not be able to figure this out in all cases. As such, it should not be relied upon for correctness. Implement for X86. Return false by default for other architectures. llvm-svn: 87026	2009-11-12 20:55:29 +00:00
Dan Gohman	49fa51d936	Fix MachineLICM to use the correct virtual register class when unfolding loads for hoisting. getOpcodeAfterMemoryUnfold returns the opcode of the original operation without the load, not the load itself, MachineLICM needs to know the operand index in order to get the correct register class. Extend getOpcodeAfterMemoryUnfold to return this information. llvm-svn: 85622	2009-10-30 22:18:41 +00:00
Dan Gohman	e919de5acf	Replace X86's CanRematLoadWithDispOperand by calling the target-independent MachineInstr::isInvariantLoad instead, which has the benefit of being more complete. llvm-svn: 83696	2009-10-10 00:34:18 +00:00
Dan Gohman	dd76bb23d1	Add basic infrastructure and x86 support for preserving MachineMemOperand information when unfolding memory references. llvm-svn: 83656	2009-10-09 18:10:05 +00:00
Dan Gohman	be8137b0b4	Replace TargetInstrInfo::isInvariantLoad and its target-specific implementations with a new MachineInstr::isInvariantLoad, which uses MachineMemOperands and is target-independent. This brings MachineLICM and other functionality to targets which previously lacked an isInvariantLoad implementation. llvm-svn: 83475	2009-10-07 17:38:06 +00:00
Dan Gohman	2728569a38	Remove explicit enum integer values. They don't appear to be needed, and they make it less convenient to add new entries. llvm-svn: 83308	2009-10-05 15:52:08 +00:00
Evan Cheng	3cad6283b8	It's not legal to fold a load from a narrower stack slot into a wider instruction. If done, the instruction does a 64-bit load and that's not safe. This can happen we a subreg_to_reg 0 has been coalesced. One exception is when the instruction that folds the load is a move, then we can simply turn it into a 32-bit load from the stack slot. rdar://7170444 llvm-svn: 81494	2009-09-11 00:39:26 +00:00
Evan Cheng	1b38952c99	Reference to hidden symbols do not have to go through non-lazy pointer in non-pic mode. rdar://7187172. llvm-svn: 80904	2009-09-03 07:04:02 +00:00
Eric Christopher	7dfa9f2e56	Add crc32 instruction and intrinsics. Add a new class of prefix bytes for F2 0F 38 and propagate. Add a FIXME for a set of possibilities which correspond to intrinsics already used. New test. llvm-svn: 78508	2009-08-08 21:55:08 +00:00
Evan Cheng	84517443ca	Let callers decide the sub-register index on the def operand of rematerialized instructions. Avoid remat'ing instructions whose def have sub-register indices for now. It's just really really hard to get all the cases right. llvm-svn: 75900	2009-07-16 09:20:10 +00:00
Evan Cheng	9e0c7f2c5e	Move load / store folding alignment require into the table(s). llvm-svn: 75749	2009-07-15 06:10:07 +00:00
Evan Cheng	7997cbf2d5	Undo my brain cramp. llvm-svn: 75290	2009-07-10 21:31:42 +00:00
Evan Cheng	bb00fe0dc6	CMOVxx doesn't swap operands which it's commuted. llvm-svn: 75266	2009-07-10 19:26:57 +00:00
Chris Lattner	d3f32c725b	add a predicate to determine if a global var reference requires a PIC-base to be added in. llvm-svn: 75238	2009-07-10 07:33:30 +00:00
Chris Lattner	ca9d784bf1	change isGlobalStubReference to take target flags instead of a MachineOperand. llvm-svn: 75236	2009-07-10 06:29:59 +00:00
Chris Lattner	377f1d5373	add a new predicate method that says whether a GlobalValue MachineOperand is a reference to a stub, not a reference to the global variable itself. Look no context needed! llvm-svn: 75233	2009-07-10 06:06:17 +00:00
Chris Lattner	72e3deca47	move reasoning about darwin $non_lazy_ptr stubs from asmprinter into isel. llvm-svn: 75117	2009-07-09 06:59:17 +00:00
Chris Lattner	d047d06358	make isel decide whether to emit $stub's on darwin instead of asmprinter. llvm-svn: 75107	2009-07-09 05:27:35 +00:00
Chris Lattner	47f64ea174	move handling of dllimport linkage in isel, not in asmprinter. llvm-svn: 75086	2009-07-09 00:58:53 +00:00
Chris Lattner	49ed726e46	Move all the TLS processing logic into isel, don't do it in asmprinter at all. llvm-svn: 74327	2009-06-26 21:20:29 +00:00
Chris Lattner	2aaad91bbe	start adding logic in isel to determine asm printer semantics, step N of M. llvm-svn: 74246	2009-06-26 00:43:52 +00:00
Chris Lattner	852739b46f	Use target-specific machine operand flags to eliminate a gross hack from the asmprinter. llvm-svn: 74184	2009-06-25 17:38:33 +00:00

1 2 3 4

174 Commits