llvm-project

Commit Graph

Author	SHA1	Message	Date
Evan Cheng	194c3dc01f	Move CallFrameSetupOpcode and CallFrameDestroyOpcode to TargetInstrInfo. llvm-svn: 134030	2011-06-28 21:14:33 +00:00
Evan Cheng	1e210d08d8	Merge XXXGenRegisterNames.inc into XXXGenRegisterInfo.inc llvm-svn: 134024	2011-06-28 20:07:07 +00:00
Evan Cheng	6cc775f905	- Rename TargetInstrDesc, TargetOperandInfo to MCInstrDesc and MCOperandInfo and sink them into MC layer. - Added MCInstrInfo, which captures the tablegen generated static data. Chang TargetInstrInfo so it's based off MCInstrInfo. llvm-svn: 134021	2011-06-28 19:10:37 +00:00
Evan Cheng	8d71a75777	More refactoring. Move getRegClass from TargetOperandInfo to TargetInstrInfo. llvm-svn: 133944	2011-06-27 21:26:13 +00:00
Evan Cheng	ee9b90a727	Get rid of one getStackAlignment(). RegisterInfo shouldn't need to know about stack alignment. llvm-svn: 133679	2011-06-23 01:53:43 +00:00
Rafael Espindola	defd4b0875	AnalyzeBranch doesn't change which successors a bb has, just the order we try to branch to them. Before we were creating successor lists with duplicated entries. Fixing that found a bug in isBlockOnlyReachableByFallthrough that would causes it to return the wrong answer for ----------- ... jne foo jmp bar foo: ---------- llvm-svn: 132882	2011-06-12 03:20:32 +00:00
Eli Friedman	87ef38784e	PR10092 (second try): Don't crash on a load without a momoperand; fast-isel creates loads like this. llvm-svn: 132826	2011-06-10 01:13:01 +00:00
Eli Friedman	9008377c2d	Revert 132789; it breaks tests. My mistake. llvm-svn: 132795	2011-06-09 19:33:30 +00:00
Eli Friedman	c095116710	Add a check to make sure we don't crash with strange configurations where we do fast-isel, then try to fold instructions. PR10092. llvm-svn: 132789	2011-06-09 18:55:00 +00:00
Jakob Stoklund Olesen	56ce3a0f01	Fix PR10059 and future variations by handling all register subclasses. Add TargetRegisterInfo::hasSubClassEq and use it to check for compatible register classes instead of trying to list all register classes in X86's getLoadStoreRegOpcode. llvm-svn: 132398	2011-06-01 15:32:10 +00:00
Jakob Stoklund Olesen	2348cdd67f	X86AsmPrinter doesn't know how to handle the X86II::MO_GOT_ABSOLUTE_ADDRESS flag after folding ADD32ri to ADD32mi, so don't do that. This only happens when the greedy register allocator gets itself in trouble and spills %vreg9 here: 16L %vreg9<def> = MOVPC32r 0, %ESP<imp-use>; GR32:%vreg9 48L %vreg9<def> = ADD32ri %vreg9, <es:_GLOBAL_OFFSET_TABLE_>[TF=1], %EFLAGS<imp-def,dead>; GR32:%vreg9 That should never happen, the live range should be split instead. llvm-svn: 130625	2011-04-30 23:00:05 +00:00
Chris Lattner	0ab5e2cded	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
Bill Wendling	b902f1dd88	Reapply r129401 with patch for clang. llvm-svn: 129419	2011-04-13 00:36:11 +00:00
Bill Wendling	dbfde42468	Revert r129401 for now. Clang is using the old way of doing things. llvm-svn: 129403	2011-04-12 22:59:27 +00:00
Bill Wendling	47c24875a1	Remove the unaligned load intrinsics in favor of using native unaligned loads. Now that we have a first-class way to represent unaligned loads, the unaligned load intrinsics are superfluous. First part of <rdar://problem/8460511>. llvm-svn: 129401	2011-04-12 22:46:31 +00:00
Andrew Trick	641e2d4f8c	Increased the register pressure limit on x86_64 from 8 to 12 regs. This is the only change in this checkin that may affects the default scheduler. With better register tracking and heuristics, it doesn't make sense to artificially lower the register limit so much. Added -sched-high-latency-cycles and X86InstrInfo::isHighLatencyDef to give the scheduler a way to account for div and sqrt on targets that don't have an itinerary. It is currently defaults to 10 (the actual number doesn't matter much), but only takes effect on non-default schedulers: list-hybrid and list-ilp. Added several heuristics that can be individually disabled for the non-default sched=list-ilp mode. This helps us determine how much better we can do on a given benchmark than the default scheduler. Certain compute intensive loops run much faster in this mode with the right set of heuristics, and it doesn't seem to have much negative impact elsewhere. Not all of the heuristics are needed, but we still need to experiment to decide which should be disabled by default for sched=list-ilp. llvm-svn: 127067	2011-03-05 08:00:22 +00:00
Evan Cheng	3923466e82	Fix bug in X86 folding / unfolding table. Int_CMPSDrm and Int_CMPSSrm memory operands starts at index 2, not 1. rdar://9045024 PR9305 llvm-svn: 126359	2011-02-24 02:36:52 +00:00
NAKAMURA Takumi	0cfdac078e	Target/X86: Tweak win64's tailcall. llvm-svn: 124272	2011-01-26 02:04:09 +00:00
NAKAMURA Takumi	9d29eff198	Fix whitespace. llvm-svn: 124270	2011-01-26 02:03:37 +00:00
Nate Begeman	073901c836	Add support for AVX to materialize +0.0 when doing scalar FP. llvm-svn: 121415	2010-12-09 21:43:51 +00:00
Anton Korobeynikov	d08fbd19f5	Move callee-saved regs spills / reloads to TFI llvm-svn: 120228	2010-11-27 23:05:03 +00:00
Evan Cheng	63c7608c34	Re-enable register pressure aware machine licm with fixes. Hoist() may have erased the instruction during LICM so UpdateRegPressureAfter() should not reference it afterwards. llvm-svn: 116845	2010-10-19 18:58:51 +00:00
Daniel Dunbar	418204e523	Revert r116781 "- Add a hook for target to determine whether an instruction def is", which breaks some nightly tests. llvm-svn: 116816	2010-10-19 17:14:24 +00:00
Evan Cheng	8249dfe6ce	- Add a hook for target to determine whether an instruction def is "long latency" enough to hoist even if it may increase spilling. Reloading a value from spill slot is often cheaper than performing an expensive computation in the loop. For X86, that means machine LICM will hoist SQRT, DIV, etc. ARM will be somewhat aggressive with VFP and NEON instructions. - Enable register pressure aware machine LICM by default. llvm-svn: 116781	2010-10-19 00:55:07 +00:00
Jakob Stoklund Olesen	aec745326a	Remove the x86 MOV{32,64}{rr,rm,mr}_TC instructions. The reg-reg copies were no longer being generated since copyPhysReg copies physical registers only. The loads and stores are not necessary - The TC constraint is imposed by the TAILJMP and TCRETURN instructions, there should be no need for constrained loads and stores. llvm-svn: 116314	2010-10-12 17:15:00 +00:00
Chris Lattner	dd77477690	reapply: Use the new TB_NOT_REVERSABLE flag instead of special reapply: reimplement the second half of the or/add optimization. We should now with no changes. Turns out that one missing "Defs = [EFLAGS]" can upset things a bit. llvm-svn: 116040	2010-10-08 03:57:25 +00:00
Chris Lattner	626656a562	reapply the patch reverted in r116033: "Reimplement (part of) the or -> add optimization. Matching 'or' into 'add'" With a critical fix: the add pseudos clobber EFLAGS. llvm-svn: 116039	2010-10-08 03:54:52 +00:00
Daniel Dunbar	8f21f9c1fb	Revert "Reimplement (part of) the or -> add optimization. Matching 'or' into 'add'", which seems to have broken just about everything. llvm-svn: 116033	2010-10-08 02:07:32 +00:00
Daniel Dunbar	5b2a411c77	Revert "Use the new TB_NOT_REVERSABLE flag instead of special ", which depends on r116007, which I am about to revert. llvm-svn: 116032	2010-10-08 02:07:29 +00:00
Daniel Dunbar	efdf08b5b8	Revert "reimplement the second half of the or/add optimization. We should now", which depends on r116007, which I am about to revert. llvm-svn: 116031	2010-10-08 02:07:26 +00:00
Chris Lattner	134f415bf8	reimplement the second half of the or/add optimization. We should now only end up emitting LEA instead of OR. If we aren't able to promote something into an LEA, we should never be emitting it as an ADD. Add some testcases that we emit "or" in cases where we used to produce an "add". llvm-svn: 116026	2010-10-08 01:05:10 +00:00
Chris Lattner	e2245542ce	Use the new TB_NOT_REVERSABLE flag instead of special casing FsMOVAPDrr/FsMOVAPSrr. llvm-svn: 116016	2010-10-08 00:03:02 +00:00
Chris Lattner	0921bfdf36	simplify some map operations. llvm-svn: 116014	2010-10-07 23:57:02 +00:00
Chris Lattner	4fb38d3cd3	Reimplement (part of) the or -> add optimization. Matching 'or' into 'add' is general goodness because it allows ORs to be converted to LEA to avoid inserting copies. However, this is bad because it makes the generated .s file less obvious and gives valgrind heartburn (tons of false positives in bitfield code). While the general fix should be in valgrind, we can at least try to avoid emitting ADD instructions that don't get promoted to LEA. This is more work because it requires introducing pseudo instructions to represents "add that knows the bits are disjoint", but hey, people really love valgrind. This fixes this testcase: https://bugs.kde.org/show_bug.cgi?id=242137#c20 the add r/i cases are coming next. llvm-svn: 116007	2010-10-07 23:36:18 +00:00
Chris Lattner	1c090c00bc	Reduce casting in various tables by defining the table with the right types. llvm-svn: 116001	2010-10-07 23:08:41 +00:00
Chris Lattner	70a7b54f97	simplify code: don't build up vector only to assert it is empty. llvm-svn: 115997	2010-10-07 22:26:19 +00:00
Jakob Stoklund Olesen	b19bae4e3e	Constrain the offset register to a *_NOSP register class when inserting LEA instructions. This unbreaks the machine code verifier and fixes PR8317. llvm-svn: 115879	2010-10-07 00:07:26 +00:00
Chris Lattner	1a1c600110	Use #NAME# to have the CMOV multiclass define things with the same names as before (e.g. CMOVBE16rr instead of CMOVBErr16). llvm-svn: 115705	2010-10-05 23:00:14 +00:00
Chris Lattner	0067ee02f9	switch CMOVBE to the multipattern: 21 insertions(+), 53 deletions(-) Moar change coming before I switch the rest. llvm-svn: 115697	2010-10-05 22:23:58 +00:00
Chris Lattner	f60062fd55	add basic avx support to the disassembler, also teach it about ssmem/sdmem operands. With this done, we can remove the _Int suffixes from the round instructions without the disassembler blowing up. This allows the assembler to support them, implementing rdar://8456376 - llvm-mc rejects 'roundss' llvm-svn: 115019	2010-09-29 02:57:56 +00:00
Chris Lattner	ff3a3930a0	add asmparser support for cvttpd2dq by removing some Int_ prefixes. Clean up cvttps2dq by removing some redundant implementations of the same instruction. rdar://8456382 llvm-svn: 115018	2010-09-29 02:36:32 +00:00
Chris Lattner	ef1c2fc305	implement rdar://8456382 - cvtsd2si support, by removing some Int_ prefixes. llvm-svn: 115017	2010-09-29 02:24:57 +00:00
Chris Lattner	37fc469f88	fix rdar://8456412 - llvm-mc crash in encoder on "mov %rdx, %cr8" Teaching the code generator about CR8-15, how to rex them up, etc. llvm-svn: 114533	2010-09-22 05:29:50 +00:00
Dan Gohman	534db8a5c8	Avoid emitting a PIC base register if no PIC addresses are needed. This fixes rdar://8396318. llvm-svn: 114201	2010-09-17 20:24:24 +00:00
Anton Korobeynikov	c0b36921c2	Properly handle passing of FP stuff to varargs function on Win64: value should be copied to the corresponding shadow reg as well. Patch by Cameron Esfahani! llvm-svn: 112262	2010-08-27 14:43:06 +00:00
Anton Korobeynikov	88c09879c7	Revert part of one of the prev. patches - tailjmp will follow later. llvm-svn: 111291	2010-08-17 21:08:28 +00:00
Anton Korobeynikov	cd78af6e3c	Enable more win64 calls folding opportunities. Patch by Cameron Esfahani! llvm-svn: 111288	2010-08-17 21:06:01 +00:00
Bruno Cardoso Lopes	7f704b31a9	- Teach SSEDomainFix to switch between different levels of AVX instructions. Here we guess that AVX will have domain issues, so just implement them for consistency and in the future we remove if it's unnecessary. - Make foldMemoryOperandImpl aware of 256-bit zero vectors folding and support the 128-bit counterparts of AVX too. - Make sure MOV[AU]PS instructions are only selected when SSE1 is enabled, and duplicate the patterns to match AVX. - Add a testcase for a simple 128-bit zero vector creation. llvm-svn: 110946	2010-08-12 20:20:53 +00:00
Bruno Cardoso Lopes	1401e040eb	Fix comment order llvm-svn: 110898	2010-08-12 02:08:52 +00:00
Jakob Stoklund Olesen	9c473e46f3	Fix <rdar://problem/8282498> even if it doesn't reproduce on trunk. When a register is defined by a partial load: %reg1234:sub_32 = MOV32mr <fi#-1>; GR64:%reg1234 That load cannot be folded into an instruction using the full 64-bit register. It would become a 64-bit load. This is related to the recent change to have isLoadFromStackSlot return false on a sub-register load. llvm-svn: 110874	2010-08-11 23:08:22 +00:00

1 2 3 4 5 ...

509 Commits