llvm-project

History

Craig Topper c7a0b2684f [X86][MC][Target] Initial backend support a tune CPU to support -mtune This patch implements initial backend support for a -mtune CPU controlled by a "tune-cpu" function attribute. If the attribute is not present X86 will use the resolved CPU from target-cpu attribute or command line. This patch adds MC layer support a tune CPU. Each CPU now has two sets of features stored in their GenSubtargetInfo.inc tables . These features lists are passed separately to the Processor and ProcessorModel classes in tablegen. The tune list defaults to an empty list to avoid changes to non-X86. This annoyingly increases the size of static tables on all target as we now store 24 more bytes per CPU. I haven't quantified the overall impact, but I can if we're concerned. One new test is added to X86 to show a few tuning features with mismatched tune-cpu and target-cpu/target-feature attributes to demonstrate independent control. Another new test is added to demonstrate that the scheduler model follows the tune CPU. I have not added a -mtune to llc/opt or MC layer command line yet. With no attributes we'll just use the -mcpu for both. MC layer tools will always follow the normal CPU for tuning. Differential Revision: https://reviews.llvm.org/D85165		2020-08-14 15:31:50 -07:00
..
AsmParser	…
Disassembler	…
MCTargetDesc	[X86][MC][Target] Initial backend support a tune CPU to support -mtune	2020-08-14 15:31:50 -07:00
TargetInfo	…
CMakeLists.txt	…
DelaySlotFiller.cpp	…
LLVMBuild.txt	…
LeonFeatures.td	…
LeonPasses.cpp	…
LeonPasses.h	…
README.txt	…
Sparc.h	[Sparc] Remove unused forward declarations. NFC.	2020-04-23 16:30:44 +01:00
Sparc.td	…
SparcAsmPrinter.cpp	…
SparcCallingConv.td	[llvm] NFC: Fix trivial typo in rst and td files	2020-04-23 14:26:32 +09:00
SparcFrameLowering.cpp	CodeGen: Use Register in TargetFrameLowering	2020-04-07 17:07:44 -04:00
SparcFrameLowering.h	CodeGen: Use Register in TargetFrameLowering	2020-04-07 17:07:44 -04:00
SparcISelDAGToDAG.cpp	…
SparcISelLowering.cpp	[Alignment][NFC] Use proper getter to retrieve alignment from ConstantInt and ConstantSDNode	2020-07-03 08:06:43 +00:00
SparcISelLowering.h	CodeGen: Use Register in TargetLowering	2020-04-08 12:10:58 -04:00
SparcInstr64Bit.td	…
SparcInstrAliases.td	…
SparcInstrFormats.td	…
SparcInstrInfo.cpp	Sparc: Use Register	2020-06-30 16:14:23 -04:00
SparcInstrInfo.h	Sparc: Use Register	2020-06-30 16:14:23 -04:00
SparcInstrInfo.td	[llvm] NFC: Fix trivial typo in rst and td files	2020-04-23 14:26:32 +09:00
SparcInstrVIS.td	…
SparcMCInstLower.cpp	…
SparcMachineFunctionInfo.cpp	…
SparcMachineFunctionInfo.h	Sparc: Use Register	2020-06-30 16:14:23 -04:00
SparcRegisterInfo.cpp	CodeGen: Use Register in TargetFrameLowering	2020-04-07 17:07:44 -04:00
SparcRegisterInfo.h	…
SparcRegisterInfo.td	…
SparcSchedule.td	[llvm] NFC: Fix trivial typo in rst and td files	2020-04-23 14:26:32 +09:00
SparcSubtarget.cpp	[X86][MC][Target] Initial backend support a tune CPU to support -mtune	2020-08-14 15:31:50 -07:00
SparcSubtarget.h	[X86][MC][Target] Initial backend support a tune CPU to support -mtune	2020-08-14 15:31:50 -07:00
SparcTargetMachine.cpp	…
SparcTargetMachine.h	…
SparcTargetObjectFile.cpp	TargetLowering.h - remove unnecessary TargetMachine.h include. NFC	2020-05-23 19:49:38 +01:00
SparcTargetObjectFile.h	…

README.txt

To-do
-----

* Keep the address of the constant pool in a register instead of forming its
  address all of the time.
* We can fold small constant offsets into the %hi/%lo references to constant
  pool addresses as well.
* When in V9 mode, register allocate %icc[0-3].
* Add support for isel'ing UMUL_LOHI instead of marking it as Expand.
* Emit the 'Branch on Integer Register with Prediction' instructions.  It's
  not clear how to write a pattern for this though:

float %t1(int %a, int* %p) {
        %C = seteq int %a, 0
        br bool %C, label %T, label %F
T:
        store int 123, int* %p
        br label %F
F:
        ret float undef
}

codegens to this:

t1:
        save -96, %o6, %o6
1)      subcc %i0, 0, %l0
1)      bne .LBBt1_2    ! F
        nop
.LBBt1_1:       ! T
        or %g0, 123, %l0
        st %l0, [%i1]
.LBBt1_2:       ! F
        restore %g0, %g0, %g0
        retl
        nop

1) should be replaced with a brz in V9 mode.

* Same as above, but emit conditional move on register zero (p192) in V9
  mode.  Testcase:

int %t1(int %a, int %b) {
        %C = seteq int %a, 0
        %D = select bool %C, int %a, int %b
        ret int %D
}

* Emit MULX/[SU]DIVX instructions in V9 mode instead of fiddling
  with the Y register, if they are faster.

* Codegen bswap(load)/store(bswap) -> load/store ASI

* Implement frame pointer elimination, e.g. eliminate save/restore for
  leaf fns.
* Fill delay slots

* Use %g0 directly to materialize 0. No instruction is required.