llvm-project/llvm/lib/Target/Sparc
Craig Topper 7ec8f43659 [SPARC] Fix fp128 load/stores
The generated code for the split fp128 load/stores was missing a small yet important adjustment to the pointer metadata being fed into `getStore` and `getLoad`, making it out of sync with the effective memory address.
This problem often resulted in instructions being scheduled in the wrong order.

I also took this chance to clean up some "wrong" uses of `getAlignment` as done in D77687.

Thanks @jrtc27 for finding the problem and providing a patch.

Patch by LemonBoy and Jessica Clarke(jrtc27)

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D94345
2021-01-13 14:59:50 -08:00
..
AsmParser [Sparc] Fixes for the internal assembler 2021-01-04 13:25:37 +01:00
Disassembler llvmbuildectomy - replace llvm-build by plain cmake 2020-11-13 10:35:24 +01:00
MCTargetDesc [Sparc] SparcMCExpr::printVariantKind - fix Wcovered-switch-default gcc warning. NFCI. 2021-01-04 14:08:44 +00:00
TargetInfo llvmbuildectomy - replace llvm-build by plain cmake 2020-11-13 10:35:24 +01:00
CMakeLists.txt llvmbuildectomy - replace llvm-build by plain cmake 2020-11-13 10:35:24 +01:00
DelaySlotFiller.cpp
LeonFeatures.td
LeonPasses.cpp LeonPasses.h - remove unnecessary includes. NFCI. 2020-09-07 17:51:12 +01:00
LeonPasses.h LeonPasses.h - remove unnecessary includes. NFCI. 2020-09-07 17:51:12 +01:00
README.txt
Sparc.h
Sparc.td
SparcAsmPrinter.cpp [Sparc] Fixes for the internal assembler 2021-01-04 13:25:37 +01:00
SparcCallingConv.td
SparcFrameLowering.cpp [SVE] Return StackOffset for TargetFrameLowering::getFrameIndexReference. 2020-11-05 11:02:18 +00:00
SparcFrameLowering.h [SVE] Return StackOffset for TargetFrameLowering::getFrameIndexReference. 2020-11-05 11:02:18 +00:00
SparcISelDAGToDAG.cpp
SparcISelLowering.cpp [SPARC] Fix fp128 load/stores 2021-01-13 14:59:50 -08:00
SparcISelLowering.h
SparcInstr64Bit.td [Sparc] Fixes for the internal assembler 2021-01-04 13:25:37 +01:00
SparcInstrAliases.td
SparcInstrFormats.td [Sparc] Fixes for the internal assembler 2021-01-04 13:25:37 +01:00
SparcInstrInfo.cpp Sparc: Use Register 2020-06-30 16:14:23 -04:00
SparcInstrInfo.h Sparc: Use Register 2020-06-30 16:14:23 -04:00
SparcInstrInfo.td [Sparc] Fixes for the internal assembler 2021-01-04 13:25:37 +01:00
SparcInstrVIS.td
SparcMCInstLower.cpp
SparcMachineFunctionInfo.cpp
SparcMachineFunctionInfo.h Sparc: Use Register 2020-06-30 16:14:23 -04:00
SparcRegisterInfo.cpp [SVE] Return StackOffset for TargetFrameLowering::getFrameIndexReference. 2020-11-05 11:02:18 +00:00
SparcRegisterInfo.h
SparcRegisterInfo.td
SparcSchedule.td
SparcSubtarget.cpp [X86][MC][Target] Initial backend support a tune CPU to support -mtune 2020-08-14 15:31:50 -07:00
SparcSubtarget.h SparcSubtarget.h - cleanup include dependencies. NFCI. 2020-09-29 16:41:58 +01:00
SparcTargetMachine.cpp [llvm] Use Optional::getValueOr (NFC) 2021-01-12 21:43:50 -08:00
SparcTargetMachine.h
SparcTargetObjectFile.cpp
SparcTargetObjectFile.h

README.txt

To-do
-----

* Keep the address of the constant pool in a register instead of forming its
  address all of the time.
* We can fold small constant offsets into the %hi/%lo references to constant
  pool addresses as well.
* When in V9 mode, register allocate %icc[0-3].
* Add support for isel'ing UMUL_LOHI instead of marking it as Expand.
* Emit the 'Branch on Integer Register with Prediction' instructions.  It's
  not clear how to write a pattern for this though:

float %t1(int %a, int* %p) {
        %C = seteq int %a, 0
        br bool %C, label %T, label %F
T:
        store int 123, int* %p
        br label %F
F:
        ret float undef
}

codegens to this:

t1:
        save -96, %o6, %o6
1)      subcc %i0, 0, %l0
1)      bne .LBBt1_2    ! F
        nop
.LBBt1_1:       ! T
        or %g0, 123, %l0
        st %l0, [%i1]
.LBBt1_2:       ! F
        restore %g0, %g0, %g0
        retl
        nop

1) should be replaced with a brz in V9 mode.

* Same as above, but emit conditional move on register zero (p192) in V9
  mode.  Testcase:

int %t1(int %a, int %b) {
        %C = seteq int %a, 0
        %D = select bool %C, int %a, int %b
        ret int %D
}

* Emit MULX/[SU]DIVX instructions in V9 mode instead of fiddling
  with the Y register, if they are faster.

* Codegen bswap(load)/store(bswap) -> load/store ASI

* Implement frame pointer elimination, e.g. eliminate save/restore for
  leaf fns.
* Fill delay slots

* Use %g0 directly to materialize 0. No instruction is required.