llvm-project

History

Momchil Velikov 980c3e6dd2 [CodeGen] Async unwind - add a pass to fix CFI information This pass inserts the necessary CFI instructions to compensate for the inconsistency of the call-frame information caused by linear (non-CFG aware) nature of the unwind tables. Unlike the `CFIInstrInserer` pass, this one almost always emits only `.cfi_remember_state`/`.cfi_restore_state`, which results in smaller unwind tables and also transparently handles custom unwind info extensions like CFA offset adjustement and save locations of SVE registers. This pass takes advantage of the constraints that LLVM imposes on the placement of save/restore points (cf. `ShrinkWrap.cpp`): * there is a single basic block, containing the function prologue * possibly multiple epilogue blocks, where each epilogue block is complete and self-contained, i.e. CSR restore instructions (and the corresponding CFI instructions are not split across two or more blocks. * prologue and epilogue blocks are outside of any loops Thus, during execution, at the beginning and at the end of each basic block the function can be in one of two states: - "has a call frame", if the function has executed the prologue, or has not executed any epilogue - "does not have a call frame", if the function has not executed the prologue, or has executed an epilogue These properties can be computed for each basic block by a single RPO traversal. In order to accommodate backends which do not generate unwind info in epilogues we compute an additional property "strong no call frame on entry" which is set for the entry point of the function and for every block reachable from the entry along a path that does not execute the prologue. If this property holds, it takes precedence over the "has a call frame" property. From the point of view of the unwind tables, the "has/does not have call frame" state at beginning of each block is determined by the state at the end of the previous block, in layout order. Where these states differ, we insert compensating CFI instructions, which come in two flavours: - CFI instructions, which reset the unwind table state to the initial one. This is done by a target specific hook and is expected to be trivial to implement, for example it could be: ``` .cfi_def_cfa <sp>, 0 .cfi_same_value <rN> .cfi_same_value <rN-1> ... ``` where `<rN>` are the callee-saved registers. - CFI instructions, which reset the unwind table state to the one created by the function prologue. These are the sequence: ``` .cfi_restore_state .cfi_remember_state ``` In this case we also insert a `.cfi_remember_state` after the last CFI instruction in the function prologue. Reviewed By: MaskRay, danielkiss, chill Differential Revision: https://reviews.llvm.org/D114545		2022-04-04 14:38:22 +01:00
..
AsmPrinter	[CodeGen] Async unwind - add a pass to fix CFI information	2022-04-04 14:38:22 +01:00
GlobalISel	[AMDGPU][GlobalISel] Scalarize add/sub with overflow ops in the legalizer	2022-03-31 21:46:34 +05:30
LiveDebugValues	[CodeGen] Apply clang-tidy fixes for readability-redundant-smartptr-get (NFC)	2022-03-20 23:11:06 -07:00
MIRParser	Cleanup include: codegen second round	2022-03-23 13:54:00 +01:00
SelectionDAG	[DAG] SimplifySetCC - clang-format add/xor/sub with constant handling. NFC.	2022-04-04 13:30:17 +01:00
AggressiveAntiDepBreaker.cpp	[llvm] Remove redundant member initialization (NFC)	2022-01-07 17:45:09 -08:00
AggressiveAntiDepBreaker.h	…
AllocationOrder.cpp	Cleanup includes: final pass	2022-03-29 09:00:21 +02:00
AllocationOrder.h	…
Analysis.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
AtomicExpandPass.cpp	Fix computation of MadeChange bit in AtomicExpandPass.	2022-03-18 13:47:11 -07:00
BasicBlockSections.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
BasicTargetTransformInfo.cpp	…
BranchFolding.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
BranchFolding.h	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
BranchRelaxation.cpp	Cleanup includes: final pass	2022-03-29 09:00:21 +02:00
BreakFalseDeps.cpp	Cleanup includes: final pass	2022-03-29 09:00:21 +02:00
CFGuardLongjmp.cpp	…
CFIFixup.cpp	[CodeGen] Async unwind - add a pass to fix CFI information	2022-04-04 14:38:22 +01:00
CFIInstrInserter.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
CMakeLists.txt	[CodeGen] Async unwind - add a pass to fix CFI information	2022-04-04 14:38:22 +01:00
CalcSpillWeights.cpp	[NFC] Expose isRematerializable and copyHint from CalcSpillWeights	2022-01-04 08:11:49 -08:00
CallingConvLower.cpp	Cleanup includes: final pass	2022-03-29 09:00:21 +02:00
CodeGen.cpp	[CodeGen] Async unwind - add a pass to fix CFI information	2022-04-04 14:38:22 +01:00
CodeGenCommonISel.cpp	Reapply: StackProtector: ignore debug insts when splitting blocks.	2022-02-14 10:58:22 +00:00
CodeGenPassBuilder.cpp	…
CodeGenPrepare.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
CommandFlags.cpp	Reland "Lower `@llvm.global_dtors` using `__cxa_atexit` on MachO"	2022-03-23 18:36:55 -07:00
CriticalAntiDepBreaker.cpp	[llvm] Remove redundant member initialization (NFC)	2022-01-07 17:45:09 -08:00
CriticalAntiDepBreaker.h	…
DFAPacketizer.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
DeadMachineInstructionElim.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
DetectDeadLanes.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
DwarfEHPrepare.cpp	Reland "[ARM] __cxa_end_cleanup should be called instead of _UnwindResume."	2021-10-28 21:45:09 +02:00
EHContGuardCatchret.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
EarlyIfConversion.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
EdgeBundles.cpp	…
ExecutionDomainFix.cpp	…
ExpandMemCmp.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
ExpandPostRAPseudos.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
ExpandReductions.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
ExpandVectorPredication.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
FEntryInserter.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
FaultMaps.cpp	…
FinalizeISel.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
FixupStatepointCallerSaved.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
FuncletLayout.cpp	…
GCMetadata.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
GCMetadataPrinter.cpp	…
GCRootLowering.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
GlobalMerge.cpp	[llvm] Use range-based for loops (NFC)	2021-11-28 18:14:49 -08:00
HardwareLoops.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
IfConversion.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
ImplicitNullChecks.cpp	[llvm] Use none_of instead of \!any_of (NFC)	2021-12-17 13:48:57 -08:00
IndirectBrExpandPass.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
InlineSpiller.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
InterferenceCache.cpp	[CodeGen] Use range-based for loops (NFC)	2021-12-03 20:45:59 -08:00
InterferenceCache.h	[CodeGen] Use = default (NFC)	2022-02-06 10:54:44 -08:00
InterleavedAccessPass.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
InterleavedLoadCombinePass.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
IntrinsicLowering.cpp	…
JMCInstrumenter.cpp	[JMCInstrument] infer proper path style based on debug info	2022-03-10 10:50:44 -08:00
LLVMTargetMachine.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
LatencyPriorityQueue.cpp	[llvm] Use range-based for loops (NFC)	2021-11-28 18:14:49 -08:00
LazyMachineBlockFrequencyInfo.cpp	[CodeGen] Apply clang-tidy fixes for readability-redundant-smartptr-get (NFC)	2022-03-20 23:11:06 -07:00
LexicalScopes.cpp	…
LiveDebugVariables.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
LiveDebugVariables.h	[GlobalISel] Allow DBG_VALUE to use undefined vregs before LiveDebugValues.	2021-12-05 15:55:59 -05:00
LiveInterval.cpp	[CodeGen] Tweak whitespace in LiveInterval printing	2021-11-11 15:19:32 +00:00
LiveIntervalCalc.cpp	Cleanup includes: final pass	2022-03-29 09:00:21 +02:00
LiveIntervalUnion.cpp	[nfc][regalloc] const LiveIntervals within the allocator	2022-02-03 12:35:36 -08:00
LiveIntervals.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
LivePhysRegs.cpp	[llvm] Use llvm::reverse (NFC)	2021-11-06 19:31:18 -07:00
LiveRangeCalc.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
LiveRangeEdit.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
LiveRangeShrink.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
LiveRangeUtils.h	[NFC][llvm] Inclusive language: remove instance of master in LiveRangeUtils.h	2021-11-23 13:07:42 -06:00
LiveRegMatrix.cpp	[nfc][regalloc] const LiveIntervals within the allocator	2022-02-03 12:35:36 -08:00
LiveRegUnits.cpp	…
LiveStacks.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
LiveVariables.cpp	[CodeGen] Use range-based for loops (NFC)	2021-12-03 20:45:59 -08:00
LocalStackSlotAllocation.cpp	[CodeGen] Return better Changed status from LocalStackSlotAllocation	2022-02-17 09:31:41 +00:00
LoopTraversal.cpp	…
LowLevelType.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
LowerEmuTLS.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MBFIWrapper.cpp	…
MIRCanonicalizerPass.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MIRFSDiscriminator.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MIRNamerPass.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MIRPrinter.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MIRPrintingPass.cpp	…
MIRSampleProfile.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MIRVRegNamerUtils.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MIRVRegNamerUtils.h	…
MIRYamlMapping.cpp	…
MLRegallocEvictAdvisor.cpp	[mlgo] Fix build breaks introduced by includes cleanups	2022-03-21 13:49:40 -07:00
MachineBasicBlock.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
MachineBlockFrequencyInfo.cpp	…
MachineBlockPlacement.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineBranchProbabilityInfo.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineCSE.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineCheckDebugify.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineCombiner.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineCopyPropagation.cpp	[MachineCopyPropagation] More robust isForwardableRegClassCopy	2022-03-21 16:41:01 +00:00
MachineCycleAnalysis.cpp	Reapply CycleInfo: Introduce cycles as a generalization of loops	2021-12-10 14:36:43 +05:30
MachineDebugify.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineDominanceFrontier.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineDominators.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineFrameInfo.cpp	…
MachineFunction.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineFunctionPass.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineFunctionPrinterPass.cpp	…
MachineFunctionSplitter.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineInstr.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
MachineInstrBundle.cpp	[CodeGen] Apply clang-tidy fixes for readability-redundant-smartptr-get (NFC)	2022-03-20 23:11:06 -07:00
MachineLICM.cpp	[MachineLICM] Simplify code and avoid adding nullptr values to ParentMap. NFC	2022-03-15 01:24:01 -07:00
MachineLoopInfo.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineLoopUtils.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
MachineModuleInfo.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineModuleInfoImpls.cpp	…
MachineModuleSlotTracker.cpp	[CodeGen] Use default member initialization (NFC)	2022-01-30 12:32:51 -08:00
MachineOperand.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineOptimizationRemarkEmitter.cpp	[llvm] Remove redundant member initialization (NFC)	2022-01-07 17:45:09 -08:00
MachineOutliner.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachinePassManager.cpp	…
MachinePipeliner.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachinePostDominators.cpp	…
MachineRegionInfo.cpp	Revert "[NFC] Remove LinkAll*.h"	2021-11-02 09:08:09 -07:00
MachineRegisterInfo.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineSSAContext.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineSSAUpdater.cpp	[DebugInfo] Attempt to preserve more information during tail duplication	2021-12-03 15:30:05 +00:00
MachineScheduler.cpp	[CodeGen] Define ABI breaking class members correctly	2022-03-24 12:42:59 +03:00
MachineSink.cpp	[MachineSink] Check block prologue interference	2022-03-22 11:15:37 +09:00
MachineSizeOpts.cpp	[NFC] Use Optional<ProfileCount> to model invalid counts	2021-11-14 19:03:30 -08:00
MachineStableHash.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineStripDebug.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
MachineTraceMetrics.cpp	[llvm] Use llvm::reverse (NFC)	2021-12-12 16:13:49 -08:00
MachineVerifier.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
MacroFusion.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
ModuloSchedule.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
MultiHazardRecognizer.cpp	…
NonRelocatableStringpool.cpp	Move STLFunctionalExtras out of STLExtras	2022-01-24 14:13:21 +01:00
OptimizePHIs.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
PHIElimination.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
PHIEliminationUtils.cpp	…
PHIEliminationUtils.h	…
ParallelCG.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
PatchableFunction.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
PeepholeOptimizer.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
PostRAHazardRecognizer.cpp	[CodeGen] Apply clang-tidy fixes for readability-redundant-smartptr-get (NFC)	2022-03-20 23:11:06 -07:00
PostRASchedulerList.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
PreISelIntrinsicLowering.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
ProcessImplicitDefs.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
PrologEpilogInserter.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
PseudoProbeInserter.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
PseudoSourceValue.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RDFGraph.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RDFLiveness.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RDFRegisters.cpp	…
README.txt	…
ReachingDefAnalysis.cpp	Use RegisterInfo::regsOverlaps instead of checking aliases	2022-02-26 20:32:12 +01:00
RegAllocBase.cpp	[nfc][regalloc] const LiveIntervals within the allocator	2022-02-03 12:35:36 -08:00
RegAllocBase.h	[nfc][regalloc] const LiveIntervals within the allocator	2022-02-03 12:35:36 -08:00
RegAllocBasic.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RegAllocEvictionAdvisor.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RegAllocEvictionAdvisor.h	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RegAllocFast.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RegAllocGreedy.cpp	RegAllocGreedy: Fix typo	2022-03-31 16:30:01 -04:00
RegAllocGreedy.h	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RegAllocPBQP.cpp	[CodeGen] Use range-based for loops (NFC)	2021-12-04 08:48:05 -08:00
RegAllocScore.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RegAllocScore.h	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RegUsageInfoCollector.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RegUsageInfoPropagate.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RegisterBank.cpp	[nfc][codegen] Move RegisterBank[Info].h under CodeGen	2022-03-01 21:53:25 -08:00
RegisterBankInfo.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RegisterClassInfo.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RegisterCoalescer.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
RegisterCoalescer.h	…
RegisterPressure.cpp	…
RegisterScavenging.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RegisterUsageInfo.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RemoveRedundantDebugValues.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
RenameIndependentSubregs.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
ReplaceWithVeclib.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
ResetMachineFunctionPass.cpp	…
SafeStack.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
SafeStackLayout.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
SafeStackLayout.h	[SafeStack] Use Align instead of uint64_t	2021-12-15 14:40:56 -08:00
ScheduleDAG.cpp	[llvm] Use llvm::reverse (NFC)	2021-12-12 16:13:49 -08:00
ScheduleDAGInstrs.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
ScheduleDAGPrinter.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
ScoreboardHazardRecognizer.cpp	[llvm] Remove redundant member initialization (NFC)	2022-01-07 17:45:09 -08:00
ShadowStackGCLowering.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
ShrinkWrap.cpp	[ShrinkWrap] check for PPC's non-callee-saved LR	2022-01-11 10:01:34 -08:00
SjLjEHPrepare.cpp	…
SlotIndexes.cpp	[CodeGen] Use default member initialization (NFC)	2022-01-30 12:32:51 -08:00
SpillPlacement.cpp	…
SpillPlacement.h	…
SplitKit.cpp	Simplify mask creation with llvm::seq. NFCI.	2022-02-05 23:35:41 +01:00
SplitKit.h	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
StackColoring.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
StackMapLivenessAnalysis.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
StackMaps.cpp	…
StackProtector.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
StackSlotColoring.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
SwiftErrorValueTracking.cpp	…
SwitchLoweringUtils.cpp	…
TailDuplication.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
TailDuplicator.cpp	[CodeGen] Async unwind - add a pass to fix CFI information	2022-04-04 14:38:22 +01:00
TargetFrameLoweringImpl.cpp	[CodeGen] Async unwind - add a pass to fix CFI information	2022-04-04 14:38:22 +01:00
TargetInstrInfo.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
TargetLoweringBase.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
TargetLoweringObjectFileImpl.cpp	Reland "Lower `@llvm.global_dtors` using `__cxa_atexit` on MachO"	2022-03-23 18:36:55 -07:00
TargetOptionsImpl.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
TargetPassConfig.cpp	[CodeGen] Async unwind - add a pass to fix CFI information	2022-04-04 14:38:22 +01:00
TargetRegisterInfo.cpp	Reduce dependencies on llvm/BinaryFormat/Dwarf.h	2022-02-04 11:44:03 +01:00
TargetSchedule.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
TargetSubtargetInfo.cpp	[regalloc] Remove -consider-local-interval-cost	2022-03-14 10:49:16 -07:00
TwoAddressInstructionPass.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
TypePromotion.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
UnreachableBlockElim.cpp	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
VLIWMachineScheduler.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
ValueTypes.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
VirtRegMap.cpp	[CodeGen] Use MachineInstr::operands (NFC)	2021-11-11 07:10:30 -08:00
WasmEHPrepare.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
WinEHPrepare.cpp	Cleanup codegen includes	2022-03-16 08:43:00 +01:00
XRayInstrumentation.cpp	Reapply [xray] add support for hexagon	2021-12-10 05:32:28 -08:00

README.txt

//===---------------------------------------------------------------------===//

Common register allocation / spilling problem:

        mul lr, r4, lr
        str lr, [sp, #+52]
        ldr lr, [r1, #+32]
        sxth r3, r3
        ldr r4, [sp, #+52]
        mla r4, r3, lr, r4

can be:

        mul lr, r4, lr
        mov r4, lr
        str lr, [sp, #+52]
        ldr lr, [r1, #+32]
        sxth r3, r3
        mla r4, r3, lr, r4

and then "merge" mul and mov:

        mul r4, r4, lr
        str r4, [sp, #+52]
        ldr lr, [r1, #+32]
        sxth r3, r3
        mla r4, r3, lr, r4

It also increase the likelihood the store may become dead.

//===---------------------------------------------------------------------===//

bb27 ...
        ...
        %reg1037 = ADDri %reg1039, 1
        %reg1038 = ADDrs %reg1032, %reg1039, %noreg, 10
    Successors according to CFG: 0x8b03bf0 (#5)

bb76 (0x8b03bf0, LLVM BB @0x8b032d0, ID#5):
    Predecessors according to CFG: 0x8b0c5f0 (#3) 0x8b0a7c0 (#4)
        %reg1039 = PHI %reg1070, mbb<bb76.outer,0x8b0c5f0>, %reg1037, mbb<bb27,0x8b0a7c0>

Note ADDri is not a two-address instruction. However, its result %reg1037 is an
operand of the PHI node in bb76 and its operand %reg1039 is the result of the
PHI node. We should treat it as a two-address code and make sure the ADDri is
scheduled after any node that reads %reg1039.

//===---------------------------------------------------------------------===//

Use local info (i.e. register scavenger) to assign it a free register to allow
reuse:
        ldr r3, [sp, #+4]
        add r3, r3, #3
        ldr r2, [sp, #+8]
        add r2, r2, #2
        ldr r1, [sp, #+4]  <==
        add r1, r1, #1
        ldr r0, [sp, #+4]
        add r0, r0, #2

//===---------------------------------------------------------------------===//

LLVM aggressively lift CSE out of loop. Sometimes this can be negative side-
effects:

R1 = X + 4
R2 = X + 7
R3 = X + 15

loop:
load [i + R1]
...
load [i + R2]
...
load [i + R3]

Suppose there is high register pressure, R1, R2, R3, can be spilled. We need
to implement proper re-materialization to handle this:

R1 = X + 4
R2 = X + 7
R3 = X + 15

loop:
R1 = X + 4  @ re-materialized
load [i + R1]
...
R2 = X + 7 @ re-materialized
load [i + R2]
...
R3 = X + 15 @ re-materialized
load [i + R3]

Furthermore, with re-association, we can enable sharing:

R1 = X + 4
R2 = X + 7
R3 = X + 15

loop:
T = i + X
load [T + 4]
...
load [T + 7]
...
load [T + 15]
//===---------------------------------------------------------------------===//

It's not always a good idea to choose rematerialization over spilling. If all
the load / store instructions would be folded then spilling is cheaper because
it won't require new live intervals / registers. See 2003-05-31-LongShifts for
an example.

//===---------------------------------------------------------------------===//

With a copying garbage collector, derived pointers must not be retained across
collector safe points; the collector could move the objects and invalidate the
derived pointer. This is bad enough in the first place, but safe points can
crop up unpredictably. Consider:

        %array = load { i32, [0 x %obj] }** %array_addr
        %nth_el = getelementptr { i32, [0 x %obj] }* %array, i32 0, i32 %n
        %old = load %obj** %nth_el
        %z = div i64 %x, %y
        store %obj* %new, %obj** %nth_el

If the i64 division is lowered to a libcall, then a safe point will (must)
appear for the call site. If a collection occurs, %array and %nth_el no longer
point into the correct object.

The fix for this is to copy address calculations so that dependent pointers
are never live across safe point boundaries. But the loads cannot be copied
like this if there was an intervening store, so may be hard to get right.

Only a concurrent mutator can trigger a collection at the libcall safe point.
So single-threaded programs do not have this requirement, even with a copying
collector. Still, LLVM optimizations would probably undo a front-end's careful
work.

//===---------------------------------------------------------------------===//

The ocaml frametable structure supports liveness information. It would be good
to support it.

//===---------------------------------------------------------------------===//

The FIXME in ComputeCommonTailLength in BranchFolding.cpp needs to be
revisited. The check is there to work around a misuse of directives in inline
assembly.

//===---------------------------------------------------------------------===//

It would be good to detect collector/target compatibility instead of silently
doing the wrong thing.

//===---------------------------------------------------------------------===//

It would be really nice to be able to write patterns in .td files for copies,
which would eliminate a bunch of explicit predicates on them (e.g. no side
effects).  Once this is in place, it would be even better to have tblgen
synthesize the various copy insertion/inspection methods in TargetInstrInfo.

//===---------------------------------------------------------------------===//

Stack coloring improvements:

1. Do proper LiveStacks analysis on all stack objects including those which are
   not spill slots.
2. Reorder objects to fill in gaps between objects.
   e.g. 4, 1, <gap>, 4, 1, 1, 1, <gap>, 4 => 4, 1, 1, 1, 1, 4, 4

//===---------------------------------------------------------------------===//

The scheduler should be able to sort nearby instructions by their address. For
example, in an expanded memset sequence it's not uncommon to see code like this:

  movl $0, 4(%rdi)
  movl $0, 8(%rdi)
  movl $0, 12(%rdi)
  movl $0, 0(%rdi)

Each of the stores is independent, and the scheduler is currently making an
arbitrary decision about the order.

//===---------------------------------------------------------------------===//

Another opportunitiy in this code is that the $0 could be moved to a register:

  movl $0, 4(%rdi)
  movl $0, 8(%rdi)
  movl $0, 12(%rdi)
  movl $0, 0(%rdi)

This would save substantial code size, especially for longer sequences like
this. It would be easy to have a rule telling isel to avoid matching MOV32mi
if the immediate has more than some fixed number of uses. It's more involved
to teach the register allocator how to do late folding to recover from
excessive register pressure.