llvm-project

History

Stanislav Mekhanoshin 3197eb6981 [AMDGPU] Optimize SI_IF lowering for simple if regions Currently SI_IF results in a s_and_saveexec_b64 followed by s_xor_b64. The xor is used to extract only the changed bits. In case of a simple if region where the only use of that value is in the SI_END_CF to restore the old exec mask, we can omit the xor and perform an or of the exec mask with the original exec value saved by the s_and_saveexec_b64. Differential Revision: https://reviews.llvm.org/D35861 llvm-svn: 309185		2017-07-26 21:29:15 +00:00
..
AsmParser	[AMDGPU][MC][GFX9] Added support of VOP3 'op_sel' modifier	2017-07-21 13:54:11 +00:00
Disassembler	AMDGPU: Add instruction definitions for some scratch_* instructions	2017-07-21 15:36:16 +00:00
InstPrinter	AMDGPU: Fix allocating pseudo-registers	2017-07-24 18:06:15 +00:00
MCTargetDesc	Fully fix the movw/movt addend.	2017-07-11 23:18:25 +00:00
TargetInfo	fix trivial typos; NFC	2017-07-02 03:24:54 +00:00
Utils	AMDGPU: Fix using SMRD instructions for argument loads in functions	2017-07-26 20:39:42 +00:00
AMDGPU.h	AMDGPU: Implement memory model	2017-07-21 21:19:23 +00:00
AMDGPU.td	AMDGPU: Add instruction definitions for some scratch_* instructions	2017-07-21 15:36:16 +00:00
AMDGPUAliasAnalysis.cpp	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
AMDGPUAliasAnalysis.h	AMDGPU/R600: Fix amdgpu alias analysis pass.	2017-03-31 19:26:23 +00:00
AMDGPUAlwaysInlinePass.cpp	[AMDGPU] Testing commit access only, no real change	2017-06-15 23:02:55 +00:00
AMDGPUAnnotateKernelFeatures.cpp	AMDGPU: Annotate necessity of flat-scratch-init	2017-07-18 16:44:58 +00:00
AMDGPUAnnotateUniformValues.cpp	AMDGPU: Fix converting unanalyzable global loads to SMRD	2017-07-12 23:06:18 +00:00
AMDGPUAsmPrinter.cpp	AMDGPU: Remove duplicate print outs from .AMDGPU.csdata	2017-07-16 19:24:08 +00:00
AMDGPUAsmPrinter.h	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
AMDGPUCallLowering.cpp	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
AMDGPUCallLowering.h	AMDGPU: Start defining a calling convention	2017-05-17 21:56:25 +00:00
AMDGPUCallingConv.td	AMDGPU: Start defining a calling convention	2017-05-17 21:56:25 +00:00
AMDGPUCodeGenPrepare.cpp	AMDGPU : Widen extending scalar loads to 32-bits.	2017-07-26 21:07:28 +00:00
AMDGPUFrameLowering.cpp	[AMDGPU] Split R600/SI getFrameIndexReference and emit stack object offsets for SI	2017-03-10 19:39:07 +00:00
AMDGPUFrameLowering.h	[AMDGPU] Split R600/SI getFrameIndexReference and emit stack object offsets for SI	2017-03-10 19:39:07 +00:00
AMDGPUGenRegisterBankInfo.def	Re-commit AMDGPU/GlobalISel: Add support for simple shaders	2017-01-30 21:56:46 +00:00
AMDGPUISelDAGToDAG.cpp	[AMDGPU][MC][GFX9] Added support of VOP3 'op_sel' modifier	2017-07-21 13:54:11 +00:00
AMDGPUISelLowering.cpp	fix typos in comments; NFC	2017-07-16 08:11:56 +00:00
AMDGPUISelLowering.h	AMDGPU: Return correct type during argument lowering	2017-07-15 05:52:59 +00:00
AMDGPUInstrInfo.cpp	[AMDGPU] SDWA: merge VI and GFX9 pseudo instructions	2017-06-21 08:53:38 +00:00
AMDGPUInstrInfo.h	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
AMDGPUInstrInfo.td	[AMDGPU] simplify add x, *ext (setcc) => addc\|subb x, 0, setcc	2017-06-21 22:05:06 +00:00
AMDGPUInstructionSelector.cpp	AMDGPU: Start adding offset fields to flat instructions	2017-06-12 15:55:58 +00:00
AMDGPUInstructionSelector.h	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
AMDGPUInstructions.td	[AMDGPU][MC] Added check for truncation of SOPK imm operand	2017-04-26 15:34:19 +00:00
AMDGPUIntrinsicInfo.cpp	Rename AttributeSet to AttributeList	2017-03-21 16:57:19 +00:00
AMDGPUIntrinsicInfo.h	…
AMDGPUIntrinsics.td	AMDGPU: Remove legacy bfe intrinsics	2017-04-03 18:08:08 +00:00
AMDGPULegalizerInfo.cpp	AMDGPU/GlobalISel: Mark 32-bit G_OR as legal	2017-07-26 20:00:53 +00:00
AMDGPULegalizerInfo.h	Re-commit AMDGPU/GlobalISel: Add support for simple shaders	2017-01-30 21:56:46 +00:00
AMDGPULowerIntrinsics.cpp	Extend memcpy expansion in Transform/Utils to handle wider operand types.	2017-07-07 02:00:06 +00:00
AMDGPUMCInstLower.cpp	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
AMDGPUMCInstLower.h	…
AMDGPUMachineCFGStructurizer.cpp	fix trivial typos, NFC	2017-06-27 10:35:37 +00:00
AMDGPUMachineFunction.cpp	AMDGPU: Start defining a calling convention	2017-05-17 21:56:25 +00:00
AMDGPUMachineFunction.h	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
AMDGPUMachineModuleInfo.cpp	AMDGPU: Implement memory model	2017-07-21 21:19:23 +00:00
AMDGPUMachineModuleInfo.h	AMDGPU: Implement memory model	2017-07-21 21:19:23 +00:00
AMDGPUMacroFusion.cpp	AMDGPU: Add macro fusion schedule DAG mutation	2017-07-06 20:57:05 +00:00
AMDGPUMacroFusion.h	AMDGPU: Add macro fusion schedule DAG mutation	2017-07-06 20:57:05 +00:00
AMDGPUOpenCLImageTypeLoweringPass.cpp	…
AMDGPUPTNote.h	[AMDGPU] Restructure code object metadata creation	2017-03-22 22:32:22 +00:00
AMDGPUPromoteAlloca.cpp	[AMDGPU] Fix for issue in alloca to vector promotion pass	2017-06-09 14:16:22 +00:00
AMDGPURegAsmNames.inc.cpp	AMDGPU: Work around build special casing .inc files	2017-06-08 19:25:21 +00:00
AMDGPURegisterBankInfo.cpp	[RegisterBankInfo] Uniquely allocate instruction mapping.	2017-05-05 22:48:22 +00:00
AMDGPURegisterBankInfo.h	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
AMDGPURegisterBanks.td	Re-commit AMDGPU/GlobalISel: Add support for simple shaders	2017-01-30 21:56:46 +00:00
AMDGPURegisterInfo.cpp	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
AMDGPURegisterInfo.h	AMDGPU: Start defining a calling convention	2017-05-17 21:56:25 +00:00
AMDGPURegisterInfo.td	…
AMDGPUSubtarget.cpp	AMDGPU: Add encoding for carryless add/sub instructions	2017-07-20 17:42:47 +00:00
AMDGPUSubtarget.h	AMDGPU: Add encoding for carryless add/sub instructions	2017-07-20 17:42:47 +00:00
AMDGPUTargetMachine.cpp	AMDGPU: Implement memory model	2017-07-21 21:19:23 +00:00
AMDGPUTargetMachine.h	TargetMachine: Indicate whether machine verifier passes.	2017-05-31 18:41:23 +00:00
AMDGPUTargetObjectFile.cpp	Move Object format code to lib/BinaryFormat.	2017-06-07 03:48:56 +00:00
AMDGPUTargetObjectFile.h	[AMDGPU] Get address space mapping by target triple environment	2017-03-27 14:04:01 +00:00
AMDGPUTargetTransformInfo.cpp	[LoopUnroll] Pass SCEV to getUnrollingPreferences hook. NFCI.	2017-06-28 15:53:17 +00:00
AMDGPUTargetTransformInfo.h	[LoopUnroll] Pass SCEV to getUnrollingPreferences hook. NFCI.	2017-06-28 15:53:17 +00:00
AMDGPUUnifyDivergentExitNodes.cpp	AMDGPU: Unify divergent function exits.	2017-03-24 19:52:05 +00:00
AMDGPUUnifyMetadata.cpp	[AMDGPU] Turn AMDGPUUnifyMetadata back into module pass	2017-01-27 16:38:10 +00:00
AMDILCFGStructurizer.cpp	Remove unused functions. Remove static qualifier from functions in header files. NFC.	2017-04-11 14:55:32 +00:00
AMDKernelCodeT.h	…
BUFInstructions.td	AMDGPU: Implement memory model	2017-07-21 21:19:23 +00:00
CMakeLists.txt	AMDGPU: Implement memory model	2017-07-21 21:19:23 +00:00
CaymanInstructions.td	…
DSInstructions.td	[AMDGPU][MC] New syntax for ds_swizzle_b32 offset	2017-05-31 16:26:47 +00:00
EvergreenInstructions.td	AMDGPU: Fix unnecessary ands when packing f16 vectors	2017-03-15 19:04:26 +00:00
FLATInstructions.td	AMDGPU: Implement memory model	2017-07-21 21:19:23 +00:00
GCNHazardRecognizer.cpp	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
GCNHazardRecognizer.h	AMDGPU: Fix broken condition in hazard recognizer	2017-03-17 21:36:28 +00:00
GCNIterativeScheduler.cpp	[CodeGen] Rename DEBUG_TYPE to match passnames	2017-07-11 22:08:28 +00:00
GCNIterativeScheduler.h	[AMDGPU] Iterative scheduling infrastructure + minimal registry scheduler	2017-03-21 13:15:46 +00:00
GCNMinRegStrategy.cpp	[CodeGen] Rename DEBUG_TYPE to match passnames	2017-07-11 22:08:28 +00:00
GCNRegPressure.cpp	Implement LaneBitmask::getNumLanes and LaneBitmask::getHighestLane	2017-07-20 19:43:19 +00:00
GCNRegPressure.h	[AMDGPU] Fix incorrect register usage tracking in GCNUpwardTracker	2017-05-22 13:09:40 +00:00
GCNSchedStrategy.cpp	[CodeGen] Rename DEBUG_TYPE to match passnames	2017-07-11 22:08:28 +00:00
GCNSchedStrategy.h	fix typos in comments and error messges; NFC	2017-07-13 06:48:39 +00:00
LLVMBuild.txt	AMDGPU: Add GlobalISel to required_libraries.	2017-01-28 18:13:08 +00:00
MIMGInstructions.td	[AMDGPU] Fix latency of MIMG instructions	2017-07-04 14:43:38 +00:00
Processors.td	AMDGPU: Whitespace fixes	2017-06-26 03:01:36 +00:00
R600ClauseMergePass.cpp	[LegacyPassManager] Remove TargetMachine constructors	2017-05-18 17:21:13 +00:00
R600ControlFlowFinalizer.cpp	[AMDGPU] Fix -Wimplicit-fallthrough warnings. NFCI.	2017-07-07 10:18:57 +00:00
R600Defines.h	…
R600EmitClauseMarkers.cpp	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
R600ExpandSpecialInstrs.cpp	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
R600FrameLowering.cpp	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
R600FrameLowering.h	[AMDGPU] Split R600/SI getFrameIndexReference and emit stack object offsets for SI	2017-03-10 19:39:07 +00:00
R600ISelLowering.cpp	Add DAG argument to canMergeStoresTo NFC.	2017-07-10 20:25:54 +00:00
R600ISelLowering.h	Add DAG argument to canMergeStoresTo NFC.	2017-07-10 20:25:54 +00:00
R600InstrFormats.td	…
R600InstrInfo.cpp	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
R600InstrInfo.h	Cyle -> Cycle; NFCI	2017-03-15 15:37:42 +00:00
R600Instructions.td	[AMDGPU] Get address space mapping by target triple environment	2017-03-27 14:04:01 +00:00
R600Intrinsics.td	AMDGPU: Make intrinsics speculatable	2017-05-02 16:57:44 +00:00
R600MachineFunctionInfo.cpp	…
R600MachineFunctionInfo.h	…
R600MachineScheduler.cpp	[CodeGen] Rename DEBUG_TYPE to match passnames	2017-07-11 22:08:28 +00:00
R600MachineScheduler.h	[AMDGPU, PowerPC, TableGen] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC).	2016-12-09 22:06:55 +00:00
R600OptimizeVectorRegisters.cpp	[LegacyPassManager] Remove TargetMachine constructors	2017-05-18 17:21:13 +00:00
R600Packetizer.cpp	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
R600RegisterInfo.cpp	AMDGPU: Start defining a calling convention	2017-05-17 21:56:25 +00:00
R600RegisterInfo.h	AMDGPU: Start defining a calling convention	2017-05-17 21:56:25 +00:00
R600RegisterInfo.td	[AMDGPU] Add INDIRECT_BASE_ADDR to R600_Reg32 class (PR33045)	2017-05-23 21:27:15 +00:00
R600Schedule.td	…
R700Instructions.td	…
SIAnnotateControlFlow.cpp	Remove now useless trailing nullptr in StructType::get	2017-05-11 08:46:02 +00:00
SIDebuggerInsertNops.cpp	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
SIDefines.h	AMDGPU: Introduce maybeAtomic instruction flag	2017-07-21 21:05:45 +00:00
SIFixControlFlowLiveIntervals.cpp	…
SIFixSGPRCopies.cpp	[AMDGPU] Eliminate SGPR to VGPR copy when possible	2017-06-20 18:32:42 +00:00
SIFixVGPRCopies.cpp	[AMDGPU] Add VGPR copies post regalloc fix pass	2017-01-24 17:46:17 +00:00
SIFoldOperands.cpp	AMDGPU: Fix crash when folding immediates into multiple uses	2017-07-18 14:54:41 +00:00
SIFrameLowering.cpp	AMDGPU: Annotate necessity of flat-scratch-init	2017-07-18 16:44:58 +00:00
SIFrameLowering.h	AMDGPU: Setup SP/FP in callee function prolog/epilog	2017-06-26 17:53:59 +00:00
SIISelLowering.cpp	TargetLowering: Change isShuffleMaskLegal's mask argument type to ArrayRef<int>. NFCI.	2017-07-26 08:06:58 +00:00
SIISelLowering.h	TargetLowering: Change isShuffleMaskLegal's mask argument type to ArrayRef<int>. NFCI.	2017-07-26 08:06:58 +00:00
SIInsertSkips.cpp	AMDGPU: Rename SI_RETURN	2017-03-21 22:18:10 +00:00
SIInsertWaitcnts.cpp	AMDGPU: Partially fix improper reliance on memoperands	2017-07-21 18:54:54 +00:00
SIInsertWaits.cpp	AMDGPU: Make auto waitcnt before barrier a feature	2017-06-02 17:40:26 +00:00
SIInstrFormats.td	AMDGPU: Introduce maybeAtomic instruction flag	2017-07-21 21:05:45 +00:00
SIInstrInfo.cpp	AMDGPU: Fix getMemOpBaseRegImmOfs for flat with offsets	2017-07-21 18:06:36 +00:00
SIInstrInfo.h	AMDGPU: Don't track lgkmcnt for global_/scratch_ instructions	2017-07-21 18:34:51 +00:00
SIInstrInfo.td	AMDGPU: Remove leftover td file	2017-07-22 00:40:46 +00:00
SIInstructions.td	AMDGPU: Introduce maybeAtomic instruction flag	2017-07-21 21:05:45 +00:00
SIIntrinsics.td	AMDGPU: Remove legacy export intrinsic	2017-04-04 16:34:39 +00:00
SILoadStoreOptimizer.cpp	[LegacyPassManager] Remove TargetMachine constructors	2017-05-18 17:21:13 +00:00
SILowerControlFlow.cpp	[AMDGPU] Optimize SI_IF lowering for simple if regions	2017-07-26 21:29:15 +00:00
SILowerI1Copies.cpp	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
SIMachineFunctionInfo.cpp	AMDGPU: Annotate necessity of flat-scratch-init	2017-07-18 16:44:58 +00:00
SIMachineFunctionInfo.h	AMDGPU: Figure out private memory regs after lowering	2017-07-18 16:44:56 +00:00
SIMachineScheduler.cpp	AMDGPU/SI: Fix Depth and Height computation for SI scheduler	2017-07-25 20:37:03 +00:00
SIMachineScheduler.h	AMDGPU/SI: Force exports at the end for SI scheduler	2017-07-25 20:36:58 +00:00
SIMemoryLegalizer.cpp	AMDGPU: Implement memory model	2017-07-21 21:19:23 +00:00
SIOptimizeExecMasking.cpp	…
SIPeepholeSDWA.cpp	[AMDGPU] SDWA: several fixes for V_CVT and VOPC instructions	2017-06-27 15:02:23 +00:00
SIRegisterInfo.cpp	AMDGPU: Preserve undef flag in eliminateFrameIndex	2017-07-21 19:31:44 +00:00
SIRegisterInfo.h	AMDGPU: Partially fix implicit.buffer.ptr intrinsic handling	2017-06-26 03:01:31 +00:00
SIRegisterInfo.td	AMDGPU: Fix allocating pseudo-registers	2017-07-24 18:06:15 +00:00
SISchedule.td	AMDGPU: Implement early ifcvt target hooks.	2017-01-25 04:25:02 +00:00
SIShrinkInstructions.cpp	AMDGPU: Allow SIShrinkInstructions to fold FrameIndexes	2017-07-10 20:04:35 +00:00
SIWholeQuadMode.cpp	[AMDGPU, PowerPC, TableGen] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC).	2016-12-09 22:06:55 +00:00
SMInstructions.td	AMDGPUAnnotateUniformValue should always treat volatile loads as divergent	2017-06-02 15:25:52 +00:00
SOPInstructions.td	Resubmit r303859 with test fixed.	2017-05-26 20:38:26 +00:00
VIInstrFormats.td	…
VIInstructions.td	AMDGPU: Add VI i16 support	2016-11-10 16:02:37 +00:00
VOP1Instructions.td	[AMDGPU] SDWA: merge VI and GFX9 pseudo instructions	2017-06-21 08:53:38 +00:00
VOP2Instructions.td	AMDGPU: Add encoding for carryless add/sub instructions	2017-07-20 17:42:47 +00:00
VOP3Instructions.td	[AMDGPU][MC][GFX9] Added support of VOP3 'op_sel' modifier	2017-07-21 13:54:11 +00:00
VOP3PInstructions.td	[AMDGPU][MC] Added missing VOP3P opcodes	2017-07-18 09:24:10 +00:00
VOPCInstructions.td	[AMDGPU] resubmit r308179: CodeGen: check dst operand type to determine if omod is supported for VOP3 instructions	2017-07-18 14:23:26 +00:00
VOPInstructions.td	[AMDGPU][MC][GFX9] Added support of VOP3 'op_sel' modifier	2017-07-21 13:54:11 +00:00