2019-02-22 07:27:46 +08:00
|
|
|
; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx803 -stop-after=si-insert-skips < %s | FileCheck --check-prefix=GCN %s
|
2017-07-12 06:23:00 +08:00
|
|
|
|
|
|
|
; GCN-LABEL: name: syncscopes
|
Seperate volatility and atomicity/ordering in SelectionDAG
At the moment, we mark every atomic memory access as being also volatile. This is unnecessarily conservative and prohibits many legal transforms (DCE, folding, etc..).
This patch removes MOVolatile from the MachineMemOperands of atomic, but not volatile, instructions. This should be strictly NFC after a series of previous patches which have gone in to ensure backend code is conservative about handling of isAtomic MMOs. Once it's in and baked for a bit, we'll start working through removing unnecessary bailouts one by one. We applied this same strategy to the middle end a few years ago, with good success.
To make sure this patch itself is NFC, it is build on top of a series of other patches which adjust code to (for the moment) be as conservative for an atomic access as for a volatile access and build up a test corpus (mostly in test/CodeGen/X86/atomics-unordered.ll)..
Previously landed
D57593 Fix a bug in the definition of isUnordered on MachineMemOperand
D57596 [CodeGen] Be conservative about atomic accesses as for volatile
D57802 Be conservative about unordered accesses for the moment
rL353959: [Tests] First batch of cornercase tests for unordered atomics.
rL353966: [Tests] RMW folding tests w/unordered atomic operations.
rL353972: [Tests] More unordered atomic lowering tests.
rL353989: [SelectionDAG] Inline a single use helper function, and remove last non-MMO interface
rL354740: [Hexagon, SystemZ] Be super conservative about atomics
rL354800: [Lanai] Be super conservative about atomics
rL354845: [ARM] Be super conservative about atomics
Attention Out of Tree Backend Owners: This patch may break you. If it does, you can use the TLI getMMOFlags hook to restore the MOVolatile to any instruction you need to. (See llvm-dev thread titled "PSA: Changes to how atomics are handled in backends" started Feb 27, 2019.)
Differential Revision: https://reviews.llvm.org/D57601
llvm-svn: 355025
2019-02-28 04:20:08 +08:00
|
|
|
; GCN: FLAT_STORE_DWORD killed renamable $vgpr1_vgpr2, killed renamable $vgpr0, 0, 0, 0, implicit $exec, implicit $flat_scr :: (store syncscope("agent") seq_cst 4 into %ir.agent_out)
|
|
|
|
; GCN: FLAT_STORE_DWORD killed renamable $vgpr4_vgpr5, killed renamable $vgpr3, 0, 0, 0, implicit $exec, implicit $flat_scr :: (store syncscope("workgroup") seq_cst 4 into %ir.workgroup_out)
|
|
|
|
; GCN: FLAT_STORE_DWORD killed renamable $vgpr7_vgpr8, killed renamable $vgpr6, 0, 0, 0, implicit $exec, implicit $flat_scr :: (store syncscope("wavefront") seq_cst 4 into %ir.wavefront_out)
|
2017-07-12 06:23:00 +08:00
|
|
|
define void @syncscopes(
|
|
|
|
i32 %agent,
|
2018-02-03 00:07:16 +08:00
|
|
|
i32* %agent_out,
|
2017-07-12 06:23:00 +08:00
|
|
|
i32 %workgroup,
|
2018-02-03 00:07:16 +08:00
|
|
|
i32* %workgroup_out,
|
2017-07-12 06:23:00 +08:00
|
|
|
i32 %wavefront,
|
2018-02-03 00:07:16 +08:00
|
|
|
i32* %wavefront_out) {
|
2017-07-12 06:23:00 +08:00
|
|
|
entry:
|
2018-02-03 00:07:16 +08:00
|
|
|
store atomic i32 %agent, i32* %agent_out syncscope("agent") seq_cst, align 4
|
|
|
|
store atomic i32 %workgroup, i32* %workgroup_out syncscope("workgroup") seq_cst, align 4
|
|
|
|
store atomic i32 %wavefront, i32* %wavefront_out syncscope("wavefront") seq_cst, align 4
|
2017-07-12 06:23:00 +08:00
|
|
|
ret void
|
|
|
|
}
|