forked from OSchip/llvm-project
5d0ffbeb4d
This patch adds support for explicitly highlighting sub-expressions shared by multiple leaf nodes. For example consider the following code %shared.load = tail call <8 x double> @llvm.matrix.columnwise.load.v8f64.p0f64(double* %arg1, i32 %stride, i32 2, i32 4), !dbg !10, !noalias !10 %trans = tail call <8 x double> @llvm.matrix.transpose.v8f64(<8 x double> %shared.load, i32 2, i32 4), !dbg !10 tail call void @llvm.matrix.columnwise.store.v8f64.p0f64(<8 x double> %trans, double* %arg3, i32 10, i32 4, i32 2), !dbg !10 %load.2 = tail call <30 x double> @llvm.matrix.columnwise.load.v30f64.p0f64(double* %arg3, i32 %stride, i32 2, i32 15), !dbg !10, !noalias !10 %mult = tail call <60 x double> @llvm.matrix.multiply.v60f64.v8f64.v30f64(<8 x double> %trans, <30 x double> %load.2, i32 4, i32 2, i32 15), !dbg !11 tail call void @llvm.matrix.columnwise.store.v60f64.p0f64(<60 x double> %mult, double* %arg2, i32 10, i32 4, i32 15), !dbg !11 We have two leaf nodes (the 2 stores) and the first store stores %trans which is also used by the matrix multiply %mult. We generate separate remarks for each leaf (stores). To denote that parts are shared, the shared expressions are marked as shared (), with a reference to the other remark that shares it. The operation summary also denotes the shared operations separately. Reviewers: anemet, Gerolf, thegameg, hfinkel, andrew.w.kaylor, LuoYuanke Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D72526 |
||
---|---|---|
.. | ||
bigger-expressions-double.ll | ||
multiply-double-contraction-fmf.ll | ||
multiply-double-contraction.ll | ||
multiply-double.ll | ||
multiply-float-contraction-fmf.ll | ||
multiply-float-contraction.ll | ||
multiply-float.ll | ||
multiply-i32.ll | ||
propagate-backward.ll | ||
propagate-backwards-unsupported.ll | ||
propagate-forward.ll | ||
propagate-mixed-users.ll | ||
propagate-multiple-iterations.ll | ||
remarks-shared-subtrees.ll | ||
remarks.ll | ||
strided-load-double.ll | ||
strided-load-float.ll | ||
strided-load-i32.ll | ||
strided-store-double.ll | ||
strided-store-float.ll | ||
strided-store-i32.ll | ||
transpose-double.ll | ||
transpose-float.ll | ||
transpose-i32.ll |