forked from OSchip/llvm-project
f13a59bcff
This patch uses the TileInfo introduced in D77550 to generate a loop nest for tiled matrix multiplication, instead of generating the unrolled code for the whole multiplication. This makes code-generation more scalable for larger matrixes. Initially loops are only used if both the number of rows and columns are divisible by the tile size. Other cases will be added as follow-up. Reviewers: anemet, Gerolf, hfinkel, andrew.w.kaylor, LuoYuanke, nicolasvasilache Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D81308 |
||
---|---|---|
.. | ||
bigger-expressions-double.ll | ||
const-gep.ll | ||
load-align-volatile.ll | ||
multiply-add-sub-double-row-major.ll | ||
multiply-double-contraction-fmf.ll | ||
multiply-double-contraction.ll | ||
multiply-double-row-major.ll | ||
multiply-double.ll | ||
multiply-float-contraction-fmf.ll | ||
multiply-float-contraction.ll | ||
multiply-float.ll | ||
multiply-fused-loops.ll | ||
multiply-fused-multiple-blocks.ll | ||
multiply-fused-volatile.ll | ||
multiply-fused.ll | ||
multiply-i32-row-major.ll | ||
multiply-i32.ll | ||
multiply-minimal.ll | ||
propagate-backward.ll | ||
propagate-backwards-unsupported.ll | ||
propagate-forward.ll | ||
propagate-mixed-users.ll | ||
propagate-multiple-iterations.ll | ||
remarks-inlining.ll | ||
remarks-shared-subtrees.ll | ||
remarks.ll | ||
store-align-volatile.ll | ||
strided-load-double.ll | ||
strided-load-float.ll | ||
strided-load-i32.ll | ||
strided-store-double.ll | ||
strided-store-float.ll | ||
strided-store-i32.ll | ||
transpose-double-row-major.ll | ||
transpose-double.ll | ||
transpose-float-row-major.ll | ||
transpose-float.ll | ||
transpose-i32-row-major.ll | ||
transpose-i32.ll |