forked from OSchip/llvm-project
b3224adfb6
This is the fourth patch to apply the BLIS matmul optimization pattern on matmul kernels (http://www.cs.utexas.edu/users/flame/pubs/TOMS-BLIS-Analytical.pdf). BLIS implements gemm as three nested loops around a macro-kernel, plus two packing routines. The macro-kernel is implemented in terms of two additional loops around a micro-kernel. The micro-kernel is a loop around a rank-1 (i.e., outer product) update. In this change we perform copying to created arrays, which is the last step to implement the packing transformation. Reviewed-by: Tobias Grosser <tobias@grosser.es> Differential Revision: https://reviews.llvm.org/D23260 llvm-svn: 281441 |
||
---|---|---|
.. | ||
Canonicalization.cpp | ||
CodePreparation.cpp | ||
DeadCodeElimination.cpp | ||
FlattenAlgo.cpp | ||
FlattenSchedule.cpp | ||
ScheduleOptimizer.cpp |