llvm-project

Go to file

Roman Gareev 3a18a931a8 Apply all necessary tilings and interchangings to get a macro-kernel This is the second patch to apply the BLIS matmul optimization pattern on matmul kernels (http://www.cs.utexas.edu/users/flame/pubs/TOMS-BLIS-Analytical.pdf). BLIS implements gemm as three nested loops around a macro-kernel, plus two packing routines. The macro-kernel is implemented in terms of two additional loops around a micro-kernel. The micro-kernel is a loop around a rank-1 (i.e., outer product) update. In this change we create the BLIS macro-kernel by applying a combination of tiling and interchanging. In subsequent changes we will implement the packing transformation. Reviewed-by: Tobias Grosser <tobias@grosser.es> Differential Revision: http://reviews.llvm.org/D21491 llvm-svn: 276627		2016-07-25 09:42:53 +00:00
clang	…
clang-tools-extra	…
compiler-rt	…
debuginfo-tests	…
libclc	…
libcxx	…
libcxxabi	…
libunwind	…
lld	…
lldb	…
llgo	…
llvm	…
openmp	…
parallel-libs	…
polly	…