Research Article

Design and Implementation of Cache Memory with Dual Unit Tile/Line Accessibility

Listing 1

The basic tiled MM version (6-loop tiled code).
(1) int main()
(2)
(3) double A[N][N], B[N][N], C[N][N];
(4) for(ii = 0; ii < N; ii+=32)
(5) for(jj = 0; jj < N; jj+=32)
(6) for(kk = 0; kk < N; kk+=32)
(7) for(i = ii; (i < N && i < ii + 32); i++)
(8) for(j = jj; (j < N && j < jj + 32); j++)
(9) for(k = kk; (k < N && k < kk + 32); k++)
(10) C[i][j]+= A[i][k] B[k][j];
(11)