next up previous
Next: 3 Projects Up: CS 378: High Performance Previous: 1.1 Related Materials

2 Example: Matrix-Matrix Multiplication

Start by considering the FORTRAN coding of the BLAS routine dgemm

For this routine, I have gone through the following steps:

Notice that by typing

yes | rm *.o
make driver
one creates a version that does not use an optimizer, where typing
yes | rm *.o
make driver3
creates a version that uses optimization level -O 3.

Some performance numbers (in MFLOPS):

figure117

place HR here


next up previous
Next: 3 Projects Up: CS 378: High Performance Previous: 1.1 Related Materials

rvdg@cs.utexas.edu