The GotoBLAS codes are currently the fastest implementations
of the Basic Linear Algebra Subroutines(BLAS; math/blas).
It supports various architectures.

WWW: http://www.tacc.utexas.edu/resources/software/
