Math Kernel Library Performance

Intel Math Kernel Library
	Features/Benefits

	System Requirements

	Performance Specifications

	What's New -- Revision History

	Licensing

	Download and Install

	Technical Support

Intel Math Kernel Library Performance Specifications

All of the Math Kernel Library routines have been optimized for high performance on the Pentium^® and Pentium Pro processors. The BLAS level 3 routines that perform matrix-matrix operations have been further optimized for efficient cache usage. This optimal cache management ensures excellent performance on a single processor. In addition, these routines have been multithreaded and scale well when run on up to four processors.

The graphs that follow show double precision performance on two of the BLAS level 3 routines: DGEMM and DSYMM run on 200 MHz Pentium Pro Processor. DGEMM computes a double-precision general matrix-matrix multiplication, while DSYMM computes double-precision symmetric matrix-matrix multiplication. For DGEMM, NN indicates a normal orientation for each matrix. For DSYMM, LL means that the symmetric matrix is on the left and its lower portion is used. For both functions, -1, -2, and -4 represent one, two, and four processors, respectively.

DGEMM on 200 MHz Pentium^® Pro Processor

DSYMM on 200 MHz Pentium^® Pro Processor