Conference proceeding
Implementation of Strassen's Algorithm for Matrix Multiplication
Proceedings of the 1996 ACM/IEEE Conference on Supercomputing, v 1996-, pp 32-32
1996
Abstract
In this paper we report on the development of an efficient and portable implementation of Strassen's matrix mulitplication algorithm. Our implementation is designed to be used in place of DGEMM, the Level 3 BLAS matrix mulitplication routine. Efficient performance will be obtained for all matrix sizes and shapes and the additional memory needed fro temporary variables has been minimized. Replacing DGEMM with our routine should provide a significant performance gain for large matrices while providing the same performance for small matrices. We measure performance of our code on the IBM RS/6000, CRAY YMP C90, and CRAY T3D single processor, and offer comparisons to other codes. Our performance data reconfirms that Strassen's algorithm is practical for realistic size matrices. The usefulness of our implementation is demonstrated by replacing DGEMM with our routine in a large application code.
Metrics
5 Record Views
37 citations in Scopus
Details
- Title
- Implementation of Strassen's Algorithm for Matrix Multiplication
- Creators
- S Huss-Lederman - University of Wisconsin-MadisonE.M JacobsonJ.R JohnsonA TsaoT Turnbull
- Publication Details
- Proceedings of the 1996 ACM/IEEE Conference on Supercomputing, v 1996-, pp 32-32
- Conference
- 1996 ACM/IEEE Conference on Supercomputing
- Publisher
- IEEE
- Number of pages
- 1
- Resource Type
- Conference proceeding
- Language
- English
- Academic Unit
- Computer Science (Computing)
- Scopus ID
- 2-s2.0-33947652414
- Other Identifier
- 991019173961904721