A memory-efficient algorithm for large-scale symmetric tridiagonal eigenvalue problem on multi-GPU systems

Paper presented at WorldComp 2014: PDPTA (2014)
Poster presented at the GPU Technology Conference (2014)

Download

Code: https://github.com/hcho3/dstedc_mgpu
GTC poster: [PDF]
WorldComp PDPTA talk: [PPTX] [PDF]
WorldComp PDPTA paper: [Paper]

Synopsis

Divide-and-conquer algorithm is a numerically stable and efficient algorithm that computes the eigenvalues and eigenvectors of a symmetric tridiagonal matrix. We often face the situation where the input matrix fits into the main memory but not into the on-chip memory of a GPU device. We present an out-of-core implementation where only part of the input matrix is resident in GPU memory at any point in time. It works independently of the physical size of GPU memory, handling any size of input as long as it fits into the main memory. Work is dynamically allocated to multiple GPUs and CPU cores, taking account of available workspaces and progress of the algorithm. In addition, it delivers a performance comparable to that of conventional multi-GPU implementations for cases where workspaces fit into the GPU memory.

Publication Details

Conference Paper:
Hyunsu Cho and Peter Yoon. “A Memory-Efficient Algorithm for Large-Scale Symmetric Tridiagonal Eigenvalue Problem on Multi-GPU Systems,” Proceedings of the 2014 International Conference on Parallel and Distributed Processing Techniques and Applications, pp. 568-573, Las Vegas, NV, July 24, 2014.
Poster:
Hyunsu Cho and Peter Yoon. “Symmetric Tridiagonal Eigenvalue Problem on Multi-GPU Systems,” The GPU Technology Conference 2014, San Jose, CA, March 24, 2014.

[← Go back to profile]