Generating optimal CUDA sparse matrix-vector product implementations for evolving GPU hardware

El Zein, Ahmed; Rendell, Alistair

Generating optimal CUDA sparse matrix-vector product implementations for evolving GPU hardware

Date

2012

Authors

El Zein, Ahmed

Rendell, Alistair

Publisher

John Wiley & Sons Inc

Abstract

The CUDA model for graphics processing units (GPUs) presents the programmer with a plethora of different programming options. These includes different memory types, different memory access methods and different data types. Identifying which options to use and when is a non-trivial exercise. This paper explores the effect of these different options on the performance of a routine that evaluates sparse matrix-vector products (SpMV) across three different generations of NVIDIA GPU hardware. A process for analysing performance and selecting the subset of implementations that perform best is proposed. The potential for mapping sparse matrix attributes to optimal CUDA SpMV implementations is discussed.

Keywords

Keywords: CUDA; Fermi; GPU; Matrix-vector; NVIDIA; S2050; sparse; Multicore programming; Optimization; Program processors; Matrix algebra CUDA; Fermi; GPU; matrix-vector; NVIDIA; S2050; sparse

URI

http://hdl.handle.net/1885/63386

Collections

ANU Research Publications

Source

Concurrency and Computation: Practice and Experience

Type

Journal article

DOI

10.1002/cpe.1732

Restricted until

2037-12-31

Downloads

File

Description

01_El Zein_Generating_optimal_CUDA_sparse_2012.pdf (2.32 MB)

Full item page

Generating optimal CUDA sparse matrix-vector product implementations for evolving GPU hardware

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

Downloads