El Zein, Ahmed; Rendell, Alistair
The CUDA model for graphics processing units (GPUs) presents the programmer with a plethora of different programming options. These includes different memory types, different memory access methods and different data types. Identifying which options to use and when is a non-trivial exercise. This paper explores the effect of these different options on the performance of a routine that evaluates sparse matrix-vector products (SpMV) across three different generations of NVIDIA GPU hardware. A...[Show more]
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.