Jaros, Jiri; Treeby, Bradley; Rendell, Alistair
This paper outlines our effort to migrate a compute intensive application of ultrasound propagation being developed in Matlab to a cluster computer where each node has seven GPUs. Our goal is to perform realistic simulations in hours and minutes instead of weeks and days. In order to reach this goal we investigate architecture characteristics of the target system focusing on the PCI-Express subsystem and new features proposed in CUDA version 4.0, especially simultaneous host to device, device...[Show more]
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.