Implementation of 3D FFTs Across Multiple GPUs in Shared Memory Environments
In this paper, a novel implementation of the distributed 3D Fast Fourier Transform (FFT) on a multi-GPU platform using CUDA is presented. The 3D FFT is the core of many simulation methods, thus its fast calculation is critical. The main bottleneck of the
|Collections||ANU Research Publications|
|01_Nandapalan_Implementation_of_3D_FFTs_2012.pdf||685.49 kB||Adobe PDF||Request a copy|
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.