FFTW is a comprehensive collection of fast C routines for computing the Discrete Fourier Transform (DFT) and various special cases thereof. It is an open-source implementation of the Fast Fourier transform algorithm. It can compute transforms of real and complex-values arrays of arbitrary size and dimension. AMD FFTW includes selective kernels and routines optimized for the AMD EPYCTM processor family.
Highlights of AMD FFTW 2.2
- Improved performance of in-place MPI FFT by employing a faster in-place MPI transpose routine.
- Improved performance of copy function cpy2d_pair used for rank-0 transform and buffering plans.
- Added DFT kernels of higher radix sizes for q, t and n series FFT codelets.
The package containing AMD FFTW Library binaries which includes optimizations for the AMD EPYCTM processor family and documentation are available in the Downloads section below.
Source code is available on GitHub https://github.com/amd/amd-fftw.
Refer here for older versions.
Note: Below binary packages are generated and validated with following configuration. If user needs to use with other configuration, we recommend to build the required library from source as per instructions provided in user guide.
OS ( 19.04) , GCC ( 9.1), AOCC ( 2.2), OpenMPI(3.1.1), OpenMP(4.5), Glibc(2.29), Python(2.7)