Title
Automatic Generation of High-Performance FFT Kernels on Arm and X86 CPUs
Abstract
This article presents AutoFFT, a template-based code generation framework that can automatically generate high-performance FFT kernels for all natural-number radices. AutoFFT is based on the Cooley-Tukey FFT algorithm, which exploits the symmetric and periodic properties of the DFT matrix, as the outer parallelization framework. Because butterflies are the core operations of the Cooley-Tukey algor...
Year
DOI
Venue
2020
10.1109/TPDS.2020.2977629
IEEE Transactions on Parallel and Distributed Systems
Keywords
DocType
Volume
Kernel,Libraries,Discrete Fourier transforms,Computer architecture,Optimization,Symmetric matrices,Hardware
Journal
31
Issue
ISSN
Citations 
8
1045-9219
2
PageRank 
References 
Authors
0.40
0
6
Name
Order
Citations
PageRank
Zhihao Li1175.10
Haipeng Jia2222.20
Yunquan Zhang332743.92
Tun Chen420.74
Liang Yuan54512.85
Richard Vuduc61343100.74