W Jang, D Lim, H Park - arXiv preprint arXiv:2305.10823, 2023 - arxiv.org
This paper presents FastFit, a novel neural vocoder architecture that replaces the U-Net
encoder with multiple short-time Fourier transforms (STFTs) to achieve faster generation …