查看文章

Speech enhancement by iterating forward pass through U-net

作者

Tomasz Grzywalski, Szymon Drgas

发表日期

2020/9/23

研讨会论文

2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

页码范围

157-162

出版商

IEEE

简介

In recent years speech enhancement has shown great progress that was driven mostly by using bigger and more sophisticated neural networks. In this work we investigate the possibility to use state-of-the-art speech enhancement neural network and modify it in such a way that will allow it to process the noisy signal multiple times. By doing so we expect, that with each iteration the enhancement will improve. Experiments conducted using the WSJ0, Noisex-92 and DCASE datasets show, that U-net with gated dilated convolutions is able to achieve better SI-SDR, STOI and PESQ after processing the noisy signal two times, with the improvement being consistent across all SNRs and tested noise types. This is achieved without any additional trainable parameters and no additional memory requirements compared to the baseline model.

引用总数

被引用次数：1

20221

学术搜索中的文章

Speech enhancement by iterating forward pass through U-net

T Grzywalski, S Drgas - 2020 Signal Processing: Algorithms, Architectures …, 2020

被引用次数：1 相关文章