作者
Tomasz Grzywalski, Szymon Drgas
发表日期
2020/9/23
研讨会论文
2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)
页码范围
157-162
出版商
IEEE
简介
In recent years speech enhancement has shown great progress that was driven mostly by using bigger and more sophisticated neural networks. In this work we investigate the possibility to use state-of-the-art speech enhancement neural network and modify it in such a way that will allow it to process the noisy signal multiple times. By doing so we expect, that with each iteration the enhancement will improve. Experiments conducted using the WSJ0, Noisex-92 and DCASE datasets show, that U-net with gated dilated convolutions is able to achieve better SI-SDR, STOI and PESQ after processing the noisy signal two times, with the improvement being consistent across all SNRs and tested noise types. This is achieved without any additional trainable parameters and no additional memory requirements compared to the baseline model.
引用总数
学术搜索中的文章
T Grzywalski, S Drgas - 2020 Signal Processing: Algorithms, Architectures …, 2020