Using recurrences in time and frequency within U-net architecture for speech enhancement- 学术资源搜索

文章

学术资源搜索

Using recurrences in time and frequency within U-net architecture for speech enhancement

T Grzywalski, S Drgas - ICASSP 2019-2019 IEEE International …, 2019 - ieeexplore.ieee.org

T Grzywalski, S Drgas

ICASSP 2019-2019 IEEE International Conference on Acoustics …, 2019•ieeexplore.ieee.org

When designing fully-convolutional neural network, there is a trade-off between receptive field size, number of parameters and spatial resolution of features in deeper layers of the network. In this work we present a novel network design based on combination of many convolutional and recurrent layers that solves these dilemmas. We compare our solution with U-nets based models known from the literature and other baseline models on speech enhancement task. We test our solution on TIMIT speech utterances combined with noise segments extracted from NOISEX-92 database and show clear advantage of proposed solution in terms of SDR (signal-to-distortion ratio), SIR (signal-to-interference ratio) and STOI (spectro-temporal objective intelligibility) metrics compared to the current state-of-the-art.

ieeexplore.ieee.org

展开收起

被引用次数：22 相关文章所有 5 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

Using recurrences in time and frequency within U-net architecture for speech enhancement

引用