作者
Shoba Sivapatham, Asutosh Kar, Mads Græsbøll Christensen
发表日期
2022/6/15
期刊
Applied Acoustics
卷号
194
页码范围
108784
出版商
Elsevier
简介
Speech signal enhancement achieves high-level performance in recent years using deep learning techniques. However, the deep learning technique in the speech enhancement algorithm degrades the performance of speech, particularly for unseen noises, unseen speakers and moreover, deep learning models are limited to the small number of speakers. Hence, we propose a Gammatone filterbank (GTFB) – simple deep neural network (SDNN) based speech enhancement algorithm to improve the quality of speech for three different unseen conditions. The use of GTFB gives a finer resolution in low-frequency regions of speech, and the SDNN model extracts a noisy GTFB frame as input and maps it to a clean speech GTFB frame. The experimental results are measured objectively using signal-noise-ratio, perceptual evaluation of speech quality, short time objective intelligibility, and subjectively using mean …
引用总数