作者
Shoba Sivapatham, Rajavel Ramadoss, Asutosh Kar, Banshidhar Majhi
发表日期
2020/3/1
期刊
Applied Acoustics
卷号
160
页码范围
107140
出版商
Elsevier
简介
In this research work, we propose the model based on the Genetic Algorithm (GA) and Deep Neural Network (DNN) to enhance the quality and intelligibility of the noisy speech. In this proposed model, the Voiced Speech (VS) T-F mask is computed using correlogram, frame energy and cross-channel correlogram and Unvoiced Speech (UVS) T-F mask is computed using speech onset/offset. The T-F mask obtained using speech onset and offset represents both voiced and unvoiced segment of the noisy speech signal. The UVS T-F mask is obtained by subtracting the VS from the T-F mask obtained earlier using speech onset/offset. Next, the GA is used to find the optimum weight to combine the T-F mask of VS and UVS to improve speech quality and intelligibility. The weight obtained using GA may not be an optimum one for all sets of speech and noise. This research work focuses on this issue and proposes a DNN …
引用总数
20202021202220231374
学术搜索中的文章
S Sivapatham, R Ramadoss, A Kar, B Majhi - Applied Acoustics, 2020