Improving speech system performance in noisy environments remains a challenging task, and speech enhancement (SE) is one of the effective techniques to solve the problem …
This paper presents an unsupervised segment-based method for robust voice activity detection (rVAD). The method consists of two passes of denoising followed by a voice …
In this paper, we study aspects of single microphone speech enhancement (SE) based on deep neural networks (DNNs). Specifically, we explore the generalizability capabilities of …
X Qin, H Bu, M Li - ICASSP 2020-2020 IEEE International …, 2020 - ieeexplore.ieee.org
This paper presents a far-field text-dependent speaker verification database named HI-MIA. We aim to meet the data requirement for far-field microphone array based speaker …
D Cai, W Cai, M Li - ICASSP 2020-2020 IEEE International …, 2020 - ieeexplore.ieee.org
Despite the significant improvements in speaker recognition enabled by deep neural networks, unsatisfactory performance persists under noisy environments. In this paper, we …
In this paper we propose a Deep Neural Network (D NN) based Speech Enhancement (SE) system that is designed to maximize an approximation of the Short-Time Objective …
X Qin, D Cai, M Li - Interspeech, 2019 - isca-archive.org
In this paper, we focus on the far-field end-to-end textdependent speaker verification task with a small-scale far-field text dependent dataset and a large scale close-talking text …
When speaking in presence of background noise, humans reflexively change their way of speaking in order to improve the intelligibility of their speech. This reflex is known as …
HP Liu, Y Tsao, CS Fuh - Speech Communication, 2018 - Elsevier
Bone-conduction microphones (BCMs) capture speech signals based on the vibrations of the speaker's skull and exhibit better noise-resistance capabilities than normal air …