作者
Shoba Sivapatham, Rajavel Ramadoss
发表日期
2018/9
期刊
IET Signal Processing
卷号
12
期号
7
页码范围
896-906
出版商
The Institution of Engineering and Technology
简介
This research work proposes an image analysis‐based algorithm to enhance the time–frequency (TF) mask obtained in the initial segmentation of CASA‐based monaural speech separation system to improve speech quality and intelligibility. It consists of labelling the initial segmentation mask, boundary extraction, active pixel detection and eliminating the non‐active pixels related to noise. In labelling, the TF mask obtained is labelled as periodicity pixel (P) matrix and non‐periodicity pixel (NP) matrix. Next speech boundaries are created by connecting all the possible nearby P and NP matrix. Some speech boundary may include noisy TF units as holes; these holes are treated using the proposed algorithm. The proposed algorithm is evaluated with the quality and intelligibility measures such as signal to noise ratio (SNR), perceptual evaluation of speech quality, , , coherence speech intelligibility index (CSII …
引用总数
20192020202120224213