作者
Devi Sowjanya, Shoba Sivapatham, Asutosh Kar, Vladimir Mladenovic
发表日期
2022/7
期刊
Circuits, Systems, and Signal Processing
卷号
41
期号
7
页码范围
4117-4135
出版商
Springer US
简介
The most commonly used training target is masking-based approach which maps noisy speech to the time–frequency (T–F) unit and has a remarkable impact on the performance in the supervised learning algorithms. Traditional T–F masks like ideal ratio mask (IRM) demonstrate a strong performance but are limited to only the magnitude domain in enhancement. Though bounded IRM with phase constraint (BIRMP) includes phase difference but doesn’t exploit channel correlation, the proposed ratio mask (pRM) considers channel correlation but is computed only in the magnitude domain. This work proposes a new mask, i.e., phase correlation ideal ratio mask (PCIRM), which includes both inter-channel correlation and phase difference between the noisy speech (), noise (N) and clean speech (). Considering these factors increases the percentage of and readily decreases the percentage of unwanted …
引用总数
学术搜索中的文章
D Sowjanya, S Sivapatham, A Kar, V Mladenovic - Circuits, Systems, and Signal Processing, 2022