查看文章

Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network

作者

Sidheswar Routray, Qirong Mao

发表日期

2022/1/1

期刊

Computer Speech & Language

卷号

页码范围

101270

出版商

Academic Press

简介

We propose PSMGAN, an efficient phase sensitive masking-based single-channel speech enhancement technique using a conditional generative adversarial network (cGAN). The time–frequency (T-F) masking-based speech enhancement approaches through deep neural networks (DNNs) have shown large speech intelligibility improvements. However, these approaches fail to achieve better enhancement results at low signal-to-noise ratio (SNR) conditions since they ignore the phase information during reconstruction. Alternatively, GANs have been introduced effectively for speech enhancement and achieved improved performance due to the adversarial training. Motivated by the recent success of GAN, we introduce the phase sensitive masking (PSM) in a cGAN framework for speech enhancement task. The reason for choosing a conditional generative model is that the data generation process can be …

引用总数

被引用次数：23

20222023202410 8 5

学术搜索中的文章

Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network

S Routray, Q Mao - Computer Speech & Language, 2022

被引用次数：23 相关文章所有 2 个版本