作者
Shoba Sivapatham, Pankaj Goel, Srikanth Burra, Pitikhate Sooraksa, Asutosh Kar
发表日期
2022/11/23
研讨会论文
2022 20th International Conference on ICT and Knowledge Engineering (ICT&KE)
页码范围
1-6
出版商
IEEE
简介
Monaural speech separation has remained a very challenging problem for a longtime which can be addressed using a supervised learning approach that uses features of the noisy input to predict an accurate time-frequency mask. Effective acoustic phonetic features can help in the accurate mask prediction at low Signal-to-Noise Ratios (SNRs). Individual features capture specific attributes of the audio signal; therefore, it’s essential to employ a set of features. This work examines different combinations of monaural features as input and ideal ratio mask a straining target to the DNN model. Feature combination sets are constructed by examining single features and then combining the most relevant ones. The results are evaluated for different feature combinations under non-stationary noises at low SNR levels. The feature performance is evaluated by using intelligibility and quality measures. A combination of two …
引用总数
学术搜索中的文章
S Sivapatham, P Goel, S Burra, P Sooraksa, A Kar - 2022 20th International Conference on ICT and …, 2022