Neural spectrospatial filtering

K Tan, ZQ Wang, DL Wang - IEEE/ACM Transactions on Audio …, 2022 - ieeexplore.ieee.org
As the most widely-used spatial filtering approach for multi-channel speech separation,
beamforming extracts the target speech signal arriving from a specific direction. An …

Advances in phase-aware signal processing in speech communication

P Mowlaee, R Saeidi, Y Stylianou - Speech communication, 2016 - Elsevier
During the past three decades, the issue of processing spectral phase has been largely
neglected in speech applications. There is no doubt that the interest of speech processing …

Learning affective representations based on magnitude and dynamic relative phase information for speech emotion recognition

L Guo, L Wang, J Dang, ES Chng, S Nakagawa - Speech Communication, 2022 - Elsevier
The complete acoustic features include magnitude and phase information. However,
traditional speech emotion recognition methods only focus on the magnitude information …

A review on Gujarati language based automatic speech recognition (ASR) systems

M Dua, B Bhagat, S Dua, N Chakravarty - International Journal of Speech …, 2024 - Springer
Automatic speech recognition (ASR) plays a crucial role in facilitating natural and efficient
human–computer interaction. This paper offers a comprehensive review of ASR systems …

Harmonic phase estimation in single-channel speech enhancement using phase decomposition and SNR information

P Mowlaee, J Kulmer - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
In conventional single-channel speech enhancement, typically the noisy spectral amplitude
is modified while the noisy phase is used to reconstruct the enhanced signal. Several recent …

Robustness to noise for speech emotion classification using CNNs and attention mechanisms

L Wijayasingha, JA Stankovic - Smart Health, 2021 - Elsevier
Abstract Speech Emotion Recognition (SER) is an important task since emotion is a primary
dimension in human communication and health. It has a wide variety of practical …

[PDF][PDF] Speech Emotion Recognition by Combining Amplitude and Phase Information Using Convolutional Neural Network.

L Guo, L Wang, J Dang, L Zhang, H Guan, X Li - INTERSPEECH, 2018 - isca-archive.org
Previous studies of speech emotion recognition utilize convolutional neural network (CNN)
directly on amplitude spectrogram to extract features. CNN combines with bidirectional long …

Phase estimation in single channel speech enhancement using phase decomposition

J Kulmer, P Mowlaee - IEEE signal processing letters, 2014 - ieeexplore.ieee.org
Conventional speech enhancement methods typically utilize the noisy phase spectrum for
signal reconstruction. This letter presents a novel method to estimate the clean speech …

Experimental investigation on STFT phase representations for deep learning-based dysarthric speech detection

P Janbakhshi, I Kodrasi - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Mainstream deep learning-based dysarthric speech detection approaches typically rely on
processing the magnitude spectrum of the short-time Fourier transform of input signals, while …

Phase-based cepstral features for automatic speech emotion recognition of low resource Indian languages

C Chakraborty, TK Dash*, G Panda… - Transactions on Asian and …, 2022 - dl.acm.org
Automatic speech emotion recognition (SER) is a crucial task in communication-based
systems, where feature extraction plays an important role. Recently, a lot of SER models …