Multi-microphone complex spectral mapping for speech dereverberation

ZQ Wang, DL Wang - ICASSP 2020-2020 IEEE International …, 2020 - ieeexplore.ieee.org
This study proposes a multi-microphone complex spectral mapping approach for speech
dereverberation on a fixed array geometry. In the proposed approach, a deep neural …

Evaluation and comparison of late reverberation power spectral density estimators

S Braun, A Kuklasiński, O Schwartz… - … on Audio, Speech …, 2018 - ieeexplore.ieee.org
Reduction of late reverberation can be achieved using spatio-spectral filters, such as the
multichannel Wiener filter. To compute this filter, an estimate of the late reverberation power …

Non-intrusive speech quality prediction using modulation energies and LSTM-network

B Cauchi, K Siedenburg, JF Santos… - … on Audio, Speech …, 2019 - ieeexplore.ieee.org
Many signal processing algorithms have been proposed to improve the quality of speech
recorded in the presence of noise and reverberation. Perceptual measures, ie, listening …

Neural network based time-frequency masking and steering vector estimation for two-channel MVDR beamforming

Y Liu, A Ganguly, K Kamath… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
We present a neural network based approach to two-channel beamforming. First, single-and
cross-channel spectral features are extracted to form a feature map for each utterance. A …

Single-channel online enhancement of speech corrupted by reverberation and noise

CSJ Doire, M Brookes, PA Naylor… - … on Audio, Speech …, 2016 - ieeexplore.ieee.org
This paper proposes an online single-channel speech enhancement method designed to
improve the quality of speech degraded by reverberation and noise. Based on an …

Att-TasNet: Attending to Encodings in Time-Domain Audio Speech Separation of Noisy, Reverberant Speech Mixtures

W Ravenscroft, S Goetze, T Hain - Frontiers in Signal Processing, 2022 - frontiersin.org
Separation of speech mixtures in noisy and reverberant environments remains a
challenging task for state-of-the-art speech separation systems. Time-domain audio speech …

Task-specific speech enhancement and data augmentation for improved multimodal emotion recognition under noisy conditions

S Kshirsagar, A Pendyala, TH Falk - Frontiers in Computer Science, 2023 - frontiersin.org
Automatic emotion recognition (AER) systems are burgeoning and systems based on either
audio, video, text, or physiological signals have emerged. Multimodal systems, in turn, have …

Spectral masking and filtering

T Gerkmann, E Vincent - Audio source separation and speech …, 2018 - Wiley Online Library
In this chapter, we consider spectral masking filters for interference reduction in case of a
single‐channel input. The considered techniques are thus relevant when only one …

Speech dereverberation with context-aware recurrent neural networks

JF Santos, TH Falk - IEEE/ACM Transactions on Audio, Speech …, 2018 - ieeexplore.ieee.org
In this paper, we propose a model to perform speech dereverberation by estimating its
spectral magnitude from the reverberant counterpart. Our models are capable of extracting …

[PDF][PDF] Improving emotional TTS with an emotion intensity input from unsupervised extraction

B Schnell, PN Garner - Proc. 11th ISCA Speech Synth …, 2021 - publications.idiap.ch
We aim to provide controls for emotion in synthetic speech. Many emotions are not
displayed continuously in an otherwise emotional utterance; rather, the intensity varies with …