[PDF][PDF] Towards robust speech emotion recognition using deep residual networks for speech enhancement

A Triantafyllopoulos, G Keren, J Wagner, I Steiner… - 2019 - opus.bibliothek.uni-augsburg.de
The use of deep learning (DL) architectures for speech enhancement has recently improved
the robustness of voice applications under diverse noise conditions. These improvements …

Subjective evaluation of a noise-reduced training target for deep neural network-based speech enhancement

FB Gelderblom, TV Tronstad… - IEEE/ACM Transactions …, 2018 - ieeexplore.ieee.org
Speech enhancement systems aim to improve the quality and intelligibility of noisy speech.
In this study, we compare two speech enhancement systems based on deep neural …

Effect of noise suppression losses on speech distortion and ASR performance

S Braun, H Gamper - ICASSP 2022-2022 IEEE International …, 2022 - ieeexplore.ieee.org
Deep learning based speech enhancement has made rapid development towards
improving quality, while models are becoming more compact and usable for real-time on-the …

[PDF][PDF] Dynamic noise aware training for speech enhancement based on deep neural networks.

Y Xu, J Du, LR Dai, CH Lee - Interspeech, 2014 - isca-archive.org
We propose three algorithms to address the mismatch problem in deep neural network
(DNN) based speech enhancement. First, we investigate noise aware training by …

Selective acoustic feature enhancement for speech emotion recognition with noisy speech

SG Leem, D Fulford, JP Onnela… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
A speech emotion recognition (SER) system deployed on a real-world application can
encounter speech contaminated with unconstrained background noise. To deal with this …

Data augmentation and loss normalization for deep noise suppression

S Braun, I Tashev - International Conference on Speech and Computer, 2020 - Springer
Speech enhancement using neural networks is recently receiving large attention in research
and being integrated in commercial devices and applications. In this work, we investigate …

[PDF][PDF] Separation of Emotional and Reconstruction Embeddings on Ladder Network to Improve Speech Emotion Recognition Robustness in Noisy Conditions}}

SG Leem, D Fulford, JP Onnela, D Gard… - Proc. Interspeech …, 2021 - drive.google.com
When speech emotion recognition (SER) is applied in an actual application, the system
should be able to cope with audio acquired in a noisy, unconstrained environment. Most …

New Era for Robust Speech Recognition

S Watanabe, M Delcroix, F Metze… - Cham, Switzerland …, 2017 - Springer
The field of automatic speech recognition has evolved greatly since the introduction of deep
learning, which began only about 5 years ago. In particular, as more and more products …

Speech enhancement based on teacher–student deep learning using improved speech presence probability for noise-robust speech recognition

YH Tu, J Du, CH Lee - IEEE/ACM Transactions on Audio …, 2019 - ieeexplore.ieee.org
In this paper, we propose a novel teacher-student learning framework for the preprocessing
of a speech recognizer, leveraging the online noise tracking capabilities of improved minima …

A regression approach to speech enhancement based on deep neural networks

Y Xu, J Du, LR Dai, CH Lee - IEEE/ACM transactions on audio …, 2014 - ieeexplore.ieee.org
In contrast to the conventional minimum mean square error (MMSE)-based noise reduction
techniques, we propose a supervised method to enhance speech by means of finding a …