Domain Adapting Deep Reinforcement Learning for Real-world Speech Emotion Recognition

T Rajapakshe, R Rana, S Khalifa… - arXiv preprint arXiv …, 2022 - arxiv.org
Computers can understand and then engage with people in an emotionally intelligent way
thanks to speech-emotion recognition (SER). However, the performance of SER in cross …

A novel policy for pre-trained deep reinforcement learning for speech emotion recognition

T Rajapakshe, R Rana, S Khalifa, J Liu… - Proceedings of the 2022 …, 2022 - dl.acm.org
Deep Reinforcement Learning (deep RL) has gained tremendous success in gaming but it
has rarely been explored for Speech Emotion Recognition (SER). In the RL literature, policy …

[PDF][PDF] Real-Time End-to-End Speech Emotion Recognition with Cross-Domain Adaptation. Big Data Cogn. Comput. 2022, 6, 79

K Wongpatikaseree, S Singkul, N Hnoohom… - 2022 - academia.edu
Language resources are the main factor in speech-emotion-recognition (SER)-based deep
learning models. Thai is a low-resource language that has a smaller data size than high …

Enhancing Speech Emotion Recognition Through Deep Learning and Handcrafted Feature Fusion

F GÜNEŞ ERİŞ, E AKBAL - Available at SSRN 4753487 - papers.ssrn.com
In this paper we introduce an innovative investigation in Speech Emotion Recognition
(SER), emphasizing combining deep learning-based features and handcrafted audio …

Active Learning Based Fine-Tuning Framework for Speech Emotion Recognition

D Li, Y Wang, K Funakoshi, M Okumura - arXiv preprint arXiv:2310.00283, 2023 - arxiv.org
Speech emotion recognition (SER) has drawn increasing attention for its applications in
human-machine interaction. However, existing SER methods ignore the information gap …

[HTML][HTML] Real-time end-to-end speech emotion recognition with cross-domain adaptation

K Wongpatikaseree, S Singkul, N Hnoohom… - Big Data and Cognitive …, 2022 - mdpi.com
Language resources are the main factor in speech-emotion-recognition (SER)-based deep
learning models. Thai is a low-resource language that has a smaller data size than high …

A cross-corpus study on speech emotion recognition

R Milner, MA Jalal, RWM Ng… - 2019 IEEE Automatic …, 2019 - ieeexplore.ieee.org
For speech emotion datasets, it has been difficult to acquire large quantities of reliable data
and acted emotions may be over the top compared to less expressive emotions displayed in …

Self supervised adversarial domain adaptation for cross-corpus and cross-language speech emotion recognition

S Latif, R Rana, S Khalifa, R Jurdak… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Despite the recent advancement in speech emotion recognition (SER) within a single corpus
setting, the performance of these SER systems degrades significantly for cross-corpus and …

Domain-adversarial autoencoder with attention based feature level fusion for speech emotion recognition

Y Gao, JX Liu, L Wang, J Dang - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Over the past two decades, although speech emotion recognition (SER) has garnered
considerable attention, the problem of insufficient training data has been unresolved. A …

Cta-rnn: Channel and temporal-wise attention rnn leveraging pre-trained asr embeddings for speech emotion recognition

C Chen, P Zhang - arXiv preprint arXiv:2203.17023, 2022 - arxiv.org
Previous research has looked into ways to improve speech emotion recognition (SER) by
utilizing both acoustic and linguistic cues of speech. However, the potential association …