An engineering view on emotions and speech: From analysis and predictive models to responsible human-centered applications

CC Lee, T Chaspari, EM Provost… - Proceedings of the …, 2023 - ieeexplore.ieee.org
The substantial growth of Internet-of-Things technology and the ubiquity of smartphone
devices has increased the public and industry focus on speech emotion recognition (SER) …

A Review of the Advancement in Speech Emotion Recognition for Indo‐Aryan and Dravidian Languages

ST Alam Monisha, S Sultana - Advances in Human‐Computer …, 2022 - Wiley Online Library
Speech emotion recognition (SER) has grown to be one of the most trending research topics
in computational linguistics in the last two decades. Speech being the primary …

Multi-lingual multi-task speech emotion recognition using wav2vec 2.0

M Sharma - ICASSP 2022-2022 IEEE International Conference …, 2022 - ieeexplore.ieee.org
Speech Emotion Recognition (SER) has several use cases for Digital Entertainment Content
(DEC) in Over-the-top (OTT) services, emotive Text-to-Speech (TTS) engines and voice …

Bangla speech emotion recognition and cross-lingual study using deep CNN and BLSTM networks

S Sultana, MZ Iqbal, MR Selim, MM Rashid… - IEEE …, 2021 - ieeexplore.ieee.org
In this study, we have presented a deep learning-based implementation for speech emotion
recognition (SER). The system combines a deep convolutional neural network (DCNN) and …

emotion2vec: Self-supervised pre-training for speech emotion representation

Z Ma, Z Zheng, J Ye, J Li, Z Gao, S Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
We propose emotion2vec, a universal speech emotion representation model. emotion2vec
is pre-trained on open-source unlabeled emotion data through self-supervised online …

Smart reception: An artificial intelligence driven bangla language based receptionist system employing speech, speaker, and face recognition for automating reception …

KA Mamun, RA Nabid, SI Pranto, SM Lamim… - … Applications of Artificial …, 2024 - Elsevier
In recent times, service robots (SR) have become widely accepted in a variety of fields as an
alternative to traditional reception methods. Artificial Intelligence (AI) driven systems are …

Speech Emotion Recognition and Deep Learning: an Extensive Validation using Convolutional Neural Networks

FA Dal Rì, FC Ciardi, N Conci - IEEE Access, 2023 - ieeexplore.ieee.org
The domain of Speech Emotion Recognition (SER) has experienced a tremendous
revolution due to the outbreak of deep learning, which has contributed, as in many other …

[HTML][HTML] BanglaSER: A speech emotion recognition dataset for the Bangla language

RK Das, N Islam, MR Ahmed, S Islam, S Shatabda… - Data in Brief, 2022 - Elsevier
The speech emotion recognition system determines a speaker's emotional state by
analyzing his/her speech audio signal. It is an essential at the same time a challenging task …

An improved mser using grid search based pca and ensemble voting technique

A Tripathi, P Rani - Multimedia Tools and Applications, 2024 - Springer
Recognizing speech emotions is indeed a crucial aspect of human–computer interaction.
However, developing a model that can accurately process multiple languages is one of the …

SER Evals: In-domain and Out-of-domain benchmarking for speech emotion recognition

M Osman, DZ Kaplan, T Nadeem - arXiv preprint arXiv:2408.07851, 2024 - arxiv.org
Speech emotion recognition (SER) has made significant strides with the advent of powerful
self-supervised learning (SSL) models. However, the generalization of these models to …