Attention guided 3D CNN-LSTM model for accurate speech based emotion recognition

O Atila, A Şengür - Applied Acoustics, 2021 - Elsevier
In this paper, a novel approach, which is based on attention guided 3D convolutional neural
networks (CNN)-long short-term memory (LSTM) model, is proposed for speech based …

Speech emotion recognition UsingConvolutional neural network and long-short TermMemory

R Dangol, A Alsadoon, PWC Prasad, I Seher… - Multimedia Tools and …, 2020 - Springer
Human-Robot interactions involve human intentions and human emotion. After the
evolvement of positive psychology, the psychological research has a tremendous …

1D-CNN: Speech Emotion Recognition System Using a Stacked Network with Dilated CNN Features.

S Kwon - Computers, Materials & Continua, 2021 - search.ebscohost.com
Emotion recognition from speech data is an active and emerging area of research that plays
an important role in numerous applications, such as robotics, virtual reality, behavior …

Speech emotion recognition using deep 1D & 2D CNN LSTM networks

J Zhao, X Mao, L Chen - Biomedical signal processing and control, 2019 - Elsevier
We aimed at learning deep emotion features to recognize speech emotion. Two
convolutional neural network and long short-term memory (CNN LSTM) networks, one 1D …

An ensemble 1D-CNN-LSTM-GRU model with data augmentation for speech emotion recognition

MR Ahmed, S Islam, AKMM Islam… - Expert Systems with …, 2023 - Elsevier
Precise recognition of emotion from speech signals aids in enhancing human–computer
interaction (HCI). The performance of a speech emotion recognition (SER) system depends …

Speech emotion recognition using 3d convolutions and attention-based sliding recurrent networks with auditory front-ends

Z Peng, X Li, Z Zhu, M Unoki, J Dang, M Akagi - IEEE Access, 2020 - ieeexplore.ieee.org
Emotion information from speech can effectively help robots understand speaker's intentions
in natural human-robot interaction. The human auditory system can easily track temporal …

CLSTM: Deep feature-based speech emotion recognition using the hierarchical ConvLSTM network

Mustaqeem, S Kwon - Mathematics, 2020 - mdpi.com
Artificial intelligence, deep learning, and machine learning are dominant sources to use in
order to make a system smarter. Nowadays, the smart speech emotion recognition (SER) …

MLT-DNet: Speech emotion recognition using 1D dilated CNN based on multi-learning trick approach

S Kwon - Expert Systems with Applications, 2021 - Elsevier
Speech is the most dominant source of communication among humans, and it is an efficient
way for human–computer interaction (HCI) to exchange information. Nowadays, speech …

Speech emotion recognition based on multiple acoustic features and deep convolutional neural network

K Bhangale, M Kothandaraman - Electronics, 2023 - mdpi.com
Speech emotion recognition (SER) plays a vital role in human–machine interaction. A large
number of SER schemes have been anticipated over the last decade. However, the …

Attention-LSTM-attention model for speech emotion recognition and analysis of IEMOCAP database

Y Yu, YJ Kim - Electronics, 2020 - mdpi.com
We propose a speech-emotion recognition (SER) model with an “attention-long Long Short-
Term Memory (LSTM)-attention” component to combine IS09, a commonly used feature for …