Sentiment analysis: Comprehensive reviews, recent advances, and open challenges

Q Lu, X Sun, Y Long, Z Gao, J Feng… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Sentiment analysis (SA) aims to understand the attitudes and views of opinion holders with
computers. Previous studies have achieved significant breakthroughs and extensive …

Dawn of the transformer era in speech emotion recognition: closing the valence gap

J Wagner, A Triantafyllopoulos… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Recent advances in transformer-based architectures have shown promise in several
machine learning tasks. In the audio domain, such architectures have been successfully …

Emotion recognition from speech using wav2vec 2.0 embeddings

L Pepino, P Riera, L Ferrer - arXiv preprint arXiv:2104.03502, 2021 - arxiv.org
Emotion recognition datasets are relatively small, making the use of the more sophisticated
deep learning approaches challenging. In this work, we propose a transfer learning method …

Adversarial alignment and graph fusion via information bottleneck for multimodal emotion recognition in conversations

Y Shou, T Meng, W Ai, F Zhang, N Yin, K Li - Information Fusion, 2024 - Elsevier
With the rapid development of social media and human–computer interaction, multimodal
emotion recognition in conversations (MERC) tasks have begun to receive widespread …

Learning multi-scale features for speech emotion recognition with connection attention mechanism

Z Chen, J Li, H Liu, X Wang, H Wang… - Expert Systems with …, 2023 - Elsevier
Speech emotion recognition (SER) has become a crucial topic in the field of human–
computer interactions. Feature representation plays an important role in SER, but there are …

Slue: New benchmark tasks for spoken language understanding evaluation on natural speech

S Shon, A Pasad, F Wu, P Brusco… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Progress in speech processing has been facilitated by shared datasets and benchmarks.
Historically these have focused on automatic speech recognition (ASR), speaker …

Adversarial representation with intra-modal and inter-modal graph contrastive learning for multimodal emotion recognition

Y Shou, T Meng, W Ai, N Yin, K Li - arXiv preprint arXiv:2312.16778, 2023 - arxiv.org
With the release of increasing open-source emotion recognition datasets on social media
platforms and the rapid development of computing resources, multimodal emotion …

Representation learning through cross-modal conditional teacher-student training for speech emotion recognition

S Srinivasan, Z Huang… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Generic pre-trained speech and text representations promise to reduce the need for large
labeled datasets on specific speech and language tasks. However, it is not clear how to …

Probing speech emotion recognition transformers for linguistic knowledge

A Triantafyllopoulos, J Wagner, H Wierstorf… - arXiv preprint arXiv …, 2022 - arxiv.org
Large, pre-trained neural networks consisting of self-attention layers (transformers) have
recently achieved state-of-the-art results on several speech emotion recognition (SER) …

Alzheimer disease recognition using speech-based embeddings from pre-trained models

ML Gauder, LD Pepino, L Ferrer, P Riera - 2021 - ri.conicet.gov.ar
This paper describes our submission to the ADreSSo Challenge, which focuses on the
problem of automatic recognition of Alzheimer's Disease (AD) from speech. The audio …