Contrastive unsupervised learning for speech emotion recognition

Q Lu, X Sun, Y Long, Z Gao, J Feng… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Sentiment analysis (SA) aims to understand the attitudes and views of opinion holders with
computers. Previous studies have achieved significant breakthroughs and extensive …

被引用次数：27 相关文章所有 4 个版本

[PDF] arxiv.org

Dawn of the transformer era in speech emotion recognition: closing the valence gap

J Wagner, A Triantafyllopoulos… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Recent advances in transformer-based architectures have shown promise in several
machine learning tasks. In the audio domain, such architectures have been successfully …

被引用次数：320 相关文章所有 8 个版本

[PDF] arxiv.org

Emotion recognition from speech using wav2vec 2.0 embeddings

L Pepino, P Riera, L Ferrer - arXiv preprint arXiv:2104.03502, 2021 - arxiv.org

Emotion recognition datasets are relatively small, making the use of the more sophisticated
deep learning approaches challenging. In this work, we propose a transfer learning method …

被引用次数：397 相关文章所有 8 个版本

Adversarial alignment and graph fusion via information bottleneck for multimodal emotion recognition in conversations

Y Shou, T Meng, W Ai, F Zhang, N Yin, K Li - Information Fusion, 2024 - Elsevier

With the rapid development of social media and human–computer interaction, multimodal
emotion recognition in conversations (MERC) tasks have begun to receive widespread …

被引用次数：10 相关文章所有 3 个版本

Learning multi-scale features for speech emotion recognition with connection attention mechanism

Z Chen, J Li, H Liu, X Wang, H Wang… - Expert Systems with …, 2023 - Elsevier

Speech emotion recognition (SER) has become a crucial topic in the field of human–
computer interactions. Feature representation plays an important role in SER, but there are …

被引用次数：56 相关文章所有 2 个版本

[PDF] arxiv.org

Slue: New benchmark tasks for spoken language understanding evaluation on natural speech

S Shon, A Pasad, F Wu, P Brusco… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Progress in speech processing has been facilitated by shared datasets and benchmarks.
Historically these have focused on automatic speech recognition (ASR), speaker …

被引用次数：77 相关文章所有 4 个版本

[PDF] arxiv.org

Adversarial representation with intra-modal and inter-modal graph contrastive learning for multimodal emotion recognition

Y Shou, T Meng, W Ai, N Yin, K Li - arXiv preprint arXiv:2312.16778, 2023 - arxiv.org

With the release of increasing open-source emotion recognition datasets on social media
platforms and the rapid development of computing resources, multimodal emotion …

被引用次数：22 相关文章所有 2 个版本

[PDF] arxiv.org

Representation learning through cross-modal conditional teacher-student training for speech emotion recognition

S Srinivasan, Z Huang… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

Generic pre-trained speech and text representations promise to reduce the need for large
labeled datasets on specific speech and language tasks. However, it is not clear how to …

被引用次数：44 相关文章所有 6 个版本

[PDF] arxiv.org

Probing speech emotion recognition transformers for linguistic knowledge

A Triantafyllopoulos, J Wagner, H Wierstorf… - arXiv preprint arXiv …, 2022 - arxiv.org

Large, pre-trained neural networks consisting of self-attention layers (transformers) have
recently achieved state-of-the-art results on several speech emotion recognition (SER) …

被引用次数：29 相关文章所有 9 个版本

[PDF] conicet.gov.ar

Alzheimer disease recognition using speech-based embeddings from pre-trained models

ML Gauder, LD Pepino, L Ferrer, P Riera - 2021 - ri.conicet.gov.ar

This paper describes our submission to the ADreSSo Challenge, which focuses on the
problem of automatic recognition of Alzheimer's Disease (AD) from speech. The audio …

被引用次数：46 相关文章所有 6 个版本

高级搜索

QQ 群