The emerging science of interacting minds

T Wheatley, MA Thornton, A Stolk… - Perspectives on …, 2024 - journals.sagepub.com
For over a century, psychology has focused on uncovering mental processes of a single
individual. However, humans rarely navigate the world in isolation. The most important …

An overview of affective speech synthesis and conversion in the deep learning era

A Triantafyllopoulos, BW Schuller… - Proceedings of the …, 2023 - ieeexplore.ieee.org
Speech is the fundamental mode of human communication, and its synthesis has long been
a core priority in human–computer interaction research. In recent years, machines have …

Long dialogue emotion detection based on commonsense knowledge graph guidance

W Nie, Y Bao, Y Zhao, A Liu - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Dialogue emotion detection is always challenging due to human subjectivity and the
randomness of dialogue content. In a conversation, the emotion of each person often …

Transformers in speech processing: A survey

S Latif, A Zaidi, H Cuayahuitl, F Shamshad… - arXiv preprint arXiv …, 2023 - arxiv.org
The remarkable success of transformers in the field of natural language processing has
sparked the interest of the speech-processing community, leading to an exploration of their …

The muse 2023 multimodal sentiment analysis challenge: Mimicked emotions, cross-cultural humour, and personalisation

L Christ, S Amiriparian, A Baird, A Kathan… - Proceedings of the 4th …, 2023 - dl.acm.org
The Multimodal Sentiment Analysis Challenge (MuSe) 2023 is a set of shared tasks
addressing three different contemporary multimodal affect and sentiment analysis problems …

The VoicePrivacy 2024 Challenge Evaluation Plan

N Tomashenko, X Miao, P Champion, S Meyer… - arXiv preprint arXiv …, 2024 - arxiv.org
The task of the challenge is to develop a voice anonymization system for speech data which
conceals the speaker's voice identity while protecting linguistic content and emotional states …

[PDF][PDF] A review of speech-centric trustworthy machine learning: Privacy, safety, and fairness

T Feng, R Hebbar, N Mehlman, X Shi… - … on Signal and …, 2023 - nowpublishers.com
Speech-centric machine learning systems have revolutionized a number of leading
industries ranging from transportation and healthcare to education and defense …

[HTML][HTML] Speech emotion recognition using machine learning—A systematic review

S Madanian, T Chen, O Adeleye, JM Templeton… - Intelligent systems with …, 2023 - Elsevier
Speech emotion recognition (SER) as a Machine Learning (ML) problem continues to
garner a significant amount of research interest, especially in the affective computing …

A vector quantized approach for text to speech synthesis on real-world spontaneous speech

LW Chen, S Watanabe, A Rudnicky - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Abstract Recent Text-to-Speech (TTS) systems trained on reading or acted corpora have
achieved near human-level naturalness. The diversity of human speech, however, often …

Speechformer++: A hierarchical efficient framework for paralinguistic speech processing

W Chen, X Xing, X Xu, J Pang… - IEEE/ACM Transactions …, 2023 - ieeexplore.ieee.org
Paralinguistic speech processing is important in addressing many issues, such as sentiment
and neurocognitive disorder analyses. Recently, Transformer has achieved remarkable …