作者
Siddique Latif, Junaid Qadir, Muhammad Bilal
发表日期
2019/9/3
研讨会论文
2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII)
页码范围
732-737
出版商
IEEE
简介
Cross-lingual speech emotion recognition (SER)is a crucial task for many real-world applications. The performance of SER systems is often degraded by the differences in the distributions of training and test data. These differences become more apparent when training and test data belong to different languages, which cause a significant performance gap between the validation and test scores. It is imperative to build more robust models that can fit in practical applications of SER systems. Therefore, in this paper, we propose a Generative Adversarial Network (GAN)-based model for multilingual SER. Our choice of using GAN is motivated by their great success in learning the underlying data distribution. The proposed model is designed in such a way that the language invariant representations can be learned without requiring target-language data labels. We evaluate our proposed model on four different language …
引用总数
2020202120222023202410918195
学术搜索中的文章