Audio self-supervised learning: A survey S Liu, A Mallol-Ragolta, E Parada-Cabaleiro, K Qian, X Jing, A Kathan, ... Patterns 3 (12), 2022 | 90 | 2022 |
CovNet: A transfer learning framework for automatic COVID-19 detection from crowd-sourced cough sounds Y Chang, X Jing, Z Ren, BW Schuller Frontiers in Digital Health 3, 799067, 2022 | 21 | 2022 |
An overview & analysis of sequence-to-sequence emotional voice conversion Z Yang, X Jing, A Triantafyllopoulos, M Song, I Aslan, BW Schuller arXiv preprint arXiv:2203.15873, 2022 | 10 | 2022 |
Redundancy reduction twins network: A training framework for multi-output emotion regression X Jing, M Song, A Triantafyllopoulos, Z Yang, BW Schuller The ICML Expressive Vocalizations (ExVo) Workshop and Competition 2022, 2022 | 8 | 2022 |
Daily mental health monitoring from speech: A real-world japanese dataset and multitask learning analysis M Song, A Triantafyllopoulos, Z Yang, H Takeuchi, T Nakamura, A Kishi, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 7 | 2023 |
Exploring speaker enrolment for few-shot personalisation in emotional vocalisation prediction A Triantafyllopoulos, M Song, Z Yang, X Jing, BW Schuller arXiv preprint arXiv:2206.06680, 2022 | 6 | 2022 |
Dynamic Restrained Uncertainty Weighting Loss for Multitask Learning of Vocal Expression M Song, Z Yang, A Triantafyllopoulos, X Jing, V Karas, X Jiangjian, ... The ICML Expressive Vocalizations (ExVo) Workshop and Competition 2022, 2022 | 4 | 2022 |
U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech X Jing, Y Chang, Z Yang, J Xie, A Triantafyllopoulos, BW Schuller Speech Communication; 15th ITG Conference, 56-60, 2023 | 2 | 2023 |
HEAR4Health: a blueprint for making computer audition a staple of modern healthcare A Triantafyllopoulos, A Kathan, A Baird, L Christ, A Gebhard, M Gerczuk, ... Frontiers in Digital Health 5, 1196079, 2023 | 2 | 2023 |
A Temporal-oriented Broadcast ResNet for COVID-19 Detection X Jing, S Liu, E Parada-Cabaleiro, A Triantafyllopoulos, M Song, Z Yang, ... 2022 IEEE-EMBS International Conference on Biomedical and Health Informatics …, 2022 | 1 | 2022 |
Parallelising 2D-CNNs and transformers: A Cognitive-based approach for Automatic Recognition of Learners’ English Proficiency M Song, E Parada-Cabaleiro, Z Yang, X Jing, K Togami, K Qian, ... Intelligent Human Systems Integration (IHSI 2022): Integrating People and …, 2022 | 1 | 2022 |
ParaCLAP--Towards a general language-audio model for computational paralinguistic tasks X Jing, A Triantafyllopoulos, B Schuller arXiv preprint arXiv:2406.07203, 2024 | | 2024 |
STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition Y Chang, Z Ren, Z Zhang, X Jing, K Qian, X Shao, B Hu, T Schultz, ... arXiv preprint arXiv:2402.01227, 2024 | | 2024 |
Identifying languages in a novel dataset: ASMR-whispered speech M Song, Z Yang, E Parada-Cabaleiro, X Jing, Y Yamamoto, B Schuller Frontiers in Neuroscience 17, 1120311, 2023 | | 2023 |
Kazumasa Togami4, Kun Qian5, Björn W. Schuller1, 6, and Yoshiharu Yamamoto2 M Song, E Parada-Cabaleiro, Z Yang, X Jing | | |