NHSS: A speech and singing parallel database B Sharma, X Gao, K Vijayan, X Tian, H Li Speech Communication 133, 9-22, 2021 | 36 | 2021 |
Automatic lyrics transcription of polyphonic music with lyrics-chord multi-task learning X Gao, C Gupta, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2280-2294, 2022 | 27 | 2022 |
Genre-conditioned Acoustic Models for Automatic Lyrics Transcription of Polyphonic Music X Gao, C Gupta, H Li in Proc. ICASSP, 791-795, 2022 | 20 | 2022 |
Analysis of Speech and Singing Signals for Temporal Alignment K Vijayan, X Gao, H Li Asia-Pacific Signal and Information Processing Association Annual Summit and …, 2018 | 18 | 2018 |
Speaker-independent spectral mapping for speech-to-singing conversion X Gao, X Tian, RK Das, Y Zhou, H Li 2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019 | 14 | 2019 |
Personalized Singing Voice Generation Using WaveRNN X Gao, X Tian, Y Zhou, RK Das, H Li Proc. Odyssey 2020 The Speaker and Language Recognition Workshop, 252-258, 2020 | 11 | 2020 |
LYRICS TRANSCRIPTION AND LYRICS-TO-AUDIO ALIGNMENT WITH MUSIC-INFORMED ACOUSTIC MODELS X Gao, C Gupta, H Li MIREX 2020, 2020 | 10 | 2020 |
NUS-HLT Spoken Lyrics and Singing (SLS) Corpus X Gao, B Sisman, RK Das, K Vijayan International Conference on Orange Technologies (ICOT 2018), 2018 | 9 | 2018 |
PoLyScriber: Integrated Fine-tuning of Extractor and Lyrics Transcriber for Polyphonic Music X Gao, C Gupta, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 7* | 2023 |
token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired Speech and Text X Yue, J Ao, X Gao, H Li IEEE ICASSP 2023, 2022 | 7 | 2022 |
Music-robust Automatic Lyrics Transcription of Polyphonic Music X Gao, C Gupta, H Li in Proc. SMC 2022, 325-332, 2022 | 6 | 2022 |
Self-Transriber: Few-shot Lyrics Transcription with Self-training X Gao, X Yue, H Li IEEE ICASSP 2023, 2022 | 4 | 2022 |
NUS Speak-to-Sing: A Web Platform for Personalized Speech-to-Singing Conversion. C Gupta, K Vijayan, B Sharma, X Gao, H Li INTERSPEECH, 2376-2377, 2019 | 2 | 2019 |
Automatic lyrics transcription of polyphonic music X Gao PQDT-Global, 2022 | 1 | 2022 |
TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations X Gao, Y Chen, X Yue, Y Tsao, NF Chen arXiv preprint arXiv:2407.01927, 2024 | | 2024 |
Enhancing Real-World Active Speaker Detection with Multi-Modal Extraction Pre-Training R Tao, X Qian, RK Das, X Gao, J Wang, H Li arXiv preprint arXiv:2404.00861, 2024 | | 2024 |
Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks D Ma, X Yue, J Ao, X Gao, H Li arXiv preprint arXiv:2402.15725, 2024 | | 2024 |
Adapting Pre-Trained Self-Supervised Learning Model for Speech Recognition with Light-Weight Adapters X Yue, X Gao, X Qian, H Li Electronics 13 (1), 190, 2024 | | 2024 |
AUTOMATIC LYRICS TRANSCRIPTION OF POLYPHONIC MUSIC GAO XIAOXUE Doctoral Dissertation, National University of Singapore, Singapore, 2022 | | 2022 |