A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion M Na, R Liu, F Bao, G Gao International Conference on Neural Information Processing, 612-625, 2022 | | 2022 |
A lstm approach with sub-word embeddings for mongolian phrase break prediction R Liu, F Bao, G Gao, H Zhang, Y Wang Proceedings of the 27th International Conference on Computational …, 2018 | 11 | 2018 |
Accurate emotion strength assessment for seen and unseen speech based on data-driven deep learning R Liu, B Sisman, B Schuller, G Gao, H Li arXiv preprint arXiv:2206.07229, 2022 | 9 | 2022 |
Alignment-learning based single-step decoding for accurate and fast non-autoregressive speech recognition Y Wang, R Liu, F Bao, H Zhang, G Gao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 3 | 2022 |
Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion R Liu, J Zhang, G Gao, H Li arXiv preprint arXiv:2305.16353, 2023 | 1 | 2023 |
Building mongolian tts front-end with encoder-decoder model by using bridge method and multi-view features R Liu, F Bao, G Gao Neural Information Processing: 26th International Conference, ICONIP 2019 …, 2019 | 7 | 2019 |
Check for updates MnTTS2: An Open-Source Multi-speaker Mongolian Text-to-Speech Synthesis Dataset K Liang, B Liu, Y Hu, R Liu, F Bao, G Gao Man-Machine Speech Communication: 17th National Conference, NCMMSC 2022 …, 2023 | | 2023 |
Comparative Study for Multi-Speaker Mongolian TTS with a New Corpus K Liang, B Liu, Y Hu, R Liu, F Bao, G Gao Applied Sciences 13 (7), 4237, 2023 | 1 | 2023 |
Contrastive Learning based Modality-Invariant Feature Acquisition for Robust Multimodal Emotion Recognition with Missing Modalities R Liu, H Zuo, Z Lian, BW Schuller, H Li IEEE Transactions on Affective Computing, 2024 | 1 | 2024 |
Controllable accented text-to-speech synthesis R Liu, B Sisman, G Gao, H Li arXiv preprint arXiv:2209.10804, 2022 | 5 | 2022 |
Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering R Liu, B Sisman, G Gao, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 2 | 2024 |
Decoding knowledge transfer for neural text-to-speech training R Liu, B Sisman, G Gao, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1789-1802, 2022 | 13 | 2022 |
Decoupling speaker-independent emotions for voice conversion via source-filter networks Z Luo, S Lin, R Liu, J Baba, Y Yoshikawa, H Ishiguro IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 11-24, 2022 | 8 | 2022 |
Distributed sensor selection for speech enhancement with acoustic sensor networks D Hu, Q Si, R Liu, F Bao IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 985-999, 2023 | 4 | 2023 |
Emotion rendering for conversational speech synthesis with heterogeneous graph-based context modeling R Liu, Y Hu, Y Ren, X Yin, H Li Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 18698 …, 2024 | 2 | 2024 |
Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech R Liu, B Liu, H Li arXiv preprint arXiv:2309.11724, 2023 | 2 | 2023 |
Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge R Liu, Z Ma arXiv preprint arXiv:2406.06646, 2024 | | 2024 |
Emotional voice conversion: Theory, databases and ESD K Zhou, B Sisman, R Liu, H Li Speech Communication 137, 1-18, 2022 | 125 | 2022 |
End-to-end mongolian text-to-speech system J Li, H Zhang, R Liu, X Zhang, F Bao 2018 11th international symposium on chinese spoken language processing …, 2018 | 11 | 2018 |
Explicit intensity control for accented text-to-speech R Liu, H Zuo, D Hu, G Gao, H Li arXiv preprint arXiv:2210.15364, 2022 | 2 | 2022 |