Non-autoregressive transformer with unified bidirectional decoder for automatic speech recognition CF Zhang, Y Liu, TH Zhang, SL Chen, F Chen, XC Yin ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 9 | 2022 |
Stable Speech Emotion Recognition with Head-k-Pooling Loss C Ding, J Li, D Zong, B Li, TH Zhang, Q Zhou INTERSPEECH 2023, 2023 | 4 | 2023 |
M3TTS: Multi-modal text-to-speech of multi-scale style control for dubbing Y Liu, LF Wei, X Qian, TH Zhang, SL Chen, XC Yin Pattern Recognition Letters 179, 158-164, 2024 | 2 | 2024 |
Self-supervised contrastive speaker verification with nearest neighbor positive instances Y Liu, LF Wei, CF Zhang, TH Zhang, SL Chen, XC Yin Pattern Recognition Letters 173, 17-22, 2023 | 2 | 2023 |
An Efficient Temporal Model for Small-Footprint Keyword Spotting S Zhang, T Zhang, S Chen, F Chen, X Yin 2021 7th IEEE International Conference on Network Intelligence and Digital …, 2021 | 2 | 2021 |
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores J Zhou, S Zhao, H Wang, TH Zhang, H Sun, X Wang, Y Qin arXiv preprint arXiv:2406.03814, 2024 | 1 | 2024 |
CIF-T: A Novel CIF-Based Transducer Architecture for Automatic Speech Recognition TH Zhang, D Zhou, G Zhong, J Zhou, B Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition ZH Lai, TH Zhang, Q Liu, X Qian, LF Wei, SL Chen, F Chen, XC Yin INTERSPEECH 2023, 2023 | 1 | 2023 |
Improving Multi-Type License Plate Recognition via Learning Globally and Contrastively Q Liu, Y Liu, SL Chen, TH Zhang, F Chen, XC Yin IEEE Transactions on Intelligent Transportation Systems, 2024 | | 2024 |
Self-Convolution for Automatic Speech Recognition TH Zhang, Q Liu, X Qian, SL Chen, F Chen, XC Yin ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | | 2023 |
Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding TH Zhang, HB Qin, ZH Lai, SL Chen, Q Liu, F Chen, X Qian, XC Yin INTERSPEECH 2023, 2023 | | 2023 |
Transmitted and Aggregated Self-Attention for Automatic Speech Recognition TH Zhang, X Qian, F Chen, XC Yin INTERSPEECH 2024, 0 | | |