M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge F Yu, S Zhang, Y Fu, L Xie, S Zheng, Z Du, W Huang, P Guo, Z Yan, B Ma, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 70 | 2022 |
Summary on the ICASSP 2022 multi-channel multi-party meeting transcription grand challenge F Yu, S Zhang, P Guo, Y Fu, Z Du, S Zheng, W Huang, L Xie, ZH Tan, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 50* | 2022 |
Cam++: A fast and efficient network for speaker verification using context-aware masking H Wang, S Zheng, Y Chen, L Cheng, Q Chen arXiv preprint arXiv:2303.00332, 2023 | 32 | 2023 |
Autoencoder-based semi-supervised curriculum learning for out-of-domain speaker verification S Zheng, G Liu, H Suo, Y Lei System 3, 98, 2019 | 24 | 2019 |
A real-time speaker diarization system based on spatial spectrum S Zheng, W Huang, X Wang, H Suo, J Feng, Z Yan ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 23 | 2021 |
Cam: Context-aware masking for robust speaker verification YQ Yu, S Zheng, H Suo, Y Lei, WJ Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 20 | 2021 |
Pushing the limits of self-supervised speaker verification using regularized distillation framework Y Chen, S Zheng, H Wang, L Cheng, Q Chen ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 18 | 2023 |
Funcodec: A fundamental, reproducible and integrable open-source toolkit for neural speech codec Z Du, S Zhang, K Hu, S Zheng ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 17 | 2024 |
Lauragpt: Listen, attend, understand, and regenerate audio with gpt Q Chen, Y Chu, Z Gao, Z Li, K Hu, X Zhou, J Xu, Z Ma, W Wang, S Zheng, ... arXiv preprint arXiv:2310.04673, 2023 | 17 | 2023 |
A new method for fuzzy formal concept analysis S Zheng, Y Zhou, T Martin 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and …, 2009 | 17 | 2009 |
An enhanced res2net with local and global feature fusion for speaker verification Y Chen, S Zheng, H Wang, L Cheng, Q Chen, J Qi arXiv preprint arXiv:2305.12838, 2023 | 15 | 2023 |
Phonetically-Aware Coupled Network For Short Duration Text-Independent Speaker Verification. S Zheng, Y Lei, H Suo INTERSPEECH, 926-930, 2020 | 15 | 2020 |
Speaker overlap-aware neural diarization for multi-party meeting analysis Z Du, S Zhang, S Zheng, Z Yan arXiv preprint arXiv:2211.10243, 2022 | 10 | 2022 |
Crop weed identification system based on convolutional neural network F Miao, S Zheng, B Tao 2019 IEEE 2nd International Conference on Electronic Information and …, 2019 | 10 | 2019 |
LauraGPT: Listen, attend, understand, and regenerate audio with GPT J Wang, Z Du, Q Chen, Y Chu, Z Gao, Z Li, K Hu, X Zhou, J Xu, Z Ma, ... | 9 | 2023 |
Ponet: Pooling network for efficient token mixing in long sequences CH Tan, Q Chen, W Wang, Q Zhang, S Zheng, ZH Ling arXiv preprint arXiv:2110.02442, 2021 | 8 | 2021 |
Towards a Fault-Tolerant Speaker Verification System: A Regularization Approach to Reduce the Condition Number. S Zheng, G Liu, H Suo, Y Lei INTERSPEECH, 4065-4069, 2019 | 8 | 2019 |
3d-speaker: A large-scale multi-device, multi-distance, and multi-dialect corpus for speech representation disentanglement S Zheng, L Cheng, Y Chen, H Wang, Q Chen arXiv preprint arXiv:2306.15354, 2023 | 7 | 2023 |
Beamtransformer: Microphone array-based overlapping speech detection S Zheng, S Zhang, W Huang, Q Chen, H Suo, M Lei, J Feng, Z Yan arXiv preprint arXiv:2109.04049, 2021 | 7 | 2021 |
An Embarrassingly Simple Approach for LLM with Strong ASR Capacity Z Ma, G Yang, Y Yang, Z Gao, J Wang, Z Du, F Yu, Q Chen, S Zheng, ... arXiv preprint arXiv:2402.08846, 2024 | 5 | 2024 |