关注
Siqi Zheng
Siqi Zheng
DAMO Academy, Alibaba Group
在 mail.harvard.edu 的电子邮件经过验证
标题
引用次数
引用次数
年份
M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge
F Yu, S Zhang, Y Fu, L Xie, S Zheng, Z Du, W Huang, P Guo, Z Yan, B Ma, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
702022
Summary on the ICASSP 2022 multi-channel multi-party meeting transcription grand challenge
F Yu, S Zhang, P Guo, Y Fu, Z Du, S Zheng, W Huang, L Xie, ZH Tan, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
50*2022
Cam++: A fast and efficient network for speaker verification using context-aware masking
H Wang, S Zheng, Y Chen, L Cheng, Q Chen
arXiv preprint arXiv:2303.00332, 2023
322023
Autoencoder-based semi-supervised curriculum learning for out-of-domain speaker verification
S Zheng, G Liu, H Suo, Y Lei
System 3, 98, 2019
242019
A real-time speaker diarization system based on spatial spectrum
S Zheng, W Huang, X Wang, H Suo, J Feng, Z Yan
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
232021
Cam: Context-aware masking for robust speaker verification
YQ Yu, S Zheng, H Suo, Y Lei, WJ Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
202021
Pushing the limits of self-supervised speaker verification using regularized distillation framework
Y Chen, S Zheng, H Wang, L Cheng, Q Chen
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
182023
Funcodec: A fundamental, reproducible and integrable open-source toolkit for neural speech codec
Z Du, S Zhang, K Hu, S Zheng
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
172024
Lauragpt: Listen, attend, understand, and regenerate audio with gpt
Q Chen, Y Chu, Z Gao, Z Li, K Hu, X Zhou, J Xu, Z Ma, W Wang, S Zheng, ...
arXiv preprint arXiv:2310.04673, 2023
172023
A new method for fuzzy formal concept analysis
S Zheng, Y Zhou, T Martin
2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and …, 2009
172009
An enhanced res2net with local and global feature fusion for speaker verification
Y Chen, S Zheng, H Wang, L Cheng, Q Chen, J Qi
arXiv preprint arXiv:2305.12838, 2023
152023
Phonetically-Aware Coupled Network For Short Duration Text-Independent Speaker Verification.
S Zheng, Y Lei, H Suo
INTERSPEECH, 926-930, 2020
152020
Speaker overlap-aware neural diarization for multi-party meeting analysis
Z Du, S Zhang, S Zheng, Z Yan
arXiv preprint arXiv:2211.10243, 2022
102022
Crop weed identification system based on convolutional neural network
F Miao, S Zheng, B Tao
2019 IEEE 2nd International Conference on Electronic Information and …, 2019
102019
LauraGPT: Listen, attend, understand, and regenerate audio with GPT
J Wang, Z Du, Q Chen, Y Chu, Z Gao, Z Li, K Hu, X Zhou, J Xu, Z Ma, ...
92023
Ponet: Pooling network for efficient token mixing in long sequences
CH Tan, Q Chen, W Wang, Q Zhang, S Zheng, ZH Ling
arXiv preprint arXiv:2110.02442, 2021
82021
Towards a Fault-Tolerant Speaker Verification System: A Regularization Approach to Reduce the Condition Number.
S Zheng, G Liu, H Suo, Y Lei
INTERSPEECH, 4065-4069, 2019
82019
3d-speaker: A large-scale multi-device, multi-distance, and multi-dialect corpus for speech representation disentanglement
S Zheng, L Cheng, Y Chen, H Wang, Q Chen
arXiv preprint arXiv:2306.15354, 2023
72023
Beamtransformer: Microphone array-based overlapping speech detection
S Zheng, S Zhang, W Huang, Q Chen, H Suo, M Lei, J Feng, Z Yan
arXiv preprint arXiv:2109.04049, 2021
72021
An Embarrassingly Simple Approach for LLM with Strong ASR Capacity
Z Ma, G Yang, Y Yang, Z Gao, J Wang, Z Du, F Yu, Q Chen, S Zheng, ...
arXiv preprint arXiv:2402.08846, 2024
52024
系统目前无法执行此操作,请稍后再试。
文章 1–20