Target-speaker voice activity detection via sequence-to-sequence prediction M Cheng, W Wang, Y Zhang, X Qin, M Li ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 34 | 2023 |
Low-latency online speaker diarization with graph-based label generation Y Zhang, Q Lin, W Wang, L Yang, X Wang, J Wang, M Li arXiv preprint arXiv:2111.13803, 2021 | 15 | 2021 |
The dku-dukeece diarization system for the voxceleb speaker recognition challenge 2022 W Wang, X Qin, M Cheng, Y Zhang, K Wang, M Li arXiv preprint arXiv:2210.01677, 2022 | 9 | 2022 |
The dku-smiip diarization system for the voxceleb speaker recognition challenge 2022 W Wang, X Qin, M Cheng, Y Zhang, K Wang, M Li Voxsrc Workshop, 2022 | 9 | 2022 |
A Dual-Path Framework with Frequency-and-Time Excited Network for Anomalous Sound Detection Y Zhang, J Liu, Y Tian, H Liu, M Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
Outlier-aware inlier modeling and multi-scale scoring for anomalous sound detection via multitask learning Y Zhang, H Suo, Y Wan, M Li arXiv preprint arXiv:2309.07500, 2023 | 4 | 2023 |
Multimodal Laryngoscopic Video Analysis for Assisted Diagnosis of Vocal Cord Paralysis Y Zhang, X Zou, J Yang, W Chen, F Liang, M Li arXiv preprint arXiv:2409.03597, 2024 | 1 | 2024 |
An Automatic Laryngoscopic Image Segmentation System Based on SAM Prompt Engineering: From Glottis Annotation to Vocal Fold Segmentation Y Song, Y Zhang, M Li Authorea Preprints, 2024 | | 2024 |
Data Augmentation by Finite Element Analysis for Enhanced Machine Anomalous Sound Detection Z Zhang, Y Zhang, M Li National Conference on Man-Machine Speech Communication, 102-110, 2023 | | 2023 |