Block-based high performance CNN architectures for frame-level overlapping speech detection M Yousefi, JHL Hansen IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 28-40, 2020 | 51 | 2020 |
Audio-based toxic language classification using self-attentive convolutional neural network M Yousefi, D Emmanouilidou 2021 29th European Signal Processing Conference (EUSIPCO), 11-15, 2021 | 30 | 2021 |
Probabilistic permutation invariant training for speech separation M Yousefi, S Khorram, JHL Hansen arXiv preprint arXiv:1908.01768, 2019 | 29 | 2019 |
Frame-based overlapping speech detection using convolutional neural networks M Yousefi, JHL Hansen ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 22 | 2020 |
Assessing speaker engagement in 2-person debates: Overlap detection in United States Presidential debates. M Yousefi, N Shokouhi, JHL Hansen Interspeech, 2117-2121, 2018 | 18 | 2018 |
Real-time speaker counting in a cocktail party scenario using attention-guided convolutional neural network M Yousefi, JHL Hansen arXiv preprint arXiv:2111.00316, 2021 | 15 | 2021 |
Supervised speech enhancement using online group-sparse convolutive nmf M Yousefi, MH Savoji 2016 8th International Symposium on Telecommunications (IST), 494-499, 2016 | 10 | 2016 |
Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition M Yousefi, JHL Hansen 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 8 | 2021 |
Profile-Error-Tolerant Target-Speaker Voice Activity Detection D Wang, X Xiao, N Kanda, M Yousefi, T Yoshioka, J Wu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 5 | 2024 |
Single-channel speech separation using soft-minimum permutation invariant training M Yousefi, JHL Hansen Speech Communication 151, 76-85, 2023 | 4 | 2023 |
System for end-to-end speech separation using squeeze and excitation dilated convolutional neural networks M Yousefi, P Angkititrakul US Patent App. 16/805,716, 2021 | 4 | 2021 |
Investigating neural audio codecs for speech language model-based speech generation J Li, D Wang, X Wang, Y Qian, L Zhou, S Liu, M Yousefi, C Li, CH Tsai, ... 2024 IEEE Spoken Language Technology Workshop (SLT), 554-561, 2024 | 2 | 2024 |
TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation C Le, Y Qian, D Wang, L Zhou, S Liu, X Wang, M Yousefi, Y Qian, J Li, ... arXiv preprint arXiv:2405.17809, 2024 | 2 | 2024 |
Speaker Diarization for ASR Output with T-vectors: A Sequence Classification Approach M Yousefi, N Kanda, D Wang, Z Chen, X Wang, T Yoshioka | 2 | 2023 |
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations L Zhang, Y Qian, L Zhou, S Liu, D Wang, X Wang, M Yousefi, Y Qian, J Li, ... arXiv preprint arXiv:2404.06690, 2024 | 1 | 2024 |
Deep Learning Based Methods for Detection, Separation, and Recognition of Overlapping Speech M Yousefi The University of Texas at Dallas, 2021 | 1 | 2021 |
Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages M Yousefi, Y Qian, J Chen, G Wang, Y Liu, D Wang, X Wang, J Xue arXiv preprint arXiv:2411.07387, 2024 | | 2024 |
Domain mismatch and data augmentation in speech emotion recognition D Emmanouilidou, H Gamper, M Yousefi Proc. SMM 2024, 21-25, 2024 | | 2024 |
FEARLESS STEPS: ADVANCEMENTS IN SPEECH TECHNOLOGY AND CORPUS DEVELOPMENT FOR NATURALISTIC AUDIO A Joglekar, JHL Hansen, M Yousefi, M Chandra Shekar, SJ Chen, ... NASA Human Research Program Investigators Conference, 2023 | | 2023 |