关注
Midia Yousefi
Midia Yousefi
Senior Research Scientist at Microsoft
在 microsoft.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Block-based high performance CNN architectures for frame-level overlapping speech detection
M Yousefi, JHL Hansen
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 28-40, 2020
512020
Audio-based toxic language classification using self-attentive convolutional neural network
M Yousefi, D Emmanouilidou
2021 29th European Signal Processing Conference (EUSIPCO), 11-15, 2021
302021
Probabilistic permutation invariant training for speech separation
M Yousefi, S Khorram, JHL Hansen
arXiv preprint arXiv:1908.01768, 2019
292019
Frame-based overlapping speech detection using convolutional neural networks
M Yousefi, JHL Hansen
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
222020
Assessing speaker engagement in 2-person debates: Overlap detection in United States Presidential debates.
M Yousefi, N Shokouhi, JHL Hansen
Interspeech, 2117-2121, 2018
182018
Real-time speaker counting in a cocktail party scenario using attention-guided convolutional neural network
M Yousefi, JHL Hansen
arXiv preprint arXiv:2111.00316, 2021
152021
Supervised speech enhancement using online group-sparse convolutive nmf
M Yousefi, MH Savoji
2016 8th International Symposium on Telecommunications (IST), 494-499, 2016
102016
Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition
M Yousefi, JHL Hansen
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
82021
Profile-Error-Tolerant Target-Speaker Voice Activity Detection
D Wang, X Xiao, N Kanda, M Yousefi, T Yoshioka, J Wu
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
52024
Single-channel speech separation using soft-minimum permutation invariant training
M Yousefi, JHL Hansen
Speech Communication 151, 76-85, 2023
42023
System for end-to-end speech separation using squeeze and excitation dilated convolutional neural networks
M Yousefi, P Angkititrakul
US Patent App. 16/805,716, 2021
42021
Investigating neural audio codecs for speech language model-based speech generation
J Li, D Wang, X Wang, Y Qian, L Zhou, S Liu, M Yousefi, C Li, CH Tsai, ...
2024 IEEE Spoken Language Technology Workshop (SLT), 554-561, 2024
22024
TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation
C Le, Y Qian, D Wang, L Zhou, S Liu, X Wang, M Yousefi, Y Qian, J Li, ...
arXiv preprint arXiv:2405.17809, 2024
22024
Speaker Diarization for ASR Output with T-vectors: A Sequence Classification Approach
M Yousefi, N Kanda, D Wang, Z Chen, X Wang, T Yoshioka
22023
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations
L Zhang, Y Qian, L Zhou, S Liu, D Wang, X Wang, M Yousefi, Y Qian, J Li, ...
arXiv preprint arXiv:2404.06690, 2024
12024
Deep Learning Based Methods for Detection, Separation, and Recognition of Overlapping Speech
M Yousefi
The University of Texas at Dallas, 2021
12021
Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages
M Yousefi, Y Qian, J Chen, G Wang, Y Liu, D Wang, X Wang, J Xue
arXiv preprint arXiv:2411.07387, 2024
2024
Domain mismatch and data augmentation in speech emotion recognition
D Emmanouilidou, H Gamper, M Yousefi
Proc. SMM 2024, 21-25, 2024
2024
FEARLESS STEPS: ADVANCEMENTS IN SPEECH TECHNOLOGY AND CORPUS DEVELOPMENT FOR NATURALISTIC AUDIO
A Joglekar, JHL Hansen, M Yousefi, M Chandra Shekar, SJ Chen, ...
NASA Human Research Program Investigators Conference, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–19