Midia Yousefi 个人学术档案

引用次数

	总计	2020 年至今
引用	204	193
h 指数	8	7
i10 指数	7	6

201820192020202120222023202420253 8 13 42 42 48 45 2

开放获取的出版物数量

查看全部

2 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

John HansenAssociate Dean for Research; Erik Jonsson School of Engineering, University of Texas at Dallas在 utdallas.edu 的电子邮件经过验证
Dongmei WangMicrosoft在 microsoft.com 的电子邮件经过验证
Xiaofei WangMicrosoft在 jhu.edu 的电子邮件经过验证
Dimitra EmmanouilidouResearcher - Microsoft Research在 microsoft.com 的电子邮件经过验证
Takuya YoshiokaAssemblyAI在 assemblyai.com 的电子邮件经过验证
Naoyuki KandaMeta在 meta.com 的电子邮件经过验证
Xiong XiaoPrincipal Applied scientist, Microsoft在 microsoft.com 的电子邮件经过验证
Pongtep AngkititrakulNagoya University在 g.sp.m.is.nagoya-u.ac.jp 的电子邮件经过验证
Zhuo ChenBytedance (formerly Microsoft, Columbia University)在 columbia.edu 的电子邮件经过验证

关注

Midia Yousefi

Senior Research Scientist at Microsoft

在 microsoft.com 的电子邮件经过验证 - 首页

Machine Learning Speech Translation Speech Recognition Speech and Language Processing


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Block-based high performance CNN architectures for frame-level overlapping speech detection M Yousefi, JHL Hansen IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 28-40, 2020	51	2020
Audio-based toxic language classification using self-attentive convolutional neural network M Yousefi, D Emmanouilidou 2021 29th European Signal Processing Conference (EUSIPCO), 11-15, 2021	30	2021
Probabilistic permutation invariant training for speech separation M Yousefi, S Khorram, JHL Hansen arXiv preprint arXiv:1908.01768, 2019	29	2019
Frame-based overlapping speech detection using convolutional neural networks M Yousefi, JHL Hansen ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	22	2020
Assessing speaker engagement in 2-person debates: Overlap detection in United States Presidential debates. M Yousefi, N Shokouhi, JHL Hansen Interspeech, 2117-2121, 2018	18	2018
Real-time speaker counting in a cocktail party scenario using attention-guided convolutional neural network M Yousefi, JHL Hansen arXiv preprint arXiv:2111.00316, 2021	15	2021
Supervised speech enhancement using online group-sparse convolutive nmf M Yousefi, MH Savoji 2016 8th International Symposium on Telecommunications (IST), 494-499, 2016	10	2016
Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition M Yousefi, JHL Hansen 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	8	2021
Profile-Error-Tolerant Target-Speaker Voice Activity Detection D Wang, X Xiao, N Kanda, M Yousefi, T Yoshioka, J Wu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	5	2024
Single-channel speech separation using soft-minimum permutation invariant training M Yousefi, JHL Hansen Speech Communication 151, 76-85, 2023	4	2023
System for end-to-end speech separation using squeeze and excitation dilated convolutional neural networks M Yousefi, P Angkititrakul US Patent App. 16/805,716, 2021	4	2021
Investigating neural audio codecs for speech language model-based speech generation J Li, D Wang, X Wang, Y Qian, L Zhou, S Liu, M Yousefi, C Li, CH Tsai, ... 2024 IEEE Spoken Language Technology Workshop (SLT), 554-561, 2024	2	2024
TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation C Le, Y Qian, D Wang, L Zhou, S Liu, X Wang, M Yousefi, Y Qian, J Li, ... arXiv preprint arXiv:2405.17809, 2024	2	2024
Speaker Diarization for ASR Output with T-vectors: A Sequence Classification Approach M Yousefi, N Kanda, D Wang, Z Chen, X Wang, T Yoshioka	2	2023
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations L Zhang, Y Qian, L Zhou, S Liu, D Wang, X Wang, M Yousefi, Y Qian, J Li, ... arXiv preprint arXiv:2404.06690, 2024	1	2024
Deep Learning Based Methods for Detection, Separation, and Recognition of Overlapping Speech M Yousefi The University of Texas at Dallas, 2021	1	2021
Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages M Yousefi, Y Qian, J Chen, G Wang, Y Liu, D Wang, X Wang, J Xue arXiv preprint arXiv:2411.07387, 2024		2024
Domain mismatch and data augmentation in speech emotion recognition D Emmanouilidou, H Gamper, M Yousefi Proc. SMM 2024, 21-25, 2024		2024
FEARLESS STEPS: ADVANCEMENTS IN SPEECH TECHNOLOGY AND CORPUS DEVELOPMENT FOR NATURALISTIC AUDIO A Joglekar, JHL Hansen, M Yousefi, M Chandra Shekar, SJ Chen, ... NASA Human Research Program Investigators Conference, 2023		2023

系统目前无法执行此操作，请稍后再试。

文章 1–19

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用