关注
Viet Anh Trinh
标题
引用次数
引用次数
年份
Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
VA Trinh, S Braun
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
232022
New Dataset and Strong Baselines for the Grammatical Error Correction of Russian
VA Trinh, A Rozovskaya
Findings of ACL, 2021
132021
Importantaug: a data augmentation agent for speech
VA Trinh, HS Kavaki, MI Mandel
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
122022
Directly Comparing the Listening Strategies of Humans and Machines
VA Trinh, M Mandel
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 312 - 323, 2020
112020
Bubble Cooperative Networks for identifying important speech cues
VA Trinh, B McFee, MI Mandel
Proceeding of the Interspeech 2018, 2018
82018
Concatenative Resynthesis with Improved Training Signals for Speech Enhancement.
AR Syed, VA Trinh, MI Mandel
Proceeding of the Interspeech 2018, 0
6*
Large scale evaluation of importance maps in automatic speech recognition
VA Trinh, M Mandel
Proceeding of the Interspeech 2020, 2020
52020
Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation
VA Trinh, P Ghahremani, B King, J Droppo, A Stolcke, R Maas
Proceeding of the Interspeech 2022, 2022
42022
Combining spatial clustering with LSTM speech models for multichannel speech enhancement
F Grezes, Z Ni, VA Trinh, M Mandel
arXiv preprint arXiv:2012.03388, 2020
42020
AUTOMATIC SPEECH RECOGNITION TUNED FOR CHILD SPEECH IN THE CLASSROOM
R Southwell, W Ward, VA Trinh, C Clevenger, C Clevenger, E Watts, ...
ICASSP, 2024
22024
Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks
F Grezes, Z Ni, VA Trinh, M Mandel
arXiv preprint arXiv:2012.01576, 2020
22020
Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing
VA Trinh, R Southwell, Y Guan, X He, Z Wang, J Whitehill
arXiv preprint arXiv:2406.06582, 2024
12024
Towards Accurate and Real-Time End-of-Speech Estimation
Y Fan, C Vaz, D He, J Heymann, VA Trinh, Z Zhang, V Ravichandran
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
12023
Identifying, Evaluating and Applying Importance Maps for Speech
VA Trinh
City University of New York, 2022
12022
Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks
Z Ni, F Grezes, VA Trinh, MI Mandel
arXiv preprint arXiv:2012.02191, 2020
12020
Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy?
Y Guan, VA Trinh, V Voleti, J Whitehill
arXiv preprint arXiv:2409.09221, 2024
2024
Tracking Classroom Movement Patterns with Person Re-ID
X He, J Wang, VA Trinh, A McReynolds, J Whitehill
Educational Data Mining, 2024
2024
Two-Pass Endpoint Detection for Speech Recognition
A Raju, A Khare, D He, I Sklyar, L Chen, S Alptekin, VA Trinh, Z Zhang, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
2023
Adaptive Endpointing with Deep Contextual Multi-Armed Bandits
A Stolcke, A Raju, C Vaz, D He, V Ravichandran, VA Trinh
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–19