Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses VA Trinh, S Braun ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 23 | 2022 |
New Dataset and Strong Baselines for the Grammatical Error Correction of Russian VA Trinh, A Rozovskaya Findings of ACL, 2021 | 13 | 2021 |
Importantaug: a data augmentation agent for speech VA Trinh, HS Kavaki, MI Mandel ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 12 | 2022 |
Directly Comparing the Listening Strategies of Humans and Machines VA Trinh, M Mandel IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 312 - 323, 2020 | 11 | 2020 |
Bubble Cooperative Networks for identifying important speech cues VA Trinh, B McFee, MI Mandel Proceeding of the Interspeech 2018, 2018 | 8 | 2018 |
Concatenative Resynthesis with Improved Training Signals for Speech Enhancement. AR Syed, VA Trinh, MI Mandel Proceeding of the Interspeech 2018, 0 | 6* | |
Large scale evaluation of importance maps in automatic speech recognition VA Trinh, M Mandel Proceeding of the Interspeech 2020, 2020 | 5 | 2020 |
Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation VA Trinh, P Ghahremani, B King, J Droppo, A Stolcke, R Maas Proceeding of the Interspeech 2022, 2022 | 4 | 2022 |
Combining spatial clustering with LSTM speech models for multichannel speech enhancement F Grezes, Z Ni, VA Trinh, M Mandel arXiv preprint arXiv:2012.03388, 2020 | 4 | 2020 |
AUTOMATIC SPEECH RECOGNITION TUNED FOR CHILD SPEECH IN THE CLASSROOM R Southwell, W Ward, VA Trinh, C Clevenger, C Clevenger, E Watts, ... ICASSP, 2024 | 2 | 2024 |
Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks F Grezes, Z Ni, VA Trinh, M Mandel arXiv preprint arXiv:2012.01576, 2020 | 2 | 2020 |
Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing VA Trinh, R Southwell, Y Guan, X He, Z Wang, J Whitehill arXiv preprint arXiv:2406.06582, 2024 | 1 | 2024 |
Towards Accurate and Real-Time End-of-Speech Estimation Y Fan, C Vaz, D He, J Heymann, VA Trinh, Z Zhang, V Ravichandran ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
Identifying, Evaluating and Applying Importance Maps for Speech VA Trinh City University of New York, 2022 | 1 | 2022 |
Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks Z Ni, F Grezes, VA Trinh, MI Mandel arXiv preprint arXiv:2012.02191, 2020 | 1 | 2020 |
Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy? Y Guan, VA Trinh, V Voleti, J Whitehill arXiv preprint arXiv:2409.09221, 2024 | | 2024 |
Tracking Classroom Movement Patterns with Person Re-ID X He, J Wang, VA Trinh, A McReynolds, J Whitehill Educational Data Mining, 2024 | | 2024 |
Two-Pass Endpoint Detection for Speech Recognition A Raju, A Khare, D He, I Sklyar, L Chen, S Alptekin, VA Trinh, Z Zhang, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | | 2023 |
Adaptive Endpointing with Deep Contextual Multi-Armed Bandits A Stolcke, A Raju, C Vaz, D He, V Ravichandran, VA Trinh ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | | 2023 |