Complex spectral mapping for single-and multi-channel speech enhancement and robust ASR ZQ Wang, P Wang, DL Wang IEEE/ACM transactions on audio, speech, and language processing 28, 1778-1787, 2020 | 199 | 2020 |
Multi-microphone complex spectral mapping for utterance-wise and continuous speech separation ZQ Wang, P Wang, DL Wang IEEE/ACM transactions on audio, speech, and language processing 29, 2001-2014, 2021 | 78 | 2021 |
Bridging the gap between monaural speech enhancement and recognition with distortion-independent acoustic modeling P Wang, K Tan, DL Wang IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 39-48, 2019 | 63 | 2019 |
Why does self-supervised learning for speech recognition benefit speaker recognition? S Chen, Y Wu, C Wang, S Liu, Z Chen, P Wang, G Liu, J Li, J Wu, X Yu, ... arXiv preprint arXiv:2204.12765, 2022 | 37 | 2022 |
Speech separation using speaker inventory P Wang, Z Chen, X Xiao, Z Meng, T Yoshioka, T Zhou, L Lu, J Li 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 35 | 2019 |
Multitask training with text data for end-to-end speech recognition P Wang, TN Sainath, RJ Weiss Proc. of Interspeech, 2566--2570, 2021 | 30 | 2021 |
Large-scale streaming end-to-end speech translation with neural transducers J Xue, P Wang, J Li, M Post, Y Gaur arXiv preprint arXiv:2204.05352, 2022 | 21 | 2022 |
A conformer based acoustic model for robust automatic speech recognition Y Yang, P Wang, DL Wang arXiv preprint arXiv:2203.00725, 2022 | 16 | 2022 |
Improving Attention-Based End-to-End ASR Systems with Sequence-Based Loss Functions J Cui, C Weng, G Wang, J Wang, P Wang, C Yu, D Su, D Yu 2018 IEEE Spoken Language Technology Workshop (SLT), 353-360, 2018 | 15 | 2018 |
LAMASSU: A Streaming Language-Agnostic Multilingual Speech Recognition and Translation Model Using Neural Transducers P Wang, E Sun, J Xue, Y Wu, L Zhou, Y Gaur, S Liu, J Li Proc. of INTERSPEECH, 57--61, 2023 | 14* | 2023 |
Enhanced Spectral Features for Distortion-Independent Acoustic Modeling P Wang, DL Wang Proc. of INTERSPEECH 2019, 476-480, 2019 | 14 | 2019 |
Utterance-Wise Recurrent Dropout and Iterative Speaker Adaptation for Robust Monaural Speech Recognition P Wang, DL Wang Acoustics, Speech and Signal Processing (ICASSP), 2018 IEEE International …, 2018 | 13 | 2018 |
Speaker separation using speaker inventories and estimated speech P Wang, Z Chen, DL Wang, J Li, Y Gong IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 537-546, 2020 | 12 | 2020 |
Continuous speech separation with recurrent selective attention network Y Zhang, Z Chen, J Wu, T Yoshioka, P Wang, Z Meng, J Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 10 | 2022 |
Filter-and-Convolve: A CNN Based Multichannel Complex Concatenation Acoustic Model P Wang, DL Wang Acoustics, Speech and Signal Processing (ICASSP), 2018 IEEE International …, 2018 | 10 | 2018 |
Improving Speech Recognition Error Prediction for Modern and Off-the-Shelf Speech Recognizers P Serai, P Wang, E Fosler-Lussier Acoustics, Speech and Signal Processing (ICASSP), 2019 IEEE International …, 2019 | 9 | 2019 |
Large Margin Training for Attention Based End-to-End Speech Recognition P Wang, J Cui, C Weng, D Yu Proc. of INTERSPEECH 2019, 246-250, 2019 | 8 | 2019 |
A weakly-supervised streaming multilingual speech model with truly zero-shot capability J Xue, P Wang, J Li, E Sun 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023 | 7 | 2023 |
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments S Papi, P Wang, J Chen, J Xue, J Li, Y Gaur arXiv preprint arXiv:2307.03354, 2023 | 6 | 2023 |
Efficient end-to-end speech recognition using performers in conformers P Wang, DL Wang arXiv preprint arXiv:2011.04196, 2020 | 6 | 2020 |