L2-ARCTIC: A Non-Native English Speech Corpus G Zhao, S Sonsaat, A Silpachai, I Lucic, E Chukharev-Hudilainen, J Levis, ... Interspeech, 2018 | 182 | 2018 |
Accent conversion using phonetic posteriorgrams G Zhao, S Sonsaat, J Levis, E Chukharev-Hudilainen, R Gutierrez-Osuna Acoustics, Speech, and Signal Processing (ICASSP), 2018 IEEE International …, 2018 | 64 | 2018 |
Golden speaker builder–An interactive tool for pronunciation training S Ding, C Liberatore, S Sonsaat, I Lučić, A Silpachai, G Zhao, ... Speech Communication 115, 51-66, 2019 | 57 | 2019 |
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams G Zhao, S Ding, R Gutierrez-Osuna Interspeech, 2019 | 56 | 2019 |
Improved techniques for learning to dehaze and beyond: A collective study Y Liu, G Zhao, B Gong, Y Li, R Raj, N Goel, S Kesav, S Gottimukkala, ... arXiv preprint arXiv:1807.00202, 2018 | 45 | 2018 |
Using phonetic posteriorgram based frame pairing for segmental accent conversion G Zhao, R Gutierrez-Osuna IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (10 …, 2019 | 27 | 2019 |
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning S Ding, G Zhao, R Gutierrez-Osuna Computer Speech & Language 72, 101302, 2022 | 24 | 2022 |
Converting foreign accent speech without a reference G Zhao, S Ding, R Gutierrez-Osuna IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2367-2381, 2021 | 22 | 2021 |
The English pronunciation of Arabic speakers: A data-driven approach to segmental error identification I Rehman, A Silpachai, J Levis, G Zhao, R Gutierrez-Osuna Language Teaching Research 26 (6), 1055-1081, 2022 | 18 | 2022 |
Improving the Speaker Identity of Non-Parallel Many-to-Many Voice Conversion with Adversarial Speaker Recognition S Ding, G Zhao, R Gutierrez-Osuna Interspeech, 2020 | 18 | 2020 |
Exemplar selection methods in voice conversion G Zhao, R Gutierrez-Osuna Acoustics, Speech, and Signal Processing (ICASSP), 2017 IEEE International …, 2017 | 13 | 2017 |
Understanding the Effect of Voice Quality and Accent on Talker Similarity A Das, G Zhao, J Levis, E Chukharev-Hudilainen, R Gutierrez-Osuna Interspeech, 2020 | 12 | 2020 |
Learning structured sparse representations for voice conversion S Ding, G Zhao, C Liberatore, R Gutierrez-Osuna IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 343-354, 2019 | 11 | 2019 |
Augmenting transformer-transducer based speaker change detection with token-level training loss G Zhao, Q Wang, H Lu, Y Huang, IL Moreno ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 10 | 2023 |
Highly efficient real-time streaming and fully on-device speaker diarization with multi-stage clustering Q Wang, Y Huang, H Lu, G Zhao, IL Moreno arXiv preprint arXiv:2210.13690, 2022 | 9 | 2022 |
Voice conversion through residual warping in a Sparse, Anchor-Based Representation of speech C Liberatore, G Zhao, R Gutierrez-Osuna Acoustics, Speech, and Signal Processing (ICASSP), 2018 IEEE International …, 2018 | 9 | 2018 |
Diarizationlm: Speaker diarization post-processing with large language models Q Wang, Y Huang, G Zhao, E Clark, W Xia, H Liao arXiv preprint arXiv:2401.03506, 2024 | 6 | 2024 |
Pad-net: A perception-aided single image dehazing network Y Liu, G Zhao arXiv preprint arXiv:1805.03146, 2018 | 6 | 2018 |
Golden Speaker Builder: an interactive online tool for L2 learners to build pronunciation models S Ding, C Liberatore, G Zhao, S Sonsaat, E Chukharev-Hudilainen, ... Pronunciation in Second Language Learning & Teaching (PSLLT) 9th Annual …, 2017 | 6 | 2017 |
Towards word-level end-to-end neural speaker diarization with auxiliary network Y Huang, W Wang, G Zhao, H Liao, W Xia, Q Wang arXiv preprint arXiv:2309.08489, 2023 | 3 | 2023 |