Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances Y Jung, SM Kye, Y Choi, M Jung, H Kim | 45* | |
Joint Learning Using Denoising Variational Autoencoders for Voice Activity Detection. Y Jung, Y Kim, Y Choi, H Kim INTERSPEECH, 1210-1214, 2018 | 36 | 2018 |
Spatial pyramid encoding with convex length normalization for text-independent speaker verification Y Jung, Y Kim, H Lim, Y Choi, H Kim arXiv preprint arXiv:1906.08333, 2019 | 32 | 2019 |
Neural MOS Prediction for Synthesized Speech Using Multi-Task Learning With Spoofing Detection and Spoofing Type Classification Y Choi, Y Jung, H Kim arXiv preprint arXiv:2007.08267, 2020 | 29 | 2020 |
Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling Y Choi, Y Jung, H Kim arXiv preprint arXiv:2008.03710, 2020 | 26 | 2020 |
A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments Y Jung, Y Choi, H Lim, H Kim IEEE Access 8, 175448-175466, 2020 | 23 | 2020 |
Self-adaptive soft voice activity detection using deep neural networks for robust speaker verification Y Jung, Y Choi, H Kim 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 20 | 2019 |
An end-to-end synthesis method for Korean text-to-speech systems Y Choi, Y Jung, Y Kim, Y Suh, H Kim Phonetics and Speech Sciences 10 (1), 39-48, 2018 | 8 | 2018 |
Multi-Scale Aggregation Using Feature Pyramid Module for Text-Independent Speaker Verification Y Jung, S Kye, Y Choi, M Jung, H Kim arXiv preprint arXiv:2004.03194, 2020 | 2 | 2020 |
Perceptually Guided End-to-End Text-to-Speech Y Choi, Y Jung, Y Suh, H Kim arXiv preprint arXiv:2011.01174, 2020 | | 2020 |
Deep Least Squares Regression-Based Speaker-Dependent Layer Initialization for DNN Acoustic Model Adaptation Y Kim, Y Jung, Y Choi, HR Kim ICCE-Asia, 361-364, 2018 | | 2018 |