Order-free rnn with visual attention for multi-label classification SF Chen, YC Chen, CK Yeh, YCF Wang Thirty-Second AAAI Conference on Artificial Intelligence, 2018 | 170 | 2018 |
Meta-TTS: Meta-learning for few-shot speaker adaptive text-to-speech SF Huang, CJ Lin, DR Liu, YC Chen, H Lee IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1558-1571, 2022 | 51 | 2022 |
Audio Word2vec: Sequence-to-Sequence Autoencoding for Unsupervised Learning of Audio Segmentation and Representation YC Chen, SF Huang, H Lee, YH Wang, CH Shen IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (9), 1481 …, 2019 | 41 | 2019 |
Phonetic-and-semantic embedding of spoken words with applications in spoken content retrieval YC Chen, SF Huang, CH Shen, H Lee, L Lee 2018 IEEE Spoken Language Technology Workshop (SLT), 941-948, 2018 | 39 | 2018 |
DARTS-ASR: Differentiable architecture search for multilingual speech recognition and adaptation YC Chen, JY Hsu, CK Lee, H Lee Proc. Interspeech 2020, 1803-1807, 2020 | 35 | 2020 |
Aipnet: Generative Adversarial Pre-Training of Accent-Invariant Networks for End-To-End Speech Recognition YC Chen, Z Yang, CF Yeh, M Jain, ML Seltzer ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 34 | 2020 |
Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training SF Huang, SP Chuang, DR Liu, YC Chen, GP Yang, H Lee Proc. Interspeech 2021, 3056--3060, 2020 | 23* | 2020 |
Towards Unsupervised Automatic Speech Recognition Trained by Unaligned Speech and Text only YC Chen, CH Shen, SF Huang, H Lee arXiv preprint arXiv:1803.10952, 2018 | 18 | 2018 |
Almost-unsupervised Speech Recognition with Close-to-zero Resource Based on Phonetic Structures Learned from Very Small Unpaired Speech and Text Data YC Chen, CH Shen, SF Huang, H Lee, L Lee arXiv preprint arXiv:1810.12566, 2018 | 13 | 2018 |
Speech Representation Learning Through Self-supervised Pretraining And Multi-task Finetuning YC Chen, S Yang, CK Lee, S See, H Lee AAAI 2022 Workshop on Self-supervised Learning for Audio and Speech Processing, 2021 | 11 | 2021 |
SpeechNet: A Universal Modularized Model for Speech Processing Tasks YC Chen, PH Chi, S Yang, KW Chang, J Lin, SF Huang, DR Liu, CL Liu, ... arXiv preprint arXiv:2105.03070, 2021 | 11 | 2021 |
Learning Phone Recognition From Unpaired Audio and Phone Sequences Based on Generative Adversarial Network D Liu, P Hsu, Y Chen, S Huang, S Chuang, D Wu, H Lee IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 230-243, 2021 | 7 | 2021 |
Improved Audio Embeddings by Adjacency-Based Clustering with Applications in Spoken Term Detection SF Huang, YC Chen, H Lee, L Lee arXiv preprint arXiv:1811.02775, 2018 | 7 | 2018 |
From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings YC Chen, SF Huang, H Lee, L Lee arXiv preprint arXiv:1904.05078, 2019 | 2 | 2019 |