On generative spoken language modeling from raw audio K Lakhotia, E Kharitonov, WN Hsu, Y Adi, A Polyak, B Bolte, TA Nguyen, ... Transactions of the Association for Computational Linguistics 9, 1336-1354, 2021 | 281 | 2021 |
The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling TA Nguyen, M de Seyssel, P Rozé, M Rivière, E Kharitonov, A Baevski, ... arXiv preprint arXiv:2011.11588, 2020 | 96 | 2020 |
Text-free prosody-aware generative spoken language modeling E Kharitonov, A Lee, A Polyak, Y Adi, J Copet, K Lakhotia, TA Nguyen, ... arXiv preprint arXiv:2109.03264, 2021 | 95 | 2021 |
Generative spoken dialogue language modeling TA Nguyen, E Kharitonov, J Copet, Y Adi, WN Hsu, A Elkahky, ... Transactions of the Association for Computational Linguistics 11, 250-266, 2023 | 73 | 2023 |
Textless speech emotion conversion using discrete and decomposed representations F Kreuk, A Polyak, J Copet, E Kharitonov, TA Nguyen, M Rivière, WN Hsu, ... arXiv preprint arXiv:2111.07402, 2021 | 54 | 2021 |
The zero resource speech challenge 2021: Spoken language modelling E Dunbar, M Bernard, N Hamilakis, TA Nguyen, M De Seyssel, P Rozé, ... arXiv preprint arXiv:2104.14700, 2021 | 45 | 2021 |
Textually pretrained speech language models M Hassid, T Remez, TA Nguyen, I Gat, A Conneau, F Kreuk, J Copet, ... Advances in Neural Information Processing Systems 36, 2024 | 33 | 2024 |
Are discrete units necessary for spoken language modeling? TA Nguyen, B Sagot, E Dupoux IEEE Journal of Selected Topics in Signal Processing 16 (6), 1415-1423, 2022 | 23 | 2022 |
Expresso: A benchmark and analysis of discrete expressive speech resynthesis TA Nguyen, WN Hsu, A d'Avirro, B Shi, I Gat, M Fazel-Zarani, T Remez, ... arXiv preprint arXiv:2308.05725, 2023 | 21 | 2023 |
textless-lib: A library for textless spoken language processing E Kharitonov, J Copet, K Lakhotia, TA Nguyen, P Tomasello, A Lee, ... arXiv preprint arXiv:2202.07359, 2022 | 11 | 2022 |
Are word boundaries useful for unsupervised language learning? TA Nguyen, M De Seyssel, R Algayres, P Roze, E Dunbar, E Dupoux arXiv preprint arXiv:2210.02956, 2022 | 7 | 2022 |
Augmentation invariant discrete representation for generative spoken language modeling I Gat, F Kreuk, TA Nguyen, A Lee, J Copet, G Synnaeve, E Dupoux, Y Adi arXiv preprint arXiv:2209.15483, 2022 | 7 | 2022 |
Spirit-lm: Interleaved spoken and written language model TA Nguyen, B Muller, B Yu, MR Costa-Jussa, M Elbayad, S Popuri, ... arXiv preprint arXiv:2402.05755, 2024 | 4 | 2024 |
Generative Spoken Language Model based on continuous word-sized audio tokens R Algayres, Y Adi, TA Nguyen, J Copet, G Synnaeve, B Sagot, E Dupoux arXiv preprint arXiv:2310.05224, 2023 | 3 | 2023 |
Do coarser units benefit cluster prediction-based speech pre-training? A Elkahky, WN Hsu, P Tomasello, TA Nguyen, R Algayres, Y Adi, J Copet, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 3 | 2023 |
Spoken Language Modeling from Raw Audio TA Nguyen Sorbonne Université, 2024 | | 2024 |
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS P Hsu, A Elkahky, WN Hsu, Y Adi, TA Nguyen, J Copet, E Dupoux, H Lee, ... arXiv preprint arXiv:2309.17020, 2023 | | 2023 |
SYSTRAN@ WAT 2019: Russian-Japanese News Commentary task J Xu, TA Nguyen, MQ Pham, JM Crego, J Senellart Proceedings of the 6th Workshop on Asian Translation, 189-194, 2019 | | 2019 |