CTC-based Non-autoregressive Textless Speech-to-Speech Translation

Q Fang, Z Ma, Y Zhou, M Zhang, Y Feng - arXiv preprint arXiv:2406.07330, 2024 - arxiv.org
Direct speech-to-speech translation (S2ST) has achieved impressive translation quality, but
it often faces the challenge of slow decoding due to the considerable length of speech …