A Radford, JW Kim, T Xu, G Brockman… - Proceedings of the 40th …, 2023 - dl.acm.org
We study the capabilities of speech processing systems trained simply to predict large
amounts of transcripts of audio on the internet. When scaled to 680,000 hours of multilingual …