… Recently, there has been a strong push to transition from hybrid models to end-to-end (E2E) models for automatic speechrecognition. Currently, there are three promising E2E methods: …
V Pratap, Q Xu, J Kahn, G Avidov… - arXiv preprint arXiv …, 2020 - arxiv.org
… We design an online end-to-endspeechrecognition system based on Time-Depth Separable (TDS) convolutions and Connectionist Temporal Classification (CTC). We improve the …
… To the best of our knowledge, all end-to-end neural speechrecognition systems employ recurrent neural networks in at least some part of the processing pipeline. The most successful …
… In this work, we present a novel, all-neural, end-to-end (E2E) ASR system that utilizes such context. Our approach, which we refer to as Contextual Listen, Attend and Spell (CLAS) jointly…
… Deepspeech 2: End-to-endspeechrecognition in english and mandarin. In Proceedings of the International Conference on Machine Learning, New York, NY, USA, 19–24 June 2016; …
… , and we show that a competitive end-to-end ASR model can be achieved solely using … end-to-end ASR models with the Transformer. Second, in order to facilitate training of very deep …
… Although their implementation was followed as closely as possible, training end-toend quickly exceeded the memory limitations of modern GPUs. To work around these problems, the …
… on challenging tasks such as voice search. In … end-to-end (E2E) models [10, 11, 12, 13, 14]. Such models replace the traditional components of an ASR system with a single, end-to-end …
J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com
… Recently, the speech community is seeing a significant trend of moving from deep neural network based hybrid modeling to end-to-end (E2E) modeling for automatic speechrecognition …