Improving Streaming End-to-End ASR on Transformer-Based Causal Models With Encoder States Revision Strategies

Z Li, H Miao, K Deng, G Cheng, S Tian, T Li… - arXiv preprint arXiv …, 2022 - arxiv.org
There is often a trade-off between performance and latency in streaming automatic speech
recognition (ASR). Traditional methods such as look-ahead and chunk-based methods …

[PDF][PDF] Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios.

J Mahadeokar, Y Shi, Y Shangguan, C Wu, A Xiao… - Interspeech, 2021 - isca-archive.org
Often, the storage and computational constraints of embedded devices demand that a single
on-device ASR model serve multiple use-cases/domains. In this paper, we propose a …

Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios

J Mahadeokar, Y Shi, Y Shangguan, C Wu… - arXiv preprint arXiv …, 2021 - arxiv.org
Often, the storage and computational constraints of embeddeddevices demand that a single
on-device ASR model serve multiple use-cases/domains. In this paper, we propose …