相关文章- 学术资源搜索

State-of-the-art speech recognition with sequence-to-sequence models

CC Chiu, TN Sainath, Y Wu… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org

Attention-based encoder-decoder architectures such as Listen, Attend, and Spell (LAS),
subsume the acoustic, pronunciation and language model components of a traditional …

被引用次数：1424 相关文章所有 10 个版本

[PDF] arxiv.org

Streaming automatic speech recognition with the transformer model

N Moritz, T Hori, J Le - ICASSP 2020-2020 IEEE International …, 2020 - ieeexplore.ieee.org

Encoder-decoder based sequence-to-sequence models have demonstrated state-of-the-art
results in end-to-end automatic speech recognition (ASR). Recently, the transformer …

被引用次数：204 相关文章所有 11 个版本

[PDF] arxiv.org

An online attention-based model for speech recognition

R Fan, P Zhou, W Chen, J Jia, G Liu - arXiv preprint arXiv:1811.05247, 2018 - arxiv.org

Attention-based end-to-end models such as Listen, Attend and Spell (LAS), simplify the
whole pipeline of traditional automatic speech recognition (ASR) systems and become …

被引用次数：58 相关文章所有 8 个版本

[PDF] arxiv.org

Multi-dialect speech recognition with a single sequence-to-sequence model

B Li, TN Sainath, KC Sim, M Bacchiani… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org

Sequence-to-sequence models provide a simple and elegant solution for building speech
recognition systems by folding separate components of a typical system, namely acoustic …

被引用次数：136 相关文章所有 4 个版本

[PDF] arxiv.org

A comparison of techniques for language model integration in encoder-decoder speech recognition

S Toshniwal, A Kannan, CC Chiu, Y Wu… - 2018 IEEE spoken …, 2018 - ieeexplore.ieee.org

Attention-based recurrent neural encoder-decoder models present an elegant solution to the
automatic speech recognition problem. This approach folds the acoustic model …

被引用次数：182 相关文章所有 5 个版本

[PDF] arxiv.org

Attention-based end-to-end speech recognition on voice search

C Shan, J Zhang, Y Wang, L Xie - 2018 IEEE International …, 2018 - ieeexplore.ieee.org

Recently, there has been a growing interest in end-to-end speech recognition that directly
transcribes speech to text without any predefined alignments. In this paper, we explore the …

被引用次数：72 相关文章所有 6 个版本

[PDF] arxiv.org

Building competitive direct acoustics-to-word models for english conversational speech recognition

K Audhkhasi, B Kingsbury… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org

Direct acoustics-to-word (A2W) models in the end-to-end paradigm have received
increasing attention compared to conventional subword based automatic speech …

被引用次数：140 相关文章所有 6 个版本

[PDF] arxiv.org

Transformer transducer: A streamable speech recognition model with transformer encoders and rnn-t loss

Q Zhang, H Lu, H Sak, A Tripathi… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

In this paper we present an end-to-end speech recognition model with Transformer
encoders that can be used in a streaming speech recognition system. Transformer …

被引用次数：486 相关文章所有 6 个版本

[PDF] arxiv.org

Minimum latency training strategies for streaming sequence-to-sequence ASR

H Inaguma, Y Gaur, L Lu, J Li… - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org

Recently, a few novel streaming attention-based sequence-to-sequence (S2S) models have
been proposed to perform online speech recognition with linear-time decoding complexity …

被引用次数：55 相关文章所有 7 个版本

[PDF] arxiv.org

A spelling correction model for end-to-end speech recognition

J Guo, TN Sainath, RJ Weiss - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org

Attention-based sequence-to-sequence models for speech recognition jointly train an
acoustic model, language model (LM), and alignment mechanism using a single neural …

被引用次数：151 相关文章所有 8 个版本

高级搜索

QQ 群

State-of-the-art speech recognition with sequence-to-sequence models

Streaming automatic speech recognition with the transformer model

An online attention-based model for speech recognition

Multi-dialect speech recognition with a single sequence-to-sequence model

A comparison of techniques for language model integration in encoder-decoder speech recognition

Attention-based end-to-end speech recognition on voice search

Building competitive direct acoustics-to-word models for english conversational speech recognition

Transformer transducer: A streamable speech recognition model with transformer encoders and rnn-t loss

Minimum latency training strategies for streaming sequence-to-sequence ASR

A spelling correction model for end-to-end speech recognition

相关搜索

引用