LSTM for punctuation restoration in speech transcripts.- 学术资源搜索

文章

学术资源搜索

[PDF][PDF] LSTM for punctuation restoration in speech transcripts.

O Tilk, T Alumäe - Interspeech, 2015 - isca-archive.org

Interspeech, 2015•isca-archive.org

The output of automatic speech recognition systems is generally an unpunctuated stream of
words which is hard to process for both humans and machines. We present a two-stage
recurrent neural network based model using long short-term memory units to restore
punctuation in speech transcripts. In the first stage, textual features are learned on a large
text corpus. The second stage combines textual features with pause durations and adapts
the model to speech domain. Our approach reduces the number of punctuation errors by up …

Abstract

The output of automatic speech recognition systems is generally an unpunctuated stream of words which is hard to process for both humans and machines. We present a two-stage recurrent neural network based model using long short-term memory units to restore punctuation in speech transcripts. In the first stage, textual features are learned on a large text corpus. The second stage combines textual features with pause durations and adapts the model to speech domain. Our approach reduces the number of punctuation errors by up to 16.9% when compared to a decision tree that combines hidden-event language model posteriors with inter-word pause information, having largest improvements in period restoration.

isca-archive.org

展开收起

被引用次数：150 相关文章所有 6 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

[PDF][PDF] LSTM for punctuation restoration in speech transcripts.

引用