Live streaming speech recognition using deep bidirectional LSTM acoustic models and interpolated language models

J Jorge, A Giménez, JA Silvestre-Cerdà… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Although Long-Short Term Memory (LSTM) networks and deep Transformers are now
extensively used in offline ASR, it is unclear how best offline systems can be adapted to …

Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion

J Tejedor, DT Toledano, P Lopez-Otero… - EURASIP Journal on …, 2015 - Springer
Spoken term detection (STD) aims at retrieving data from a speech repository given a textual
representation of the search term. Nowadays, it is receiving much interest due to the large …

Annotation of heterogeneous multimedia content using automatic speech recognition

M Huijbregts, R Ordelman, F de Jong - International Conference on …, 2007 - Springer
This paper reports on the setup and evaluation of robust speech recognition system parts,
geared towards transcript generation for heterogeneous, real-life media collections. The …

Recursive n-gram hashing is pairwise independent, at best

D Lemire, O Kaser - Computer Speech & Language, 2010 - Elsevier
Many applications use sequences of n consecutive symbols (n-grams). Hashing these n-
grams can be a performance bottleneck. For more speed, recursive hash families compute …

[PDF][PDF] Transcrigal: A Bilingual System for Automatic Indexing of Broadcast News.

C Garcia-Mateo, J Dieguez-Tirado, LD Fernández… - LREC, 2004 - lrec-conf.org
This paper describes a Broadcast News (BN) database called Transcrigal-DB. The news
shows are mainly in Galician language, although around 11% of data is in Spanish. This …

[PDF][PDF] 基于扩展N 元文法模型的快速语言模型预测算法

单煜翔, 陈谐, 史永哲, 刘加 - 自动化学报, 2012 - Citeseer
摘要针对基于动态解码网络的大词汇量连续语音识别器, 本文提出了一种采用扩展N
元文法模型进行快速语言模型(Language model, LM) 预测的方法. 扩展N …

A fast and memory-efficient N-gram language model lookup method for large vocabulary continuous speech recognition

X Li, Y Zhao - Computer Speech & Language, 2007 - Elsevier
Recently, minimum perfect hashing (MPH)-based language model (LM) lookup methods
have been proposed for fast access of N-gram LM scores in lexical-tree based LVCSR …

Soft decoding strategies for distributed speech recognition over IP networks

A Cardenal-Lopez, L Docio-Fernandez… - … , Speech, and Signal …, 2004 - ieeexplore.ieee.org
In distributed speech recognition, speech feature vectors are obtained at the client side, and
transmitted to the remote server for recognition. In this paper, we investigate the robustness …

Adaptation strategies for the acoustic and language models in bilingual speech transcription

J Dieguez-Tirado, C Garcia-Mateo… - … .(ICASSP'05). IEEE …, 2005 - ieeexplore.ieee.org
The paper describes our current work on speech-to-text transcription for recordings in two
languages. The experimental framework consists of television news shows in Galician and …

Efficient language model look-ahead probabilities generation using lower order LM look-ahead information

L Chen, KK Chin - … on Acoustics, Speech and Signal Processing, 2008 - ieeexplore.ieee.org
In this paper, an efficient method for language model look-ahead probability generation is
presented. Traditional methods generate language model look-ahead (LMLA) probabilities …