Speech recognition via phonetically-featured syllables

S King, J Frankel, K Livescu, E McDermott… - The Journal of the …, 2007 - pubs.aip.org

Although much is known about how speech is produced, and research into speech
production has resulted in measured articulatory data, feature systems of different kinds, and …

被引用次数：251 相关文章所有 19 个版本

Interacting with computers by voice: automatic speech recognition and synthesis

D O'shaughnessy - Proceedings of the IEEE, 2003 - ieeexplore.ieee.org

This paper examines how people communicate with computers using speech. Automatic
speech recognition (ASR) transforms speech into text, while automatic speech synthesis [or …

被引用次数：265 相关文章所有 4 个版本

Subword modeling for automatic speech recognition: Past, present, and emerging approaches

K Livescu, E Fosler-Lussier… - IEEE Signal Processing …, 2012 - ieeexplore.ieee.org

Modern automatic speech recognition systems handle large vocabularies of words, making
it infeasible to collect enough repetitions of each word to train individual word models …

被引用次数：59 相关文章所有 3 个版本

[PDF] psu.edu

[PDF][PDF] Robust speech recognition using articulatory information

K Kirchho - PhD esis, University of Bielefeld, Bielefeld, Germany, 1999 - Citeseer

Whereas most state-of-the-art speech recognition systems use spectral or cepstral
representations of the speech signal, there have also been some promising attempts at …

被引用次数：273 相关文章所有 8 个版本

[PDF] psu.edu

[PDF][PDF] Moving beyond the 'beads-on-a-string'model of speech

M Ostendorf - Proc. IEEE ASRU Workshop, 1999 - Citeseer

The notion that a word is composed of a sequence of phone segments, sometimes referred
to as 'beads on a string', has formed the basis of most speech recognition work for over 15 …

被引用次数：194 相关文章所有 4 个版本

[PDF] ttic.edu

Visual speech recognition with loosely synchronized feature streams

K Saenko, K Livescu, M Siracusa… - … on Computer Vision …, 2005 - ieeexplore.ieee.org

We present an approach to detecting and recognizing spoken isolated phrases based solely
on visual input. We adopt an architecture that first employs discriminative detection of visual …

被引用次数：119 相关文章所有 13 个版本

[PDF] psu.edu

[PDF][PDF] Fundamental technologies in modern speech recognition

T OCKPH - IEEE Signal Processing Magazine, 2012 - Citeseer

There is a vast body of literature on LVCSR research and some limitation is necessary in the
scope of this article. We will focus primarily on the techniques that have been successful in …

被引用次数：69 相关文章所有 14 个版本

[PDF] mit.edu

Articulatory features for robust visual speech recognition

K Saenko, T Darrell, JR Glass - … of the 6th international conference on …, 2004 - dl.acm.org

Visual information has been shown to improve the performance of speech recognition
systems in noisy acoustic environments. However, most audio-visual speech recognizers …

被引用次数：59 相关文章所有 12 个版本

[PDF] psu.edu

Discriminative articulatory models for spoken term detection in low-resource conversational settings

R Prabhavalkar, K Livescu… - … , Speech and Signal …, 2013 - ieeexplore.ieee.org

We study spoken term detection (STD)-the task of determining whether and where a given
word or phrase appears in a given segment of speech-using articulatory feature-based …

被引用次数：29 相关文章所有 11 个版本

[PDF] kit.edu

[PDF][PDF] Articulatory features for conversational speech recognition

F Metze - 2005 - isl.anthropomatik.kit.edu

While the overall performance of speech recognition systems continues to improve, they still
show a dramatic increase in word error rate when tested on different speaking styles, ie …

被引用次数：44 相关文章所有 5 个版本

高级搜索

QQ 群