Using relative duration in large vocabulary speech recognition.

R Zabih, V Kolmogorov - … of the 2004 IEEE Computer Society …, 2004 - ieeexplore.ieee.org

Feature space clustering is a popular approach to image segmentation, in which a feature
vector of local properties (such as intensity, texture or motion) is computed at each pixel. The …

被引用次数：226 相关文章所有 16 个版本

Duration modeling in large vocabulary speech recognition

A Anastasakos, R Schwartz… - … Conference on Acoustics …, 1995 - ieeexplore.ieee.org

This paper presents a study of different methods for phoneme duration modeling in large
vocabulary speech recognition. We investigate the employment of phoneme duration and …

被引用次数：139 相关文章所有 4 个版本

[PDF] isca-archive.org

A fast and reliable rate of speech detector

JP Verhasselt, JP Martens - Proceeding of Fourth International …, 1996 - ieeexplore.ieee.org

In this paper, we present a new rate-of-speech (ROS) detector that operates independently
from the recognition process. This detector is evaluated on the TIMIT corpus and positioned …

被引用次数：72 相关文章所有 11 个版本

[PDF] uni-muenchen.de

[PDF][PDF] Phonetische analyse der sprechgeschwindigkeit

HR Pfitzinger - 2001 - bas.uni-muenchen.de

A model for deriving perceptual local speech rate directly from the speech signal is
developed based on experimental analysis of the relationship between speech acoustics …

被引用次数：56 相关文章所有 4 个版本

[PDF] psu.edu

[PDF][PDF] Two approaches to speech rate estimation

HR Pfitzinger - Proc. SST, 1996 - Citeseer

This paper introduces two approaches to speech rate estimation: one is based on automatic
syllable detection and the other on automatic phone segmentation. For evaluation of both …

被引用次数：40 相关文章所有 3 个版本

[PDF] hacettepe.edu.tr

[PDF][PDF] Bir Türkçe fonem kümeleme sistemi tasarımı ve gerçekleştirimi

H Artuner - Yayımlanmamış Doktora Tezi, 1994 - yunus.hacettepe.edu.tr

ÖZET Günümüzde bilgisayar ve insan arasındaki etkileşim, el ve gözü birlikte kullanmayı
gerektiren, daha çok yazıya dayalı biçimde gerçekleşmektedir. Halbuki insanlar kendi …

被引用次数：20 相关文章所有 2 个版本

[PDF] uva.nl

[图书][B] Incorporating knowledge on segmental duration in HMM-based continuous speech recognition

X Wang - 1997 - fon.hum.uva.nl

Spoken language research, even though originating from the very need of everyday-life, has
traditionally been mainly theoretical and descriptive. It is modern technology and present …

被引用次数：37 相关文章所有 5 个版本

[PDF] spbu.ru

[PDF][PDF] Взаимодействие сегментных и просодических факторов, влияющих на степень и локализацию предпаузального удлинения в русском языке

ТВ Качковская - … государственная библиотека. https://dlib. rsl. ru …, 2015 - disser.spbu.ru

Членение речевого потока на крупные смысловые единицы—синтагмы и фразы—
осуществляется за счет комплексного взаимодействия мелодических, темпоральных …

被引用次数：10 相关文章所有 5 个版本

[PDF] academia.edu

Pre-recognition measures of speaking rate

K Samudravijaya, SK Singh, PVS Rao - Speech Communication, 1998 - Elsevier

The accuracy of speech recognition systems is known to be affected by fast speech. If fast
speech can be detected by means of a measure of speaking rate, the acoustic as well as …

被引用次数：24 相关文章所有 9 个版本

[PDF] mit.edu

[PDF][PDF] Hierarchical duration modeling for a speech recognition system

GYC Chung - 1997 - dspace.mit.edu

Durational patterns of phonetic segments and pauses convey information about the
linguistic content of an utterance. Most speech recognition systems grossly underutilize the …

被引用次数：15 相关文章所有 7 个版本

高级搜索

QQ 群