Spatially coherent clustering using graph cuts

R Zabih, V Kolmogorov - … of the 2004 IEEE Computer Society …, 2004 - ieeexplore.ieee.org
Feature space clustering is a popular approach to image segmentation, in which a feature
vector of local properties (such as intensity, texture or motion) is computed at each pixel. The …

Duration modeling in large vocabulary speech recognition

A Anastasakos, R Schwartz… - … Conference on Acoustics …, 1995 - ieeexplore.ieee.org
This paper presents a study of different methods for phoneme duration modeling in large
vocabulary speech recognition. We investigate the employment of phoneme duration and …

A fast and reliable rate of speech detector

JP Verhasselt, JP Martens - Proceeding of Fourth International …, 1996 - ieeexplore.ieee.org
In this paper, we present a new rate-of-speech (ROS) detector that operates independently
from the recognition process. This detector is evaluated on the TIMIT corpus and positioned …

[PDF][PDF] Phonetische analyse der sprechgeschwindigkeit

HR Pfitzinger - 2001 - bas.uni-muenchen.de
A model for deriving perceptual local speech rate directly from the speech signal is
developed based on experimental analysis of the relationship between speech acoustics …

[PDF][PDF] Two approaches to speech rate estimation

HR Pfitzinger - Proc. SST, 1996 - Citeseer
This paper introduces two approaches to speech rate estimation: one is based on automatic
syllable detection and the other on automatic phone segmentation. For evaluation of both …

[PDF][PDF] Bir Türkçe fonem kümeleme sistemi tasarımı ve gerçekleştirimi

H Artuner - Yayımlanmamış Doktora Tezi, 1994 - yunus.hacettepe.edu.tr
ÖZET Günümüzde bilgisayar ve insan arasındaki etkileşim, el ve gözü birlikte kullanmayı
gerektiren, daha çok yazıya dayalı biçimde gerçekleşmektedir. Halbuki insanlar kendi …

[图书][B] Incorporating knowledge on segmental duration in HMM-based continuous speech recognition

X Wang - 1997 - fon.hum.uva.nl
Spoken language research, even though originating from the very need of everyday-life, has
traditionally been mainly theoretical and descriptive. It is modern technology and present …

[PDF][PDF] Взаимодействие сегментных и просодических факторов, влияющих на степень и локализацию предпаузального удлинения в русском языке

ТВ Качковская - … государственная библиотека. https://dlib. rsl. ru …, 2015 - disser.spbu.ru
Членение речевого потока на крупные смысловые единицы—синтагмы и фразы—
осуществляется за счет комплексного взаимодействия мелодических, темпоральных …

Pre-recognition measures of speaking rate

K Samudravijaya, SK Singh, PVS Rao - Speech Communication, 1998 - Elsevier
The accuracy of speech recognition systems is known to be affected by fast speech. If fast
speech can be detected by means of a measure of speaking rate, the acoustic as well as …

[PDF][PDF] Hierarchical duration modeling for a speech recognition system

GYC Chung - 1997 - dspace.mit.edu
Durational patterns of phonetic segments and pauses convey information about the
linguistic content of an utterance. Most speech recognition systems grossly underutilize the …