Voice activity detection system and method

Z Valsan - US Patent 8,311,813, 2012 - Google Patents
Discrimination between at least two classes of events in an input signal is carried out in the
following way. A set of frames containing an input signal is received, and at least two …

Voice activity detection

Z Valsan - US Patent 8,554,560, 2013 - Google Patents
This application is a continuation of US Pat. No. 8.311, 813, entitled VOICE ACTIVITY
DETECTION SYSTEM AND METHOD, filed May 15, 2009, which was a S371 of …

Multilingual and crosslingual acoustic modelling for automatic speech recognition

F Diehl - 2007 - dialnet.unirioja.es
This thesis studies the definition, implementation and validation of multilingual and
crosslingual acoustic models for automatic speech recognition (ASR), The acoustic model …

Multilingual text-to-phoneme mapping

SK Riis, KJ Jensen - Proc. Eurospeech 2001, 2001 - isca-archive.org
This paper introduces a novel approach for generating multilingual text-to-phoneme
mappings for use in multilingual speech recognition systems. The multilingual mappings are …

Cross-lingual audio-to-text alignment for multimedia content management

DC Lyu, RY Lyu, YC Chiang, CN Hsu - Decision support systems, 2008 - Elsevier
This paper addresses a content management problem in situations where we have a
collection of spoken documents in audio stream format in one language and a collection of …

Different approaches to build multilingual conversational systems

M Mast, T Roß, H Schulz, H Harrikari - International Conference on Text …, 2002 - Springer
The paper describes developments and results of the work being carried out during the
European research project CATCH-2004 (Converse in AThens Cologne and Helsinki) 3 …

[PDF][PDF] Flavoured acoustic model and combined spelling to sound for asymmetrical bilingual environment.

R Lejeune, J Baude, C Tchong, H Crepy… - …, 2005 - isca-archive.org
The most common target of multilingual ASR aims at covering various speakers from various
languages. The problem we address in this article is more specifically an asymmetrical …

[PDF][PDF] M ix e dL ing ua l Spo ken Wo rd Reco g nit io n by Us ing VQ Co debo ok Seque nces of Variable Length Segments

H Kojima, K Tanaka - 2003 - isca-archive.org
We are investigating unsupervised phone modeling. This paper describes a derivation
method of VQ codebook sequences of variable length segments from spoken word samples …

[引用][C] Different Approaches to Build Multilingual Conversational Systems

TR Marion Mast, H Schulz¹, H Harrikari - Text, Speech and Dialogue, 2002 - Springer,.