Voice activity detection system and method

Z Valsan - US Patent 8,311,813, 2012 - Google Patents
Discrimination between at least two classes of events in an input signal is carried out in the
following way. A set of frames containing an input signal is received, and at least two …

Voice activity detection

Z Valsan - US Patent 8,554,560, 2013 - Google Patents
This application is a continuation of US Pat. No. 8.311, 813, entitled VOICE ACTIVITY
DETECTION SYSTEM AND METHOD, filed May 15, 2009, which was a S371 of …

[PDF][PDF] A transformation-based learning approach to language identification for mixed-lingual text-to-speech synthesis.

JC Marcadet, V Fischer, C Waast-Richard - INTERSPEECH, 2005 - isca-archive.org
Recent progress in corpus-based concatenative text-to-speech synthesis has generated
some interest in systems that are capable of synthesizing text from more than one language …

Using artificially reverberated training data in distant-talking ASR

T Haderlein, E Nöth, W Herbordt, W Kellermann… - … Conference on Text …, 2005 - Springer
Abstract Automatic Speech Recognition (ASR) in reverberant rooms can be improved by
choosing training data from the same acoustical environment as the test data. In a real-world …

Dealing with cross-lingual aspects in spoken name recognition

F Stouten, JP Martens - 2007 IEEE Workshop on Automatic …, 2007 - ieeexplore.ieee.org
The development of an automatic speech recognizer (ASR) that can accurately recognize
spoken names belonging to a large lexicon, is still a big challenge. One of the bottlenecks is …

[PDF][PDF] Feature extraction and event detection for automatic speech recognition

F Stouten - 2008 - biblio.ugent.be
Feature extraction and event detection for Automatic Speech Recognition
Kenmerkenextractie en eventdetectie voor Automatische Sp Page 1 “thesis” — 2008/6/4 …

[PDF][PDF] Speech Recognition APIS in the Context of Using English as a Second Language

K Czyż, M Derkacz, J Smołka, E Łukasik… - Computational …, 2019 - researchgate.net
Speech recognition systems are applied in many different solutions (eg web and mobile
applications for language learning or voice assistants). They are frequently used by non …

French–German bilingual acoustic modeling for embedded voice driven applications

J Ivanecký, V Fischer, S Kunzmann - International Conference on Text …, 2005 - Springer
Multilingual access to information and services is a key requirement in any pervasive or
ubiquitous computing environment. In this paper we describe our efforts towards multilingual …

[PDF][PDF] Multilingual models in the IBM bilingual text-to-speech systems.

JB Ordinas, V Fischer, C Waast-Richard - INTERSPEECH, 2005 - researchgate.net
In this paper we describe the role of multilingual models in the creation and deployment of
unit selection based bilingual speech synthesizers. We first review the definition of a …

Speech recognition techniques for languages with limited linguistic resources

M Gerber - 2011 - research-collection.ethz.ch
There are several thousand languages in the world and each language has a multitude of
dialects. State-of-the-art speech recognition techniques, which are usually based on …