Towards a common phone alphabet for multilingual speech recognition

Z Valsan - US Patent 8,311,813, 2012 - Google Patents

Discrimination between at least two classes of events in an input signal is carried out in the
following way. A set of frames containing an input signal is received, and at least two …

被引用次数：47 相关文章所有 4 个版本

[PDF] googleapis.com

Voice activity detection

Z Valsan - US Patent 8,554,560, 2013 - Google Patents

This application is a continuation of US Pat. No. 8.311, 813, entitled VOICE ACTIVITY
DETECTION SYSTEM AND METHOD, filed May 15, 2009, which was a S371 of …

被引用次数：18 相关文章所有 4 个版本

Multilingual and crosslingual acoustic modelling for automatic speech recognition

F Diehl - 2007 - dialnet.unirioja.es

This thesis studies the definition, implementation and validation of multilingual and
crosslingual acoustic models for automatic speech recognition (ASR), The acoustic model …

被引用次数：8 相关文章所有 2 个版本

Multilingual text-to-phoneme mapping

SK Riis, KJ Jensen - Proc. Eurospeech 2001, 2001 - isca-archive.org

This paper introduces a novel approach for generating multilingual text-to-phoneme
mappings for use in multilingual speech recognition systems. The multilingual mappings are …

被引用次数：8 相关文章

[PDF] academia.edu

Cross-lingual audio-to-text alignment for multimedia content management

DC Lyu, RY Lyu, YC Chiang, CN Hsu - Decision support systems, 2008 - Elsevier

This paper addresses a content management problem in situations where we have a
collection of spoken documents in audio stream format in one language and a collection of …

被引用次数：4 相关文章所有 12 个版本

[PDF] researchgate.net

Different approaches to build multilingual conversational systems

M Mast, T Roß, H Schulz, H Harrikari - International Conference on Text …, 2002 - Springer

The paper describes developments and results of the work being carried out during the
European research project CATCH-2004 (Converse in AThens Cologne and Helsinki) 3 …

被引用次数：2 相关文章所有 9 个版本

[PDF] isca-archive.org

[PDF][PDF] Flavoured acoustic model and combined spelling to sound for asymmetrical bilingual environment.

R Lejeune, J Baude, C Tchong, H Crepy… - …, 2005 - isca-archive.org

The most common target of multilingual ASR aims at covering various speakers from various
languages. The problem we address in this article is more specifically an asymmetrical …

被引用次数：1 相关文章所有 3 个版本

[PDF] isca-archive.org

[PDF][PDF] M ix e dL ing ua l Spo ken Wo rd Reco g nit io n by Us ing VQ Co debo ok Seque nces of Variable Length Segments

H Kojima, K Tanaka - 2003 - isca-archive.org

We are investigating unsupervised phone modeling. This paper describes a derivation
method of VQ codebook sequences of variable length segments from spoken word samples …

[引用][C] Different Approaches to Build Multilingual Conversational Systems

TR Marion Mast, H Schulz¹, H Harrikari - Text, Speech and Dialogue, 2002 - Springer,.

高级搜索

QQ 群