Mean and variance adaptation within the MLLR framework

T Hueber, L Girin, X Alameda-Pineda… - … /ACM Transactions on …, 2015 - ieeexplore.ieee.org

This paper addresses the adaptation of an acoustic-articulatory model of a reference
speaker to the voice of another speaker, using a limited amount of audio-only data. In the …

被引用次数：32 相关文章所有 8 个版本

[PDF] researchgate.net

Semi-automatic video annotation based on active learning with multiple complementary predictors

Y Song, XS Hua, LR Dai, M Wang - Proceedings of the 7th ACM SIGMM …, 2005 - dl.acm.org

In this paper, we will propose a novel semi-automatic annotation scheme for video semantic
classification. It is well known that the large gap between high-level semantics and low-level …

被引用次数：70 相关文章所有 7 个版本

[PDF] nist.gov

[PDF][PDF] Spoken Document Retrieval for TREC-9 at Cambridge University.

SE Johnson, P Jourlin, KS Jones, PC Woodland - TREC, 2000 - trec.nist.gov

This paper presents work done at Cambridge University for the TREC-9 Spoken Document
Retrieval (SDR) track. The CUHTK transcriptions from TREC-8 with Word Error Rate (WER) …

被引用次数：83 相关文章所有 13 个版本

[PDF] navoiy-uni.uz

22 Question Answering

B Webber, N Webb - The handbook of computational linguistics …, 2010 - Wiley Online Library

Questions are asked and answered every day. Question answering (QA) technology aims to
deliver the same facility online. It goes further than the more familiar search based on …

被引用次数：63 相关文章所有 13 个版本

[PDF] googleapis.com

Method and system for cross-lingual voice conversion

I Agiomyrgiannakis - US Patent 9,177,549, 2015 - Google Patents

US 2015/O127349 A1 May 7, 2015 (57) ABSTRACT (51) Int. Cl. A method and system for is
disclosed for cross-lingual Voice GOL 5/00(2013.01) conversion. A speech-to-speech …

被引用次数：28 相关文章所有 4 个版本

[PDF] hal.science

Comparison of speaker adaptation methods as feature extraction for SVM-based speaker recognition

M Ferras, CC Leung, C Barras… - IEEE Transactions on …, 2009 - ieeexplore.ieee.org

In the last years the speaker recognition field has made extensive use of speaker adaptation
techniques. Adaptation allows speaker model parameters to be estimated using less speech …

被引用次数：53 相关文章所有 10 个版本

[PDF] danielpovey.com

A basis representation of constrained MLLR transforms for robust adaptation

D Povey, K Yao - Computer Speech & Language, 2012 - Elsevier

Abstract Constrained Maximum Likelihood Linear Regression (CMLLR) is a speaker
adaptation method for speech recognition that can be realized as a feature-space …

被引用次数：48 相关文章所有 11 个版本

[PDF] aclanthology.org

Lightly supervised and data-driven approaches to mandarin broadcast news transcription

B Chen, JW Kuo, WH Tsai - 2004 IEEE International …, 2004 - ieeexplore.ieee.org

This paper investigates the use of several lightly supervised and data-driven approaches to
Mandarin broadcast news transcription. First, with a consideration of the special structural …

被引用次数：72 相关文章所有 16 个版本

[PDF] ugent.be

A novel channel estimate for noise robust speech recognition

G Vanderreydt, K Demuynck - Computer Speech & Language, 2024 - Elsevier

We propose a novel technique to estimate the channel characteristics for robust speech
recognition. The method focuses on reliable time–frequency speech patches which are …

被引用次数：2 相关文章所有 4 个版本

[PDF] academia.edu

Frequency warping for VTLN and speaker adaptation by linear transformation of standard MFCC

S Panchapagesan, A Alwan - Computer speech & language, 2009 - Elsevier

Vocal tract length normalization (VTLN) for standard filterbank-based Mel frequency cepstral
coefficient (MFCC) features is usually implemented by warping the center frequencies of the …

被引用次数：55 相关文章所有 11 个版本

高级搜索

QQ 群