查看文章

hal.science 中的 [PDF]

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

作者

Emmanuel Vincent, Shinji Watanabe, Aditya Arie Nugraha, Jon Barker, Ricard Marxer

发表日期

2017/11/1

期刊

Computer Speech & Language

卷号

页码范围

535-557

出版商

Academic Press

简介

Speech enhancement and automatic speech recognition (ASR) are most often evaluated in matched (or multi-condition) settings where the acoustic conditions of the training data match (or cover) those of the test data. Few studies have systematically assessed the impact of acoustic mismatches between training and test data, especially concerning recent speech enhancement and state-of-the-art ASR techniques. In this article, we study this issue in the context of the CHiME-3 dataset, which consists of sentences spoken by talkers situated in challenging noisy environments recorded using a 6-channel tablet based microphone array. We provide a critical analysis of the results published on this dataset for various signal enhancement, feature extraction, and ASR backend techniques and perform a number of new experiments in order to separately assess the impact of different noise environments, different numbers …

引用总数

被引用次数：409

20162017201820192020202120222023202417 38 76 65 49 51 56 36 20

学术搜索中的文章

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

E Vincent, S Watanabe, AA Nugraha, J Barker… - Computer Speech & Language, 2017

被引用次数：409 相关文章所有 16 个版本