作者
S Uma Maheswari, N Radha, A Shahina, P Prabha, BT Preethi Sri, A Nayeemulla Khan
发表日期
2022/1/1
期刊
Materials Today: Proceedings
卷号
62
页码范围
5034-5041
出版商
Elsevier
简介
Research work on the design of robust multimodal speech recognition systems making use of acoustic and visual cues, extracted using the relatively noise robust alternate speech sensors is gaining interest in recent times among the speech processing research fraternity. The primary objective of this work is to study the exclusive influence of Lombard effect on Automatic Speech Recognition (ASR) systems towards building robust multimodal ASR systems in adverse environments in the context of Indian languages which are syllabic in nature. The dataset for this work comprises the confusable 145 Consonant-Vowel (CV) syllabic units of Hindi language recorded simultaneously using three modalities that capture the acoustic and visual speech cues, namely Normal acoustic Microphone (NM), Throat Microphone (TM) and a camera that captures the associated lip movements. The Lombard effect is induced by …
引用总数
学术搜索中的文章