查看文章

A new multi-stream approach using acoustic and visual features for robust speech recognition system

作者

N Radha, A Shahina, A Nayeemulla Khan, Jansi Rani Sella Velusami

发表日期

2022/1/1

期刊

Materials Today: Proceedings

卷号

页码范围

4916-4924

出版商

Elsevier

简介

Abstract Building a robust Automatic Speech Recognition (ASR) system and improving recognition accuracy in adverse conditions is still a challenging task. One way to improve the robustness of an ASR system is combining information from multiple sources (streams). A multi-stream approach which handles the multiple inputs at the model level is the key contribution of our work. Standard mic (S m), Throat mic (T m), and Lip reading (L r) are the various source streams that have been used. This work explores a static weighted two stream HMM (TSH) and multi-stream HMM (MSH) model for the bimodal and multimodal systems. A syllabic units of the Hindi language database categorized into three categories–Vowel, Place of Articulation (POA), and Manner of Articulation (MOA) are used for training and testing. In this study, four types of TSH are proposed for the combination of bimodal ((S m+ T m),(T m+ L r),(S m+ L r …

引用总数

被引用次数：1

20231

学术搜索中的文章

A new multi-stream approach using acoustic and visual features for robust speech recognition system

N Radha, A Shahina, AN Khan, JRS Velusami - Materials Today: Proceedings, 2022

被引用次数：1 相关文章