查看文章

sciencedirect.com 中的 [PDF]

Visual Speech Recognition using Fusion of Motion and Geometric Features

作者

N Radha, A Shahina, A Nayeemulla Khan

发表日期

2020

期刊

Procedia Computer Science

卷号

171

页码范围

924-933

出版商

Elsevier

简介

The Visual Speech Recognition (VSR) system performance is highly influenced by the selection of visual features. These features are categorized into static and dynamic features. This work proposes to exploit both lip shape (static-geometric features) as well as the temporal sequence of lip movements (dynamic-motion features) to build a combined VSR system with fusion both at feature level and model level. The digit dataset for VSR system is evaluated on the benchmark (using Discrete Wavelet Transform (DWT), Discrete Cosine Transform (DCT), and Zernike Moments (ZM)) systems. First, the Motion History Image (MHI) is calculated from all visemes from which wavelet and Zernike coefficients are extracted and modeled using a simple GMM L-R HMM. This proposed method shows a significant improvement in performance of 85% for MHI-DWT based features, 74% for MHI-DCT and 80% for MHI-ZM features …

引用总数

被引用次数：7

20212022202320241 3 2 1

学术搜索中的文章

Visual speech recognition using fusion of motion and geometric features

N Radha, A Shahina, N Khan - Procedia Computer Science, 2020

被引用次数：7 相关文章