A comprehensive survey on the biometric recognition systems based on physiological and behavioral modalities

S Dargan, M Kumar - Expert Systems with Applications, 2020 - Elsevier
Biometrics is the branch of science that deals with the identification and verification of an
individual based on the physiological and behavioral traits. These traits or identifiers are …

CATNet: Cross-modal fusion for audio–visual speech recognition

X Wang, J Mi, B Li, Y Zhao, J Meng - Pattern Recognition Letters, 2024 - Elsevier
Automatic speech recognition (ASR) is a typical pattern recognition technology that converts
human speeches into texts. With the aid of advanced deep learning models, the …

A survey on visual speech recognition approaches

N Radha, A Shahina - 2021 International Conference on …, 2021 - ieeexplore.ieee.org
The robustness of automatic speech recognition (ASR) systems degrade due to the factors
such as environmental noises, speaker variability, and channel distortion, among others …

Binary neural networks for classification of voice commands from throat microphone

FC Ribeiro, RTS Carvalho, PC Cortez… - IEEE …, 2018 - ieeexplore.ieee.org
Multi-class pattern classification has many applications including speech recognition, and it
is not easy to extend from two-class neural networks (NNs). This paper presents a study …

Visual speech recognition using fusion of motion and geometric features

N Radha, A Shahina, N Khan - Procedia Computer Science, 2020 - Elsevier
Abstract The Visual Speech Recognition (VSR) system performance is highly influenced by
the selection of visual features. These features are categorized into static and dynamic …

Improving recognition of speech system using multimodal approach

N Radha, A Shahina, A Nayeemulla Khan - International Conference on …, 2019 - Springer
Building an ASR system in adverse conditions is a challenging task. The performance of the
ASR system is high in clean environments. However, the variabilities such as speaker effect …

A new multi-stream approach using acoustic and visual features for robust speech recognition system

N Radha, A Shahina, AN Khan… - Materials Today …, 2022 - Elsevier
Abstract Building a robust Automatic Speech Recognition (ASR) system and improving
recognition accuracy in adverse conditions is still a challenging task. One way to improve …

A multimodal Lombard speech recognition system for the confusable Hindi syllabic units

SU Maheswari, N Radha, A Shahina, P Prabha… - Materials Today …, 2022 - Elsevier
Research work on the design of robust multimodal speech recognition systems making use
of acoustic and visual cues, extracted using the relatively noise robust alternate speech …

A Study on Alternative Speech Sensor

N Radha, A Shahina, AN Khan - … International Conference on …, 2018 - ieeexplore.ieee.org
This paper presents a study on alternative speech sensor for speech processing
applications. Noise robustness is one of the major considerations in speech processing …

Multimodal fusion for pattern recognition

Z Khan, S Kumar, EBG Reyes, P Mahanti - Pattern Recognition Letters, 2018 - Elsevier
This guest editorial introduces the special issue on “multimodal fusion for pattern
recognition”. The goal of this special issue is to consolidate and to strengthen the …