Audiovisual synchronization and fusion using canonical correlation analysis

ME Sargin, Y Yemez, E Erzin… - IEEE transactions on …, 2007 - ieeexplore.ieee.org
It is well-known that early integration (also called data fusion) is effective when the
modalities are correlated, and late integration (also called decision or opinion fusion) is …

Discriminative analysis of lip motion features for speaker identification and speech-reading

HE Cetingul, Y Yemez, E Erzin… - IEEE Transactions on …, 2006 - ieeexplore.ieee.org
There have been several studies that jointly use audio, lip intensity, and lip geometry
information for speaker identification and speech-reading applications. This paper proposes …

Multimodal speaker/speech recognition using lip motion, lip texture and audio

HE Çetingül, E Erzin, Y Yemez, AM Tekalp - Signal processing, 2006 - Elsevier
We present a new multimodal speaker/speech recognition system that integrates audio, lip
texture and lip motion modalities. Fusion of audio and face texture modalities has been …

Human Voice Sensing through Radio-Frequency Technologies: A Comprehensive Review

Y Wu, J Han, Z Jian, W Xu - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Voice print is one of the promising solutions for biometric applications by distinctive patterns
of certain voice characteristics of an individual. Due to the nonintrusive and high-accuracy …

Multimodal speaker identification using canonical correlation analysis

ME Sargin, E Erzin, Y Yemez… - 2006 IEEE International …, 2006 - ieeexplore.ieee.org
In this work, we explore the use of canonical correlation analysis to improve the performance
of multimodal recognition systems that involve multiple correlated modalities. More …

Robust lip-motion features for speaker identification

HE Cetingul, Y Yemez, E Erzin… - … .(ICASSP'05). IEEE …, 2005 - ieeexplore.ieee.org
The paper addresses the selection of robust lip-motion features for an audio-visual open-set
speaker identification problem. We consider two alternatives for initial lip motion …

An analysis of the effect of combining standard and alternate sensor signals on recognition of syllabic units for multimodal speech recognition

N Radha, A Shahina, P Prabha, PS BT, N Khan - Pattern recognition letters, 2018 - Elsevier
This paper studies the effect of combining evidences from multiple modes of speech on the
recognition of different categories of sounds. Multimodal speech recognition systems are …

A built-in redundancy-analysis scheme for RAMs with 2D redundancy using 1D local bitmap

TW Tseng, JF Li, DM Chang - … of the Design Automation & Test …, 2006 - ieeexplore.ieee.org
Built-in self-repair (BISR) technique is gaining popular for repairing embedded memory
cores in system-on-chips (SOCs). To increase the utilization of memory redundancy, the …

Biometric identification using motion history images of a speaker's lip movements

AG de la Cuesta, J Zhang… - 2008 International Machine …, 2008 - ieeexplore.ieee.org
This paper describes a new simple, but effective, approach to speaker verification using
video sequences of lip movements. We use motion history images (MHI) to provide a …

Visual speaker identification with spatiotemporal directional features

G Zhao, M Pietikäinen - … 10th International Conference, ICIAR 2013, Póvoa …, 2013 - Springer
In this paper, a novel local spatiotemporal directional descriptor is proposed for speaker
identification by analyzing mouth movements. For this new descriptor, the directional local …