[PDF][PDF] Deep siamese architecture based replay detection for secure voice biometric.

K Sriskandaraja, V Sethu, E Ambikairajah - Interspeech, 2018 - isca-archive.org
Replay attacks are the simplest and the most easily accessible form of spoofing attacks on
voice biometric systems and can be hard to detect by systems designed to identify spoofing …

Audio and visual modality combination in speech processing applications

G Potamianos, E Marcheret, Y Mroueh, V Goel… - The Handbook of …, 2017 - dl.acm.org
Chances are that most of us have experienced difficulty in listening to our interlocutor during
face-to-face conversation while in highly noisy environments, such as next to heavy traffic or …

Using audio-visual information to understand speaker activity: Tracking active speakers on and off screen

K Hoover, S Chaudhuri, C Pantofaru… - … , Speech and Signal …, 2018 - ieeexplore.ieee.org
We present a system that associates faces with voices in a video by fusing information from
the audio and visual signals. The thesis underlying our work is that an extreme simple …

Multimodal Transformer Distillation for Audio-Visual Synchronization

X Chen, H Wu, CC Wang, H Lee… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Audio-visual synchronization aims to determine whether the mouth movements and speech
in the video are synchronized. VocaLiST reaches state-of-the-art performance by …

基于特定韵母发音事件分析的语音唇动一致性判决方法.

朱铮宇, 邱华愉, 杨春玲, 王泳 - Journal of South China …, 2020 - search.ebscohost.com
针对现有一致性判决方法主要对整句(段) 话进行分析, 并无对分析内容加以筛选,
存在运算繁琐及结果易受静音等弱关联片段影响等不足, 以唇型变化显著的韵母发音单元为研究 …

Audiovisual synchrony detection with optimized audio features

S Sieranoja, M Sahidullah, T Kinnunen… - 2018 IEEE 3rd …, 2018 - ieeexplore.ieee.org
Audiovisual speech synchrony detection is an important part of talking-face verification
systems. Prior work has primarily focused on visual features and joint-space models, while …

Robust audiovisual liveness detection for biometric authentication using deep joint embedding and dynamic time warping

A Aides, DOV David, H Aronowitz - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
We address the problem of liveness detection in audiovisual recordings for preventing
spoofing attacks in biometric authentication systems. We assume that liveness is detected …

[PDF][PDF] Adversarially robust speech and speaker recognition

L Schönherr - 2021 - scholar.archive.org
Speech and speaker recognition systems are integrated into our everyday life; voice
assistants answer questions, set timers, or play music, but also send personal messages …

[PDF][PDF] Audiovisual Synchrony Detection with Optimized Audio

S Sieranoja, M Sahidullah, T Kinnunen, J Komulainen… - core.ac.uk
Audiovisual speech synchrony detection is an important part of talking-face verification
systems. Prior work has primarily focused on visual features and joint-space models, while …

Spoofing countermeasures for secure and robust voice authentication system: Feature extraction and modelling

K Sriskandaraja - 2018 - unsworks.unsw.edu.au
The ability to employ automatic speaker verification systems without face-to-face contact
makes them more prone to malicious spoofing attacks compared to most other biometric …