Akvsr: Audio knowledge empowered visual speech recognition by compressing audio knowledge of a pretrained model

JH Yeo, M Kim, J Choi, DH Kim… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Visual Speech Recognition (VSR) is the task of predicting spoken words from silent lip
movements. VSR is regarded as a challenging task because of the insufficient information …

Accurate and resource-efficient lipreading with efficientnetv2 and transformers

A Koumparoulis, G Potamianos - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
We present a novel resource-efficient end-to-end architecture for lipreading that achieves
state-of-the-art results on a popular and challenging benchmark. In particular, we make the …

Review of Deep Learning Methods for Lip Recognition.

MA Jinlin, ZHU Yanbin, MA Ziping… - Journal of …, 2021 - search.ebscohost.com
With the continuous development of deep learning, significant progress has been made in
the field of lip recognition, and many deep learning algorithms for lip recognition have …