Spatio-temporal attention mechanism and knowledge distillation for lip reading

文章

学术资源搜索

获得 3 条结果（用时0.02秒）

我的图书馆

Spatio-temporal attention mechanism and knowledge distillation for lip reading

在引用文章中搜索

[HTML] arxiv.org

Akvsr: Audio knowledge empowered visual speech recognition by compressing audio knowledge of a pretrained model

JH Yeo, M Kim, J Choi, DH Kim… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Visual Speech Recognition (VSR) is the task of predicting spoken words from silent lip
movements. VSR is regarded as a challenging task because of the insufficient information …

被引用次数：6 相关文章所有 3 个版本

[PDF] researchgate.net

Accurate and resource-efficient lipreading with efficientnetv2 and transformers

A Koumparoulis, G Potamianos - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

We present a novel resource-efficient end-to-end architecture for lipreading that achieves
state-of-the-art results on a popular and challenging benchmark. In particular, we make the …

被引用次数：20 相关文章所有 3 个版本

Review of Deep Learning Methods for Lip Recognition.

MA Jinlin, ZHU Yanbin, MA Ziping… - Journal of …, 2021 - search.ebscohost.com

With the continuous development of deep learning, significant progress has been made in
the field of lip recognition, and many deep learning algorithms for lip recognition have …

被引用次数：1 相关文章

高级搜索

QQ 群

Spatio-temporal attention mechanism and knowledge distillation for lip reading

Akvsr: Audio knowledge empowered visual speech recognition by compressing audio knowledge of a pretrained model

Accurate and resource-efficient lipreading with efficientnetv2 and transformers

Review of Deep Learning Methods for Lip Recognition.

引用