Lip-listening: Mixing senses to understand lips using cross modality knowledge distillation...

文章

学术资源搜索

获得 2 条结果（用时0.01秒）

我的图书馆

Lip-listening: Mixing senses to understand lips using cross modality knowledge distillation...

在引用文章中搜索

[HTML] arxiv.org

Akvsr: Audio knowledge empowered visual speech recognition by compressing audio knowledge of a pretrained model

JH Yeo, M Kim, J Choi, DH Kim… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Visual Speech Recognition (VSR) is the task of predicting spoken words from silent lip
movements. VSR is regarded as a challenging task because of the insufficient information …

被引用次数：6 相关文章所有 3 个版本

[PDF] arxiv.org

Spatio-temporal attention mechanism and knowledge distillation for lip reading

S Elashmawy, M Ramsis, HM Eraqi… - arXiv preprint arXiv …, 2021 - arxiv.org

Despite the advancement in the domain of audio and audio-visual speech recognition,
visual speech recognition systems are still quite under-explored due to the visual ambiguity …

被引用次数：3 相关文章所有 4 个版本

高级搜索

QQ 群

Lip-listening: Mixing senses to understand lips using cross modality knowledge distillation...

Akvsr: Audio knowledge empowered visual speech recognition by compressing audio knowledge of a pretrained model

Spatio-temporal attention mechanism and knowledge distillation for lip reading

引用