Deep learning-based automated lip-reading: A survey

S Fenghour, D Chen, K Guo, B Li, P Xiao - IEEE Access, 2021 - ieeexplore.ieee.org
A survey on automated lip-reading approaches is presented in this paper with the main
focus being on deep learning related methodologies which have proven to be more fruitful …

Lipreading using temporal convolutional networks

B Martinez, P Ma, S Petridis… - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
Lip-reading has attracted a lot of research attention lately thanks to advances in deep
learning. The current state-of-the-art model for recognition of isolated words in-the-wild …

A survey of research on lipreading technology

M Hao, M Mamut, N Yadikar, A Aysa, K Ubul - IEEE Access, 2020 - ieeexplore.ieee.org
Although automatic speech recognition (ASR) technology is mature, there are still some
unsolved problems, such as how to accurately identify what the speaker is saying in a noisy …

LRW-1000: A naturally-distributed large-scale benchmark for lip reading in the wild

S Yang, Y Zhang, D Feng, M Yang… - 2019 14th IEEE …, 2019 - ieeexplore.ieee.org
Large-scale datasets have successively proven their fundamental importance in several
research fields, especially for early progress in some emerging topics. In this paper, we …

End-to-end visual speech recognition with LSTMs

S Petridis, Z Li, M Pantic - 2017 IEEE international conference …, 2017 - ieeexplore.ieee.org
Traditional visual speech recognition systems consist of two stages, feature extraction and
classification. Recently, several deep learning approaches have been presented which …

Lip reading using convolutional neural networks with and without pre-trained models

T Ozcan, A Basturk - Balkan journal of electrical and computer …, 2019 - dergipark.org.tr
Lip reading has become a popular topic recently. There is a widespread literature studies on
lip reading in human action recognition. Deep learning methods are frequently used in this …

End-to-end visual speech recognition for small-scale datasets

S Petridis, Y Wang, P Ma, Z Li, M Pantic - Pattern Recognition Letters, 2020 - Elsevier
Visual speech recognition models traditionally consist of two stages, feature extraction and
classification. Several deep learning approaches have been recently presented aiming to …

How to use time information effectively? Combining with time shift module for lipreading

M Hao, M Mamut, N Yadikar, A Aysa… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Lipreading refers to recognizing the speaker's speech content through the image sequence
of lip movement without the speech signal. Currently, most models use a spatiotemporal …

[PDF][PDF] HLR-net: a hybrid lip-reading model based on deep convolutional neural networks

AM Sarhan, NM Elshennawy… - … , Materials and Continua, 2021 - cdn.techscience.cn
Lip reading is typically regarded as visually interpreting the speaker's lip movements during
the speaking. This is a task of decoding the text from the speaker's mouth movement. This …

[PDF][PDF] Exploring ROI size in deep learning based lipreading.

A Koumparoulis, G Potamianos, Y Mroueh, SJ Rennie - AVSP, 2017 - avsp2017.loria.fr
Automatic speechreading systems have increasingly exploited deep learning advances,
resulting in dramatic gains over traditional methods. State-of-the-art systems typically …