Combining audio and visual speech recognition using LSTM and deep convolutional neural network

R Shashidhar, S Patilkulkarni, SB Puneeth - International Journal of …, 2022 - Springer
Human speech is bimodal, whereas audio speech relates to the speaker's acoustic
waveform. Lip motions are referred to as visual speech. Audiovisual Speech Recognition is …

A multiple-input deep residual convolutional neural network for reservoir permeability prediction

M Masroor, ME Niri, MH Sharifinasab - Geoenergy Science and …, 2023 - Elsevier
Permeability plays an essential role in reservoir-related studies, including fluid flow
characterization, reservoir modeling/simulation, and management. However, operational …

Aspect-based sentiment analysis of customer speech data using deep convolutional neural network and bilstm

S Murugaiyan, SR Uyyala - Cognitive Computation, 2023 - Springer
The process of detecting sentiments of particular context from human speech emotions is
naturally in-built for humans unlike computers, where it is not possible to process human …

Visual speech recognition for kannada language using vgg16 convolutional neural network

S Rudregowda, S Patil Kulkarni, G HL, V Ravi… - Acoustics, 2023 - mdpi.com
Visual speech recognition (VSR) is a method of reading speech by noticing the lip actions of
the narrators. Visual speech significantly depends on the visual features derived from the …

Automatic guava disease detection using different deep learning approaches

V Tewari, NA Azeem, S Sharma - Multimedia Tools and Applications, 2024 - Springer
In many countries, agriculture plays a major role in the economy. The health of the crop is
therefore very important, but there are many plant diseases that are difficult to diagnose. A …

Online diagnosis for rolling bearings based on multi-channel convolution and transfer learning

Z Meng, Z Zhao, B Zhu, F Fan - Measurement Science and …, 2022 - iopscience.iop.org
In recent years, the fault diagnosis methods based on deep learning have been widely
applied. In practical engineering, there are great distribution differences between the …

Deep delay rectified neural networks

C Shan, A Li, X Chen - The Journal of Supercomputing, 2023 - Springer
An activation function is one of the key factors for the success in deep learning. According to
the neurobiology research, biological neurons don't respond to external stimuli in the initial …

[HTML][HTML] Audiovisual speech recognition based on a deep convolutional neural network

S Rudregowda, S Patilkulkarni, V Ravi… - Data Science and …, 2024 - Elsevier
Audiovisual speech recognition is an emerging research topic. Lipreading is the recognition
of what someone is saying using visual information, primarily lip movements. In this study …

Enhancing visual speech recognition for deaf individuals: a hybrid LSTM and CNN 3D model for improved accuracy

R Shashidhar, MP Shashank, B Sahana - Arabian Journal for Science and …, 2023 - Springer
The ability of a person to communicate with other people and engage with the outside
environment makes it crucial to a person's existence. It can be challenging for all the people …

Study on the CNN model optimization for household garbage classification based on machine learning

W Xie, S Li, W Xu, H Deng, W Liao… - Journal of Ambient …, 2022 - content.iospress.com
In order to solve the problem of household garbage classification accurately and efficiently,
convolutional neural network classifier is an effective method. In this study, a garbage …