An overview of deep-learning-based audio-visual speech enhancement and separation

D Michelsanti, ZH Tan, SX Zhang, Y Xu… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Speech enhancement and speech separation are two related tasks, whose purpose is to
extract either one or more target speech signals, respectively, from a mixture of sounds …

CochleaNet: A robust language-independent audio-visual model for real-time speech enhancement

M Gogate, K Dashtipour, A Adeel, A Hussain - Information Fusion, 2020 - Elsevier
Noisy situations cause huge problems for the hearing-impaired, as hearing aids often make
speech more audible but do not always restore intelligibility. In noisy settings, humans …

Watmif: Multimodal medical image fusion-based watermarking for telehealth applications

KN Singh, OP Singh, AK Singh, AK Agrawal - Cognitive Computation, 2024 - Springer
Over recent years, the volume of big data has drastically increased for medical applications.
Such data are shared by cloud providers for storage and further processing. Medical images …

Contextual deep learning-based audio-visual switching for speech enhancement in real-world environments

A Adeel, M Gogate, A Hussain - Information Fusion, 2020 - Elsevier
Human speech processing is inherently multi-modal, where visual cues (eg lip movements)
can help better understand speech in noise. Our recent work [1] has shown that lip-reading …

Multimodal audio-visual information fusion using canonical-correlated graph neural network for energy-efficient speech enhancement

LA Passos, JP Papa, J Del Ser, A Hussain, A Adeel - Information Fusion, 2023 - Elsevier
This paper proposes a novel multimodal self-supervised architecture for energy-efficient
audio-visual (AV) speech enhancement that integrates Graph Neural Networks with …

Rating of Modern Color Image Cryptography: A Next‐Generation Computing Perspective

M Samiullah, W Aslam, MA Khan… - Wireless …, 2022 - Wiley Online Library
Issues such as inefficient encryption architectures, nonstandard formats of image datasets,
weak randomness of chaos‐based Pseudorandom Number Generators (PRNGs), omitted S …

Unlocking the potential of two-point cells for energy-efficient and resilient training of deep nets

A Adeel, A Adetomi, K Ahmed… - … on Emerging Topics …, 2023 - ieeexplore.ieee.org
Context-sensitive two-point layer 5 pyramidal cells (L5PCs) were discovered as long ago as
1999. However, the potential of this discovery to provide useful neural computation has yet …

Speech reconstruction with reminiscent sound via visual voice memory

J Hong, M Kim, SJ Park, YM Ro - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org
The goal of this work is to reconstruct speech from silent video, in both speaker dependent
and independent ways. Unlike previous works that have been mostly restricted to a speaker …

Design of secure cryptosystem based on chaotic components and AES S-Box

Z Qiao, S El Assad, I Taralova - AEU-International Journal of Electronics …, 2020 - Elsevier
In this paper, we design, realize and evaluate a new secure cryptosystem based on a
Pseudo-Chaotic Number Generator (PCNG), a global diffusion and a block cipher in Cipher …

Mobility prediction-based optimisation and encryption of passenger traffic-flows using machine learning

SM Asad, J Ahmad, S Hussain, A Zoha, QH Abbasi… - Sensors, 2020 - mdpi.com
Information and Communication Technology (ICT) enabled optimisation of train's passenger
traffic flows is a key consideration of transportation under Smart City planning (SCP) …