Deep learning for instance retrieval: A survey

W Chen, Y Liu, W Wang, EM Bakker… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
In recent years a vast amount of visual content has been generated and shared from many
fields, such as social media platforms, medical imaging, and robotics. This abundance of …

SDBAD-Net: A spatial dual-branch attention dehazing network based on meta-former paradigm

G Zhang, W Fang, Y Zheng… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Image dehazing is an emblematical low-level vision task that aims at restoring haze-free
images from haze images. Recently, some methods adopts deep learning techniques to …

Exploring optical-flow-guided motion and detection-based appearance for temporal sentence grounding

D Liu, X Fang, W Hu, P Zhou - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Temporal sentence grounding aims to localize a target segment in an untrimmed video
semantically according to a given sentence query. Most previous works focus on learning …

CNDesc: Cross normalization for local descriptors learning

C Wang, R Xu, S Xu, W Meng… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
For a long time, the local descriptors learning benefited from the use of L2 normalization,
which projects the descriptor space onto the hypersphere. However, there is no free lunch in …

Graph convolutional network discrete hashing for cross-modal retrieval

C Bai, C Zeng, Q Ma, J Zhang - IEEE Transactions on Neural …, 2022 - ieeexplore.ieee.org
With the rapid development of deep neural networks, cross-modal hashing has made great
progress. However, the information of different types of data is asymmetrical, that is to say, if …

OMCBIR: Offline mobile content-based image retrieval with lightweight CNN optimization

X Zhang, C Bai, K Kpalma - Displays, 2023 - Elsevier
Abstract Convolutional Neural Networks (CNNs) have achieved great success in computer
vision applications. However, due to the high requirements for computation power and …

Advanced dropout: A model-free methodology for bayesian dropout optimization

J Xie, Z Ma, J Lei, G Zhang, JH Xue… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Due to lack of data, overfitting ubiquitously exists in real-world applications of deep neural
networks (DNNs). We propose advanced dropout, a model-free methodology, to mitigate …

Label semantic knowledge distillation for unbiased scene graph generation

L Li, J Xiao, H Shi, W Wang, J Shao… - … on Circuits and …, 2023 - ieeexplore.ieee.org
The Scene Graph Generation (SGG) task aims to detect all the objects and their pairwise
visual relationships in a given image. Although SGG has achieved remarkable progress …

Lipformer: learning to lipread unseen speakers based on visual-landmark transformers

F Xue, Y Li, D Liu, Y Xie, L Wu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Lipreading refers to understanding and further translating the speech of a video speaker into
textual outputs. State-of-the-art lipreading methods excel in interpreting overlap speakers, ie …

Forward propagation dropout in deep neural networks using Jensen–Shannon and random forest feature importance ranking

M Heidari, MH Moattar, H Ghaffari - Neural Networks, 2023 - Elsevier
Dropout is a mechanism to prevent deep neural networks from overfitting and improving
their generalization. Random dropout is the simplest method, where nodes are randomly …