Deep learning for approximate nearest neighbour search: A survey and future directions

M Li, YG Wang, P Zhang, H Wang, L Fan… - … on Knowledge and …, 2022 - ieeexplore.ieee.org
Approximate nearest neighbour search (ANNS) in high-dimensional space is an essential
and fundamental operation in many applications from many domains such as multimedia …

Video moment localization via deep cross-modal hashing

Y Hu, M Liu, X Su, Z Gao, L Nie - IEEE Transactions on Image …, 2021 - ieeexplore.ieee.org
Due to the continuous booming of surveillance and Web videos, video moment localization,
as an important branch of video content analysis, has attracted wide attention from both …

Targeted attack for deep hashing based retrieval

J Bai, B Chen, Y Li, D Wu, W Guo, S Xia… - Computer Vision–ECCV …, 2020 - Springer
The deep hashing based retrieval method is widely adopted in large-scale image and video
retrieval. However, there is little investigation on its security. In this paper, we propose a …

Self-supervised video hashing via bidirectional transformers

S Li, X Li, J Lu, J Zhou - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
Most existing unsupervised video hashing methods are built on unidirectional models with
less reliable training objectives, which underuse the correlations among frames and the …

Semantics-aware spatial-temporal binaries for cross-modal video retrieval

M Qi, J Qin, Y Yang, Y Wang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
With the current exponential growth of video-based social networks, video retrieval using
natural language is receiving ever-increasing attention. Most existing approaches tackle this …

Neighborhood-adaptive structure augmented metric learning

P Li, Y Li, H Xie, L Zhang - Proceedings of the AAAI Conference on …, 2022 - ojs.aaai.org
Most metric learning techniques typically focus on sample embedding learning, while
implicitly assume a homogeneous local neighborhood around each sample, based on the …

Dual-stream knowledge-preserving hashing for unsupervised video retrieval

P Li, H Xie, J Ge, L Zhang, S Min, Y Zhang - European Conference on …, 2022 - Springer
Unsupervised video hashing usually optimizes binary codes by learning to reconstruct input
videos. Such reconstruction constraint spends much effort on frame-level temporal context …

Deep neighborhood structure-preserving hashing for large-scale image retrieval

Q Qin, K Xie, W Zhang, C Wang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Deep hashing integrates the advantages of deep learning and hashing technology, and has
become the mainstream of the large-scale image retrieval field. However, when training the …

Contrastive masked autoencoders for self-supervised video hashing

Y Wang, J Wang, B Chen, Z Zeng, ST Xia - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Abstract Self-Supervised Video Hashing (SSVH) models learn to generate short binary
representations for videos without ground-truth supervision, facilitating large-scale video …

Self-attention binary neural tree for video summarization

H Fu, H Wang - Pattern recognition letters, 2021 - Elsevier
In this paper, we address the problem of shot-level video summarization, which aims at
selecting a subset of video shots as a summary to represent the original video contents …