Recent advance in content-based image retrieval: A literature survey

W Zhou, H Li, Q Tian - arXiv preprint arXiv:1706.06064, 2017 - arxiv.org
The explosive increase and ubiquitous accessibility of visual data on the Web have led to
the prosperity of research activity in image search or retrieval. With the ignorance of visual …

Deep learning on multi sensor data for counter UAV applications—A systematic review

S Samaras, E Diamantidou, D Ataloglou, N Sakellariou… - Sensors, 2019 - mdpi.com
Usage of Unmanned Aerial Vehicles (UAVs) is growing rapidly in a wide range of consumer
applications, as they prove to be both autonomous and flexible in a variety of environments …

MMTM: Multimodal transfer module for CNN fusion

HRV Joze, A Shaban, ML Iuzzolino… - Proceedings of the …, 2020 - openaccess.thecvf.com
In late fusion, each modality is processed in a separate unimodal Convolutional Neural
Network (CNN) stream and the scores of each modality are fused at the end. Due to its …

Risk prediction with electronic health records: A deep learning approach

Y Cheng, F Wang, P Zhang, J Hu - … of the 2016 SIAM international conference …, 2016 - SIAM
The recent years have witnessed a surge of interests in data analytics with patient Electronic
Health Records (EHR). Data-driven healthcare, which aims at effective utilization of big …

Robust multi-view spectral clustering via low-rank and sparse decomposition

R Xia, Y Pan, L Du, J Yin - Proceedings of the AAAI conference on …, 2014 - ojs.aaai.org
Multi-view clustering, which seeks a partition of the data inmultiple views that often provide
complementary information to eachother, has received considerable attention in recent …

Modeling spatial-temporal clues in a hybrid deep learning framework for video classification

Z Wu, X Wang, YG Jiang, H Ye, X Xue - Proceedings of the 23rd ACM …, 2015 - dl.acm.org
Classifying videos according to content semantics is an important problem with a wide range
of applications. In this paper, we propose a hybrid deep learning framework for video …

Exploiting feature and class relationships in video categorization with regularized deep neural networks

YG Jiang, Z Wu, J Wang, X Xue… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
In this paper, we study the challenging problem of categorizing videos according to high-
level semantics such as the existence of a particular human action or a complex event …

Moddrop: adaptive multi-modal gesture recognition

N Neverova, C Wolf, G Taylor… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
We present a method for gesture detection and localisation based on multi-scale and multi-
modal deep learning. Each visual modality captures spatial information at a particular spatial …

Sparse representation based fisher discrimination dictionary learning for image classification

M Yang, L Zhang, X Feng, D Zhang - International Journal of Computer …, 2014 - Springer
The employed dictionary plays an important role in sparse representation or sparse coding
based image reconstruction and classification, while learning dictionaries from the training …

Multi-stream multi-class fusion of deep networks for video classification

Z Wu, YG Jiang, X Wang, H Ye, X Xue - Proceedings of the 24th ACM …, 2016 - dl.acm.org
This paper studies deep network architectures to address the problem of video classification.
A multi-stream framework is proposed to fully utilize the rich multimodal information in …