Multi-stream deep neural networks for rgb-d egocentric action recognition

Y Tang, Z Wang, J Lu, J Feng… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
In this paper, we investigate the problem of RGB-D egocentric action recognition. Unlike
conventional human action videos that are passively recorded by static cameras, egocentric …

Complex events detection using data-driven concepts

Y Yang, M Shah - Computer Vision–ECCV 2012: 12th European …, 2012 - Springer
Automatic event detection in a large collection of unconstrained videos is a challenging and
important task. The key issue is to describe long complex video with high level semantic …

Concept-based patent image retrieval

S Vrochidis, A Moumtzidou, I Kompatsiaris - World Patent Information, 2012 - Elsevier
Recently, the intellectual property and information retrieval communities have shown
increasing interest in patent image retrieval, which could further enhance the current …

Harvesting social images for bi-concept search

X Li, CGM Snoek, M Worring… - IEEE Transactions on …, 2012 - ieeexplore.ieee.org
Searching for the co-occurrence of two visual concepts in unlabeled images is an important
step towards answering complex user queries. Traditional visual search methods use …

Circular reranking for visual search

T Yao, CW Ngo, T Mei - IEEE Transactions on Image …, 2012 - ieeexplore.ieee.org
Search reranking is regarded as a common way to boost retrieval precision. The problem
nevertheless is not trivial especially when there are multiple features or modalities to be …

Superpixel-based causal multisensor video fusion

VN Gangapure, S Nanda… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Video surveillance systems have become extremely important recently. It has been
observed that information extracted from a single spectrum video is often insufficient in …

A cross-modal approach for extracting semantic relationships between concepts using tagged images

M Katsurai, T Ogawa… - IEEE transactions on …, 2014 - ieeexplore.ieee.org
This paper presents a cross-modal approach for extracting semantic relationships between
concepts using tagged images. In the proposed method, we first project both text and visual …

Learning hierarchical video representation for action recognition

Q Li, Z Qiu, T Yao, T Mei, Y Rui, J Luo - International Journal of Multimedia …, 2017 - Springer
Video analysis is an important branch of computer vision due to its wide applications,
ranging from video surveillance, video indexing, and retrieval to human computer …

A video indexing and retrieval computational prototype based on transcribed speech

N Spolaôr, HD Lee, WSR Takaki, LA Ensina… - Multimedia Tools and …, 2021 - Springer
Using the voice to interact with systems is attractive in medicine and other areas due to its
friendliness and flexibility. Video indexing and retrieval have benefited from this resource …

Towards large-scale multimedia retrieval enriched by knowledge about human interpretation: retrospective survey

K Shirahama, M Grzegorzek - Multimedia Tools and Applications, 2016 - Springer
Abstract Recent Large-Scale Multimedia Retrieval (LSMR) methods seem to heavily rely on
analysing a large amount of data using high-performance machines. This paper aims to …