Learning where to attend with deep architectures for image tracking

M Denil, L Bazzani, H Larochelle, N de Freitas - Neural computation, 2012 - direct.mit.edu
We discuss an attentional model for simultaneous object tracking and recognition that is
driven by gaze data. Motivated by theories of perception, the model consists of two …

Tracking as online decision-making: Learning a policy from streaming videos with reinforcement learning

J Supancic III, D Ramanan - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
We formulate tracking as an online decision-making process, where a tracking agent must
follow an object despite ambiguous image frames and a limited computational budget …

Unsupervised foveal vision neural architecture with top-down attention

R Burt, NN Thigpen, A Keil, JC Principe - Neural Networks, 2021 - Elsevier
Deep learning architectures are an extremely powerful tool for recognizing and classifying
images. However, they require supervised learning and normally work on vectors of the size …

Unsupervised foveal vision neural networks with top-down attention

R Burt, NN Thigpen, A Keil, JC Principe - arXiv preprint arXiv:2010.09103, 2020 - arxiv.org
Deep learning architectures are an extremely powerful tool for recognizing and classifying
images. However, they require supervised learning and normally work on vectors the size of …

[PDF][PDF] Beyond Multi-target Tracking

L Bazzani - 2012 - Citeseer
Every day millions and millions of surveillance cameras monitor the world, recording and
collecting huge amount of data. The collected data can be extremely useful: from the …

Beyond Multi-target tracking: statistical pattern analysis of people and groups

L Bazzani - 2012 - iris.univr.it
Every day millions and millions of surveillance cameras monitor the world, recording and
collecting huge amount of data. The collected data can be extremely useful: from the …

[图书][B] Long-Term Tracking by Decision Making

J Supancic III - 2017 - search.proquest.com
Cameras can naturally capture sequences of images, or videos, and for computers to
understand videos, they must track to connect the past with the present. We focus on two …

Finding Objects in Complex Scenes with Top-down and Bottom-up Information

RM Burt - 2017 - search.proquest.com
Humans have the ability to view a scene and form an overall representation in a remarkably
short length of time. However, due to the complexity of visual search, it is reasonable to …

Online Multi-Stage Deep Architectures for Feature Extraction and Object Recognition

DC Rose - 2013 - trace.tennessee.edu
Multi-stage visual architectures have recently found success in achieving high classification
accuracies over image datasets with large variations in pose, lighting, and scale. Inspired by …

[PDF][PDF] Bayesian Optimization for Gaze Selection CPSC 540 Course Project

M Denil - mdenil.com
This project extends the attentional tracking model developed by Bazzani et al.[1] to include
gaze selection strategies which operate in the presence of partial information and on a …