Learning attentional mechanisms for simultaneous object tracking and recognition with deep networks

M Denil, L Bazzani, H Larochelle, N de Freitas - Neural computation, 2012 - direct.mit.edu

We discuss an attentional model for simultaneous object tracking and recognition that is
driven by gaze data. Motivated by theories of perception, the model consists of two …

被引用次数：266 相关文章所有 17 个版本

[PDF] thecvf.com

Tracking as online decision-making: Learning a policy from streaming videos with reinforcement learning

J Supancic III, D Ramanan - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

We formulate tracking as an online decision-making process, where a tracking agent must
follow an object despite ambiguous image frames and a limited computational budget …

被引用次数：135 相关文章所有 8 个版本

[PDF] sciencedirect.com

Unsupervised foveal vision neural architecture with top-down attention

R Burt, NN Thigpen, A Keil, JC Principe - Neural Networks, 2021 - Elsevier

Deep learning architectures are an extremely powerful tool for recognizing and classifying
images. However, they require supervised learning and normally work on vectors of the size …

被引用次数：6 相关文章所有 5 个版本

[PDF] arxiv.org

Unsupervised foveal vision neural networks with top-down attention

R Burt, NN Thigpen, A Keil, JC Principe - arXiv preprint arXiv:2010.09103, 2020 - arxiv.org

Deep learning architectures are an extremely powerful tool for recognizing and classifying
images. However, they require supervised learning and normally work on vectors the size of …

被引用次数：4 相关文章所有 3 个版本

[PDF] psu.edu

[PDF][PDF] Beyond Multi-target Tracking

L Bazzani - 2012 - Citeseer

Every day millions and millions of surveillance cameras monitor the world, recording and
collecting huge amount of data. The collected data can be extremely useful: from the …

被引用次数：3 相关文章

[PDF] univr.it

Beyond Multi-target tracking: statistical pattern analysis of people and groups

L Bazzani - 2012 - iris.univr.it

Every day millions and millions of surveillance cameras monitor the world, recording and
collecting huge amount of data. The collected data can be extremely useful: from the …

被引用次数：2 相关文章所有 5 个版本

[HTML] proquest.com

[图书][B] Long-Term Tracking by Decision Making

J Supancic III - 2017 - search.proquest.com

Cameras can naturally capture sequences of images, or videos, and for computers to
understand videos, they must track to connect the past with the present. We focus on two …

Finding Objects in Complex Scenes with Top-down and Bottom-up Information

RM Burt - 2017 - search.proquest.com

Humans have the ability to view a scene and form an overall representation in a remarkably
short length of time. However, due to the complexity of visual search, it is reasonable to …

[PDF] tennessee.edu

Online Multi-Stage Deep Architectures for Feature Extraction and Object Recognition

DC Rose - 2013 - trace.tennessee.edu

Multi-stage visual architectures have recently found success in achieving high classification
accuracies over image datasets with large variations in pose, lighting, and scale. Inspired by …

[PDF][PDF] Bayesian Optimization for Gaze Selection CPSC 540 Course Project

M Denil - mdenil.com

This project extends the attentional tracking model developed by Bazzani et al.[1] to include
gaze selection strategies which operate in the presence of partial information and on a …

高级搜索

QQ 群