查看文章

semanticscholar.org 中的 [PDF]

Multi-scale deep learning for gesture detection and localization

作者

Natalia Neverova, Christian Wolf, Graham W Taylor, Florian Nebout

发表日期

2015

研讨会论文

Computer Vision-ECCV 2014 Workshops: Zurich, Switzerland, September 6-7 and 12, 2014, Proceedings, Part I 13

页码范围

474-490

出版商

Springer International Publishing

简介

We present a method for gesture detection and localization based on multi-scale and multi-modal deep learning. Each visual modality captures spatial information at a particular spatial scale (such as motion of the upper body or a hand), and the whole system operates at two temporal scales. Key to our technique is a training strategy which exploits i) careful initialization of individual modalities; and ii) gradual fusion of modalities from strongest to weakest cross-modality structure. We present experiments on the ChaLearn 2014 Looking at People Challenge gesture recognition track, in which we placed first out of 17 teams.

引用总数

被引用次数：293

201420152016201720182019202020212022202320245 16 18 54 44 39 35 25 23 15 12

学术搜索中的文章

Multi-scale deep learning for gesture detection and localization

N Neverova, C Wolf, GW Taylor, F Nebout - Computer Vision-ECCV 2014 Workshops: Zurich …, 2015

被引用次数：293 相关文章所有 3 个版本