human actions in videos. In this paper, we present a hand tracking method using proposal
selection based on temporal information, hand detection and human pose estimation. We
use color and motion features for hand tracker, and obtain hand detection proposals using
Faster Region-based Convolutional Neural Network (RCNN). Stacked-hour-glasses network
is used for human pose estimation to provide possible hand regions based on the spatial …