作者
Guijin Wang, Xinghao Chen, Hengkai Guo, Cairong Zhang
发表日期
2018/8/1
期刊
Journal of Visual Communication and Image Representation
卷号
55
页码范围
404-414
出版商
Academic Press
简介
Abstract 3D hand pose estimation is an important and challenging problem for human-computer interaction. Recently convolutional networks (ConvNet) with sophisticated design have been employed to address it, but the improvement is not so significant. To exploit good practice and promote the performance for hand pose estimation, we propose a Region Ensemble Network (REN) for directly 3D coordinate regression. It first partitions the last convolutional outputs of ConvNet into several grid regions. Results from separate fully-connected (FC) regressors on each regions are integrated by another FC layer to perform estimation. By exploitation of several training strategies including data augmentation and smooth L 1 loss, REN significantly improves the performance of ConvNet for hand pose estimation. Experiments demonstrate that our approach achieves strong performance on par or better than state-of-the-art …
引用总数
20172018201920202021202220232024122252723202312
学术搜索中的文章
G Wang, X Chen, H Guo, C Zhang - Journal of Visual Communication and Image …, 2018
H Guo, G Wang, X Chen, C Zhang - arXiv preprint arXiv:1707.07248, 2017