Graph convolutional networks in language and vision: A survey

H Ren, W Lu, Y Xiao, X Chang, X Wang, Z Dong… - Knowledge-Based …, 2022 - Elsevier
Graph convolutional networks (GCNs) have a strong ability to learn graph representation
and have achieved good performance in a range of applications, including social …

Survey of graph neural networks and applications

F Liang, C Qian, W Yu, D Griffith… - … and Mobile Computing, 2022 - Wiley Online Library
The advance of deep learning has shown great potential in applications (speech, image,
and video classification). In these applications, deep learning models are trained by …

Ppt: token-pruned pose transformer for monocular and multi-view human pose estimation

H Ma, Z Wang, Y Chen, D Kong, L Chen, X Liu… - … on Computer Vision, 2022 - Springer
Recently, the vision transformer and its variants have played an increasingly important role
in both monocular and multi-view human pose estimation. Considering image patches as …

Transfusion: Cross-view fusion with transformer for 3d human pose estimation

H Ma, L Chen, D Kong, Z Wang, X Liu, H Tang… - arXiv preprint arXiv …, 2021 - arxiv.org
Estimating the 2D human poses in each view is typically the first step in calibrated multi-view
3D pose estimation. But the performance of 2D pose detectors suffers from challenging …

High fidelity 3d hand shape reconstruction via scalable graph frequency decomposition

T Luan, Y Zhai, J Meng, Z Li, Z Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Despite the impressive performance obtained by recent single-image hand modeling
techniques, they lack the capability to capture sufficient details of the 3D hand mesh. This …

Identity-aware hand mesh estimation and personalization from rgb images

D Kong, L Zhang, L Chen, H Ma, X Yan, S Sun… - … on Computer Vision, 2022 - Springer
Reconstructing 3D hand meshes from monocular RGB images has attracted increasing
amount of attention due to its enormous potential applications in the field of AR/VR. Most …

Mvhm: A large-scale multi-view hand mesh benchmark for accurate 3d hand pose estimation

L Chen, SY Lin, Y Xie, YY Lin… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Estimating 3D hand poses from a single RGB image is challenging because depth
ambiguity leads the problem ill-posed. Training hand pose estimators with 3D hand mesh …

Temporal-aware self-supervised learning for 3d hand pose and mesh estimation in videos

L Chen, SY Lin, Y Xie, YY Lin… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Estimating 3D hand pose directly from RGB images is challenging but has gained steady
progress recently by training deep models with annotated 3D poses. However annotating …

Megan: memory enhanced graph attention network for space-time video super-resolution

C You, L Han, A Feng, R Zhao… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract Space-time video super-resolution (STVSR) aims to construct a high space-time
resolution video sequence from the corresponding low-frame-rate, low-resolution video …

PD-Net: Quantitative Motor Function Evaluation for Parkinson's Disease via Automated Hand Gesture Analysis

Y Chen, H Ma, J Wang, J Wu, X Wu, X Xie - Proceedings of the 27th ACM …, 2021 - dl.acm.org
Parkinson's Disease (PD) is a commonly diagnosed movement disorder with more than 10
million patients worldwide. Its clinical evaluation relies on a rating system called MDS …