Survey on videos data augmentation for deep learning models

N Cauli, D Reforgiato Recupero - Future Internet, 2022 - mdpi.com
In most Computer Vision applications, Deep Learning models achieve state-of-the-art
performances. One drawback of Deep Learning is the large amount of data needed to train …

Dcface: Synthetic face generation with dual condition diffusion model

M Kim, F Liu, A Jain, X Liu - … of the ieee/cvf conference on …, 2023 - openaccess.thecvf.com
Generating synthetic datasets for training face recognition models is challenging because
dataset generation entails more than creating high fidelity images. It involves generating …

Depth pro: Sharp monocular metric depth in less than a second

A Bochkovskii, A Delaunoy, H Germain… - arXiv preprint arXiv …, 2024 - arxiv.org
We present a foundation model for zero-shot metric monocular depth estimation. Our model,
Depth Pro, synthesizes high-resolution depth maps with unparalleled sharpness and high …

Casa: Category-agnostic skeletal animal reconstruction

Y Wu, Z Chen, S Liu, Z Ren… - Advances in Neural …, 2022 - proceedings.neurips.cc
Recovering a skeletal shape from a monocular video is a longstanding challenge. Prevailing
nonrigid animal reconstruction methods often adopt a control-point driven animation model …

4d panoptic scene graph generation

J Yang, J Cen, W Peng, S Liu, F Hong… - Advances in …, 2024 - proceedings.neurips.cc
We are living in a three-dimensional space while moving forward through a fourth
dimension: time. To allow artificial intelligence to develop a comprehensive understanding …

Rethinking inductive biases for surface normal estimation

G Bae, AJ Davison - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Despite the growing demand for accurate surface normal estimation models existing
methods use general-purpose dense prediction models adopting the same inductive biases …

Playing for 3d human recovery

Z Cai, M Zhang, J Ren, C Wei, D Ren… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Image-and video-based 3D human recovery (ie, pose and shape estimation) have achieved
substantial progress. However, due to the prohibitive cost of motion capture, existing …

Deformation and correspondence aware unsupervised synthetic-to-real scene flow estimation for point clouds

Z Jin, Y Lei, N Akhtar, H Li… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Point cloud scene flow estimation is of practical importance for dynamic scene navigation in
autonomous driving. Since scene flow labels are hard to obtain, current methods train their …

Nothing stands still: A spatiotemporal benchmark on 3d point cloud registration under large geometric and temporal change

T Sun, Y Hao, S Huang, S Savarese, K Schindler… - ISPRS Journal of …, 2025 - Elsevier
Building 3D geometric maps of man-made spaces is a well-established and active field that
is fundamental to numerous computer vision and robotics applications. However …

Muva: A new large-scale benchmark for multi-view amodal instance segmentation in the shopping scenario

Z Li, W Ye, J Terven, Z Bennett… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Amodal Instance Segmentation (AIS) endeavors to accurately deduce complete
object shapes that are partially or fully occluded. However, the inherent ill-posed nature of …