Transformerfusion: Monocular rgb scene reconstruction using transformers

A Bozic, P Palafox, J Thies, A Dai… - Advances in Neural …, 2021 - proceedings.neurips.cc
We introduce TransformerFusion, a transformer-based 3D scene reconstruction approach.
From an input monocular RGB video, the video frames are processed by a transformer …

Conerf: Controllable neural radiance fields

K Kania, KM Yi, M Kowalski… - Proceedings of the …, 2022 - openaccess.thecvf.com
We extend neural 3D representations to allow for intuitive and interpretable user control
beyond novel view rendering (ie camera control). We allow the user to annotate which part …

Doublefield: Bridging the neural surface and radiance fields for high-fidelity human reconstruction and rendering

R Shao, H Zhang, H Zhang, M Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
We introduce DoubleField, a novel framework combining the merits of both surface field and
radiance field for high-fidelity human reconstruction and rendering. Within DoubleField, the …

Nsf: Neural surface fields for human modeling from monocular depth

Y Xue, BL Bhatnagar, R Marin… - Proceedings of the …, 2023 - openaccess.thecvf.com
Obtaining personalized 3D animatable avatars from a monocular camera has several real
world applications in gaming, virtual try-on, animation, and VR/XR, etc. However, it is very …

Generalizable neural performer: Learning robust radiance fields for human novel view synthesis

W Cheng, S Xu, J Piao, C Qian, W Wu, KY Lin… - arXiv preprint arXiv …, 2022 - arxiv.org
This work targets at using a general deep learning framework to synthesize free-viewpoint
images of arbitrary human performers, only requiring a sparse number of camera views as …

Geometry-guided progressive nerf for generalizable and efficient neural human rendering

M Chen, J Zhang, X Xu, L Liu, Y Cai, J Feng… - European Conference on …, 2022 - Springer
In this work we develop a generalizable and efficient Neural Radiance Field (NeRF) pipeline
for high-fidelity free-viewpoint human body synthesis under settings with sparse camera …

Mononhr: Monocular neural human renderer

H Choi, G Moon, M Armando, V Leroy… - … Conference on 3D …, 2022 - ieeexplore.ieee.org
Existing neural human rendering methods struggle with a single image input due to the lack
of information in in-visible areas and the depth ambiguity of pixels in visible areas. In this …

Bodymap: Learning full-body dense correspondence map

A Ianina, N Sarafianos, Y Xu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Dense correspondence between humans carries powerful semantic information that can be
utilized to solve fundamental problems for full-body understanding such as in-the-wild …

FLEX: extrinsic parameters-free multi-view 3D human motion reconstruction

B Gordon, S Raab, G Azov, R Giryes… - European Conference on …, 2022 - Springer
The increasing availability of video recordings made by multiple cameras has offered new
means for mitigating occlusion and depth ambiguities in pose and motion reconstruction …

Super-resolution 3D human shape from a single low-resolution image

M Pesavento, M Volino, A Hilton - European Conference on Computer …, 2022 - Springer
We propose a novel framework to reconstruct super-resolution human shape from a single
low-resolution input image. The approach overcomes limitations of existing approaches that …