[HTML][HTML] Factor graphs: Exploiting structure in robotics

F Dellaert - Annual Review of Control, Robotics, and …, 2021 - annualreviews.org
Many estimation, planning, and optimal control problems in robotics have an optimization
problem at their core. In most of these optimization problems, the objective to be maximized …

Raft: Recurrent all-pairs field transforms for optical flow

Z Teed, J Deng - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
Abstract We introduce Recurrent All-Pairs Field Transforms (RAFT), a new deep network
architecture for optical flow. RAFT extracts per-pixel features, builds multi-scale 4D …

Theseus: A library for differentiable nonlinear optimization

L Pineda, T Fan, M Monge… - Advances in …, 2022 - proceedings.neurips.cc
We present Theseus, an efficient application-agnostic open source library for differentiable
nonlinear least squares (DNLS) optimization built on PyTorch, providing a common …

Back to the feature: Learning robust camera localization from pixels to pose

PE Sarlin, A Unagar, M Larsson… - Proceedings of the …, 2021 - openaccess.thecvf.com
Camera pose estimation in known scenes is a 3D geometry task recently tackled by multiple
learning algorithms. Many regress precise geometric quantities, like poses or 3D points …

Pixel-perfect structure-from-motion with featuremetric refinement

P Lindenberger, PE Sarlin… - Proceedings of the …, 2021 - openaccess.thecvf.com
Finding local features that are repeatable across multiple views is a cornerstone of sparse
3D reconstruction. The classical image matching paradigm detects keypoints per-image …

Feature-metric registration: A fast semi-supervised approach for robust point cloud registration without correspondences

X Huang, G Mei, J Zhang - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com
We present a fast feature-metric point cloud registration framework, which enforces the
optimisation of registration by minimising a feature-metric projection error without …

Overview of deep learning application on visual SLAM

S Li, D Zhang, Y Xian, B Li, T Zhang, C Zhong - Displays, 2022 - Elsevier
Virtual reality, augmented reality, robotics, and autonomous driving have attracted much
attention from the academic and industrial communities, in which visual SLAM (VSLAM) is a …

Fast-mvsnet: Sparse-to-dense multi-view stereo with learned propagation and gauss-newton refinement

Z Yu, S Gao - Proceedings of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com
Almost all previous deep learning-based multi-view stereo (MVS) approaches focus on
improving reconstruction quality. Besides quality, efficiency is also a desirable feature for …

Refit: Recurrent fitting network for 3d human recovery

Y Wang, K Daniilidis - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Abstract We present Recurrent Fitting (ReFit), a neural network architecture for single-image,
parametric 3D human reconstruction. ReFit learns a feedback-update loop that mirrors the …

Pointnetlk revisited

X Li, JK Pontes, S Lucey - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
We address the generalization ability of recent learning-based point cloud registration
methods. Despite their success, these approaches tend to have poor performance when …