On the synergies between machine learning and binocular stereo for depth estimation from images: a survey

M Poggi, F Tosi, K Batsos, P Mordohai… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Stereo matching is one of the longest-standing problems in computer vision with close to 40
years of studies and research. Throughout the years the paradigm has shifted from local …

Resolution-robust large mask inpainting with fourier convolutions

R Suvorov, E Logacheva, A Mashikhin… - Proceedings of the …, 2022 - openaccess.thecvf.com
Modern image inpainting systems, despite the significant progress, often struggle with large
missing areas, complex geometric structures, and high-resolution images. We find that one …

Patch-netvlad: Multi-scale fusion of locally-global descriptors for place recognition

S Hausler, S Garg, M Xu, M Milford… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract Visual Place Recognition is a challenging task for robotics and autonomous
systems, which must deal with the twin problems of appearance and viewpoint change in an …

The application of deep learning in stereo matching and disparity estimation: A bibliometric review

C Wang, X Cui, S Zhao, K Guo, Y Wang… - Expert Systems with …, 2024 - Elsevier
Estimating the depth of the 3D world from 2D images is a classic and important issue in
computer vision, which has been widely studied for decades. With the remarkable effect of …

Transvpr: Transformer-based place recognition with multi-level attention aggregation

R Wang, Y Shen, W Zuo, S Zhou… - Proceedings of the …, 2022 - openaccess.thecvf.com
Visual place recognition is a challenging task for applications such as autonomous driving
navigation and mobile robot localization. Distracting elements presenting in complex scenes …

Efficient regional memory network for video object segmentation

H Xie, H Yao, S Zhou, S Zhang… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Abstract Recently, several Space-Time Memory based networks have shown that the object
cues (eg video frames as well as the segmented object masks) from the past frames are …

Pri3d: Can 3d priors help 2d representation learning?

J Hou, S Xie, B Graham, A Dai… - Proceedings of the …, 2021 - openaccess.thecvf.com
Recent advances in 3D perception have shown impressive progress in understanding
geometric structures of 3D shapes and even scenes. Inspired by these advances in …

Dense dilated convolutions' merging network for land cover classification

Q Liu, M Kampffmeyer, R Jenssen… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Land cover classification of remote sensing images is a challenging task due to limited
amounts of annotated data, highly imbalanced classes, frequent incorrect pixel-level …

Densely connected multi-dilated convolutional networks for dense prediction tasks

N Takahashi, Y Mitsufuji - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
Tasks that involve high-resolution dense prediction require a modeling of both local and
global patterns in a large input field. Although the local and global structures often depend …

Defeat-net: General monocular depth via simultaneous unsupervised representation learning

J Spencer, R Bowden… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
In the current monocular depth research, the dominant approach is to employ unsupervised
training on large datasets, driven by warped photometric consistency. Such approaches lack …