A review of multimodal image matching: Methods and applications

X Jiang, J Ma, G Xiao, Z Shao, X Guo - Information Fusion, 2021 - Elsevier
Multimodal image matching, which refers to identifying and then corresponding the same or
similar structure/content from two or more images that are of significant modalities or …

Few-shot object detection: A survey

S Antonelli, D Avola, L Cinque, D Crisostomi… - ACM Computing …, 2022 - dl.acm.org
Deep learning approaches have recently raised the bar in many fields, from Natural
Language Processing to Computer Vision, by leveraging large amounts of data. However …

Emergent correspondence from image diffusion

L Tang, M Jia, Q Wang, CP Phoo… - Advances in Neural …, 2023 - proceedings.neurips.cc
Finding correspondences between images is a fundamental problem in computer vision. In
this paper, we show that correspondence emerges in image diffusion models without any …

A tale of two features: Stable diffusion complements dino for zero-shot semantic correspondence

J Zhang, C Herrmann, J Hur… - Advances in …, 2024 - proceedings.neurips.cc
Text-to-image diffusion models have made significant advances in generating and editing
high-quality images. As a result, numerous approaches have explored the ability of diffusion …

Cost aggregation with 4d convolutional swin transformer for few-shot segmentation

S Hong, S Cho, J Nam, S Lin, S Kim - European Conference on Computer …, 2022 - Springer
This paper presents a novel cost aggregation network, called Volumetric Aggregation with
Transformers (VAT), for few-shot segmentation. The use of transformers can benefit …

[HTML][HTML] Image matching from handcrafted to deep features: A survey

J Ma, X Jiang, A Fan, J Jiang, J Yan - International Journal of Computer …, 2021 - Springer
As a fundamental and critical task in various visual applications, image matching can identify
then correspond the same or similar structure/content from two or more images. Over the …

Hypercorrelation squeeze for few-shot segmentation

J Min, D Kang, M Cho - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Few-shot semantic segmentation aims at learning to segment a target object from a query
image using only a few annotated support images of the target class. This challenging task …

Self-supervised learning of pretext-invariant representations

I Misra, L Maaten - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com
The goal of self-supervised learning from images is to construct image representations that
are semantically meaningful via pretext tasks that do not require semantic annotations. Many …

D2-net: A trainable cnn for joint description and detection of local features

M Dusmanu, I Rocco, T Pajdla… - Proceedings of the …, 2019 - openaccess.thecvf.com
In this work we address the problem of finding reliable pixel-level correspondences under
difficult imaging conditions. We propose an approach where a single convolutional neural …

Nerf-supervision: Learning dense object descriptors from neural radiance fields

L Yen-Chen, P Florence, JT Barron… - … on robotics and …, 2022 - ieeexplore.ieee.org
Thin, reflective objects such as forks and whisks are common in our daily lives, but they are
particularly chal-lenging for robot perception because it is hard to reconstruct them using …