Convolutional neural network architecture for geometric matching

X Jiang, J Ma, G Xiao, Z Shao, X Guo - Information Fusion, 2021 - Elsevier

Multimodal image matching, which refers to identifying and then corresponding the same or
similar structure/content from two or more images that are of significant modalities or …

被引用次数：321 相关文章所有 2 个版本

[PDF] acm.org

Few-shot object detection: A survey

S Antonelli, D Avola, L Cinque, D Crisostomi… - ACM Computing …, 2022 - dl.acm.org

Deep learning approaches have recently raised the bar in many fields, from Natural
Language Processing to Computer Vision, by leveraging large amounts of data. However …

被引用次数：71 相关文章所有 5 个版本

[PDF] neurips.cc

Emergent correspondence from image diffusion

L Tang, M Jia, Q Wang, CP Phoo… - Advances in Neural …, 2023 - proceedings.neurips.cc

Finding correspondences between images is a fundamental problem in computer vision. In
this paper, we show that correspondence emerges in image diffusion models without any …

被引用次数：166 相关文章所有 12 个版本

[PDF] neurips.cc

A tale of two features: Stable diffusion complements dino for zero-shot semantic correspondence

J Zhang, C Herrmann, J Hur… - Advances in …, 2024 - proceedings.neurips.cc

Text-to-image diffusion models have made significant advances in generating and editing
high-quality images. As a result, numerous approaches have explored the ability of diffusion …

被引用次数：63 相关文章所有 8 个版本

[PDF] arxiv.org

Cost aggregation with 4d convolutional swin transformer for few-shot segmentation

S Hong, S Cho, J Nam, S Lin, S Kim - European Conference on Computer …, 2022 - Springer

This paper presents a novel cost aggregation network, called Volumetric Aggregation with
Transformers (VAT), for few-shot segmentation. The use of transformers can benefit …

被引用次数：103 相关文章所有 5 个版本

[HTML] springer.com

[HTML][HTML] Image matching from handcrafted to deep features: A survey

J Ma, X Jiang, A Fan, J Jiang, J Yan - International Journal of Computer …, 2021 - Springer

As a fundamental and critical task in various visual applications, image matching can identify
then correspond the same or similar structure/content from two or more images. Over the …

被引用次数：747 相关文章所有 9 个版本

[PDF] thecvf.com

Hypercorrelation squeeze for few-shot segmentation

J Min, D Kang, M Cho - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

Few-shot semantic segmentation aims at learning to segment a target object from a query
image using only a few annotated support images of the target class. This challenging task …

被引用次数：289 相关文章所有 6 个版本

[PDF] thecvf.com

Self-supervised learning of pretext-invariant representations

I Misra, L Maaten - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com

The goal of self-supervised learning from images is to construct image representations that
are semantically meaningful via pretext tasks that do not require semantic annotations. Many …

被引用次数：1565 相关文章所有 7 个版本

[PDF] thecvf.com

D2-net: A trainable cnn for joint description and detection of local features

M Dusmanu, I Rocco, T Pajdla… - Proceedings of the …, 2019 - openaccess.thecvf.com

In this work we address the problem of finding reliable pixel-level correspondences under
difficult imaging conditions. We propose an approach where a single convolutional neural …

被引用次数：714 相关文章所有 10 个版本

[PDF] arxiv.org

Nerf-supervision: Learning dense object descriptors from neural radiance fields

L Yen-Chen, P Florence, JT Barron… - … on robotics and …, 2022 - ieeexplore.ieee.org

Thin, reflective objects such as forks and whisks are common in our daily lives, but they are
particularly chal-lenging for robot perception because it is hard to reconstruct them using …

被引用次数：124 相关文章所有 5 个版本

高级搜索

QQ 群