Fine-grained car detection for visual census estimation

M Harradon, J Druce, B Ruttenberg - arXiv preprint arXiv:1802.00541, 2018 - arxiv.org

Deep neural networks are complex and opaque. As they enter application in a variety of
important and safety critical domains, users seek methods to explain their output predictions …

被引用次数：95 相关文章所有 3 个版本

Res-tuning: A flexible and efficient tuning paradigm via unbinding tuner from backbone

Z Jiang, C Mao, Z Huang, A Ma, Y Lv… - Advances in …, 2024 - proceedings.neurips.cc

Parameter-efficient tuning has become a trend in transferring large-scale foundation models
to downstream applications. Existing methods typically embed some light-weight tuners into …

被引用次数：8 相关文章所有 6 个版本

[PDF] aaai.org

Lion: Implicit vision prompt tuning

H Wang, J Chang, Y Zhai, X Luo, J Sun, Z Lin… - Proceedings of the …, 2024 - ojs.aaai.org

Despite recent promising performances across a range of vision tasks, vision Transformers
still have an issue of high computational costs. Recently, vision prompt learning has …

被引用次数：10 相关文章所有 4 个版本

[PDF] arxiv.org

A hierarchical grocery store image dataset with visual and semantic labels

M Klasson, C Zhang, H Kjellström - 2019 IEEE winter …, 2019 - ieeexplore.ieee.org

Image classification models built into visual support systems and other assistive devices
need to provide accurate predictions about their environment. We focus on an application of …

被引用次数：66 相关文章所有 8 个版本

[PDF] arxiv.org

Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidents

E Weber, DP Papadopoulos… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Natural disasters, such as floods, tornadoes, or wildfires, are increasingly pervasive as the
Earth undergoes global warming. It is difficult to predict when and where an incident will …

被引用次数：22 相关文章所有 11 个版本

[PDF] mdpi.com

Prototyping a social media flooding photo screening system based on deep learning

H Ning, Z Li, ME Hodgson, C Wang - ISPRS international journal of geo …, 2020 - mdpi.com

This article aims to implement a prototype screening system to identify flooding-related
photos from social media. These photos, associated with their geographic locations, can …

被引用次数：47 相关文章所有 10 个版本

[PDF] arxiv.org

Exploring fine-grained audiovisual categorization with the ssw60 dataset

G Van Horn, R Qian, K Wilber, H Adam… - … on Computer Vision, 2022 - Springer

We present a new benchmark dataset, Sapsucker Woods 60 (SSW60), for advancing
research on audiovisual fine-grained categorization. While our community has made great …

被引用次数：10 相关文章所有 7 个版本

Deep listwise triplet hashing for fine-grained image retrieval

Y Liang, Y Pan, H Lai, W Liu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Hashing is a practical approach for the approximate nearest neighbor search. Deep hashing
methods, which train deep networks to generate compact and similarity-preserving binary …

被引用次数：21 相关文章所有 5 个版本

[PDF] arxiv.org

Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning?

C Han, Q Wang, Y Cui, W Wang, L Huang, S Qi… - arXiv preprint arXiv …, 2024 - arxiv.org

As the scale of vision models continues to grow, the emergence of Visual Prompt Tuning
(VPT) as a parameter-efficient transfer learning technique has gained attention due to its …

被引用次数：6 相关文章所有 3 个版本

[PDF] arxiv.org

Dynamic tuning towards parameter and inference efficiency for vit adaptation

W Zhao, J Tang, Y Han, Y Song, K Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

Existing parameter-efficient fine-tuning (PEFT) methods have achieved significant success
on vision transformers (ViTs) adaptation by improving parameter efficiency. However, the …

被引用次数：3 相关文章所有 2 个版本

高级搜索

QQ 群