E^ 2VPT: An Effective and Efficient Approach for Visual Prompt Tuning

Z Qin, C Han, Q Wang, X Nie, Y Yin… - Advances in Neural …, 2023 - proceedings.neurips.cc

The task of point cloud segmentation, comprising semantic, instance, and panoptic
segmentation, has been mainly tackled by designing task-specific network architectures …

被引用次数：14 相关文章所有 3 个版本

[PDF] thecvf.com

Promptkd: Unsupervised prompt distillation for vision-language models

Z Li, X Li, X Fu, X Zhang, W Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Prompt learning has emerged as a valuable technique in enhancing vision-language
models (VLMs) such as CLIP for downstream tasks in specific domains. Existing work mainly …

被引用次数：4 相关文章所有 4 个版本

[PDF] aclanthology.org

Aprompt: Attention prompt tuning for efficient adaptation of pre-trained language models

Q Wang, Y Mao, J Wang, H Yu, S Nie… - Proceedings of the …, 2023 - aclanthology.org

With the continuous growth of large language models, the process of fine-tuning these
models for new tasks has become increasingly parameter-intensive. Prompt tuning, a …

被引用次数：7 相关文章所有 2 个版本

[PDF] arxiv.org

Image translation as diffusion visual programmers

C Han, JC Liang, Q Wang, M Rabbani, S Dianat… - arXiv preprint arXiv …, 2024 - arxiv.org

We introduce the novel Diffusion Visual Programmer (DVP), a neuro-symbolic image
translation framework. Our proposed DVP seamlessly embeds a condition-flexible diffusion …

被引用次数：6 相关文章所有 3 个版本

[HTML] sciencedirect.com

[HTML][HTML] Ov-vg: A benchmark for open-vocabulary visual grounding

C Wang, W Feng, X Li, G Cheng, S Lyu, B Liu, L Chen… - Neurocomputing, 2024 - Elsevier

Open-vocabulary learning has emerged as a cutting-edge research area, particularly in light
of the widespread adoption of vision-based foundational models. Its primary objective is to …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning?

C Han, Q Wang, Y Cui, W Wang, L Huang, S Qi… - arXiv preprint arXiv …, 2024 - arxiv.org

As the scale of vision models continues to grow, the emergence of Visual Prompt Tuning
(VPT) as a parameter-efficient transfer learning technique has gained attention due to its …

被引用次数：7 相关文章所有 3 个版本

The improved YOLOv8 algorithm based on EMSPConv and SPE-head modules

G Wen, M Li, Y Luo, C Shi, Y Tan - Multimedia Tools and Applications, 2024 - Springer

Addressing the challenges of high model complexity, low generalization capability, and
suboptimal detection performance in most algorithms for crop leaf disease detection, the …

被引用次数：10 相关文章

[PDF] arxiv.org

Efficient multimodal semantic segmentation via dual-prompt learning

S Dong, Y Feng, Q Yang, Y Huang, D Liu… - arXiv preprint arXiv …, 2023 - arxiv.org

Multimodal (eg, RGB-Depth/RGB-Thermal) fusion has shown great potential for improving
semantic segmentation in complex scenes (eg, indoor/low-light conditions). Existing …

被引用次数：5 相关文章所有 2 个版本

[PDF] thecvf.com

ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification

J Shi, C Li, T Gong, Y Zheng… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Multiple instance learning (MIL)-based framework has become the mainstream for
processing the whole slide image (WSI) with giga-pixel size and hierarchical image context …

被引用次数：1 相关文章

[PDF] arxiv.org

Unsupervised Domain Adaption Harnessing Vision-Language Pre-training

W Zhou, Z Zhou - IEEE Transactions on Circuits and Systems …, 2024 - ieeexplore.ieee.org

This paper addresses two vital challenges in Unsupervised Domain Adaptation (UDA) with a
focus on harnessing the power of Vision-Language Pre-training (VLP) models. Firstly, UDA …

被引用次数：3 相关文章

高级搜索

QQ 群