Visual tuning

Gla-gcn: Global-local adaptive graph convolutional network for 3d human pose estimation from monocular video

BXB Yu, Z Zhang, Y Liu, S Zhong… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract 3D human pose estimation has been researched for decades with promising fruits.
3D human pose lifting is one of the promising research directions toward the task where …

被引用次数：41 相关文章所有 5 个版本

[PDF] neurips.cc

Improving diffusion-based image synthesis with context prediction

L Yang, J Liu, S Hong, Z Zhang… - Advances in …, 2024 - proceedings.neurips.cc

Diffusion models are a new class of generative models, and have dramatically promoted
image generation with unprecedented quality and diversity. Existing diffusion models mainly …

被引用次数：17 相关文章所有 7 个版本

Parameter-efficient tuning of large-scale multimodal foundation model

H Wang, X Yang, J Chang, D Jin… - Advances in …, 2023 - proceedings.neurips.cc

Driven by the progress of large-scale pre-training, parameter-efficient transfer learning has
gained immense popularity across different subfields of Artificial Intelligence. The core is to …

被引用次数：20 相关文章所有 6 个版本

[PDF] arxiv.org

A new learning paradigm for foundation model-based remote-sensing change detection

K Li, X Cao, D Meng - IEEE Transactions on Geoscience and …, 2024 - ieeexplore.ieee.org

Change detection (CD) is a critical task to observe and analyze dynamic processes of land
cover. Although numerous deep-learning (DL)-based CD models have performed …

被引用次数：24 相关文章所有 4 个版本

[PDF] arxiv.org

Dualcoop++: Fast and effective adaptation to multi-label recognition with limited annotations

P Hu, X Sun, S Sclaroff… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Multi-label image recognition in the low-label regime is a task of great challenge and
practical significance. Previous works have focused on learning the alignment between …

被引用次数：13 相关文章所有 5 个版本

[PDF] arxiv.org

SAMUS: Adapting segment anything model for clinically-friendly and generalizable ultrasound image segmentation

X Lin, Y Xiang, L Zhang, X Yang, Z Yan… - arXiv preprint arXiv …, 2023 - arxiv.org

Segment anything model (SAM), an eminent universal image segmentation model, has
recently gathered considerable attention within the domain of medical image segmentation …

被引用次数：40 相关文章所有 2 个版本

[PDF] aaai.org

Lion: Implicit vision prompt tuning

H Wang, J Chang, Y Zhai, X Luo, J Sun, Z Lin… - Proceedings of the …, 2024 - ojs.aaai.org

Despite recent promising performances across a range of vision tasks, vision Transformers
still have an issue of high computational costs. Recently, vision prompt learning has …

被引用次数：14 相关文章所有 4 个版本

[PDF] mdpi.com

Deep Learning for Abnormal Human Behavior Detection in Surveillance Videos—A Survey

LM Wastupranata, SG Kong, L Wang - Electronics, 2024 - mdpi.com

Detecting abnormal human behaviors in surveillance videos is crucial for various domains,
including security and public safety. Many successful detection techniques based on deep …

被引用次数：1 相关文章

[PDF] arxiv.org

Dynamic tuning towards parameter and inference efficiency for vit adaptation

W Zhao, J Tang, Y Han, Y Song, K Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

Existing parameter-efficient fine-tuning (PEFT) methods have achieved significant success
on vision transformers (ViTs) adaptation by improving parameter efficiency. However, the …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

Parameter-efficient long-tailed recognition

JX Shi, T Wei, Z Zhou, XY Han, JJ Shao… - arXiv preprint arXiv …, 2023 - arxiv.org

The" pre-training and fine-tuning" paradigm in addressing long-tailed recognition tasks has
sparked significant interest since the emergence of large vision-language models like the …

被引用次数：11 相关文章所有 4 个版本

高级搜索

QQ 群

Gla-gcn: Global-local adaptive graph convolutional network for 3d human pose estimation from monocular video

Improving diffusion-based image synthesis with context prediction

Parameter-efficient tuning of large-scale multimodal foundation model

A new learning paradigm for foundation model-based remote-sensing change detection

Dualcoop++: Fast and effective adaptation to multi-label recognition with limited annotations

SAMUS: Adapting segment anything model for clinically-friendly and generalizable ultrasound image segmentation

Lion: Implicit vision prompt tuning

Deep Learning for Abnormal Human Behavior Detection in Surveillance Videos—A Survey

Dynamic tuning towards parameter and inference efficiency for vit adaptation

Parameter-efficient long-tailed recognition

引用