Visual tuning

BXB Yu, J Chang, H Wang, L Liu, S Wang… - ACM Computing …, 2024 - dl.acm.org
Fine-tuning visual models has been widely shown promising performance on many
downstream visual tasks. With the surprising development of pre-trained visual foundation …

Being comes from not-being: Open-vocabulary text-to-motion generation with wordless training

J Lin, J Chang, L Liu, G Li, L Lin… - Proceedings of the …, 2023 - openaccess.thecvf.com
Text-to-motion generation is an emerging and challenging problem, which aims to
synthesize motion with the same semantics as the input text. However, due to the lack of …

Open-set fine-grained retrieval via prompting vision-language evaluator

S Wang, J Chang, H Li, Z Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Open-set fine-grained retrieval is an emerging challenge that requires an extra capability to
retrieve unknown subcategories during evaluation. However, current works are rooted in the …

Sct: A simple baseline for parameter-efficient fine-tuning via salient channels

HH Zhao, P Wang, Y Zhao, H Luo, F Wang… - International Journal of …, 2024 - Springer
Pre-trained vision transformers have strong representations benefit to various downstream
tasks. Recently many parameter-efficient fine-tuning (PEFT) methods have been proposed …

Learning to parameterize visual attributes for open-set fine-grained retrieval

S Wang, J Chang, H Li, Z Wang… - Advances in Neural …, 2023 - proceedings.neurips.cc
Open-set fine-grained retrieval is an emerging challenging task that allows to retrieve
unknown categories beyond the training set. The best solution for handling unknown …

Lambo: Large language model empowered edge intelligence

L Dong, F Jiang, Y Peng, K Wang, K Yang… - arXiv preprint arXiv …, 2023 - arxiv.org
Next-generation edge intelligence is anticipated to bring huge benefits to various
applications, eg, offloading systems. However, traditional deep offloading architectures face …

Upetu: A unified parameter-efficient fine-tuning framework for remote sensing foundation model

Z Dong, Y Gu, T Liu - IEEE Transactions on Geoscience and …, 2024 - ieeexplore.ieee.org
Recent advancements in remote sensing foundation models have unveiled their
tremendous potential in addressing Earth observation tasks. Presently, when large-scale …

Compound text-guided prompt tuning via image-adaptive cues

H Tan, J Li, Y Zhou, J Wan, Z Lei… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Vision-Language Models (VLMs) such as CLIP have demonstrated remarkable
generalization capabilities to downstream tasks. However, existing prompt tuning based …

Fine-Grained Self-Supervised Learning with Jigsaw puzzles for medical image classification

W Park, J Ryu - Computers in Biology and Medicine, 2024 - Elsevier
Classifying fine-grained lesions is challenging due to minor and subtle differences in
medical images. This is because learning features of fine-grained lesions with highly minor …

Revisit parameter-efficient transfer learning: A two-stage paradigm

H Zhao, H Luo, Y Zhao, P Wang, F Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Parameter-Efficient Transfer Learning (PETL) aims at efficiently adapting large models pre-
trained on massive data to downstream tasks with limited task-specific data. In view of the …