Gla-gcn: Global-local adaptive graph convolutional network for 3d human pose estimation from monocular video

BXB Yu, Z Zhang, Y Liu, S Zhong… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 3D human pose estimation has been researched for decades with promising fruits.
3D human pose lifting is one of the promising research directions toward the task where …

Improving diffusion-based image synthesis with context prediction

L Yang, J Liu, S Hong, Z Zhang… - Advances in …, 2024 - proceedings.neurips.cc
Diffusion models are a new class of generative models, and have dramatically promoted
image generation with unprecedented quality and diversity. Existing diffusion models mainly …

Parameter-efficient tuning of large-scale multimodal foundation model

H Wang, X Yang, J Chang, D Jin… - Advances in …, 2023 - proceedings.neurips.cc
Driven by the progress of large-scale pre-training, parameter-efficient transfer learning has
gained immense popularity across different subfields of Artificial Intelligence. The core is to …

A new learning paradigm for foundation model-based remote-sensing change detection

K Li, X Cao, D Meng - IEEE Transactions on Geoscience and …, 2024 - ieeexplore.ieee.org
Change detection (CD) is a critical task to observe and analyze dynamic processes of land
cover. Although numerous deep-learning (DL)-based CD models have performed …

Dualcoop++: Fast and effective adaptation to multi-label recognition with limited annotations

P Hu, X Sun, S Sclaroff… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Multi-label image recognition in the low-label regime is a task of great challenge and
practical significance. Previous works have focused on learning the alignment between …

SAMUS: Adapting segment anything model for clinically-friendly and generalizable ultrasound image segmentation

X Lin, Y Xiang, L Zhang, X Yang, Z Yan… - arXiv preprint arXiv …, 2023 - arxiv.org
Segment anything model (SAM), an eminent universal image segmentation model, has
recently gathered considerable attention within the domain of medical image segmentation …

Lion: Implicit vision prompt tuning

H Wang, J Chang, Y Zhai, X Luo, J Sun, Z Lin… - Proceedings of the …, 2024 - ojs.aaai.org
Despite recent promising performances across a range of vision tasks, vision Transformers
still have an issue of high computational costs. Recently, vision prompt learning has …

Deep Learning for Abnormal Human Behavior Detection in Surveillance Videos—A Survey

LM Wastupranata, SG Kong, L Wang - Electronics, 2024 - mdpi.com
Detecting abnormal human behaviors in surveillance videos is crucial for various domains,
including security and public safety. Many successful detection techniques based on deep …

Dynamic tuning towards parameter and inference efficiency for vit adaptation

W Zhao, J Tang, Y Han, Y Song, K Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Existing parameter-efficient fine-tuning (PEFT) methods have achieved significant success
on vision transformers (ViTs) adaptation by improving parameter efficiency. However, the …

Parameter-efficient long-tailed recognition

JX Shi, T Wei, Z Zhou, XY Han, JJ Shao… - arXiv preprint arXiv …, 2023 - arxiv.org
The" pre-training and fine-tuning" paradigm in addressing long-tailed recognition tasks has
sparked significant interest since the emergence of large vision-language models like the …