Visual segmentation seeks to partition images, video frames, or point clouds into multiple segments or groups. This technique has numerous real-world applications, such as …
In the field of visual scene understanding, deep neural networks have made impressive advancements in various core tasks like segmentation, tracking, and detection. However …
Human-centric perceptions (eg, pose estimation, human parsing, pedestrian detection, person re-identification, etc.) play a key role in industrial applications of visual models. While …
Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good …
In this work, we focus on open vocabulary instance segmentation to expand a segmentation model to classify and segment instance-level novel categories. Previous approaches have …
X Wang, Z Fang, X Li, X Li… - Proceedings of the …, 2024 - openaccess.thecvf.com
In-context learning provides a new perspective for multi-task modeling for vision and NLP. Under this setting the model can perceive tasks from prompts and accomplish them without …
Referring Image Segmentation (RIS) is a fundamental vision-language task that outputs object masks based on text descriptions. Many works have achieved considerable progress …
Y Xu, X Li, H Yuan, Y Yang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Previous multi-task dense prediction studies developed complex pipelines such as multi- modal distillations in multiple stages or searching for task relational contexts for each task …
M Li, C Wang, W Feng, S Lyu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Visual Grounding (VG) aims at localizing target objects from an image based on given expressions and has made significant progress with the development of detection and vision …