Super-clevr: A virtual benchmark to diagnose domain robustness in visual reasoning

Z Li, X Wang, E Stengel-Eskin… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Visual Question Answering (VQA) models often perform poorly on out-of-distribution
data and struggle on domain generalization. Due to the multi-modal nature of this task …

3d-aware visual question answering about parts, poses and occlusions

X Wang, W Ma, Z Li, A Kortylewski… - Advances in Neural …, 2024 - proceedings.neurips.cc
Despite rapid progress in Visual question answering (\textit {VQA}), existing datasets and
models mainly focus on testing reasoning in 2D. However, it is important that VQA models …

EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition

X Zheng, L Wang - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
In this paper we make the first attempt at achieving the cross-modal (ie image-to-events)
adaptation for event-based object recognition without accessing any labeled source image …

Towards open-world segmentation of parts

TY Pan, Q Liu, WL Chao… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Segmenting object parts such as cup handles and animal bodies is important in many real-
world applications but requires more annotation effort. The largest dataset nowadays …

Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations

D de Geus, G Dubbelman - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Part-aware panoptic segmentation (PPS) requires (a) that each foreground object and
background region in an image is segmented and classified and (b) that all parts within …

Progressive transformation learning for leveraging virtual images in training

YT Shen, H Lee, H Kwon… - Proceedings of the …, 2023 - openaccess.thecvf.com
To effectively interrogate UAV-based images for detecting objects of interest, such as
humans, it is essential to acquire large-scale UAV-based datasets that include human …

A Bayesian Approach to OOD Robustness in Image Classification

P Kaushik, A Kortylewski… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
An important and unsolved problem in computer vision is to ensure that the algorithms are
robust to changes in image domains. We address this problem in the scenario where we …

Parsing Objects at a Finer Granularity: A Survey

Y Zhao, J Li, Y Tian - Machine Intelligence Research, 2024 - Springer
Fine-grained visual parsing, including fine-grained part segmentation and fine-grained
object recognition, has attracted considerable critical attention due to its importance in many …

[HTML][HTML] Part2Point: A Part-Oriented Point Cloud Reconstruction Framework

YC Feng, SY Zeng, TY Liang - Sensors, 2023 - mdpi.com
Three-dimensional object modeling is necessary for developing virtual and augmented
reality applications. Traditionally, application engineers must manually use art software to …

Learning from Synthetic Human Group Activities

CJ Chang, D Li, D Patel, P Goel… - Proceedings of the …, 2024 - openaccess.thecvf.com
The study of complex human interactions and group activities has become a focal point in
human-centric computer vision. However progress in related tasks is often hindered by the …