Rlipv2: Fast scaling of relational language-image pre-training

H Yuan, S Zhang, X Wang, S Albanie… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Relational Language-Image Pre-training (RLIP) aims to align vision representations
with relational texts, thereby advancing the capability of relational reasoning in computer …

Re-mine, learn and reason: Exploring the cross-modal semantic correlations for language-guided hoi detection

Y Cao, Q Tang, F Yang, X Su, S You… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Human-Object Interaction (HOI) detection is a challenging computer vision task that
requires visual models to address the complex interactive relationship between humans and …

From detection to understanding: A survey on representation learning for human-object interaction

T Luo, S Guan, R Yang, J Smith - Neurocomputing, 2023 - Elsevier
Abstract Human-Object Interaction (HOI) detection is a critical topic in the visual
understanding field. With the development of deep learning models, the research of HOI …

Mining cross-person cues for body-part interactiveness learning in hoi detection

X Wu, YL Li, X Liu, J Zhang, Y Wu, C Lu - European Conference on …, 2022 - Springer
Abstract Human-Object Interaction (HOI) detection plays a crucial role in activity
understanding. Though significant progress has been made, interactiveness learning …

Viplo: Vision transformer based pose-conditioned self-loop graph for human-object interaction detection

J Park, JW Park, JS Lee - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Abstract Human-Object Interaction (HOI) detection, which localizes and infers relationships
between human and objects, plays an important role in scene understanding. Although two …

Agglomerative transformer for human-object interaction detection

D Tu, W Sun, G Zhai, W Shen - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We propose an agglomerative Transformer (AGER) that enables Transformer-based human-
object interaction (HOI) detectors to flexibly exploit extra instance-level cues in a single …

Category query learning for human-object interaction classification

C Xie, F Zeng, Y Hu, S Liang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Unlike most previous HOI methods that focus on learning better human-object features, we
propose a novel and complementary approach called category query learning. Such queries …

Unified visual relationship detection with vision and language models

L Zhao, L Yuan, B Gong, Y Cui… - Proceedings of the …, 2023 - openaccess.thecvf.com
This work focuses on training a single visual relationship detector predicting over the union
of label spaces from multiple datasets. Merging labels spanning different datasets could be …

Detecting any human-object interaction relationship: Universal hoi detector with spatial prompt learning on foundation models

Y Cao, Q Tang, X Su, S Chen, S You… - Advances in Neural …, 2024 - proceedings.neurips.cc
Human-object interaction (HOI) detection aims to comprehend the intricate relationships
between humans and objects, predicting triplets, and serving as the foundation for …

Neural-logic human-object interaction detection

L Li, J Wei, W Wang, Y Yang - Advances in Neural …, 2024 - proceedings.neurips.cc
The interaction decoder utilized in prevalent Transformer-based HOI detectors typically
accepts pre-composed human-object pairs as inputs. Though achieving remarkable …