G Zhu, L Zhang, Y Jiang, Y Dang, H Hou… - arXiv preprint arXiv …, 2022 - arxiv.org
Deep learning techniques have led to remarkable breakthroughs in the field of generic object detection and have spawned a lot of scene-understanding tasks in recent years …
Abstract We present the All-Seeing Project V2: a new model and dataset designed for understanding object relations in images. Specifically, we propose the All-Seeing Model V2 …
We propose to compose dynamic tree structures that place the objects in an image into a visual context, helping visual reasoning tasks such as scene graph generation and visual …
X Lin, C Ding, J Zeng, D Tao - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com
Scene graph generation (SGG) aims to detect objects in an image along with their pairwise relationships. There are three key properties of scene graph that have been underexplored …
In compositional zero-shot learning, the goal is to recognize unseen compositions (eg old dog) of observed visual primitives states (eg old, cute) and objects (eg car, dog) in the …
Scene graphs---objects as nodes and visual relationships as edges---describe the whereabouts and interactions of objects in an image for comprehensive scene …
M Qi, W Li, Z Yang, Y Wang… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Scene graph generation refers to the task of automatically mapping an image into a semantic structural graph, which requires correctly labeling each extracted object and their …
H Dhamo, A Farshad, I Laina… - Proceedings of the …, 2020 - openaccess.thecvf.com
Image manipulation can be considered a special case of image generation where the image to be produced is a modification of an existing image. Image generation and manipulation …
We seek to detect visual relations in images of the form of triplets t=(subject, predicate, object), such as" person riding dog", where training examples of the individual entities are …