Tensorize, factorize and regularize: Robust visual relationship learning

X Chang, P Ren, P Xu, Z Li, X Chen… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Scene graph is a structured representation of a scene that can clearly express the objects,
attributes, and relationships between objects in the scene. As computer vision technology …

被引用次数：351 相关文章所有 15 个版本

[PDF] arxiv.org

Scene graph generation: A comprehensive survey

G Zhu, L Zhang, Y Jiang, Y Dang, H Hou… - arXiv preprint arXiv …, 2022 - arxiv.org

Deep learning techniques have led to remarkable breakthroughs in the field of generic
object detection and have spawned a lot of scene-understanding tasks in recent years …

被引用次数：73 相关文章所有 2 个版本

[PDF] arxiv.org

The all-seeing project v2: Towards general relation comprehension of the open world

W Wang, Y Ren, H Luo, T Li, C Yan, Z Chen… - … on Computer Vision, 2025 - Springer

Abstract We present the All-Seeing Project V2: a new model and dataset designed for
understanding object relations in images. Specifically, we propose the All-Seeing Model V2 …

被引用次数：31 相关文章所有 3 个版本

[PDF] thecvf.com

Learning to compose dynamic tree structures for visual contexts

K Tang, H Zhang, B Wu, W Luo… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com

We propose to compose dynamic tree structures that place the objects in an image into a
visual context, helping visual reasoning tasks such as scene graph generation and visual …

被引用次数：563 相关文章所有 7 个版本

[PDF] thecvf.com

Gps-net: Graph property sensing network for scene graph generation

X Lin, C Ding, J Zeng, D Tao - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com

Scene graph generation (SGG) aims to detect objects in an image along with their pairwise
relationships. There are three key properties of scene graph that have been underexplored …

被引用次数：290 相关文章所有 7 个版本

[PDF] thecvf.com

Learning graph embeddings for compositional zero-shot learning

MF Naeem, Y Xian, F Tombari… - Proceedings of the …, 2021 - openaccess.thecvf.com

In compositional zero-shot learning, the goal is to recognize unseen compositions (eg old
dog) of observed visual primitives states (eg old, cute) and objects (eg car, dog) in the …

被引用次数：176 相关文章所有 8 个版本

[PDF] thecvf.com

Counterfactual critic multi-agent training for scene graph generation

L Chen, H Zhang, J Xiao, X He… - Proceedings of the …, 2019 - openaccess.thecvf.com

Scene graphs---objects as nodes and visual relationships as edges---describe the
whereabouts and interactions of objects in an image for comprehensive scene …

被引用次数：195 相关文章所有 6 个版本

[PDF] thecvf.com

Attentive relational networks for mapping images to scene graphs

M Qi, W Li, Z Yang, Y Wang… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com

Scene graph generation refers to the task of automatically mapping an image into a
semantic structural graph, which requires correctly labeling each extracted object and their …

被引用次数：184 相关文章所有 6 个版本

[PDF] thecvf.com

Semantic image manipulation using scene graphs

H Dhamo, A Farshad, I Laina… - Proceedings of the …, 2020 - openaccess.thecvf.com

Image manipulation can be considered a special case of image generation where the image
to be produced is a modification of an existing image. Image generation and manipulation …

被引用次数：133 相关文章所有 13 个版本

[PDF] thecvf.com

Detecting unseen visual relations using analogies

J Peyre, I Laptev, C Schmid… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com

We seek to detect visual relations in images of the form of triplets t=(subject, predicate,
object), such as" person riding dog", where training examples of the individual entities are …

被引用次数：162 相关文章所有 12 个版本

高级搜索

QQ 群