A survey on graph neural networks and graph transformers in computer vision: A task-oriented perspective

C Chen, Y Wu, Q Dai, HY Zhou, M Xu… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Graph Neural Networks (GNNs) have gained momentum in graph representation learning
and boosted the state of the art in a variety of areas, such as data mining (eg, social network …

Graph representation learning meets computer vision: A survey

L Jiao, J Chen, F Liu, S Yang, C You… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
A graph structure is a powerful mathematical abstraction, which can not only represent
information about individuals but also capture the interactions between individuals for …

Graph neural networks: foundation, frontiers and applications

L Wu, P Cui, J Pei, L Zhao, X Guo - … of the 28th ACM SIGKDD Conference …, 2022 - dl.acm.org
The field of graph neural networks (GNNs) has seen rapid and incredible strides over the
recent years. Graph neural networks, also known as deep learning on graphs, graph …

Modulated graph convolutional network for 3D human pose estimation

Z Zou, W Tang - Proceedings of the IEEE/CVF international …, 2021 - openaccess.thecvf.com
The graph convolutional network (GCN) has recently achieved promising performance of 3D
human pose estimation (HPE) by modeling the relationship among body parts. However …

Derivation of prognostic contextual histopathological features from whole-slide images of tumours via graph deep learning

Y Lee, JH Park, S Oh, K Shin, J Sun, M Jung… - Nature Biomedical …, 2022 - nature.com
Methods of computational pathology applied to the analysis of whole-slide images (WSIs) do
not typically consider histopathological features from the tumour microenvironment. Here …

Bridging knowledge graphs to generate scene graphs

A Zareian, S Karaman, SF Chang - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer
Scene graphs are powerful representations that parse images into their abstract semantic
elements, ie, objects and their interactions, which facilitates visual comprehension and …

Meta-learning approaches for learning-to-learn in deep learning: A survey

Y Tian, X Zhao, W Huang - Neurocomputing, 2022 - Elsevier
Compared to traditional machine learning, deep learning can learn deeper abstract data
representation and understand scattered data properties. It has gained considerable …

IDRNet: Intervention-driven relation network for semantic segmentation

Z Jin, X Hu, L Zhu, L Song… - Advances in Neural …, 2024 - proceedings.neurips.cc
Co-occurrent visual patterns suggest that pixel relation modeling facilitates dense prediction
tasks, which inspires the development of numerous context modeling paradigms,\emph {eg} …

Modeling image composition for complex scene generation

Z Yang, D Liu, C Wang, J Yang… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We present a method that achieves state-of-the-art results on challenging (few-shot) layout-
to-image generation tasks by accurately modeling textures, structures and relationships …

Graph convolutional module for temporal action localization in videos

R Zeng, W Huang, M Tan, Y Rong… - … on Pattern Analysis …, 2021 - ieeexplore.ieee.org
Temporal action localization, which requires a machine to recognize the location as well as
the category of action instances in videos, has long been researched in computer vision …