Hypergraph attention networks for multimodal learning

A survey on graph neural networks and graph transformers in computer vision: A task-oriented perspective

C Chen, Y Wu, Q Dai, HY Zhou, M Xu… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Graph Neural Networks (GNNs) have gained momentum in graph representation learning
and boosted the state of the art in a variety of areas, such as data mining (eg, social network …

被引用次数：71 相关文章所有 3 个版本

[PDF] researchgate.net

An analysis of graph convolutional networks and recent datasets for visual question answering

AA Yusuf, F Chong, M Xianling - Artificial Intelligence Review, 2022 - Springer

Graph neural network is a deep learning approach widely applied on structural and non-
structural scenarios due to its substantial performance and interpretability recently. In a non …

被引用次数：46 相关文章所有 5 个版本

[PDF] arxiv.org

Superhypergraph neural networks and plithogenic graph neural networks: Theoretical foundations

T Fujita - arXiv preprint arXiv:2412.01176, 2024 - arxiv.org

Hypergraphs extend traditional graphs by allowing edges to connect multiple nodes, while
superhypergraphs further generalize this concept to represent even more complex …

被引用次数：57 相关文章所有 5 个版本

[PDF] researchgate.net

Attention, please! A survey of neural attention models in deep learning

A de Santana Correia, EL Colombini - Artificial Intelligence Review, 2022 - Springer

In humans, Attention is a core property of all perceptual and cognitive operations. Given our
limited ability to process competing sources, attention mechanisms select, modulate, and …

被引用次数：225 相关文章所有 8 个版本

Multimodal remote sensing image segmentation with intuition-inspired hypergraph modeling

Q He, X Sun, W Diao, Z Yan, F Yao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Multimodal remote sensing (RS) image segmentation aims to comprehensively utilize
multiple RS modalities to assign pixel-level semantics to the studied scenes, which can …

被引用次数：53 相关文章所有 6 个版本

Multimodal federated learning: Concept, methods, applications and future directions

W Huang, D Wang, X Ouyang, J Wan, J Liu, T Li - Information Fusion, 2024 - Elsevier

Multimodal learning mines and analyzes multimodal data in reality to better understand and
appreciate the world around people. However, how to exploit this rich multimodal data …

被引用次数：8 相关文章所有 2 个版本

[PDF] arxiv.org

Hypergraph transformer for skeleton-based action recognition

Y Zhou, ZQ Cheng, C Li, Y Fang, Y Geng, X Xie… - arXiv preprint arXiv …, 2022 - arxiv.org

Skeleton-based action recognition aims to recognize human actions given human joint
coordinates with skeletal interconnections. By defining a graph with joints as vertices and …

被引用次数：74 相关文章所有 5 个版本

[PDF] google.com

Continual image deraining with hypergraph convolutional networks

X Fu, J Xiao, Y Zhu, A Liu, F Wu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Image deraining is a challenging task since rain streaks have the characteristics of a
spatially long structure and have a complex diversity. Existing deep learning-based methods …

被引用次数：43 相关文章所有 6 个版本

[PDF] thecvf.com

Coarse-to-fine reasoning for visual question answering

BX Nguyen, T Do, H Tran, E Tjiputra… - Proceedings of the …, 2022 - openaccess.thecvf.com

Bridging the semantic gap between image and question is an important step to improve the
accuracy of the Visual Question Answering (VQA) task. However, most of the existing VQA …

被引用次数：63 相关文章所有 10 个版本

Sigma++: Improved semantic-complete graph matching for domain adaptive object detection

W Li, X Liu, Y Yuan - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Domain Adaptive Object Detection (DAOD) generalizes the object detector from an
annotated domain to a label-free novel one. Recent works estimate prototypes (class …

被引用次数：46 相关文章所有 7 个版本

高级搜索

QQ 群