Diagram perception networks for textbook question answering via joint optimization

J Ma, J Liu, Q Chai, P Wang, J Tao - International Journal of Computer …, 2024 - Springer
Textbook question answering requires a system to answer questions with or without
diagrams accurately, given multimodal contexts that include rich paragraphs and diagrams …

A survey of deep learning techniques for machine reading comprehension

S Kazi, S Khoja, A Daud - Artificial Intelligence Review, 2023 - Springer
Reading comprehension involves the process of reading and understanding textual
information in order to answer questions related to it. It finds practical applications in various …

Visual question answering: from early developments to recent advances--a survey

ND Huynh, MR Bouadjenek, S Aryal, I Razzak… - arXiv preprint arXiv …, 2025 - arxiv.org
Visual Question Answering (VQA) is an evolving research field aimed at enabling machines
to answer questions about visual content by integrating image and language processing …

Weakly supervised learning for textbook question answering

J Ma, Q Chai, J Huang, J Liu, Y You… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Textbook Question Answering (TQA) is the task of answering diagram and non-diagram
questions given large multi-modal contexts consisting of abundant text and diagrams. Deep …

A human-like traffic scene understanding system: A survey

ZX Xia, WC Lai, LW Tsao, LF Hsu… - IEEE Industrial …, 2020 - ieeexplore.ieee.org
Autonomous vehicles, also known as self-driving cars, have the capability to perceive the
environment, locate its position, and safely drive to the destination without any human …

Spatial-semantic collaborative graph network for textbook question answering

Y Wang, B Wei, J Liu, Q Lin, L Zhang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Textbook Question Answering (TQA) task requires answering questions by reasoning based
on both the given diagrams and text context. There are mainly two challenges for the task …

Xtqa: Span-level explanations for textbook question answering

J Ma, Q Chai, J Liu, Q Yin, P Wang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Textbook question answering (TQA) is the task of correctly answering diagram or
nondiagram (ND) questions given large multimodal contexts consisting of abundant essays …

Relation-Aware Heterogeneous Graph Network for Learning Intermodal Semantics in Textbook Question Answering

S Zhang, Y Wu, X Zhang, Z Feng… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Textbook question answering (TQA) task aims to infer answers for given questions from a
multimodal context, including text and diagrams. The existing studies have aggregated …