Explainable deep learning methods in medical image classification: A survey

C Patrício, JC Neves, LF Teixeira - ACM Computing Surveys, 2023 - dl.acm.org
The remarkable success of deep learning has prompted interest in its application to medical
imaging diagnosis. Even though state-of-the-art deep learning models have achieved …

Causal reasoning meets visual representation learning: A prospective study

Y Liu, YS Wei, H Yan, GB Li, L Lin - Machine Intelligence Research, 2022 - Springer
Visual representation learning is ubiquitous in various real-world applications, including
visual comprehension, video understanding, multi-modal analysis, human-computer …

Cross-modal causal relational reasoning for event-level visual question answering

Y Liu, G Li, L Lin - IEEE Transactions on Pattern Analysis and …, 2023 - ieeexplore.ieee.org
Existing visual question answering methods often suffer from cross-modal spurious
correlations and oversimplified event-level reasoning processes that fail to capture event …

Chatcad+: Towards a universal and reliable interactive cad using llms

Z Zhao, S Wang, J Gu, Y Zhu, L Mei… - … on Medical Imaging, 2024 - ieeexplore.ieee.org
The integration of Computer-Aided Diagnosis (CAD) with Large Language Models (LLMs)
presents a promising frontier in clinical applications, notably in automating diagnostic …

Visual causal scene refinement for video question answering

Y Wei, Y Liu, H Yan, G Li, L Lin - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Existing methods for video question answering (VideoQA) often suffer from spurious
correlations between different modalities, leading to a failure in identifying the dominant …

Textual inversion and self-supervised refinement for radiology report generation

Y Luo, H Li, X Wu, M Cao, X Huang, Z Zhu… - … Conference on Medical …, 2024 - Springer
Existing mainstream approaches follow the encoder-decoder paradigm for generating
radiology reports. They focus on improving the network structure of encoders and decoders …

Causal inference meets deep learning: A comprehensive survey

L Jiao, Y Wang, X Liu, L Li, F Liu, W Ma, Y Guo, P Chen… - Research, 2024 - spj.science.org
Deep learning relies on learning from extensive data to generate prediction results. This
approach may inadvertently capture spurious correlations within the data, leading to models …

[PDF][PDF] Causality-aware visual scene discovery for cross-modal question reasoning

Y Liu, G Li, L Lin - arXiv preprint arXiv:2304.08083, 2023 - sysu-hcp.net
Existing visual question reasoning methods usually fail to explicitly discover the inherent
causal mechanism and ignore the complex event-level understanding that requires jointly …

Causalvlr: A toolbox and benchmark for visual-linguistic causal reasoning

Y Liu, W Chen, G Li, L Lin - arXiv preprint arXiv:2306.17462, 2023 - arxiv.org
We present CausalVLR (Causal Visual-Linguistic Reasoning), an open-source toolbox
containing a rich set of state-of-the-art causal relation discovery and causal inference …

Multimodal Embodied Interactive Agent for Cafe Scene

Y Liu, X Song, K Jiang, W Chen, J Luo, G Li… - arXiv preprint arXiv …, 2024 - arxiv.org
With the surge in the development of large language models, embodied intelligence has
attracted increasing attention. Nevertheless, prior works on embodied intelligence typically …