Visual-linguistic causal intervention for radiology report generation

C Patrício, JC Neves, LF Teixeira - ACM Computing Surveys, 2023 - dl.acm.org

The remarkable success of deep learning has prompted interest in its application to medical
imaging diagnosis. Even though state-of-the-art deep learning models have achieved …

被引用次数：48 相关文章所有 3 个版本

[PDF] springer.com

Causal reasoning meets visual representation learning: A prospective study

Y Liu, YS Wei, H Yan, GB Li, L Lin - Machine Intelligence Research, 2022 - Springer

Visual representation learning is ubiquitous in various real-world applications, including
visual comprehension, video understanding, multi-modal analysis, human-computer …

被引用次数：48 相关文章所有 7 个版本

[PDF] arxiv.org

Cross-modal causal relational reasoning for event-level visual question answering

Y Liu, G Li, L Lin - IEEE Transactions on Pattern Analysis and …, 2023 - ieeexplore.ieee.org

Existing visual question answering methods often suffer from cross-modal spurious
correlations and oversimplified event-level reasoning processes that fail to capture event …

被引用次数：119 相关文章所有 7 个版本

[PDF] arxiv.org

Chatcad+: Towards a universal and reliable interactive cad using llms

Z Zhao, S Wang, J Gu, Y Zhu, L Mei… - … on Medical Imaging, 2024 - ieeexplore.ieee.org

The integration of Computer-Aided Diagnosis (CAD) with Large Language Models (LLMs)
presents a promising frontier in clinical applications, notably in automating diagnostic …

被引用次数：41 相关文章所有 6 个版本

[PDF] arxiv.org

Visual causal scene refinement for video question answering

Y Wei, Y Liu, H Yan, G Li, L Lin - Proceedings of the 31st ACM …, 2023 - dl.acm.org

Existing methods for video question answering (VideoQA) often suffer from spurious
correlations between different modalities, leading to a failure in identifying the dominant …

被引用次数：23 相关文章所有 3 个版本

[PDF] arxiv.org

Textual inversion and self-supervised refinement for radiology report generation

Y Luo, H Li, X Wu, M Cao, X Huang, Z Zhu… - … Conference on Medical …, 2024 - Springer

Existing mainstream approaches follow the encoder-decoder paradigm for generating
radiology reports. They focus on improving the network structure of encoders and decoders …

被引用次数：6 相关文章所有 2 个版本

[PDF] science.org

Causal inference meets deep learning: A comprehensive survey

L Jiao, Y Wang, X Liu, L Li, F Liu, W Ma, Y Guo, P Chen… - Research, 2024 - spj.science.org

Deep learning relies on learning from extensive data to generate prediction results. This
approach may inadvertently capture spurious correlations within the data, leading to models …

被引用次数：2 相关文章所有 5 个版本

[PDF] sysu-hcp.net

[PDF][PDF] Causality-aware visual scene discovery for cross-modal question reasoning

Y Liu, G Li, L Lin - arXiv preprint arXiv:2304.08083, 2023 - sysu-hcp.net

Existing visual question reasoning methods usually fail to explicitly discover the inherent
causal mechanism and ignore the complex event-level understanding that requires jointly …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Causalvlr: A toolbox and benchmark for visual-linguistic causal reasoning

Y Liu, W Chen, G Li, L Lin - arXiv preprint arXiv:2306.17462, 2023 - arxiv.org

We present CausalVLR (Causal Visual-Linguistic Reasoning), an open-source toolbox
containing a rich set of state-of-the-art causal relation discovery and causal inference …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Multimodal Embodied Interactive Agent for Cafe Scene

Y Liu, X Song, K Jiang, W Chen, J Luo, G Li… - arXiv preprint arXiv …, 2024 - arxiv.org

With the surge in the development of large language models, embodied intelligence has
attracted increasing attention. Nevertheless, prior works on embodied intelligence typically …

被引用次数：3 相关文章所有 2 个版本

高级搜索

QQ 群