Causal interpretation of self-attention in pre-trained transformers

RY Rohekar, Y Gurwicz… - Advances in Neural …, 2024 - proceedings.neurips.cc
We propose a causal interpretation of self-attention in the Transformer neural network
architecture. We interpret self-attention as a mechanism that estimates a structural equation …

A survey of methods, challenges and perspectives in causality

G Gendron, M Witbrock, G Dobbie - arXiv preprint arXiv:2302.00293, 2023 - arxiv.org
Deep Learning models have shown success in a large variety of tasks by extracting
correlation patterns from high-dimensional data but still struggle when generalizing out of …

LVLM-Intrepret: an interpretability tool for large vision-language models

G Ben Melech Stan, E Aflalo… - Proceedings of the …, 2024 - openaccess.thecvf.com
In the rapidly evolving landscape of artificial intelligence multi-modal large language models
are emerging as a significant area of interest. These models which combine various forms of …

From temporal to contemporaneous iterative causal discovery in the presence of latent confounders

RY Rohekar, S Nisimov, Y Gurwicz… - … on Machine Learning, 2023 - proceedings.mlr.press
We present a constraint-based algorithm for learning causal structures from observational
time-series data, in the presence of latent confounders. We assume a discrete-time …

Out-of-distribution generalization with causal feature separation

H Wang, K Kuang, L Lan, Z Wang… - … on Knowledge and …, 2023 - ieeexplore.ieee.org
Driven by empirical risk minimization, machine learning algorithm tends to exploit subtle
statistical correlations existing in the training environment for prediction, while the spurious …

LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models

GBM Stan, RY Rohekar, Y Gurwicz, ML Olson… - arXiv preprint arXiv …, 2024 - arxiv.org
In the rapidly evolving landscape of artificial intelligence, multi-modal large language
models are emerging as a significant area of interest. These models, which combine various …

Interpretability and causal discovery of the machine learning models to predict the production of CBM wells after hydraulic fracturing

C Min, G Wen, L Gou, X Li, Z Yang - Energy, 2023 - Elsevier
Abstract Machine learning approaches are widely studied in the production prediction of
CBM wells after hydraulic fracturing, but rarely used in practice due to the low generalization …

Causality compensated attention for contextual biased visual recognition

R Liu, J Huang, TH Li, G Li - The eleventh international conference …, 2023 - openreview.net
Visual attention does not always capture the essential object representation desired for
robust predictions. Attention modules tend to underline not only the target object but also the …

Demystifying deep reinforcement learning-based autonomous vehicle decision-making

H Wan, P Li, A Kusari - arXiv preprint arXiv:2403.11432, 2024 - arxiv.org
With the advent of universal function approximators in the domain of reinforcement learning,
the number of practical applications leveraging deep reinforcement learning (DRL) has …

A KNN-Based Non-Parametric Conditional Independence Test for Mixed Data and Application in Causal Discovery

J Huegle, C Hagedorn, R Schlosser - Joint European Conference on …, 2023 - Springer
Abstract Testing for Conditional Independence (CI) is a fundamental task for causal
discovery but is particularly challenging in mixed discrete-continuous data. In this context …