A blood-based metabolomic signature predictive of risk for pancreatic cancer

E Irajizad, A Kenney, T Tang, J Vykoukal, R Wu… - Cell Reports …, 2023 - cell.com
Emerging evidence implicates microbiome involvement in the development of pancreatic
cancer (PaCa). Here, we investigate whether increases in circulating microbial-related …

MDI+: A flexible random forest-based feature importance framework

A Agarwal, AM Kenney, YS Tan, TM Tang… - arXiv preprint arXiv …, 2023 - arxiv.org
Mean decrease in impurity (MDI) is a popular feature importance measure for random
forests (RFs). We show that the MDI for a feature $ X_k $ in each tree in an RF is equivalent …

Patterns of Fitness and Gene Expression Epistasis Generated by Beneficial Mutations in the rho and rpoB Genes of Escherichia coli during High-Temperature …

A González-González, TN Batarseh… - Molecular Biology …, 2024 - academic.oup.com
Epistasis is caused by genetic interactions among mutations that affect fitness. To
characterize properties and potential mechanisms of epistasis, we engineered eight double …

Learning from learning machines: a new generation of AI technology to meet the needs of science

L Pion-Tonachini, K Bouchard, HG Martin… - arXiv preprint arXiv …, 2021 - arxiv.org
We outline emerging opportunities and challenges to enhance the utility of AI for scientific
discovery. The distinct goals of AI for industry versus the goals of AI for science create …

Detecting gene–gene interactions from GWAS using diffusion kernel principal components

A Walakira, J Ocira, D Duroux, R Fouladi, M Moškon… - BMC …, 2022 - Springer
Genes and gene products do not function in isolation but as components of complex
networks of macromolecules through physical or biochemical interactions. Dependencies of …

A generative framework to bridge data-driven models and scientific theories in language neuroscience

R Antonello, C Singh, S Jain, A Hsu, J Gao… - arXiv preprint arXiv …, 2024 - arxiv.org
Representations from large language models are highly effective at predicting BOLD fMRI
responses to language stimuli. However, these representations are largely opaque: it is …

[PDF][PDF] Learning from learning machines: a new generation of AI technology to meet the needs of science

BJ Webb-Robertsonzn, R Stevenszo… - arXiv preprint arXiv …, 2021 - academia.edu
We outline emerging opportunities and challenges to enhance the utility of AI for scientific
discovery. The distinct goals of AI for industry versus the goals of AI for science create …

Interpretable and efficient statistical approaches for biomedical data

X Li - 2021 - escholarship.org
Statistics and machine learning have achieved remarkable successes in solving data
problems including driving new biomedical discoveries. In particular, prediction and …

[引用][C] What is uncertainty in today's practice of data science?

B Yu - Journal of Econometrics, 2023 - ideas.repec.org
What is uncertainty in today’s practice of data science? IDEAS home Advanced search
Economic literature: papers, articles, software, chapters, books. Authors Institutions …