Comparison of the effectiveness of different normalization methods for metagenomic cross-study phenotype prediction under heterogeneity

B Wang, F Sun, Y Luan - Scientific Reports, 2024 - nature.com
The human microbiome, comprising microorganisms residing within and on the human
body, plays a crucial role in various physiological processes and has been linked to …

Evaluation of normalization methods for predicting quantitative phenotypes in metagenomic data analysis

B Wang, Y Luan - Frontiers in Genetics, 2024 - frontiersin.org
Genotype-to-phenotype mapping is an essential problem in the current genomic era. While
qualitative case-control predictions have received significant attention, less emphasis has …

In search of the ratio of miRNA expression as robust biomarkers for constructing stable diagnostic models among multi-center data

C Ma, Y Zhang, R Ding, H Chen, X Wu, L Xu… - Frontiers in …, 2024 - frontiersin.org
MicroRNAs (miRNAs) are promising biomarkers for the early detection of disease, and many
miRNA-based diagnostic models have been constructed to distinguish patients and healthy …

AITeQ: a machine learning framework for Alzheimer's prediction using a distinctive five-gene signature

I Ahammad, AB Lamisa, A Bhattacharjee… - Briefings in …, 2024 - academic.oup.com
Neurodegenerative diseases, such as Alzheimer's disease, pose a significant global health
challenge with their complex etiology and elusive biomarkers. In this study, we developed …

Optimal ensemble construction for multistudy prediction with applications to mortality estimation

G Loewinger, RA Nunez, R Mazumder… - Statistics in …, 2024 - Wiley Online Library
It is increasingly common to encounter prediction tasks in the biomedical sciences for which
multiple datasets are available for model training. Common approaches such as pooling …

Batch normalization followed by merging is powerful for phenotype prediction integrating multiple heterogeneous studies

Y Gao, F Sun - PLOS Computational Biology, 2023 - journals.plos.org
Heterogeneity in different genomic studies compromises the performance of machine
learning models in cross-study phenotype predictions. Overcoming heterogeneity when …

[HTML][HTML] Hierarchical resampling for bagging in multistudy prediction with applications to human neurochemical sensing

G Loewinger, P Patil, KT Kishida… - The annals of applied …, 2022 - ncbi.nlm.nih.gov
We propose the “study strap ensemble”, which combines advantages of two common
approaches to fitting prediction models when multiple training datasets (“studies”) are …

The ratio of interacting miRNAs' expressions is a robust biomarker for disease classification in multi-center data

Y Zhang, C Ma, R Ding, H Chen, L Xu, C Yu - medRxiv, 2023 - medrxiv.org
Background Many miRNA-based diagnostic models have been constructed to distinguish
diseased individuals. However, due to the inherent differences across different platforms or …

[HTML][HTML] Kokiri: Random-forest-based comparison and characterization of cohorts

K Eckelt, P Adelberger, MJ Bauer, T Zichner, M Streit - 2022 - europepmc.org
A bstract We propose an interactive visual analytics approach to characterizing and
comparing patient subgroups (ie, cohorts). Despite having the same disease and similar …

Defining Replicability of Prediction Rules

G Parmigiani - Statistical Science, 2023 - projecteuclid.org
In this article, I propose an approach for defining replicability for prediction rules. Motivated
by a recent report by the USA National Academy of Sciences, I start from the perspective that …