Toxicity testing in the 21st century: progress in the past decade and future perspectives

D Krewski, ME Andersen, MG Tyshenko… - Archives of …, 2020 - Springer
Advances in the biological sciences have led to an ongoing paradigm shift in toxicity testing
based on expanded application of high-throughput in vitro screening and in silico methods …

Computational deconvolution of transcriptomics data from mixed cell populations

F Avila Cobos, J Vandesompele, P Mestdagh… - …, 2018 - academic.oup.com
Gene expression analyses of bulk tissues often ignore cell type composition as an important
confounding factor, resulting in a loss of signal from lowly abundant cell types. In this review …

The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification

D Chicco, G Jurman - BioData Mining, 2023 - Springer
Binary classification is a common task for which machine learning and computational
statistics are used, and the area under the receiver operating characteristic curve (ROC …

Development of a clinical polygenic risk score assay and reporting workflow

L Hao, P Kraft, GF Berriz, ED Hynes, C Koch… - Nature medicine, 2022 - nature.com
Implementation of polygenic risk scores (PRS) may improve disease prevention and
management but poses several challenges: the construction of clinically valid assays …

Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric

S Boughorbel, F Jarray, M El-Anbari - PloS one, 2017 - journals.plos.org
Data imbalance is frequently encountered in biomedical applications. Resampling
techniques can be used in binary classification to tackle this issue. However such solutions …

Removing batch effects in analysis of expression microarray data: an evaluation of six batch adjustment methods

C Chen, K Grennan, J Badner, D Zhang, E Gershon… - PloS one, 2011 - journals.plos.org
The expression microarray is a frequently used approach to study gene expression on a
genome-wide scale. However, the data produced by the thousands of microarray studies …

Inflation of test accuracy due to data leakage in deep learning-based classification of OCT images

IE Tampu, A Eklund, N Haj-Hosseini - Scientific Data, 2022 - nature.com
In the application of deep learning on optical coherence tomography (OCT) data, it is
common to train classification networks using 2D images originating from volumetric data …

Essential guidelines for computational method benchmarking

LM Weber, W Saelens, R Cannoodt, C Soneson… - Genome biology, 2019 - Springer
In computational biology and other sciences, researchers are frequently faced with a choice
between several computational methods for performing data analyses. Benchmarking …

Integrating feature selection and feature extraction methods with deep learning to predict clinical outcome of breast cancer

D Zhang, L Zou, X Zhou, F He - Ieee Access, 2018 - ieeexplore.ieee.org
In many microarray studies, classifiers have been constructed based on gene signatures to
predict clinical outcomes for various cancer sufferers. However, signatures originating from …

Comparison of normalization and differential expression analyses using RNA-Seq data from 726 individual Drosophila melanogaster

Y Lin, K Golovnina, ZX Chen, HN Lee, YLS Negron… - BMC genomics, 2016 - Springer
Background A generally accepted approach to the analysis of RNA-Seq read count data
does not yet exist. We sequenced the mRNA of 726 individuals from the Drosophila Genetic …