Benchmarking variable selection in QSAR

M Eklund, U Norinder, S Boyer… - Molecular …, 2012 - Wiley Online Library
Variable selection is important in QSAR modeling since it can improve model performance
and transparency, as well as reduce the computational cost of model fitting and predictions …

Choosing feature selection and learning algorithms in QSAR

M Eklund, U Norinder, S Boyer… - Journal of Chemical …, 2014 - ACS Publications
Feature selection is an important part of contemporary QSAR analysis. In a recently
published paper, we investigated the performance of different feature selection methods in a …

Three useful dimensions for domain applicability in QSAR models using random forest

RP Sheridan - Journal of chemical information and modeling, 2012 - ACS Publications
One popular metric for estimating the accuracy of prospective quantitative structure–activity
relationship (QSAR) predictions is based on the similarity of the compound being predicted …

Recursive random forests enable better predictive performance and model interpretation than variable selection by LASSO

XW Zhu, YJ Xin, HL Ge - Journal of chemical information and …, 2015 - ACS Publications
Variable selection is of crucial significance in QSAR modeling since it increases the model
predictive ability and reduces noise. The selection of the right variables is far more …

MASSA Algorithm: an automated rational sampling of training and test subsets for QSAR modeling

GC Veríssimo, SQ Pantaleão, PO Fernandes… - Journal of Computer …, 2023 - Springer
QSAR models capable of predicting biological, toxicity, and pharmacokinetic properties
were widely used to search lead bioactive molecules in chemical databases. The dataset's …

An automated framework for QSAR model building

S Kausar, AO Falcao - Journal of cheminformatics, 2018 - Springer
Background In-silico quantitative structure–activity relationship (QSAR) models based tools
are widely used to screen huge databases of compounds in order to determine the …

The relative importance of domain applicability metrics for estimating prediction errors in QSAR varies with training set diversity

RP Sheridan - Journal of Chemical Information and Modeling, 2015 - ACS Publications
In QSAR, a statistical model is generated from a training set of molecules (represented by
chemical descriptors) and their biological activities (an “activity model”). The aim of the field …

Consistency of QSAR models: Correct split of training and test sets, ranking of models and performance parameters

A Rácz, D Bajusz, K Héberger - SAR and QSAR in Environmental …, 2015 - Taylor & Francis
Recent implementations of QSAR modelling software provide the user with numerous
models and a wealth of information. In this work, we provide some guidance on how one …

How to judge predictive quality of classification and regression based QSAR models?

K Roy, S Kar - Frontiers in computational chemistry, 2015 - Elsevier
Quantitative structure-activity relationship (QSAR) is a statistical modelling approach that
can be used in drug discovery, environmental fate modeling, property and activity prediction …

Developing collaborative QSAR models without sharing structures

P Gedeck, S Skolnik, S Rodde - Journal of Chemical Information …, 2017 - ACS Publications
It is widely understood that QSAR models greatly improve if more data are used. However,
irrespective of model quality, once chemical structures diverge too far from the initial data …