Ensemble feature selection: consistent descriptor subsets for multiple QSAR models

D Dutta, R Guha, D Wild, T Chen - Journal of Chemical …, 2007 - ACS Publications
Selecting a small subset of descriptors from a large pool to build a predictive quantitative
structure− activity relationship (QSAR) model is an important step in the QSAR modeling …

Interpreting computational neural network QSAR models: a measure of descriptor importance

R Guha, PC Jurs - Journal of chemical information and modeling, 2005 - ACS Publications
We present a method to measure the relative importance of the descriptors present in a
QSAR model developed with a computational neural network (CNN). The approach is based …

Exploiting multiple descriptor sets in QSAR studies

JH Tomal, WJ Welch, RH Zamar - Journal of Chemical Information …, 2016 - ACS Publications
A quantitative structure–activity relationship (QSAR) is a model relating a specific biological
response to the chemical structures of compounds. There are many descriptor sets available …

A method for quantifying and visualizing the diversity of QSAR models

S Izrailev, DK Agrafiotis - Journal of Molecular Graphics and Modelling, 2004 - Elsevier
Feature selection is one of the most commonly used and reliable methods for deriving
predictive quantitative structure–activity relationships (QSAR). Many feature selection …

Similarity to molecules in the training set is a good discriminator for prediction accuracy in QSAR

RP Sheridan, BP Feuston, VN Maiorov… - Journal of chemical …, 2004 - ACS Publications
How well can a QSAR model predict the activity of a molecule not in the training set used to
create the model? A set of retrospective cross-validation experiments using 20 diverse in …

Stochastic versus Stepwise Strategies for Quantitative Structure− Activity Relationship Generation How Much Effort May the Mining for Successful QSAR Models Take …

D Horvath, F Bonachera, V Solov'Ev… - Journal of chemical …, 2007 - ACS Publications
Descriptor selection in QSAR typically relies on a set of upfront working hypotheses in order
to boil down the initial descriptor set to a tractable size. Stepwise regression …

QSAR− how good is it in practice? Comparison of descriptor sets on an unbiased cross section of corporate data sets

P Gedeck, B Rohde, C Bartels - Journal of chemical information …, 2006 - ACS Publications
The quality of QSAR (Quantitative Structure− Activity Relationships) predictions depends on
a large number of factors including the descriptor set, the statistical method, and the data …

Three useful dimensions for domain applicability in QSAR models using random forest

RP Sheridan - Journal of chemical information and modeling, 2012 - ACS Publications
One popular metric for estimating the accuracy of prospective quantitative structure–activity
relationship (QSAR) predictions is based on the similarity of the compound being predicted …

Predictive quantitative structure–activity relationships modeling: development and validation of QSAR models

A Tropsha, A Golbraikh - Handbook of chemoinformatics …, 2010 - taylorfrancis.com
Validation Criteria of QSAR Models.......................................... 214 7.3 Validation of QSAR
Models: Y-Randomization.............................. 220 7.4 Validation of QSAR Models: Training …

Toward an optimal procedure for variable selection and QSAR model building

A Yasri, D Hartsough - Journal of chemical information and …, 2001 - ACS Publications
In this work, we report the development of a novel QSAR technique combining genetic
algorithms and neural networks for selecting a subset of relevant descriptors and building …