Toward an optimal procedure for variable selection and QSAR model building

A Yasri, D Hartsough - Journal of chemical information and …, 2001 - ACS Publications
In this work, we report the development of a novel QSAR technique combining genetic
algorithms and neural networks for selecting a subset of relevant descriptors and building …

Choosing feature selection and learning algorithms in QSAR

M Eklund, U Norinder, S Boyer… - Journal of Chemical …, 2014 - ACS Publications
Feature selection is an important part of contemporary QSAR analysis. In a recently
published paper, we investigated the performance of different feature selection methods in a …

Interpreting computational neural network QSAR models: a measure of descriptor importance

R Guha, PC Jurs - Journal of chemical information and modeling, 2005 - ACS Publications
We present a method to measure the relative importance of the descriptors present in a
QSAR model developed with a computational neural network (CNN). The approach is based …

Automated descriptor selection for quantitative structure-activity relationships using generalized simulated annealing

JM Sutter, SL Dixon, PC Jurs - Journal of chemical information and …, 1995 - ACS Publications
The central steps in developing QSARs are generation and selection of molecular structure
descriptors and development of the model. Recently, computational neural networks have …

Ensemble feature selection: consistent descriptor subsets for multiple QSAR models

D Dutta, R Guha, D Wild, T Chen - Journal of Chemical …, 2007 - ACS Publications
Selecting a small subset of descriptors from a large pool to build a predictive quantitative
structure− activity relationship (QSAR) model is an important step in the QSAR modeling …

Multivariate regression outperforms several robust architectures of neural networks in QSAR modeling

B Lučić, N Trinajstić - Journal of chemical information and …, 1999 - ACS Publications
In the past decade, many authors replaced multivariate regression (MR) by the neural
networks (NNs) algorithm because they believed the latter to be superior. To verify this, we …

k Nearest Neighbors QSAR Modeling as a Variational Problem:  Theory and Applications

P Itskowitz, A Tropsha - Journal of chemical information and …, 2005 - ACS Publications
Variable selection k Nearest Neighbor (kNN) QSAR is a popular nonlinear methodology for
building correlation models between chemical descriptors of compounds and biological …

Unsupervised forward selection: a method for eliminating redundant variables

DC Whitley, MG Ford… - Journal of chemical …, 2000 - ACS Publications
An unsupervised learning method is proposed for variable selection and its performance
assessed using three typical QSAR data sets. The aims of this procedure are to generate a …

Optimal sparse descriptor selection for QSAR using Bayesian methods

FR Burden, DA Winkler - QSAR & Combinatorial Science, 2009 - Wiley Online Library
Choosing a set of molecular descriptors (features) that is most relevant to a given biological
response variable is a very important problem in QSAR that has not be solved in an optimal …

Modified and enhanced replacement method for the selection of molecular descriptors in QSAR and QSPR theories

AG Mercader, PR Duchowicz, FM Fernández… - Chemometrics and …, 2008 - Elsevier
We improve a recently developed Replacement Method (RM) for the selection of an optimal
set of molecular descriptors from a much greater pool of such regression variables. Our …