On some aspects of validation of predictive QSAR models

Roy, K; Roy, PP; Leonard, JT

doi:10.1186/1752-153X-2-S1-P9

Volume 2 Supplement 1

3rd German Conference on Chemoinformatics: 21. CIC-Workshop

Poster presentation
Open access
Published: 26 March 2008

On some aspects of validation of predictive QSAR models

K Roy¹,
PP Roy¹ &
JT Leonard¹

Chemistry Central Journal volume 2, Article number: P9 (2008) Cite this article

3210 Accesses
4 Citations
Metrics details

Quantitative structure-activity relationships (QSARs) represent predictive models derived from application of statistical tools correlating biological activity (including therapeutic and toxic) of chemicals (drugs/toxicants/environmental pollutants) with descriptors representative of molecular structure and/or property. The success of any QSAR model depends on accuracy of the input data, selection of appropriate descriptors and statistical tools, and most importantly validation of the developed model. Validation is the process by which the reliability and relevance of a procedure are established for a specific purpose. Leave one-out cross-validation generally leads to an overestimation of predictive capacity, and even with external validation, no one can be sure whether the selection of training and test sets was manipulated to maximize the predictive capacity of the model being published. In this paper, we present some representative examples of validation of QSAR models in order to explore possible importance of the method of selection of training set compounds, setting training set size and impact of variable selection for training set models for determining the quality of prediction. The major conclusions from the study are: (1) K-means cluster based division of training and prediction sets can be used as a reliable method of division of data set into training and test sets for developing predictive QSAR models; (2) the training set size should be set at an optimal level so that the model is developed with proper training (learning) process and the developed model is able to satisfactorily predict the activity values of the test set compounds; (3) choice of variables for regression based only on Q² value may not be optimum. Furthermore, predictive R² value may not be considered as the only criterion to indicate external predictability of a model.

Author information

Authors and Affiliations

Drug Theoretics and Cheminformatics Lab, Division of Medicinal and Pharmaceutical Chemistry, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700 032, India
K Roy, PP Roy & JT Leonard

Authors

K Roy
View author publications
You can also search for this author in PubMed Google Scholar
PP Roy
View author publications
You can also search for this author in PubMed Google Scholar
JT Leonard
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to K Roy.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Roy, K., Roy, P. & Leonard, J. On some aspects of validation of predictive QSAR models. Chemistry Central Journal 2 (Suppl 1), P9 (2008). https://doi.org/10.1186/1752-153X-2-S1-P9

Download citation

Published: 26 March 2008
DOI: https://doi.org/10.1186/1752-153X-2-S1-P9

3rd German Conference on Chemoinformatics: 21. CIC-Workshop

On some aspects of validation of predictive QSAR models

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Chemistry

Contact us

3rd German Conference on Chemoinformatics: 21. CIC-Workshop

On some aspects of validation of predictive QSAR models

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Chemistry

Contact us