A benchmark data set for in silico prediction of ames mutagenicity

Hansen, K; Mika, S; Schroeter, T; Sutter, A; Ter Laak, A; Steger-Hartmann, T; Heinrich, N; Müller, K-R

doi:10.1186/1752-153X-3-S1-P31

Volume 3 Supplement 1

4th German Conference on Chemoinformatics: 22. CIC-Workshop

Poster presentation
Open access
Published: 05 June 2009

A benchmark data set for in silico prediction of ames mutagenicity

K Hansen¹,
S Mika¹,
T Schroeter¹,
A Sutter¹,
A Ter Laak¹,
T Steger-Hartmann¹,
N Heinrich¹ &
…
K-R Müller¹

Chemistry Central Journal volume 3, Article number: P31 (2009) Cite this article

2255 Accesses
3 Citations
Metrics details

In silico prediction tools for Ames mutagenicity (Salmonella typhimurium reverse mutation assay) represent a cost-effective high throughput approach for the prioritization of compounds before submission to experimental testing. Various modeling approaches have been pursued in this field during the last few years. However, the publicly available data sets used for modeling are mostly very limited in terms of size and chemical coverage. Hence, a reasonable comparison of the different modeling methodologies is so far – as for most QSAR problems – impossible.

In this work we describe a collection of about 6000 non-confidential compounds together with their biological activity in the Ames mutagenicity test. This very large, unique and valuable data set built from public sources is made available in machine-readable form (smiles strings) to be used as a benchmark by other researchers. Based on these data we built three statistical prediction models for Ames mutagenicity based on CORINA and DRAGON descriptors. The methods used are a support vector machine, a random forest and Gaussian processes. All three approaches are evaluated within the same cross-validation setting. To facilitate this valuable benchmark, the exact validation protocol including the exact random splits will be made publicly available. The results show that all three methods yield satisfactory results, reaching sensitivity and specificity values of greater than 70% or 80%, respectively. The application of Gaussian processes, previously not applied to Ames mutagenicity prediction proves slightly superior to the other two methods.

Author information

Authors and Affiliations

Technical University of Berlin, Franklinstr. 28/29, 10587, Berlin, Germany
K Hansen, S Mika, T Schroeter, A Sutter, A Ter Laak, T Steger-Hartmann, N Heinrich & K-R Müller

Authors

K Hansen
View author publications
You can also search for this author in PubMed Google Scholar
S Mika
View author publications
You can also search for this author in PubMed Google Scholar
T Schroeter
View author publications
You can also search for this author in PubMed Google Scholar
A Sutter
View author publications
You can also search for this author in PubMed Google Scholar
A Ter Laak
View author publications
You can also search for this author in PubMed Google Scholar
T Steger-Hartmann
View author publications
You can also search for this author in PubMed Google Scholar
N Heinrich
View author publications
You can also search for this author in PubMed Google Scholar
K-R Müller
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Hansen, K., Mika, S., Schroeter, T. et al. A benchmark data set for in silico prediction of ames mutagenicity. Chemistry Central Journal 3 (Suppl 1), P31 (2009). https://doi.org/10.1186/1752-153X-3-S1-P31

Download citation

Published: 05 June 2009
DOI: https://doi.org/10.1186/1752-153X-3-S1-P31

4th German Conference on Chemoinformatics: 22. CIC-Workshop

A benchmark data set for in silico prediction of ames mutagenicity

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

BMC Chemistry

Contact us

4th German Conference on Chemoinformatics: 22. CIC-Workshop

A benchmark data set for in silico prediction of ames mutagenicity

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Chemistry

Contact us