Skip Navigation

Nucleic Acids Research 2006 34(Database Issue):D678-D683; doi:10.1093/nar/gkj132
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (874K) Freely available
Right arrow Screen PDF (261K) Freely available
Right arrow Supplementary Material
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Dunkel, M.
Right arrow Articles by Preissner, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Dunkel, M.
Right arrow Articles by Preissner, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2006, Vol. 34, Database issue D678-D683
© The Author 2006. Published by Oxford University Press. All rights reserved
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions{at}oxfordjournals.org


Article

SuperNatural: a searchable database of available natural compounds

Mathias Dunkel*, Melanie Fullbeck, Stefanie Neumann and Robert Preissner

Berlin Center of Genome Based Bioinformatics, 3D Datamining Group, Institute of Biochemistry, Charité–University Medicine Berlin Monbijoustrasse 2, 10117 Berlin, Germany

*To whom correspondence should be addressed. Tel: +49 30 450 528375; Fax: +49 30 450 528942; Email: mathias.dunkel{at}charite.de

Received August 15, 2005. Revised October 24, 2005. Accepted October 24, 2005.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 THE DATABASE
 PRACTICAL APPROACHES USING THE...
 AVAILABILITY
 CONCLUSIONS AND FUTURE...
 SUPPLEMENTARY DATA
 REFERENCES
 
Although tremendous effort has been put into synthetic libraries, most drugs on the market are still natural compounds or derivatives thereof. There are encyclopaedias of natural compounds, but the availability of these compounds is often unclear and catalogues from numerous suppliers have to be checked. To overcome these problems we have compiled a database of ~50 000 natural compounds from different suppliers. To enable efficient identification of the desired compounds, we have implemented substructure searches with typical templates. Starting points for in silico screenings are about 2500 well-known and classified natural compounds from a compendium that we have added. Possible medical applications can be ascertained via automatic searches for similar drugs in a free conformational drug database containing WHO indications. Furthermore, we have computed about three million conformers, which are deployed to account for the flexibilities of the compounds when the 3D superposition algorithm that we have developed is used. The SuperNatural Database is publicly available at http://bioinformatics.charite.de/supernatural. Viewing requires the free Chime-plugin from MDL (Chime) or Java2 Runtime Environment (MView), which is also necessary for using Marvin application for chemical drawing.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 THE DATABASE
 PRACTICAL APPROACHES USING THE...
 AVAILABILITY
 CONCLUSIONS AND FUTURE...
 SUPPLEMENTARY DATA
 REFERENCES
 
The world is full of natural products, but only a few existing natural products are known and our understanding of the metabolome is fragmentary. Nature invented a universe of secondary metabolites as ‘defense compounds’ against enemies in predator–prey relationships. Concomitantly, strategies for handling xenobiotics evolved, such as the multidrug resistance efflux pump and the cytochrome P450 monooxygenases (1,2). Tulp and Bohlin (3) hypothesize that when a natural compound occurs in unrelated species, it must have an important biological function, e.g. addressing a specific target, because fortuitous production of a particular compound by totally unrelated species is extremely improbable (3). About 200 000 natural compounds are currently known and many more will prove to be more than just ‘secondary metabolites’ (3). Even though combinatorial synthesis is now producing molecules that are drug-like in terms of size and property, these molecules, in contrast to natural products, have not evolved to interact with biomolecules (4). Natural compounds such as brefelidin A, camptothecin, forskolin and immunophilins often interfere with protein–protein interaction sites (5). Analysis of the properties of synthetic and natural compounds compared to drugs revealed the distinctiveness of natural compounds, especially concerning the diversity of scaffolds and the large number of chiral centers (6). This may be one reason why ~50% of the drugs introduced to the market during the last 20 years are derived directly or indirectly from natural compounds (7). Although most drugs on the market have a natural origin, their availability often remains unclear (8). The percentage of new non-synthetic chemical entities in the area of cancer remained at a yearly average of 62% over the period of 1981–2002 (9). Some marine natural products are either in or approaching Phase II/III clinical trials in cancer, analgesia, allergy and cognitive diseases (10). The chemical diversity of these compounds is tremendous and may offer inspiration for innovations in the fields of medicine, nutrition, agrochemical and life sciences (11).


    THE DATABASE
 TOP
 ABSTRACT
 INTRODUCTION
 THE DATABASE
 PRACTICAL APPROACHES USING THE...
 AVAILABILITY
 CONCLUSIONS AND FUTURE...
 SUPPLEMENTARY DATA
 REFERENCES
 
Several commercial databases and databases of rare compounds exist (1214), but the SuperNatural Database is the first public resource containing 3D structures and conformers of 45917 natural compounds, derivatives and analogues purchasable from different suppliers. Currently, data from eight suppliers are available, but we plan to add further suppliers, compounds from which will be added on request (see ‘List of Suppliers’ on the SuperNatural Database website). The 2D structure of each compound, provided by the suppliers, was used to generate 3D structures (Discovery Studio, Accelrys Inc., http://www.accelrys.com/dstudio). Using a chemistry development kit (http://almost.cubic.uni-koeln.de/cdk/), fingerprints (966 bits, MACCS Keys) were calculated; each bit of a fingerprint represents functional groups (structural fingerprint). As a measure of 2D similarity we used the Tanimoto coefficient (15), which compares the bits of the structural fingerprints of two compounds. A Tanimoto coefficient of ≥0.85 indicates that a molecule has activities similar to a lead compound (16). For better coverage of the compounds and to ensure their flexibility during usage of the 3D-superposition algorithm, about three million conformers were evaluated (MedChem Explorer, Accelrys Inc., http://www.accelrys.com/dstudio/ds_medchem). As a threshold for conformer generation, 20 kcal/mol as a relative maximum energy was set. This spacious threshold allows the user to find the best 3D superposition of two compounds even if they contain several rotatable bonds. The pre-computed fingerprints are stored in a MySQL-database on a web server, which is accessible via browser (see FAQ on the website for the database schema).

Owing to the immense structural diversity of natural compounds compared to synthetic compounds, an increased spectrum of therapeutic activities can be covered. Natural compounds can be classified by different criteria (see the classification list at ‘Search via known compounds’ on the SuperNatural Database website):

  1. Classification by structural characteristics: alkaloid, amino acid, fatty acid, etc.
  2. Classification by functional aspects: vitamin, hormone, enzyme, etc.

To find desired natural compounds, a number of search options were implemented:

  • As a starting point for screenings we compiled a searchable compendium of about 2500 well-known natural compounds characterized by a CAS-number (Chemical Abstracts), which is useful to cross-referencing other databases. This compendium contains systematic names, classification codes, empiric formulae, mixtures and synonyms (Figure 1A).
  • Similarity searches based on fingerprints and Tanimoto coefficients are implemented in the SuperNatural Database (Figure 1B).
  • Another way to perform a similarity search is the Marvin Applet, which allows the user to build or import a molecular structure and compare it with compounds of the SuperNatural Database (Figure 1C).
  • Furthermore, an algorithm developed in our group enables 3D-superpositions of two compounds to be made. The algorithm compares all conformers of two compounds to find the best structural alignment (17) (Figure 1E).
  • To identify possible applications, the user can search for similar drugs in the free drug database (SuperDrug Database) containing medical indications assigned by WHO (18).
About 300 natural compounds from the SuperNatural Database are identical to active ingredients of drugs, and 8% (3600) of the natural compounds are similar to essential marketed drugs with Tanimoto coefficients >0.85. For each natural compound, information on different structural and chemical properties (DS Viewer, Property Calculator, http://www.accelrys.com/dstudio) such as number of chiral centers, estimated logp, surface area, etc. are precalculated and given in a separate ‘FULL INFO’ window (Figure 1D). For molecular visualization of the compounds, the user needs the free Chime-Plugin from MDL (available for Windows, SGI, Mac) or the Java2 Runtime Environment. Atomic coordinates of single or superimposed compounds are available for saving in Mol-format.



View larger version (45K):
[in this window]
[in a new window]
 
Figure 1 Screenshots of the web-interface of the SuperNatural Database. (A) Navigation frame and text query options for performing a search via known natural compounds. (B) Query results with the option for a 3D superposition. The 2D similarity query shows two compounds, which have a 2D similarity of 100.00 and 87.41 to the lead-structure. The compounds can be rotated (left mouse button), different display styles are available (right mouse button) and more detailed information concerning the properties of each structure can be obtained by use of the Properties button. Both compounds are available from the supplier MicroSource. (C) Screenshot of the Java applet Marvin, which allows upload or drawing of own structures for similarity searches in the SuperNatural Database. (D) Calculated properties for one structure. (E) Results of a 3D superposition. All conformations of both structures are superimposed and the best superposition is displayed. The table separately depicts the structures and the superposition of the corresponding conformations in the middle. The (superimposed) 3D structures can be saved by right clicking on the molecule. Also, information is given about the number of superimposed atoms and the root mean square distance.

 

    PRACTICAL APPROACHES USING THE SIMILARITY SCREENING FUNCTION OF THE SuperNatural DATABASE
 TOP
 ABSTRACT
 INTRODUCTION
 THE DATABASE
 PRACTICAL APPROACHES USING THE...
 AVAILABILITY
 CONCLUSIONS AND FUTURE...
 SUPPLEMENTARY DATA
 REFERENCES
 
A detailed review of various approaches to similarity searching was given by Willet et al. (19). Screenings for new bioactive natural compounds on the basis of chemical similarity to a known ligand depend on the similar property principle of Johnson and Maggiora (20). As an example, we performed a similarity screening in the SuperNatural Database with natural compounds that are known drugs, from clinical trials or lead compounds for drug development (Tables 1 and 2 and Supplementary Data) (21). Our investigations showed that the database contains compounds that have already been investigated in clinical trials for different diseases (Table 1 and 2 and Supplementary Data) and a great number of compounds with calculated 2D similarities of ≥0.85 to the lead compounds. The SuperNatural Database contains 289 natural compounds, which are already known as drugs. Owing to the immense structural and chemical variety of natural compounds, the coverage of a great spectrum of diseases is possible, which is confirmed by the ATC classifications of the drugs (see ATC classification in the category statistics on the SuperNatural website). There are 73 different ATC classes (three letter abbreviations) covered by these 289 natural compounds. The results show that the SuperNatural Database is an excellent source for finding bioactive natural products.


View this table:
[in this window]
[in a new window]
 
Table 1 Well-known natural compounds (drugs, lead compounds for drugs or compounds in clinical trials) with antibacterial, antifungal, antiparasitic and antiviral effects and similar compounds (tanimoto ≥0.85) from the SuperNatural Database

 

View this table:
[in this window]
[in a new window]
 
Table 2 Well-known natural compounds (drugs, lead compounds for drugs or compounds in clinical trials) used in areas of neurological diseases, immunological or inflammatory processes and oncological diseases and similar compounds (tanimoto ≥0.85) from the SuperNatural Database

 

    AVAILABILITY
 TOP
 ABSTRACT
 INTRODUCTION
 THE DATABASE
 PRACTICAL APPROACHES USING THE...
 AVAILABILITY
 CONCLUSIONS AND FUTURE...
 SUPPLEMENTARY DATA
 REFERENCES
 
The database is publicly available at http://bioinformatics.charite.de/supernatural. The data will be updated twice a year.


    CONCLUSIONS AND FUTURE DIRECTIONS
 TOP
 ABSTRACT
 INTRODUCTION
 THE DATABASE
 PRACTICAL APPROACHES USING THE...
 AVAILABILITY
 CONCLUSIONS AND FUTURE...
 SUPPLEMENTARY DATA
 REFERENCES
 
The chemical diversity and unique properties of natural compounds provide a promising starting-point for developing innovations for scientific, medical and nutritional applications. The SuperNatural Database is a free resource with embedded screening functions for bioactive natural compounds. The extension of the database allows the scientific community simple access to a growing number of available natural compounds.


    SUPPLEMENTARY DATA
 TOP
 ABSTRACT
 INTRODUCTION
 THE DATABASE
 PRACTICAL APPROACHES USING THE...
 AVAILABILITY
 CONCLUSIONS AND FUTURE...
 SUPPLEMENTARY DATA
 REFERENCES
 
Supplementary Data are available at NAR Online.


    ACKNOWLEDGEMENTS
 
This work is supported by the BMBF (grant contract: 0312705B) funded Berlin Center for Genome Based Bioinformatics (BCB). We would like to thank the companies for their permissions to use their natural compound libraries. Funding to pay the Open Access publication charges for this article was provided by Universitätsmedizin, Charité.

Conflict of interest statement. None declared.


    Footnotes
 
The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 THE DATABASE
 PRACTICAL APPROACHES USING THE...
 AVAILABILITY
 CONCLUSIONS AND FUTURE...
 SUPPLEMENTARY DATA
 REFERENCES
 

  1. Schuler, M.A. and Werck-Reichhart, D. (2003) Functional genomics of P450s Annu. Rev. Plant Biol, . 54, 629–667[CrossRef][Medline] .

  2. Del Sorbo, G., Schoonbeek, H., De Waard, M.A. (2000) Fungal transporters involved in efflux of natural toxic compounds and fungicides Fungal Genet. Biol, . 30, 1–15[CrossRef][Web of Science][Medline] .

  3. Tulp, M. and Bohlin, L. (2005) Rediscovery of known natural compounds: nuisance or goldmine? Bioorg. Med. Chem, . 13, 5274–5282[Medline] .

  4. Piggott, A.M. and Karuso, P. (2004) Quality, not quantity: the role of natural products and chemical proteomics in modern drug discovery Comb. Chem. High Throughput Screen, . 7, 607–630[Medline] .

  5. Pommier, Y. and Cherfils, J. (2005) Interfacial inhibition of macromolecular interactions: nature's paradigm for drug discovery Trends Pharmacol. Sci, . 26, 138–145[CrossRef][Medline] .

  6. Feher, M. and Schmidt, J.M. (2003) Property distributions: differences between drugs, natural products, and molecules from combinatorial chemistry J. Chem. Inf. Comput. Sci, . 43, 218–227[CrossRef][Web of Science][Medline] .

  7. Vuorelaa, P., Leinonenb, M., Saikkuc, P., Tammelaa, P., Rauhad, J.P., Wennberge, T., Vuorela, H. (2004) Natural products in the process of finding new drug candidates Curr. Med. Chem, . 11, 1375–1389[Web of Science][Medline] .

  8. Koehn, F.E. and Carter, G.T. (2005) The evolving role of natural products in drug discovery Nature Rev. Drug. Discov, . 4, 206–220[CrossRef][Web of Science][Medline] .

  9. Newman, D.J., Cragg, G.M., Snader, K.M. (2003) Natural products as sources of new drugs over the period 1981–2002 J. Nature Prod, . 66, 1022–1037 .

  10. Newman, D.J. and Cragg, G.M. (2004) Advanced preclinical and clinical trials of natural products and related compounds from marine sources Curr. Med. Chem, . 11, 1693–1713[Medline] .

  11. Wessjohann, L.A., Ruijter, E., Garcia-Rivera, D., Brandt, W. (2005) What can a chemist learn from nature's macrocycles?—a brief, conceptual view Mol. Divers, 9, 171–186[Medline] .

  12. Qiao, X., Hou, T., Zhang, W., Guo, S., Xu, X. (2002) A 3D structure database of components from Chinese traditional medicinal herbs J. Chem. Inf. Comput. Sci, . 42, 481–489[Medline] .

  13. Lei, J. and Zhou, J. (2002) A marine natural product database J. Chem. Inf. Comput. Sci, . 42, 742–748[Medline] .

  14. Fang, X., Shao, L., Zhang, H., Wang, S. (2005) CHMIS-C: a comprehensive herbal medicine information system for cancer J. Med. Chem, . 48, 1481–1488[Medline] .

  15. Delaney, J.S. (1996) Assessing the ability of chemical similarity measures to discriminate between active and inactive compounds Mol. Divers, 1, 217–222[CrossRef][Web of Science][Medline] .

  16. Martin, Y.C., Kofron, J.L., Traphagen, L.M. (2002) Do structurally similar molecules have similar biological activity? J. Med. Chem, . 45, 4350–4358[CrossRef][Web of Science][Medline] .

  17. Thimm, M., Goede, A., Hougardy, S., Preissner, R. (2004) Comparison of 2D similarity and 3D superposition. Application to searching a conformational drug database J. Chem. Inf. Comput. Sci, . 44, 1816–1822[Medline] .

  18. Goede, A., Dunkel, M., Mester, N., Frommel, C., Preissner, R. (2005) SuperDrug: a conformational drug database Bioinformatics, 21, 1751–1753[Abstract/Free Full Text] .

  19. Willet, P., Barnard, J.M., Downs, G.M. (1998) Chemical similarity searching J. Chem. Inf. Comput. Sci, . 38, 983–996[CrossRef] .

  20. In Johnson, M.A. and Maggiora, G.M. (Eds.). Concepts and Applications of Molecular Similarity, (1990) NY Wiley .

  21. Butler, M.S. (2005) Natural products to drugs: natural product derived compounds in clinical trials Nature Prod. Rep, . 22, 162–195[CrossRef] .

  22. Datry, A. and Bart-Delabesse, E. (2005) Caspofungin: mode of action and therapeutic applications Rev. Med. Interne, . in press .

  23. Gupta, Y.K., Gupta, M., Aneja, S., Kohli, K. (2004) Current drug therapy of protozoal diarrhoea Indian J. Pediatr, . 71, 55–58[Medline] .

  24. Rathore, D., McCutchan, T.F., Sullivan, M., Kumar, S. (2005) Antimalarial drugs: current status and new developments Expert Opin. Investig. Drugs, 14, 871–883[Medline] .

  25. WHO. (2003) The selection and use of essential medicines. Report of the WHO Expert Committee, 2002 (including the 12th Model list of essential medicines) World Health Organ. Tech. Rep. Ser, . 914, 1–126 .


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Antimicrob. Agents Chemother.Home page
L. Yang, M. T. Rybtke, T. H. Jakobsen, M. Hentzer, T. Bjarnsholt, M. Givskov, and T. Tolker-Nielsen
Computer-Aided Identification of Recognized Drugs as Pseudomonas aeruginosa Quorum-Sensing Inhibitors
Antimicrob. Agents Chemother., June 1, 2009; 53(6): 2432 - 2443.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
U. Schmidt, S. Struck, B. Gruening, J. Hossbach, I. S. Jaeger, R. Parol, U. Lindequist, E. Teuscher, and R. Preissner
SuperToxic: a comprehensive database of toxic compounds
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D295 - D299.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. L. Lopez-Perez, R. Theron, E. del Olmo, and D. Diaz
NAPROC-13: a database for the dereplication of natural product mixtures in bioassay-guided protocols
Bioinformatics, December 1, 2007; 23(23): 3256 - 3257.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (874K) Freely available
Right arrow Screen PDF (261K) Freely available
Right arrow Supplementary Material
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Dunkel, M.
Right arrow Articles by Preissner, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Dunkel, M.
Right arrow Articles by Preissner, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?