Nucleic Acids Research, 2003, Vol. 31, No. 1 262-265
© 2003 Oxford University Press
The University of Minnesota Biocatalysis/Biodegradation Database: post-genomic data mining
Environmental Molecular Science Center, University of Minnesota, Minneapolis, MN 55455, USA 1 Department of Laboratory Medicine and Pathology, University of Minnesota, Minneapolis, MN 55455, USA 2 Biological Technology Institute, University of Minnesota, St Paul, MN 55108, USA
*To whom correspondence should be addressed. Tel: +1 6126259122; Fax: +1 6126251121; Email: lynda{at}tc.umn.edu
Present address: Lynda B. M. Ellis, Mayo Mail Code 511, 420 SE Deleware Street, Minneapolis, MN 55455, USA
Received September 13, 2002; Accepted September 24, 2002
ABSTRACT
The University of Minnesota Biocatalysis/Biodegradation Database (UM-BBD, http://umbbd.ahc.umn.edu/) provides curated information on microbial catabolism and related biotransformations, primarily for environmental pollutants. Currently, it contains information on over 130 metabolic pathways, 800 reactions, 750 compounds and 500 enzymes. In the past two years, it has increased its breath to include more examples of microbial metabolism of metals and metalloids; and expanded the types of information it includes to contain microbial biotransformations of, and binding interactions with many chemical elements. It has also increased the ways in which this data can be accessed (mined). Structure-based searching was added, for exact matches, similarity, or substructures. Analysis of UM-BBD reactions has lead to a prototype, guided, pathway prediction system. Guided prediction means that the user is shown all possible biotransformations at each step and guides the process to its conclusion. Mining the UM-BBD's data provides a unique view into how the microbial world recycles organic functional groups. UM-BBD users are encouraged to comment on all aspects of the database, including the information it contains and the tools by which it can be mined. The database and prediction system develop under the direction of the scientific community.
INTRODUCTION
As the University of Minnesota Biocatalysis/Biodegradation Database (UM-BBD, URL=http://umbbd.ahc.umn.edu/) begins its eighth year of operation, the world is entering the post-genomic era. Moving its focus from genome sequencing, the molecular biology community is actively mining genomic, proteomic and metabolomic data, to better understand how organisms function at the molecular level. The UM-BBD has at last matured enough for such systematic analysis. The past two years have seen many changes to it besides an increase in database size. These include a change in its underlying database management system, additional ways to access the data and an increase in the types of information it contains. Additional changes will be made in the coming two years. These are intended to facilitate mining of UM-BBD data by the scientific community and are discussed in more detail below.
PRESENT STATUS
UM-BBD data content and methods, including data format, update and access, have been previously reported (13). The UM-BBD URL was changed to http://umbbd.ahc.umn.edu/ in early 2001. At the same time, all compound and reaction pages were reviewed and updated. In mid-2002, the database was transferred to MySQL (4). By the end of 2002, the UM-BBD will have grown to contain information on over 130 metabolic pathways, 800 reactions, 750 compounds and 500 enzymes.
Emphasis on elements
The US Department of Energy now supports the growth of the UM-BBD through its Microbial Genome Program (5). Under their sponsorship, we will be adding more examples of microbial biotransformations of metals, metalloids and metal chelators. A list of such pathways now in the UM-BBD is shown in Table 1.
|
The UM-BBD has traditionally focused on the metabolism of organic compounds, with some depiction of the organometals, organometalloids and toxic heavy metals listed in Table 1. However, microbes interact with and are increasingly discovered to transform, many of the chemical elements that constitute the earth. A recent UM-BBD project depicts the chemical elements using a periodic chart that organizes elements into a framework consistent with their use and transformation by microbes. The six major elements of a microbial cell are H, O, C, N, P and S. Microbial cells almost universally use the main group elements Na, K, Ca, Mg, Cl, Se, Zn and the transition metals V, Mo,W, Fe, Mn, Co, Ni and Cu. Increasingly, specialized microorganisms have been discovered to transform radionuclides, such as uranium and technetium; precious metals such as silver and gold; and metalloid compounds such as tellurium oxyanions. These transformations or binding interactions are now included in the UM-BBD, with a convenient access from periodic charts that organize and provide a common entry point to this information (URL=http://umbbd.ahc.umn.edu/periodic/). If one or more UM-BBD compounds contain the element, a link to a dynamic list of them is available on each element page. An example page for the element arsenic is shown in Figure 1.
|
Paths to prediction
The ability to accurately predict biodegradation has enormous implications. Industries will continue to synthesize new materials faster than they, or regulatory agencies and academic researchers, can study their environmental fate. Many companies already invest significant resources to predict biodegradation of their new compounds to avoid commercialization of materials that are later found to be dangerous and subsequently need to be withdrawn from commerce.
To start on the path to prediction tools, structure-based searching was added to the UM-BBD in early 2002, based on the JChem toolkit from ChemAxon, Ltd (6). A user can either draw the compound to be searched for using a Java Applet or enter its SMILES string, then choose the type of structure search (similarity, substructure or exact) to do, and the similarity threshold to use. The structures of matching UM-BBD compounds are returned. Clicking on a returned structure leads to the corresponding UM-BBD compound page. From a compound page, one can go to UM-BBD reactions of, and biodegradation pathways for, the compound. This is a first step towards predicting a biodegradation pathway for compounds the UM-BBD does not contain, since the biodegradation pathways for UM-BBD compounds that are similar to the query compound can suggest ways the query might be degraded.
The next step in the prediction path is to analyze the reactions used by microorganisms to catabolically transform organic functions groups, identify the functional groups in an organic molecule representation, and transform each functional group, one at a time. This is guided prediction, since the user sees all possible transformations at each step and guides the process to its conclusion. Challenges include the implementation of transformation rules and incorporating functional group interactions.
A prototype system with 40 rules (approximately one-third of those initially planned for the full system) was added to the UM-BBD in late 2002. A pathway prediction is shown in Figure 2; the first 10 rules are shown in Table 2. The JChem toolkit (6) and JOElib library (URL=http://joelib.sourceforge.net/) were used in construction of this system. With input from the scientific community, future work includes development and testing of additional transformation rules and determining how functional group interactions change rules.
|
|
CONCLUSIONS
Mining the UM-BBD's data provides a unique view into how the microbial world recycles organic functional groups. Similar to the way UM-BBD users are encouraged to recommend pathways to be added to the database, they are also encouraged to submit additional information on biotransformations of elements and to test the prediction program and comment on individual prediction rules. In this way, the database and prediction system grow under the direction of the scientific community.
ACKNOWLEDGEMENTS
This research was supported by the Office of Science (BER), U.S. Department of Energy, Grant No. DE-FG02-01ER63268. We thank Prasad Kotharu for assistance with the database and John Carlis, Tony Dodge, Steve Toeniskoetter and Jennifer Dommer for helpful discussions and other work on the biochemical periodic table.
REFERENCES
- Ellis,L.B.M., Hershberger,C.D. and Wackett,L.P. (1999) The University of Minnesota Biocatalysis/Biodegradation Database: specialized metabolism for functional genomics. Nucleic Acids Res., 27, 373376.
[Abstract/Free Full Text] - Ellis,L.B.M., Hershberger,C.D. and Wackett,L.P. (2000) The University of Minnesota Biocatalysis/Biodegradation Database: microorganisms, genomics and prediction. Nucleic Acids Res., 28, 377379.
[Abstract/Free Full Text] - Ellis,L.B.M., Hershberger,C.D., Bryan,E. and Wackett,L.P. (2001) The University of Minnesota Biocatalysis/Biodegradation Database: emphasizing enzymes. Nucleic Acids Res., 29, 340343.
[Abstract/Free Full Text] - MySQL., Inc., 2510 Fairview Ave. East, Seattle, WA 98102.
- US Department of Energy Microbial Genome Program, Oak Ridge National Laboratory (ORNL), 1060 Commerce Park MS 6480, Oak Ridge, TN 37830.
- Csizmadia,F. (2000) JChem: Java Applets and Modules Supporting Chemical Database Handling from Web Browsers. J. Chem. Inform. Comput. Sci., 40, 323324.[CrossRef][Web of Science][Medline]
This article has been cited by other articles:
![]() |
J. Gao, L. B. M. Ellis, and L. P. Wackett The University of Minnesota Biocatalysis/Biodegradation Database: improving public access Nucleic Acids Res., September 18, 2009; (2009) gkp771v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. M. Hearn, D. R. Patel, and B. van den Berg Outer-membrane transport of aromatic hydrocarbons as a first step in biodegradation PNAS, June 24, 2008; 105(25): 8601 - 8606. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Martinez-Perez, A. Lopez-Sanchez, F. Reyes-Ramirez, B. Floriano, and E. Santero Integrated Response to Inducers by Communication between a Catabolic Pathway and Its Regulatory System J. Bacteriol., May 15, 2007; 189(10): 3768 - 3775. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. E. Smith, A. P. Schwab, and M. K. Banks Phytoremediation of Polychlorinated Biphenyl (PCB)-Contaminated Sediment: A Greenhouse Feasibility Study J. Environ. Qual., January 9, 2007; 36(1): 239 - 244. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. B. M. Ellis, D. Roe, and L. P. Wackett The University of Minnesota Biocatalysis/Biodegradation Database: the first decade Nucleic Acids Res., January 1, 2006; 34(suppl_1): D517 - D521. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Pazos, D. Guijas, A. Valencia, and V. De Lorenzo MetaRouter: bioinformatics for bioremediation Nucleic Acids Res., January 1, 2005; 33(suppl_1): D588 - D592. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Pharkya, A. P. Burgard, and C. D. Maranas OptStrain: A computational framework for redesign of microbial production systems Genome Res., November 1, 2004; 14(11): 2367 - 2376. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Tropel and J. R. van der Meer Bacterial Transcriptional Regulators for Degradation Pathways of Aromatic Compounds Microbiol. Mol. Biol. Rev., September 1, 2004; 68(3): 474 - 500. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. P. Wackett, A. G. Dodge, and L. B. M. Ellis Microbial Genomics and the Periodic Table Appl. Envir. Microbiol., February 1, 2004; 70(2): 647 - 655. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||









