| Nucleic Acids Research | Pages |
The University of Minnesota Biocatalysis/Biodegradation Database: specialized metabolism for functional genomics
Introduction
Database Content And Methods
Database format
Database update
Data access
Applications
Future directions
Conclusions
Acknowledgements
References
The University of Minnesota Biocatalysis/Biodegradation Database: specialized metabolism for functional genomics
ABSTRACT
INTRODUCTION
Current genome projects are generating a staggering number of gene sequences. GenBank (1; http://www.ncbi.nlm.nih.gov/Web/Genbank/index.html ) contained 2.4 × 106 sequences in June 1998 and is growing exponentially, with a doubling time of1.5 years (National Center for Biotechnology Information News,February, 1998, http://www.ncbi.nlm.nih.gov/Web/Newsltr/feb98.html ). While presently workers struggle just to store this massive amount of information, the next major challenge will be functional genomics-determining each gene's function. Functional genomics is relatively straightforward when dealing with genes which code for enzymes involved in intermediary metabolism, the metabolism common to most organisms, such as glycolysis and the citric acid cycle. There are several intermediary metabolism databases such as KEGG (2; http://www.genome.ad.jp/kegg/kegg.html ), many homologues for each coding sequence, and the metabolic function of each is well-studied. Functional genomics is more of a challenge when dealing with specialized metabolism. Forty-four percent of the coding sequences in the Escherichia coli genome have functions which are presently unknown (3); a large fraction of these may be involved in specialized metabolism.
The microbial world which includes E.coli flourishes in part due to its specialized catabolic metabolism. This permits microorganisms to gain energy from compounds which others cannot eat; for example, some Pseudomonas species can catabolize >1000 compounds. This catabolic metabolism is termed biodegradation. These esoteric feedstocks can include environmental pollutants, and microbes can thus be used to clean up pollution, a process called bioremediation. Specialized microbial catabolism can also be used synthetically, a process called biocatalysis. The University of Minnesota Biocatalysis/Biodegradation Database (UM-BBD, http://www.labmed.umn.edu/umbbd/index.html ) started on the web in 1995 to provide information on the specialized microbial catabolic pathways and metabolic reactions which are important in biotechnology. Besides improving bioremediation efforts and biosynthetic processes, the UM-BBD is also a resource for functional genomics. UM-BBD users should cite this paper or its successors as its primary reference.
DATABASE CONTENT AND METHODS
The UM-BBD now contains information on >400 reactions and compounds and 300 enzymes, organized in 68 annotated metabolic pathways, for compounds from acrylonitrile to xylene. Figure
Figure 1. Graphical overview of UM-BBD content. The circled compounds lead to intermediary metabolism. The UM-BBD contains information on the metabolic reactions which funnel the other compounds (and others not shown) to the circles. Solid arrows indicate reactions carried out under aerobic conditions; dashed arrows, under anaerobic conditions. UM-BBD information for the biodegradation of acrylonitrile, the boxed compound on the left, is shown in more detail in Figure 2. URL, http://www.labmed.umn.edu/umbbd/meta/meta_map.html Figure 2. Example UM-BBD pathway information. (A) Graphical pathway map for the acrylonitrile pathway; (B) reaction page for the nitrile hydrolase reaction; (C) reaction mechanism graphic for nitrile hydrolase; (D) compound page for acrylonitrile; () reaction graphic for nitrile hydrolase. URL, http://www.labmed.umn.edu/umbbd/acr/acr_map.html The UM-BBD was prototyped in HTML form. When usage demonstrated that its format was used and useful, its information was transferred to the Compounds, Organisms, Reactions and Enzyme (CORE) database management system, written in Java (4). CORE contains the information needed to generate reaction and compound pages and can dynamically generate pathway maps starting from any UM-BBD reaction (Fig. The UM-BBD is updated at least monthly. Updates include new pathways, new reactions added to existing pathways, new search tools, and other features. The pathways to be added are usually based on the scientific literature and are occasionally suggested by our users. When a contribution is based on unpublished work, our International Scientific Advisory Board [Ellis,L.B.M. (1998) UM-BBD Contributors Page. http://www.labmed.umn.edu/umbbd/contrib.html ] reviews the submission for scientific accuracy. Much of the data entry and graphics are done by database staff but some pathways are entered by distant learners enrolled in a Biocatalysis and Biodegradation course taught completely over the Internet [Ellis,L.B.M. and Wackett,L.P. (1998) BioC/MicE 5309, Biocatalysis and Biodegradation. http://www.cee.umn.edu/biodeg/ ]. The home page (http://www.labmed.umn.edu/umbbd/index.html ) contains a scrollable list of UM-BBD pathway maps, each of which links to compound and reaction pages. It also contains links to searchable lists of UM-BBD pathways, reactions, enzymes, compounds and microorganisms, and lists of pathway map and enzyme mechanism graphics. Other more specialized lists include UM-BBD reactions which have not been studied in enough detail to be assigned enzymes. Such a list can be used to define areas in specialized microbial metabolism where further work is needed. Compounds can be searched for by full or partial name, CAS Registry Number and chemical formula. Compound searches return links to UM-BBD compound pages and any UM-BBD reaction pages in which the compound is produced or consumed. Enzymes can be searched for by full or partial name and full or partial EC code. Enzyme searches return a list of links to UM-BBD reaction pages catalyzed by that enzyme or, if there is only one such reaction, link directly to that reaction page. Microorganism entries can be searched by full or partial name and return links to UM-BBD pathways that involve that microorganism. As mentioned earlier, the UM-BBD links to related databases. Some of these links are reciprocal and UM-BBD data can be accessed from those databases. Ligand Chemical Database at Kyoto University (5; http://www.genome.ad.jp/dbget/ligand.html ) links to UM-BBD enzyme pages; Entrez PubMed, Nucleotide and Protein Databases (6; http://www.ncbi.nlm.nih.gov/Entrez/ ) link to UM-BBD reaction pages; and ChemFinder fromCambridgeSoft, Inc. (7; http://chemfinder.camsoft.com/ ) links to UM-BBD compound pages. The UM-BBD can be used for understanding of basic biochemistry, biocatalysis leading to speciality chemical manufacture, and biodegradation of environmental pollutants. Based on the 1998 [Ellis,L.B.M. (1998) UM-BBD 1998 User Survey. URL, http://www.labmed.umn.edu/umbbd/stats/results3.html ] and earlier user surveys, UM-BBD information primarily supports pure and applied research. Industries increasingly need to know the environmental fate of their commercial chemicals, and this is largely governed by the metabolism of these chemicals by soil and water microorganisms. Commercial users have used our information as part of EPA reports; environmental law firms have used it to prepare their cases. Biotechnology companies increasingly turn to biocatalysis for new advances in speciality chemical manufacture. One example that predates the UM-BBD is the application of naphthalene dioxygenase, used naturally to catabolize polycyclic aromatic hydrocarbons, to produce the blue jean dye indigo in fermentation vessels (8). This is an example of the emerging field of green chemistry which will draw more heavily on specialized bacterial enzymes than on the more commonly studied enzymes of intermediary metabolism. The UM-BBD can also be used to predict biodegradation reactions, both practically and theoretically. As an example of the former, functional genomic analysis based on the UM-BBD is currently being used to predict the metabolism of Deinococcus radiodurans, an organism which is of interest because of its extreme resistance to ionizing radiation but whose metabolic pathways are poorly studied. Since numerous hazardous waste sites contain high level radioactivity and toxic organic waste, a radiation-resistant bacterium is needed for bioremediation purposes. Genome analysis of D.radiodurans revealed a paucity of biodegradative enzymes of the type found in soil Pseudomonas species and thus recombinant biodegradative genes were cloned and expressed in the organism (9). The UM-BBD was used both in the functional genomic analysis and in deciding which biodegradative genes to experimentally express in D.radiodurans. The theoretical prediction of biodegradation metabolism is also very important to industry and requires information of the type provided in the UM-BBD. The United States Environmental Protection Agency must decide annually on the environmental acceptability of perhaps 500 new compounds. In most cases, information on their pathways of biodegradation is lacking and too expensive and time-consuming to obtain experimentally. Thus, it is increasingly important to predict the biodegradative metabolism of new organic compounds based on known biodegradation reactions. This requires information databases such as the UM-BBD and expert system approaches whereby that knowledge can be used to predict new metabolism. This is in some sense the converse of the functional genomics problem in that the goal is determining the likelihood that microorganisms collectively will contain genes and enzymes to metabolize a given compound. We are beginning to predict biodegradation pathways in a project called Predict-BT [Wackett,L.P., Ellis,L.B.M., Speedie,S. and Hershberger,C.D. (1998) PredictBT: The University of Minnesota Predictive Biodegradation Project. URL, http://www.labmed.umn.edu/umbbd/predictbt/ ], building on the information contained in the UM-BBD, intermediary metabolism, and gene sequence databases. The UM-BBD now has ~400 reactions and compounds, a very small fraction of the 10 million organic compounds currently known. It will never include all compounds and that is not its goal. Instead it is to become a representative database of biodegradation, spanning known metabolic routes, organic functional groups metabolized, and environmental conditions under which biodegradation occurs. The next goal will be to use this information to predict the metabolism of compounds the UM-BBD does not contain.
Database format
Database update
Data access
Applications
Future directions
CONCLUSIONS
Functional genomics has the potential to decode the physiological meaning of an organism's genetic information. Its success will be limited without a greater knowledge, in experiment and representation, of the enzymes which mediate the more esoteric metabolism found in the bacterial world. The UM-BBD can assist in this and in fostering biotechnology and proper environmental stewardship.
ACKNOWLEDGEMENTS
We thank Jingfeng Feng and Stuart Speedie for helpful discussions. This work was supported in part by National Science Foundation Award 9630427, National Institutes of Health Award R01-GM56529 and an NIH Training Grant LM07041 traineeship to C.D.H.
REFERENCES
This article has been cited by other articles:
This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: www-admin{at}oup.co.uk
Last modification: 9 Dec 1998
Copyright©Oxford University Press, 1998.
![]()
CiteULike
Connotea
Del.icio.us What's this?
![]()
![]()

![]()
![]()
![]()
J. Gao, L. B. M. Ellis, and L. P. Wackett
The University of Minnesota Biocatalysis/Biodegradation Database: improving public access
Nucleic Acids Res.,
September 18, 2009;
(2009)
gkp771v1.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
L. B. M. Ellis, D. Roe, and L. P. Wackett
The University of Minnesota Biocatalysis/Biodegradation Database: the first decade
Nucleic Acids Res.,
January 1, 2006;
34(suppl_1):
D517 - D521.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
D. Meyer, B. Witholt, and A. Schmid
Suitability of Recombinant Escherichia coli and Pseudomonas putida Strains for Selective Biotransformation of m-Nitrotoluene by Xylene Monooxygenase
Appl. Envir. Microbiol.,
November 1, 2005;
71(11):
6624 - 6632.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
L. B. M. Ellis, B. K. Hou, W. Kang, and L. P. Wackett
The University of Minnesota Biocatalysis/Biodegradation Database: post-genomic data mining
Nucleic Acids Res.,
January 1, 2003;
31(1):
262 - 265.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
L. B. M. Ellis, C. D. Hershberger, E. M. Bryan, and L. P. Wackett
The University of Minnesota Biocatalysis/Biodegradation Database: emphasizing enzymes
Nucleic Acids Res.,
January 1, 2001;
29(1):
340 - 343.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
L. B. M. Ellis, C. D. Hershberger, and L. P. Wackett
The University of Minnesota Biocatalysis/Biodegradation Database: microorganisms, genomics and prediction
Nucleic Acids Res.,
January 1, 2000;
28(1):
377 - 379.
[Abstract]
[Full Text]
[PDF]
![]()
This Article ![]()
![]()
Abstract
![]()
Print PDF (655K)
![]()
Alert me when this article is cited
![]()
Alert me if a correction is posted
![]()
Services ![]()
![]()
Email this article to a friend
![]()
Similar articles in this journal
![]()
Similar articles in ISI Web of Science
![]()
Similar articles in PubMed
![]()
Alert me to new issues of the journal
![]()
Add to My Personal Archive
![]()
Download to citation manager
![]()
Search for citing articles in:
ISI Web of Science (18)
![]()
Request Permissions ![]()
Commercial Re-use Guidelines
for Open Access NAR Content
![]()
Google Scholar ![]()
![]()
Articles by Ellis, L. B.
![]()
Articles by Wackett, L. P.
![]()
Search for Related Content
![]()
PubMed ![]()
![]()
PubMed Citation
![]()
Articles by Ellis, L. B.
![]()
Articles by Wackett, L. P.
![]()
Social Bookmarking ![]()
![]()
What's this?