Nucleic Acids Research, 2000, Vol. 28, No. 1 1-7
© 2000 Oxford University Press
The Molecular Biology Database Collection: an online compilation of relevant database resources
Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Building 49, Room 4A-22, Bethesda, MD 20892-4470, USA
Received October 26, 1999; Accepted October 27, 1999.
| ABSTRACT |
|---|
|
|
|---|
The Molecular Biology Database Collection represents an effort geared at making molecular biology database resources more accessible to biologists. This online resource, available at http://www.oup.co.uk/nar/Volume_28/Issue_01/html/gkd115_gml.html, is intended to serve as a searchable, up-to-date, centralized jumping-off point to individual Web sites. An emphasis has also been placed on including databases where new value is added to the underlying data by virtue of curation, new data connections, or other innovative approaches.
With systematic sequencing efforts on the human and other model organisms in full swing, a tremendous flood of sequence data is being produced at breakneck speed. Considering the Human Genome Project alone, it is currently estimated that over 2 million bases are deposited into GenBank each day. This growth will only accelerate in the future, given the new goal of having a working draft covering 90% of the genome by the spring of 2000 and the complete sequence in-hand in 2002 (1). When considering all organismal sequencing efforts together, it immediately becomes obvious that there must be concrete and coherent plans in place to both warehouse and access these data, data which will become more and more important as we enter into an era of sequence-based biology.
The most recent five-year plan for the Human Genome Project (1) recognizes the importance of the connection between the in silico world and the bench, and has put forth recommendations regarding bioinformatics and computational biology, particularly with respect to databases and the development of analytical tools. Databases that grow out of the human and other systematic sequencing efforts should not be just storehouses for thousands of bases or amino acids; rather, they need to make logical connections to other types of information that are available, such as phenotypic or expression data, as these types of connections serve to increase the intrinsic value of the raw sequence data, allowing for biological discovery. Of particular importance will be the curation and annotation of specialized databases, which will help to ensure data quality as well as foster the ability to access and analyze sequence data.
Many such specialized databases have emerged over the years, and this journal has devoted its first issue over the last several years to documenting the availability and features of these databases. With the current issue, Nucleic Acids Research will be facilitating access to these databases online through a new entity called the Molecular Biology Database Collection. By doing so, it is hoped that individual investigators will be able to more easily find and use specialized databases that are appropriate to their scientific needs.
The initial set of databases included in the online compilation, available at http://www.oup.co.uk/nar/Volume_28/Issue_01/html/gkd115_gml.html, is shown in Table 1. The list includes databases for which there are full-length descriptions in the current issue, as well as other databases which represent a valuable resource to the biological community. Also included are databases collected by Christian Burks as part of the 1999 Database Issue (2). Efforts have been made to reduce redundancy in the list with respect to content wherever possible. An emphasis has also been placed on including databases where new value is added to the underlying data by virtue of curation, new data connections, or other innovative approaches. While Table 1 attempts to classify these databases by type, it is often true that these databases are more global in nature, providing multiple types of information. Links to these databases will be updated regularly. Suggestions for the inclusion of additional database resources in this online compendium are encouraged and may be directed to the author (andy@nhgri.nih.gov).
|
| FOOTNOTES |
|---|
* Tel: +1 301 496 8570; Fax: +1 301 402 6858; Email: andy@nhgri.nih.gov
| REFERENCES |
|---|
|
|
|---|
-
1 Collins,F.S., Patrinos,A., Jordan,E., Chakravarti,A., Gesteland,R., Walters,L. and members of the DOE and NIH Planning Groups (1998) Science, 282, 682689.
2 Burks,C. (1999) Nucleic Acids Res., 27, 19.
This article has been cited by other articles:
![]() |
Q. Feng, S. E. Hawes, J. E. Stern, A. Dem, P. S. Sow, B. Dembele, P. Toure, P. Sova, P. W. Laird, and N. B. Kiviat Promoter Hypermethylation of Tumor Suppressor Genes in Urine from Patients with Cervical Neoplasia Cancer Epidemiol. Biomarkers Prev., June 1, 2007; 16(6): 1178 - 1184. [Abstract] [Full Text] [PDF] |
||||
![]() |
W.-C. Suen, N. Zhang, L. Xiao, V. Madison, and A. Zaks Improved activity and thermostability of Candida antarctica lipase B by DNA family shuffling Protein Eng. Des. Sel., February 1, 2004; 17(2): 133 - 140. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-J. Hu Prediction of consensus structural motifs in a family of coregulated RNA sequences Nucleic Acids Res., September 1, 2002; 30(17): 3886 - 3893. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Coronella, P. Telleman, G. A. Kingsbury, T. D. Truong, S. Hays, and R. P. Junghans Evidence for an Antigen-driven Humoral Immune Response in Medullary Ductal Breast Cancer Cancer Res., November 1, 2001; 61(21): 7889 - 7899. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Garcia-Martinez, I. Bescos, J. J. Rodriguez-Sala, and F. Rodriguez-Valera RISSC: a novel database for ribosomal 16S-23S RNA genes spacer regions Nucleic Acids Res., January 1, 2001; 29(1): 178 - 180. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



