Nucleic Acids Research, 2001, Vol. 29, No. 1 1-10
© 2001 Oxford University Press
The Molecular Biology Database Collection: an updated compilation of biological database resources
Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Building 49, Room 4A-22, Bethesda, MD 20892-4470, USA
Received November 13, 2000; Revised and Accepted November 17, 2000.
| ABSTRACT |
|---|
|
|
|---|
The Molecular Biology Database Collection is an online resource listing key databases of value to the biological community. This Collection is intended to bring fellow scientists attention to high-quality databases that are available throughout the world, rather than just be a lengthy listing of all available databases. As such, this up-to-date listing is intended to serve as the initial point from which to find specialized databases that may be of use in biological research. The databases included in this Collection provide new value to the underlying data by virtue of curation, new data connections or other innovative approaches. Short, searchable summaries of each of the databases included in the Collection are available through the Nucleic Acids Research Web site, at http://www.nar.oupjournals.org.
With the advent of the new millennium, the scientific community marked a significant milestone in the study of biologythe completion of the working draft of the human genome (1). Amongst much fanfare, the completion of the working draft was announced by President Clinton at a White House ceremony on June 26, 2000 (http://www.whitehouse.gov/WH/New/html/20000626.html). This announcement signaled that the majority of biological and biomedical research would now be conducted in a sequence-based fashion. This new approach, long-awaited and much-debated, promises to quickly lead to advances not just in the understanding of basic biological processes, but in the prevention, diagnosis and treatment of many genetic and genomic disorders. While the fruits of sequencing the human genome may not be known or appreciated for another hundred years, the implications to the basic way in which medicine will be practised in the future is staggering.
At the time of writing of this paper, the International Human Genome Sequencing Consortium had fully finished 24.7% of the human sequence, with another 66.2% of the sequence being available in draft form. In the course of this sequencing, two of the human chromosomes have been finished, namely chromosomes 21 and 22 (2,3). Even with most of the chromosomes incomplete, some interesting insights have already been made into the structure of the human genome, such as a decided down-estimate in the number of genes actually in the human genome. While most of the attention of the scientific community and the public at large has focused on the human sequence, a number of model organisms have also been sequenced, including that of the fruit fly (Drosophila melanogaster) in 2000 (4); the complete genomes of organisms such as the rat and the mouse will quickly follow over the next several years. Efforts are also focused on sequence variation, with the SNP Consortium anticipating the identification of a million single nucleotide polymorphisms (SNPs) by the end of 2000, far ahead of the initial goal of discovering 100 000 SNPs by 2003 (1).
Database efforts have kept pace with the furious rate at which this sequence data is being generated, providing investigators access to all public data in a practically instantaneous fashion (5). While most biologists are familiar with the databases comprising the International Nucleotide Sequence Database Collaboration (DDBJ, EMBL and GenBank), numerous other specialized databases have emerged. These specialized databases often arise out of a particular need, whether it be to address a particular biological question of interest or to better serve a particular segment of the biological community. This journal has devoted its first issue over the last several years to documenting the availability and features of these specialized databases in order to better serve its readership and to promote the use of these resources in the design and analysis of experiments. These reviewed databases are collectively listed in the Molecular Biology Database Collection.
The databases included in the current version of the Collection are shown in Table 1. This year, 55 new entries have been added, bringing the total number of databases listed to 281. While this number may seem large for a curated collection, these databases distinguish themselves by their approach to presenting the underlying datafor example, by adding new value to the underlying data by virtue of curation, by providing new types of data connections or by implementing other innovative approaches facilitating biological discovery. The individual entries are classified by type, but the reader should recognize that the distinctions between these classes are often arbitrary, and that many of these databases provide more than one type of information to the user.
|
In addition to the list presented in this paper, an electronic version of the Database Issue and Collection can be accessed online and is freely available to everyone, regardless of subscription status, at http://www.nar.oupjournals.org. While the list contains the databases described in the papers comprising the current issue, it should be immediately apparent to the reader that there are simply not enough pages in this journal to accommodate full-length, printed descriptions of all 281 of the databases featured here. To address this, the online version of the Collection now includes short summaries of many of the databases, the summaries having been provided directly by the investigators responsible for the individual databases. It is hoped that this approach will provide the reader with an additional source of information that will facilitate finding and selecting the sources of data that would be of most value in addressing a specific biological problem. Contributors will be encouraged to keep their entries up-to-date, as the online descriptions will be updated on a regular basis.
Suggestions for the inclusion of additional database resources in this Collection are encouraged and may be directed to the author (andy{at}nhgri.nih.gov).
| ACKNOWLEDGEMENT |
|---|
I wish to thank Yi-Chi Barash for designing the Web-based submission tool for this Collection as well as for her technical support.
| FOOTNOTES |
|---|
* Tel: +1 301 496 8570; Fax: +1 301 402 6858; Email: andy{at}nhgri.nih.gov
| REFERENCES |
|---|
|
|
|---|
-
1 Collins,F.S., Patrinos,A., Jordan,E., Chakravarti,A., Gesteland,R., Walters,L. and members of the DOE and NIH Planning Groups (1998) New goals for the U.S. Human Genome Project: 19982003. Science, 282, 682689.
2 Hattori,M., Fujiyama,A., Taylor,T.D., Watanabe,H., Yada,T., Park,H.S., Toyoda,A., Ishii,K., Totoki,Y., Choi,D.K. et al. (2000) The DNA sequence of human chromosome 21. The chromosome 21 mapping and sequencing consortium. Nature, 405, 311319.[Medline]
3 Dunham,I, Shimizu,N., Roe,B.A., Chissoe,S., Hunt,A.R., Collins,J.E., Bruskiewich,R., Beare,D.M., Clamp,M., Smink,L.J. et al. (1999) The DNA sequence of human chromosome 22. Nature, 402, 489495.[Medline]
4 Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D., Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F. et al. (2000) The genome sequence of Drosophila melanogaster. Science, 287, 21852195.
5 Guyer,M.S. (1998) Statement on the rapid release of genomic DNA sequence. Genome Res., 8, 413.
This article has been cited by other articles:
![]() |
T. Mijatovic, A. Op De Beeck, E. Van Quaquebeke, J. Dewelle, F. Darro, Y. de Launoit, and R. Kiss The cardenolide UNBS1450 is able to deactivate nuclear factor {kappa}B-mediated cytoprotective effects in human non-small cell lung cancer cells. Mol. Cancer Ther., February 1, 2006; 5(2): 391 - 399. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Rojas-Cartagena, C. B. Appleyard, O. I. Santiago, and I. Flores Experimental Intestinal Endometriosis Is Characterized by Increased Levels of Soluble TNFRSF1B and Downregulation of Tnfrsf1a and Tnfrsf1b Gene Expression Biol Reprod, December 1, 2005; 73(6): 1211 - 1218. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. S. Hittel, W. E. Kraus, C. J. Tanner, J. A. Houmard, and E. P. Hoffman Exercise training increases electron and substrate shuttling proteins in muscle of overweight men and women with the metabolic syndrome J Appl Physiol, January 1, 2005; 98(1): 168 - 179. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Lefranc, T. Mijatovic, V. Mathieu, S. Rorive, C. Decaestecker, O. Debeir, J. Brotchi, P. Van Ham, I. Salmon, and R. Kiss Characterization of Gastrin-Induced Proangiogenic Effects In vivo in Orthotopic U373 Experimental Human Glioblastomas and In vitro in Human Umbilical Vein Endothelial Cells Clin. Cancer Res., December 15, 2004; 10(24): 8250 - 8265. [Abstract] [Full Text] [PDF] |
||||
![]() |
W.-M. Boon, T. Beissbarth, L. Hyde, G. Smyth, J. Gunnersen, D. A. Denton, H. Scott, and S.-S. Tan A comparative analysis of transcribed genes in the mouse hypothalamus and neocortex reveals chromosomal clustering PNAS, October 12, 2004; 101(41): 14972 - 14977. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Banan, L. J. Zhang, M. Shaikh, J. Z. Fields, A. Farhadi, and A. Keshavarzian Novel effect of NF-{kappa}B activation: carbonylation and nitration injury to cytoskeleton and disruption of monolayer barrier in intestinal epithelium Am J Physiol Cell Physiol, October 1, 2004; 287(4): C1139 - C1151. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Gellhaus, X. Dong, S. Propson, K. Maass, L. Klein-Hitpass, M. Kibschull, O. Traub, K. Willecke, B. Perbal, S. J. Lye, et al. Connexin43 Interacts with NOV: A POSSIBLE MECHANISM FOR NEGATIVE REGULATION OF CELL GROWTH IN CHORIOCARCINOMA CELLS J. Biol. Chem., August 27, 2004; 279(35): 36931 - 36942. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. E. Rogler, T. Tchaikovskaya, R. Norel, A. Massimi, C. Plescia, E. Rubashevsky, P. Siebert, and L. E. Rogler RNA expression microarrays (REMs), a high-throughput method to measure differences in gene expression in diverse biological samples Nucleic Acids Res., August 25, 2004; 32(15): e120 - e120. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Mocellin, E. Wang, M. Panelli, P. Pilati, and F. M. Marincola DNA Array-Based Gene Profiling in Tumor Immunology Clin. Cancer Res., July 15, 2004; 10(14): 4597 - 4606. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Banan, J. Z. Fields, L. J. Zhang, M. Shaikh, A. Farhadi, and A. Keshavarzian {zeta} Isoform of Protein Kinase C Prevents Oxidant-Induced Nuclear Factor-{kappa}B Activation and I-{kappa}B{alpha} Degradation: A Fundamental Mechanism for Epidermal Growth Factor Protection of the Microtubule Cytoskeleton and Intestinal Barrier Integrity J. Pharmacol. Exp. Ther., October 1, 2003; 307(1): 53 - 66. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Banan, A. Farhadi, J. Z. Fields, E. Mutlu, L. Zhang, and A. Keshavarzian Evidence That Nuclear Factor-{kappa}B Activation Is Critical in Oxidant-Induced Disruption of the Microtubule Cytoskeleton and Barrier Integrity and That Its Inactivation Is Essential in Epidermal Growth Factor-Mediated Protection of the Monolayers of Intestinal Epithelia J. Pharmacol. Exp. Ther., July 1, 2003; 306(1): 13 - 28. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Fu, E. J. Campbell, T. G. Shepherd, and M. W. Nachtigal Epigenetic Regulation of Proprotein Convertase PACE4 Gene Expression in Human Ovarian Cancer Cells Mol. Cancer Res., June 1, 2003; 1(8): 569 - 576. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Irizarry, B. M. Bolstad, F. Collin, L. M. Cope, B. Hobbs, and T. P. Speed Summaries of Affymetrix GeneChip probe level data Nucleic Acids Res., February 15, 2003; 31(4): e15 - e15. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Rosato and C. P. Kyriacou Origins of Circadian Rhythmicity J Biol Rhythms, December 1, 2002; 17(6): 506 - 511. [PDF] |
||||
![]() |
A. Spector, D. Li, W. Ma, F. Sun, and P. Pavlidis Differential Amplification of Gene Expression in Lens Cell Lines Conditioned to Survive Peroxide Stress Invest. Ophthalmol. Vis. Sci., October 1, 2002; 43(10): 3251 - 3264. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Guan, J. Stege, M. Kim, Z. Dahmani, N. Fan, P. Heifetz, C. F. Barbas III, and S. P. Briggs Heritable endogenous gene regulation in plants with designed polydactyl zinc finger transcription factors PNAS, October 1, 2002; 99(20): 13296 - 13301. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. H. Selzman, S. A. Miller, M. A. Zimmerman, F. Gamboni-Robertson, A. H. Harken, and A. Banerjee Monocyte chemotactic protein-1 directly induces human vascular smooth muscle proliferation Am J Physiol Heart Circ Physiol, October 1, 2002; 283(4): H1455 - H1461. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. K. Park, L.-X. Wang, and S. Roseman Isolation of a Glucosamine-specific Kinase, a Unique Enzyme of Vibrio cholerae J. Biol. Chem., May 3, 2002; 277(18): 15573 - 15578. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-C. Daugeron, F. Mauxion, and B. Seraphin The yeast POP2 gene encodes a nuclease involved in mRNA deadenylation Nucleic Acids Res., June 15, 2001; 29(12): 2448 - 2455. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||












