Nucleic Acids Research, 2003, Vol. 31, No. 1 193-195
© 2003 Oxford University Press
MGD: the Mouse Genome Database
The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
*To whom correspondence should be addressed. Tel: +1 2072886248; Fax: +1 2072886132; Email: jblake{at}informatics.jax.org
Current members of the Mouse Genome Database Group are R. M. Baldarelli, J. S. Beal, D. W. Bradt, D. L. Burkart, N. E. Butler, J. Campbell, T. Chu, L. E. Corbani, S. Cousins, H. J. Drabkin, D. Dahmen, K. Frazer, D. M. Garippa, C. W. Goldsmith, P. L. Grant, M. Lennon-Pierce, J. Lewis, I. Lu, C. M. Lutz, L. J. Maltais, P. Mani, L. M. McKenzie, L. Ni, J. E. Ormsby, A. Planchart, S. Ramachandran, D. J. Reed, D. R. Shaw, C. L. Smith, P. Szauter, P. Vanden Borre, L. Washburn and J. Winslow
Received September 23, 2002; Accepted September 27, 2002
ABSTRACT
The Mouse Genome Database (MGD) (http://www.informatics.jax.org) one component of a community database resource for the laboratory mouse, a key model organism for interpreting the human genome and for understanding human biology. MGD strives to provide an extensively integrated information resource with experimental details annotated from both literature and on-line genomic data sources. MGD curates and presents the consensus representation of genotype (sequence) to phenotype information including highly detailed information about genes and gene products. Primary foci of integration are through representations of relationships between genes, sequences and phenotypes. MGD collaborates with other bioinformatics groups to curate a definitive set of information about the laboratory mouse. Recent developments include a general implementation of database structures for controlled vocabularies and the integration of a phenotype classification system.
INTRODUCTION
The Mouse Genome Database (MGD) provides an integrated view of genetic and genomic information for the laboratory mouse (1). MGD contains information on mouse genes, genetic markers and genomic features as well as information on molecular segments (probes, primers, cDNA clones, BACs and YACs) mutant phenotypes, comparative mapping data, graphical displays of linkage, cytogenetic and physical maps, experimental mapping data, as well as strain distribution patterns for recombinant inbred strains (RIs) and cross haplotypes. MGD is updated daily (Table 1). Since it first became available on the WWW, MGD has continued to evolve, expanding its data coverage, improving data handling, and providing several new data manipulation and display tools.
|
MGD is one component of the Mouse Genome Informatics (MGI) database resource (http://www.informatics.jax.org) located at The Jackson Laboratory (http://www.jax.org). Other projects and resources that contribute to MGI include the Gene Expression Database (GXD) (2), the Mouse Genome Sequencing (MGS) (3) project and the Mouse Tumor Biology Database (MTB; http://www.informatics.jax.org/mtb) (4). The MGI consortium group participates actively in the development and implementation of the Gene Ontologies (GO) (www.geneontology.org) (5). MGI curators also collaborate extensively with SWISS-PROT (6) and with the LocusLink project at NCBI (7) to evaluate associations between genes and sequences for the mouse.
IMPROVEMENTS DURING 2002
Implementation of phenotype classifications
A broad, high-level set of phenotype terms have been developed and employed to classify phenotype data in MGD. This defined vocabulary of 105 terms can be used to search, group, compare and analyze phenotypes. These phenotype classification terms appear on the Alleles and Phenotypes Query Form (Fig. 1), and on the Genes and Marker Query Form. The complete list of terms and their accession IDs is also available by FTP. On each form, there is a link to the phenotype classification terms, complete with definitions and examples. Users of the MGI database can select one or more terms from the list to search for records associated with a particular phenotype, in combination with many other parameters on the forms. In addition, text-based searches for more specific phenotypic terms remain available.
|
A more comprehensive phenotype vocabulary continues to be developed by MGD staff and currently (September, 2002) contains over 1800 concepts. These terms are used to annotate mouse mutant phenotypes. Although these controlled terms are used to annotate mouse mutant phenotypes and can be viewed on allele detail pages, there currently is limited access to the full phenotype vocabulary as a query or analysis tool.
Improvements to the MGI : GO browser
The MGI GO Browser (http://www.informatics.jax.org/searches/GO_form.shtml) allows database users to access genes in MGI using functional annotation terms from the GO. This Browser was developed in conjunction with the GXD. A general database implementation within MGI for structured, controlled vocabularies enhances the search and recovery capabilities of this browser. The GO Browser can be accessed from gene detail or query pages as well as directly from the MGI menus. A GO Browser query returns a graph reflecting both parents and children of the query term and a link to all MGI associations with that term or any of the subterms.
Availability of MGI : GO files in various formats
MGI gene-to-GO annotations are updated daily. Various files for the MGI gene/markers with the GO associations are publicly available. These files are updated each time MGI submits a new gene association file to the GO web site (http://www.geneontology.org) and can be accessed on the MGI FTP server (ftp://www.informatics.jax.org/pub/informatics/reports/gene_association.mgi). A file of all the GO terms used by MGI in the annotation of genes and gene products is also available. MGI also provides a file to the GO database of MGI Gene : SWISS-PROT associations. This information is incorporated into the GO database and thus enables users to recover mouse sequence data as a result of a semantic search against the GO database (http://www.godatabase.org/cgi-bin/go.cgi).
OTHER INFORMATION
User input
MGD encourages user input into its gene and allele annotation efforts. On each gene detail and allele detail page, a clickable button (Your Input Welcome) brings the user to a web-based form for submitting updates to the information being viewed.
Mouse gene nomenclature
The MGD gene annotation group assigns unique symbols and names to mouse genes under the guidelines set by the International Committee on Standardized Genetic Nomenclature for mouse (http://www.informatics.jax.org/mgihome/nomen/index.shtml#mnrg) (8). Scientists can reserve symbols prior to publication using the electronic nomenclature submission form (http://www.informatics.jax.org/mgihome/nomen/nomen_submit_form.shtml) or by contacting the MGD nomenclature coordinator by email (nomen{at}informatics.jax.org).
Electronic data submission
Any type of data that MGD maintains can be submitted as an electronic contribution, although mapping data, polymorphisms, and mammalian homologies are currently the most common. Each electronic submission receives a permanent database accession ID. All data sets are associated with either an electronic submission reference or a published paper. MGD reference pages provide links to associated data sets.
Community outreach and user support
MGD provides extensive user support through online documentation and easy email or phone access to User Support Staff.
User Support WWW access, http://www.informatics.jax.org/mgihome/support/support.shtml; Email: mgi-help{at}informatics.jax.org; Tel: +1 2072886445; Fax: +1 2072886132.
Other outreach
MGI-LIST (http://www.informatics.jax.org/mgihome/lists/lists.shtml), is a moderated and active email bulletin board supported by the MGI Users Support group. Other outreach includes Online Tutorials and answers to Frequently Asked Questions, available at: http://www.informatics.jax.org/userdocs/helpdocs_menu.shtml. Lee Silver's book, Mouse Genetics, is now available in an electronic version at http://www.informatics.jax.org/silver/. The online version has been enhanced by linking genes and references to MGI and MEDLINE.
IMPLEMENTATION
MGD is implemented in the Sybase relational database system, version 12.5. A large set of CGI scripts and Java Servlets mediate the user's interaction with the database. For computational users, direct SQL access can be requested through User Support. User-requested database reports and a number of widely used data files (generated daily) are available on the FTP site (ftp://ftp.informatics.jax.org).
CITING MGD
The following citation format is suggested when referring to datasets specific to the MGD component of MGI : Mouse Genome Database (MGD), Mouse Genome Informatics, The Jackson Laboratory, Bar Harbor, Maine (URL: http://www.informatics.jax.org). [Type in date (month, year) when you retrieved the data cited.]
SUPPLEMENTARY MATERIAL
Supplementary Material is available at NAR Online.
ACKNOWLEDGEMENTS
MGD is supported by NIH/NHGRI grant HG00330. GO development and annotation efforts for MGI are supported by NIH/NHGRI grant HG02273.
REFERENCES
- Blake,J.A., Eppig,J.T., Richardson,J.E., Bult,C.J., Kadin,J.A. and Mouse Genome Database Group (2002) The Mouse Genome Database (MGD): the model organism database for the laboratory mouse. Nucleic Acids Res., 30, 113115.
[Abstract/Free Full Text] - Ringwald,M., Eppig,J.T., Begley,D.A., Corradi,J.P., McCright,I.J., Hayamizu,T.F., Hill,D.P., Kadin,J.A. and Richardson,J.E. (2001) The mouse gene expression database. Nucleic Acids Res., 29, 98101.
[Abstract/Free Full Text] - Denny,P. and Justice,M.J. (2000) Mouse as the measure of man? Trends Genet., 16, 283287.[CrossRef][Web of Science][Medline]
- Naf,D., Krupke,D.M., Sundberg,J.P., Eppig,J.T. and Bult,C.J. (2002) The Mouse Tumor Biology Database: a public resource for cancer genetics and pathology of the mouse. Cancer Res., 62, 12351240.
[Abstract/Free Full Text] - The Gene Ontology Consortium (2001) Creating the Gene Ontology Resource: design and implementation. Genome Res., 11, 14251433.
[Abstract/Free Full Text] - Bairoch,A. and Apweiler,R. (2000) The SWISS-PROT protein sequence database and its supplement TrEML in 2000. Nucleic Acid Res., 28, 4548.
[Abstract/Free Full Text] - Pruitt,K.D. and Maglott,D.R. (2001) RefSeq and LocusLink: NCBI gene-centered resources. Nucleic Acid Res., 29, 137140.
[Abstract/Free Full Text] - Maltais,L., Blake,J.A., Chu,T., Lutz,C.M., Eppig,J.T. and Jackson,I. (2002) Rules and guidelines for mouse gene, allele, and mutation nomenclature: a condensed version. Genomics, 79, 471474.[CrossRef][Web of Science][Medline]
This article has been cited by other articles:
![]() |
E. Ayroldi and C. Riccardi Glucocorticoid-induced leucine zipper (GILZ): a new important mediator of glucocorticoid action FASEB J, November 1, 2009; 23(11): 3649 - 3658. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Alizadeh, L. Z. Hong, C. B. Kaelin, T. Raudsepp, H. Manuel, and G. S. Barsh Genetics of Sex-linked yellow in the Syrian Hamster Genetics, April 1, 2009; 181(4): 1427 - 1436. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pirooznia, T. Habib, E. J. Perkins, and Y. Deng GOfetcher: a database with complex searching facility for gene ontology Bioinformatics, November 1, 2008; 24(21): 2561 - 2563. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Yao and A. Rzhetsky Quantitative systems-level determinants of human genes targeted by successful drugs Genome Res., February 1, 2008; 18(2): 206 - 213. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Spudich, X. M. Fernandez-Suarez, and E. Birney Genome browsing with Ensembl: a practical overview Brief Funct Genomic Proteomic, October 29, 2007; (2007) elm025v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. J. Gaulton, K. L. Mohlke, and T. J. Vision A computational system to select candidate genes for complex human traits Bioinformatics, May 1, 2007; 23(9): 1132 - 1140. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. K. Gupta, N. Gao, R. K. Gorski, P. White, O. T. Hardy, K. Rafiq, J. E. Brestelli, G. Chen, C. J. Stoeckert Jr., and K. H. Kaestner Expansion of adult beta-cell mass in response to increased metabolic demand is dependent on HNF-4{alpha} Genes & Dev., April 1, 2007; 21(7): 756 - 769. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Friedman, T. Borlawsky, L. Shagina, H. R. Xing, and Y. A. Lussier Bio-Ontology and text: bridging the modeling gap Bioinformatics, October 1, 2006; 22(19): 2421 - 2429. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. H. Chadwick, L. M. Pertz, K. W. Broman, M. S. Bartolomei, and H. F. Willard Genetic Control of X Chromosome Inactivation in Mice: Definition of the Xce Candidate Interval Genetics, August 1, 2006; 173(4): 2103 - 2110. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Yahara, K. Sato, and A. Nakano The Arf1p GTPase-activating protein Glo3p executes its regulatory function through a conserved repeat motif at its C-terminus J. Cell Sci., June 15, 2006; 119(12): 2604 - 2612. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-B. Lee, J.-j. Kim, and J. C. Park Automatic extension of Gene Ontology with flexible identification of candidate terms Bioinformatics, March 15, 2006; 22(6): 665 - 670. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Moran, A. D. Bolton, P. V. Tran, A. Brown, N. D. Dwyer, D. K. Manning, B. C. Bjork, C. Li, K. Montgomery, S. M. Siepka, et al. Utilization of a whole genome SNP panel for efficient genetic mapping in the mouse Genome Res., March 1, 2006; 16(3): 436 - 440. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. P. van Someren, B. L. T. Vaes, W. T. Steegenga, A. M. Sijbers, K. J. Dechering, and M. J. T. Reinders Least absolute regression network analysis of the murine osteoblast differentiation network Bioinformatics, February 15, 2006; 22(4): 477 - 484. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Basu, E. Bremer, C. Zhou, and D. F. Bogenhagen MiGenes: a searchable interspecies database of mitochondrial proteins curated using gene ontology annotation Bioinformatics, February 15, 2006; 22(4): 485 - 492. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. G. Puente, D. J. Borris, J.-F. Carriere, J. F. Kelly, and L. A. Megeney Identification of Candidate Regulators of Embryonic Stem Cell Differentiation by Comparative Phosphoprotein Affinity Profiling Mol. Cell. Proteomics, January 1, 2006; 5(1): 57 - 67. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Kummerfeld and S. A. Teichmann DBD: a transcription factor prediction database Nucleic Acids Res., January 1, 2006; 34(suppl_1): D74 - D81. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Wheeler, T. Barrett, D. A. Benson, S. H. Bryant, K. Canese, V. Chetvernin, D. M. Church, M. DiCuccio, R. Edgar, S. Federhen, et al. Database resources of the National Center for Biotechnology Information Nucleic Acids Res., January 1, 2006; 34(suppl_1): D173 - D180. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Fink, R. N. Aturaliya, M. J. Davis, F. Zhang, K. Hanson, M. S. Teasdale, C. Kai, J. Kawai, P. Carninci, Y. Hayashizaki, et al. LOCATE: a mouse protein subcellular localization database Nucleic Acids Res., January 1, 2006; 34(suppl_1): D213 - D217. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. van Driel, K. Cuelenaere, P. P. C. W. Kemmeren, J. A. M. Leunissen, H. G. Brunner, and G. Vriend GeneSeeker: extraction and integration of human disease-related information from web-based genetic databases Nucleic Acids Res., July 1, 2005; 33(suppl_2): W758 - W761. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Garlow, E. Boone, W. Li, M. J. Owens, and C. B. Nemeroff Genetic Analysis of the Hypothalamic Corticotropin-Releasing Factor System Endocrinology, May 1, 2005; 146(5): 2362 - 2368. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Wolstencroft, R. McEntire, R. Stevens, L. Tabernero, and A. Brass Constructing ontology-driven protein family databases Bioinformatics, April 15, 2005; 21(8): 1685 - 1692. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. L. Douglas, G. N. Bowman, H. A. Baghdoyan, and R. Lydic C57BL/6J and B6.V-LEPOB mice differ in the cholinergic modulation of sleep and breathing J Appl Physiol, March 1, 2005; 98(3): 918 - 929. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-W. Huang, R. Friedman, N. Yu, A. Yu, and W.-H. Li How Strong Is the Mutagenicity of Recombination in Mammals? Mol. Biol. Evol., March 1, 2005; 22(3): 426 - 431. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kahraman, A. Avramov, L. G. Nashev, D. Popov, R. Ternes, H.-D. Pohlenz, and B. Weiss PhenomicDB: a multi-species genotype/phenotype database for comparative phenomics Bioinformatics, February 1, 2005; 21(3): 418 - 420. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Chen, H. Liu, and C. Friedman Gene name ambiguity of eukaryotic nomenclatures Bioinformatics, January 15, 2005; 21(2): 248 - 256. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Giudicelli, D. Chaume, and M.-P. Lefranc IMGT/GENE-DB: a comprehensive database for human and mouse immunoglobulin and T cell receptor genes Nucleic Acids Res., January 1, 2005; 33(suppl_1): D256 - D261. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Ashurst, C.-K. Chen, J. G. R. Gilbert, K. Jekosch, S. Keenan, P. Meidl, S. M. Searle, J. Stalker, R. Storey, S. Trevanion, et al. The Vertebrate Genome Annotation (Vega) database Nucleic Acids Res., January 1, 2005; 33(suppl_1): D459 - D465. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. T. Eppig, C. J. Bult, J. A. Kadin, J. E. Richardson, J. A. Blake, and the Mouse Genome Database Group The Mouse Genome Database (MGD): from genes to mice--a community resource for mouse biology Nucleic Acids Res., January 1, 2005; 33(suppl_1): D471 - D475. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Deruytter, O. Boulard, and H.-J. Garchon Mapping Non-Class II H2-Linked Loci for Type 1 Diabetes in Nonobese Diabetic Mice Diabetes, December 1, 2004; 53(12): 3323 - 3327. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Funkat, C. M. Massa, V. Jovanovska, J. Proietto, and S. Andrikopoulos Metabolic Adaptations of Three Inbred Strains of Mice (C57BL/6, DBA/2, and 129T2) in Response to a High-Fat Diet J. Nutr., December 1, 2004; 134(12): 3264 - 3269. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. De Felice and R. Di Lauro Thyroid Development and Its Disorders: Genetics and Molecular Mechanisms Endocr. Rev., October 1, 2004; 25(5): 722 - 746. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. J. Goren, R. N. Kulkarni, and C. R. Kahn Glucose Homeostasis and Tissue Transcript Content of Insulin Signaling Intermediates in Four Inbred Strains of Mice: C57BL/6, C57BLKS/6, DBA/2, and 129X1 Endocrinology, July 1, 2004; 145(7): 3307 - 3323. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. S. Lein, X. Zhao, and F. H. Gage Defining a Molecular Atlas of the Hippocampus Using DNA Microarrays and High-Throughput In Situ Hybridization J. Neurosci., April 14, 2004; 24(15): 3879 - 3889. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. N. Twigger, J. Nie, V. Ruotti, J. Yu, D. Chen, D. Li, J. Mathis, V. Narayanasamy, G. R. Gopinath, D. Pasko, et al. Integrative Genomics: In Silico Coupling of Rat Physiology and Complex Traits With Mouse and Human Data Genome Res., April 1, 2004; 14(4): 651 - 660. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. M. Wain, M. J. Lush, F. Ducluzeau, V. K. Khodiyar, and S. Povey Genew: the Human Gene Nomenclature Database, 2004 updates Nucleic Acids Res., January 1, 2004; 32(90001): D255 - 257. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Bodenreider The Unified Medical Language System (UMLS): integrating biomedical terminology Nucleic Acids Res., January 1, 2004; 32(90001): D267 - 270. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Bult, J. A. Blake, J. E. Richardson, J. A. Kadin, J. T. Eppig, and the Mouse Genome Database Group The Mouse Genome Database (MGD): integrating biology with the genome Nucleic Acids Res., January 1, 2004; 32(90001): D476 - 481. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Divina and J. Forejt The Mouse SAGE Site: database of public mouse SAGE libraries Nucleic Acids Res., January 1, 2004; 32(90001): D482 - 483. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. N. Schofield, J. B. L. Bard, C. Booth, J. Boniver, V. Covelli, P. Delvenne, M. Ellender, W. Engstrom, W. Goessner, M. Gruenberger, et al. Pathbase: a database of mutant mouse pathology Nucleic Acids Res., January 1, 2004; 32(90001): D512 - 515. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Akagi, T. Suzuki, R. M. Stephens, N. A. Jenkins, and N. G. Copeland RTCGD: retroviral tagged cancer gene database Nucleic Acids Res., January 1, 2004; 32(90001): D523 - 527. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Visel, C. Thaller, and G. Eichele GenePaint.org: an atlas of gene expression patterns in the mouse embryo Nucleic Acids Res., January 1, 2004; 32(90001): D552 - 556. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. P. Hill, D. A. Begley, J. H. Finger, T. F. Hayamizu, I. J. McCright, C. M. Smith, J. S. Beal, L. E. Corbani, J. A. Blake, J. T. Eppig, et al. The mouse Gene Expression Database (GXD): updates and enhancements Nucleic Acids Res., January 1, 2004; 32(90001): D568 - 571. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
















