Published online 7 October 2005
Article |
The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes
1Fellowship for Interpretation of Genomes 15W155 81st Street, Burr Ridge, IL 60527, USA 2Mathematics and Computer Science Division, Argonne National Laboratory Argonne, IL 60439, USA 3Center for Biotechnology, Institute for Genome Research, Bielefeld University 33594 Bielefeld, Germany, USA 4International NRW Graduate School in Bioinformatics & Genome Research, Institute for Genome Research, Bielefeld University 33594 Bielefeld, Germany, USA 5Emerson Hall, University of Florida PO Box 14425, Gainesville, FL 32604, USA 6Institute for Information Transmission Problems, Russian Academy of Sciences Moscow, Russia 7Center for Microbial Sciences, San Diego State University San Diego, CA 92813, USA 8The Burnham Institute San Diego CA 92037, USA 9Department of Microbiology, University of Illinois at Urbana-Champaign Urbana, IL 61801 10Computer Science Dept, Middle Tennessee State University Murfreesboro, TN 37132, USA 11Danish Genome Institute Gustav Wieds vej 10 C, DK-8000 Aarhus C, Denmark 12Computation Institute, University of Chicago Chicago, IL 60637, USA 13Departments of Microbiology and Cell Science, University of Florida Gainesville, FL 32611, USA 14Department of Horticultural Science, University of Florida Gainesville, FL 32611, USA 15Department of Chemistry, Portland State University Portland, OR 97207, USA 16Department of Chemistry and Chemical Biology, Cornell University Ithaca, NY14853, USA 17University of California San Diego, CA 92093, USA 18Cleveland BioLabs, Inc. Cleveland, OH 44106, USA
*To whom correspondence should be addressed. Tel: +1 630 325 4178; Fax: +1 630 325 4179; Email: Veronika{at}theFIG.info
Received June 9, 2005. Revised September 8, 2005. Accepted September 8, 2005.
The release of the 1000th complete microbial genome will occur in the next two to three years. In anticipation of this milestone, the Fellowship for Interpretation of Genomes (FIG) launched the Project to Annotate 1000 Genomes. The project is built around the principle that the key to improved accuracy in high-throughput annotation technology is to have experts annotate single subsystems over the complete collection of genomes, rather than having an annotation expert attempt to annotate all of the genes in a single genome. Using the subsystems approach, all of the genes implementing the subsystem are analyzed by an expert in that subsystem. An annotation environment was created where populated subsystems are curated and projected to new genomes. A portable notion of a populated subsystem was defined, and tools developed for exchanging and curating these objects. Tools were also developed to resolve conflicts between populated subsystems. The SEED is the first annotation environment that supports this model of annotation. Here, we describe the subsystem approach, and offer the first release of our growing library of populated subsystems. The initial release of data includes 180 177 distinct proteins with 2133 distinct functional roles. This data comes from 173 subsystems and 383 different organisms.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Paige, S. D. Reid, P. C. Hanna, and A. Claiborne The Type III Pantothenate Kinase Encoded by coaX Is Essential for Growth of Bacillus anthracis J. Bacteriol., September 15, 2008; 190(18): 6271 - 6275. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Dalevi, N. N. Ivanova, K. Mavromatis, S. D. Hooper, E. Szeto, P. Hugenholtz, N. C. Kyrpides, and V. M. Markowitz Annotation of metagenome short reads using proxygenes Bioinformatics, August 15, 2008; 24(16): i7 - i13. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Martin, N. N. Diaz, J. Ontrup, and T. W. Nattkemper Hyperbolic SOM-based clustering of DNA fragment features for taxonomic visualization and classification Bioinformatics, July 15, 2008; 24(14): 1568 - 1574. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Adin, K. L. Visick, and E. V. Stabb Identification of a Cellobiose Utilization Gene Cluster with Cryptic {beta}-Galactosidase Activity in Vibrio fischeri Appl. Envir. Microbiol., July 1, 2008; 74(13): 4059 - 4069. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. E. Martinez-Guerrero, R. Ciria, C. Abreu-Goodger, G. Moreno-Hagelsieb, and E. Merino GeConT 2: gene context analysis for orthologous proteins, conserved domains and metabolic pathways Nucleic Acids Res., July 1, 2008; 36(suppl_2): W176 - W180. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. O. Hudson, C. Gilvarg, and T. Leustek Biochemical and Phylogenetic Characterization of a Novel Diaminopimelate Biosynthesis Pathway in Prokaryotes Identifies a Diverged Form of LL-Diaminopimelate Aminotransferase J. Bacteriol., May 1, 2008; 190(9): 3256 - 3263. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. DellaPenna and R. L. Last Genome-Enabled Approaches Shed New Light on Plant Metabolism Science, April 25, 2008; 320(5875): 479 - 481. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Frank, C. I. Reich, S. Sharma, J. S. Weisbaum, B. A. Wilson, and G. J. Olsen Critical Evaluation of Two Primers Commonly Used for Amplification of Bacterial 16S rRNA Genes Appl. Envir. Microbiol., April 15, 2008; 74(8): 2461 - 2470. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Rodionov, X. Li, I. A. Rodionova, C. Yang, L. Sorci, E. Dervyn, D. Martynowski, H. Zhang, M. S. Gelfand, and A. L. Osterman Transcriptional regulation of NAD metabolism in bacteria: genomic reconstruction of NiaR (YrxA) regulon Nucleic Acids Res., April 1, 2008; 36(6): 2032 - 2046. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Rodionov, J. De Ingeniis, C. Mancini, F. Cimadamore, H. Zhang, A. L. Osterman, and N. Raffaelli Transcriptional regulation of NAD metabolism in bacteria: NrtR family of Nudix-related regulators Nucleic Acids Res., April 1, 2008; 36(6): 2047 - 2059. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Naponelli, A. Noiriel, M. J. Ziemak, S. M. Beverley, L.-F. Lye, A. M. Plume, J. R. Botella, K. Loizeau, S. Ravanel, F. Rebeille, et al. Phylogenomic and Functional Analysis of Pterin-4a-Carbinolamine Dehydratase Family (COG2154) Proteins in Plants and Microorganisms Plant Physiology, April 1, 2008; 146(4): 1515 - 1527. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Bonner, T. Disz, K. Hwang, J. Song, V. Vonstein, R. Overbeek, and R. A. Jensen Cohesion Group Approach for Evolutionary Analysis of TyrA, a Protein Family with Wide-Ranging Substrate Specificities Microbiol. Mol. Biol. Rev., March 1, 2008; 72(1): 13 - 53. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Yang, D. A. Rodionov, I. A. Rodionova, X. Li, and A. L. Osterman Glycerate 2-Kinase of Thermotoga maritima and Genomic Reconstruction of Related Metabolic Pathways J. Bacteriol., March 1, 2008; 190(5): 1773 - 1782. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Severi, A. Muller, J. R. Potts, A. Leech, D. Williamson, K. S. Wilson, and G. H. Thomas Sialic Acid Mutarotation Is Catalyzed by the Escherichia coli {beta}-Propeller Protein YjhT J. Biol. Chem., February 22, 2008; 283(8): 4841 - 4849. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Gupta, S. Tanner, N. Jaitly, J. N. Adkins, M. Lipton, R. Edwards, M. Romine, A. Osterman, V. Bafna, R. D. Smith, et al. Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation Genome Res., September 1, 2007; 17(9): 1362 - 1377. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Lin, L. C. Johnson, H. Weissbach, N. Brot, M. O. Lively, and W. T. Lowther Free methionine-(R)-sulfoxide reductase from Escherichia coli reveals a new GAF domain function PNAS, June 5, 2007; 104(23): 9597 - 9602. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Sulakhe, M. D'Souza, M. Syed, A. Rodriguez, Y. Zhang, E. M. Glass, M. F. Romine, and N. Maltsev GNARE--a grid-based server for the analysis of user submitted genomes Nucleic Acids Res., May 25, 2007; (2007) gkm366v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Brune, N. Jochmann, K. Brinkrolf, A. T. Huser, R. Gerstmeir, B. J. Eikmanns, J. Kalinowski, A. Puhler, and A. Tauch The IclR-Type Transcriptional Repressor LtbR Regulates the Expression of Leucine and Tryptophan Biosynthesis Genes in the Amino Acid Producer Corynebacterium glutamicum J. Bacteriol., April 1, 2007; 189(7): 2720 - 2733. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Rodionov, O. V. Kurnasov, B. Stec, Y. Wang, M. F. Roberts, and A. L. Osterman Genomic identification and in vitro reconstitution of a complete biosynthetic pathway for the osmolyte di-myo-inositol-phosphate PNAS, March 13, 2007; 104(11): 4279 - 4284. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Hebbeln, D. A. Rodionov, A. Alfandega, and T. Eitinger Biotin uptake in prokaryotes by solute transporters with an optional ATP-binding cassette-containing module PNAS, February 20, 2007; 104(8): 2909 - 2914. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Krause, A. C. McHardy, T. W. Nattkemper, A. Puhler, J. Stoye, and F. Meyer GISMO--gene identification using a support vector machine for ORF classification Nucleic Acids Res., January 28, 2007; 35(2): 540 - 549. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Igarashi, A. Eroshkin, S. Gramatikova, K. Gramatikoff, Y. Zhang, J. W. Smith, A. L. Osterman, and A. Godzik CutDB: a proteolytic event database Nucleic Acids Res., January 12, 2007; 35(suppl_1): D546 - D549. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. K. McNeil, C. Reich, R. K. Aziz, D. Bartels, M. Cohoon, T. Disz, R. A. Edwards, S. Gerdes, K. Hwang, M. Kubal, et al. The National Microbial Pathogen Database Resource (NMPDR): a genomics platform based on subsystem annotation Nucleic Acids Res., January 12, 2007; 35(suppl_1): D347 - D353. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. El Yacoubi, S. Bonnett, J. N. Anderson, M. A. Swairjo, D. Iwata-Reuyl, and V. de Crecy-Lagard Discovery of a New Prokaryotic Type I GTP Cyclohydrolase Family J. Biol. Chem., December 8, 2006; 281(49): 37586 - 37593. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. R. Joyce, J. L. Reed, A. White, R. Edwards, A. Osterman, T. Baba, H. Mori, S. A. Lesely, B. O. Palsson, and S. Agarwalla Experimental and Computational Assessment of Conditionally Essential Genes in Escherichia coli J. Bacteriol., December 1, 2006; 188(23): 8259 - 8271. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Brot, J.-F. Collet, L. C. Johnson, T. J. Jonsson, H. Weissbach, and W. T. Lowther The Thioredoxin Domain of Neisseria gonorrhoeae PilB Can Use Electrons from DsbD to Reduce Downstream Methionine Sulfoxide Reductases J. Biol. Chem., October 27, 2006; 281(43): 32668 - 32675. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Yang, D. A. Rodionov, X. Li, O. N. Laikova, M. S. Gelfand, O. P. Zagnitko, M. F. Romine, A. Y. Obraztsova, K. H. Nealson, and A. L. Osterman Comparative Genomics and Experimental Characterization of N-Acetylglucosamine Utilization Pathway of Shewanella oneidensis J. Biol. Chem., October 6, 2006; 281(40): 29872 - 29885. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Yang, Y. Eyobo, L. A. Brand, D. Martynowski, D. Tomchick, E. Strauss, and H. Zhang Crystal Structure of a Type III Pantothenate Kinase: Insight into the Mechanism of an Essential Coenzyme A Biosynthetic Enzyme Universally Distributed in Bacteria. J. Bacteriol., August 1, 2006; 188(15): 5532 - 5540. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Awai, C. Xu, B. Tamot, and C. Benning From the Cover: A phosphatidic acid-binding protein of the chloroplast inner envelope membrane involved in lipid trafficking PNAS, July 11, 2006; 103(28): 10817 - 10822. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Osterman A hidden metabolic pathway exposed PNAS, April 11, 2006; 103(15): 5637 - 5638. [Full Text] [PDF] |
||||
![]() |
S. Y. Gerdes, O. V. Kurnasov, K. Shatalin, B. Polanuyer, R. Sloutsky, V. Vonstein, R. Overbeek, and A. L. Osterman Comparative Genomics of NAD Biosynthesis in Cyanobacteria. J. Bacteriol., April 1, 2006; 188(8): 3012 - 3023. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. F. DeLong, C. M. Preston, T. Mincer, V. Rich, S. J. Hallam, N.-U. Frigaard, A. Martinez, M. B. Sullivan, R. Edwards, B. R. Brito, et al. Community Genomics Among Stratified Microbial Assemblages in the Ocean's Interior Science, January 27, 2006; 311(5760): 496 - 503. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Vallenet, L. Labarre, Z. Rouy, V. Barbe, S. Bocs, S. Cruveiller, A. Lajus, G. Pascal, C. Scarpelli, and C. Medigue MaGe: a microbial genome annotation system supported by synteny results Nucleic Acids Res., January 10, 2006; 34(1): 53 - 65. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Rodionov, P. Hebbeln, M. S. Gelfand, and T. Eitinger Comparative and Functional Genomic Analysis of Prokaryotic Nickel and Cobalt Uptake Transporters: Evidence for a Novel Group of ATP-Binding Cassette Transporters J. Bacteriol., January 1, 2006; 188(1): 317 - 327. [Abstract] [Full Text] [PDF] |
||||









