Nucleic Acids Research, 2000, Vol. 28, No. 1 33-36
© 2000 Oxford University Press
The COG database: a tool for genome-scale analysis of protein functions and evolution
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
Rational classification of proteins encoded in sequenced genomes is critical for making the genome sequences maximally useful for functional and evolutionary studies. The database of Clusters of Orthologous Groups of proteins (COGs) is an attempt on a phylogenetic classification of the proteins encoded in 21 complete genomes of bacteria, archaea and eukaryotes (http://www.ncbi.nlm.nih.gov/COG ). The COGs were constructed by applying the criterion of consistency of genome-specific best hits to the results of an exhaustive comparison of all protein sequences from these genomes. The database comprises 2091 COGs that include 5683% of the gene products from each of the complete bacterial and archaeal genomes and ~35% of those from the yeast Saccharomyces cerevisiae genome. The COG database is accompanied by the COGNITOR program that is used to fit new proteins into the COGs and can be applied to functional and phylogenetic annotation of newly sequenced genomes.
* To whom correspondence should be addressed. Tel: +1 301 435 5913; Fax: +1 301 480 9241; Email: koonin@ncbi.nlm.nih.gov
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. Roovers, K. H. Kaminska, K. L. Tkaczuk, D. Gigot, L. Droogmans, and J. M. Bujnicki The YqfN protein of Bacillus subtilis is the tRNA: m1A22 methyltransferase (TrmK) Nucleic Acids Res., April 17, 2008; (2008) gkn169v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Overton, C. A. J. van Niekerk, L. G. Carter, A. Dawson, D. M. A. Martin, S. Cameron, S. A. McMahon, M. F. White, W. N. Hunter, J. H. Naismith, et al. TarO: a target optimisation system for structural biology Nucleic Acids Res., April 2, 2008; (2008) gkn141v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Rodionov, X. Li, I. A. Rodionova, C. Yang, L. Sorci, E. Dervyn, D. Martynowski, H. Zhang, M. S. Gelfand, and A. L. Osterman Transcriptional regulation of NAD metabolism in bacteria: genomic reconstruction of NiaR (YrxA) regulon Nucleic Acids Res., April 1, 2008; 36(6): 2032 - 2046. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Rodionov, J. De Ingeniis, C. Mancini, F. Cimadamore, H. Zhang, A. L. Osterman, and N. Raffaelli Transcriptional regulation of NAD metabolism in bacteria: NrtR family of Nudix-related regulators Nucleic Acids Res., April 1, 2008; 36(6): 2047 - 2059. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. M. Lauro, K. Tran, A. Vezzi, N. Vitulo, G. Valle, and D. H. Bartlett Large-Scale Transposon Mutagenesis of Photobacterium profundum SS9 Reveals New Genetic Loci Important for Growth at Low Temperature and High Pressure J. Bacteriol., March 1, 2008; 190(5): 1699 - 1709. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. X. Cordero, B. Snel, and P. Hogeweg Coevolution of gene families in prokaryotes Genome Res., March 1, 2008; 18(3): 462 - 468. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Conte, S. Gaillard, N. Lanau, M. Rouard, and C. Perin GreenPhylDB: a database for plant comparative genomics Nucleic Acids Res., January 11, 2008; 36(suppl_1): D991 - D998. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. A. Kirkland, M. A. Gil, I. M. Karadzic, and J. A. Maupin-Furlow Genetic and Proteomic Analyses of a Proteasome-Activating Nucleotidase A Mutant of the Haloarchaeon Haloferax volcanii J. Bacteriol., January 1, 2008; 190(1): 193 - 205. [Abstract] [Full Text] [PDF] |
||||
![]() |
A.D.J. van Dijk, C.J.F. ter Braak, R.G. Immink, G.C. Angenent, and R.C.H.J. van Ham Predicting and understanding transcription factor interactions based on sequence level determinants of combinatorial control Bioinformatics, January 1, 2008; 24(1): 26 - 33. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Huang, C. L. Leming, M. Suyemoto, and C. Altier Genome-Wide Screen of Salmonella Genes Expressed during Infection in Pigs, Using In Vivo Expression Technology Appl. Envir. Microbiol., December 1, 2007; 73(23): 7522 - 7530. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. L. Hiller, B. Janto, J. S. Hogg, R. Boissy, S. Yu, E. Powell, R. Keefe, N. E. Ehrlich, K. Shen, J. Hayes, et al. Comparative Genomic Analyses of Seventeen Streptococcus pneumoniae Strains: Insights into the Pneumococcal Supragenome J. Bacteriol., November 15, 2007; 189(22): 8186 - 8195. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. C. Silver, N. M. Rabinowitz, S. Kuffer, and J. Graf Identification of Aeromonas veronii Genes Required for Colonization of the Medicinal Leech, Hirudo verbana J. Bacteriol., October 1, 2007; 189(19): 6763 - 6772. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Tralau, S. Vuilleumier, C. Thibault, B. J. Campbell, C. A. Hart, and M. A. Kertesz Transcriptomic Analysis of the Sulfate Starvation Response of Pseudomonas aeruginosa J. Bacteriol., October 1, 2007; 189(19): 6743 - 6750. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Nakagawa, Y. Takaki, S. Shimamura, A.-L. Reysenbach, K. Takai, and K. Horikoshi Deep-sea vent {varepsilon}-proteobacterial genomes provide insights into emergence of pathogens PNAS, July 17, 2007; 104(29): 12146 - 12150. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. H. Godsey, G. Minasov, L. Shuvalova, J. S. Brunzelle, I. I. Vorontsov, F. R. Collart, and W. F. Anderson The 2.2 A resolution crystal structure of Bacillus cereus Nif3-family protein YqfO reveals a conserved dimetal-binding motif and a regulatory domain Protein Sci., July 1, 2007; 16(7): 1285 - 1293. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Udwary, L. Zeigler, R. N. Asolkar, V. Singan, A. Lapidus, W. Fenical, P. R. Jensen, and B. S. Moore Genome sequencing reveals complex secondary metabolome in the marine actinomycete Salinispora tropica PNAS, June 19, 2007; 104(25): 10376 - 10381. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. R. Kane, A. Y. Chakicherla, P. S. G. Chain, R. Schmidt, M. W. Shin, T. C. Legler, K. M. Scow, F. W. Larimer, S. M. Lucas, P. M. Richardson, et al. Whole-Genome Analysis of the Methyl tert-Butyl Ether-Degrading Beta-Proteobacterium Methylibium petroleiphilum PM1 J. Bacteriol., March 1, 2007; 189(5): 1931 - 1945. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. H. Bergman, K. D. Passalacqua, P. C. Hanna, and Z. S. Qin Operon Prediction for Sequenced Bacterial Genomes without Experimental Information Appl. Envir. Microbiol., February 1, 2007; 73(3): 846 - 854. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. R. Marri, W. Hao, and G. B. Golding Gene Gain and Gene Loss in Streptococcus: Is It Driven by Habitat? Mol. Biol. Evol., December 1, 2006; 23(12): 2379 - 2391. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Billion, R. Ghai, T. Chakraborty, and T. Hain Augur--a computational pipeline for whole genome microbial surface protein prediction and classification Bioinformatics, November 15, 2006; 22(22): 2819 - 2820. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Wu, L. A. Mueller, D. Crouzillat, V. Petiard, and S. D. Tanksley Combining Bioinformatics and Phylogenetics to Identify Large Sets of Single-Copy Orthologous Genes (COSII) for Comparative, Evolutionary and Systematic Studies: A Test Case in the Euasterid Plant Clade Genetics, November 1, 2006; 174(3): 1407 - 1420. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Comas, A. Moya, R. K. Azad, J. G. Lawrence, and F. Gonzalez-Candelas The Evolutionary Origin of Xanthomonadales Genomes and the Nature of the Horizontal Gene Transfer Process Mol. Biol. Evol., November 1, 2006; 23(11): 2049 - 2057. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Yu, L. P. Castillo, S. Mnaimneh, T. R. Hughes, and G. W. Brown A Survey of Essential Gene Function in the Yeast Cell Division Cycle Mol. Biol. Cell, November 1, 2006; 17(11): 4736 - 4747. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. E. Wyckoff, A. R. Mey, A. Leimbach, C. F. Fisher, and S. M. Payne Characterization of Ferric and Ferrous Iron Transport Systems in Vibrio cholerae. J. Bacteriol., September 1, 2006; 188(18): 6515 - 6523. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Lin, L. Zhu, and D.-Y. Zhang An initial strategy for comparing proteins at the domain architecture level Bioinformatics, September 1, 2006; 22(17): 2081 - 2086. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Y. Mulkidjanian, E. V. Koonin, K. S. Makarova, S. L. Mekhedov, A. Sorokin, Y. I. Wolf, A. Dufresne, F. Partensky, H. Burd, D. Kaznadzey, et al. The cyanobacterial genome core and the origin of photosynthesis PNAS, August 29, 2006; 103(35): 13126 - 13131. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Gil, G. J. Platz, C. A. Forestal, M. Monfett, C. S. Bakshi, T. J. Sellati, M. B. Furie, J. L. Benach, and D. G. Thanassi Deletion of TolC orthologs in Francisella tularensis identifies roles in multidrug resistance and virulence PNAS, August 22, 2006; 103(34): 12897 - 12902. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. F. DeLuca, I-H. Wu, J. Pu, T. Monaghan, L. Peshkin, S. Singh, and D. P. Wall Roundup: a multi-genome repository of orthologs and evolutionary distances Bioinformatics, August 15, 2006; 22(16): 2044 - 2046. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. L. Campbell, J. Henderson, D. E. Heinrichs, and E. D. Brown The yjeQ Gene Is Required for Virulence of Staphylococcus aureus. Infect. Immun., August 1, 2006; 74(8): 4918 - 4921. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kaur, M. Pan, M. Meislin, M. T. Facciotti, R. El-Gewely, and N. S. Baliga A systems view of haloarchaeal strategies to withstand stress from transition metals Genome Res., July 1, 2006; 16(7): 841 - 854. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ranjan, J. Seshadri, V. Vindal, S. Yellaboina, and A. Ranjan iCR: a web tool to identify conserved targets of a regulatory protein across the multiple related prokaryotic species. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W584 - W587. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Pasek, J.-L. Risler, and P. Brezellec Gene fusion/fission is a major contributor to evolution of multi-domain bacterial proteins Bioinformatics, June 15, 2006; 22(12): 1418 - 1423. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Y. Galperin Structural classification of bacterial response regulators: diversity of output domains and domain combinations. J. Bacteriol., June 1, 2006; 188(12): 4169 - 4182. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. S. Makarova, E. V. Koonin, R. Haselkorn, and M. Y. Galperin Cyanobacterial response regulator PatA contains a conserved N-terminal domain (PATAN) with an alpha-helical insertion Bioinformatics, June 1, 2006; 22(11): 1297 - 1301. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kanjilal-Kolar, S. S. Basu, M. I. Kanipes, Z. Guan, T. A. Garrett, and C. R. H. Raetz Expression Cloning of Three Rhizobium leguminosarum Lipopolysaccharide Core Galacturonosyltransferases J. Biol. Chem., May 5, 2006; 281(18): 12865 - 12878. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. T. Cardona, C. L. Mueller, and M. A. Valvano Identification of Essential Operons with a Rhamnose-Inducible Promoter in Burkholderia cenocepacia Appl. Envir. Microbiol., April 1, 2006; 72(4): 2547 - 2555. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Jothi, E. Zotenko, A. Tasneem, and T. M. Przytycka COCO-CL: hierarchical clustering of homology relations based on evolutionary correlations Bioinformatics, April 1, 2006; 22(7): 779 - 788. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Bab-Dinitz, H. Shmuely, J. Maupin-Furlow, J. Eichler, and B. Shaanan Haloferax volcanii PitA: an example of functional interaction between the Pfam chlorite dismutase and antibiotic biosynthesis monooxygenase families? Bioinformatics, March 15, 2006; 22(6): 671 - 675. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Chiu, E. K. Lee, M. G. Egan, I. N. Sarkar, G. M. Coruzzi, and R. DeSalle OrthologID: automation of genome-scale ortholog identification within a parsimony framework Bioinformatics, March 15, 2006; 22(6): 699 - 707. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ballschmiter, O. Futterer, and W. Liebl Identification and Characterization of a Novel Intracellular Alkaline {alpha}-Amylase from the Hyperthermophilic Bacterium Thermotoga maritima MSB8. Appl. Envir. Microbiol., March 1, 2006; 72(3): 2206 - 2211. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Bandyopadhyay, R. Sharan, and T. Ideker Systematic identification of functional orthologs based on protein network comparison Genome Res., March 1, 2006; 16(3): 428 - 435. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. R. Beller, P. S. G. Chain, T. E. Letain, A. Chakicherla, F. W. Larimer, P. M. Richardson, M. A. Coleman, A. P. Wood, and D. P. Kelly The Genome Sequence of the Obligately Chemolithoautotrophic, Facultatively Anaerobic Bacterium Thiobacillus denitrificans J. Bacteriol., February 15, 2006; 188(4): 1473 - 1488. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Moreira, F. Rodriguez-Valera, and P. Lopez-Garcia Metagenomic analysis of mesopelagic Antarctic plankton reveals a novel deltaproteobacterial group Microbiology, February 1, 2006; 152(2): 505 - 517. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Snider, I. Gutsche, M. Lin, S. Baby, B. Cox, G. Butland, J. Greenblatt, A. Emili, and W. A. Houry Formation of a Distinctive Complex between the Inducible Bacterial Lysine Decarboxylase and a Novel AAA+ ATPase J. Biol. Chem., January 20, 2006; 281(3): 1532 - 1546. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Loughman and M. Caparon Regulation of SpeB in Streptococcus pyogenes by pH and NaCl: a Model for In Vivo Gene Expression J. Bacteriol., January 15, 2006; 188(2): 399 - 408. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Rodionov, P. Hebbeln, M. S. Gelfand, and T. Eitinger Comparative and Functional Genomic Analysis of Prokaryotic Nickel and Cobalt Uptake Transporters: Evidence for a Novel Group of ATP-Binding Cassette Transporters J. Bacteriol., January 1, 2006; 188(1): 317 - 327. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Du, M. R.K. S. Rao, X. Q. Chen, W. Wu, S. Mahalingam, and D. Balasundaram The Homologous Putative GTPases Grn1p from Fission Yeast and the Human GNL3L Are Required for Growth and Play a Role in Processing of Nucleolar Pre-rRNA Mol. Biol. Cell, January 1, 2006; 17(1): 460 - 474. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Reynolds, B. Collier, K. Maratou, V. Bingham, R. M. Speed, M. Taggart, C. A. Semple, N. K. Gray, and H. J. Cooke Dazl binds in vivo to specific transcripts and can regulate the pre-meiotic translation of Mvh in germ cells Hum. Mol. Genet., December 15, 2005; 14(24): 3899 - 3909. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. R. Mey, E. E. Wyckoff, V. Kanukurthy, C. R. Fisher, and S. M. Payne Iron and Fur Regulation in Vibrio cholerae and the Role of Fur in Virulence Infect. Immun., December 1, 2005; 73(12): 8167 - 8178. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. V. Alsaker and E. T. Papoutsakis Transcriptional Program of Early Sporulation and Stationary-Phase Events in Clostridium acetobutylicum J. Bacteriol., October 15, 2005; 187(20): 7103 - 7118. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. S. Makarova, Y. I. Wolf, S. L. Mekhedov, B. G. Mirkin, and E. V. Koonin Ancestral paralogs and pseudoparalogs and their role in the emergence of the eukaryotic cell Nucleic Acids Res., August 16, 2005; 33(14): 4626 - 4638. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Tauch, O. Kaiser, T. Hain, A. Goesmann, B. Weisshaar, A. Albersmeier, T. Bekel, N. Bischoff, I. Brune, T. Chakraborty, et al. Complete Genome Sequence and Analysis of the Multiresistant Nosocomial Pathogen Corynebacterium jeikeium K411, a Lipid-Requiring Bacterium of the Human Skin Flora J. Bacteriol., July 1, 2005; 187(13): 4671 - 4682. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Jaroszewski, L. Rychlewski, Z. Li, W. Li, and A. Godzik FFAS03: a server for profile-profile sequence alignments Nucleic Acids Res., July 1, 2005; 33(suppl_2): W284 - W288. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. H. Majoros, M. Pertea, and S. L. Salzberg Efficient implementation of a generalized pair hidden Markov model for comparative gene finding Bioinformatics, May 1, 2005; 21(9): 1782 - 1788. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Horan, J. Lauricha, J. Bailey-Serres, N. Raikhel, and T. Girke Genome Cluster Database. A Sequence Family Analysis Platform for Arabidopsis and Rice Plant Physiology, May 1, 2005; 138(1): 47 - 54. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Huang and E. K. O'Shea A Systematic High-Throughput Screen of a Yeast Deletion Collection for Mutants Defective in PHO5 Regulation Genetics, April 1, 2005; 169(4): 1859 - 1871. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-V. Albers and A. J. M. Driessen Analysis of ATPases of putative secretion operons in the thermoacidophilic archaeon Sulfolobus solfataricus Microbiology, March 1, 2005; 151(3): 763 - 773. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Chen and D. Xu Understanding protein dispensability through machine-learning analysis of high-throughput data Bioinformatics, March 1, 2005; 21(5): 575 - 581. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Stothard and D. S. Wishart Circular genome visualization and exploration using CGView Bioinformatics, February 15, 2005; 21(4): 537 - 539. [Abstract] [Full Text] [PDF] |
||||














