Nucleic Acids Research, 2002, Vol. 30, No. 10 2212-2223
© 2002 Oxford University Press
Connected gene neighborhoods in prokaryotic genomes
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA and 1Department of Mathematics, University of South Carolina, Columbia, SC 29208, USA
A computational method was developed for delineating connected gene neighborhoods in bacterial and archaeal genomes. These gene neighborhoods are not typically present, in their entirety, in any single genome, but are held together by overlapping, partially conserved gene arrays. The procedure was applied to comparing the orders of orthologous genes, which were extracted from the database of Clusters of Orthologous Groups of proteins (COGs), in 31 prokaryotic genomes and resulted in the identification of 188 clusters of gene arrays, which included 1001 of 2890 COGs. These clusters were projected onto actual genomes to produce extended neighborhoods including additional genes, which are adjacent to the genes from the clusters and are transcribed in the same direction, which resulted in a total of 2387 COGs being included in the neighborhoods. Most of the neighborhoods consist predominantly of genes united by a coherent functional theme, but also include a minority of genes without an obvious functional connection to the main theme. We hypothesize that although some of the latter genes might have unsuspected roles, others are maintained within gene arrays because of the advantage of expression at a level that is typical of the given neighborhood. We designate this phenomenon genomic hitchhiking. The largest neighborhood includes 79 genes (COGs) and consists of overlapping, rearranged ribosomal protein superoperons; apparent genome hitchhiking is particularly typical of this neighborhood and other neighborhoods that consist of genes coding for translation machinery components. Several neighborhoods involve previously undetected connections between genes, allowing new functional predictions. Gene neighborhoods appear to evolve via complex rearrangement, with different combinations of genes from a neighborhood fixed in different lineages.
* To whom correspondence should be addressed. Tel: +1 301 435 5913; Fax: +1 301 435 7794; Email: koonin{at}ncbi.nlm.nih.gov
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. E. Martinez-Guerrero, R. Ciria, C. Abreu-Goodger, G. Moreno-Hagelsieb, and E. Merino GeConT 2: gene context analysis for orthologous proteins, conserved domains and metabolic pathways Nucleic Acids Res., July 1, 2008; 36(suppl_2): W176 - W180. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Barkan, L. Klipcan, O. Ostersetzer, T. Kawamura, Y. Asakura, and K. P. Watkins The CRM domain: An RNA binding module derived from an ancient ribosome-associated protein RNA, January 1, 2007; 13(1): 55 - 64. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Mao, Z. Su, V. Olman, P. Dam, Z. Liu, and Y. Xu Mapping of orthologous genes in the context of biological pathways: An application of integer programming PNAS, January 3, 2006; 103(1): 129 - 134. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Rattei, R. Arnold, P. Tischler, D. Lindner, V. Stumpflen, and H. W. Mewes SIMAP: the similarity matrix of proteins Nucleic Acids Res., January 1, 2006; 34(suppl_1): D252 - D256. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T. Edwards, S. C. G. Rison, N. G. Stoker, and L. Wernisch A universally applicable method of operon map prediction on minimally annotated genomes using conserved genomic context Nucleic Acids Res., June 7, 2005; 33(10): 3253 - 3262. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. N. Price, K. H. Huang, A. P. Arkin, and E. J. Alm Operon formation is driven by co-regulation and not by horizontal gene transfer Genome Res., June 1, 2005; 15(6): 809 - 819. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Choi, Y. Ma, J.-H. Choi, and S. Kim PLATCOM: a Platform for Computational Comparative Genomics Bioinformatics, May 15, 2005; 21(10): 2514 - 2516. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Janga, J. Collado-Vides, and G. Moreno-Hagelsieb Nebulon: a system for the inference of functional relationships of gene products from the rearrangement of predicted operons Nucleic Acids Res., May 2, 2005; 33(8): 2521 - 2530. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Pevzner and G. Tesler Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes Genome Res., January 1, 2003; 13(1): 37 - 45. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. B. Rogozin, K. S. Makarova, D. A. Natale, A. N. Spiridonov, R. L. Tatusov, Y. I. Wolf, J. Yin, and E. V. Koonin Congruent evolution of different classes of non-coding DNA in prokaryotic genomes Nucleic Acids Res., October 1, 2002; 30(19): 4264 - 4271. [Abstract] [Full Text] [PDF] |
||||




