Published online 8 February 2005
Article |
A novel method for accurate operon predictions in all sequenced prokaryotes
1 Lawrence Berkeley National Lab 1 Cyclotron Road, Mailstop 939R704, Berkeley, CA 94720, USA 2 Howard Hughes Medical Institute Berkeley, CA, USA 3 Department of Bioengineering, University of California Berkeley, USA
*To whom correspondence should be addressed. Tel: +1 510 843 1794; Fax: +1 510 486 6059; Email: ejalm{at}lbl.gov
Received September 14, 2004. Revised October 25, 2004. Accepted January 20, 2005.
We combine comparative genomic measures and the distance separating adjacent genes to predict operons in 124 completely sequenced prokaryotic genomes. Our method automatically tailors itself to each genome using sequence information alone, and thus can be applied to any prokaryote. For Escherichia coli K12 and Bacillus subtilis, our method is 85 and 83% accurate, respectively, which is similar to the accuracy of methods that use the same features but are trained on experimentally characterized transcripts. In Halobacterium NRC-1 and in Helicobacter pylori, our method correctly infers that genes in operons are separated by shorter distances than they are in E.coli, and its predictions using distance alone are more accurate than distance-only predictions trained on a database of E.coli transcripts. We use microarray data from six phylogenetically diverse prokaryotes to show that combining intergenic distance with comparative genomic measures further improves accuracy and that our method is broadly effective. Finally, we survey operon structure across 124 genomes, and find several surprises: H.pylori has many operons, contrary to previous reports; Bacillus anthracis has an unusual number of pseudogenes within conserved operons; and Synechocystis PCC 6803 has many operons even though it has unusually wide spacings between conserved adjacent genes.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Gu, Y. Wang, and T. Lilburn A Comparative Genomics, Network-Based Approach to Understanding Virulence in Vibrio cholerae J. Bacteriol., October 15, 2009; 191(20): 6262 - 6272. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Mrazek Finding sequence motifs in prokaryotic genomes--a brief practical guide for a microbiologist Brief Bioinform, September 1, 2009; 10(5): 525 - 536. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Elias, A. Mukhopadhyay, M. P. Joachimiak, E. C. Drury, A. M. Redding, H.-C. B. Yen, M. W. Fields, T. C. Hazen, A. P. Arkin, J. D. Keasling, et al. Expression profiling of hypothetical genes in Desulfovibrio vulgaris leads to improved functional annotation Nucleic Acids Res., May 1, 2009; 37(9): 2926 - 2939. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Jochmann, A.-K. Kurze, L. F. Czaja, K. Brinkrolf, I. Brune, A. T. Huser, N. Hansmeier, A. Puhler, I. Borovok, and A. Tauch Genetic makeup of the Corynebacterium glutamicum LexA regulon deduced from comparative transcriptomics and in vitro DNA band shift assays Microbiology, May 1, 2009; 155(5): 1459 - 1477. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Carrera, G. Rodrigo, and A. Jaramillo Model-based redesign of global transcription regulation Nucleic Acids Res., April 1, 2009; 37(5): e38 - e38. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Hsiao, X. Xu, B. Kan, R. V. Kulkarni, and J. Zhu Direct Regulation by the Vibrio cholerae Regulator ToxT To Modulate Colonization and Anticolonization Pilus Expression Infect. Immun., April 1, 2009; 77(4): 1383 - 1388. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. G. Dashper, C.-S. Ang, P. D. Veith, H. L. Mitchell, A. W. H. Lo, C. A. Seers, K. A. Walsh, N. Slakeski, D. Chen, J. P. Lissel, et al. Response of Porphyromonas gingivalis to Heme Limitation in Continuous Culture J. Bacteriol., February 1, 2009; 191(3): 1044 - 1055. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. S. Nentwich, K. Brinkrolf, L. Gaigalat, A. T. Huser, D. A. Rey, T. Mohrbach, K. Marin, A. Puhler, A. Tauch, and J. Kalinowski Characterization of the LacI-type transcriptional repressor RbsR controlling ribose transport in Corynebacterium glutamicum ATCC 13032 Microbiology, January 1, 2009; 155(1): 150 - 164. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Y. Galperin and G. R. Cochrane Nucleic Acids Research annual Database Issue and the NAR online Molecular Biology Database Collection in 2009 Nucleic Acids Res., January 1, 2009; 37(suppl_1): D1 - D4. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pertea, K. Ayanbule, M. Smedinghoff, and S. L. Salzberg OperonDB: a comprehensive database of predicted operons in microbial genomes Nucleic Acids Res., January 1, 2009; 37(suppl_1): D479 - D482. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Mao, P. Dam, J. Chou, V. Olman, and Y. Xu DOOR: a database for prokaryotic operons Nucleic Acids Res., January 1, 2009; 37(suppl_1): D459 - D463. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. P. Tan, P. M. Giffard, D. G. Barry, W. M. Huston, and M. S. Turner Random Mutagenesis Identifies Novel Genes Involved in the Secretion of Antimicrobial, Cell Wall-Lytic Enzymes by Lactococcus lactis Appl. Envir. Microbiol., December 15, 2008; 74(24): 7490 - 7496. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Chen, X. Tong, R. W. Woodard, G. Du, J. Wu, and J. Chen Identification and Characterization of Bacterial Cutinase J. Biol. Chem., September 19, 2008; 283(38): 25854 - 25862. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. W. W. Brouwer, O. P. Kuipers, and S. A. F. T. van Hijum The relative value of operon predictions Brief Bioinform, September 1, 2008; 9(5): 367 - 375. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Gupta, J. Benhamida, V. Bhargava, D. Goodman, E. Kain, I. Kerman, N. Nguyen, N. Ollikainen, J. Rodriguez, J. Wang, et al. Comparative proteogenomics: Combining mass spectrometry and comparative genomics to analyze multiple genomes Genome Res., July 1, 2008; 18(7): 1133 - 1142. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Yang and S.-H. Sze Large-scale analysis of gene clustering in bacteria Genome Res., June 1, 2008; 18(6): 949 - 956. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. R. Johnson, E. L. Brodie, A. E. Hubbard, G. L. Andersen, S. H. Zinder, and L. Alvarez-Cohen Temporal Transcriptomic Microarray Analysis of "Dehalococcoides ethenogenes" Strain 195 during the Transition into Stationary Phase Appl. Envir. Microbiol., May 1, 2008; 74(9): 2864 - 2872. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Brinkrolf, S. Ploger, S. Solle, I. Brune, S. S. Nentwich, A. T. Huser, J. Kalinowski, A. Puhler, and A. Tauch The LacI/GalR family transcriptional regulator UriR negatively controls uridine utilization of Corynebacterium glutamicum by binding to catabolite-responsive element (cre)-like sequences Microbiology, April 1, 2008; 154(4): 1068 - 1081. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. N. Reid, R. Pandey, K. Palyada, H. Naikare, and A. Stintzi Identification of Campylobacter jejuni Genes Involved in the Response to Acidic pH and Stomach Transit Appl. Envir. Microbiol., March 1, 2008; 74(5): 1583 - 1597. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. N. Reid, R. Pandey, K. Palyada, L. Whitworth, E. Doukhanine, and A. Stintzi Identification of Campylobacter jejuni Genes Contributing to Acid Adaptation by Transcriptional Profiling and Genome-Wide Mutagenesis Appl. Envir. Microbiol., March 1, 2008; 74(5): 1598 - 1612. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Wilson, M. P. Gleisten, and T. J. Donohue Identification of proteins involved in formaldehyde metabolism by Rhodobacter sphaeroides Microbiology, January 1, 2008; 154(1): 296 - 305. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Brilli, R. Fani, and P. Lio Current trends in the bioinformatic sequence analysis of metabolic pathways in prokaryotes Brief Bioinform, January 1, 2008; 9(1): 34 - 45. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Molina and E. van Nimwegen Universal patterns of purifying selection at noncoding positions in bacteria Genome Res., January 1, 2008; 18(1): 148 - 160. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Charaniya, S. Mehra, W. Lian, K. P. Jayapal, G. Karypis, and W.-S. Hu Transcriptome dynamics-based operon prediction and verification in Streptomyces coelicolor Nucleic Acids Res., December 18, 2007; 35(21): 7222 - 7236. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. D. Karp, I. M. Keseler, A. Shearer, M. Latendresse, M. Krummenacker, S. M. Paley, I. Paulsen, J. Collado-Vides, S. Gama-Castro, M. Peralta-Gil, et al. Multidimensional annotation of the Escherichia coli K-12 genome Nucleic Acids Res., December 3, 2007; 35(22): 7577 - 7590. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Kolesov, Z. Wunderlich, O. N. Laikova, M. S. Gelfand, and L. A. Mirny How gene order is influenced by the biophysics of transcription regulation PNAS, August 28, 2007; 104(35): 13948 - 13953. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. D. Harrington, A. H. Singh, T. Doerks, I. Letunic, C. von Mering, L. J. Jensen, J. Raes, and P. Bork Quantitative assessment of protein function prediction from metagenomics shotgun sequences PNAS, August 28, 2007; 104(35): 13913 - 13918. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Davidsen, H. K. Tuven, M. Bjoras, E. A. Rodland, and T. Tonjum Genetic Interactions of DNA Repair Pathways in the Pathogen Neisseria meningitidis J. Bacteriol., August 1, 2007; 189(15): 5728 - 5737. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Roback, J. Beard, D. Baumann, C. Gille, K. Henry, S. Krohn, H. Wiste, M.I. Voskuil, C. Rainville, and R. Rutherford A predicted operon map for Mycobacterium tuberculosis Nucleic Acids Res., August 1, 2007; 35(15): 5085 - 5095. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Auxilien, F. El Khadali, A. Rasmussen, S. Douthwaite, and H. Grosjean Archease from Pyrococcus abyssi Improves Substrate Specificity and Solubility of a tRNA m5C Methyltransferase J. Biol. Chem., June 29, 2007; 282(26): 18711 - 18721. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Wright, P. Kharchenko, G. M. Church, and D. Segre Chromosomal periodicity of evolutionarily conserved gene pairs PNAS, June 19, 2007; 104(25): 10559 - 10564. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Brune, N. Jochmann, K. Brinkrolf, A. T. Huser, R. Gerstmeir, B. J. Eikmanns, J. Kalinowski, A. Puhler, and A. Tauch The IclR-Type Transcriptional Repressor LtbR Regulates the Expression of Leucine and Tryptophan Biosynthesis Genes in the Amino Acid Producer Corynebacterium glutamicum J. Bacteriol., April 1, 2007; 189(7): 2720 - 2733. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. H. Bergman, K. D. Passalacqua, P. C. Hanna, and Z. S. Qin Operon Prediction for Sequenced Bacterial Genomes without Experimental Information Appl. Envir. Microbiol., February 1, 2007; 73(3): 846 - 854. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. T. Tran, P. Dam, Z. Su, F. L. Poole II, M. W. W. Adams, G. T. Zhou, and Y. Xu Operon prediction in Pyrococcus furiosus Nucleic Acids Res., January 12, 2007; 35(1): 11 - 20. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Dam, V. Olman, K. Harris, Z. Su, and Y. Xu Operon prediction using both genome-specific and general genomic information Nucleic Acids Res., January 12, 2007; 35(1): 288 - 298. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Groh, Q. Luo, J. D. Ballard, and L. R. Krumholz Genes That Enhance the Ecological Fitness of Shewanella oneidensis MR-1 in Sediments Reveal the Value of Antibiotic Resistance Appl. Envir. Microbiol., January 1, 2007; 73(2): 492 - 498. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Rodrigues, M. Sarkar-Tyson, S. V. Harding, S. H. Sim, H. H. Chua, C. H. Lin, X. Han, R. K. M. Karuturi, K. Sung, K. Yu, et al. Global Map of Growth-Regulated Gene Expression in Burkholderia pseudomallei, the Causative Agent of Melioidosis J. Bacteriol., December 1, 2006; 188(23): 8178 - 8188. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. H. Bergman, E. C. Anderson, E. E. Swenson, M. M. Niemeyer, A. D. Miyoshi, and P. C. Hanna Transcriptional Profiling of the Bacillus anthracis Life Cycle In Vitro and an Implied Model for Regulation of Spore Formation. J. Bacteriol., September 1, 2006; 188(17): 6092 - 6100. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Janga, W. F. Lamboy, A. M. Huerta, and G. Moreno-Hagelsieb The distinctive signatures of promoter regions and operon junctions across prokaryotes Nucleic Acids Res., September 1, 2006; 34(14): 3980 - 3987. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. E. Clark, Q. He, Z. He, K. H. Huang, E. J. Alm, X.-F. Wan, T. C. Hazen, A. P. Arkin, J. D. Wall, J.-Z. Zhou, et al. Temporal Transcriptomic Analysis as Desulfovibrio vulgaris Hildenborough Transitions into Stationary Phase during Electron Donor Depletion Appl. Envir. Microbiol., August 1, 2006; 72(8): 5578 - 5588. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. He, K. H. Huang, Z. He, E. J. Alm, M. W. Fields, T. C. Hazen, A. P. Arkin, J. D. Wall, and J. Zhou Energetic Consequences of Nitrite Stress in Desulfovibrio vulgaris Hildenborough, Inferred from Global Transcriptional Analysis. Appl. Envir. Microbiol., June 1, 2006; 72(6): 4370 - 4381. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Anand, S. K. Verma, and B. Prakash Structural stabilization of GTP-binding domains in circularly permuted GTPases: implications for RNA binding. Nucleic Acids Res., January 1, 2006; 34(8): 2196 - 2205. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Che, G. Li, F. Mao, H. Wu, and Y. Xu Detecting uber-operons in prokaryotic genomes. Nucleic Acids Res., January 1, 2006; 34(8): 2418 - 2427. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Okuda, T. Katayama, S. Kawashima, S. Goto, and M. Kanehisa ODB: a database of operons accumulating known operons across multiple genomes Nucleic Acids Res., January 1, 2006; 34(suppl_1): D358 - D362. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Alm, K. H. Huang, M. N. Price, R. P. Koche, K. Keller, I. L. Dubchak, and A. P. Arkin The MicrobesOnline Web site for comparative genomics Genome Res., July 1, 2005; 15(7): 1015 - 1022. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. N. Price, E. J. Alm, and A. P. Arkin Interruptions in gene expression drive highly expressed operons to the leading strand of DNA replication Nucleic Acids Res., June 7, 2005; 33(10): 3224 - 3234. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. N. Price, K. H. Huang, A. P. Arkin, and E. J. Alm Operon formation is driven by co-regulation and not by horizontal gene transfer Genome Res., June 1, 2005; 15(6): 809 - 819. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Janga, J. Collado-Vides, and G. Moreno-Hagelsieb Nebulon: a system for the inference of functional relationships of gene products from the rearrangement of predicted operons Nucleic Acids Res., May 2, 2005; 33(8): 2521 - 2530. [Abstract] [Full Text] [PDF] |
||||








