Nucleic Acids Research, 2001, Vol. 29, No. 3 774-782
© 2001 Oxford University Press
Phylogenetic footprinting of transcription factor binding sites in proteobacterial genomes
1The Wadsworth Center for Laboratories and Research, New York State Department of Health, Albany, NY 12201, USA, 2The Department of Statistics, Harvard University, Cambridge, MA 02138, USA and 3Computer Science Department, Rensselaer Polytechnic Institute, Troy, NY 12180, USA
Toward the goal of identifying complete sets of transcription factor (TF)-binding sites in the genomes of several gamma proteobacteria, and hence describing their transcription regulatory networks, we present a phylogenetic footprinting method for identifying these sites. Probable transcription regulatory sites upstream of Escherichia coli genes were identified by cross-species comparison using an extended Gibbs sampling algorithm. Close examination of a study set of 184 genes with documented transcription regulatory sites revealed that when orthologous data were available from at least two other gamma proteobacterial species, 81% of our predictions corresponded with the documented sites, and 67% corresponded when data from only one other species were available. That the remaining predictions included bona fide TF-binding sites was proven by affinity purification of a putative transcription factor (YijC) bound to such a site upstream of the fabA gene. Predicted regulatory sites for 2097 E.coli genes are available at http://www.wadsworth.org/resnres/bioinfo/.
* To whom correspondence should be addressed at: The Wadsworth Center for Laboratories and Research, New York State Department of Health, Albany, NY 12201, USA. Tel: +1 518 402 5034; Fax: +1 518 473 2900; Email: lawrence{at}wadsworth.org
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Feng and J. E. Cronan Escherichia coli Unsaturated Fatty Acid Synthesis: COMPLEX TRANSCRIPTION OF THE fabA GENE AND IN VIVO IDENTIFICATION OF THE ESSENTIAL REACTION CATALYZED BY FabB J. Biol. Chem., October 23, 2009; 284(43): 29526 - 29535. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. N. Miller, L. R. Jarboe, L. P. Yomano, S. W. York, K. T. Shanmugam, and L. O. Ingram Silencing of NADPH-Dependent Oxidoreductase Genes (yqhD and dkgA) in Furfural-Resistant Ethanologenic Escherichia coli Appl. Envir. Microbiol., July 1, 2009; 75(13): 4315 - 4323. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Jerga and C. O. Rock Acyl-Acyl Carrier Protein Regulates Transcription of Fatty Acid Biosynthetic Genes via the FabT Repressor in Streptococcus pneumoniae J. Biol. Chem., June 5, 2009; 284(23): 15364 - 15368. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Zhang, M. Xu, S. Li, and Z. Su Genome-wide de novo prediction of cis-regulatory binding sites in prokaryotes Nucleic Acids Res., June 1, 2009; 37(10): e72 - e72. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Liu, X. Xu, and G. D. Stormo The cis-regulatory map of Shewanella genomes Nucleic Acids Res., September 1, 2008; 36(16): 5376 - 5390. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Mustonen, J. Kinney, C. G. Callan Jr, and M. Lassig Energy-dependent fitness: A quantitative model for the evolution of yeast transcription factor binding sites PNAS, August 26, 2008; 105(34): 12376 - 12381. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Romero-Zaliz, C. del Val, J. P. Cobb, and I. Zwir Onto-CC: a web server for identifying Gene Ontology conceptual clusters Nucleic Acids Res., July 1, 2008; 36(suppl_2): W352 - W357. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. A. Newberg, W. A. Thompson, S. Conlan, T. M. Smith, L. A. McCue, and C. E. Lawrence A phylogenetic Gibbs sampler that yields centroid solutions for cis-regulatory site prediction Bioinformatics, July 15, 2007; 23(14): 1718 - 1727. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. A. Thompson, L. A. Newberg, S. Conlan, L. A. McCue, and C. E. Lawrence The Gibbs Centroid Sampler Nucleic Acids Res., July 13, 2007; 35(suppl_2): W232 - W237. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Sosinsky, B. Honig, R. S. Mann, and A. Califano Discovering transcriptional regulatory regions in Drosophila by a nonalignment method for phylogenetic footprinting PNAS, April 10, 2007; 104(15): 6305 - 6310. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. G. Perez, V. E. Angarica, A. T. R. Vasconcelos, and J. Collado-Vides Tractor_DB (version 2.0): a database of regulatory interactions in gamma-proteobacterial genomes Nucleic Acids Res., January 12, 2007; 35(suppl_1): D132 - D136. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pachkov, I. Erb, N. Molina, and E. van Nimwegen SwissRegulon: a database of genome-wide annotations of regulatory sites Nucleic Acids Res., January 12, 2007; 35(suppl_1): D127 - D131. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Haberer, M. T. Mader, P. Kosarev, M. Spannagl, L. Yang, and K. F.X. Mayer Large-Scale cis-Element Detection by Analysis of Correlated Expression and Sequence Conservation between Arabidopsis and Brassica oleracea Plant Physiology, December 1, 2006; 142(4): 1589 - 1602. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta Computational identification of transcriptional regulatory elements in DNA sequence Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Wei and S. T. Jensen GAME: detecting cis-regulatory elements using a genetic algorithm Bioinformatics, July 1, 2006; 22(13): 1577 - 1584. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. S Hon and A. N Jain A deterministic motif finding algorithm with application to the human genome Bioinformatics, May 1, 2006; 22(9): 1047 - 1054. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Bai, L. A. McCue, and K. A. McDonough Characterization of Mycobacterium tuberculosis Rv3676 (CRPMt), a Cyclic AMP Receptor Protein-Like DNA Binding Protein J. Bacteriol., November 15, 2005; 187(22): 7795 - 7804. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Zwir, H. Huang, and E. A. Groisman Analysis of differentially-regulated genes within a regulatory network by GPS genome navigation Bioinformatics, November 15, 2005; 21(22): 4073 - 4083. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Conlan, C. Lawrence, and L. A. McCue Rhodopseudomonas palustris Regulons Detected by Cross-Species Analysis of Alphaproteobacterial Genomes Appl. Envir. Microbiol., November 1, 2005; 71(11): 7442 - 7452. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. T. Jensen, L. Shen, and J. S. Liu Combining phylogenetic motif discovery and motif clustering to predict co-regulated genes Bioinformatics, October 15, 2005; 21(20): 3832 - 3839. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Gertz, L. Riles, P. Turnbaugh, S.-W. Ho, and B. A. Cohen Discovery, validation, and genetic dissection of transcription factor binding sites by comparative and functional genomics Genome Res., August 1, 2005; 15(8): 1145 - 1152. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Qian, N. Esumi, Y. Chen, Q. Wang, I. Chowers, and D. J. Zack Identification of regulatory targets of tissue-specific transcription factors: application to retina-specific gene regulation Nucleic Acids Res., June 20, 2005; 33(11): 3479 - 3491. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Gupta and J. S. Liu De novo cis-regulatory module elicitation for eukaryotic genomes PNAS, May 17, 2005; 102(20): 7079 - 7084. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Sabatti, L. Rohlin, K. Lange, and J. C. Liao Vocabulon: a dictionary model approach for reconstruction and localization of transcription factor binding sites Bioinformatics, April 1, 2005; 21(7): 922 - 931. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Zwir, D. Shin, A. Kato, K. Nishino, T. Latifi, F. Solomon, J. M. Hare, H. Huang, and E. A. Groisman Dissecting the PhoP regulatory network of Escherichia coli and Salmonella enterica PNAS, February 22, 2005; 102(8): 2862 - 2867. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. N. Price, K. H. Huang, E. J. Alm, and A. P. Arkin A novel method for accurate operon predictions in all sequenced prokaryotes Nucleic Acids Res., February 8, 2005; 33(3): 880 - 892. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Tan, L. A. McCue, and G. D. Stormo Making connections between novel transcription factors and their DNA motifs Genome Res., February 1, 2005; 15(2): 312 - 320. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Gonzalez, V. Espinosa, A. T. Vasconcelos, E. Perez-Rueda, and J. Collado-Vides TRACTOR_DB: a database of regulatory networks in gamma-proteobacterial genomes Nucleic Acids Res., January 1, 2005; 33(suppl_1): D98 - D102. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta, L. A. Schriefer, R. H. Waterston, and G. D. Stormo Novel transcription regulatory elements in Caenorhabditis elegans muscle genes Genome Res., December 1, 2004; 14(12): 2457 - 2468. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. B.L. Alkema, B. Lenhard, and W. W. Wasserman Regulog Analysis: Detection of Conserved Regulatory Networks Across Bacteria: Application to Staphylococcus aureus Genome Res., July 1, 2004; 14(7): 1362 - 1373. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Rigali, M. Schlicht, P. Hoskisson, H. Nothaft, M. Merzbacher, B. Joris, and F. Titgemeyer Extending the classification of bacterial transcription factors beyond the helix-turn-helix motif as an alternative approach to discover new cis/trans relationships Nucleic Acids Res., June 24, 2004; 32(11): 3418 - 3426. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Liu, X. S. Liu, L. Wei, R. B. Altman, and S. Batzoglou Eukaryotic Regulatory Element Conservation Analysis and Identification Using Comparative Genomics Genome Res., March 1, 2004; 14(3): 451 - 458. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Frith, U. Hansen, J. L. Spouge, and Z. Weng Finding functional sequence elements by multiple local alignment Nucleic Acids Res., January 2, 2004; 32(1): 189 - 200. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. P. Fischer, N. A. Brunner, B. Wieland, J. Paquette, L. Macko, K. Ziegelbauer, and C. Freiberg Identification of Antibiotic Stress-Inducible Promoters: A Systematic Approach to Novel Pathway-Specific Reporter Assays for Antibacterial Drug Discovery Genome Res., January 1, 2004; 14(1): 90 - 98. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pritsker, Y.-C. Liu, M. A. Beer, and S. Tavazoie Whole-Genome Discovery of Transcription Factor Binding Sites by Network-Level Conservation Genome Res., January 1, 2004; 14(1): 99 - 108. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. H. Margulies, M. Blanchette, NISC Comparative Sequencing Program, D. Haussler, and E. D. Green Identification and Characterization of Multi-Species Conserved Sequences Genome Res., December 1, 2003; 13(12): 2507 - 2518. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-M. Zhang, H. Marrakchi, S. W. White, and C. O. Rock The application of computational methods to explore the diversity and structure of bacterial fatty acid synthase J. Lipid Res., January 1, 2003; 44(1): 1 - 10. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. V. Benos, M. L. Bulyk, and G. D. Stormo Additivity in protein-DNA interactions: how good an approximation is it? Nucleic Acids Res., October 15, 2002; 30(20): 4442 - 4451. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. A. McCue, W. Thompson, C. S. Carmack, and C. E. Lawrence Factors Influencing the Identification of Transcription Factor Binding Sites by Cross-Species Comparison Genome Res., October 1, 2002; 12(10): 1523 - 1532. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li, V. Rhodius, C. Gross, and E. D. Siggia Identification of the binding sites of regulatory proteins in bacterial genomes PNAS, September 3, 2002; 99(18): 11772 - 11777. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. van Nimwegen, M. Zavolan, N. Rajewsky, and E. D. Siggia Probabilistic clustering of sequences: Inferring new bacterial regulons by comparative genomics PNAS, May 28, 2002; 99(11): 7323 - 7328. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-M. Zhang, H. Marrakchi, and C. O. Rock The FabR (YijC) Transcription Factor Regulates Unsaturated Fatty Acid Biosynthesis in Escherichia coli J. Biol. Chem., May 3, 2002; 277(18): 15558 - 15565. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta, L. Palomar, G. D. Stormo, P. Tedesco, T. E. Johnson, D. W. Walker, G. Lithgow, S. Kim, and C. D. Link Identification of a Novel cis-Regulatory Element Involved in the Heat Shock Response in Caenorhabditis elegans Using Microarray Gene Expression and Computational Methods Genome Res., May 1, 2002; 12(5): 701 - 712. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Blanchette and M. Tompa Discovery of Regulatory Elements by a Computational Method for Phylogenetic Footprinting Genome Res., May 1, 2002; 12(5): 739 - 748. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Graber, G. D. McAllister, and T. F. Smith Probabilistic prediction of Saccharomyces cerevisiae mRNA 3'-processing sites Nucleic Acids Res., April 15, 2002; 30(8): 1851 - 1858. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. A. Mirny and M. S. Gelfand Structural analysis of conserved base pairs in protein-DNA complexes Nucleic Acids Res., April 1, 2002; 30(7): 1704 - 1711. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Sengupta, M. Djordjevic, and B. I. Shraiman Specificity and robustness in transcription control networks PNAS, February 19, 2002; 99(4): 2072 - 2077. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. M. Panina, A. A. Mironov, and M. S. Gelfand Comparative analysis of FUR regulons in gamma-proteobacteria Nucleic Acids Res., December 15, 2001; 29(24): 5195 - 5206. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rajewsky, N. D. Socci, M. Zapotocky, and E. D. Siggia The Evolution of DNA Regulatory Regions for Proteo-Gamma Bacteria by Interspecies Comparisons Genome Res., February 1, 2002; 12(2): 298 - 308. [Abstract] [Full Text] [PDF] |
||||








