Nucleic Acids Research, 2001, Vol. 29, No. 1 72-74
© 2001 Oxford University Press
RegulonDB (version 3.2): transcriptional regulation and operon organization in Escherichia coli K-12
Centro de Investigación sobre Fijación de Nitrógeno. UNAM A.P. 565-A Cuernavaca, Morelos 62100, México
Received October 2, 2000; Revised and Accepted October 11, 2000.
| ABSTRACT |
|---|
|
|
|---|
RegulonDB is a database on mechanisms of transcription regulation and operon organization in Escherichia coli K-12. The current version has considerably increased numbers of regulatory elements such as promoters, binding sites and terminators. The complete repertoire of known and predicted DNA-binding transcriptional regulators can be considered to be included in this version. The database now distinguishes different allosteric conformations of regulatory proteins indicating the one active in binding and regulating the different promoters. A new set of operon predictions has been incorporated. The relational design has been modified accordingly. Furthermore, a major improvement is a graphic display enabling browsing of the database with a Java-based graphic user interface with three zoom-levels connected to properties of each chromosomal element. The purpose of these modifications is to make RegulonDB a useful tool and control set for transcriptome experiments. RegulonDB can be accessed on the web at the URL: http://www.cifn.unam.mx/Computational_Biology/regulondb/
| INTRODUCTION |
|---|
|
|
|---|
RegulonDB is a relational database containing information on mechanisms at the level of transcriptional initiation as well as operon organization with their terminator signals in the Escherichia coli K12 chromosome. The database is updated constantly by searching in original publications, and it is complemented by computational predictions. All this information is mapped in the genome sequence. This accumulated knowledge includes 2996 MEDLINE and PubMed links to the literature, plus 2451 links to GenBank. Computational predictions and their methods have been published previously (1,2). Previous publications in this yearly issue explain the initial relational design and subsequent modifications (35). The relational design keeps a constant evolution given the enriched expansion of the type of information that is added.
This paper describes the major updates and changes during the previous year to the database. There has been an increase in the number of all objects, such as promoters, binding sites and operons. What we consider the complete repertoire of 314 DNA-binding transcriptional regulators, both known and predicted, has been incorporated (6). The set of predicted operons has been updated (7). Furthermore, we have modified the database to incorporate different conformations of regulatory proteins that depend on their allosteric or covalent modifications, and that enable addition of not only effector metabolites involved in the allosteric interactions, but also the changes in the conditions that provoke the regulated changes in the cell. Furthermore, an innovation in this new version is a set of Java-based graphic user interfaces starting from the whole genome, to a region encompassing on average 18 genes, down to selecting individual operons with their transcription units, and their associated regulatory elements. All these graphic descriptions enable navigation to the text describing the properties of the different chromosomal elements. Table 1 lists links to Supplementary Material illustrating several aspects mentioned here.
|
| OVERVIEW OF THE CURRENT DATA |
|---|
|
|
|---|
As in previous versions, the database contains both information gathered from the literature and with the associated MEDLINE, PubMed and/or GenBank link to the original literature. Important additions include a total of 314 transcriptional DNA-binding regulators out of which 165 have experimental evidence and the rest have been predicted based on their helixturnhelix DNA binding motif (6); we have also changed the complete set of predicted operons, to 578 predicted operons, resulting from a well-defined method based on distances in-between genes and their functional classes when available (7). Relevant information about binding sites and promoters published in a specialized Escherichia coli book on regulation (8) were all obtained and included in the database, involving additional searches to complement their relative position or regulated promoters in many cases.
Moreover, we have expanded the design of the database to take into account the different conformations available for regulatory proteins based on their interactions with small metabolites (usually allosteric interactions) or covalent modifications (phosphorylation, methylation, etc). By far most regulatory proteins have two allosteric conformations, one capable of binding to DNA and affecting transcription and the other unbinding from the DNA. There are, however, few cases of proteins interacting with more than one metabolite and affecting transcription differentially, such as the case of TyrR interacting with phenyalanine, tyrosine and tryptophan (911). The design we implemented is not limited to two conformations for each protein. For each regulatory interaction, however, only one conformation is the active one, that is the one that binds to DNA and has a defined positive or negative effect on a given promoter. This enables encoding in the database cases, for example, of a protein that binds in two conformations and activates and represses a single promoter. For instance, AraC binds to DNA and activates in the presence of arabinose the araBAD operon (12)this is one regulatory interactionwhereas in the absence of arabinose it binds to a similar site but now repressing the same promoter, and therefore supporting a different regulatory interaction as defined in the database. Similar cases are those of Ada binding unmethylated (repressing at high concentration) as well as methylated (activating) ada promoter, whereas it only activates the alkA promoter (13). Note that what we call active conformation has nothing to do with activators or repressors but with the configuration that is able to bind DNA and have a defined regulatory effect.
To date, RegulonDB contains information that is knowledge supported on the molecular biology of the different systems. We consider that it is useful, especially for analyzing transcriptome experiments, to expand the database in order to include knowledge from genetic studies in the absence of details of the molecular machinery. Thus, knowing for instance, that in the presence of lysine E.coli represses lysP gene (in the absence of a detailed mechanism) is potentially useful information (14) in the analysis and interpretation of transcriptome experiments. An important effort has been in gathering information on the effector molecules that interact with these proteins, and by these means modify the activation or repression of different genes and operons.
| COMPUTATIONAL INFRASTRUCTURE AND GRAPHIC USER INTERFACE |
|---|
|
|
|---|
We have, as mentioned above, modified the database to include the expansions described previously. These are basically the addition of a table for conformation previously of regulatory proteins, and the encoding of conditions and their link to a regulatory protein, an effector metabolite, and the effect (inducing or repressing) on the regulated genes. We can then include associations between conditions, i.e. glucose as carbon source, and its effect diminishing cAMP and by these means affecting regulation of CRP-controlled promoters. In less characterized genes there may only be an association between, for instance, high osmolarity in the medium and the repression of mdoGH genes (15).
We have spent considerable effort in improving the graphic capabilities of RegulonDB. A Java applet display (JavaTM Development Kit JDKTM 1.1.3 Product of Sun Microsystem, Inc.) starts with a circular representation of the whole E.coli K-12 genome. In this graph forward and reverse genes are easily identified. This circular representation is clickable and a new window is generated displaying all known and predicted objects within a range of 18 kb centered at the selected position, and with arrows at each extreme that enable walking through the chromosome in either direction. Genes and their clustering into operons are displayed, as well as regulatory elements. We have colored the genes based on their functional classification from Riley (16). An additional window is displayed showing the color code.
Further navigation enables the user to get the detailed information about a particular gene, or to focus on a particular operon. The window on the individual operon displays the operon organization, its regulatory elements and the different transcription units associated to it. The different elements (operons, genes, regulatory elements) are clickable in order to obtain text information about their individual properties. Briefly, this new version of RegulonDB has an increased amount of information and a much more attractive graphic browsing interface. A new server supporting the database will speed its access on the web.
| AVAILABILITY |
|---|
|
|
|---|
RegulonDB 3.2 can be accessed through the URL: http://www.cifn.unam.mx/Computational_Biology/regulondb/. We kindly ask users of RegulonDB to cite this article.
| SUPPLEMENTARY MATERIAL |
|---|
|
|
|---|
Supplementary Material is available at NAR Online.
| ACKNOWLEDGEMENTS |
|---|
This work was supported by grant number 0028 from CONACYT-Mexico, grant R01 RR07861-10 from NIH and DE-FG02-98ER62558 from the US Department of Energy.
| FOOTNOTES |
|---|
* To whom correspondence should be addressed. Tel: +527 313 2063; Fax: +527 317 5581; Email: ecoli-t1{at}cifn.unam.mx Present address: Ernesto Pérez-Rueda, Université Libre de Bruxelles av F.D. Roosevelt 50-CP 160/16 B-1050 Bruxelles, Belgium
| REFERENCES |
|---|
|
|
|---|
-
1 Blattner,F.R., Plunkett,G.,III, Bloch,C.A., Perna,N.T., Burland,V., Riley,M., Collado-Vides,J., Glasner,J.D., Rode,C.K., Mayhew,G. et al. (1997) The complete genome sequence of Escherichia coli K-12. Science, 277, 14531462.
2 Thieffry,D., Salgado,H., Huerta,A.M. and Collado-Vides,J. (1998) Prediction of transcriptional regulatory sites in the complete genome sequence of Escherichia coli K-12. Bioinformatics, 14, 391400.
3 Huerta,A.M., Salgado,H., Thieffry,D. and Collado-Vides,J. (1998) RegulonDB: a database on transcriptional regulation in Escherichia coli. Nucleic Acids Res., 26, 5559.
4 Salgado,H., Santos,A., Garza-Ramos,U., van Helden,J., Diaz,E. and Collado-Vides,J. (1999) RegulonDB (version 2.0): a database on transcriptional regulation in Escherichia coli. Nucleic Acids Res., 27, 5960.
5 Salgado,H., Santos-Zavaleta,A., Gama-Castro,S., Millan-Zarate,D., Blattner,F.R. and Collado-Vides,J. (2000) RegulonDB (version 3.0): transcriptional regulation and operon organization in Escherichia coli K-12. Nucleic Acids Res., 28, 6567.
6 Pérez-Rueda,E. and Collado-Vides,J. (2000) The repertoire of DNA-binding transcriptional regulators in Escherichia coli K-12. Nucleic Acids Res., 28, 18381847.
7 Salgado,H., Moreno-Hagelsieb,G., Smith,T.F. and Collado-Vides,J. (2000) Operons in Escherichia coli: genomic analyses and predictions. Proc. Natl Acad. Sci. USA, 97, 66526657.
8 Lin,E.C.C. and Lynch,A.S. (eds) (1996) Regulation of Gene Expression in Escherichia coli. Chapman and Hall, New York.
9 Wilson,T.J., Argaet,V.P., Howlett,G.J. and Davidson,B.E. (1995) Evidence for two aromatic amino acid-binding sites, one ATP-dependent and the other ATP-independent, in the Escherichia coli regulatory protein TyrR. Mol. Microbiol., 17, 483492.[ISI][Medline]
10 Lawley,B., Fujita,N., Ishihama,A. and Pittard,A.J. (1995) The TyrR protein of Escherichia coli is a class I transcription activator. J. Bacteriol., 177, 238241.
11 Lawley,B. and Pittard,A.J. (1994) Regulation of aroL expression by TyrR protein and Trp repressor in Escherichia coli K-12. J. Bacteriol., 176, 69216930.
12 Lobell,R.B. and Schleif,R.F. (1990) DNA looping and unlooping by AraC protein. Science, 250, 528532.
13 Saget,B.M. and Walker,G.C. (1994) The Ada protein acts as both a positive and a negative modulator of Escherichia colis response to methylating agents. Proc. Natl Acad. Sci. USA, 91, 97309734.
14 Neely,M.N. and Olson,E.R. (1996) Kinetics of expression of the Escherichia coli cad operon as a function of pH and lysine. J. Bacteriol., 178, 55225528.
15 Lacroix,J.M., Loubens,I., Tempete,M., Menichi,B. and Bohin,J.P. (1991) The mdoA locus of Escherichia coli consists of an operon under osmotic control. Mol. Microbiol., 5, 17451753.[ISI][Medline]
16 Riley,M. (1993) Functions of the gene products of Escherichia coli. Microbiol. Rev., 57, 862952.
This article has been cited by other articles:
![]() |
Z. Chen, K. A. Lewis, R. K. Shultzaberger, I. G. Lyakhov, M. Zheng, B. Doan, G. Storz, and T. D. Schneider Discovery of Fur binding site clusters in Escherichia coli by information theory models Nucleic Acids Res., November 7, 2007; 35(20): 6762 - 6777. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. M. Saul and V. Filkov Exploring biological network structure using exponential random graph models Bioinformatics, October 1, 2007; 23(19): 2604 - 2611. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. K. Shultzaberger, Z. Chen, K. A. Lewis, and T. D. Schneider Anatomy of Escherichia coli {sigma}70 promoters Nucleic Acids Res., February 16, 2007; 35(3): 771 - 788. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ranjan, J. Seshadri, V. Vindal, S. Yellaboina, and A. Ranjan iCR: a web tool to identify conserved targets of a regulatory protein across the multiple related prokaryotic species. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W584 - W587. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Li, S. M. Shaw, M. J. Yedwabnick, and C. Chan Using a state-space model with hidden variables to infer transcription factor activities Bioinformatics, March 15, 2006; 22(6): 747 - 754. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Gordon, M. W. Towsey, J. M. Hogan, S. A. Mathews, and P. Timms Improved prediction of bacterial transcription start sites Bioinformatics, January 15, 2006; 22(2): 142 - 148. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Krishna, A. M. C. Andersson, S. Semsey, and K. Sneppen Structure and function of negative feedback loops at the interface of genetic and metabolic networks. Nucleic Acids Res., January 1, 2006; 34(8): 2455 - 2462. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Salgado, S. Gama-Castro, M. Peralta-Gil, E. Diaz-Peredo, F. Sanchez-Solano, A. Santos-Zavaleta, I. Martinez-Flores, V. Jimenez-Jacinto, C. Bonavides-Martinez, J. Segura-Salazar, et al. RegulonDB (version 5.0): Escherichia coli K-12 transcriptional regulatory network, operon organization, and growth conditions Nucleic Acids Res., January 1, 2006; 34(suppl_1): D394 - D397. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. C. Kao, L. M. Tran, and J. C. Liao A Global Regulatory Role of Gluconeogenic Genes in Escherichia coli Revealed by Transcriptome Network Analysis J. Biol. Chem., October 28, 2005; 280(43): 36079 - 36087. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. H. Plantinga, C. van der Does, D. Tomkiewicz, G. van Keulen, W. N. Konings, and A. J. M. Driessen Deletion of the yiaMNO transporter genes affects the growth characteristics of Escherichia coli K-12 Microbiology, May 1, 2005; 151(5): 1683 - 1689. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Zwir, D. Shin, A. Kato, K. Nishino, T. Latifi, F. Solomon, J. M. Hare, H. Huang, and E. A. Groisman Dissecting the PhoP regulatory network of Escherichia coli and Salmonella enterica PNAS, February 22, 2005; 102(8): 2862 - 2867. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Le Gall, P. Darlu, P. Escobar-Paramo, B. Picard, and E. Denamur Selection-driven transcriptome polymorphism in Escherichia coli/Shigella species Genome Res., February 1, 2005; 15(2): 260 - 268. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ott, A. Hansen, S.-Y. Kim, and S. Miyano Superiority of network motifs over optimal networks and an application to the revelation of gene network evolution Bioinformatics, January 15, 2005; 21(2): 227 - 238. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Lacour and P. Landini {sigma}S-Dependent Gene Expression at the Onset of Stationary Phase in Escherichia coli: Function of {sigma}S-Dependent Genes and Identification of Their Promoter Sequences J. Bacteriol., November 1, 2004; 186(21): 7186 - 7195. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Snel, V. van Noort, and M. A. Huynen Gene co-regulation is highly conserved in the evolution of eukaryotes and prokaryotes Nucleic Acids Res., September 7, 2004; 32(16): 4725 - 4731. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. P. Zimmer, O. Paliy, B. Thomas, P. Gyaneshwar, and S. Kustu Genome Image Programs: Visualization and Interpretation of Escherichia coli Microarray Experiments Genetics, August 1, 2004; 167(4): 2111 - 2119. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Yellaboina, J. Seshadri, M. S. Kumar, and A. Ranjan PredictRegulon: a web server for the prediction of the regulatory protein binding sites and operons in prokaryote genomes Nucleic Acids Res., July 1, 2004; 32(suppl_2): W318 - W320. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Chen, Z. Su, P. Dam, B. Palenik, Y. Xu, and T. Jiang Operon prediction by comparative genomics: an application to the Synechococcus sp. WH8102 genome Nucleic Acids Res., April 19, 2004; 32(7): 2147 - 2157. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Snel and M. A. Huynen Quantifying Modularity in the Evolution of Biomolecular Systems Genome Res., March 1, 2004; 14(3): 391 - 397. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. T. Brown and C. G. Callan Jr. Evolutionary comparisons suggest many novel cAMP response protein binding sites in Escherichia coli PNAS, February 24, 2004; 101(8): 2404 - 2409. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. P. Burgard, E. V. Nikolaev, C. H. Schilling, and C. D. Maranas Flux Coupling Analysis of Genome-Scale Metabolic Network Reconstructions Genome Res., February 1, 2004; 14(2): 301 - 312. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. C. Kao, Y.-L. Yang, R. Boscolo, C. Sabatti, V. Roychowdhury, and J. C. Liao Transcriptome-based determination of multiple transcription regulator activities in Escherichia coli by using network component analysis PNAS, January 13, 2004; 101(2): 641 - 646. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. R. Chaudhuri, A. M. Khan, and M. J. Pallen coliBASE: an online database for Escherichia coli, Shigella and Salmonella comparative genomics Nucleic Acids Res., January 1, 2004; 32(90001): D296 - 299. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Salgado, S. Gama-Castro, A. Martinez-Antonio, E. Diaz-Peredo, F. Sanchez-Solano, M. Peralta-Gil, D. Garcia-Alonso, V. Jimenez-Jacinto, A. Santos-Zavaleta, C. Bonavides-Martinez, et al. RegulonDB (version 4.0): transcriptional regulation, operon organization and growth conditions in Escherichia coli K-12 Nucleic Acids Res., January 1, 2004; 32(90001): D303 - 306. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Lemer, E. Antezana, F. Couche, F. Fays, X. Santolaria, R.'s Janky, Y. Deville, J. Richelle, and S. J. Wodak The aMAZE LightBench: a web interface to a relational database of cellular processes Nucleic Acids Res., January 1, 2004; 32(90001): D443 - 448. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. E. Allen, M. J. Herrgard, M. Liu, Y. Qiu, J. D. Glasner, F. R. Blattner, and B. O. Palsson Genome-Scale Analysis of the Uses of the Escherichia coli Genome: Model-Driven Analysis of Heterogeneous Data Sets J. Bacteriol., November 1, 2003; 185(21): 6392 - 6399. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M. Gutierrez-Rios, D. A. Rosenblueth, J. A. Loza, A. M. Huerta, J. D. Glasner, F. R. Blattner, and J. Collado-Vides Regulatory Network of Escherichia coli: Consistency Between Literature Knowledge and Microarray Profiles Genome Res., November 1, 2003; 13(11): 2435 - 2443. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Herrgard, M. W. Covert, and B. o. Palsson Reconciling Gene Expression Data With Known Genome-Scale Regulatory Network Structures Genome Res., November 1, 2003; 13(11): 2423 - 2434. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Sabina, N. Dover, L. J. Templeton, D. R. Smulski, D. Soll, and R. A. LaRossa Interfering with Different Steps of Protein Synthesis Explored by Transcriptional Profiling of Escherichia coli K-12 J. Bacteriol., October 15, 2003; 185(20): 6158 - 6170. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. van Helden Regulatory Sequence Analysis Tools Nucleic Acids Res., July 1, 2003; 31(13): 3593 - 3596. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Rombauts, K. Florquin, M. Lescot, K. Marchal, P. Rouze, and Y. Van de Peer Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes Plant Physiology, July 1, 2003; 132(3): 1162 - 1176. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Vogel, I. M. Axmann, H. Herzel, and W. R. Hess Experimental and computational analysis of transcriptional start sites in the cyanobacterium Prochlorococcus MED4 Nucleic Acids Res., June 1, 2003; 31(11): 2890 - 2899. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Michel, A. W. Roszak, V. Sauve, J. Maclean, A. Matte, J. R. Coggins, M. Cygler, and A. J. Lapthorn Structures of Shikimate Dehydrogenase AroE and Its Paralog YdiB: A COMMON STRUCTURAL FRAMEWORK FOR DIFFERENT ACTIVITIES J. Biol. Chem., May 23, 2003; 278(21): 19463 - 19472. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Reed and B. O. Palsson Thirteen Years of Building Constraint-Based In Silico Models of Escherichia coli J. Bacteriol., May 1, 2003; 185(9): 2692 - 2699. [Full Text] [PDF] |
||||
![]() |
M. Madan Babu and S. A. Teichmann Evolution of transcription factors and the gene regulatory network in Escherichia coli Nucleic Acids Res., February 15, 2003; 31(4): 1234 - 1244. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Selinger, R. M. Saxena, K. J. Cheung, G. M. Church, and C. Rosenow Global RNA Half-Life Analysis in Escherichia coli Reveals Positional Patterns of Transcript Degradation Genome Res., February 1, 2003; 13(2): 216 - 223. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Munch, K. Hiller, H. Barg, D. Heldt, S. Linz, E. Wingender, and D. Jahn PRODORIC: prokaryotic database of gene regulation Nucleic Acids Res., January 1, 2003; 31(1): 266 - 269. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Ma, A. Campbell, and S. Karlin Correlations between Shine-Dalgarno Sequences and Gene Features Such as Predicted Expression Levels and Operon Structures J. Bacteriol., October 15, 2002; 184(20): 5733 - 5745. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Weber and K. Jung Profiling Early Osmostress-Dependent Gene Expression in Escherichia coli Using DNA Macroarrays J. Bacteriol., October 1, 2002; 184(19): 5502 - 5507. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. B. Rogozin, K. S. Makarova, D. A. Natale, A. N. Spiridonov, R. L. Tatusov, Y. I. Wolf, J. Yin, and E. V. Koonin Congruent evolution of different classes of non-coding DNA in prokaryotic genomes Nucleic Acids Res., October 1, 2002; 30(19): 4264 - 4271. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li, V. Rhodius, C. Gross, and E. D. Siggia Identification of the binding sites of regulatory proteins in bacterial genomes PNAS, September 3, 2002; 99(18): 11772 - 11777. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. W. Covert and B. O. Palsson Transcriptional Regulation in Constraints-based Metabolic Models of Escherichia coli J. Biol. Chem., July 26, 2002; 277(31): 28058 - 28064. [Abstract] |








