Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (23K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (16)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Salgado, H.
Right arrow Articles by Collado-Vides, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Salgado, H.
Right arrow Articles by Collado-Vides, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research Pages 59-60  


RegulonDB (version 2.0): a database on transcriptional regulation in Escherichia coli
Introduction
Changes To The New Version
   Computational infrastructure
   Changes to the biological content
Availability
Acknowledgements
References


RegulonDB (version 2.0): a database on transcriptional regulation in Escherichia coli

RegulonDB (version 2.0): a database on transcriptional regulation in Escherichia coli

Heladia Salgado, Alberto Santos, Ulises Garza-Ramos, Jacques van Helden+, Edgar Díaz and Julio Collado-Vides*

Centro de Investigación sobre Fijación de Nitrógeno, UNAM A.P., 565-A Cuernavaca, Morelos 62100, México

Received October 1, 1998; Revised and Accepted October 9, 1998

ABSTRACT

RegulonDB version 2.0, a database on transcriptional regulation and operon organization in Escherichia coli, is now available on the web at the following URL: http://www.cifn.unam.mx/Computational_Biology/regulondb/ . In this paper we describe the main computational changes to the database, which include migrating the database to Sybase, providing graphical descriptions of the internal organization of operons and regulons, and direct links to MEDLINE references. The web interface offers searching either by mechanisms of regulation or by operon organization. The results of a search (operon organization, or site collection) are displayed as hypertext, and can also be displayed graphically. In terms of its contents, RegulonDB contains a large number of operons, as well as the absolute position in the completed genome sequence of sites, promoters, and individual genes of E.coli.

INTRODUCTION

RegulonDB is a relational database containing information on mechanisms of regulation at the level of transcription initiation in Escherichia coli, as well as information on the organization of genes into operons. Our previous publication in the previous Database Issue of this journal explains in detail the design of this relational database (1). The modifications and improvements to RegulonDB during the previous year are summarized below.

CHANGES TO THE NEW VERSION

Computational infrastructure

RegulonDB uses a relational database scheme. The design of the database is explained in detail in ref. 1. We have migrated the database into Unix using Sybase SQL-Server, a relational database manager system. The relational design has been kept as previously described, with the following tables: operons, promoters, genes, regulatory interactions, polypeptides and proteins, signals and conditions, and finally references.

Scripts were written in Transact-SQL language and Sybperl (http://www.mbay.net/~mpeppler/ ) in order to make the database available on the web. To generate graphical web interfaces, we used Perl (2), java applets and JDBC-driver. In addition, Reference Manager (Research Information System, Inc., Camino Corporate) has been used to organize a parallel literature database. Biological information has been gathered from the literature using Medline and PubMed (3; http://www.ncbi.nlm.nih.gov/Entrez/medline.html ), as well as other biological database sources as mentioned below.

The web interface for RegulonDB can be found at http://www.cifn.unam.mx/Computational_Biology/regulondb/ . The menu on the left of the main page offers three types of search procedures, what we call the simple search, the search for regulatory mechanisms, and the search for operon organization. These were built keeping in mind the main biological content of the database centered, as mentioned before, in mechanisms of regulation, and operon and regulon organization. Furthermore, the results of each search can be displayed graphically. These graphics are built with the feature-map program described in ref. 4 (http://copan.cifn.unam.mx/Computational_Biology/yeast-tools/ ). Browsing by operon organization, users can get a graphical automatic display of the internal organization of operons, with the genes, promoters and sites indicated, with symbols in different lines associated to their direction in the chromosome.

Browsing by regulatory mechanism permits, for instance, to search for the complete set of regulatory sites of a given protein, its alignment and weight matrix, and to see such sites in a single graph in relation to the transcription initiation of each regulated promoter. Alternatively, if the name of the promoter is indicated, all sites that regulate it can be obtained. A `help' button permits the user to go through the documentation explaining in detail the options implicit in such searching procedures, and the restrictions in the syntax for inputs in these browsing tools. The help pages include examples in order to guide new users through the basic functionality of the search forms.

All bibliographic references are now described uniformily by means of active MEDLINE numbers used as direct links to PubMed. In this way the users can go directly, for instance, from a regulatory site to the reference in Medline reporting the experimental work that supports it.

Finally, we have also developed tools for the internal management, updating and correction of the database. These include programs that check for the consistency of new data with exisiting data, ensuring data integrity. We verify, for instance, that promoters are well located upstream of the beginning of the transcribed gene; that two distinct entries do not have the same name; that genes within an operon are all transcribed in the same direction; and that one gene belongs to one and only one operon. Note that this last restriction is a consequence of the structural definition of operons implemented in RegulonDB. Complex operons can transcribe subsets of genes under different physiological conditions by means of different promoters and internal terminators. The design of RegulonDB (i.e., adding a termination feature to the genes table) can describe such complex operons, as well as the absolute position of sites, promoters, and individual genes in the genome sequence of E.coli.

Changes to the biological content

Information about the clustering of genes into operons was initially obtained from Mary Berlyn (E.coli Genetic Stock Center: http://cgsc.biology.yale.edu/ ). Based on that initial input, we searched in the literature and started correcting a large number of operons. One of the reasons that explain why this information required extensive verification is that biologists use the terms of `operon' or `transcripton unit' with different precise meanings. RegulonDB is based on the definition of an operon as a set of co-transcribed genes and their associated regulatory elements. This permits a precise notation for names of operons. For instance, if three genes such as araB, araA and araD form an operon, the operon is named araBAD. In the more complicated case of an operon formed by genes with unrelated names, we define the name of the operon concatenating the gene names. For instance, the operon containing aroF and tyrA is named as the aroF-tyrA operon; and that containing aroL, yaiA, and aroM, is named the aroL-yaiA-aroM operon.

Additional corrections come from the multiple names given to individual genes, and to the positioning of genes in the complete E.coli K-12 sequence. A brief summary of such changes is shown in Table 1. Our absolute coordinates are based on the current, so-called m52 version, of the genome described in ref. 5. New additions to the database include, among others, the identifiers for genes used in the Blattner laboratory, or `b-numbers'; and the set of predicted operons, promoters and regulatory sites. A more detailed description of the method used to predict regulatory sites is contained in ref. 6. Briefly, the strategy followed for these predictions was to complement what is already known from experiments reported in the literature. Thus, promoters were searched and predicted only in those upstream regulatory regions that do not contain a single promoter experimentally described. Similarly, genes were clustered in operons by means of an algorithm only in cases where we have no experimental information of operon organization. In this way, predictions complement the experimental knowledge, generating together an organized structural description of the complete genome into sets of operons. RegulonDB will then contain this mixture of known and predicted objects, which will be clearly distinguised by a `type' field within operons, promoters or regulatory interactions. It is important to emphasize that the set of predictions may change with time, as new improved methods will be utilized, and as we continue gathering information from the literature.

Table 1. Summary of corrections
Number of promoter corrections
Position Name Sigma factor Strand
106 15 2 3
Number of `regulatory interaction' corrections
Type of evidence Medline accession number
20 74
Number of gene corrections
Name Synonyms Strand
7 6 12

AVAILABILITY

RegulonDB 2.0 can be accessed though the URL: http://www.cifn.unam.mx/Computational_Biology/regulondb/ . We kindly ask users of RegulonDB (2.0) to cite this article.

ACKNOWLEDGEMENTS

This work was supported by grants from DGAPA and Conacyt to JC-V. We want to thank the availability of the operon information to Mary Berlyn, as well as to Monica Riley for her classification of gene functions, and function of gene product information (together with Guy Plunkett III). We thank the availability of the E.coli sequence to the group of Fred Blattner. We also appreciate useful discussions with Araceli Huerta, Peter Karp and Edgar Wingender.

REFERENCES

1. Huerta,A.M., Salgado,H., Thieffry,D. and Collado-Vides,J. (1998) Nucleic Acids Res., 26, 55-60. MEDLINE Abstract

2. Wall,L. and Schwartz,R.L. (1991) Programming Perl. O'Reilly and Associates, Inc., Sebastopol, CA.

3. Benson,D.A., Bogusky,M., Lipman,D.J. and Ostell,J. (1998) Nucleic Acids Res., 26, 1-7. MEDLINE Abstract

4. van Helden,J., André,B. and Collado-Vides,J. (1998) J. Mol. Biol., 281, 827-842.

5. Blattner,F.R., Plunkett,G.,III, Bloch,C.A., Perna,N.T., Burland,V., Riley,M., Collado-Vides,J., Glasner,J.D., Rode,C.K., Mayhew,G., Gregor,J., Davis,N.W., Kirkpatrick,H.A., Goeden,M.A., Rose,D.J., Mau,B. and Shao,Y. (1997) Science, 277, 1453-1462. MEDLINE Abstract

6. Thieffry,D., Salgado,H., Huerta,A.M. and Collado-Vides,J. (1998) Bioinformatics, 14, 391-400. MEDLINE Abstract


*To whom correspondence should be addressed. Tel: +52 7 313 2063; Fax: +52 7 317 5581; Email: ecoli-t1{at}cifn.unam.mx
+Present address: European Bioinformatics Institute, EMBL Outstation, EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK


This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: www-admin{at}oup.co.uk
Last modification: 9 Dec 1998
Copyright©Oxford University Press, 1998.

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Brief BioinformHome page
J. Baumbach, A. Tauch, and S. Rahmann
Towards the integrated analysis, visualization and reconstruction of microbial gene regulatory networks
Brief Bioinform, January 1, 2009; 10(1): 75 - 83.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Salgado, S. Gama-Castro, M. Peralta-Gil, E. Diaz-Peredo, F. Sanchez-Solano, A. Santos-Zavaleta, I. Martinez-Flores, V. Jimenez-Jacinto, C. Bonavides-Martinez, J. Segura-Salazar, et al.
RegulonDB (version 5.0): Escherichia coli K-12 transcriptional regulatory network, operon organization, and growth conditions
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D394 - D397.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Salgado, S. Gama-Castro, A. Martinez-Antonio, E. Diaz-Peredo, F. Sanchez-Solano, M. Peralta-Gil, D. Garcia-Alonso, V. Jimenez-Jacinto, A. Santos-Zavaleta, C. Bonavides-Martinez, et al.
RegulonDB (version 4.0): transcriptional regulation, operon organization and growth conditions in Escherichia coli K-12
Nucleic Acids Res., January 1, 2004; 32(90001): D303 - 306.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Salgado, A. Santos-Zavaleta, S. Gama-Castro, D. Millan-Zarate, E. Diaz-Peredo, F. Sanchez-Solano, E. Perez-Rueda, C. Bonavides-Martinez, and J. Collado-Vides
RegulonDB (version 3.2): transcriptional regulation and operon organization in Escherichia coli K-12
Nucleic Acids Res., January 1, 2001; 29(1): 72 - 74.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
N. Sommer, V. Salniene, E. Gineikiene, R. Nivinskas, and W. Rüger
T4 early promoter strength probed in vivo with unribosylated and ADP-ribosylated Escherichia coli RNA polymerase: a mutation analysis
Microbiology, October 1, 2000; 146(10): 2643 - 2653.
[Abstract] [Full Text]


Home page
Nucleic Acids ResHome page
E. Perez-Rueda and J. Collado-Vides
The repertoire of DNA-binding transcriptional regulators in Escherichia coli K-12
Nucleic Acids Res., April 15, 2000; 28(8): 1838 - 1847.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Salgado, A. Santos-Zavaleta, S. Gama-Castro, D. Millan-Zarate, F. R. Blattner, and J. Collado-Vides
RegulonDB (version 3.0): transcriptional regulation and operon organization in Escherichia coli K-12
Nucleic Acids Res., January 1, 2000; 28(1): 65 - 67.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. V. Ponomarenko, G. V. Orlova, M. P. Ponomarenko, S. V. Lavryushev, A. S. Frolov, S. V. Zybova, and N. A. Kolchanov
SELEX_DB: an activated database on selected randomized DNA/RNA sequences addressed to genomic sequence annotation
Nucleic Acids Res., January 1, 2000; 28(1): 205 - 208.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. Stojanovic, L. Florea, C. Riemer, D. Gumucio, J. Slightom, M. Goodman, W. Miller, and R. Hardison
Comparison of five methods for finding conserved sequences in multiple alignments of gene regulatory regions
Nucleic Acids Res., October 1, 1999; 27(19): 3899 - 3910.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
H. Salgado, G. Moreno-Hagelsieb, T. F. Smith, and J. Collado-Vides
Operons in Escherichia coli: Genomic analyses and predictions
PNAS, June 6, 2000; 97(12): 6652 - 6657.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (23K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (16)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Salgado, H.
Right arrow Articles by Collado-Vides, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Salgado, H.
Right arrow Articles by Collado-Vides, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?