Nucleic Acids Research, 2004, Vol. 32, Database issue D75-D77
© 2004 Oxford University Press
DBTBS: database of transcriptional regulation in Bacillus subtilis and its contribution to comparative genomics
1 Human Genome Center, The Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan, 2 Department of Applied Physics, Graduate School of Engineering, Nagoya University, Chikusa-ku, Nagoya 464-8603, Japan and 3 Department of Bioinformatics and Genomics, Graduate School of Information Science, Nara Institute of Science and Technology, 8916-5 Takayama, Ikoma, Nara 630-0101, Japan
*To whom correspondence should be addressed. Tel: +81 3 5449 5619; Fax: +81 3 5449 5434; Email: knakai{at}ims.u-tokyo.ac.jp
Received September 15, 2003; Accepted September 30, 2003
| ABSTRACT |
|---|
|
|
|---|
DBTBS (http://dbtbs.hgc.jp) was originally released in 1999 as a reference database of published transcriptional regulation events in Bacillus subtilis, one of the best studied bacteria. It is essentially a compilation of transcription factors with their regulated genes as well as their recognition sequences, which were experimentally characterized and reported in the literature. Here we report its major update, which contains information on 114 transcription factors, including sigma factors, and 633 promoters of 525 genes. The number of references cited in the database has increased from 291 to 378. It also supports a function to find putative transcription factor binding sites within input sequences by using our collection of weight matrices and consensus patterns. Furthermore, though preliminarily, DBTBS now aims to contribute to comparative genomics by showing the presence or absence of potentially orthologous transcription factors and their corresponding cis-elements on the promoters of their potentially orthologously regulated genes in 50 eubacterial genomes.
| INTRODUCTION |
|---|
|
|
|---|
Bacillus subtilis is one of the most intensively studied bacteria, its genome entirely determined, its essential gene set defined and its systematic functional studies ongoing worldwide (13). For further comprehensive understanding of this organism, the existence of reference databases containing the results of previous studies is essential. SubtiList is one successful example (4), but it does not provide users with detailed information of the B.subtilis transcription system. Therefore, we have constructed a database of transcriptional regulation in B.subtilis (DBTBS) that contains transcriptional information specific to this organism (5). In this report, we introduce the recent progress of DBTBS, including the presentation of phylogenetic conservation information of both transcription factors and their recognition elements.
| UPDATES AND NEW FEATURES |
|---|
|
|
|---|
In Release 3, DBTBS contains information on 114 binding factors and 633 promoters of 525 regulated genes. These binding factors include
factors [nine
70-related, one
54-related and five extracytoplasmic function (ECF) family members]. The promoters include those of six rRNA operons as well as plasmid-encoded promoters. It contains 203 annotated transcriptional start sites and 129 operons. The number of cited references now amounts to 378, which is a
30% increase on the previous release. Although we are trying to keep the database as comprehensive as possible, there still remain some references that should be incorporated. For example, information obtained from microarray experiments has not been included fully. Users feedback to fix errors or to add more data are welcome. For each transcription factor, the collected sequences around the binding sites were realigned using MEME (6) as well as by eye. Then, a weight matrix of its sequence specificity was constructed considering pseudocounts if a sufficient number of examples (say, three) are available; if there are too few samples, the consensus pattern (such as TATAAT) was derived, instead of a weight matrix. The obtained set of weight matrices and consensus patterns is used to support a new function where input sequences are annotated with the hits of this set. The set will be available upon request.
Following requests from several researchers, the position of each binding site in the genome was newly recorded in the database. These data will make it easier to obtain the surrounding sequence at any length. Such data could be used for training or evaluating motif-finding software such as MEME.
We also renewed the style of the graphical representation of promoters (Fig. 1). One notable feature is that overlapping binding sites can be perceived more easily (core consensus regions are also featured with color). Another is that the sequence information is represented in four colors. When users move the mouse over objects, relevant information such as the gene name and the binding factor is displayed; clicking the binding site will turn the page into a list of known binding sites. The annotation style follows the convention of the sequence ontology project (http://song.sourceforge.net), so that our annotation can easily be integrated into other systems.
|
| PHYLOGENETIC CONSERVATION STUDY |
|---|
|
|
|---|
One of the problems of using weight matrices or consensus patterns to identify novel recognition positions of known transcription factors is that it often produces a number of false positives. To overcome this problem, the use of sequence conservation information between closely related species, called phylogenetic footprinting, is widely used. For example, we predicted B.subtilis regulons based on the conservation of upstream sequence segments between B.subtilis, Bacillus halodurans and Bacillus stearothermophilus (7). In this new release, we expand such an analysis into a systematic survey of the conservation of known cis-elements, as well as their binding factors, through available eubacterial genome sequences. For this purpose, we used the orthology information of the COG database (8) and applied the above-mentioned weight matrices/consensus patterns to the upstream sequence of orthologous genes. The diagram summarizing the result is shown in Figure 2. In this diagram, each column represents a bacterium (its name is displayed on the above window when the mouse is put over its position). In the upper table, the ./+ sign shows the absence/presence of an orthologous binding factor. In the lower table, the . indicates the absence of the orthologous regulated gene; the + means the presence of the ortholog of the regulated gene but the absence of the conserved element; and a filled circle means the presence of both (by clicking the circle, users can see the sequences of all detected sites as putative binding sites). Of course, the determination of presence/absence of orthologs itself is a highly delicate problem depending on, say, the setting of the cut-off values. In this sense, the current version is only a starting point for more comprehensive analyses. However, such kinds of data will undoubtedly stimulate many interesting studies on the evolution of transcriptional networks in eubacteria.
|
Another direction of future studies using our data is the construction of a comprehensive model of the transcription system in B.subtilis. In such studies, the incorporation of accumulating microarray data will be a challenging task. We believe that DBTBS is a useful resource for both experimental and theoretical researchers of B.subtilis as well as of other eubacteria.
| ACKNOWLEDGEMENTS |
|---|
We thank T. Ishii, G. Terai, K. Yoshida and Y. Fujita for their contribution to previous releases; Michiel J. L. de Hoon for checking the contents of DBTBS and critically reading the manuscript. This work was supported by a Grant-in-Aid for Scientific Research on Priority Areas Genome Biology from the Ministry of Education, Culture, Sports, Science and Technology of Japan.
| REFERENCES |
|---|
|
|
|---|
- Kunst,F., Ogasawara,N., Moszer,I., Albertini,A.M., Alloni,G., Azevedo,V., Bertero,M.G., Bessieres,P., Bolotin,A., Borchert,S. et al. (1997) The complete genome sequence of the Gram-positive bacterium Bacillus subtilis. Nature, 390, 249256.[CrossRef][Medline]
- Kobayashi,K., Ehrlich,S.D., Albertini,A., Amati,G., Andersen,K.K., Arnaud,M., Asai,K., Ashikaga,S., Aymerch,S., Bessieres,P. et al. (2003) Essential Bacillus subtilis genes. Proc. Natl Acad. Sci. USA, 100, 46784683.
[Abstract/Free Full Text] - Ogasawara,N. (2000) Systematic function analysis of Bacillus subtilis genes. Res. Microbiol., 151, 129134.[Medline]
- Moszer,I., Jones,L.M., Moreira,S., Fabry,C. and Danchin,A. (2002) SubtiList: the reference database for the Bacillus subtilis genome. Nucleic Acids Res., 30, 6265.
[Abstract/Free Full Text] - Ishii,T., Yoshida,K., Terai,G., Fujita,Y. and Nakai,K. (2001) DBTBS: a database of Bacillus subtilis promoters and transcription factors. Nucleic Acids Res., 29, 278280.
[Abstract/Free Full Text] - Bailey,T.L. and Elkan,C. (1994) Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Altman,R., Brutlag,D., Karp,P., Lathrop,R. and Searls,D. (eds), Proceedings of the 2nd International Conference on Intelligent Systems for Molecular Biology. AAAI Press, Menlo Park, CA, pp. 2836.
- Terai,G., Takagi,T. and Nakai,K. (2001) Prediction of co-regulated genes in Bacillus subtilis on the basis of upstream elements conserved across three closely related species. Genome Biol., 2, RESEARCH0048.10048.12.
- Tatusov,R.L., Fedorova,N.D., Jackson,J.J., Jacobs,A.R., Kiryutin,B., Koonin,E.V., Krylov,D.M., Mazumder,R., Mekhedov,S.L., Nikolskaya,A.N. et al. (2003) The COG database: an updated version includes eukaryotes. BMC Bioinformatics, 4, 41.[CrossRef][Medline]
This article has been cited by other articles:
![]() |
A. T. Beek, B. J. F. Keijser, A. Boorsma, A. Zakrzewska, R. Orij, G. J. Smits, and S. Brul Transcriptome Analysis of Sorbic Acid-Stressed Bacillus subtilis Reveals a Nutrient Limitation Response and Indicates Plasma Membrane Remodeling J. Bacteriol., March 1, 2008; 190(5): 1751 - 1761. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Sierro, Y. Makita, M. de Hoon, and K. Nakai DBTBS: a database of transcriptional regulation in Bacillus subtilis containing upstream intergenic conservation information Nucleic Acids Res., January 1, 2008; 36(suppl_1): D93 - D96. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Okumura, H. Makiguchi, Y. Makita, R. Yamashita, and K. Nakai Melina II: a web tool for comparisons among several predictive algorithms to find potential motifs from promoter regions Nucleic Acids Res., July 13, 2007; 35(suppl_2): W227 - W231. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. van Schaik, M. van der Voort, D. Molenaar, R. Moezelaar, W. M. de Vos, and T. Abee Identification of the {sigma}B Regulon of Bacillus cereus and Conservation of {sigma}B-Regulated Genes in Low-GC-Content Gram-Positive Bacteria J. Bacteriol., June 15, 2007; 189(12): 4384 - 4390. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. E. Kazakov, M. J. Cipriano, P. S. Novichkov, S. Minovitsky, D. V. Vinogradov, A. Arkin, A. A. Mironov, M. S. Gelfand, and I. Dubchak RegTransBase--a database of regulatory sequences and interactions in a wide range of prokaryotic genomes Nucleic Acids Res., January 12, 2007; 35(suppl_1): D407 - D412. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pachkov, I. Erb, N. Molina, and E. van Nimwegen SwissRegulon: a database of genome-wide annotations of regulatory sites Nucleic Acids Res., January 12, 2007; 35(suppl_1): D127 - D131. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. E. Ulrich and I. B. Zhulin MiST: a microbial signal transduction database Nucleic Acids Res., January 12, 2007; 35(suppl_1): D386 - D390. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. de Been, C. Francke, R. Moezelaar, T. Abee, and R. J. Siezen Comparative analysis of two-component signal transduction systems of Bacillus cereus, Bacillus thuringiensis and Bacillus anthracis. Microbiology, October 1, 2006; 152(Pt 10): 3035 - 3048. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Schubbe, C. Wurdemann, J. Peplies, U. Heyen, C. Wawer, F. O. Glockner, and D. Schuler Transcriptional Organization and Regulation of Magnetosome Operons in Magnetospirillum gryphiswaldense Appl. Envir. Microbiol., September 1, 2006; 72(9): 5757 - 5765. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta Computational identification of transcriptional regulatory elements in DNA sequence Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Lozada-Chavez, S. C. Janga, and J. Collado-Vides Bacterial regulatory networks are extremely flexible in evolution Nucleic Acids Res., July 13, 2006; 34(12): 3434 - 3445. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Wels, C. Francke, R. Kerkhoven, M. Kleerebezem, and R. J. Siezen Predicting cis-acting elements of Lactobacillus plantarum by comparative genomics with different taxonomic subgroups Nucleic Acids Res., April 13, 2006; 34(7): 1947 - 1958. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Kosaka, T. Uchiyama, S.-i. Ishii, M. Enoki, H. Imachi, Y. Kamagata, A. Ohashi, H. Harada, H. Ikenaga, and K. Watanabe Reconstruction and Regulation of the Central Catabolic Pathway in the Thermophilic Propionate-Oxidizing Syntroph Pelotomaculum thermopropionicum J. Bacteriol., January 1, 2006; 188(1): 202 - 210. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Kummerfeld and S. A. Teichmann DBD: a transcription factor prediction database Nucleic Acids Res., January 1, 2006; 34(suppl_1): D74 - D81. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Okuda, T. Katayama, S. Kawashima, S. Goto, and M. Kanehisa ODB: a database of operons accumulating known operons across multiple genomes Nucleic Acids Res., January 1, 2006; 34(suppl_1): D358 - D362. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. T. Larsson, A. Rogstam, and C. von Wachenfeldt Coordinated patterns of cytochrome bd and lactate dehydrogenase expression in Bacillus subtilis Microbiology, October 1, 2005; 151(10): 3323 - 3335. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kanhere and M. Bansal Structural properties of promoters: similarities and differences between prokaryotes and eukaryotes Nucleic Acids Res., June 6, 2005; 33(10): 3165 - 3175. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. N. Price, K. H. Huang, A. P. Arkin, and E. J. Alm Operon formation is driven by co-regulation and not by horizontal gene transfer Genome Res., June 1, 2005; 15(6): 809 - 819. [Abstract] [Full Text] [PDF] |
||||
![]() |
P.-E. Jacques, A. L. Gervais, M. Cantin, J.-F. Lucier, G. Dallaire, G. Drouin, L. Gaudreau, J. Goulet, and R. Brzezinski MtbRegList, a database dedicated to the analysis of transcriptional regulation in Mycobacterium tuberculosis Bioinformatics, May 15, 2005; 21(10): 2563 - 2565. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ott, A. Hansen, S.-Y. Kim, and S. Miyano Superiority of network motifs over optimal networks and an application to the revelation of gene network evolution Bioinformatics, January 15, 2005; 21(2): 227 - 238. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kawai, I. Uchiyama, and I. Kobayashi Genome Comparison In Silico in Neisseria Suggests Integration of Filamentous Bacteriophages by their Own Transposase DNA Res, January 1, 2005; 12(6): 389 - 401. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Bi and P. K. Rogan Bipartite pattern discovery by entropy minimization-based multiple local alignment Nucleic Acids Res., September 23, 2004; 32(17): 4979 - 4991. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Roberts, A. Clark, S. McBeth, and R. L. Friedman Molecular Characterization of the eis Promoter of Mycobacterium tuberculosis J. Bacteriol., August 15, 2004; 186(16): 5410 - 5417. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||








