Nucleic Acids Research, 2002, Vol. 30, No. 1 332-334
© 2002 Oxford University Press
TRANSCompel®: a database on composite regulatory elements in eukaryotic genes
1BIOBASE GmbH, Halchtersche Strasse 33, D-38304 Wolfenbüttel, Germany, 2Institute of Cytology and Genetics SB RAN, 10 Lavrentyev pr., 630090, Novosibirsk, Russia and 3AG Bioinformatik, Gesellschaft für Biotechnologische Forschung mbH, Mascheroder Weg 1, D-38124 Braunschweig, Germany
Received September 24, 2001; Accepted October 1, 2001.
| ABSTRACT |
|---|
|
|
|---|
Originating from COMPEL, the TRANSCompel® database emphasizes the key role of specific interactions between transcription factors binding to their target sites providing specific features of gene regulation in a particular cellular content. Composite regulatory elements contain two closely situated binding sites for distinct transcription factors and represent minimal functional units providing combinatorial transcriptional regulation. Both specific factorDNA and factorfactor interactions contribute to the function of composite elements (CEs). Information about the structure of known CEs and specific gene regulation achieved through such CEs appears to be extremely useful for promoter prediction, for gene function prediction and for applied gene engineering as well. Each database entry corresponds to an individual CE within a particular gene and contains information about two binding sites, two corresponding transcription factors and experiments confirming cooperative action between transcription factors. The COMPEL database, equipped with the search and browse tools, is available at http://www.gene-regulation.com/pub/databases.html#transcompel. Moreover, we have developed the program CATCHTM for searching potential CEs in DNA sequences. It is freely available as CompelPatternSearch at http://compel.bionet.nsc.ru/FunSite/CompelPatternSearch.html.
| INTRODUCTION |
|---|
|
|
|---|
Based on known examples, we define a composite element (CE) as a combination of transcription factor binding sites which, as such, and through proteinprotein interactions between the transcription factors involved, provides a known regulatory feature (13). Thus, interacting factors may differ by the structure of DNA-binding, activation, oligomerization and other domains. Along with structural differences, functional properties of the transcription factors, and hence their specific contribution to the transcription regulation, may vary significantly. Co-operative action of the transcription factors within the CEs results in a new highly specific pattern of gene transcription that cannot be provided by the involved factors separately. CEs are structuralfunctional units that provide cross-coupling of gene regulatory pathways and, in particular, cross-coupling of signal transduction pathways (3).
There are two main types of CEs: synergistic and antagonistic. In synergistic CEs, simultaneous interactions of two factors with closely situated target sites results in a non-additive high level of a transcriptional activation. Within an antagonistic CE, two factors interfere with each other.
The TRANSCompel® database originated from the COMPEL database (18). Several new features have been introduced to improve representation of the composite regulatory elements, and content of the database has been significantly increased.
| CONTENT OF THE DATABASE |
|---|
|
|
|---|
During the last 2 years, the number of CEs described in the database has increased by
30%. Two freely available versions of the database have been released, versions 4.4 and 6.0 (Table 1). Distribution of genes among species is as follows: 73 human, 40 mouse, 22 rat, 8 chick and 19 others.
|
Among recent entries there are CEs containing binding sites for the following transcription factors: Smads, 14 entries; steroidogenic factor 1, 11 entries; SREBP, 8 entries; AML/PEBP, 10 entries; PU.1, 19 entries; c-Ets-1,2, 39 entries.
We have extended a description of the experimental evidences confirming factor cooperation within CEs. In a separate field we provide information about protein domains involved in the functional co-operation and/or physical interactions between transcription factors.
Another new field in the database is devoted to the information about confirmed or suggested molecular mechanisms of cooperation between transcription factors.
| CLASSIFICATION OF THE CEs |
|---|
|
|
|---|
Starting with TRANSCompel 4.4, functional classification of the CEs is one of the important features of the database. For that, we have applied a classification of CEs according to the specific transcriptional regulation they provide due to co-operative action of transcriptional factors binding to their target sites (3). In release 6.0, 200 CEs have been classified into the following five main groups.
1. Inducible/inducible: 81 CEs are formed by binding sites for two inducible factors providing cross-coupling of signal transduction pathways. To this group, we have classified, for instance, 14 CEs within different mammalian genes consisting of binding sites for Ets and AP-1 transcription factors, providing cross-coupling of Ras/Raf- and PKC-dependent signalling pathways.
2. Inducible/constitutive: 39 CEs are composed of binding sites for an inducible and a constitutive ubiquitous factor providing some additional features of the inducible regulation. For instance, within Smad/TEF3 and Smad/Sp1 CEs, Smads are inducible by TGF-ß signalling, and TEF3 and Sp1 are constitutive transcription factors. Thus, constitutive factors took an essential part in the regulation by TGF-ß.
3. Tissue-restricted/ubiquitous: 30 CEs are formed by binding sites for a tissue-enriched and a constitutive ubiquitous factor providing some additional features of the tissue-specific transcriptional regulation. For example, steroidogenic cell-restricted transcription factor SF-1 and ubiquitous Sp1 are known to synergistically activate gene expression in steroidogenic cells.
4. Inducible/tissue-restricted: 27 CEs are constituted by binding sites for a tissue-enriched and an inducible factor providing tissue-specific responses to inducing signals. This group may be illustrated by Pit1/AP-1, Pit1/Ets CEs, where Pit1 is a pituitary-restricted transcription factor, whereas AP-1 and Ets are ubiquitous inducible factors. These CEs provide pituitary-restricted induction.
5. Tissue-restricted/tissue-restricted: 23 CEs comprise binding sites for two tissue-enriched factors providing particular aspects of tissue-specific regulation. For example, Ptx-1 is expressed in all pituitary lineages and SF-1 in pituitary gonadotropes only. CEs formed by binding sites for these two factors regulate expression of genes exclusively in gonadotropes where both factors are present.
| AVAILABILITY |
|---|
|
|
|---|
Being maintained internally as a relational database, TRANSCompel® is distributed as a single ASCII flat file. Public versions 4.4 and 6.0 are available at http://www.gene-regulation.com/pub/databases.html#transcompel; the current professional version can be obtained from BIOBASE (http://www.biobase.de; four updates per year). Release COMPEL 3.0 can be found at http://compel.bionet.nsc.ru/. A detailed description of the fields is given in the database documentation. Web-based search and browse options are available.
| CONNECTED PROGRAM |
|---|
|
|
|---|
The program CATCHTM for searching potential CEs in DNA sequences has been developed. A preliminary version of this program, CompelPatternSearch, is publicly available at http://compel.bionet.nsc.ru/FunSite/CompelPatternSearch.html. The current version can be obtained from BIOBASE. A sequence under study is scanned by this program using all CEs collected in the TRANSCompel® database as individual search patterns. Several parameters are available, restricting the search: maximal mismatches in the cores of site1 and site2 comprising the CEs, maximal variation of the distance between two sites, and composite score cut-off value (6,9). The composite score reflects how well the match coincides with the known examples of the CE in TRANSCompel®. This scoring function takes into account the number of mismatches in both sites and the distance between them. All found matches are directly linked to the TRANSCompel® entries containing the corresponding CEs.
| SUPPLEMENTARY MATERIAL |
|---|
|
|
|---|
Supplementary Material is available at NAR Online.
| ACKNOWLEDGEMENTS |
|---|
Authors are grateful to N. A. Kolchanov and A. G. Romaschenko for valuable discussions on the concept of composite regulatory elements. Part of this work has been funded by Volkswagen-Stiftung (I/75941).
| FOOTNOTES |
|---|
* To whom correspondence should be addressed at present address: BIOBASE GmbH, Halchtersche Strasse 33, D-38304 Wolfenbuettel, Germany. Tel: +49 5331 858426; Fax: +49 5331 858470; Email: oke{at}biobase.de
| REFERENCES |
|---|
|
|
|---|
-
1 Kel,O.V., Romaschenko,A.G., Kel,A.E., Wingender,E. and Kolchanov,N.A. (1995) A compilation of composite regulatory elements affecting gene transcription in vertebrates. Nucleic Acids Res., 23, 40974103.
2 Kel,O.V., Romaschenko,A.G., Kel,A.E., Wingender,E. and Kolchanov,N.A. (1997) Composite regulatory elements: classification and description in the COMPEL database. Mol. Biol., 31, 498512.
3 Kel-Margoulis,O.V., Romaschenko,A.G., Kolchanov,N.A., Wingender,E. and Kel,A.E. (2000) COMPEL: a database on composite regulatory elements providing combinatorial transcriptional regulation. Nucleic Acids Res., 28, 311315.
4 Wingender,E., Kel,A.E., Kel,O.V., Karas,H., Heinemeyer,T., Dietze,P., Knüppel,R., Romaschenko,A.G. and Kolchanov,N.A. (1997) TRANSFAC, TRRD and COMPEL: towards a federated database system on transcriptional regulation. Nucleic Acids Res., 25, 265268.
5 Heinemeyer,T., Wingender,E., Reuter,I., Hermjakob,H., Kel,A.E., Kel,O.V., Ignatieva,E.V., Ananko,E.A., Podkolodnaya,O.A., Kolpakov,F.A. et al. (1998) Databases on transcriptional regulation: TRANSFAC, TRRD and COMPEL. Nucleic Acids Res., 26, 362367.
6 Kel-Margoulis,O.V., Kel,A.E., Frisch,M., Romaschenko,A.G., Kolchanov,N.A. and Wingender,E. (1998) COMPEL a database on composite regulatory elements. Proceedings of the First International Conference on Bioinformatics of Genome Regulation and Structure. ICG, Novosibirsk, Vol. 1, pp. 5457.
7 Heinemeyer,T., Chen,X., Karas,H., Kel,A.E., Kel,O.V., Liebich,I., Meinhardt,T., Reuter,I., Schacherer,F. and Wingender,E. (1999) Expanding of the TRANSFAC database towards an expert system of regulatory molecular mechanisms. Nucleic Acids Res., 27, 318322.
8 Kel-Margoulis,O.V., Romaschenko,A.G., Deineko,I.V., Kolchanov,N.A., Wingender,E. and Kel,A.E. (2000) Database on composite regulatory elements in eukaryotic genes (COMPEL). Proceedings of the Second International Conference on Bioinformatics of Genome Regulation and Structure. ICG, Novosibirsk, Vol. 1, pp. 4548.
9 Kel,A., Kel-Margoulis,O., Babenko,V. and Wingender,E. (1999) Recognition of NFATp/AP-1 composite elements within genes induced upon the activation of immune cells. J. Mol. Biol., 288, 353376.[Web of Science][Medline]
This article has been cited by other articles:
![]() |
C. S. Barr, R. L. Dvoskin, Q. Yuan, R. H. Lipsky, M. Gupte, X. Hu, Z. Zhou, M. L. Schwandt, S. G. Lindell, M. McKee, et al. CRH Haplotype as a Factor Influencing Cerebrospinal Fluid Levels of Corticotropin-Releasing Hormone, Hypothalamic-Pituitary-Adrenal Axis Activity, Temperament, and Alcohol Consumption in Rhesus Macaques Arch Gen Psychiatry, August 1, 2008; 65(8): 934 - 944. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hu, H. Hu, and X. Li MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs Nucleic Acids Res., August 1, 2008; 36(13): 4488 - 4497. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Wingender The TRANSFAC project as an example of framework technology that supports the analysis of genomic regulation Brief Bioinform, July 1, 2008; 9(4): 326 - 332. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hannenhalli Eukaryotic transcription factor binding sites--modeling and integrative search methods Bioinformatics, June 1, 2008; 24(11): 1325 - 1331. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Yarwood, G. Borland, W. A. Sands, and T. M. Palmer Identification of CCAAT/Enhancer-binding Proteins as Exchange Protein Activated by cAMP-activated Transcription Factors That Mediate the Induction of the SOCS-3 Gene J. Biol. Chem., March 14, 2008; 283(11): 6843 - 6853. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta Computational identification of transcriptional regulatory elements in DNA sequence Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Waleev, D. Shtokalo, T. Konovalova, N. Voss, E. Cheremushkin, P. Stegmaier, O. Kel-Margoulis, E. Wingender, and A. Kel Composite Module Analyst: identification of transcription factor binding site combinations using genetic algorithm. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W541 - W545. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kel, T. Konovalova, T. Waleev, E. Cheremushkin, O. Kel-Margoulis, and E. Wingender Composite Module Analyst: a fitness-based tool for identification of transcription factor binding site combinations Bioinformatics, May 15, 2006; 22(10): 1190 - 1197. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Robertson, M. Bilenky, K. Lin, A. He, W. Yuen, M. Dagpinar, R. Varhol, K. Teague, O. L. Griffith, X. Zhang, et al. cisRED: a database system for genome-scale computational discovery of regulatory elements Nucleic Acids Res., January 1, 2006; 34(suppl_1): D68 - D73. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Matys, O. V. Kel-Margoulis, E. Fricke, I. Liebich, S. Land, A. Barre-Dirrie, I. Reuter, D. Chekmenev, M. Krull, K. Hornischer, et al. TRANSFAC(R) and its module TRANSCompel(R): transcriptional gene regulation in eukaryotes Nucleic Acids Res., January 1, 2006; 34(suppl_1): D108 - D110. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. S. Chekmenev, C. Haid, and A. E. Kel P-Match: transcription factor binding site search by combining patterns and weight matrices Nucleic Acids Res., July 1, 2005; 33(suppl_2): W432 - W437. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Favorov, M. S. Gelfand, A. V. Gerasimova, D. A. Ravcheev, A. A. Mironov, and V. J. Makeev A Gibbs sampler for identification of symmetrically structured, spaced DNA motifs with improved estimation of the signal length Bioinformatics, May 15, 2005; 21(10): 2240 - 2245. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Alazard, M. Blaud, S. Elbaz, C. Vossen, G. Icre, G. Joseph, L. Nieto, and M. Erard Identification of the 'NORE' (N-Oct-3 responsive element), a novel structural motif and composite element Nucleic Acids Res., March 14, 2005; 33(5): 1513 - 1523. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kel, S. Reymann, V. Matys, P. Nettesheim, E. Wingender, and J. Borlak A Novel Computational Approach for the Prediction of Networked Transcription Factors of Aryl Hydrocarbon-Receptor-Regulated Genes Mol. Pharmacol., December 1, 2004; 66(6): 1557 - 1572. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Bulyk, A. M. McGuire, N. Masuda, and G. M. Church A Motif Co-Occurrence Approach for Genome-Wide Prediction of Transcription-Factor-Binding Sites in Escherichia coli Genome Res., February 1, 2004; 14(2): 201 - 208. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. J. Makeev, A. P. Lifanov, A. G. Nazina, and D. A. Papatsenko Distance preferences in the arrangement of binding motifs and hierarchical levels in organization of transcription regulatory information Nucleic Acids Res., October 15, 2003; 31(20): 6016 - 6026. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Duval, A. Jean, and A. Gutierrez-Hartmann Ras Signaling and Transcriptional Synergy at a Flexible Ets-1/Pit-1 Composite DNA Element Is Defined by the Assembly of Selective Activation Domains J. Biol. Chem., October 10, 2003; 278(41): 39684 - 39696. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. J. K. Edwards, T. J. Carver, T. Vavouri, M. Frith, M. J. Bishop, and G. Elgar Theatre: a software tool for detailed comparative analysis and visualization of genomic sequence Nucleic Acids Res., July 1, 2003; 31(13): 3510 - 3517. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Krull, N. Voss, C. Choi, S. Pistor, A. Potapov, and E. Wingender TRANSPATH(R): an integrated database on signal transduction and a tool for array analysis Nucleic Acids Res., January 1, 2003; 31(1): 97 - 100. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. A. Shahmuradov, A. J. Gammerman, J. M. Hancock, P. M. Bramley, and V. V. Solovyev PlantProm: a database of plant promoter sequences Nucleic Acids Res., January 1, 2003; 31(1): 114 - 117. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Matys, E. Fricke, R. Geffers, E. Gossling, M. Haubrock, R. Hehl, K. Hornischer, D. Karas, A. E. Kel, O. V. Kel-Margoulis, et al. TRANSFAC(R): transcriptional regulation, from patterns to profiles Nucleic Acids Res., January 1, 2003; 31(1): 374 - 378. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






