Nucleic Acids Research, 2003, Vol. 31, No. 6 1753-1764
© 2003 Oxford University Press
Toucan: deciphering the cis-regulatory logic of coregulated genes
Department of Electrical Engineering (ESAT-SCD), Katholieke Universiteit Leuven, Kasteelpark Arenberg 10, 3001 Heverlee (Leuven), Belgium
*To whom correspondence should be addressed. Tel: +32 16321801; Fax: +32 321970; Email: stein.aerts{at}esat.kuleuven.ac.be
TOUCAN is a Java application for the rapid discovery of significant cis-regulatory elements from sets of coexpressed or coregulated genes. Biologists can automatically (i) retrieve genes and intergenic regions, (ii) identify putative regulatory regions, (iii) score sequences for known transcription factor binding sites, (iv) identify candidate motifs for unknown binding sites, and (v) detect those statistically over-represented sites that are characteristic for a gene set. Genes or intergenic regions are retrieved from Ensembl or EMBL, together with orthologs and supporting information. Orthologs are aligned and syntenic regions are selected as candidate regulatory regions. Putative sites for known transcription factors are detected using our MotifScanner, which scores position weight matrices using a probabilistic model. New motifs are detected using our MotifSampler based on Gibbs sampling. Binding sites characteristic for a gene setand thus statistically over-represented with respect to a reference sequence setare found using a binomial test. We have validated Toucan by analyzing muscle-specific genes, liver-specific genes and E2F target genes; we have easily detected many known binding sites within intergenic DNA and identified new biologically plausible sites for known and unknown transcription factors. Software available at http://www.esat.kuleuven.ac. be/
dna/BioI/Software.html.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
F. Colecchia, D. Kottwitz, M. Wagner, C. V. Pfenninger, G. Thiel, I. Tamm, C. Peterson, and U. A. Nuber Tissue-specific regulatory network extractor (TS-REX): a database and software resource for the tissue and cell type-specific investigation of transcription factor-gene networks Nucleic Acids Res., June 1, 2009; 37(11): e82 - e82. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Christianson, I. W. Wilson, D. J. Llewellyn, and E. S. Dennis The Low-Oxygen-Induced NAC Domain Transcription Factor ANAC102 Affects Viability of Arabidopsis Seeds following Low-Oxygen Treatment Plant Physiology, April 1, 2009; 149(4): 1724 - 1738. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. P. Provenzano, D. R. Inman, K. W. Eliceiri, H. E. Beggs, and P. J. Keely Mammary Epithelial-Specific Disruption of Focal Adhesion Kinase Retards Tumor Formation and Metastasis in a Transgenic Mouse Model of Human Breast Cancer Am. J. Pathol., November 1, 2008; 173(5): 1551 - 1565. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Gotea and I. Ovcharenko DiRE: identifying distant regulatory elements of co-expressed genes Nucleic Acids Res., July 1, 2008; 36(suppl_2): W133 - W139. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Thomas-Chollier, O. Sand, J.-V. Turatsinze, R. Janky, M. Defrance, E. Vervisch, S. Brohee, and J. van Helden RSAT: regulatory sequence analysis tools Nucleic Acids Res., July 1, 2008; 36(suppl_2): W119 - W127. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. N. Singh, L.-S. Wang, and S. Hannenhalli TREMOR a tool for retrieving transcriptional modules by incorporating motif covariance Nucleic Acids Res., December 18, 2007; 35(21): 7360 - 7371. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. E. Reddy, B. E. Shakhnovich, D. S. Roberts, S. J. Russek, and C. DeLisi Positional clustering improves computational binding site detection and identifies novel cis-regulatory sites in mammalian GABAA receptor subunit genes Nucleic Acids Res., February 16, 2007; 35(3): e20 - e20. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. B. Jeffery, S. F. Madden, P. A. McGettigan, G. Perriere, A. C. Culhane, and D. G. Higgins Integrating transcription factor binding site information with gene expression datasets Bioinformatics, February 1, 2007; 23(3): 298 - 305. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kankainen, P. Pehkonen, P. Rosenstom, P. Toronen, G. Wong, and L. Holm POXO: a web-enabled tool series to discover transcription factor binding sites. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W534 - W540. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Abnizova and W. R. Gilks Studying statistical properties of regulatory DNA sequences, and their use in predicting regulatory regions in the eukaryotic genomes Brief Bioinform, March 1, 2006; 7(1): 48 - 54. [Abstract] [Full Text] [PDF] |
||||
![]() |
L.-W. Chang, R. Nagarajan, J. A. Magee, J. Milbrandt, and G. D. Stormo A systematic model to predict transcriptional regulatory mechanisms based on overrepresentation of transcription factor binding profiles Genome Res., March 1, 2006; 16(3): 405 - 413. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Vasarhelyi, A. Cseh, I. Kocsis, A. Treszl, B. Gyorffy, and J. Rigo Jr Three mechanisms in the pathogenesis of pre-eclampsia suggested by over-represented transcription factor-binding sites detected with comparative promoter analysis Mol. Hum. Reprod., January 1, 2006; 12(1): 31 - 34. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Suzuki, M. G. Ketterling, and D. R. McCarty Quantitative Statistical Analysis of cis-Regulatory Sequences in ABA/VP1- and CBF/DREB1-Regulated Genes of Arabidopsis Plant Physiology, September 1, 2005; 139(1): 437 - 447. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Pudimat, E.-G. Schukat-Talamazzini, and R. Backofen A multiple-feature framework for modelling and predicting transcription factor binding sites Bioinformatics, July 15, 2005; 21(14): 3082 - 3088. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Aerts, P. Van Loo, G. Thijs, H. Mayer, R. de Martin, Y. Moreau, and B. De Moor TOUCAN 2: the all-inclusive open source workbench for regulatory sequence analysis Nucleic Acids Res., July 1, 2005; 33(suppl_2): W393 - W396. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Ovcharenko and M. A. Nobrega Identifying synonymous regulatory elements in vertebrate genomes Nucleic Acids Res., July 1, 2005; 33(suppl_2): W403 - W407. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Ho Sui, J. R. Mortimer, D. J. Arenillas, J. Brumm, C. J. Walsh, B. P. Kennedy, and W. W. Wasserman oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes Nucleic Acids Res., June 2, 2005; 33(10): 3154 - 3164. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. E. Vinogradov Noncoding DNA, isochores and gene expression: nucleosome formation potential Nucleic Acids Res., January 26, 2005; 33(2): 559 - 563. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Scotton, F. O. Martinez, M. J. Smelt, M. Sironi, M. Locati, A. Mantovani, and S. Sozzani Transcriptional Profiling Reveals Complex Regulation of the Monocyte IL-1{beta} System by IL-13 J. Immunol., January 15, 2005; 174(2): 834 - 845. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Ovcharenko, G. G. Loots, B. M. Giardine, M. Hou, J. Ma, R. C. Hardison, L. Stubbs, and W. Miller Mulan: Multiple-sequence local alignment and visualization for studying function and evolution Genome Res., January 1, 2005; 15(1): 184 - 194. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Krawczyk, N. Peyraud, N. Rybtsova, K. Masternak, P. Bucher, E. Barras, and W. Reith Long Distance Control of MHC Class II Expression by Multiple Distal Enhancers Regulated by Regulatory Factor X Complex and CIITA J. Immunol., November 15, 2004; 173(10): 6200 - 6210. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. O'Lone, M. C. Frith, E. K. Karlsson, and U. Hansen Genomic Targets of Nuclear Estrogen Receptors Mol. Endocrinol., August 1, 2004; 18(8): 1859 - 1875. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. B.L. Alkema, B. Lenhard, and W. W. Wasserman Regulog Analysis: Detection of Conserved Regulatory Networks Across Bacteria: Application to Staphylococcus aureus Genome Res., July 1, 2004; 14(7): 1362 - 1373. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Hu, Y. Fu, A. S. Halees, S. M. Kielbasa, and Z. Weng SeqVISTA: a new module of integrated computational tools for studying transcriptional regulation Nucleic Acids Res., July 1, 2004; 32(suppl_2): W235 - W241. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. B. Vega, D. K. Bangarusamy, L. D. Miller, E. T. Liu, and C.-Y. Lin BEARR: Batch Extraction and Analysis of cis-Regulatory Regions Nucleic Acids Res., July 1, 2004; 32(suppl_2): W257 - W260. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Fu, M. C. Frith, P. M. Haverty, and Z. Weng MotifViz: an analysis and visualization tool for motif discovery Nucleic Acids Res., July 1, 2004; 32(suppl_2): W420 - W423. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Karanam and C. S. Moreno CONFAC: automated application of comparative genomic promoter analysis to DNA microarray datasets Nucleic Acids Res., July 1, 2004; 32(suppl_2): W475 - W484. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Mayer, M. Bilban, V. Kurtev, F. Gruber, O. Wagner, B. R. Binder, and R. de Martin Deciphering Regulatory Patterns of Inflammatory Gene Expression From Interleukin-1--Stimulated Human Endothelial Cells Arterioscler. Thromb. Vasc. Biol., July 1, 2004; 24(7): 1192 - 1198. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Stabenau, G. McVicker, C. Melsopp, G. Proctor, M. Clamp, and E. Birney The Ensembl Core Software Libraries Genome Res., May 1, 2004; 14(5): 929 - 933. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-D. Huang, J.-T. Horng, Y.-M. Sun, A.-P. Tsou, and S.-L. Huang Identifying transcriptional regulatory sites in the human genome using an integrated system Nucleic Acids Res., March 29, 2004; 32(6): 1948 - 1956. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Frith, Y. Fu, L. Yu, J.-F. Chen, U. Hansen, and Z. Weng Detection of functional DNA motifs via statistical over-representation Nucleic Acids Res., February 26, 2004; 32(4): 1372 - 1381. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. Haverty, U. Hansen, and Z. Weng Computational inference of transcriptional regulatory networks from expression profiling and transcription factor binding site identification Nucleic Acids Res., January 2, 2004; 32(1): 179 - 188. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Coessens, G. Thijs, S. Aerts, K. Marchal, F. De Smet, K. Engelen, P. Glenisson, Y. Moreau, J. Mathys, and B. De Moor INCLUSive: a web portal and service registry for microarray and regulatory sequence analysis Nucleic Acids Res., July 1, 2003; 31(13): 3468 - 3470. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Halees, D. Leyfer, and Z. Weng PromoSer: a large-scale mammalian promoter and transcription start site identification service Nucleic Acids Res., July 1, 2003; 31(13): 3554 - 3559. [Abstract] [Full Text] [PDF] |
||||









