Nucleic Acids Research, 2000, Vol. 28, No. 8 1808-1818
© 2000 Oxford University Press
Discovering regulatory elements in non-coding sequences by analysis of spaced dyads
Unité de Conformation des Macromolécules Biologiques, Université Libre de Bruxelles, CP 160/16, 50 av. F. D. Roosevelt, B-1050 Bruxelles, Belgium and 1Centro de Investigación sobre Fijación de Nitrógeno, Universidad Nacional Autónoma de México, AP565A, Cuernavaca 62100, Morelos, Mexico
The application of microarray and related technologies is currently generating a systematic catalog of the transcriptional response of any single gene to a multiplicity of experimental conditions. Clustering genes according to the similarity of their transcriptional response provides a direct hint to the regulons of the different transcription factors, many of which have still not been characterized. We have developed a new method for deciphering the mechanism underlying the common transcriptional response of a set of genes, i.e. discovering cis-acting regulatory elements from a set of unaligned upstream sequences. This method, called dyad analysis, is based on the observation that many regulatory sites consist of a pair of highly conserved trinucleotides, spaced by a non-conserved region of fixed width. The approach is to count the number of occurrences of each possible spaced pair of trinucleotides, and to assess its statistical significance. The method is highly efficient in the detection of sites bound by C6 Zn2 binuclear cluster proteins, as well as other transcription factors. In addition, we show that the dyad and single-word analyses are efficient for the detection of regulatory patterns in gene clusters from DNA chip experiments. In combination, these programs should provide a fast and efficient way to discover new regulatory sites for as yet unknown transcription factors.
* To whom correspondence should be addressed. Tel: +32 2 650 20 13; Fax: +32 2 648 89 54; Email: jvanheld@ucmb.ulb.ac.be
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. A. Hazelwood, M. C. Walsh, M. A. H. Luttik, P. Daran-Lapujade, J. T. Pronk, and J.-M. Daran Identity of the Growth-Limiting Nutrient Strongly Affects Storage Carbohydrate Accumulation in Anaerobic Chemostat Cultures of Saccharomyces cerevisiae Appl. Envir. Microbiol., November 1, 2009; 75(21): 6876 - 6885. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Defrance and J. van Helden info-gibbs: a motif discovery algorithm that directly optimizes information content during sampling Bioinformatics, October 15, 2009; 25(20): 2715 - 2722. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. F. T. van Hijum, M. H. Medema, and O. P. Kuipers Mechanisms and Evolution of Control Logic in Prokaryotic Transcriptional Regulation Microbiol. Mol. Biol. Rev., September 1, 2009; 73(3): 481 - 509. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Narlikar and I. Ovcharenko Identifying regulatory elements in eukaryotic genomes Brief Funct Genomic Proteomic, July 1, 2009; 8(4): 215 - 230. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. D. Yokoyama, U. Ohler, and G. A. Wray Measuring spatial preferences at fine-scale resolution identifies known and novel cis-regulatory element candidates and functional motif-pair relationships Nucleic Acids Res., July 1, 2009; 37(13): e92 - e92. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. G. Roider, T. Manke, S. O'Keeffe, M. Vingron, and S. A. Haas PASTAA: identifying transcription factors associated with sets of co-regulated genes Bioinformatics, February 15, 2009; 25(4): 435 - 442. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Thomas-Chollier, O. Sand, J.-V. Turatsinze, R. Janky, M. Defrance, E. Vervisch, S. Brohee, and J. van Helden RSAT: regulatory sequence analysis tools Nucleic Acids Res., July 1, 2008; 36(suppl_2): W119 - W127. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Hernandez-Lucas, A. L. Gallego-Hernandez, S. Encarnacion, M. Fernandez-Mora, A. G. Martinez-Batallar, H. Salgado, R. Oropeza, and E. Calva The LysR-Type Transcriptional Regulator LeuO Controls Expression of Several Genes in Salmonella enterica Serovar Typhi J. Bacteriol., March 1, 2008; 190(5): 1658 - 1670. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-Y. Chen, H.-K. Tsai, C.-M. Hsu, M.-J. May Chen, H.-G. Hung, G. T.-W. Huang, and W.-H. Li Discovering gapped binding sites of yeast transcription factors PNAS, February 19, 2008; 105(7): 2527 - 2532. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Wijaya, K. Rajaraman, S.-M. Yiu, and W.-K. Sung Detection of generic spaced motifs using submotif pattern mining Bioinformatics, June 15, 2007; 23(12): 1476 - 1485. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Chakravarty, J. M. Carlson, R. S. Khetani, C. E. DeZiel, and R. H. Gross SPACER: identification of cis-regulatory elements with non-contiguous critical residues Bioinformatics, April 15, 2007; 23(8): 1029 - 1031. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Elnitski, V. X. Jin, P. J. Farnham, and S. J.M. Jones Locating mammalian transcription factor binding sites: A survey of computational and experimental techniques Genome Res., December 1, 2006; 16(12): 1455 - 1464. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta Computational identification of transcriptional regulatory elements in DNA sequence Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Waleev, D. Shtokalo, T. Konovalova, N. Voss, E. Cheremushkin, P. Stegmaier, O. Kel-Margoulis, E. Wingender, and A. Kel Composite Module Analyst: identification of transcription factor binding site combinations using genetic algorithm. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W541 - W545. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kel, T. Konovalova, T. Waleev, E. Cheremushkin, O. Kel-Margoulis, and E. Wingender Composite Module Analyst: a fitness-based tool for identification of transcription factor binding site combinations Bioinformatics, May 15, 2006; 22(10): 1190 - 1197. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Abnizova and W. R. Gilks Studying statistical properties of regulatory DNA sequences, and their use in predicting regulatory regions in the eukaryotic genomes Brief Bioinform, March 1, 2006; 7(1): 48 - 54. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hu, B. Li, and D. Kihara Limitations and potentials of current motif discovery algorithms Nucleic Acids Res., September 2, 2005; 33(15): 4899 - 4913. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Favorov, M. S. Gelfand, A. V. Gerasimova, D. A. Ravcheev, A. A. Mironov, and V. J. Makeev A Gibbs sampler for identification of symmetrically structured, spaced DNA motifs with improved estimation of the signal length Bioinformatics, May 15, 2005; 21(10): 2240 - 2245. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Sumazin, G. Chen, N. Hata, A. D. Smith, T. Zhang, and M. Q. Zhang DWE: Discriminating Word Enumerator Bioinformatics, January 1, 2005; 21(1): 31 - 38. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kel, S. Reymann, V. Matys, P. Nettesheim, E. Wingender, and J. Borlak A Novel Computational Approach for the Prediction of Networked Transcription Factors of Aryl Hydrocarbon-Receptor-Regulated Genes Mol. Pharmacol., December 1, 2004; 66(6): 1557 - 1572. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-D. Huang, J.-T. Horng, Y.-M. Sun, A.-P. Tsou, and S.-L. Huang Identifying transcriptional regulatory sites in the human genome using an integrated system Nucleic Acids Res., March 29, 2004; 32(6): 1948 - 1956. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Bulyk, A. M. McGuire, N. Masuda, and G. M. Church A Motif Co-Occurrence Approach for Genome-Wide Prediction of Transcription-Factor-Binding Sites in Escherichia coli Genome Res., February 1, 2004; 14(2): 201 - 208. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. van Helden Regulatory Sequence Analysis Tools Nucleic Acids Res., July 1, 2003; 31(13): 3593 - 3596. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Rombauts, K. Florquin, M. Lescot, K. Marchal, P. Rouze, and Y. Van de Peer Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes Plant Physiology, July 1, 2003; 132(3): 1162 - 1176. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Ettwiller, J. Rung, and E. Birney Discovering Novel cis-Regulatory Motifs Using Functional Networks Genome Res., May 1, 2003; 13(5): 883 - 895. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. M. Hulzink, H. Weerdesteyn, A. F. Croes, T. Gerats, M. M. A. van Herpen, and J. van Helden In Silico Identification of Putative Regulatory Sequence Elements in the 5'-Untranslated Region of Genes That Are Expressed during Male Gametogenesis Plant Physiology, May 1, 2003; 132(1): 75 - 83. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.Q. ZHANG Prediction, Annotation, and Analysis of Human Promoters Cold Spring Harb Symp Quant Biol, January 1, 2003; 68(0): 217 - 226. [Abstract] [PDF] |
||||
![]() |
S. Sinha and M. Tompa Discovery of novel transcription factor binding sites by statistical overrepresentation Nucleic Acids Res., December 15, 2002; 30(24): 5549 - 5560. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li, V. Rhodius, C. Gross, and E. D. Siggia Identification of the binding sites of regulatory proteins in bacterial genomes PNAS, September 3, 2002; 99(18): 11772 - 11777. [Abstract] [Full Text] [PDF] |
||||











