Nucleic Acids Research, 2002, Vol. 30, No. 14 3214-3224
© 2002 Oxford University Press
Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences
1 Bioinformatics Program and 2 Department of Biomedical Engineering, Boston University, 44 Cummington Street, 3 Department of Biology, Boston University, 5 Cummington Street, Boston MA 02215, USA and 4 National Center for Biotechnology Information, National Library of Medicine, Building 38A, Room 8N805, Bethesda, MD 20894, USA
*To whom correspondence should be addressed at: Bioinformatics Program, Boston University, 44 Cummington Street, Boston, MA 02215, USA. Tel: +1 617 353 3509; Fax: +1 617 353 6766; Email: zhiping{at}bu.edu
The human genome encodes the transcriptional control of its genes in clusters of cis-elements that constitute enhancers, silencers and promoter signals. The sequence motifs of individual cis- elements are usually too short and degenerate for confident detection. In most cases, the requirements for organization of cis-elements within these clusters are poorly understood. Therefore, we have developed a general method to detect local concentrations of cis-element motifs, using predetermined matrix representations of the cis-elements, and calculate the statistical significance of these motif clusters. The statistical significance calculation is highly accurate not only for idealized, pseudorandom DNA, but also for real human DNA. We use our method cluster of motifs E-value tool (COMET) to make novel predictions concerning the regulation of genes by transcription factors associated with muscle. COMET performs comparably with two alternative state-of-the-art techniques, which are more complex and lack E-value calculations. Our statistical method enables us to clarify the major bottleneck in the hard problem of detecting cis-regulatory regions, which is that many known enhancers do not contain very significant clusters of the motif types that we search for. Thus, discovery of additional signals that belong to these regulatory regions will be the key to future progress.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
D. J. Hazelett, D. L. Lakeland, and J. B. Weiss Affinity Density: a novel genomic approach to the identification of transcription factor regulatory targets Bioinformatics, July 1, 2009; 25(13): 1617 - 1624. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Fu, P. Ray, and E. P. Xing DISCOVER: a feature-based discriminative method for motif search in complex genomes Bioinformatics, June 15, 2009; 25(12): i321 - i329. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. V. Loo and P. Marynen Computational methods for the detection of cis-regulatory modules Brief Bioinform, June 4, 2009; (2009) bbp025v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Narlikar and I. Ovcharenko Identifying regulatory elements in eukaryotic genomes Brief Funct Genomic Proteomic, June 4, 2009; (2009) elp014v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hu, H. Hu, and X. Li MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs Nucleic Acids Res., August 1, 2008; 36(13): 4488 - 4497. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-J. Won, A. Sandelin, T. T. Marstrand, and A. Krogh Modeling promoter grammars with evolving hidden Markov models Bioinformatics, August 1, 2008; 24(15): 1669 - 1675. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E Balmer and R. Blomhoff Anecdotes, data and regulatory modules Biol Lett, September 22, 2006; 2(3): 431 - 434. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Chowdhary, S. L. Tan, R. A. Ali, B. Boerlage, L. Wong, and V. B Bajic Dragon Promoter Mapper (DPM): a Bayesian framework for modelling promoter structures Bioinformatics, September 15, 2006; 22(18): 2310 - 2312. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Li, A. S. L. Cheng, V. X. Jin, H. H. Paik, M. Fan, X. Li, W. Zhang, J. Robarge, C. Balch, R. V. Davuluri, et al. A mixture model-based discriminate analysis for identifying ordered transcription factor binding site pairs in gene promoters directly regulated by estrogen receptor-{alpha} Bioinformatics, September 15, 2006; 22(18): 2210 - 2216. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta Computational identification of transcriptional regulatory elements in DNA sequence Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Yu. Mitrophanov and M. Borodovsky Statistical significance in biological sequence analysis Brief Bioinform, March 1, 2006; 7(1): 2 - 24. |
||||
![]() |
M. Gupta and J. S. Liu De novo cis-regulatory module elicitation for eukaryotic genomes PNAS, May 17, 2005; 102(20): 7079 - 7084. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta, L. A. Schriefer, R. H. Waterston, and G. D. Stormo Novel transcription regulatory elements in Caenorhabditis elegans muscle genes Genome Res., December 1, 2004; 14(12): 2457 - 2468. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Thompson, M. J. Palumbo, W. W. Wasserman, J. S. Liu, and C. E. Lawrence Decoding Human Regulatory Circuits Genome Res., October 1, 2004; 14(10a): 1967 - 1974. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Hu, Y. Fu, A. S. Halees, S. M. Kielbasa, and Z. Weng SeqVISTA: a new module of integrated computational tools for studying transcriptional regulation Nucleic Acids Res., July 1, 2004; 32(suppl_2): W235 - W241. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Sharan, A. Ben-Hur, G. G. Loots, and I. Ovcharenko CREME: Cis-Regulatory Module Explorer for the human genome Nucleic Acids Res., July 1, 2004; 32(suppl_2): W253 - W256. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Kreiman Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes Nucleic Acids Res., May 20, 2004; 32(9): 2889 - 2900. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Tullai, M. E. Schaffer, S. Mullenbrock, S. Kasif, and G. M. Cooper Identification of Transcription Factor Binding Sites Upstream of Human Genes Regulated by the Phosphatidylinositol 3-Kinase and MEK/ERK Signaling Pathways J. Biol. Chem., May 7, 2004; 279(19): 20167 - 20177. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Frith, Y. Fu, L. Yu, J.-F. Chen, U. Hansen, and Z. Weng Detection of functional DNA motifs via statistical over-representation Nucleic Acids Res., February 26, 2004; 32(4): 1372 - 1381. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Bulyk, A. M. McGuire, N. Masuda, and G. M. Church A Motif Co-Occurrence Approach for Genome-Wide Prediction of Transcription-Factor-Binding Sites in Escherichia coli Genome Res., February 1, 2004; 14(2): 201 - 208. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Frith, M. C. Li, and Z. Weng Cluster-Buster: finding dense clusters of motifs in DNA sequences Nucleic Acids Res., July 1, 2003; 31(13): 3666 - 3668. [Abstract] [Full Text] [PDF] |
||||







