Skip Navigation

This Article
Right arrow Full Text Freely available
Right arrow Print PDF (313K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (38)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Frith, M. C.
Right arrow Articles by Weng, Z.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Frith, M. C.
Right arrow Articles by Weng, Z.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2002, Vol. 30, No. 14 3214-3224
© 2002 Oxford University Press

Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences

Martin C. Frith1, John L. Spouge4, Ulla Hansen1,3 and Zhiping Weng*,1,2

1 Bioinformatics Program and 2 Department of Biomedical Engineering, Boston University, 44 Cummington Street, 3 Department of Biology, Boston University, 5 Cummington Street, Boston MA 02215, USA and 4 National Center for Biotechnology Information, National Library of Medicine, Building 38A, Room 8N805, Bethesda, MD 20894, USA

*To whom correspondence should be addressed at: Bioinformatics Program, Boston University, 44 Cummington Street, Boston, MA 02215, USA. Tel: +1 617 353 3509; Fax: +1 617 353 6766; Email: zhiping{at}bu.edu

The human genome encodes the transcriptional control of its genes in clusters of cis-elements that constitute enhancers, silencers and promoter signals. The sequence motifs of individual cis- elements are usually too short and degenerate for confident detection. In most cases, the requirements for organization of cis-elements within these clusters are poorly understood. Therefore, we have developed a general method to detect local concentrations of cis-element motifs, using predetermined matrix representations of the cis-elements, and calculate the statistical significance of these motif clusters. The statistical significance calculation is highly accurate not only for idealized, pseudorandom DNA, but also for real human DNA. We use our method ‘cluster of motifs E-value tool’ (COMET) to make novel predictions concerning the regulation of genes by transcription factors associated with muscle. COMET performs comparably with two alternative state-of-the-art techniques, which are more complex and lack E-value calculations. Our statistical method enables us to clarify the major bottleneck in the hard problem of detecting cis-regulatory regions, which is that many known enhancers do not contain very significant clusters of the motif types that we search for. Thus, discovery of additional signals that belong to these regulatory regions will be the key to future progress.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
J. Hu, H. Hu, and X. Li
MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs
Nucleic Acids Res., August 1, 2008; 36(13): 4488 - 4497.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K.-J. Won, A. Sandelin, T. T. Marstrand, and A. Krogh
Modeling promoter grammars with evolving hidden Markov models
Bioinformatics, August 1, 2008; 24(15): 1669 - 1675.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Chowdhary, S. L. Tan, R. A. Ali, B. Boerlage, L. Wong, and V. B Bajic
Dragon Promoter Mapper (DPM): a Bayesian framework for modelling promoter structures
Bioinformatics, September 15, 2006; 22(18): 2310 - 2312.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
L. Li, A. S. L. Cheng, V. X. Jin, H. H. Paik, M. Fan, X. Li, W. Zhang, J. Robarge, C. Balch, R. V. Davuluri, et al.
A mixture model-based discriminate analysis for identifying ordered transcription factor binding site pairs in gene promoters directly regulated by estrogen receptor-{alpha}
Bioinformatics, September 15, 2006; 22(18): 2210 - 2216.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. GuhaThakurta
Computational identification of transcriptional regulatory elements in DNA sequence
Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
A. Yu. Mitrophanov and M. Borodovsky
Statistical significance in biological sequence analysis
Brief Bioinform, March 1, 2006; 7(1): 2 - 24.



Home page
Proc. Natl. Acad. Sci. USAHome page
M. Gupta and J. S. Liu
De novo cis-regulatory module elicitation for eukaryotic genomes
PNAS, May 17, 2005; 102(20): 7079 - 7084.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
D. GuhaThakurta, L. A. Schriefer, R. H. Waterston, and G. D. Stormo
Novel transcription regulatory elements in Caenorhabditis elegans muscle genes
Genome Res., December 1, 2004; 14(12): 2457 - 2468.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
W. Thompson, M. J. Palumbo, W. W. Wasserman, J. S. Liu, and C. E. Lawrence
Decoding Human Regulatory Circuits
Genome Res., October 1, 2004; 14(10a): 1967 - 1974.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Z. Hu, Y. Fu, A. S. Halees, S. M. Kielbasa, and Z. Weng
SeqVISTA: a new module of integrated computational tools for studying transcriptional regulation
Nucleic Acids Res., July 1, 2004; 32(suppl_2): W235 - W241.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Sharan, A. Ben-Hur, G. G. Loots, and I. Ovcharenko
CREME: Cis-Regulatory Module Explorer for the human genome
Nucleic Acids Res., July 1, 2004; 32(suppl_2): W253 - W256.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. Kreiman
Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes
Nucleic Acids Res., May 20, 2004; 32(9): 2889 - 2900.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
J. W. Tullai, M. E. Schaffer, S. Mullenbrock, S. Kasif, and G. M. Cooper
Identification of Transcription Factor Binding Sites Upstream of Human Genes Regulated by the Phosphatidylinositol 3-Kinase and MEK/ERK Signaling Pathways
J. Biol. Chem., May 7, 2004; 279(19): 20167 - 20177.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. C. Frith, Y. Fu, L. Yu, J.-F. Chen, U. Hansen, and Z. Weng
Detection of functional DNA motifs via statistical over-representation
Nucleic Acids Res., February 26, 2004; 32(4): 1372 - 1381.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. L. Bulyk, A. M. McGuire, N. Masuda, and G. M. Church
A Motif Co-Occurrence Approach for Genome-Wide Prediction of Transcription-Factor-Binding Sites in Escherichia coli
Genome Res., February 1, 2004; 14(2): 201 - 208.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. C. Frith, M. C. Li, and Z. Weng
Cluster-Buster: finding dense clusters of motifs in DNA sequences
Nucleic Acids Res., July 1, 2003; 31(13): 3666 - 3668.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.