Skip Navigation

Nucleic Acids Research 2005 33(5):1445-1453; doi:10.1093/nar/gki282
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (238K) Freely available
Right arrow Screen PDF (240K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (28)
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Down, T. A.
Right arrow Articles by Hubbard, T. J. P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Down, T. A.
Right arrow Articles by Hubbard, T. J. P.
Related Collections
Right arrow Computational methods
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Published online 10 March 2005

© The Author 2005. Published by Oxford University Press. All rights reserved
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions{at}oupjournals.org


Article

NestedMICA: sensitive inference of over-represented motifs in nucleic acid sequence

Thomas A. Down* and Tim J. P. Hubbard

Wellcome Trust Sanger Institute, Hinxton Cambridge CB10 1SA, UK

*To whom correspondence should be addressed. Tel: +44 1223 834244; Fax: +44 1223 494919; Email: td2{at}sanger.ac.uk

Received December 14, 2004. Revised January 24, 2005. Accepted February 15, 2005.

NestedMICA is a new, scalable, pattern-discovery system for finding transcription factor binding sites and similar motifs in biological sequences. Like several previous methods, NestedMICA tackles this problem by optimizing a probabilistic mixture model to fit a set of sequences. However, the use of a newly developed inference strategy called Nested Sampling means NestedMICA is able to find optimal solutions without the need for a problematic initialization or seeding step. We investigate the performance of NestedMICA in a range scenario, on synthetic data and a well-characterized set of muscle regulatory regions, and compare it with the popular MEME program. We show that the new method is significantly more sensitive than MEME: in one case, it successfully extracted a target motif from background sequence four times longer than could be handled by the existing program. It also performs robustly on synthetic sequences containing multiple significant motifs. When tested on a real set of regulatory sequences, NestedMICA produced motifs which were good predictors for all five abundant classes of annotated binding sites.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
V. X. Jin, J. Apostolos, N. S. V. R. Nagisetty, and P. J. Farnham
W-ChIPMotifs: a web application tool for de novo motif discovery from ChIP-based high-throughput data
Bioinformatics, December 1, 2009; 25(23): 3191 - 3193.
[Abstract] [Full Text] [PDF]


Home page
DevelopmentHome page
C. Collart, J. M. Ramis, T. A. Down, and J. C. Smith
Smicl is required for phosphorylation of RNA polymerase II and affects 3'-end processing of RNA at the midblastula transition in Xenopus
Development, October 15, 2009; 136(20): 3451 - 3461.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. W. Bruce, A. J. Lopez-Contreras, P. Flicek, T. A. Down, P. Dhami, S. C. Dillon, C. M. Koch, C. F. Langford, I. Dunham, R. M. Andrews, et al.
Functional diversity for REST (NRSF) is defined by in vivo binding affinity hierarchies at the DNA sequence level
Genome Res., June 1, 2009; 19(6): 994 - 1005.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
R. H. Morley, K. Lachani, D. Keefe, M. J. Gilchrist, P. Flicek, J. C. Smith, and F. C. Wardle
A gene regulatory network directed by zebrafish No tail accounts for its roles in mesoderm formation
PNAS, March 10, 2009; 106(10): 3829 - 3834.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
A. B. Rose, T. Elfersi, G. Parra, and I. Korf
Promoter-Proximal Introns in Arabidopsis thaliana Are Enriched in Dispersed Signals that Elevate Gene Expression
PLANT CELL, March 1, 2008; 20(3): 543 - 551.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
V. X. Jin, H. O'Geen, S. Iyengar, R. Green, and P. J. Farnham
Identification of an OCT4 and SRY regulatory module using integrated computational and experimental genomics approaches
Genome Res., June 1, 2007; 17(6): 807 - 817.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
E. S. Jackson and W. J. Fitzgerald
A sequential Monte Carlo EM approach to the transcription factor binding site identification problem
Bioinformatics, June 1, 2007; 23(11): 1313 - 1320.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
I. J. Donaldson and B. Gottgens
CoMoDis: composite motif discovery in mammalian genomes
Nucleic Acids Res., January 12, 2007; 35(1): e1 - e1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C.-H. Peng, J.-T. Hsu, Y.-S. Chung, Y.-J. Lin, W.-Y. Chow, D. F. Hsu, and C. Y. Tang
Identification of degenerate motifs using position restricted selection and hybrid ranking combination
Nucleic Acids Res., December 2, 2006; 34(22): 6379 - 6391.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
L. Elnitski, V. X. Jin, P. J. Farnham, and S. J.M. Jones
Locating mammalian transcription factor binding sites: A survey of computational and experimental techniques
Genome Res., December 1, 2006; 16(12): 1455 - 1464.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
I. Abnizova and W. R. Gilks
Studying statistical properties of regulatory DNA sequences, and their use in predicting regulatory regions in the eukaryotic genomes
Brief Bioinform, March 1, 2006; 7(1): 48 - 54.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.