Skip Navigation


Nucleic Acids Research Advance Access originally published online on November 27, 2006
Nucleic Acids Research 2006 34(22):6379-6391; doi:10.1093/nar/gkl658
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (1046K) Freely available
Right arrow Screen PDF (500K) Freely available
Right arrow Supplementary Data
Right arrowOA All Versions of this Article:
34/22/6379    most recent
gkl658v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Peng, C.-H.
Right arrow Articles by Tang, C. Y.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Peng, C.-H.
Right arrow Articles by Tang, C. Y.
Related Collections
Right arrow Computational methods
Right arrow Genomics
Right arrow Monitoring gene expression
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2006, Vol. 34, No. 22 6379-6391
© 2006 The Author(s).
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.


Computational Biology

Identification of degenerate motifs using position restricted selection and hybrid ranking combination

Chien-Hua Peng, Jeh-Ting Hsu, Yun-Sheng Chung, Yen-Jen Lin, Wei-Yuan Chow1, D. Frank Hsu2,3 and Chuan Yi Tang*

Department of Computer Science, Institute of Molecular and Cellular Biology, National Tsing Hua University Taiwan 300, Republic of China 1 Department of Life Science, Institute of Molecular and Cellular Biology, National Tsing Hua University Taiwan 300, Republic of China 2 Department of Computer and Information Science, Fordham University New York, NY 10023, USA 3 DIMACS Center, Rutgers University 96 Frelinghuysen Road, Piscataway, NJ 08854-8018, USA

*To whom correspondence should be addressed. Tel: +886 3 5731077; Fax: +886 3 5723694; Email: cytang{at}cs.nthu.edu.tw

Received July 5, 2006. Revised August 28, 2006. Accepted August 28, 2006.

The identification of regulatory elements recognized by transcription factors and chromatin remodeling factors is essential to studying the regulation of gene expression. When no auxiliary data, such as orthologous sequences or expression profiles, are used, the accuracy of most tools for motif discovery is strongly influenced by the motif degeneracy and the lengths of sequence. Since suitable auxiliary data may not always be available, more work must be conducted to enhance tool performance to identify transcription elements in the metazoan. A non-alignment-based algorithm, MotifSeeker, is proposed to enhance the accuracy of discovering degenerate motifs. MotifSeeker utilizes the property that variable sites of transcription elements are usually position-specific to reduce exposure to noise. Consequently, the efficiency and accuracy of motif identification are improved. Using data fusion, the ranking process integrates two measures of motif significance, resulting in a more robust significance measure. Testing results for the synthetic data reveal that the accuracy of MotifSeeker is less sensitive to the motif degeneracy and the length of input sequences. Furthermore, MotifSeeker has been tested on a well-known benchmark [M. Tompa, N. Li, T.L. Bailey, G.M. Church, B. De Moor, E. Eskin, A.V. Favorov, M.C. Frith, Y. Fu, W.J. Kent, et al. (2005) Nat. Biotechnol., 23, 137–144], yielding a correlation coefficient of 0.262, which compares favorably with those of other tools. The high applicability of MotifSeeker to biological data is further demonstrated experimentally on regulons of Saccharomyces cerevisiae and liver-specific genes with experimentally verified regulatory elements.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
E. Wijaya, K. Rajaraman, S.-M. Yiu, and W.-K. Sung
Detection of generic spaced motifs using submotif pattern mining
Bioinformatics, June 15, 2007; 23(12): 1476 - 1485.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.