Skip Navigation


Nucleic Acids Research Advance Access originally published online on April 21, 2009
Nucleic Acids Research 2009 37(10):e72; doi:10.1093/nar/gkp248
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (1751K) Freely available
Right arrow Screen PDF (918K) Freely available
Right arrow Supplementary Data
Right arrowOA All Versions of this Article:
37/10/e72    most recent
gkp248v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Zhang, S.
Right arrow Articles by Su, Z.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Zhang, S.
Right arrow Articles by Su, Z.
Related Collections
Right arrow Computational methods
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2009, Vol. 37, No. 10 e72
© 2009
The Author(s) This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.


Methods Online

Genome-wide de novo prediction of cis-regulatory binding sites in prokaryotes

Shaoqiang Zhang1,2, Minli Xu1, Shan Li1 and Zhengchang Su1,*

1Department of Bioinformatics and Genomics, Bioinformatics Research Center, the University of North Carolina at Charlotte, Charlotte, NC 28223, USA and 2College of Computer and Information Engineering, Tianjin Normal University, Tianjin 300387, China

*To whom correspondence should be addressed. Tel: +1 704 678 7996; Fax: +1 704 678 6610; Email: zcsu{at}uncc.edu

Received November 25, 2008. Revised March 16, 2009. Accepted April 2, 2009.

Although cis-regulatory binding sites (CRBSs) are at least as important as the coding sequences in a genome, our general understanding of them in most sequenced genomes is very limited due to the lack of efficient and accurate experimental and computational methods for their characterization, which has largely hindered our understanding of many important biological processes. In this article, we describe a novel algorithm for genome-wide de novo prediction of CRBSs with high accuracy. We designed our algorithm to circumvent three identified difficulties for CRBS prediction using comparative genomics principles based on a new method for the selection of reference genomes, a new metric for measuring the similarity of CRBSs, and a new graph clustering procedure. When operon structures are correctly predicted, our algorithm can predict 81% of known individual binding sites belonging to 94% of known cis-regulatory motifs in the Escherichia coli K12 genome, while achieving high prediction specificity. Our algorithm has also achieved similar prediction accuracy in the Bacillus subtilis genome, suggesting that it is very robust, and thus can be applied to any other sequenced prokaryotic genome. When compared with the prior state-of-the-art algorithms, our algorithm outperforms them in both prediction sensitivity and specificity.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.