Skip Navigation

This Article
Right arrow Full Text Freely available
Right arrow Print PDF (225K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (26)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Hannenhalli, S. S.
Right arrow Articles by Fickett, J. W.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hannenhalli, S. S.
Right arrow Articles by Fickett, J. W.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, Vol 27, Issue 17 3577-3582, Copyright © 1999 by Oxford University Press


ARTICLES

Bacterial start site prediction

SS Hannenhalli, WS Hayes, AG Hatzigeorgiou and JW Fickett
Bioinformatics, SmithKline Beecham Pharmaceuticals, 709 Swedeland Road, PO Box 1539, King of Prussia, PA 19406, USA. hannes00@mh.vs.sbphrd.com

With the growing number of completely sequenced bacterial genes, accurate gene prediction in bacterial genomes remains an important problem. Although the existing tools predict genes in bacterial genomes with high overall accuracy, their ability to pinpoint the translation start site remains unsatisfactory. In this paper, we present a novel approach to bacterial start site prediction that takes into account multiple features of a potential start site, viz., ribosome binding site (RBS) binding energy, distance of the RBS from the start codon, distance from the beginning of the maximal ORF to the start codon, the start codon itself and the coding/non-coding potential around the start site. Mixed integer programing was used to optimize the discriminatory system. The accuracy of this approach is up to 90%, compared to 70%, using the most common tools in fully automated mode (that is, without expert human post-processing of results). The approach is evaluated using Bacillus subtilis, Escherichia coli and Pyrococcus furiosus. These three genomes cover a broad spectrum of bacterial genomes, since B.subtilis is a Gram-positive bacterium, E.coli is a Gram-negative bacterium and P. furiosus is an archaebacterium. A significant problem is generating a set of 'true' start sites for algorithm training, in the absence of experimental work. We found that sequence conservation between P. furiosus and the related Pyrococcus horikoshii clearly delimited the gene start in many cases, providing a sufficient training set.
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Biophys. JHome page
P. Singhal, B. Jayaram, S. B. Dixit, and D. L. Beveridge
Prokaryotic Gene Finding Based on Physicochemical Characteristics of Codons Calculated from Molecular Dynamics Simulations
Biophys. J., June 1, 2008; 94(11): 4173 - 4183.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
F.-B. Guo, H.-Y. Ou, and C.-T. Zhang
ZCURVE: a new system for recognizing protein-coding genes in bacterial and archaeal genomes
Nucleic Acids Res., March 15, 2003; 31(6): 1780 - 1789.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. A. Thanassi, S. L. Hartman-Neumann, T. J. Dougherty, B. A. Dougherty, and M. J. Pucci
Identification of 113 conserved essential genes using a high-throughput gene disruption system in Streptococcus pneumoniae
Nucleic Acids Res., July 15, 2002; 30(14): 3152 - 3162.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Walker, V. Pavlovic, and S. Kasif
A comparative genomic method for computational identification of prokaryotic translation initiation sites
Nucleic Acids Res., July 15, 2002; 30(14): 3181 - 3191.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Shibuya and I. Rigoutsos
Dictionary-driven prokaryotic gene finding
Nucleic Acids Res., June 15, 2002; 30(12): 2710 - 2725.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
A. Ekins and D. F. Niven
Identification of fur and fldA Homologs and a Pasteurella multocida tbpA Homolog in Histophilus ovis and Effects of Iron Availability on Their Transcription
J. Bacteriol., May 1, 2002; 184(9): 2539 - 2542.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Besemer, A. Lomsadze, and M. Borodovsky
GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions
Nucleic Acids Res., June 15, 2001; 29(12): 2607 - 2618.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
S. Fujimoto and Y. Ike
pAM401-Based Shuttle Vectors That Enable Overexpression of Promoterless Genes and One-Step Purification of Tag Fusion Proteins Directly from Enterococcus faecalis
Appl. Envir. Microbiol., March 1, 2001; 67(3): 1262 - 1267.
[Abstract] [Full Text]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.