Skip Navigation

This Article
Right arrow Full Text Freely available
Right arrow Print PDF (296K) Freely available
Right arrow Supplementary Material
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (21)
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Solovyev, V.
Right arrow Articles by Shahmuradov, I.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Solovyev, V.
Right arrow Articles by Shahmuradov, I.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2003, Vol. 31, No. 13 3540-3545
© 2003 Oxford University Press

PromH: promoters identification using orthologous genomic sequences

V. V. Solovyev* and I. A. Shahmuradov1

Softberry Inc., 116 Radio Circle, Suite 400, Mount Kisco, NY 10549, USA 1 Institute of Botany, Azerbaijan National Academy of Sciences, 370073 Baku, Azerbaijan

*To whom correspondence should be addressed. Tel: +1 914 242 3592; Fax: +1 914 242 3593; Email: victor{at}softberry.com
Present address: I. A. Shahmuradov, Royal Holloway, University of London, Egham, Surrey TW20 0EX, UK

Accurate prediction of promoters is fundamental for understanding gene expression patterns, cell specificity and development. In the studies of conserved features of regulatory regions of orthologous genes, it was observed that major promoter functional components such as transcription start points, TATA-boxes and regulatory motifs, are significantly more conservative than the sequences around them (70–100% compared with 30–50%). To improve promoter identification accuracy, we employed these findings in a new program, PromH, created by extending the TSSW program feature set. PromH uses linear discriminant functions that take into account conservation features and nucleotide sequences of promoter regions in pairs of orthologous genes. The program was tested on two sets of pairs of orthologous, mostly human and rodent, sequences with known transcription start sites (TSS), annotated to have TATA (21 genes, 11 orthologous pairs) and TATA-less (38 genes, 19 pairs) promoters, respectively. The program correctly predicted TSS for all 21 genes of the first set with a median deviation of 2 bp from true site location. Only for two genes, was there significant (46 and 105 bp) discrepancy between predicted and annotated TSS positions. For 38 TATA-less promoters from the second set, TSS was predicted for 27 genes, in 14 cases within 10 bp distance from annotated TSS, and in 21 cases—within 100 bp distance. Despite more discrepancies between predicted and annotated TSS for genes from the second set, these results are consistent with observations of much higher occurrence of multiple TSS in TATA-less promoters. In any case, our results show that PromH identifies TSS positions significantly more accurately than any other published promoter prediction method. The PromH program is available at http://www.softberry.com/berry.phtml?topic=promh.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
A. Perez, F. Lankas, F. J. Luque, and M. Orozco
Towards a molecular dynamics consensus view of B-DNA flexibility
Nucleic Acids Res., April 1, 2008; 36(7): 2379 - 2394.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
T. Abeel, Y. Saeys, E. Bonnet, P. Rouze, and Y. Van de Peer
Generic eukaryotic core promoter prediction using structural features of DNA
Genome Res., February 1, 2008; 18(2): 310 - 323.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Xie, S. Wu, K.-M. Lam, and H. Yan
PromoterExplorer: an effective promoter identification method based on the AdaBoost algorithm
Bioinformatics, November 15, 2006; 22(22): 2722 - 2728.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
O. V. Vishnevsky and N. A. Kolchanov
ARGO: a web system for the detection of degenerate motifs and large-scale recognition of eukaryotic promoters
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W417 - W422.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Burden, Y.-X. Lin, and R. Zhang
Improving promoter prediction Improving promoter prediction for the NNPP2.2 algorithm: a case study using Escherichia coli DNA sequences
Bioinformatics, March 1, 2005; 21(5): 601 - 607.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
I. A. Shahmuradov, V. V. Solovyev, and A. J. Gammerman
Plant promoter prediction with confidence estimation
Nucleic Acids Res., February 18, 2005; 33(3): 1069 - 1076.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. Barta, E. Sebestyen, T. B. Palfy, G. Toth, C. P. Ortutay, and L. Patthy
DoOP: Databases of Orthologous Promoters, collections of clusters of orthologous upstream sequences from chordates and plants
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D86 - D90.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
J. W. Beaber and M. K. Waldor
Identification of Operators and Promoters That Control SXT Conjugative Transfer
J. Bacteriol., September 1, 2004; 186(17): 5945 - 5949.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.