Skip Navigation

Nucleic Acids Research 2004 32(17):5059-5065; doi:10.1093/nar/gkh836
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (269K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (4)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Wu, K.-P.
Right arrow Articles by Hsu, W.-L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wu, K.-P.
Right arrow Articles by Hsu, W.-L.
Related Collections
Right arrow Computational methods
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Published online 24 September 2004

Nucleic Acids Research, Vol. 32 No. 17 © Oxford University Press 2004; all rights reserved

HYPROSP: a hybrid protein secondary structure prediction algorithm—a knowledge-based approach

Kuen-Pin Wu, Hsin-Nan Lin, Jia-Ming Chang, Ting-Yi Sung and Wen-Lian Hsu*

Institute of Information Science, Academia Sinica, Taipei, Taiwan

* To whom correspondence should be addressed. Tel: +886 2 27883799 ext. 1804; Fax: +886 2 27824814; Email: hsu{at}iis.sinica.edu.tw
The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors

Received June 18, 2004; Revised August 4, 2004; Accepted September 2, 2004

We develop a knowledge-based approach (called PROSP) for protein secondary structure prediction. The knowledge base contains small peptide fragments together with their secondary structural information. A quantitative measure M, called match rate, is defined to measure the amount of structural information that a target protein can extract from the knowledge base. Our experimental results show that proteins with a higher match rate will likely be predicted more accurately based on PROSP. That is, there is roughly a monotone correlation between the prediction accuracy and the amount of structure matching with the knowledge base. To fully utilize the strength of our knowledge base, a hybrid prediction method is proposed as follows: if the match rate of a target protein is at least 80%, we use the extracted information to make the prediction; otherwise, we adopt a popular machine-learning approach. This comprises our hybrid protein structure prediction (HYPROSP) approach. We use the DSSP and EVA data as our datasets and PSIPRED as our underlying machine-learning algorithm. For target proteins with match rate at least 80%, the average Q3 of PROSP is 3.96 and 7.2 better than that of PSIPRED on DSSP and EVA data, respectively.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
H.-N. Lin, J.-M. Chang, K.-P. Wu, T.-Y. Sung, and W.-L. Hsu
HYPROSP II-A knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence
Bioinformatics, August 1, 2005; 21(15): 3227 - 3233.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.