Skip Navigation

Nucleic Acids Research 2005 33(16):5343-5353; doi:10.1093/nar/gki842
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (322K) Freely available
Right arrow Screen PDF (189K) Freely available
Right arrow Supplementary Material
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (2)
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Kruus, E.
Right arrow Articles by Wingreen, N. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kruus, E.
Right arrow Articles by Wingreen, N. S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Published online 20 September 2005

© The Author 2005. Published by Oxford University Press. All rights reserved
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions{at}oxfordjournals.org


Article

Gibbs sampling and helix-cap motifs

Erik Kruus1,*, Peter Thumfort1, Chao Tang1,2,3 and Ned S. Wingreen1,4

1NEC Laboratories America, Inc. 4 Independence Way, Princeton, NJ 08544, USA 2Department of Biopharmaceutical Sciences, California Institute for Quantitative Biomedical Research UCSF Box 2540 University of California San Francisco San Francisco, CA 94143-2540, USA 3Department of Biochemistry and Biophysics, California Institute for Quantitative Biomedical Research UCSF Box 2540 University of California San Francisco San Francisco, CA 94143-2540, USA 4Department of Molecular Biology, Princeton University Princeton, NJ 08544-1014, USA

*To whom correspondence should be addressed. Tel: +1 609 951 2628; Fax: +1 609 951 2482; Email: kruus{at}nec-labs.com

Received April 13, 2005. Revised August 8, 2005. Accepted August 30, 2005.

Protein backbones have characteristic secondary structures, including {alpha}-helices and ß-sheets. Which structure is adopted locally is strongly biased by the local amino acid sequence of the protein. Accurate (probabilistic) mappings from sequence to structure are valuable for both secondary-structure prediction and protein design. For the case of {alpha}-helix caps, we test whether the information content of the sequence–structure mapping can be self-consistently improved by using a relaxed definition of the structure. We derive helix-cap sequence motifs using database helix assignments for proteins of known structure. These motifs are refined using Gibbs sampling in competition with a null motif. Then Gibbs sampling is repeated, allowing for frameshifts of ±1 amino acid residue, in order to find sequence motifs of higher total information content. All helix-cap motifs were found to have good generalization capability, as judged by training on a small set of non-redundant proteins and testing on a larger set. For overall prediction purposes, frameshift motifs using all training examples yielded the best results. Frameshift motifs using a fraction of all training examples performed best in terms of true positives among top predictions. However, motifs without frameshifts also performed well, despite a roughly one-third lower total information content.


Correspondence may also be addressed to Ned S. Wingreen. Tel: +1 609 258 8476; Fax: +1 609 258 8616; Email: wingreen{at}molbio.princeton.edu


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.