Nucleic Acids Research Advance Access originally published online on November 27, 2006
Nucleic Acids Research 2006 34(22):6540-6548; doi:10.1093/nar/gkl901
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Nucleic Acids Research, 2006, Vol. 34, No. 22 6540-6548
© 2006 The Author(s).
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Computational Biology |
Sequence comparison by sequence harmony identifies subtype-specific functional sites
Centre for Integrative Bioinformatics VU (IBIVU), Vrije Universiteit De Boelelaan 1081A, 1081 HV Amsterdam, The Netherlands
*To whom correspondence should be addressed. Tel: +31 20 59 87649; Fax: +31 20 59 87653; Email: heringa{at}few.vu.nl
Received July 10, 2006. Revised October 12, 2006. Accepted October 13, 2006.
Multiple sequence alignments are often used to reveal functionally important residues within a protein family. They can be particularly useful for the identification of key residues that determine functional differences between protein subfamilies. We present a new entropy-based method, Sequence Harmony (SH) that accurately detects subfamily-specific positions from a multiple sequence alignment. The SH algorithm implements a novel formula, able to score compositional differences between subfamilies, without imposing conservation, in a simple manner on an intuitive scale. We compare our method with the most important published methods, i.e. AMAS, TreeDet and SDP-pred, using three well-studied protein families: the receptor-binding domain (MH2) of the Smad family of transcription factors, the Ras-superfamily of small GTPases and the MIP-family of integral membrane transporters. We demonstrate that SH accurately selects known functional sites with higher coverage than the other methods for these test-cases. This shows that compositional differences between protein subfamilies provide sufficient basis for identification of functional sites. In addition, SH selects a number of sites of unknown function that could be interesting candidates for further experimental investigation.
The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. E. Donald and E. I. Shakhnovich SDR: a database of predicted specificity-determining residues in proteins Nucleic Acids Res., January 1, 2009; 37(suppl_1): D191 - D194. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Sankararaman and K. Sjolander INTREPID--INformation-theoretic TREe traversal for Protein functional site IDentification Bioinformatics, November 1, 2008; 24(21): 2445 - 2452. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Capra and M. Singh Characterization and prediction of residues determining protein functional specificity Bioinformatics, July 1, 2008; 24(13): 1473 - 1480. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Paukstelis and A. M. Lambowitz Identification and evolution of fungal mitochondrial tyrosyl-tRNA synthetases with group I intron splicing activity PNAS, April 22, 2008; 105(16): 6010 - 6015. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Ye, G. Vriend, and A. P. IJzerman Tracing evolutionary pressure Bioinformatics, April 1, 2008; 24(7): 908 - 915. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Ye, K. Anton Feenstra, J. Heringa, A. P. IJzerman, and E. Marchiori Multi-RELIEF: a method to recognize specificity determining residues from multiple sequence alignments using a Machine-Learning approach for feature weighting Bioinformatics, January 1, 2008; 24(1): 18 - 25. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Feenstra, W. Pirovano, K. Krab, and J. Heringa Sequence harmony: detecting functional specificity from alignments Nucleic Acids Res., July 13, 2007; 35(suppl_2): W495 - W498. [Abstract] [Full Text] [PDF] |
||||


