Published online 17 May 2006
Article |
Refining multiple sequence alignments with conserved core regions
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health Bethesda, MD, 20894, USA
*To whom correspondence should be addressed. Tel: 001 301 435 7792; Fax: 001 301 480 9241; E-mail: bryant{at}ncbi.nlm.nih.gov
Received January 11, 2006. Revised February 19, 2006. Accepted April 3, 2006.
Accurate multiple sequence alignments of proteins are very important to several areas of computational biology and provide an understanding of phylogenetic history of domain families, their identification and classification. This article presents a new algorithm, REFINER, that refines a multiple sequence alignment by iterative realignment of its individual sequences with the predetermined conserved core (block) model of a protein family. Realignment of each sequence can correct misalignments between a given sequence and the rest of the profile and at the same time preserves the family's overall block model. Large-scale benchmarking studies showed a noticeable improvement of alignment after refinement. This can be inferred from the increased alignment score and enhanced sensitivity for database searching using the sequence profiles derived from refined alignments compared with the original alignments. A standalone version of the program is available by ftp distribution (ftp://ftp.ncbi.nih.gov/pub/REFINER) and will be incorporated into the next release of the Cn3D structure/alignment viewer.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
K. Katoh and H. Toh Recent developments in the MAFFT multiple sequence alignment program Brief Bioinform, July 1, 2008; 9(4): 286 - 298. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Chakrabarti and C. J. Lanczycki Analysis and prediction of functionally important sites in proteins Protein Sci., January 1, 2007; 16(1): 4 - 13. [Abstract] [Full Text] [PDF] |
||||

