Published online 7 February 2005
Article |
Homology-extended sequence alignment
1 Bioinformatics Section, Faculty of Sciences, Vrije Universiteit De Boelelaan 1081A, 1081 HV, Amsterdam, The Netherlands 2 Centre for Integrative Bioinformatics VU (IBIVU), Faculty of Sciences and Faculty of Earth and Life Sciences, Vrije Universiteit De Boelelaan 1081A, 1081 HV, Amsterdam, The Netherlands
*To whom correspondence should be addressed. Tel: +31 0 20 598 7649; Fax: +31 0 20 598 7653; Email: heringa{at}cs.vu.nl
Received November 26, 2004. Revised January 5, 2005. Accepted January 20, 2005.
We present a profileprofile multiple alignment strategy that uses database searching to collect homologues for each sequence in a given set, in order to enrich their available evolutionary information for the alignment. For each of the alignment sequences, the putative homologous sequences that score above a pre-defined threshold are incorporated into a position-specific pre-alignment profile. The enriched position-specific profile is used for standard progressive alignment, thereby more accurately describing the characteristic features of the given sequence set. We show that owing to the incorporation of the pre-alignment information into a standard progressive multiple alignment routine, the alignment quality between distant sequences increases significantly and outperforms state-of-the-art methods, such as T-COFFEE and MUSCLE. We also show that although entirely sequence-based, our novel strategy is better at aligning distant sequences when compared with a recent contact-based alignment method. Therefore, our pre-alignment profile strategy should be advantageous for applications that rely on high alignment accuracy such as local structure prediction, comparative modelling and threading.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
K. Katoh and H. Toh Recent developments in the MAFFT multiple sequence alignment program Brief Bioinform, July 1, 2008; 9(4): 286 - 298. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Richardson and I. J. Oresnik L-Rhamnose Transport Is Sugar Kinase (RhaK) Dependent in Rhizobium leguminosarum bv. trifolii J. Bacteriol., December 1, 2007; 189(23): 8437 - 8446. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Xiong, C. E. Bauer, and A. Pancholy Insight into the haem d1 biosynthesis pathway in heliobacteria through bioinformatics analysis Microbiology, October 1, 2007; 153(10): 3548 - 3562. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. L. Cantarel, H. G. Morrison, and W. Pearson Exploring the Relationship between Sequence Similarity and Accurate Phylogenetic Trees Mol. Biol. Evol., November 1, 2006; 23(11): 2090 - 2100. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Zhou and Y. Zhou SPEM: improving multiple sequence alignment with sequence profiles and predicted secondary structures Bioinformatics, September 15, 2005; 21(18): 3615 - 3621. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. A. Simossis and J. Heringa PRALINE: a multiple sequence alignment toolbox that integrates homology-extended and secondary structure information Nucleic Acids Res., July 1, 2005; 33(suppl_2): W289 - W294. [Abstract] [Full Text] [PDF] |
||||





