Skip Navigation

Nucleic Acids Research 2005 33(22):7120-7128; doi:10.1093/nar/gki1020
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (1357K) Freely available
Right arrow Screen PDF (311K) Freely available
Right arrow Supplementary Material
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (23)
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Lassmann, T.
Right arrow Articles by Sonnhammer, E. L. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lassmann, T.
Right arrow Articles by Sonnhammer, E. L. L.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Published online 16 December 2005

© The Author 2005. Published by Oxford University Press. All rights reserved
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions{at}oxfordjournals.org


Article

Automatic assessment of alignment quality

Timo Lassmann* and Erik L. L. Sonnhammer

Center for Genomics and Bioinformatics, Karolinska Institutet S-17177 Stockholm, Sweden

*To whom correspondence should be addressed. Tel: +46 8 5248 6372; Fax: +46 8 337983; Email: timo.lassmann{at}cgb.ki.se

Received September 8, 2005. Revised October 21, 2005. Accepted November 30, 2005.

Multiple sequence alignments play a central role in the annotation of novel genomes. Given the biological and computational complexity of this task, the automatic generation of high-quality alignments remains challenging. Since multiple alignments are usually employed at the very start of data analysis pipelines, it is crucial to ensure high alignment quality. We describe a simple, yet elegant, solution to assess the biological accuracy of alignments automatically. Our approach is based on the comparison of several alignments of the same sequences. We introduce two functions to compare alignments: the average overlap score and the multiple overlap score. The former identifies difficult alignment cases by expressing the similarity among several alignments, while the latter estimates the biological correctness of individual alignments. We implemented both functions in the MUMSA program and demonstrate the overall robustness and accuracy of both functions on three large benchmark sets.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Syst BiolHome page
R. C. Thomson and H. B. Shaffer
Sparse Supermatrices for Phylogenetic Inference: Taxonomy, Alignment, Rogue Taxa, and the Phylogeny of Living Turtles
Syst Biol, November 11, 2009; (2009) syp075v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
C. Kemena and C. Notredame
Upcoming challenges for multiple sequence alignment methods in the high-throughput era
Bioinformatics, October 1, 2009; 25(19): 2455 - 2465.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
B. Misof and K. Misof
A Monte Carlo Approach Successfully Identifies Randomness in Multiple Sequence Alignments: A More Objective Means of Data Exclusion
Syst Biol, May 20, 2009; (2009) syp006v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Lassmann, O. Frings, and E. L. L. Sonnhammer
Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features
Nucleic Acids Res., February 1, 2009; 37(3): 858 - 865.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
V. Ahola, T. Aittokallio, M. Vihinen, and E. Uusipaikka
Model-based prediction of sequence alignment quality
Bioinformatics, October 1, 2008; 24(19): 2165 - 2171.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
A.-C. M. Toes, M. H. Daleke, J. G. Kuenen, and G. Muyzer
Expression of copA and cusA in Shewanella during copper stress
Microbiology, September 1, 2008; 154(9): 2709 - 2718.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
B. G. Hall
How Well Does the HoT Score Reflect Sequence Alignment Accuracy?
Mol. Biol. Evol., August 1, 2008; 25(8): 1576 - 1580.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
B. G. Hall
Simulating DNA Coding Sequence Evolution with EvolveAGene 3
Mol. Biol. Evol., April 1, 2008; 25(4): 688 - 695.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
G. Lunter, A. Rocco, N. Mimouni, A. Heger, A. Caldeira, and J. Hein
Uncertainty in homology inferences: Assessing and improving genomic sequence alignment
Genome Res., February 1, 2008; 18(2): 298 - 309.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
E. Benavides, R. Baum, D. McClellan, and J. W. Sites
Molecular Phylogenetics of the Lizard Genus Microlophus (Squamata:Tropiduridae): Aligning and Retrieving Indel Signal from Nuclear Introns
Syst Biol, October 1, 2007; 56(5): 776 - 797.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Carroll, W. Beckstead, T. O'Connor, M. Ebbert, M. Clement, Q. Snell, and D. McClellan
DNA reference alignment benchmarks based on tertiary structure of encoded proteins
Bioinformatics, October 1, 2007; 23(19): 2648 - 2649.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Moretti, F. Armougom, I. M. Wallace, D. G. Higgins, C. V. Jongeneel, and C. Notredame
The M-Coffee web server: a meta-method for computing multiple sequence alignments by combining alternative alignment methods
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W645 - W648.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
M. M. McMahon and M. J. Sanderson
Phylogenetic Supermatrix Analysis of GenBank Sequences from 2228 Papilionoid Legumes
Syst Biol, October 1, 2006; 55(5): 818 - 836.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Lassmann and E. L. L. Sonnhammer
Kalign, Kalignvu and Mumsa: web servers for multiple sequence alignment.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W596 - W599.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.