| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Published online 19 March 2004
Nucleic Acids Research, 2004, Vol. 32, No. 5 1792-1797
Oxford University Press
MUSCLE: multiple sequence alignment with high accuracy and high throughput
195 Roque Moraes Drive, Mill Valley, CA 94941, USA
*Email: bob{at}drive5.com
Received January 19, 2004; Revised January 30, 2004; Accepted February 24, 2004
We describe MUSCLE, a new computer program for creating multiple alignments of protein sequences. Elements of the algorithm include fast distance estimation using kmer counting, progressive alignment using a new profile function we call the log-expectation score, and refinement using tree-dependent restricted partitioning. The speed and accuracy of MUSCLE are compared with T-Coffee, MAFFT and CLUSTALW on four test sets of reference alignments: BAliBASE, SABmark, SMART and a new benchmark, PREFAB. MUSCLE achieves the highest, or joint highest, rank in accuracy on each of these sets. Without refinement, MUSCLE achieves average accuracy statistically indistinguishable from T-Coffee and MAFFT, and is the fastest of the tested methods for large numbers of sequences, aligning 5000 sequences of average length 350 in 7 min on a current desktop computer. The MUSCLE program, source code and PREFAB test data are freely available at http://www.drive5. com/muscle.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Z.-M. Zhao, K. F. Shortridge, M. Garcia, Y. Guan, and X.-F. Wan Genotypic diversity of H5N1 highly pathogenic avian influenza viruses J. Gen. Virol., September 1, 2008; 89(9): 2182 - 2193. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. V. Sernova and M. S. Gelfand Identification of replication origins in prokaryotic genomes Brief Bioinform, September 1, 2008; 9(5): 376 - 391. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-T. Kim and M. J. Donoghue Incongruence between cpDNA and nrITS trees indicates extensive hybridization within Eupersicaria (Polygonaceae) Am. J. Botany, September 1, 2008; 95(9): 1122 - 1135. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. J. Stanley, E. S. Ehrlich, L. Short, Y. Yu, Z. Xiao, X.-F. Yu, and Y. Xiong Structural Insight into the Human Immunodeficiency Virus Vif SOCS Box and Its Role in Human E3 Ubiquitin Ligase Assembly J. Virol., September 1, 2008; 82(17): 8656 - 8663. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Savic, T. Ilic-Tomic, R. Macmaster, B. Vasiljevic, and G. L. Conn Critical Residues for Cofactor Binding and Catalytic Activity in the Aminoglycoside Resistance Methyltransferase Sgm J. Bacteriol., September 1, 2008; 190(17): 5855 - 5861. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-C. Walser, L. Ponger, and A. V. Furano CpG dinucleotides and the mutation rate of non-CpG DNA Genome Res., September 1, 2008; 18(9): 1403 - 1414. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Studer, S. Penel, L. Duret, and M. Robinson-Rechavi Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes Genome Res., September 1, 2008; 18(9): 1393 - 1402. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Akk, P. Li, J. Bracamontes, D. E. Reichert, D. F. Covey, and J. H. Steinbach Mutations of the GABA-A Receptor {alpha}1 Subunit M1 Domain Reveal Unexpected Complexity for Modulation by Neuroactive Steroids Mol. Pharmacol., September 1, 2008; 74(3): 614 - 627. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Vieira-Silva and E. P. C. Rocha An Assessment of the Impacts of Molecular Oxygen on the Evolution of Proteomes Mol. Biol. Evol., September 1, 2008; 25(9): 1931 - 1942. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Burgess and Z. Yang Estimation of Hominoid Ancestral Population Sizes under Bayesian Coalescent Models Incorporating Mutation Rate Variation and Sequencing Errors Mol. Biol. Evol., September 1, 2008; 25(9): 1979 - 1994. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-T. Kim, S. E. Sultan, and M. J. Donoghue Allopolyploid speciation in Persicaria (Polygonaceae): Insights from a low-copy nuclear region PNAS, August 26, 2008; 105(34): 12370 - 12375. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Fan, Y. Zhang, Y. Yu, S. Rounsley, M. Long, and R. A. Wing The Subtelomere of Oryza sativa Chromosome 3 Short Arm as a Hot Bed of New Gene Origination in Rice Mol Plant, August 22, 2008; (2008) ssn050v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. van den Born, M. V. Omelchenko, A. Bekkelund, V. Leihne, E. V. Koonin, V. V. Dolja, and P. O. Falnes Viral AlkB proteins repair RNA damage by oxidative demethylation Nucleic Acids Res., August 21, 2008; (2008) gkn519v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. P. Brown Efficient functional clustering of protein sequences using the Dirichlet process Bioinformatics, August 15, 2008; 24(16): 1765 - 1771. [Abstract] [PDF] |
||||
![]() |
A. D. J. van Dijk, D. Bosch, C. J. F. ter Braak, A. R. van der Krol, and R. C. H. J. van Ham Predicting sub-Golgi localization of type II membrane proteins Bioinformatics, August 15, 2008; 24(16): 1779 - 1786. [Abstract] [PDF] |
||||
![]() |
T. Rausch, A.-K. Emde, D. Weese, A. Doring, C. Notredame, and K. Reinert Segment-based multiple sequence alignment Bioinformatics, August 15, 2008; 24(16): i187 - i192. [Abstract] [PDF] |
||||
![]() |
J. M. Lambert, R. J. Siezen, W. M. de Vos, and M. Kleerebezem Improved annotation of conjugated bile acid hydrolase superfamily members in Gram-positive bacteria Microbiology, August 1, 2008; 154(8): 2492 - 2500. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Iritani, H. Vennema, J. J. Siebenga, R. J. Siezen, B. Renckens, Y. Seto, A. Kaida, and M. Koopmans Genetic Analysis of the Capsid Gene of Genotype GII.2 Noroviruses J. Virol., August 1, 2008; 82(15): 7336 - 7345. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Porter and A. S. Engel Diversity of Uncultured Epsilonproteobacteria from Terrestrial Sulfidic Caves and Springs Appl. Envir. Microbiol., August 1, 2008; 74(15): 4973 - 4977. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Liu, A. Nauta, C. Francke, and R. J. Siezen Comparative Genomics of Enzymes in Flavor-Forming Pathways from Amino Acids in Lactic Acid Bacteria Appl. Envir. Microbiol., August 1, 2008; 74(15): 4590 - 4600. [Full Text] [PDF] |
||||
![]() |
K. A. Grabinska, S. K. Ghosh, Z. Guan, J. Cui, C. R. H. Raetz, P. W. Robbins, and J. Samuelson Dolichyl-Phosphate-Glucose Is Used To Make O-Glycans on Glycoproteins of Trichomonas vaginalis Eukaryot. Cell, August 1, 2008; 7(8): 1344 - 1351. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Turner, E. B. Chuong, and H. E. Hoekstra Comparative Analysis of Testis Protein Evolution in Rodents Genetics, August 1, 2008; 179(4): 2075 - 2089. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Yutin, K. S. Makarova, S. L. Mekhedov, Y. I. Wolf, and E. V. Koonin The Deep Archaeal Roots of Eukaryotes Mol. Biol. Evol., August 1, 2008; 25(8): 1619 - 1630. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. E. Dutilh, B. Snel, T. J. G. Ettema, and M. A. Huynen Signature Genes as a Phylogenomic Tool Mol. Biol. Evol., August 1, 2008; 25(8): 1659 - 1667. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Santt, T. Pfirrmann, B. Braun, J. Juretschke, P. Kimmig, H. Scheel, K. Hofmann, M. Thumm, and D. H. Wolf The Yeast GID Complex, a Novel Ubiquitin Ligase (E3) Involved in the Regulation of Carbohydrate Metabolism Mol. Biol. Cell, August 1, 2008; 19(8): 3323 - 3333. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. S. Miller and D. Eisenberg Using inferred residue contacts to distinguish between correct and incorrect protein models Bioinformatics, July 15, 2008; 24(14): 1575 - 1582. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Sanderson Phylogenetic Signal in the Eukaryotic Tree of Life Science, July 4, 2008; 321(5885): 121 - 123. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Katoh and H. Toh Recent developments in the MAFFT multiple sequence alignment program Brief Bioinform, July 1, 2008; 9(4): 286 - 298. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Quan, Z.-L. Ji, X. Wang, A. M. Tartakoff, and T. Tao Evolutionary and Transcriptional Analysis of Karyopherin {beta} Superfamily Proteins Mol. Cell. Proteomics, July 1, 2008; 7(7): 1254 - 1269. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Gupta, J. Benhamida, V. Bhargava, D. Goodman, E. Kain, I. Kerman, N. Nguyen, N. Ollikainen, J. Rodriguez, J. Wang, et al. Comparative proteogenomics: Combining mass spectrometry and comparative genomics to analyze multiple genomes Genome Res., July 1, 2008; 18(7): 1133 - 1142. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Garcia-Boronat, C. M. Diez-Rivero, E. L. Reinherz, and P. A. Reche PVS: a web server for protein sequence variability analysis tuned to facilitate conserved epitope discovery Nucleic Acids Res., July 1, 2008; 36(suppl_2): W35 - W41. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Moretti, A. Wilm, D. G. Higgins, I. Xenarios, and C. Notredame R-Coffee: a web server for accurately aligning noncoding RNA sequences Nucleic Acids Res., July 1, 2008; 36(suppl_2): W10 - W13. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pei, M. Tang, and N. V. Grishin PROMALS3D web server for accurate multiple protein sequence and structure alignments Nucleic Acids Res., July 1, 2008; 36(suppl_2): W30 - W34. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Overton, C. A. J. van Niekerk, L. G. Carter, A. Dawson, D. M. A. Martin, S. Cameron, S. A. McMahon, M. F. White, W. N. Hunter, J. H. Naismith, et al. TarO: a target optimisation system for structural biology Nucleic Acids Res., July 1, 2008; 36(suppl_2): W190 - W196. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. E. Martinez-Guerrero, R. Ciria, C. Abreu-Goodger, G. Moreno-Hagelsieb, and E. Merino GeConT 2: gene context analysis for orthologous proteins, conserved domains and metabolic pathways Nucleic Acids Res., July 1, 2008; 36(suppl_2): W176 - W180. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Egea, S. Casillas, and A. Barbadilla Standard and generalized McDonald-Kreitman test: a website to detect selection by comparing different classes of DNA sites Nucleic Acids Res., July 1, 2008; 36(suppl_2): W157 - W162. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Thomas, M. Caceres, J. J. Lowman, C. B. Morehouse, M. E. Short, E. L. Baldwin, D. L. Maney, and C. L. Martin The Chromosomal Polymorphism Linked to Variation in Social Behavior in the White-Throated Sparrow (Zonotrichia albicollis) Is a Complex Rearrangement and Suppressor of Recombination Genetics, July 1, 2008; 179(3): 1455 - 1468. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Comeau and H. M. Krisch The Capsid of the T4 Phage Superfamily: The Evolution, Diversity, and Structure of Some of the Most Prevalent Proteins in the Biosphere Mol. Biol. Evol., July 1, 2008; 25(7): 1321 - 1332. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. F. Hudson, S. Ohta, T. Freisinger, F. MacIsaac, L. Sennels, F. Alves, F. Lai, A. Kerr, J. Rappsilber, and W. C. Earnshaw Molecular and Genetic Analysis of Condensin Function in Vertebrate Cells Mol. Biol. Cell, July 1, 2008; 19(7): 3070 - 3079. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Mignard and J.-P. Flandrois A seven-gene, multilocus, genus-wide approach to the phylogeny of mycobacteria using supertrees Int J Syst Evol Microbiol, June 1, 2008; 58(6): 1432 - 1441. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. E. Flagel, R. A. Rapp, C. E. Grover, M. P. Widrlechner, J. Hawkins, J. L. Grafenberg, I. Alvarez, G. Y. Chung, and J. F. Wendel Phylogenetic, morphological, and chemotaxonomic incongruence in the North American endemic genus Echinacea Am. J. Botany, June 1, 2008; 95(6): 756 - 765. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Orlowski and J. M. Bujnicki Structural and evolutionary classification of Type II restriction enzymes based on theoretical and experimental analyses Nucleic Acids Res., June 1, 2008; 36(11): 3552 - 3569. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Whittington, A. T. Papenfuss, P. Bansal, A. M. Torres, E. S.W. Wong, J. E. Deakin, T. Graves, A. Alsop, K. Schatzkamer, C. Kremitzki, et al. Defensins and the convergent evolution of platypus and reptile venom genes Genome Res., June 1, 2008; 18(6): 986 - 994. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Bedford, I. Wapinski, and D. L. Hartl Overdispersion of the Molecular Clock Varies Between Yeast, Drosophila and Mammals Genetics, June 1, 2008; 179(2): 977 - 984. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Hernandez-Sanchez, A. Mansilla, F. de Pablo, and R. Zardoya Evolution of the Insulin Receptor Family and Receptor Isoform Expression in Vertebrates Mol. Biol. Evol., June 1, 2008; 25(6): 1043 - 1053. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Matsuzaki, H. Kuroiwa, T. Kuroiwa, K. Kita, and H. Nozaki A Cryptic Algal Group Unveiled: A Plastid Biosynthesis Pathway in the Oyster Parasite Perkinsus marinus Mol. Biol. Evol., June 1, 2008; 25(6): 1167 - 1179. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. F. Boni, Y. Zhou, J. K. Taubenberger, and E. C. Holmes Homologous Recombination Is Very Rare or Absent in Human Influenza A Virus J. Virol., May 15, 2008; 82(10): 4807 - 4811. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Sammut, R. D. Finn, and A. Bateman Pfam 10 years on: 10 000 families and still growing Brief Bioinform, May 1, 2008; 9(3): 210 - 219. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Wilm, D. G. Higgins, and C. Notredame R-Coffee: a method for multiple alignment of non-coding RNA Nucleic Acids Res., May 1, 2008; 36(9): e52 - e52. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Gao, M. K. Davidson, and W. P. Wahls Distinct regions of ATF/CREB proteins Atf1 and Pcr1 control recombination hotspot ade6-M26 and the osmotic stress response Nucleic Acids Res., May 1, 2008; 36(9): 2838 - 2851. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Ray, C. Feschotte, H. J.T. Pagan, J. D. Smith, E. J. Pritham, P. Arensburger, P. W. Atkinson, and N. L. Craig Multiple waves of recent DNA transposon activity in the bat, Myotis lucifugus Genome Res., May 1, 2008; 18(5): 717 - 728. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. La Scola, K. Elkarkouri, W. Li, T. Wahab, G. Fournous, J.-M. Rolain, S. Biswas, M. Drancourt, C. Robert, S. Audic, et al. Rapid comparative genomic analysis for clinical microbiology: The Francisella tularensis paradigm Genome Res., May 1, 2008; 18(5): 742 - 750. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Goudet, S. Mugnier, I. Callebaut, and P. Monget Phylogenetic Analysis and Identification of Pseudogenes Reveal a Progressive Loss of Zona Pellucida Genes During Evolution of Vertebrates Biol Reprod, May 1, 2008; 78(5): 796 - 806. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Csuros, I. B. Rogozin, and E. V. Koonin Extremely Intron-Rich Genes in the Alveolate Ancestors Inferred with a Flexible Maximum-Likelihood Approach Mol. Biol. Evol., May 1, 2008; 25(5): 903 - 911. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Warthmann, S. Das, C. Lanz, and D. Weigel Comparative Analysis of the MIR319a MicroRNA Locus in Arabidopsis and Related Brassicaceae Mol. Biol. Evol., May 1, 2008; 25(5): 892 - 902. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Pantalacci, A. Chaumot, G. Benoit, A. Sadier, F. Delsuc, E. J. P. Douzery, and V. Laudet Conserved Features and Evolutionary Shifts of the EDA Signaling Pathway Involved in Vertebrate Skin Appendage Development Mol. Biol. Evol., May 1, 2008; 25(5): 912 - 928. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Balke, J. Gomez-Zurita, I. Ribera, A. Viloria, A. Zillikens, J. Steiner, M. Garcia, L. Hendrich, and A. P. Vogler Ancient associations of aquatic beetles and tank bromeliads in the Neotropical forest canopy PNAS, April 29, 2008; 105(17): 6356 - 6361. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Paukstelis and A. M. Lambowitz Identification and evolution of fungal mitochondrial tyrosyl-tRNA synthetases with group I intron splicing activity PNAS, April 22, 2008; 105(16): 6010 - 6015. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. van Ooijen, G. Mayr, M. M. A. Kasiem, M. Albrecht, B. J. C. Cornelissen, and F. L. W. Takken Structure-function analysis of the NB-ARC domain of plant disease resistance proteins J. Exp. Bot., April 4, 2008; (2008) ern045v1. [Abstract] [Full Text] [PDF] |
||||





















