Nucleic Acids Research, 2003, Vol. 31, No. 14 3982-3992
© 2003 Oxford University Press
Comparative protein structure modeling by iterative alignment, model building and model assessment
Laboratory of Molecular Biophysics, Pels Family Center for Biochemistry and Structural Biology, The Rockefeller University, New York, NY 10021, USA and 1 Departments of Biopharmaceutical Sciences and Pharmaceutical Chemistry and California Institute for Quantitative Biomedical Research, University of California at San Francisco, San Francisco, CA 94143, USA
*To whom correspondence should be addressed at Mission Bay Genentech Hall, Suite N472D, 600 16th Street, University of California at San Francisco, San Francisco, CA 94143-2240, USA. Tel: +1 415 514 4227; Fax: +1 415 514 4231; Email: sali{at}salilab.org
Comparative or homology protein structure modeling is severely limited by errors in the alignment of a modeled sequence with related proteins of known three-dimensional structure. To ameliorate this problem, we have developed an automated method that optimizes both the alignment and the model implied by it. This task is achieved by a genetic algorithm protocol that starts with a set of initial alignments and then iterates through re-alignment, model building and model assessment to optimize a model assessment score. During this iterative process: (i) new alignments are constructed by application of a number of operators, such as alignment mutations and cross-overs; (ii) comparative models corresponding to these alignments are built by satisfaction of spatial restraints, as implemented in our program MODELLER; (iii) the models are assessed by a variety of criteria, partly depending on an atomic statistical potential. When testing the procedure on a very difficult set of 19 modeling targets sharing only 427% sequence identity with their template structures, the average final alignment accuracy increased from 37 to 45% relative to the initial alignment (the alignment accuracy was measured as the percentage of positions in the tested alignment that were identical to the reference structure-based alignment). Correspondingly, the average model accuracy increased from 43 to 54% (the model accuracy was measured as the percentage of the C
atoms of the model that were within 5 Å of the corresponding C
atoms in the superposed native structure). The present method also compares favorably with two of the most successful previously described methods, PSI-BLAST and SAM. The accuracy of the final models would be increased further if a better method for ranking of the models were available.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
I. I. Serysheva, S. J. Ludtke, M. L. Baker, Y. Cong, M. Topf, D. Eramian, A. Sali, S. L. Hamilton, and W. Chiu Subnanometer-resolution electron cryomicroscopy-based domain models for the cytoplasmic region of skeletal muscle RyR channel PNAS, July 15, 2008; 105(28): 9610 - 9615. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Poleksic and M. Fienup Optimizing the size of the sequence profiles to increase the accuracy of protein sequence alignments generated by profile-profile algorithms Bioinformatics, May 1, 2008; 24(9): 1145 - 1153. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Ma, N. Sayed, P. Baskaran, A. Beuve, and F. van den Akker PAS-mediated Dimerization of Soluble Guanylyl Cyclase Revealed by Signal Transduction Histidine Kinase Domain Crystal Structure J. Biol. Chem., January 11, 2008; 283(2): 1167 - 1178. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Becker, A. Cotillard, V. Meyer, H. Madaoui, and R. Guerois HMM-Kalign: a tool for generating sub-optimal HMM alignments Bioinformatics, November 15, 2007; 23(22): 3095 - 3097. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Singh, K. J. Walker, P. S. Sijwali, A. L. Lau, and P. J. Rosenthal A chimeric cysteine protease of Plasmodium berghei engineered to resemble the Plasmodium falciparum protease falcipain-2 Protein Eng. Des. Sel., April 12, 2007; (2007) gzm009v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Chivian and D. Baker Homology modeling using parametric alignment ensemble generation with consensus and energy-based model selection Nucleic Acids Res., October 18, 2006; 34(17): e112 - e112. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. N. Offman, P. W. Fitzjohn, and P. A. Bates Developing a move-set for protein model refinement Bioinformatics, August 1, 2006; 22(15): 1838 - 1845. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Qian, A. R. Ortiz, and D. Baker Improvement of comparative model accuracy by free-energy optimization along principal components of natural structural variation PNAS, October 26, 2004; 101(43): 15346 - 15351. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Chance, A. Fiser, A. Sali, U. Pieper, N. Eswar, G. Xu, J. E. Fajardo, T. Radhakannan, and N. Marinkovic High-Throughput Computational and Experimental Techniques in Structural Genomics Genome Res., October 1, 2004; 14(10b): 2145 - 2154. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-T. Lee, J. Jacob, W. Michowski, M. Nowotny, J. Kuznicki, and W. J. Chazin Human Sgt1 Binds HSP90 through the CHORD-Sgt1 Domain and Not the Tetratricopeptide Repeat Domain J. Biol. Chem., April 16, 2004; 279(16): 16511 - 16517. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Pieper, N. Eswar, H. Braberg, M. S. Madhusudhan, F. P. Davis, A. C. Stuart, N. Mirkovic, A. Rossi, M. A. Marti-Renom, A. Fiser, et al. MODBASE, a database of annotated comparative protein structure models, and associated resources Nucleic Acids Res., January 1, 2004; 32(90001): D217 - 222. [Abstract] [Full Text] [PDF] |
||||





