Published online 26 January 2005
Article |
Quantitative evaluation of proteinDNA interactions using an optimized knowledge-based potential
1 Computational Systems Biology Laboratory, Department of Biochemistry and Molecular Biology, University of Georgia Athens, GA 30602, USA 2 Computational Biology Institute, Oak Ridge National Laboratory Oak Ridge, TN 37831, USA
*To whom correspondence should be addressed. Tel: +1 706 542 9779; Fax: +1 706 542 9751; Email: xyn{at}bmb.uga.edu
Received November 8, 2004. Revised December 16, 2004. Accepted January 3, 2005.
Computational evaluation of proteinDNA interaction is important for the identification of DNA-binding sites and genome annotation. It could validate the predicted binding motifs by sequence-based approaches through the calculation of the binding affinity between a protein and DNA. Such an evaluation should take into account structural information to deal with the complicated effects from DNA structural deformation, distance-dependent multi-body interactions and solvation contributions. In this paper, we present a knowledge-based potential built on interactions between protein residues and DNA tri-nucleotides. The potential, which explicitly considers the distance-dependent two-body, three-body and four-body interactions between protein residues and DNA nucleotides, has been optimized in terms of a Z-score. We have applied this knowledge-based potential to evaluate the binding affinities of zinc-finger proteinDNA complexes. The predicted binding affinities are in good agreement with the experimental data (with a correlation coefficient of 0.950). On a larger test set containing 48 proteinDNA complexes with known experimental binding free energies, our potential has achieved a high correlation coefficient of 0.800, when compared with the experimental data. We have also used this potential to identify binding motifs in DNA sequences of transcription factors (TF). The TFs in 79.4% of the known TFDNA complexes have accurately found their native binding sequences from a large pool of DNA sequences. When tested in a genome-scale search for TF-binding motifs of the cyclic AMP regulatory protein (CRP) of Escherichia coli, this potential ranks all known binding motifs of CRP in the top 15% of all candidate sequences.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. A. Temiz and C. J. Camacho Experimentally based contact energies decode interactions responsible for protein-DNA affinity and the role of molecular waters at the binding interface Nucleic Acids Res., July 1, 2009; 37(12): 4076 - 4088. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Persikov, R. Osada, and M. Singh Predicting DNA recognition by Cys2His2 zinc finger proteins Bioinformatics, January 1, 2009; 25(1): 22 - 29. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Gao and J. Skolnick DBD-Hunter: a knowledge-based method for the prediction of DNA-protein interactions Nucleic Acids Res., July 1, 2008; 36(12): 3978 - 3992. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. W. Siggers and B. Honig Structure-based prediction of C2H2 zinc-finger binding specificity: sensitivity to docking geometry Nucleic Acids Res., February 28, 2007; 35(4): 1085 - 1097. [Abstract] [Full Text] [PDF] |
||||

