Nucleic Acids Research Advance Access published online on September 18, 2009
Nucleic Acids Research, doi:10.1093/nar/gkp781
Database Issue |
3D-footprint: a database for the structural analysis of protein–DNA complexes
1Estación Experimental de Aula Dei, Consejo Superior de Investigaciones Científicas, 2Fundación ARAID, Paseo María Agustín 36, Zaragoza, Spain and 3Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
*To whom correspondence should be addressed. Tel: +34 976716089; Email: bcontreras{at}eead.csic.es
Received August 14, 2009. Revised September 2, 2009. Accepted September 3, 2009.
3D-footprint is a living database, updated and curated on a weekly basis, which provides estimates of binding specificity for all protein–DNA complexes available at the Protein Data Bank. The web interface allows the user to: (i) browse DNA-binding proteins by keyword; (ii) find proteins that recognize a similar DNA motif and (iii) BLAST similar DNA-binding proteins, highlighting interface residues in the resulting alignments. Each complex in the database is dissected to draw interface graphs and footprint logos, and two complementary algorithms are employed to characterize binding specificity. Moreover, oligonucleotide sequences extracted from literature abstracts are reported in order to show the range of variant sites bound by each protein and other related proteins. Benchmark experiments, including comparisons with expert-curated databases RegulonDB and TRANSFAC, support the quality of structure-based estimates of specificity. The relevant content of the database is available for download as flat files and it is also possible to use the 3D-footprint pipeline to analyze protein coordinates input by the user. 3D-footprint is available at http://floresta.eead.csic.es/3dfootprint with demo buttons and a comprehensive tutorial that illustrates the main uses of this resource.