Nucleic Acids Research, 2005, Vol. 33, Database issue D413-D417
© 2005, the authors
Nucleic Acids Research, Vol. 33, Database issue © Oxford University Press 2005; all rights reserved
3did: interacting protein domains of known three-dimensional structure
1 EMBL, Meyerhofstrasse 1, 69117 Heidelberg, Germany and 2 EMBL, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton Hall, Cambridge CB10 1SD, UK
* To whom correspondence should be addressed. Tel: +49 6221 387 305; Fax: +49 6221 387 517; Email: aloy{at}embl.de
Received July 22, 2004; Revised and Accepted September 23, 2004
| ABSTRACT |
|---|
|
|
|---|
The database of 3D Interacting Domains (3did) is a collection of domaindomain interactions in proteins for which high-resolution three-dimensional structures are known. 3did exploits structural information to provide critical molecular details necessary for understanding how interactions occur. It also offers an overview of how similar in structure are interactions between different members of the same protein family. The database also contains Gene Ontology-based functional annotations and interactions between yeast proteins from large-scale interaction discovery studies. A web-based tool to query 3did is available at http://3did.embl.de.
| INTRODUCTION |
|---|
|
|
|---|
Proteins are social molecules and most biological processes require many of them to interact. This has encouraged many projects aimed at finding protein functions based on the detection of their relationships. Genome-scale interaction discovery approaches, such as the two-hybrid system (15) and affinity purification (6,7) have suggested thousands of proteinprotein interactions. In silico approaches have also predicted many interactions with levels of accuracy similar to those determined experimentally (8). Put together, all these interactions have uncovered many aspects of protein connectivity but without critical molecular details often necessary to understand their function. Another difficulty is that it is often impossible to distinguish between direct physical interactions and functional associations that may not involve direct atomic contacts between macromolecules. Currently, atomic details of interactions are present in high-resolution three-dimensional (3D) structures of protein complexes but this information is scarce and has been largely overlooked in large-scale studies. The database of interacting domains of known 3D structure (3did) exploits structural information to provide atomic details for thousands of direct physical interactions between proteins.
| 3did CONTENT |
|---|
|
|
|---|
Proteins are composed of modular elements (domains) that to a great extent determine their structure, function and interaction partners. We thus decided to structure our database on domains rather than full-length proteins. 3did obtains the high-resolution structures of individual proteins and complexes from the Protein Data Bank (PDB) (9). Pfam (10) domains are then assigned to each individual protein and interactions between them are computed and the information stored.
Currently, 3did includes information on 50 700 protein chains of known 3D structure making a total of 48 426 domaindomain interactions. Of these, 13 482 occur between domains in the same chain (i.e. intra-molecular) and 34 944 between domains lying in different proteins (i.e. inter-molecular). We grouped these interactions into 2535 types according to the Pfam domains mediating them. Of these 411 always interact within the same polypeptide chain (intra-molecular), 1765 are only seen in different chains (inter-molecular), and 359 containing both intra- and inter-molecular interactions. When available, 3did also contains functional information about the interacting domains. Gene Ontology (GO) (11) terms for molecular function, biological process and cellular component could be assigned to 1325, 1122 and 480 families, respectively. The database also contains 1128 links between known structures and interactions between yeast proteins determined experimentally as defined in MIPS (12). New 3D structures are incorporated weekly and major updates take place whenever a new version of Pfam is released. Up-to-date statistics on 3did contents can be found in the website.
| 3did USAGE AND FEATURES |
|---|
|
|
|---|
The standard way of accessing the database is by querying it with a particular domain. When doing so, 3did will show all domains that physically interact with our domain of interest and for which the 3D structure of the interaction is known. We computed physical interactions by requiring at least five contacts (hydrogen bonds, electrostatic or van de Waals interactions) between the two domains, and removed those that lack a significant interface as described previously (13). Nevertheless, it is likely that 3did still contains some non-biological contacts (e.g. from crystal packing), although we are working to remove them. The page will also show a list of the PDB codes for such domains and the associated functional GO terms, if defined. All the domaindomain interactions will also be displayed as an interactive network (Figure 1), where the user can choose the depth and a color scheme based on molecular function, biological process or cellular compartment as described by GO. The network also gives information on the type of contacts (i.e. intra- or inter-molecular) observed between the domains.
|
The user can then select a particular interaction among all the possibilities and retrieve the specific details stored in 3did. The output page for each domaindomain interaction displays a table with information concerning all the known 3D structures where this interaction is found (Figure 2). The table shows the exact location of the two domains in the 3D complex and gives empirical potential scores and Z-scores, which provide a measure of the number of favorable interacting residue pairs at the interface (13,14). They generally account for interaction specificity: the higher the Z-score, the more specific the interaction. Finally, by clicking on the rasmol (15) icon, we will get a display of the 3D complex. The two interacting domains are colored and shown in ribbons representation with the residues participating in the interface (i.e. making hydrogen bonds, salt bridges or van der Waals contacts) are shown in ball-and-stick (Figure 2, top right).
|
The table also contains links to our tool for plotting similarity in interactions (SimInt) (16). SimInt plots structural comparisons (iRMSD) of all instances of interactions of known 3D structure, highlighting those between the domains of interest (Figure 2, bottom right). This plot provides details as to how interactions involving particular families, superfamilies and folds, as defined in the SCOP database (17) can vary. Based on an analysis of hundreds of interactions, we suggested that two pairs of proteins do interact in a similar way if the iRMSD is <10 Å.
We have also incorporated into 3did experimental interaction data for the Yeast Saccharomyces cerevisiae from MIPS (Figure 2). For each yeast protein, we assign domains and whenever two interacting proteins contain domains also present in 3did, we suggest that the interaction will likely occur via these domains, therefore suggesting molecular details for such interaction (e.g. which residues are involved, etc.). It should be noted that some interactions in MIPS (i.e. those that form pull-down experiments) link subunits in a complex that are not in physical contact and thus are not present in 3did.
The user can also choose to query 3did by pasting a protein sequence. Here, the web-tool will graphically display your sequence with Pfam domains assigned automatically by means of BLAST (18) (E-value
105) and links to interaction information for each domain. Alternatively, the user can search for all interactions in a given structure (Figure 3) or query 3did directly with GO or SCOP accession codes.
|
3did also offers the possibility to check whether there is a putative indirect interaction path across similar proteins of known structure. The search engine looks for all possible paths in 3did and displays those with the shortest length. This is particularly useful for large complexes, where components are known, but not the physical contacts. For example, in cytochrome c oxidase (COX), we can find a path between domains COX2 and COX8 since, although they do not interact directly, both interact with COX4 (Figure 3).
Future developments will include domain definitions from SMART (19), additional experimental interaction data and classification of interaction types (transient, tight-complexes, etc.).
| AVAILABILITY |
|---|
|
|
|---|
MySQL and flat files containing the entire database are available through the website for independent studies.
| Notes |
|---|
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use permissions, please contact journals.permissions{at}oupjournals.org.
| REFERENCES |
|---|
|
|
|---|
- Uetz,P., Giot,L., Cagney,G., Mansfield,T.A., Judson,R.S., Knight,J.R., Lockshon,D., Narayan,V., Srinivasan,M., Pochart,P. et al. ( (2000) ) A comprehensive analysis of proteinprotein interactions in Saccharomyces cerevisiae. Nature, , 403, , 623627.[CrossRef][Medline] .
- Ito,T., Chiba,T., Ozawa,R., Yoshida,M., Hattori,M. and Sakaki,Y. ( (2001) ) A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc. Natl Acad. Sci. USA, , 98, , 45694574.
[Abstract/Free Full Text] . - Rain,J.C., Selig,L., De Reuse,H., Battaglia,V., Reverdy,C., Simon,S., Lenzen,G., Petel,F., Wojcik,J., Schachter,V. et al. ( (2001) ) The proteinprotein interaction map of Helicobacter pylori. Nature, , 409, , 211215.[CrossRef][Medline] .
- Giot,L., Bader,J.S., Brouwer,C., Chaudhuri,A., Kuang,B., Li,Y., Hao,Y.L., Ooi,C.E., Godwin,B., Vitols,E. et al. ( (2003) ) A protein interaction map of Drosophila melanogaster. Science, , 302, , 17271736.
[Abstract/Free Full Text] . - Li,S., Armstrong,C.M., Bertin,N., Ge,H., Milstein,S., Boxem,M., Vidalain,P.O., Han,J.D., Chesneau,A., Hao,T. et al. ( (2004) ) A map of the interactome network of the metazoan C. elegans. Science, , 303, , 540543.
[Abstract/Free Full Text] . - Ho,Y., Gruhler,A., Heilbut,A., Bader,G.D., Moore,L., Adams,S.L., Millar,A., Taylor,P., Bennett,K., Boutilier,K. et al. ( (2002) ) Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature, , 415, , 180183.[CrossRef][Medline] .
- Gavin,A.C., Bosche,M., Krause,R., Grandi,P., Marzioch,M., Bauer,A., Schultz,J., Rick,J.M., Michon,A.M., Cruciat,C.M. et al. ( (2002) ) Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature, , 415, , 141147.[CrossRef][Medline] .
- Von Mering,C., Krause,R., Snel,B., Cornell,M., Oliver,S.G., Fields,S. and Bork,P. ( (2002) ) Comparative assessment of large-scale data sets of proteinprotein interactions. Nature, , 417, , 399403.[Medline] .
- Bourne,P.E., Addess,K.J., Bluhm,W.F., Chen,L., Deshpande,N., Feng,Z., Fleri,W., Green,R., Merino-Ott,J.C., Townsend-Merino,W. et al. ( (2004) ) The distribution and query systems of the RCSB Protein Data Bank. Nucleic Acids Res., , 32, , D223D225.
[Abstract/Free Full Text] . - Bateman,A., Coin,L., Durbin,R., Finn,R.D., Hollich,V., Griffiths-Jones,S., Khanna,A., Marshall,M., Moxon,S., Sonnhammer,E.L. et al. ( (2004) ) The Pfam protein families database Nucleic Acids Res., , 32, , D138D141.
[Abstract/Free Full Text] . - Ashburner,M., Ball,C.A., Blake,J.A., Botstein,D., Butler,H., Cherry,J.M., Davis,A.P., Dolinski,K., Dwight,S.S., Eppig,J.T. et al. ( (2000) ) Gene Ontology: tool for the unification of biology. The Gene Ontology Consortium. Nature Genet., , 25, , 2529.[CrossRef][Web of Science][Medline] .
- Mewes,H.W., Amid,C., Arnold,R., Frishman,D., Guldener,U., Mannhaupt,G., Munsterkotter,M., Pagel,P., Strack,N., Stumpflen,V. et al. ( (2004) ) MIPS: analysis and annotation of proteins from whole genomes. Nucleic Acids Res., , 32, , D41D44.
[Abstract/Free Full Text] . - Aloy,P. and Russell,R.B. ( (2002) ) Interrogating protein interaction networks through structural biology. Proc. Natl Acad. Sci. USA, , 99, , 58965901.
[Abstract/Free Full Text] . - Aloy,P. and Russell,R.B. ( (2003) ) InterPreTS: protein interaction prediction through tertiary structure. Bioinformatics, , 19, , 161162.
[Abstract/Free Full Text] . - Sayle,R.A. and Milner-White,E.J. ( (1995) ) RASMOL: biomolecular graphics for all Trends Biochem. Sci., , 20, , 374.[CrossRef][Web of Science][Medline] .
- Aloy,P., Ceulemans,H., Stark,A. and Russell,R.B. ( (2003) ) The relationship between sequence and interaction divergence in proteins. J. Mol. Biol., , 332, , 989998.[CrossRef][Web of Science][Medline] .
- Andreeva,A., Howorth,D., Brenner,S.E., Hubbard,T.J., Chothia,C. and Murzin,A.G. ( (2004) ) SCOP database in 2004: refinements integrate structure and sequence family data. Nucleic Acids Res., , 32, , D226D229.
[Abstract/Free Full Text] . - Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J., Zhang,Z., Miller,W. and Lipman,D.J. ( (1997) ) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res., , 25, , 33893402.
[Abstract/Free Full Text] . - Letunic,I., Copley,R.R., Schmidt,S., Ciccarelli,F.D., Doerks,T., Schultz,J., Ponting,C.P. and Bork,P. ( (2004) ) SMART 4.0: towards genomic data integration Nucleic Acids Res., , 32, , D142D144.
[Abstract/Free Full Text] .
This article has been cited by other articles:
![]() |
P. Bjorkholm and E. L. L. Sonnhammer Comparative analysis and unification of domain-domain interaction networks Bioinformatics, November 15, 2009; 25(22): 3020 - 3025. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Liu, X.-w. Chen, and R. Jothi Knowledge-guided inference of domain-domain interactions from incomplete protein-protein interaction networks Bioinformatics, October 1, 2009; 25(19): 2492 - 2499. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. C. Friedel and R. Zimmer Identifying the topology of protein complexes from affinity purification assays Bioinformatics, August 15, 2009; 25(16): 2140 - 2146. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Fornes, R. Aragues, J. Espadaler, M. A. Marti-Renom, A. Sali, and B. Oliva ModLink+: improving fold recognition by using protein-protein interactions Bioinformatics, June 15, 2009; 25(12): 1506 - 1512. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Tuncbag, G. Kar, O. Keskin, A. Gursoy, and R. Nussinov A survey of available tools and web servers for analysis of protein-protein interactions and interfaces Brief Bioinform, May 1, 2009; 10(3): 217 - 232. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Yu and R. L. Finley Jr Combining multiple positive training sets to generate confidence scores for protein-protein interactions Bioinformatics, January 1, 2009; 25(1): 105 - 111. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Higurashi, T. Ishida, and K. Kinoshita PiSite: a database of protein interaction sites using multiple binding states in the PDB Nucleic Acids Res., January 1, 2009; 37(suppl_1): D360 - D364. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Stein, A. Panjkovich, and P. Aloy 3did Update: domain-domain and peptide-mediated interactions of known 3D structure Nucleic Acids Res., January 1, 2009; 37(suppl_1): D300 - D304. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Iqbal, A. A. Freitas, C. G. Johnson, and M. Vergassola Message-passing algorithms for the prediction of protein domain interactions from protein-protein interaction data Bioinformatics, September 15, 2008; 24(18): 2064 - 2070. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Akiva, Z. Itzhaki, and H. Margalit Built-in loops allow versatility in domain-domain interactions: Lessons from self-interacting domains PNAS, September 9, 2008; 105(36): 13292 - 13297. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-E. Schelhorn, T. Lengauer, and M. Albrecht An integrative approach for predicting interactions of protein regions Bioinformatics, August 15, 2008; 24(16): i35 - i41. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Goll, S. V. Rajagopala, S. C. Shiau, H. Wu, B. T. Lamb, and P. Uetz MPIDB: the microbial protein interaction database Bioinformatics, August 1, 2008; 24(15): 1743 - 1744. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Li, W. Liu, Z. Liu, J. Wang, Q. Liu, Y. Zhu, and F. He PRINCESS, a Protein Interaction Confidence Evaluation System with Multiple Data Sources Mol. Cell. Proteomics, June 1, 2008; 7(6): 1043 - 1052. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Raghavachari, A. Tasneem, T. M. Przytycka, and R. Jothi DOMINE: a database of protein domain interactions Nucleic Acids Res., January 11, 2008; 36(suppl_1): D656 - D661. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Pagel, M. Oesterheld, O. Tovstukhina, N. Strack, V. Stumpflen, and D. Frishman DIMA 2.0 predicted and known domain interactions Nucleic Acids Res., January 11, 2008; 36(suppl_1): D651 - D655. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Guharoy and P. Chakrabarti Secondary structure based analysis and classification of biological interfaces: identification of binding motifs in protein protein interactions Bioinformatics, August 1, 2007; 23(15): 1909 - 1918. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Dawelbait, C. Winter, Y. Zhang, C. Pilarsky, R. Grutzmann, J.-C. Heinrich, and M. Schroeder Structural templates predict novel protein interactions and targets from pancreas tumour gene expression data Bioinformatics, July 1, 2007; 23(13): i115 - i124. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Schlicker, C. Huthmacher, F. Ramirez, T. Lengauer, and M. Albrecht Functional evaluation of domain domain interactions and human protein interaction networks Bioinformatics, April 1, 2007; 23(7): 859 - 865. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Cockell, B. Oliva, and R. M. Jackson Structure-based evaluation of in silico predictions of protein protein interactions using Comparative Docking Bioinformatics, March 1, 2007; 23(5): 573 - 581. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. R. Jefferson, T. P. Walsh, T. J. Roberts, and G. J. Barton SNAPPI-DB: a database and API of Structures, iNterfaces and Alignments for Protein-Protein Interactions Nucleic Acids Res., January 12, 2007; 35(suppl_1): D580 - D589. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Kundrotas and E. Alexov PROTCOM: searchable database of protein complexes enhanced with domain-domain structures Nucleic Acids Res., January 12, 2007; 35(suppl_1): D575 - D579. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li, J. Li, and L. Wong Discovering motif pairs at interaction sites from protein sequences on a proteome-wide scale Bioinformatics, April 15, 2006; 22(8): 989 - 996. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Henschel, W. K. Kim, and M. Schroeder Equivalent binding sites reveal convergently evolved interaction motifs Bioinformatics, March 1, 2006; 22(5): 550 - 555. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Winter, A. Henschel, W. K. Kim, and M. Schroeder SCOPPI: a structural classification of protein-protein interfaces Nucleic Acids Res., January 1, 2006; 34(suppl_1): D310 - D314. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||







