Skip Navigation

Nucleic Acids Research 2005 33(Web Server Issue):W252-W254; doi:10.1093/nar/gki362
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (1397K) Freely available
Right arrow Screen PDF (155K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Vlahovicek, K.
Right arrow Articles by Pongor, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Vlahovicek, K.
Right arrow Articles by Pongor, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2005. Published by Oxford University Press. All rights reserved
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions{at}oupjournals.org


Article

CX, DPX and PRIDE: WWW servers for the analysis and comparison of protein 3D structures

Kristian Vlahovicek, Alessandro Pintar, Laavanya Parthasarathi, Oliviero Carugo and Sándor Pongor*

Protein Structure and Bioinformatics Group, International Centre for Genetic Engineering and Biotechnology Area Science Park, 34012 Trieste, Italy

*To whom correspondence should be addressed. Fax: +39 04 226 555; Email: pongor{at}icgeb.org

Received February 1, 2005. Revised February 23, 2005. Accepted February 23, 2005.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 REFERENCES
 
The WWW servers at http://www.icgeb.org/protein/ are dedicated to the analysis of protein 3D structures submitted by the users as the Protein Data Bank (PDB) files. CX computes an atomic protrusion index that makes it possible to highlight the protruding atoms within a protein 3D structure. DPX calculates a depth index for the buried atoms and makes it possible to analyze the distribution of buried residues. CX and DPX return PDB files containing the calculated indices that can then be visualized using standard programs, such as Swiss-PDBviewer and Rasmol. PRIDE compares 3D structures using a fast algorithm based on the distribution of inter-atomic distances. The options include pairwise as well as multiple comparisons, and fold recognition based on searching the CATH fold database.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 REFERENCES
 
The advent of structural genomics initiatives has led to an increase in the number of protein 3D structures and hence there is a growing need for novel analysis tools (13). Maintenance of the various analysis programs on changing computer platforms is becoming a problem for many users. The Protein tools page at ICGEB is a collection of locally developed methods designed to assist users in the analysis of 3D structures. The underlying algorithms are designed to be simple and fast. Therefore, they are particularly suited for online use and for large-scale data management. All the three servers described here were written as standard C programs with PHP front end and run on a Beowulf type Linux cluster. The servers accept the Protein Data Bank (PDB) files (4), a description of the input/output options as well as the underlying theory is provided in the form of online help files.

The CX server is a visualization tool designed to highlight protruding atoms within a protein structure. Identification of protruding, or highly convex regions in proteins is relevant to the analysis of interfaces in protein–protein complexes, in the prediction of limited proteolysis cleavage sites and in the identification of possible antigenic determinant regions. CX (13,5) calculates the ratio between the external volume and the volume occupied by the protein within a sphere centered at every protein atom. Atoms in protruding regions will have a high ratio between the external and the internal volume, i.e. a high cx protrusion index. For protein structures, cx values can vary between 0 and 15. Only two independent parameters are used by CX: the average atomic volume and the sphere radius. The default value for the average atomic volume used by CX is set to 20.1 Å3. Given the approximate nature of the method and its purposes, slight variations in the average atomic volume do not affect the results in a remarkable way. The choice of the second parameter, the sphere radius, is rather empirical. Smaller values of R will make CX more sensitive to the local environment, whereas larger values will make it more sensitive to the global shape of the protein. The default radius used by CX (10 Å) is a good compromise to highlight both backbone and side chain protruding atoms in most applications (Figure 1A).



View larger version (59K):
[in this window]
[in a new window]
 
Figure 1 The Protein tools server. (A) Title page [The SBASE (15), FTHOM (16) and P450 (17) services have been described elsewhere]. (B) Clustering of 24 CH domains by the PRIDE server. The tree is an ASCII rendering of the Newick file produced by the server. Bottom: structure of the human histone lysine N-methyltransferase SET7/9 complexed with a histone peptide and S-adenosyl-L-homocysteine (SAH), PDB: 1O9S [PDB] . (C) Output of the CX server rendered with Rasmol (5). The enzyme is shown as ribbons, the peptide as sticks and SAH as a CPK model using; the structure is colored according to the cx values, calculated using a sphere radius of 8 Å. (D) Output of the DPX server rendered with Rasmol (5), The CPK model of the enzyme is shown in slab mode in the same orientation as in the left panel, and atoms colored according to their dpx values.

 
The DPX server is designed to facilitate the analysis of buried atoms within the protein interior. Parameters, such as the solvent accessible area (6) and the occluded surface, cannot distinguish buried residues that are close to the protein surface from those that are deep inside the protein core. Depth defined as the distance between a protein atom and the nearest water molecule surrounding the protein (7) was found to be a useful descriptor of the protein interior. Depth correlates better than solvent accessibility not only with amide H/D exchange rates for several proteins, but also with the difference in the thermodynamic stability of proteins containing cavity-creating mutations and with the change in the free energy formation of protein–protein complexes (8). We have developed the DPX index defined as the distance (Å) of a non-hydrogen buried atom from its closest solvent accessible protein neighbor (9,10) where buried and accessible atoms are identified using the rolling sphere algorithm. Although some information is lost for surface atoms (all solvent exposed atoms have dpx = 0 by default), the depth calculation is very fast because neither water molecules nor surface dots are explicitly considered. The only parameter that can be varied is the radius of the probe sphere, for which the default value is set to 1.4 Å (Figure 1B).

Both CX and DPX read ATOM lines from a PDB file submitted by the user. Non-standard residues, cofactors, metal ions and water molecules described in HETATM lines are not taken into account. Each chain in the PDB file is treated as an independent molecule but the results are written into a single output file in the PDB format, in which the cx or dpx values are written in place of the atomic displacement parameters (B-factors). The output file can thus be displayed using molecular graphics programs [e.g. Rasmol (11) and Swiss-PDBviewer (12)], and atoms colored according to their cx (or dpx) value. Mean residue cx (or dpx) values are also calculated.

The PRIDE server is designed to compare the fold (backbone conformation) of protein structures [for a review, see Carugo and Pongor (2) and the Database Issue 2005 of Nucleic Acids Research for current references]. PRIDE is based on comparing distributions of intramolecular C{alpha}–C{alpha} distances using a standard statistical process, contingency table analysis which gives a probability of identity or PRIDE score (13). For the calculation, the protein 3D structure is represented by 28 different C{alpha}(i) – C{alpha}(i + n) distance distributions (3 < n < 30) and the final PRIDE score for two protein structures is the average calculated from the results of the 28 comparisons (0 ≤ PRIDE ≤ 1). The calculation is extremely fast, so pairwise as well as multiple comparisons can be compared online. As PRIDE is a metric, it can be used to cluster and classify protein 3D structures through standard cluster analysis methods. As the calculation is based only on the C{alpha} atoms, the input files may contain only the C{alpha} lines. The PRIDE pair option compares two structures. Its output contains not only the final PRIDE score, but also the values it was derived from as well as a graphic representation of the underlying histograms. In case the PRIDE cluster option is used to analyze n protein 3D structures (presented as concatenated PDB files), the server provides three, easily downloadable output files: (i) the n x n square matrix where each i-th–j-th element is the distance, defined as 1-PRIDE, between the i-th and the j-th protein 3D structures; (ii) the dendrogram that summarizes a cluster analysis performed using the neighbor program of the PHYLIP software suite by applying the neighbor-joining criterion for cluster merging (Figure 1); and (iii) a Newick-format tree description that allows one to build its own dendrograms with the help of programs, such as njplot (http://pbil.univ-lyon1.fr/software/njplot.html) and TreeView (http://taxonomy.zoology.gla.ac.uk/rod/treeview.html). In case the PRIDE/scan option is used, the database search option of the server makes it possible to compare a 3D structure with the folds of the CATH database (14). The search results are presented as a ranked list, and according to the statistical evaluation, in over 99.5 of the cases the most similar structure points to the correct topology group.


    ACKNOWLEDGEMENTS
 
Funding to pay the Open Access publication charges for this article was provided by the International Centre for Genetic Engineering and Biotechnology, Trieste, Italy.

Conflict of interest statement. None declared.


    Footnotes
 
Present addresses: L. Parthasarathi, Department of Clinical Pharmacology, Royal College of Surgeons, Dublin 2, Ireland

O. Carugo, Department of General Chemistry, University of Pavia, 27100 Pavia, Italy


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 REFERENCES
 

  1. Domingues, F.S., Koppensteiner, W.A., Sippl, M.J. (2000) The role of protein structure in genomics FEBS Lett., 476, 98–102[CrossRef][Web of Science][Medline] .

  2. Carugo, O. and Pongor, S. (2002) Recent progress in protein 3D structure comparison Curr. Protein Pept. Sci., 3, 441–449[CrossRef][Web of Science][Medline] .

  3. Carugo, O. and Pongor, S. (2002) The evolution of structural databases Trends Biotechnol., 20, 498–501[CrossRef][Web of Science][Medline] .

  4. Bernstein, F.C., Koetzle, T.F., Williams, G.J., Meyer, E.F., Jr, Brice, M.D., Rodgers, J.R., Kennard, O., Shimanouchi, T., Tasumi, M. (1977) The Protein Data Bank: a computer-based archival file for macromolecular structures J. Mol. Biol., 112, 535–542[Web of Science][Medline] .

  5. Pintar, A., Carugo, O., Pongor, S. (2002) CX, an algorithm that identifies protruding atoms in proteins Bioinformatics, 18, 980–984[Abstract/Free Full Text] .

  6. Lee, B. and Richards, F.M. (1971) The interpretation of protein structures. Estimation of static accessibility J. Mol. Biol., 55, 379–400[CrossRef][Web of Science][Medline] .

  7. Pedersen, T.G., Sigurskjold, B.W., Andersen, K.V., Kjaer, M., Poulsen, F.M., Dobson, C.M., Redfield, C. (1991) A nuclear magnetic resonance study of the hydrogen-exchange behaviour of lysozyme in crystals and solution J. Mol. Biol., 218, 413–426[CrossRef][Web of Science][Medline] .

  8. Chakravarty, S. and Varadarajan, R. (1999) Residue depth: a novel parameter for the analysis of protein structure and stability Struct. Fold Des., 7, 723–732[Medline] .

  9. Pintar, A., Carugo, O., Pongor, S. (2003) DPX: for the analysis of the protein core Bioinformatics, 19, 313–314[Abstract/Free Full Text] .

  10. Pintar, A., Carugo, O., Pongor, S. (2003) Atom depth as a descriptor of the protein interior Biophys. J., 84, 2553–2561[Web of Science][Medline] .

  11. Sayle, R.A. and Milner-White, E.J. (1995) RASMOL: biomolecular graphics for all Trends Biochem. Sci., 20, 374[CrossRef][Web of Science][Medline] .

  12. Guex, N. and Peitsch, M.C. (1997) SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling Electrophoresis, 18, 2714–2723[CrossRef][Web of Science][Medline] .

  13. Carugo, O. and Pongor, S. (2002) Protein fold similarity estimated by a probabilistic approach based on C{alpha}–C{alpha} distance comparison J. Mol. Biol., 315, 887–898[CrossRef][Web of Science][Medline] .

  14. Pearl, F., Todd, A., Sillitoe, I., Dibley, M., Redfern, O., Lewis, T., Bennett, C., Marsden, R., Grant, A., Lee, D., et al. (2005) The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis Nucleic Acids Res., 33, D247–D251[Abstract/Free Full Text] .

  15. Vlahovicek, K., Kajan, L., Agoston, V., Pongor, S. (2005) The SBASE domain sequence resource, release 12: prediction of protein domain-architecture using support vector machines Nucleic Acids Res., 33, D223–D225[Abstract/Free Full Text] .

  16. Murvai, J., Vlahovicek, K., Barta, E., Parthasarathy, S., Hegyi, H., Pfeiffer, F., Pongor, S. (1999) The domain-server: direct prediction of protein domain-homologies from BLAST search Bioinformatics, 15, 343–344[Abstract/Free Full Text] .

  17. Fabian, P. and Degtyarenko, K.N. (1997) The directory of P450-containing systems in 1996 Nucleic Acids Res., 25, 274–277[Abstract/Free Full Text] .


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
M. Stout, J. Bacardit, J. D. Hirst, and N. Krasnogor
Prediction of recursive convex hull class assignments for protein residues
Bioinformatics, April 1, 2008; 24(7): 916 - 923.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. S. C. Shih, R.-c. R. Gan, and M.-J. Hwang
OPAAS: a web server for optimal, permuted, and other alternative alignments of protein structures.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W95 - W98.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (1397K) Freely available
Right arrow Screen PDF (155K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Vlahovicek, K.
Right arrow Articles by Pongor, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Vlahovicek, K.
Right arrow Articles by Pongor, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?