Nucleic Acids Research, 2001, Vol. 29, No. 8 1750-1764
© 2001 Oxford University Press
PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information
Department of Molecular Biophysics and Biochemistry, Yale University, PO Box 208114, New Haven, CT 06520, USA, 1Department of Biochemistry and Molecular Biology, University College London, Darwin Building, Gower St, London WC1E 6BT, UK and 2European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
As the number of protein folds is quite limited, a mode of analysis that will be increasingly common in the future, especially with the advent of structural genomics, is to survey and re-survey the finite parts list of folds from an expanding number of perspectives. We have developed a new resource, called PartsList, that lets one dynamically perform these comparative fold surveys. It is available on the web at http://bioinfo.mbb.yale.edu/partslist and http://www.partslist.org. The system is based on the existing fold classifications and functions as a form of companion annotation for them, providing global views of many already completed fold surveys. The central idea in the system is that of comparison through ranking; PartsList will rank the approximately 420 folds based on more than 180 attributes. These include: (i) occurrence in a number of completely sequenced genomes (e.g. it will show the most common folds in the worm versus yeast); (ii) occurrence in the structure databank (e.g. most common folds in the PDB); (iii) both absolute and relative gene expression information (e.g. most changing folds in expression over the cell cycle); (iv) proteinprotein interactions, based on experimental data in yeast and comprehensive PDB surveys (e.g. most interacting fold); (v) sensitivity to inserted transposons; (vi) the number of functions associated with the fold (e.g. most multi-functional folds); (vii) amino acid composition (e.g. most Cys-rich folds); (viii) protein motions (e.g. most mobile folds); and (ix) the level of similarity based on a comprehensive set of structural alignments (e.g. most structurally variable folds). The integration of whole-genome expression and proteinprotein interaction data with structural information is a particularly novel feature of our system. We provide three ways of visualizing the rankings: a profiler emphasizing the progression of high and low ranks across many pre-selected attributes, a dynamic comparer for custom comparisons and a numerical rankings correlator. These allow one to directly compare very different attributes of a fold (e.g. expression level, genome occurrence and maximum motion) in the uniform numerical format of ranks. This uniform framework, in turn, highlights the way that the frequency of many of the attributes falls off with approximate power-law behavior (i.e. according to Vb, for attribute value V and constant exponent b), with a few folds having large values and most having small values.
* To whom correspondence should be addressed. Tel: +1 203 432 6105; Fax: +1 360 838 7861; Email: mark.gerstein{at}yale.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. Flores, N. Echols, D. Milburn, B. Hespenheide, K. Keating, J. Lu, S. Wells, E. Z. Yu, M. Thorpe, and M. Gerstein The Database of Macromolecular Motions: new features added at the decade mark Nucleic Acids Res., January 1, 2006; 34(suppl_1): D296 - D301. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-S. Goh, N. Lan, N. Echols, S. M. Douglas, D. Milburn, P. Bertone, R. Xiao, L.-C. Ma, D. Zheng, Z. Wunderlich, et al. SPINE 2: a system for collaborative structural proteomics within a federated database framework Nucleic Acids Res., June 1, 2003; 31(11): 2833 - 2838. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Echols, D. Milburn, and M. Gerstein MolMovDB: analysis and visualization of conformational change and structural flexibility Nucleic Acids Res., January 1, 2003; 31(1): 478 - 482. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Muller, R. M. MacCallum, and M. J.E. Sternberg Structural Characterization of the Human Proteome Genome Res., November 1, 2002; 12(11): 1625 - 1641. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Lin, J. Qian, D. Greenbaum, P. Bertone, R. Das, N. Echols, A. Senes, B. Stenger, and M. Gerstein GeneCensus: genome comparisons in terms of metabolic pathway activity and protein family sharing Nucleic Acids Res., October 15, 2002; 30(20): 4574 - 4582. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Echols, P. Harrison, S. Balasubramanian, N. M. Luscombe, P. Bertone, Z. Zhang, and M. Gerstein Comprehensive analysis of amino acid and nucleotide composition in eukaryotic genomes, comparing genes and pseudogenes Nucleic Acids Res., June 1, 2002; 30(11): 2515 - 2523. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Lo Conte, S. E. Brenner, T. J. P. Hubbard, C. Chothia, and A. G. Murzin SCOP database in 2002: refinements accommodate structural genomics Nucleic Acids Res., January 1, 2002; 30(1): 264 - 267. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Hegyi and M. Gerstein Annotation Transfer for Genomics: Measuring Functional Divergence in Multi-Domain Proteins Genome Res., October 1, 2001; 11(10): 1632 - 1640. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Greenbaum, N. M. Luscombe, R. Jansen, J. Qian, and M. Gerstein Interrelating Different Types of Genomic Data, from Proteome to Secretome: 'Oming in on Function Genome Res., September 1, 2001; 11(9): 1463 - 1468. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Bertone, Y. Kluger, N. Lan, D. Zheng, D. Christendat, A. Yee, A. M. Edwards, C. H. Arrowsmith, G. T. Montelione, and M. Gerstein SPINE: an integrated tracking database and data mining approach for identifying feasible targets in high-throughput structural proteomics Nucleic Acids Res., July 1, 2001; 29(13): 2884 - 2898. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Mushegian and R. Medzhitov Evolutionary perspective on innate immune recognition J. Cell Biol., November 26, 2001; 155(5): 705 - 710. [Abstract] [Full Text] [PDF] |
||||


