Nucleic Acids Research, 2005, Vol. 33, Database issue D216-D218
© 2005, the authors
Nucleic Acids Research, Vol. 33, Database issue © Oxford University Press 2005; all rights reserved
ProtoNet 4.0: A hierarchical classification of one million protein sequences
Department of Biological Chemistry, Institute of Life Sciences and 1 School of Computer Science and Engineering, The Hebrew University of Jerusalem, Israel
* To whom correspondence should be addressed at The Hebrew University, Department of Biological Chemistry, Givat Ram Campus, Jerusalem, Israel, 91904. Tel: +972 2 6585433; Fax: +972 2 6586448; Email: kaplann{at}cc.huji.ac.il
Received September 10, 2004; Accepted September 14, 2004
ProtoNet is an automatic hierarchical classification of the protein sequence space. In 2004, the ProtoNet (version 4.0) presents the analysis of over one million proteins merged from SwissProt and TrEMBL databases. In addition to rich visualization and analysis tools to navigate the clustering hierarchy, we incorporated several improvements that allow a simplified view of the scaffold of the proteins. An unsupervised, biologically valid method that was developed resulted in a condensation of the ProtoNet hierarchy to only 12% of the clusters. A large portion of these clusters was automatically assigned high confidence biological names according to their correspondence with functional annotations. ProtoNet is available at: http://www.protonet.cs.huji.ac.il.
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use permissions, please contact journals.permissions{at}oupjournals.org.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. Rattei, P. Tischler, R. Arnold, F. Hamberger, J. Krebs, J. Krumsiek, B. Wachinger, V. Stumpflen, and W. Mewes SIMAP structuring the network of protein similarities Nucleic Acids Res., January 11, 2008; 36(suppl_1): D289 - D292. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Heger, S. Mallick, C. Wilton, and L. Holm The global trace graph, a novel paradigm for searching protein sequence databases Bioinformatics, September 15, 2007; 23(18): 2361 - 2367. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Kaplan and M. Linial ProtoBee: Hierarchical classification and annotation of the honey bee proteome Genome Res., November 1, 2006; 16(11): 1431 - 1438. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Adachi, H. Ulanovsky, Y. Li, B. Norman, J. Davis, and J. Piatigorsky Serial Analysis of Gene Expression (SAGE) in the Rat Limbal and Central Corneal Epithelium. Invest. Ophthalmol. Vis. Sci., September 1, 2006; 47(9): 3801 - 3810. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Marsden, D. Lee, M. Maibaum, C. Yeats, and C. A. Orengo Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space Nucleic Acids Res., February 15, 2006; 34(3): 1066 - 1080. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Sasson and M. Linial ProTarget: automatic prediction of protein structure novelty Nucleic Acids Res., July 1, 2005; 33(suppl_2): W81 - W84. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Bahir and M. Linial ProTeus: identifying signatures in protein termini Nucleic Acids Res., July 1, 2005; 33(suppl_2): W277 - W280. [Abstract] [Full Text] [PDF] |
||||



