Proteome Analyst: custom predictions with explanations in a web-based tool for high-throughput proteome annotations
Department of Computing Science, University of Alberta, Edmonton, AB, T6G 2E8, Canada
* To whom correspondence should be addressed. Tel: +1 780 492 7760; Fax: +1 780 492 1071; Email paullu{at}cs.ualberta.ca
Correspondence may also be addressed to Duane Szafron. Email: duane{at}cs.uslberta.ca
The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors
Received February 15, 2004; Revised April 19, 2004; Accepted May 5, 2004
Proteome Analyst (PA) (http://www.cs.ualberta.ca/~bioinfo/PA/) is a publicly available, high-throughput, web-based system for predicting various properties of each protein in an entire proteome. Using machine-learned classifiers, PA can predict, for example, the GeneQuiz general function and Gene Ontology (GO) molecular function of a protein. In addition, PA is currently the most accurate and most comprehensive system for predicting subcellular localization, the location within a cell where a protein performs its main function. Two other capabilities of PA are notable. First, PA can create a custom classifier to predict a new property, without requiring any programming, based on labeled training data (i.e. a set of examples, each with the correct classification label) provided by a user. PA has been used to create custom classifiers for potassium-ion channel proteins and other general function ontologies. Second, PA provides a sophisticated explanation feature that shows why one prediction is chosen over another. The PA system produces a Naïve Bayes classifier, which is amenable to a graphical and interactive approach to explanations for its predictions; transparent predictions increase the user's confidence in, and understanding of, PA.
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
D. Paape, C. Lippuner, M. Schmid, R. Ackermann, M. E. Barrios-Llerena, U. Zimny-Arndt, V. Brinkmann, B. Arndt, K. P. Pleissner, P. R. Jungblut, et al. Transgenic, Fluorescent Leishmania mexicana Allow Direct Analysis of the Proteome of Intracellular Amastigotes Mol. Cell. Proteomics, September 1, 2008; 7(9): 1688 - 1701. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Casadio, P. L. Martelli, and A. Pierleoni The prediction of protein subcellular localization from sequence: a shortcut to functional genome annotation Brief Funct Genomic Proteomic, February 18, 2008; (2008) eln003v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Friedberg Automated protein function prediction--the genomic challenge Brief Bioinform, September 1, 2006; 7(3): 225 - 242. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Lu, D. Szafron, R. Greiner, D. S. Wishart, A. Fyshe, B. Pearcy, B. Poulin, R. Eisner, D. Ngo, and N. Lamb PA-GOSUB: a searchable database of model organism protein sequences with their predicted Gene Ontology molecular function and subcellular localization Nucleic Acids Res., January 1, 2005; 33(suppl_1): D147 - D153. [Abstract] [Full Text] [PDF] |
||||



