Skip Navigation

Nucleic Acids Research 2004 32(Web Server Issue):W414-W419; doi:10.1093/nar/gkh350
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (382K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Bhasin, M.
Right arrow Articles by Raghava, G. P. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bhasin, M.
Right arrow Articles by Raghava, G. P. S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2004, the authors
Nucleic Acids Research, Vol. 32, Web Server issue © Oxford University Press 2004; all rights reserved

ESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST

Manoj Bhasin and G. P. S. Raghava*

Bioinformatics Centre, Institute of Microbial Technology, Sector 39A, Chandigarh, India

* To whom correspondence should be addressed. Tel: +91 172 2690557, 2695225; Fax: +91 172 2690632, 2690585; Email: raghava{at}imtech.res.in; Web: http://imtech.res.in/raghava/

Received January 14, 2004; Accepted February 9, 2004

Automated prediction of subcellular localization of proteins is an important step in the functional annotation of genomes. The existing subcellular localization prediction methods are based on either amino acid composition or N-terminal characteristics of the proteins. In this paper, support vector machine (SVM) has been used to predict the subcellular location of eukaryotic proteins from their different features such as amino acid composition, dipeptide composition and physico-chemical properties. The SVM module based on dipeptide composition performed better than the SVM modules based on amino acid composition or physico-chemical properties. In addition, PSI-BLAST was also used to search the query sequence against the dataset of proteins (experimentally annotated proteins) to predict its subcellular location. In order to improve the prediction accuracy, we developed a hybrid module using all features of a protein, which consisted of an input vector of 458 dimensions (400 dipeptide compositions, 33 properties, 20 amino acid compositions of the protein and 5 from PSI-BLAST output). Using this hybrid approach, the prediction accuracies of nuclear, cytoplasmic, mitochondrial and extracellular proteins reached 95.3, 85.2, 68.2 and 88.9%, respectively. The overall prediction accuracy of SVM modules based on amino acid composition, physico-chemical properties, dipeptide composition and the hybrid approach was 78.1, 77.8, 82.9 and 88.0%, respectively. The accuracy of all the modules was evaluated using a 5-fold cross-validation technique. Assigning a reliability index (reliability index ≥3), 73.5% of prediction can be made with an accuracy of 96.4%. Based on the above approach, an online web server ESLpred was developed, which is available at http://www.imtech.res.in/raghava/eslpred/.


The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
DNA ResHome page
S. Keerthikumar, S. Bhadra, K. Kandasamy, R. Raju, Y.L. Ramachandra, C. Bhattacharyya, K. Imai, O. Ohara, S. Mohan, and A. Pandey
Prediction of Candidate Primary Immunodeficiency Disease Genes Using a Support Vector Machine Learning Approach
DNA Res, December 1, 2009; 16(6): 345 - 351.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Physiol. Cell Physiol.Home page
A. V. Andreeva, M. A. Kutuzov, V. A. Tkachuk, and T. A. Voyno-Yasenetskaya
T-cadherin is located in the nucleus and centrosomes in endothelial cells
Am J Physiol Cell Physiol, November 1, 2009; 297(5): C1168 - C1177.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Mintz-Oron, A. Aharoni, E. Ruppin, and T. Shlomi
Network-based prediction of metabolic enzymes' subcellular localization
Bioinformatics, June 15, 2009; 25(12): i247 - i1252.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
A. Urban, I. Behm-Ansmant, C. Branlant, and Y. Motorin
RNA Sequence and Two-dimensional Structure Features Required for Efficient Substrate Modification by the Saccharomyces cerevisiae RNA:{Psi}-Synthase Pus7p
J. Biol. Chem., February 27, 2009; 284(9): 5845 - 5858.
[Abstract] [Full Text] [PDF]


Home page
Mol PlantHome page
C. Yoshihara, K. Inoue, D. Schichnes, S. Ruzin, W. Inwood, and S. Kustu
An Rh1-GFP Fusion Protein Is in the Cytoplasmic Membrane of a White Mutant Strain of Chlamydomonas reinhardtii
Mol Plant, November 14, 2008; (2008) ssn074v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Lee, H.-Y. Chuang, A. Beyer, M.-K. Sung, W.-K. Huh, B. Lee, and T. Ideker
Protein networks markedly improve prediction of subcellular localization in multiple eukaryotic species
Nucleic Acids Res., November 1, 2008; 36(20): e136 - e136.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Jiang, H. Wu, J. Wei, F. Sang, X. Sun, and Z. Lu
RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W47 - W51.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Pierleoni, P. L. Martelli, P. Fariselli, and R. Casadio
eSLDB: eukaryotic subcellular localization database
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D208 - D212.
[Abstract] [Full Text] [PDF]


Home page
J. Histochem. Cytochem.Home page
Z. Hao, X. Li, T. Qiao, R. Du, G. Zhang, and D. Fan
Subcellular Localization of CIAPIN1
J. Histochem. Cytochem., December 1, 2006; 54(12): 1437 - 1444.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Guo and Y. Lin
TSSub: eukaryotic protein subcellular localization by extracting features from profiles
Bioinformatics, July 15, 2006; 22(14): 1784 - 1785.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Hoglund, P. Donnes, T. Blum, H.-W. Adolph, and O. Kohlbacher
MultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition
Bioinformatics, May 15, 2006; 22(10): 1158 - 1165.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. Kumar, R. Verma, and G. P. S. Raghava
Prediction of Mitochondrial Proteins Using Support Vector Machine and Hidden Markov Model
J. Biol. Chem., March 3, 2006; 281(9): 5357 - 5363.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Idicula-Thomas, A. J. Kulkarni, B. D. Kulkarni, V. K. Jayaraman, and P. V. Balaji
A support vector machine-based method for predicting the propensity of a protein to be soluble or to form inclusion body on overexpression in Escherichia coli
Bioinformatics, February 1, 2006; 22(3): 278 - 284.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Xie, A. Li, M. Wang, Z. Fan, and H. Feng
LOCSVMPSI: a web server for subcellular localization of eukaryotic proteins using SVM and profile of PSI-BLAST
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W105 - W110.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Bhasin and G. P. S. Raghava
GPCRsclass: a web tool for the classification of amine type of G-protein-coupled receptors
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W143 - W147.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
L. Bao and Y. Cui
Prediction of the phenotypic effects of non-synonymous single nucleotide polymorphisms using structural and evolutionary information
Bioinformatics, May 15, 2005; 21(10): 2185 - 2190.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Bhasin and G. P. S. Raghava
GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors
Nucleic Acids Res., July 1, 2004; 32(suppl_2): W383 - W389.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.