Skip Navigation

Nucleic Acids Research 2005 33(Database Issue):D178-D182; doi:10.1093/nar/gki060
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (295K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Su, Q. J.
Right arrow Articles by Brutlag, D. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Su, Q. J.
Right arrow Articles by Brutlag, D. L.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2005, Vol. 33, Database issue D178-D182
© 2005, the authors
Nucleic Acids Research, Vol. 33, Database issue © Oxford University Press 2005; all rights reserved

eBLOCKs: enumerating conserved protein blocks to achieve maximal sensitivity and specificity

Qiaojuan Jane Su, Lin Lu1, Serge Saxonov2,3 and Douglas L. Brutlag2,*

Abgenix, Inc., 6701 Kaiser Drive, MS 11, Fremont, CA 94555, USA, 1 Oracle Corporation, 400 Oracle Parkway, Redwood shores, CA 94065, USA, 2 Department of Biochemistry and 3 Biomedical Informatics, Stanford University, Stanford, CA 94305, USA

* To whom correspondence should be addressed. Tel: +1 650 723 6593; Fax: +1 650 723 6783; Email: brutlag{at}stanford.edu

Received August 16, 2004; Revised and Accepted October 4, 2004

Classifying proteins into families and superfamilies allows identification of functionally important conserved domains. The motifs and scoring matrices derived from such conserved regions provide computational tools that recognize similar patterns in novel sequences, and thus enable the prediction of protein function for genomes. The eBLOCKs database enumerates a cascade of protein blocks with varied conservation levels for each functional domain. A biologically important region is most stringently conserved among a smaller family of highly similar proteins. The same region is often found in a larger group of more remotely related proteins with a reduced stringency. Through enumeration, highly specific signatures can be generated from blocks with more columns and fewer family members, while highly sensitive signatures can be derived from blocks with fewer columns and more members as in a superfamily. By applying PSI-BLAST and a modified K-means clustering algorithm, eBLOCKs automatically groups protein sequences according to different levels of similarity. Multiple sequence alignments are made and trimmed into a series of ungapped blocks. Motifs and position-specific scoring matrices were derived from eBLOCKs and made available for sequence search and annotation. The eBLOCKs database provides a tool for high-throughput genome annotation with maximal specificity and sensitivity. The eBLOCKs database is freely available on the World Wide Web at http://motif.stanford.edu/eblocks/ to all users for online usage. Academic and not-for-profit institutions wishing copies of the program may contact Douglas L. Brutlag (brutlag{at}stanford.edu). Commercial firms wishing copies of the program for internal installation may contact Jacqueline Tay at the Stanford Office of Technology Licensing (jacqueline.tay{at}stanford.edu; http://otl.stanford.edu/).


The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use permissions, please contact journals.permissions{at}oupjournals.org.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
S. Rajasekaran, S. Balla, P. Gradie, M. R. Gryk, K. Kadaveru, V. Kundeti, M. W. Maciejewski, T. Mi, N. Rubino, J. Vyas, et al.
Minimotif miner 2nd release: a database and web system for motif search
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D185 - D190.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C.-M. Hsu, C.-Y. Chen, and B.-J. Liu
Corrigendum
Nucleic Acids Res., March 27, 2008; 36(4): 1400 - 1406.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C.-M. Hsu, C.-Y. Chen, and B.-J. Liu
MAGIIC-PRO: detecting functional signatures by efficient discovery of long patterns in protein sequences.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W356 - W361.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Gutman, C. Berezin, R. Wollman, Y. Rosenberg, and N. Ben-Tal
QuasiMotiFinder: protein annotation by searching for evolutionarily conserved motif-like patterns
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W255 - W261.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.