Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (69K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (115)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Wu, C. H.
Right arrow Articles by Barker, W. C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wu, C. H.
Right arrow Articles by Barker, W. C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2002, Vol. 30, No. 1 35-37
© 2002 Oxford University Press

The Protein Information Resource: an integrated public resource of functional annotation of proteins

Cathy H. Wu*, Hongzhan Huang, Leslie Arminski, Jorge Castro-Alvear, Yongxing Chen, Zhang-Zhi Hu, Robert S. Ledley, Kali C. Lewis, Hans-Werner Mewes1, Bruce C. Orcutt, Baris E. Suzek, Akira Tsugita2, C. R. Vinayaka, Lai-Su L. Yeh, Jian Zhang and Winona C. Barker

National Biomedical Research Foundation, Georgetown University Medical Center, 3900 Reservoir Road, NW, Washington, DC 20007, USA, 1GSF-Forschungszentrum f. Umwelt und Gesundheit, Munich Information Center for Protein Sequences am Max–Planck Instut für Biochemie, Am Klopferspitz 18, D-82152 Martinsried, Germany and 2Japan International Protein Information Database, Science University of Tokyo, Noda, Japan

Received September 17, 2001; Revised and Accepted October 10, 2001.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 PIR-INTERNATIONAL PROTEIN...
 INTEGRATED PROTEIN...
 PIR-NREF
 AVAILABILITY
 REFERENCES
 
The Protein Information Resource (PIR) serves as an integrated public resource of functional annotation of protein data to support genomic/proteomic research and scientific discovery. The PIR, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the PIR-International Protein Sequence Database (PSD), the major annotated protein sequence database in the public domain, containing about 250 000 proteins. To improve protein annotation and the coverage of experimentally validated data, a bibliography submission system is developed for scientists to submit, categorize and retrieve literature information. Comprehensive protein information is available from iProClass, which includes family classification at the superfamily, domain and motif levels, structural and functional features of proteins, as well as cross-references to over 40 biological databases. To provide timely and comprehensive protein data with source attribution, we have introduced a non-redundant reference protein database, PIR-NREF. The database consists of about 800 000 proteins collected from PIR-PSD, SWISS-PROT, TrEMBL, GenPept, RefSeq and PDB, with composite protein names and literature data. To promote database interoperability, we provide XML data distribution and open database schema, and adopt common ontologies. The PIR web site (http://pir.georgetown.edu/) features data mining and sequence analysis tools for information retrieval and functional identification of proteins based on both sequence and annotation information. The PIR databases and other files are also available by FTP (ftp://nbrfa.georgetown.edu/pir_databases).


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 PIR-INTERNATIONAL PROTEIN...
 INTEGRATED PROTEIN...
 PIR-NREF
 AVAILABILITY
 REFERENCES
 
The Protein Information Resource (PIR) has been providing the scientific community with annotated protein databases and analysis tools for over three decades. To better support research in functional genomics and proteomics and facilitate knowledge discovery, we have made several new advances in the last year, in addition to further enhancing the PIR-International Protein Sequence Database. Some key developments include: launch of a new submission mechanism for literature data, distribution of a new non-redundant reference protein database, enhancement of the integrated classification database, and redesign of the web site for easy navigation, information retrieval and sequence analysis.


    PIR-INTERNATIONAL PROTEIN SEQUENCE DATABASE
 TOP
 ABSTRACT
 INTRODUCTION
 PIR-INTERNATIONAL PROTEIN...
 INTEGRATED PROTEIN...
 PIR-NREF
 AVAILABILITY
 REFERENCES
 
The PIR, along with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), continues to enhance and distribute the PIR-International Protein Sequence Database (PSD), a non-redundant, expertly annotated, fully classified and extensively cross-referenced protein sequence database in the public domain. It contains about 250 000 protein sequences with comprehensive coverage across the entire taxonomic range, including sequences from all the publicly available complete genomes.

Superfamily classification
A unique characteristic of the PIR-PSD is the superfamily/family classification (1) that provides complete and non-overlapping clustering of proteins based on global (end-to-end) sequence similarity. Sequences in the same superfamily share common domain architecture (i.e. have the same number, order and types of domains) and do not differ excessively in overall length unless they are fragments or result from alternate splicing or initiators. The automated classification system places new members into existing superfamilies and defines new superfamily clusters using parameters including the percentage of sequence identity, overlap length ratio, distance to neighboring superfamily clusters, and overall domain arrangement. Currently, >99% of sequences are classified into families of closely related sequences (at least 45% identical), and over two-thirds of sequences are classified into over 33 000 superfamilies. The automated classification is being augmented by manual curation of superfamilies, starting with those containing at least one definable domain, to provide superfamily names, brief descriptions, bibliography, list of representative and seed members, as well as domain and motif architecture characteristic of the superfamily.

Bibliography submission and literature mapping
Linking protein data to literature data that describes or characterizes the proteins is crucial for us to increase the amount of experimentally verified data and to improve the quality of protein annotation. Attribution of protein annotations to validated experimental sources provides effective means to avoid propagation of errors that may have resulted from large-scale genome annotation. We have developed a bibliography submission system for the scientific community to submit, categorize and retrieve literature information for PSD protein entries. The submission interface guides users through steps in mapping the paper citation to given protein entries, entering the literature data, and summarizing the literature data using categories such as genetics, tissue/cellular localization, molecular complex or interaction, function, regulation and disease. Also included is a literature information page that provides literature data mining and displays both references cited in PIR and submitted by users.


    INTEGRATED PROTEIN CLASSIFICATION DATABASE
 TOP
 ABSTRACT
 INTRODUCTION
 PIR-INTERNATIONAL PROTEIN...
 INTEGRATED PROTEIN...
 PIR-NREF
 AVAILABILITY
 REFERENCES
 
The iProClass (integrated Protein Classification) database (2) is designed to provide comprehensive descriptions of all proteins and to serve as a framework for data integration in a distributed networking environment. The database describes family relationships at both global (whole protein) and local (domain, motif, site) levels, as well as structural and functional classifications and features of proteins. The current version (Release 1.0, August 2001) consists of more than 270 000 non-redundant PIR-PSD and SWISS-PROT proteins organized with more than 33 000 PIR superfamilies, 100 000 families, 3400 PIR homology and Pfam domains (3), 1300 ProClass/ProSite motifs (4,5), 280 PIR post-translational modification sites, and links to over 40 databases of protein families, structures, functions, genes, genomes, literature and taxonomy. Protein sequence and superfamily summary reports provide rich annotations such as membership information with length, taxonomy and keyword statistics, extensive cross-references and graphical display of domain and motif regions. Directly linked to the iProClass sequence report are two additional PIR databases, ASDB and RESID (6). PIR-Annotation and Similarity Database (ASDB) lists pre-computed, biweekly updated FASTA neighbors of all PSD sequences with annotation information and graphical displays of sequence similarity matches. PIR-RESID documents over 280 post-translational modifications and links to PSD entries containing either experimentally determined or computationally predicted modifications with evidence tags. Future versions of iProClass and ASDB will be based on the new PIR Non-redundant Reference Protein database (NREF).


    PIR-NREF
 TOP
 ABSTRACT
 INTRODUCTION
 PIR-INTERNATIONAL PROTEIN...
 INTEGRATED PROTEIN...
 PIR-NREF
 AVAILABILITY
 REFERENCES
 
As a major resource of protein information, one of our primary aims is to provide a timely and comprehensive collection of all protein sequence data that keeps pace with the genome sequencing projects and contains source attribution and minimal redundancy. The PIR-NREF protein database includes sequences from PIR, SWISS-PROT (7), TrEMBL (7), RefSeq (8), GenPept, PDB (9) and other protein databases. The NREF entries, each representing an identical amino acid sequence from the same source organism redundantly presented in one or more underlying protein databases, can serve as the basic unit for protein annotation. The NCBI taxonomy (http://www.ncbi.nlm.nih.gov/Taxonomy/taxonomyhome.html/) is used as the ontology for matching source organism names at the species or strain (if known) levels. The NREF report provides source attribution (containing protein IDs, accession numbers and protein names from underlying databases), in addition to taxonomy, amino acid sequence and composite literature data. The composite protein names, including synonyms, alternate names and even misspellings, can be used to assist the ontology development on protein names and the identification of mis-annotated proteins. Related sequences, including identical sequences from different organisms and closely related sequences within the same organism, are also listed. The database presently consists of about 800 000 entries and is updated biweekly.


    AVAILABILITY
 TOP
 ABSTRACT
 INTRODUCTION
 PIR-INTERNATIONAL PROTEIN...
 INTEGRATED PROTEIN...
 PIR-NREF
 AVAILABILITY
 REFERENCES
 
PIR web site
The PIR web site (http://pir.georgetown.edu) (10) connects data mining and sequence analysis tools to underlying databases for exploration of protein information and discovery of new knowledge. The site has been redesigned to include a user-friendly navigation system and more graphical interfaces and analysis tools. The PIR-PSD and iProClass pages represent primary entry points in the PIR web site. A list of the major PIR pages is shown in Table 1.


View this table:
[in this window]
[in a new window]
 
Table 1. Major PIR web pages for data mining and sequence analysis
 
The PIR-PSD interface provides entry retrieval, batch retrieval, basic or advanced text searches, and various sequence searches. The iProClass interface also includes both sequence and text searches. The BLAST search (11) returns best-matched proteins and superfamilies, while peptide match allows protein identification based on peptide sequences. Text search involves direct search of the underlying Oracle tables using unique identifiers or combinations of text strings. The NREF database is searchable by BLAST search, peptide match and direct report retrieval based on the NREF ID or the entry identifiers of the source databases. Other sequence searches supported on the PIR web site include FASTA (12), pattern matching, hidden Markov model (HMM) (13) domain and motif search, Smith–Waterman (14) pair-wise alignment, CLUSTALW (15) multiple alignment and GeneFIND (16) family identification.

PIR FTP site
The PIR anonymous FTP site (ftp://nbrfa.georgetown.edu/pir_databases) provides direct file transfer. Files distributed include the PIR-PSD (quarterly release and interim updates), PIR-NREF, other auxiliary databases, other documents, files and software programs. The PIR-PSD is distributed as flat files in NBRF and CODATA formats, with corresponding sequences in FASTA format. Both PIR-PSD and PIR-NREF are also distributed in XML format with the associated document type definition (DTD) file.

The PIR-PSD, iProClass and PIR-NREF databases have been implemented in Oracle 8i object-relational database system on our Unix server. To enable open source distribution, the databases are being mapped to MySQL and ported to Linux system. To establish reciprocal links to PIR databases, to host a PIR mirror web site or to request PIR database schema, please contact pirmail{at}nbrf.georgetown.edu.


    ACKNOWLEDGEMENTS
 
PIR is a registered mark of National Biomedical Research Foundation (NBRF). The PIR is supported by grant P41 LM05978 from the National Library of Medicine, National Institutes of Health. The iProClass and RESID databases are supported by DBI-9974855 and DBI-9808414 from the National Science Foundation.


    FOOTNOTES
 
* To whom correspondence should be addressed. Tel: +1 202 687 2121; Fax: +1 202 687 1662; Email: pirmail{at}nbrf.georgetown.edu Back


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 PIR-INTERNATIONAL PROTEIN...
 INTEGRATED PROTEIN...
 PIR-NREF
 AVAILABILITY
 REFERENCES
 

    1 Barker,W.C., Pfeiffer,F. and George,D.G. (1996) Superfamily classification in PIR-International Protein Sequence Database. Methods Enzymol., 266, 59–71.

    2 Wu,C.H., Xiao,C., Hou,Z., Huang,H. and Barker,W.C. (2001) iProClass: an integrated, comprehensive, and annotated protein classification database. Nucleic Acids Res., 29, 52–54.[Abstract/Free Full Text]

    3 Bateman,A., Birney,E., Durbin,R., Eddy,S.R., Howe,K.L. and Sonnhammer,E.L.L. (2000) The Pfam protein families database. Nucleic Acids Res., 28, 263–266. Updated article in this issue: Nucleic Acids Res. (2002), 30, 276–280.[Abstract/Free Full Text]

    4 Huang,H., Xiao,C. and Wu,C.H. (2000) ProClass protein family database. Nucleic Acids Res., 28, 273–276.[Abstract/Free Full Text]

    5 Hofmann,K., Bucher,P., Falquet,L. and Bairoch,A. (1999) The PROSITE database, its status in 1999. Nucleic Acids Res., 27, 215–219. Updated article in this issue: Nucleic Acids Res. (2002), 30, 235–238.[Abstract/Free Full Text]

    6 Garavelli,J.S., Hou,Z., Pattabiraman,N. and Stephens,R.M. (2001) The RESID database of protein structure modifications and the NRL-3D sequence-structure database. Nucleic Acids Res., 29, 199–201.[Abstract/Free Full Text]

    7 Bairoch,A. and Apweiler,R. (2000) The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res., 28, 45–48.[Abstract/Free Full Text]

    8 Pruitt,K.D. and Maglott,D.R. (2001) RefSeq and LocusLink: NCBI gene-centered resources. Nucleic Acids Res., 29, 137–140.[Abstract/Free Full Text]

    9 Berman,H.M., Westbrook,J., Feng,Z., Gilliland,G., Bhat,T.N., Weissig,H., Shindyalov,I.N. and Bourne,P.E. (2000) The Protein Data Bank. Nucleic Acids Res., 28, 235–242. Updated article in this issue: Nucleic Acids Res. (2002), 30, 245–248.[Abstract/Free Full Text]

    10 McGarvey,P., Huang,H., Barker,W.C., Orcutt,B.C. and Wu,C.H. (2000) The PIR web site: new resource for bioinformatics. Bioinformatics, 16, 290–291.[Abstract/Free Full Text]

    11 Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J., Zhang,Z., Miller,W. and Lipman,D.J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res., 25, 3389–3402.[Abstract/Free Full Text]

    12 Pearson,W.R. and Lipman,D.J. (1988) Improved tools for biological sequence comparision. Proc. Natl Acad. Sci. USA, 85, 2444–2448.[Abstract/Free Full Text]

    13 Eddy,S.R., Mitchison,G. and Durbin,R. (1995) Maximum Discrimination hidden Markov models of sequence consensus. J. Comp. Biol., 2, 9–23.

    14 Smith,T.F. and Waterman,M.S. (1981) Comparison of bio-sequences. Adv. Appl. Math., 2, 482–489.

    15 Thompson,J.D., Higgins,D.G. and Gibson,T.J. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res., 22, 4673–4680.[Abstract/Free Full Text]

    16 Wu,C.H., Huang,H. and McLarty,J. (1999) Gene family identification network design for protein sequence analysis. Int. J. Artif. Intell. Tools, 8, 419–432.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Brief BioinformHome page
C. M. Song, S. J. Lim, and J. C. Tong
Recent advances in computer-aided drug design
Brief Bioinform, May 11, 2009; (2009) bbp023v1.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
S. R. Baerson, F. E. Dayan, A. M. Rimando, N. P. D. Nanayakkara, C.-J. Liu, J. Schroder, M. Fishbein, Z. Pan, I. A. Kagan, L. H. Pratt, et al.
A Functional Genomics Investigation of Allelochemical Biosynthesis in Sorghum bicolor Root Hairs
J. Biol. Chem., February 8, 2008; 283(6): 3231 - 3247.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
F. Ferre, Y. Ponty, W. A. Lorenz, and P. Clote
DIAL: a web server for the pairwise alignment of two RNA three-dimensional structures using nucleotide, dihedral angle and base-pairing similarities
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W659 - W668.
[Abstract] [Full Text] [PDF]


Home page
Molecular Cancer TherapeuticsHome page
E. E. Evans, A. D. Henn, A. Jonason, M. J. Paris, L. M. Schiffhauer, M. A. Borrello, E. S. Smith, D. M. Sahasrabudhe, and M. Zauderer
C35 (C17orf37) is a novel tumor biomarker abundantly expressed in breast cancer.
Mol. Cancer Ther., November 1, 2006; 5(11): 2919 - 2930.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
U. Guldener, M. Munsterkotter, M. Oesterheld, P. Pagel, A. Ruepp, H.-W. Mewes, and V. Stumpflen
MPact: the MIPS protein interaction resource on yeast
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D436 - D441.
[Abstract] [Full Text] [PDF]


Home page
J. Immunol.Home page
A. Lucchese, J. Willers, A. Mittelman, D. Kanduc, and R. Dummer
Proteomic Scan for Tyrosinase Peptide Antigenic Pattern in Vitiligo and Melanoma: Role of Sequence Similarity and HLA-DR1 Affinity
J. Immunol., November 15, 2005; 175(10): 7009 - 7020.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
L. H. Pratt, C. Liang, M. Shah, F. Sun, H. Wang, St. P. Reid, A. R. Gingle, A. H. Paterson, R. Wing, R. Dean, et al.
Sorghum Expressed Sequence Tags Identify Signature Genes for Drought, Pathogenesis, and Skotomorphogenesis from a Milestone Set of 16,801 Unique Transcripts
Plant Physiology, October 1, 2005; 139(2): 869 - 884.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
X.-Y. Ren, M. W.E.J. Fiers, W. J. Stiekema, and J.-P. Nap
Local Coexpression Domains of Two to Four Genes in the Genome of Arabidopsis
Plant Physiology, June 1, 2005; 138(2): 923 - 934.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
R. A. Salzman, J. A. Brady, S. A. Finlayson, C. D. Buchanan, E. J. Summer, F. Sun, P. E. Klein, R. R. Klein, L. H. Pratt, M.-M. Cordonnier-Pratt, et al.
Transcriptional Profiling of Sorghum Induced by Methyl Jasmonate, Salicylic Acid, and Aminocyclopropane Carboxylic Acid Reveals Cooperative Regulation and Novel Gene Responses
Plant Physiology, May 1, 2005; 138(1): 352 - 368.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
E. N. Klyushnenkova, J. Link, W. T. Oberle, J. Kodak, C. Rich, A. A. Vandenbark, and R. B. Alexander
Identification of HLA-DRB1*1501-Restricted T-cell Epitopes from Prostate-Specific Antigen
Clin. Cancer Res., April 15, 2005; 11(8): 2853 - 2861.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
O. Keskin and R. Nussinov
Favorable scaffolds: proteins with different sequence, structure and function may associate in similar ways
Protein Eng. Des. Sel., January 1, 2005; 18(1): 11 - 24.
[Abstract] [Full Text] [PDF]


Home page
J. Cell Sci.Home page
J. G. Turner, R. Engel, J. A. Derderian, R. Jove, and D. M. Sullivan
Human topoisomerase II{alpha} nuclear export is mediated by two CRM-1-dependent nuclear export signals
J. Cell Sci., June 15, 2004; 117(14): 3061 - 3071.
[Abstract] [Full Text] [PDF]


Home page
Biol. Reprod.Home page
D. Boerboom, K. A. Brown, D. Vaillancourt, P. Poitras, A. K. Goff, K. Watanabe, M. Dore, and J. Sirois
Expression of Key Prostaglandin Synthases in Equine Endometrium During Late Diestrus and Early Pregnancy
Biol Reprod, February 1, 2004; 70(2): 391 - 399.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Guo, S. Hua, X. Ji, and Z. Sun
DBSubLoc: database of protein subcellular localization
Nucleic Acids Res., January 1, 2004; 32(90001): D122 - 124.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. J. Krieger, P. Zhang, L. A. Mueller, A. Wang, S. Paley, M. Arnaud, J. Pick, S. Y. Rhee, and P. D. Karp
MetaCyc: a multiorganism database of metabolic pathways and enzymes
Nucleic Acids Res., January 1, 2004; 32(90001): D438 - 442.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
R. Chaloupkova, J. Sykorova, Z. Prokop, A. Jesenska, M. Monincova, M. Pavlova, M. Tsuda, Y. Nagata, and J. Damborsky
Modification of Activity and Specificity of Haloalkane Dehalogenase from Sphingomonas paucimobilis UT26 by Engineering of Its Entrance Tunnel
J. Biol. Chem., December 26, 2003; 278(52): 52622 - 52628.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
B. de Bono and C. Chothia
Exegesis: a procedure to improve gene predictions and its use to find immunoglobulin superfamily proteins in the human and mouse genomes
Nucleic Acids Res., November 1, 2003; 31(21): 6096 - 6103.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
B. P. Kelley, R. Sharan, R. M. Karp, T. Sittler, D. E. Root, B. R. Stockwell, and T. Ideker
Conserved pathways within bacteria and yeast as revealed by global protein network alignment
PNAS, September 30, 2003; 100(20): 11394 - 11399.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M.-Y. Choi, E.-J. Fuerst, A. Rafaeli, and R. Jurenka
Identification of a G protein-coupled receptor for pheromone biosynthesis activating neuropeptide from pheromone glands of the moth Helicoverpa zea
PNAS, August 19, 2003; 100(17): 9721 - 9726.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. Ramu
SIRW: a web server for the Simple Indexing and Retrieval System that combines sequence motif searches with keyword searches
Nucleic Acids Res., July 1, 2003; 31(13): 3771 - 3774.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
V. Brusic, R. S. Pillai, D. G. Silva, N. Petrovsky, RIKEN GER Group, GSL Members, and C. Schonbach
Cytokine-Related Genes Identified From the RIKEN Full-Length Mouse cDNA Data Set
Genome Res., June 1, 2003; 13(6): 1307 - 1317.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
T. Kasukawa, M. Furuno, I. Nikaido, H. Bono, D. A. Hume, C. Bult, D. P. Hill, R. Baldarelli, J. Gough, A. Kanapin, et al.
Development and Evaluation of an Automated Annotation Pipeline and cDNA Annotation System
Genome Res., June 1, 2003; 13(6): 1542 - 1551.
[Abstract] [Full Text] [PDF]


Home page
Circ. Res.Home page
R. L. Winslow and M. S. Boguski
Genome Informatics: Current Status and Future Prospects
Circ. Res., May 16, 2003; 92(9): 953 - 961.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. D. Mooney, T. E. Klein, R. B. Altman, M. A. Trifiro, and B. Gottlieb
A functional analysis of disease-associated mutations in the androgen receptor gene
Nucleic Acids Res., April 15, 2003; 31(8): e42 - e42.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Overbeek, N. Larsen, T. Walunas, M. D'Souza, G. Pusch, E. Selkov Jr, K. Liolios, V. Joukov, D. Kaznadzey, I. Anderson, et al.
The ERGOTM genome analysis and discovery system
Nucleic Acids Res., January 1, 2003; 31(1): 164 - 171.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
O. Ivanciuc, C. H. Schein, and W. Braun
SDAP: database and computational tools for allergenic proteins
Nucleic Acids Res., January 1, 2003; 31(1): 359 - 362.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Huang, W. C. Barker, Y. Chen, and C. H. Wu
iProClass: an integrated database of protein family, function and structure information
Nucleic Acids Res., January 1, 2003; 31(1): 390 - 392.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. la Cour, R. Gupta, K. Rapacki, K. Skriver, F. M. Poulsen, and S. Brunak
NESbase version 1.0: a database of nuclear export signals
Nucleic Acids Res., January 1, 2003; 31(1): 393 - 396.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Vlahovicek, L. Kajan, J. Murvai, Z. Heged""s, and S. Pongor
The SBASE domain sequence library, release 10: domain architecture prediction
Nucleic Acids Res., January 1, 2003; 31(1): 403 - 405.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Ikeda, M. Arai, T. Okuno, and T. Shimizu
TMPDB: a database of experimentally-characterized transmembrane topologies
Nucleic Acids Res., January 1, 2003; 31(1): 406 - 409.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
Mammalian Gene Collection Program Team*, R. L. Strausberg, E. A. Feingold, L. H. Grouse, J. G. Derge, R. D. Klausner, F. S. Collins, L. Wagner, C. M. Shenmen, G. D. Schuler, et al.
Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences
PNAS, December 24, 2002; 99(26): 16899 - 16903.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (69K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (115)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Wu, C. H.
Right arrow Articles by Barker, W. C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wu, C. H.
Right arrow Articles by Barker, W. C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?