Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (334K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (46)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Hodges, P. E.
Right arrow Articles by Garrels, J. I.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hodges, P. E.
Right arrow Articles by Garrels, J. I.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research Pages 68-72  


The Yeast Protein Database (YPD): a curated proteome database for Saccharomyces cerevisiae
Introduction
The Yeast Protein Report
YPD Curation
Using The Yeast Protein Database
YPD Protein Minireviews: Adding A New Layer Of Information
Future Directions For YPD
How To Submit Protein Data To YPD
Citing YPD
Acknowledgements
References


The Yeast Protein Database (YPD): a curated proteome database for Saccharomyces cerevisiae

The Yeast Protein Database (YPD): a curated proteome database for Saccharomyces cerevisiae

Peter E. Hodges, William E. Payne, James I. Garrels*

Proteome, Inc., 200 Cummings Center, Suite 425C, Beverly, MA 01915, USA

Received October 27, 1997; Accepted November 4, 1997

ABSTRACT

The Yeast Protein Database (YPD) is a curated database for the proteome of Saccharomyces cerevisiae. It consists of ~6000 Yeast Protein Reports, one for each of the known or predicted yeast proteins. Each Yeast Protein Report is a one-page presentation of protein properties, annotation lines that summarize findings from the literature, and references. In the past year, the number of annotation lines has grown from 25 000 to ~35 000, and the number of articles curated has grown from ~3500 to >5000. Recently, new data types have been included in YPD: protein-protein interactions, genetic interactions, and regulators of gene expression. Finally, a new layer of information, the YPD Protein Minireviews, has recently been introduced. The Yeast Protein Database can be found on the Web at http://www.proteome.com/YPDhome.html

INTRODUCTION

Since completion of the genome sequence of Saccharomyces cerevisiae in 1996 (1), yeast has been the lead organism for post-genomic analysis. Large-scale methods are being used to study the mutant phenotype of deletion of each gene (2-4), to monitor the expression level of the genes (5), to detect the protein-protein interactions (6), and to measure directly the abundance, localization and modification of the proteins (7-10). As large-scale study of proteins (proteomics) increases in importance, new challenges arise for proteome databases.

At present, the greatest body of proteome information for yeast is the research literature. The Yeast Protein Database (YPD) is a curated proteome database that seeks to compile, organize and present in a convenient format the current knowledge of yeast protein functions (11,12). The ways that proteins work together to carry out the functions of the cell are better understood for yeast than for any other cell. More than 45% of the yeast proteins have been characterized by focused investigations already (see Table 1). The pace of characterization of yeast proteins has increased rapidly in recent years, and all of the genes/proteins will soon be characterized to some degree by large-scale methods. The YPD staff gathers the new information for each yeast protein, combines it with the prior knowledge, and presents the information for each protein as a one-page Yeast Protein Report.

Table 1. Growth of YPD content by release number
YPD
release
Date Total
proteins
Knownfunctiona Similarityb Unknown
functionc
1.2 Nov. 23, 1994 3020 1729 387 904
2.0 Dec. 8, 1994 3142 1750 450 942
3.0 Feb. 1, 1995 3512 1871 524 1117
4.0 Jun. 6, 1995 4046 1951 667 1428
4.1 Jul. 7, 1995 4305 2012 729 1564
5.0 Nov. 30, 1995 4559 2187 859 1913
6.0 Aug. 3, 1996 6021 2369 1231 2421
7.01 Jan. 21, 1997 6045 2507 1192 2346
7.42 Oct. 21, 1997 6089 2786 1084 2219
aProteins characterized through genetic or biochemical experiments.
bProteins that have not been characterized but have sequence similarity to characterized proteins.
cProteins of unknown function.

This summary updates the status of YPD and introduces several new features. The often numerous free-text annotation lines in each Yeast Protein Report are now sorted and placed under topic headings. Pop-up windows have been added that summarize known physical and genetic interactions and the known inducers or repressors of mRNA expression for each gene. Each fact reported in YPD is associated with a reference, and the reference numbers are linked to PubMed (13) where abstracts and sometimes links to full text articles are found. Finally, this report introduces a new layer of yeast protein information, the YPD Protein Minireviews.

THE YEAST PROTEIN REPORT

The Yeast Protein Report (Fig. 1) uses a standardized format for presentation of a maximal amount of protein information on one Web page. Links can take the user to pages with a greater level of detail, such as sequence database pages (14-17), PubMed abstracts (13), or to pages with a higher degree of overview, such as YPD Protein Minireviews (see below). The four sections of each Yeast Protein Report are: (i) the one-to-two line descriptive phrase (title line), (ii) the formatted protein property section, (iii) the annotations divided by topic headings, and (iv) the reference list. The title line is modified frequently as more is learned about the role of each protein. The protein property section includes alternate names, links to other sequence and model organism databases, properties calculated from the protein sequence, and experimentally determined properties. In YPD, the subcellular localizations, functional categories and post-translational modifications are based on experimental evidence or strong predictions (based on close family similarity).


Figure 1 A representative YPD Yeast Protein Report page describing actin, encoded by the ACT1 gene. Users can access this page via the World Wide Web at http://www.proteome.com/YPD/ACT1.html. The page includes a title and description (uppermost), a table of protein properties (upper table), annotations from the literature sorted with topic headings (main body, not completely shown here) and a reference list (not shown here). All of the color-highlighted properties, gene names, protein names and reference numbers are linked to their corresponding pop-up windows (see Figs 2 and 3), Yeast Protein Report pages, PubMed reference pages, or other databases.


Figure 2 Representative Interactions and Regulations pop-up windows from the actin Yeast Protein Report page. The Interactions window is subdivided into Associations, Synthetic Lethals, and Complexes. The gene name corresponding to the interacting gene or protein is presented, along with references, experimental method for determining associations and the name of the multiprotein complex. The Regulations window lists factors that regulate the expression of this protein, grouped as inducers or repressors. They include regulatory proteins (listed by their gene name), environmental conditions or small molecules. References are linked and regulatory proteins are linked to their own Yeast Protein Report page.

The newest additions to YPD are the Interaction Reports and the Regulation Reports (Fig. 2). These new reports can be viewed in pop-up windows without leaving the Yeast Protein Report. The Interaction Report presents current information on physical interactions (protein-protein associations and multiprotein complexes) and genetic interactions, with references. The Regulation Report summarizes the current knowledge of agents (environmental conditions, small molecules, or transcription factors) that induce or repress the protein of interest at the transcriptional level.

YPD CURATION

The defining characteristics of YPD is its collection of annotations derived from the yeast literature. The annotations represent the work of curators who read the literature, extract the protein properties and restate the key biological results as free-text annotations. YPD curators are experienced PhD-level yeast researchers who are trained in the style and depth of coverage expected for YPD. Each curator is supported by a variety of tools for searching the yeast sequence and reference databases, and the curator has access to a developer's version of YPD that contains curators notes and annotations from unpublished work. Curators choose their own topics, starting with the new literature for the current week and tracing back to related papers from the older literature. Most recent papers are curated by reading the full paper; many older papers are curated from their abstract only. The YPD reference database contains ~12 000 references to papers dealing with yeast proteins. It also tracks the curation status of each paper (reviewed in full, reviewed in abstract only, or not yet done). With the current staff of eight curators, it is our goal to complete the curation of the yeast literature within the next two years.

The integrity of all data entered into YPD is ensured by an editorial step. All notes submitted by curators are reviewed by a YPD editor for proper style and clarity. Editors often put in additional cross-referencing, correct the nomenclature and style, and resolve occasional inconsistencies. The final level of review is by the YPD users, many of whom submit comments, corrections, and new data. The YPD staff welcomes Email at ypd@proteome.com.

USING THE YEAST PROTEIN DATABASE

The YPD Web pages are intended for use by yeast biologists and researchers in other fields who can benefit from the yeast model system. YPD can serve (i) to acquaint yeast researchers with newly characterized proteins, (ii) to enumerate proteins that share a particular property or function, (iii) as a guide to the yeast research literature, and (iv) as an entry point for scientists working with other organisms who seek clues to the structure, function, or interactions of an unknown protein based on the knowledge of similar proteins in yeast. Users can access a single Yeast Protein Report directly (for example, actin's page at http://www.proteome.com/YPD/ACT1.html). However, most users will be interested in accessing the Reports through the search form and summary pages available through the YPD home page (http://www.proteome.com/YPDhome.html). The YPD search form allows searches by gene names, keywords, protein properties, functional categories and amino acid sequence. Since the proteome of S.cerevisiae is complete, such searches take on an added power. For example, a search of protein kinases brings up every protein kinase that is encoded by the yeast genome (18).

YPD PROTEIN MINIREVIEWS: ADDING A NEW LAYER OF INFORMATION

The YPD database is built from separate Yeast Protein Reports, and within each report are long lists of annotations. To summarize these numerous, independent facts, and to present results that cover multiple related proteins, we have introduced the YPD Protein Minireviews. Each is a short summary with graphics covering a group of proteins related as part of a complex, pathway or protein family, and each is written by an expert. Each protein mentioned in a Minireview is linked to the corresponding Yeast Protein Report, and each reference citations is linked to PubMed. Keywords and the gene names of the proteins mentioned in each Minireview are indexed and searchable. Unlike a traditional review, the Minireviews are dynamic. The Yeast Protein Reports to which they link are updated daily, and each Minireview itself will be periodically updated to reflect the new information. An example shown in Figure 3, `The KTR Family of Putative Golgi Mannosyltransferases' written by Marc Lussier, Anne-Marie Sdicu and Howard Bussey, is already available (http://www.proteome.com/YPD/YPM/mannosyltransferases.html), and many more will be added in the next year.


Figure 3 Portion of a representative YPD Protein Minireview. Usersmay access this Minireview at http://www.proteome.com/YPD/YPM/mannosyltransferases.html . The title and authors are given, with an Email reply directly to the authors. Keywords and a listing of the genes mentioned in the review are indexed at the beginning of the review. A table of contents lets the user skip to later sections. Each mention of a protein in the text is linked to that protein's Yeast Protein Report. References are linked to their PubMed abstract. A variety of graphics are presented to display the information.

FUTURE DIRECTIONS FOR YPD

YPD is dedicated to completing the curation of all relevant, published data on yeast proteins, and to building a substantial collection of YPD Protein Minireviews within the next two years. We plan to continue to add new features, including a frequently updated table of related proteins from other model organisms. YPD is also expanding to encompass data from large-scale functional genomic and proteomic studies. Results of these projects will be integrated into the individual Yeast Protein Reports, with links to specialized Web pages posted by other groups. Already YPD has included the data from systematic projects to determine the phenotype of knockout mutants (2-4), two-hybrid interactions (6), the level of gene expression at the RNA level (5), and direct protein analysis through mass spectroscopy (9) and two-dimensional gel electrophoresis (9,10). YPD has been designed to interface closely with two-dimensional gel analysis, both to help with interpretation of 2D gels (including spot identification) and as a repository of new information on protein expression and modification. A two-dimensional gel database (10) is underway that will be fully integrated with YPD.

A further role for YPD is as a model for new proteome databases. These can be generated in the near future for other microorganisms with sequenced genomes, using the same tools and methods that were developed for YPD. The development of proteome databases for other model organisms, the interconnection of these model organism databases, and the development of satellite databases for pathogenic microorganisms with links to the model organism databases will help to relate the power of genomic/proteomic approaches to biological and medical problems.

HOW TO SUBMIT PROTEIN DATA TO YPD

To supplement our curation, we appreciate feedback from our users on new data submission, additions and corrections, including personal communication of unpublished result (which will be cited as such). Any correspondence should be directed to ypd@proteome.comd or by mail to the address of the authors.

CITING YPD

Authors who make use of the information provided by YPD should cite this article as a general reference for the access to and content of YPD. To cite a YPD Protein Minireview, please use the following format, or a similar format dictated by the journal of publication:

Lussier, M., Sdicu, A.-M., and Bussey, H. (1997) The KTR Family of Putative Golgi Mannosyltransferases (YPD Protein Minireviews, http://www.proteome.com/YPD/YPM/mannosyltransferases.html ).

ACKNOWLEDGEMENTS

The development of the YPD has been funded by a Small Business Innovation Research grant, Phase II, (2 R44 GM54110-02) from the National Institute of General Medical Sciences. YPD relies on the expertise of a large number of scientists, and we are extremely grateful for their contributions. The quality of our annotation is due to the diligence of our curators and consultants: Maria Costanzo, Andrew McKee, Michael Cusick, David Gonda, Susan Bromberg, George Boguslawski and Rachelle Hecht. We thanks our panel of scientific experts for this contributions to the direction, scope and content of the database: Bruno André, Charles Cole, Michael Cusick, Les Grivell, Sepp Kohlwein and Jon Warner. We thank Marc Lussier, Anne-Marie Sdicu and Howard Bussey for contributing the initial YPD Protein Minireview. We thank our computer programmers Michael Benoit and Michael Tillberg, administrative assistant and reference librarian Cheryl Lengieza, and administrative assistant Shelley Lengieza. We appreciate the help and cooperative spirit of Mike Cherry, Cathy Ball, Caroline Adler, Selina Dwight, Shuai Weng and the staff at SGD, and Werner Mewes, Alfred Zollner, Jean Hani and the staff at MIPS. Most importantly, we thank our users in the scientific community. It is their findings that make up YPD, and their comments, corrections and additions that keep it accurate.

REFERENCES

1. Goffeau, A., Barrell, B.G., Bussey, H., Davis, R.W., Dujon, B., Feldmann, H., Galibert, F., Hoheisel, J.D., Jacq, C., Johnston, M. et al.) (1996) Science, 274, 546-567. MEDLINE Abstract

2. Smith, V., Chou, K.N., Lashkari, D., Botstein, D. and Brown, P.O. (1996) Science, 274, 2069-2074. MEDLINE Abstract

3. Shoemaker, D.D., Lashkari, D.A., Morris, D., Mittmann, M. and Davis, R.W. (1996) Nature Genet., 14, 450-456. MEDLINE Abstract

4. http://genome.wustl.edu/gsc/yeast/chromosome8ORFs.html

5. Velculescu, V.E., Zhang, L., Zhou, W., Vogelstein, J., Basrai, M.A., Bassett, D.E., Jr, Hieter, P., Vogelstein, B. and Kinzler, K.W. (1997) Cell, 88, 243-251. MEDLINE Abstract

6. Fromont-Racine, M., Rain, J.C. and Legrain, P. (1997) Nature Genet., 16, 277-282. MEDLINE Abstract

7. Burns, N., Grimwade, B., Ross-Macdonald, P.B., Choi, E.Y., Finberg, K., Roeder, G.S. and Snyder, M. (1994) Genes Dev., 8, 1087-1105. MEDLINE Abstract

8. Niedenthal, R.K., Riles, L., Johnston, M. and Hegemann, J.H. (1996) Yeast, 12, 773-786. MEDLINE Abstract

9. Shevchenko, A., Jensen, O.N., Podtelejnikov, A.V., Sagliocco, F., Wilm, M., Vorm, O., Mortensen, P., Shevchenko, A., Boucherie, H. and Mann, M. (1996) Proc. Natl. Acad. Sci. USA, 93, 14440-14445. MEDLINE Abstract

10. Garrels, J.I., McLaughlin, C.S., Warner, J.R., Futcher, B., Latter, G.I., Kobayashi, R., Schwender, B., Volpe, T., Anderson, D.S., Mesquita-Fuentes, R. and Payne, W.E. (1997) Electrophoresis, 18, 1347-1360. MEDLINE Abstract

11. Garrels, J.I. (1996) Nucleic Acids Res., 24, 46-49. MEDLINE Abstract

12. Payne, W.E. and Garrels, J.I. (1997) Nucleic Acids Res., 25, 57-62. MEDLINE Abstract

13. http://www.ncbi.nlm.nih.gov/PubMed/

14. http://www.ncbi.nlm.nih.gov/Web/Genbank/

15. http://expasy.hcuge.ch/sprot/sprot-top.html

16. http://www.mips.biochem.mpg.de/mips/yeast/

17. http://genome-www.stanford.edu/Saccharomyces/

18. Hunter, T. and Plowman, G.D. (1997) Trends Biochem. Sci., 22, 18-22. MEDLINE Abstract


*To whom correspondence should be addressed. Tel: +1 978 922 1643; Fax: +1 978 922 3971; Email: jg@proteome.com


This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals Comments and feedback: www-admin{at}oup.co.uk
Last modification: 16 Dec 1997
Copyright© Oxford University Press, 1998.

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Mol. Cell. ProteomicsHome page
J. Reinders, K. Wagner, R. P. Zahedi, D. Stojanovski, B. Eyrich, M. van der Laan, P. Rehling, A. Sickmann, N. Pfanner, and C. Meisinger
Profiling Phosphoproteins of Yeast Mitochondria Reveals a Role of Phosphorylation in Assembly of the ATP Synthase
Mol. Cell. Proteomics, November 1, 2007; 6(11): 1896 - 1906.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
Y.-S. Jin, J. M. Laplaza, and T. W. Jeffries
Saccharomyces cerevisiae Engineered for Xylose Metabolism Exhibits a Respiratory Response
Appl. Envir. Microbiol., November 1, 2004; 70(11): 6816 - 6825.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. Wicker, D. Dembele, W. Raffelsberger, and O. Poch
Density of points clustering, application to transcriptomic data analysis
Nucleic Acids Res., September 15, 2002; 30(18): 3992 - 4000.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
T. J. Griffin, S. P. Gygi, T. Ideker, B. Rist, J. Eng, L. Hood, and R. Aebersold
Complementary Profiling of Gene Expression at the Transcriptome and Proteome Levels in Saccharomyces cerevisiae
Mol. Cell. Proteomics, April 1, 2002; 1(4): 323 - 333.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. S. Dwight, M. A. Harris, K. Dolinski, C. A. Ball, G. Binkley, K. R. Christie, D. G. Fisk, L. Issel-Tarver, M. Schroeder, G. Sherlock, et al.
Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO)
Nucleic Acids Res., January 1, 2002; 30(1): 69 - 72.
[Abstract] [Full Text] [PDF]


Home page
Mol. Pharmacol.Home page
J. P. Basilion, A. R. Schievella, E. Burns, P. Rioux, J. C. Olson, B. P. Monia, K. M. Lemonidis, V. P. Stanton Jr., and D. E. Housman
Selective Killing of Cancer Cells Based on Loss of Heterozygosity and Normal Variation in the Human Genome: A New Paradigm for Anticancer Drug Therapy
Mol. Pharmacol., August 1, 1999; 56(2): 359 - 369.
[Abstract] [Full Text]


Home page
J. Biol. Chem.Home page
S. Santagata, D. Bhattacharyya, F.-H. Wang, N. Singha, A. Hodtsev, and E. Spanopoulou
Molecular Cloning and Characterization of a Mouse Homolog of Bacterial ClpX, a Novel Mammalian Class II Member of the Hsp100/Clp Chaperone Family
J. Biol. Chem., June 4, 1999; 274(23): 16311 - 16319.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
S. P. Gygi, Y. Rochon, B. R. Franza, and R. Aebersold
Correlation between Protein and mRNA Abundance in Yeast
Mol. Cell. Biol., March 1, 1999; 19(3): 1720 - 1730.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (334K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (46)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Hodges, P. E.
Right arrow Articles by Garrels, J. I.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hodges, P. E.
Right arrow Articles by Garrels, J. I.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?