| Nucleic Acids Research | Pages |
The Yeast Protein Database (YPD): a curated proteome database for Saccharomyces cerevisiae
Introduction
The Yeast Protein Report
YPD Curation
Using The Yeast Protein Database
YPD Protein Minireviews: Adding A New Layer Of Information
Future Directions For YPD
How To Submit Protein Data To YPD
Citing YPD
Acknowledgements
References
The Yeast Protein Database (YPD): a curated proteome database for Saccharomyces cerevisiae
ABSTRACT
INTRODUCTION
Since completion of the genome sequence of Saccharomyces cerevisiae in 1996 (1), yeast has been the lead organism for post-genomic analysis. Large-scale methods are being used to study the mutant phenotype of deletion of each gene (2-4), to monitor the expression level of the genes (5), to detect the protein-protein interactions (6), and to measure directly the abundance, localization and modification of the proteins (7-10). As large-scale study of proteins (proteomics) increases in importance, new challenges arise for proteome databases.
At present, the greatest body of proteome information for yeast is the research literature. The Yeast Protein Database (YPD) is a curated proteome database that seeks to compile, organize and present in a convenient format the current knowledge of yeast protein functions (11,12). The ways that proteins work together to carry out the functions of the cell are better understood for yeast than for any other cell. More than 45% of the yeast proteins have been characterized by focused investigations already (see Table 1). The pace of characterization of yeast proteins has increased rapidly in recent years, and all of the genes/proteins will soon be characterized to some degree by large-scale methods. The YPD staff gathers the new information for each yeast protein, combines it with the prior knowledge, and presents the information for each protein as a one-page Yeast Protein Report.
Table
YPD
releaseDate
Total
proteinsKnownfunctiona
Similarityb
Unknown
functionc
1.2
Nov. 23, 1994
3020
1729
387
904
2.0
Dec. 8, 1994
3142
1750
450
942
3.0
Feb. 1, 1995
3512
1871
524
1117
4.0
Jun. 6, 1995
4046
1951
667
1428
4.1
Jul. 7, 1995
4305
2012
729
1564
5.0
Nov. 30, 1995
4559
2187
859
1913
6.0
Aug. 3, 1996
6021
2369
1231
2421
7.01
Jan. 21, 1997
6045
2507
1192
2346
7.42
Oct. 21, 1997
6089
2786
1084
2219
This summary updates the status of YPD and introduces several new features. The often numerous free-text annotation lines in each Yeast Protein Report are now sorted and placed under topic headings. Pop-up windows have been added that summarize known physical and genetic interactions and the known inducers or repressors of mRNA expression for each gene. Each fact reported in YPD is associated with a reference, and the reference numbers are linked to PubMed (13) where abstracts and sometimes links to full text articles are found. Finally, this report introduces a new layer of yeast protein information, the YPD Protein Minireviews.
THE YEAST PROTEIN REPORT
The Yeast Protein Report (Fig. 1) uses a standardized format for presentation of a maximal amount of protein information on one Web page. Links can take the user to pages with a greater level of detail, such as sequence database pages (14-17), PubMed abstracts (13), or to pages with a higher degree of overview, such as YPD Protein Minireviews (see below). The four sections of each Yeast Protein Report are: (i) the one-to-two line descriptive phrase (title line), (ii) the formatted protein property section, (iii) the annotations divided by topic headings, and (iv) the reference list. The title line is modified frequently as more is learned about the role of each protein. The protein property section includes alternate names, links to other sequence and model organism databases, properties calculated from the protein sequence, and experimentally determined properties. In YPD, the subcellular localizations, functional categories and post-translational modifications are based on experimental evidence or strong predictions (based on close family similarity).
Figure
Figure
The newest additions to YPD are the Interaction Reports and the Regulation Reports (Fig. 2). These new reports can be viewed in pop-up windows without leaving the Yeast Protein Report. The Interaction Report presents current information on physical interactions (protein-protein associations and multiprotein complexes) and genetic interactions, with references. The Regulation Report summarizes the current knowledge of agents (environmental conditions, small molecules, or transcription factors) that induce or repress the protein of interest at the transcriptional level.


YPD CURATION
The defining characteristics of YPD is its collection of annotations derived from the yeast literature. The annotations represent the work of curators who read the literature, extract the protein properties and restate the key biological results as free-text annotations. YPD curators are experienced PhD-level yeast researchers who are trained in the style and depth of coverage expected for YPD. Each curator is supported by a variety of tools for searching the yeast sequence and reference databases, and the curator has access to a developer's version of YPD that contains curators notes and annotations from unpublished work. Curators choose their own topics, starting with the new literature for the current week and tracing back to related papers from the older literature. Most recent papers are curated by reading the full paper; many older papers are curated from their abstract only. The YPD reference database contains ~12 000 references to papers dealing with yeast proteins. It also tracks the curation status of each paper (reviewed in full, reviewed in abstract only, or not yet done). With the current staff of eight curators, it is our goal to complete the curation of the yeast literature within the next two years.
The integrity of all data entered into YPD is ensured by an editorial step. All notes submitted by curators are reviewed by a YPD editor for proper style and clarity. Editors often put in additional cross-referencing, correct the nomenclature and style, and resolve occasional inconsistencies. The final level of review is by the YPD users, many of whom submit comments, corrections, and new data. The YPD staff welcomes Email at ypd@proteome.com.
USING THE YEAST PROTEIN DATABASE
The YPD Web pages are intended for use by yeast biologists and researchers in other fields who can benefit from the yeast model system. YPD can serve (i) to acquaint yeast researchers with newly characterized proteins, (ii) to enumerate proteins that share a particular property or function, (iii) as a guide to the yeast research literature, and (iv) as an entry point for scientists working with other organisms who seek clues to the structure, function, or interactions of an unknown protein based on the knowledge of similar proteins in yeast. Users can access a single Yeast Protein Report directly (for example, actin's page at http://www.proteome.com/YPD/ACT1.html). However, most users will be interested in accessing the Reports through the search form and summary pages available through the YPD home page (http://www.proteome.com/YPDhome.html). The YPD search form allows searches by gene names, keywords, protein properties, functional categories and amino acid sequence. Since the proteome of S.cerevisiae is complete, such searches take on an added power. For example, a search of protein kinases brings up every protein kinase that is encoded by the yeast genome (18).
YPD PROTEIN MINIREVIEWS: ADDING A NEW LAYER OF INFORMATION
The YPD database is built from separate Yeast Protein Reports, and within each report are long lists of annotations. To summarize these numerous, independent facts, and to present results that cover multiple related proteins, we have introduced the YPD Protein Minireviews. Each is a short summary with graphics covering a group of proteins related as part of a complex, pathway or protein family, and each is written by an expert. Each protein mentioned in a Minireview is linked to the corresponding Yeast Protein Report, and each reference citations is linked to PubMed. Keywords and the gene names of the proteins mentioned in each Minireview are indexed and searchable. Unlike a traditional review, the Minireviews are dynamic. The Yeast Protein Reports to which they link are updated daily, and each Minireview itself will be periodically updated to reflect the new information. An example shown in Figure 3, `The KTR Family of Putative Golgi Mannosyltransferases' written by Marc Lussier, Anne-Marie Sdicu and Howard Bussey, is already available (http://www.proteome.com/YPD/YPM/mannosyltransferases.html), and many more will be added in the next year.
Figure

FUTURE DIRECTIONS FOR YPD
YPD is dedicated to completing the curation of all relevant, published data on yeast proteins, and to building a substantial collection of YPD Protein Minireviews within the next two years. We plan to continue to add new features, including a frequently updated table of related proteins from other model organisms. YPD is also expanding to encompass data from large-scale functional genomic and proteomic studies. Results of these projects will be integrated into the individual Yeast Protein Reports, with links to specialized Web pages posted by other groups. Already YPD has included the data from systematic projects to determine the phenotype of knockout mutants (2-4), two-hybrid interactions (6), the level of gene expression at the RNA level (5), and direct protein analysis through mass spectroscopy (9) and two-dimensional gel electrophoresis (9,10). YPD has been designed to interface closely with two-dimensional gel analysis, both to help with interpretation of 2D gels (including spot identification) and as a repository of new information on protein expression and modification. A two-dimensional gel database (10) is underway that will be fully integrated with YPD.
A further role for YPD is as a model for new proteome databases. These can be generated in the near future for other microorganisms with sequenced genomes, using the same tools and methods that were developed for YPD. The development of proteome databases for other model organisms, the interconnection of these model organism databases, and the development of satellite databases for pathogenic microorganisms with links to the model organism databases will help to relate the power of genomic/proteomic approaches to biological and medical problems.
HOW TO SUBMIT PROTEIN DATA TO YPD
To supplement our curation, we appreciate feedback from our users on new data submission, additions and corrections, including personal communication of unpublished result (which will be cited as such). Any correspondence should be directed to ypd@proteome.comd or by mail to the address of the authors.
CITING YPD
Authors who make use of the information provided by YPD should cite this article as a general reference for the access to and content of YPD. To cite a YPD Protein Minireview, please use the following format, or a similar format dictated by the journal of publication:
Lussier, M., Sdicu, A.-M., and Bussey, H. (1997) The KTR Family of Putative Golgi Mannosyltransferases (YPD Protein Minireviews, http://www.proteome.com/YPD/YPM/mannosyltransferases.html ).
ACKNOWLEDGEMENTS
The development of the YPD has been funded by a Small Business Innovation Research grant, Phase II, (2 R44 GM54110-02) from the National Institute of General Medical Sciences. YPD relies on the expertise of a large number of scientists, and we are extremely grateful for their contributions. The quality of our annotation is due to the diligence of our curators and consultants: Maria Costanzo, Andrew McKee, Michael Cusick, David Gonda, Susan Bromberg, George Boguslawski and Rachelle Hecht. We thanks our panel of scientific experts for this contributions to the direction, scope and content of the database: Bruno André, Charles Cole, Michael Cusick, Les Grivell, Sepp Kohlwein and Jon Warner. We thank Marc Lussier, Anne-Marie Sdicu and Howard Bussey for contributing the initial YPD Protein Minireview. We thank our computer programmers Michael Benoit and Michael Tillberg, administrative assistant and reference librarian Cheryl Lengieza, and administrative assistant Shelley Lengieza. We appreciate the help and cooperative spirit of Mike Cherry, Cathy Ball, Caroline Adler, Selina Dwight, Shuai Weng and the staff at SGD, and Werner Mewes, Alfred Zollner, Jean Hani and the staff at MIPS. Most importantly, we thank our users in the scientific community. It is their findings that make up YPD, and their comments, corrections and additions that keep it accurate.
REFERENCES
This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals Comments and feedback: www-admin{at}oup.co.uk
Last modification: 16 Dec 1997
Copyright© Oxford University Press, 1998.
This article has been cited by other articles:
![]() |
J. Reinders, K. Wagner, R. P. Zahedi, D. Stojanovski, B. Eyrich, M. van der Laan, P. Rehling, A. Sickmann, N. Pfanner, and C. Meisinger Profiling Phosphoproteins of Yeast Mitochondria Reveals a Role of Phosphorylation in Assembly of the ATP Synthase Mol. Cell. Proteomics, November 1, 2007; 6(11): 1896 - 1906. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-S. Jin, J. M. Laplaza, and T. W. Jeffries Saccharomyces cerevisiae Engineered for Xylose Metabolism Exhibits a Respiratory Response Appl. Envir. Microbiol., November 1, 2004; 70(11): 6816 - 6825. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Wicker, D. Dembele, W. Raffelsberger, and O. Poch Density of points clustering, application to transcriptomic data analysis Nucleic Acids Res., September 15, 2002; 30(18): 3992 - 4000. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. Griffin, S. P. Gygi, T. Ideker, B. Rist, J. Eng, L. Hood, and R. Aebersold Complementary Profiling of Gene Expression at the Transcriptome and Proteome Levels in Saccharomyces cerevisiae Mol. Cell. Proteomics, April 1, 2002; 1(4): 323 - 333. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. S. Dwight, M. A. Harris, K. Dolinski, C. A. Ball, G. Binkley, K. R. Christie, D. G. Fisk, L. Issel-Tarver, M. Schroeder, G. Sherlock, et al. Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO) Nucleic Acids Res., January 1, 2002; 30(1): 69 - 72. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Basilion, A. R. Schievella, E. Burns, P. Rioux, J. C. Olson, B. P. Monia, K. M. Lemonidis, V. P. Stanton Jr., and D. E. Housman Selective Killing of Cancer Cells Based on Loss of Heterozygosity and Normal Variation in the Human Genome: A New Paradigm for Anticancer Drug Therapy Mol. Pharmacol., August 1, 1999; 56(2): 359 - 369. [Abstract] [Full Text] |
||||
![]() |
S. Santagata, D. Bhattacharyya, F.-H. Wang, N. Singha, A. Hodtsev, and E. Spanopoulou Molecular Cloning and Characterization of a Mouse Homolog of Bacterial ClpX, a Novel Mammalian Class II Member of the Hsp100/Clp Chaperone Family J. Biol. Chem., June 4, 1999; 274(23): 16311 - 16319. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. P. Gygi, Y. Rochon, B. R. Franza, and R. Aebersold Correlation between Protein and mRNA Abundance in Yeast Mol. Cell. Biol., March 1, 1999; 19(3): 1720 - 1730. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||





