Skip Navigation

This Article
Right arrow Full Text Freely available
Right arrow Print PDF (460K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (64)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Jansen, R.
Right arrow Articles by Gerstein, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Jansen, R.
Right arrow Articles by Gerstein, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2000, Vol. 28, No. 6 1481-1488
© 2000 Oxford University Press

Analysis of the yeast transcriptome with structural and functional categories: characterizing highly expressed proteins

Ronald Jansen and Mark Gerstein*

Department of Molecular Biophysics and Biochemistry, 266 Whitney Avenue, Yale University, PO Box 208114, New Haven, CT 06520, USA

We analyzed 10 genome expression data sets by large-scale cross-referencing against broad structural and functional categories. The data sets, generated by different techniques (e.g. SAGE and gene chips), provide various representations of the yeast transcriptome (the set of all yeast genes, weighted by transcript abundance). Our analysis enabled us to determine features more prevalent in the transcriptome than the genome: i.e. those that are common to highly expressed proteins. Starting with simplest categories, we find that, relative to the genome, the transcriptome is enriched in Ala and Gly and depleted in Asn and very long proteins. We find, furthermore, that protein length and maximum expression level have a roughly inverse relationship. To relate expression level and protein structure, we assigned transmembrane helices and known folds (using PSI-blast) to each protein in the genome; this allowed us to determine that the transcrip­tome is enriched in mixed {alpha}–ß structures and depleted in membrane proteins relative to the genome. In particular, some enzymatic folds, such as the TIM barrel and the G3P dehydrogenase fold, are much more prevalent in the transcriptome than the genome, whereas others, such as the protein-kinase and leucine-zipper folds, are depleted. The TIM barrel, in fact, is overwhelmingly the ‘top fold’ in the transcriptome, while it only ranks fifth in the genome. The most highly enriched functional categories in the transcriptome (based on the MIPS system) are energy production and protein synthesis, while categories such as transcription, transport and signaling are depleted. Furthermore, for a given functional category, transcriptome enrichment varies quite substantially between the different expression data sets, with a variation an order of magnitude larger than for the other categories cross-referenced (e.g. amino acids). One can readily see how the enrichment and depletion of the various functional categories relates directly to that of particular folds. Further information can be found at http://bioinfo.mbb.yale.edu/genome/expression

* To whom correspondence should be addressed. Tel: +1 203 432 6105; Fax: +1 360 838 7861; Email: mark.gerstein@yale.edu


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Eukaryot CellHome page
H. E. Hallen and F. Trail
The L-Type Calcium Ion Channel Cch1 Affects Ascospore Discharge and Mycelial Growth in the Filamentous Fungus Gibberella zeae (Anamorph Fusarium graminearum)
Eukaryot. Cell, February 1, 2008; 7(2): 415 - 424.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
P. K. Ingvarsson
Gene Expression and Protein Length Influence Codon Usage and Rates of Sequence Evolution in Populus tremula
Mol. Biol. Evol., March 1, 2007; 24(3): 836 - 844.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. D. Bloom, D. A. Drummond, F. H. Arnold, and C. O. Wilke
Structural Determinants of the Rate of Protein Evolution in Yeast
Mol. Biol. Evol., September 1, 2006; 23(9): 1751 - 1761.
[Abstract] [Full Text] [PDF]


Home page
Toxicol SciHome page
X. Yu, W. C. Griffith, K. Hanspers, J. F. Dillman III, H. Ong, M. A. Vredevoogd, and E. M. Faustman
A System-Based Approach to Interpret Dose- and Time-Dependent Microarray Data: Quantitative Integration of Gene Ontology Analysis for Risk Assessment
Toxicol. Sci., August 1, 2006; 92(2): 560 - 577.
[Abstract] [Full Text] [PDF]


Home page
Eukaryot CellHome page
R. Caesar, J. Warringer, and A. Blomberg
Physiological Importance and Identification of Novel Targets for the N-Terminal Acetyltransferase NatB
Eukaryot. Cell, February 1, 2006; 5(2): 368 - 378.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
V. Poroyko, L.G. Hejlek, W.G. Spollen, G.K. Springer, H.T. Nguyen, R.E. Sharp, and H.J. Bohnert
The Maize Root Transcriptome by Serial Analysis of Gene Expression
Plant Physiology, July 1, 2005; 138(3): 1700 - 1710.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
B. Lemos, B. R. Bettencourt, C. D. Meiklejohn, and D. L. Hartl
Evolution of Proteins and Gene Expression Levels are Coupled in Drosophila and are Independently Associated with mRNA Abundance, Protein Length, and Number of Protein-Protein Interactions
Mol. Biol. Evol., May 1, 2005; 22(5): 1345 - 1354.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. Sethi, P. O'Donoghue, and Z. Luthey-Schulten
Evolutionary profiles from the QR factorization of multiple sequence alignments
PNAS, March 15, 2005; 102(11): 4045 - 4050.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Ruepp, A. Zollner, D. Maier, K. Albermann, J. Hani, M. Mokrejs, I. Tetko, U. Guldener, G. Mannhaupt, M. Munsterkotter, et al.
The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes
Nucleic Acids Res., October 14, 2004; 32(18): 5539 - 5545.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. M. Comeron
Selective and Mutational Patterns Associated With Gene Expression in Humans: Influences on Synonymous Composition and Intron Presence
Genetics, July 1, 2004; 167(3): 1293 - 1304.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. O. Urrutia and L. D. Hurst
The Signature of Selection Mediated by Expression on Human Genes
Genome Res., October 1, 2003; 13(10): 2260 - 2264.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
H. Akashi
Translational Selection and Yeast Proteome Evolution
Genetics, August 1, 2003; 164(4): 1291 - 1303.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. M. Luscombe, T. E. Royce, P. Bertone, N. Echols, C. E. Horak, J. T. Chang, M. Snyder, and M. Gerstein
ExpressYourself: a modular platform for processing and visualizing microarray data
Nucleic Acids Res., July 1, 2003; 31(13): 3477 - 3482.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Jansen, H. J. Bussemaker, and M. Gerstein
Revisiting the codon adaptation index from a whole-genome perspective: analyzing the relationship between gene expression and codon occurrence in yeast using a variety of models
Nucleic Acids Res., April 15, 2003; 31(8): 2242 - 2251.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. Echols, P. Harrison, S. Balasubramanian, N. M. Luscombe, P. Bertone, Z. Zhang, and M. Gerstein
Comprehensive analysis of amino acid and nucleotide composition in eukaryotic genomes, comparing genes and pseudogenes
Nucleic Acids Res., June 1, 2002; 30(11): 2515 - 2523.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Lin, Y. Kuang, J. S. Joseph, and P. R. Kolatkar
Conserved codon composition of ribosomal protein coding genes in Escherichia coli, Mycobacterium tuberculosis and Saccharomyces cerevisiae: lessons from supervised machine learning in functional genomics
Nucleic Acids Res., June 1, 2002; 30(11): 2599 - 2607.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
H. Akashi and T. Gojobori
Metabolic efficiency and amino acid composition in the proteomes of Escherichia coli and Bacillus subtilis
PNAS, March 19, 2002; 99(6): 3695 - 3700.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
R. Jansen, D. Greenbaum, and M. Gerstein
Relating Whole-Genome Expression Data with Protein-Protein Interactions
Genome Res., January 1, 2002; 12(1): 37 - 46.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
D. Greenbaum, N. M. Luscombe, R. Jansen, J. Qian, and M. Gerstein
Interrelating Different Types of Genomic Data, from Proteome to Secretome: 'Oming in on Function
Genome Res., September 1, 2001; 11(9): 1463 - 1468.
[Abstract] [Full Text] [PDF]


Home page
BloodHome page
Z. Lian, L. Wang, S. Yamaga, W. Bonds, Y. Beazer-Barclay, Y. Kluger, M. Gerstein, P. E. Newburger, N. Berliner, and S. M. Weissman
Genomic and proteomic analysis of the myeloid differentiation program
Blood, August 1, 2001; 98(3): 513 - 524.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Qian, B. Stenger, C. A. Wilson, J. Lin, R. Jansen, S. A. Teichmann, J. Park, W. G. Krebs, H. Yu, V. Alexandrov, et al.
PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information
Nucleic Acids Res., April 15, 2001; 29(8): 1750 - 1764.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. M. Harrison, N. Echols, and M. B. Gerstein
Digging for dead genes: an analysis of the characteristics of the pseudogene population in the Caenorhabditis elegans genome
Nucleic Acids Res., February 1, 2001; 29(3): 818 - 830.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.