Nucleic Acids Research, 2003, Vol. 31, No. 8 2242-2251
© 2003 Oxford University Press
Revisiting the codon adaptation index from a whole-genome perspective: analyzing the relationship between gene expression and codon occurrence in yeast using a variety of models
1 Department of Molecular Biophysics and Biochemistry, 2 Computer Science, 266 Whitney Avenue, Yale University, PO Box 208114, New Haven, CT 06520, USA, 3 Department of Biological Sciences and Center for Computational Biology and Bioinformatics, Columbia University, 1212 Amsterdam Avenue MC2441, New York, NY 10027, USA
Ronald Jansen, Computational Biology Center, Memorial Sloan-Kettering Cancer Center, 307 East 63rd Street, New York, NY 10021, USA
Highly expressed genes in many bacteria and small eukaryotes often have a strong compositional bias, in terms of codon usage. Two widely used numerical indices, the codon adaptation index (CAI) and the codon usage, use this bias to predict the expression level of genes. When these indices were first introduced, they were based on fairly simple assumptions about which genes are most highly expressed: the CAI was originally based on the codon composition of a set of only 24 highly expressed genes, and the codon usage on assumptions about which functional classes of genes are highly expressed in fast-growing bacteria. Given the recent advent of genome-wide expression data, we should be able to improve on these assumptions. Here, we measure, in yeast, the degree to which consideration of the current genome-wide expression data sets improves the performance of both numerical indices. Indeed, we find that by changing the parameterization of each model its correlation with actual expression levels can be somewhat improved, although both indices are fairly insensitive to the exact way they are parameterized. This insensitivity indicates a consistent codon bias amongst highly expressed genes. We also attempt direct linear regression of codon composition against genome-wide expression levels (and protein abundance data). This has some similarity with the CAI formalism and yields an alternative model for the prediction of expression levels based on the coding sequences of genes. More information is available at http://bioinfo.mbb.yale.edu/expression/codons.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. E. Popescu, T. Borza, J. P. Bielawski, and R. W. Lee Evolutionary Rates and Expression Level in Chlamydomonas Genetics, March 1, 2006; 172(3): 1567 - 1576. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Hanfrey, K. A. Elliott, M. Franceschetti, M. J. Mayer, C. Illingworth, and A. J. Michael A Dual Upstream Open Reading Frame-based Autoregulatory Circuit Controlling Polyamine-responsive Translation J. Biol. Chem., November 25, 2005; 280(47): 39229 - 39237. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Wang, J. T. Prince, and E. M. Marcotte Mass spectrometry of the M. smegmatis proteome: Protein expression levels correlate with function, operons, and codon bias Genome Res., August 1, 2005; 15(8): 1118 - 1126. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Wu, D. E. Culley, and W. Zhang Predicted highly expressed genes in the genomes of Streptomyces coelicolor and Streptomyces avermitilis and the implications for their metabolism Microbiology, July 1, 2005; 151(7): 2175 - 2187. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Carbone, F. Kepes, and A. Zinovyev Codon Bias Signatures, Organization of Microorganisms in Codon Space, and Lifestyle Mol. Biol. Evol., March 1, 2005; 22(3): 547 - 561. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. P.C. Rocha Codon usage bias from tRNA's point of view: Redundancy, specialization, and efficient decoding for translation optimization Genome Res., November 1, 2004; 14(11): 2279 - 2286. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. R. Gerbasi, C. M. Weaver, S. Hill, D. B. Friedman, and A. J. Link Yeast Asc1p and Mammalian RACK1 Are Functionally Orthologous Core 40S Ribosomal Proteins That Repress Gene Expression Mol. Cell. Biol., September 15, 2004; 24(18): 8276 - 8287. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Karlin, J. Theriot, and J. Mrazek Comparative analysis of gene expression among low G+C gram-positive genomes PNAS, April 20, 2004; 101(16): 6182 - 6187. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Graumann, L. A. Dunipace, J. H. Seol, W. H. McDonald, J. R. Yates III, B. J. Wold, and R. J. Deshaies Applicability of Tandem Affinity Purification MudPIT to Pathway Proteomics in Yeast Mol. Cell. Proteomics, March 1, 2004; 3(3): 226 - 237. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. P. C. Rocha and A. Danchin Gene essentiality determines chromosome organisation in bacteria Nucleic Acids Res., November 15, 2003; 31(22): 6570 - 6577. [Abstract] [Full Text] [PDF] |
||||








