Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (79K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (45)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Nakamura, Y
Right arrow Articles by Ikemura, T
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Nakamura, Y
Right arrow Articles by Ikemura, T
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 1996 Oxford University Press 214-215

Footnote

Codon usage tabulated from the international DNA sequence databases

Codon usage tabulated from the international DNA sequence databases Yasukazu Nakamura , Ken-nosuke Wada 1 , Yoshiko Wada , Hirofumi Doi 2 , Shigehiko Kanaya 3 , Takashi Gojobori and Toshimichi Ikemura*

National Institute of Genetics and The Graduate University for Advanced Studies, Mishima, Shizuoka 411, Japan, 1 ATR Limited, Seika, Kyoto 619-02, Japan, 2 Fujitsu Laboratories Ltd, Chiba 261, Japan and 3 Yamagata University, Yonezawa, Yamagata 992, Japan

Received August 31, 1995 ; Revised and Accepted October 4, 1995

ABSTRACT

Codon usage in 87 602 genes has been calculated using the nucleotide sequence data obtained from the GenBank Genetic Sequence Data Bank (Release 90.0; September 1995). The database is called the CUTG Database; the complete form of the database can be obtained by anonymous ftp from DDBJ and a part of the database, which lists the frequency of codon use in each organism, is made searchable through our World Wide Web server.

SOURCE AND METHODS

Codon usage in individual genes has been calculated using the nucleotide sequence data obtained from the GenBank Genetic Sequence Data Bank (Release 90.0; September 1995). The compilation of codon usage is synchronized with each major release of GenBank. The resulting database is called the CUTG database ( 1 - 5 ).

In selecting protein coding sequences we relied on the FEATURES tables of GenBank, and only complete genes without unambiguous bases were used in the analysis. In GenBank, a group of consecutive genes whose entire region had been sequenced were registered under one LOCUS name. To distinguish the different genes belonging to a single LOCUS, the symbol # followed by a number is added after the LOCUS name; the numbers represent the order of the CDS registered in the FEATURES table of GenBank. When introns of a gene have not been completely sequenced, some of its exons are registered in separate entries (LOCUS) in GenBank. These exons, belonging to the same gene but having different LOCUS names, were combined into one entry and the first LOCUS name is added.

For the biological significance of codon usage, see Ikemura ( 6 ) and Aota and Ikemura ( 7 , 8 ).

FILES

Files of the present database, containing codon usage of 87 602 CDSs of 4805 species, are available by anonymous ftp from DDBJ. Files named as gb***.codon list the codon use in each gene registered in the GenBank Sequence files (gb***.seq). The LOCUS names given in GenBank were used to designate individual genes. Each LOCUS name is followed by fields of information extracted from the FEATURES of each CDS for defining each open reading frame analyzed here. The order of the codons in the table is the same as the previous compilation (see the CODON_LABEL file or REFERENCES).

To reveal the characteristics of codon use of a wide range of organisms, as well as viruses and organella, the frequency (per 1000) of codon use in 461 organisms for which >20 genes are available was calculated by summing up numbers of codon used. World Wide Web clients, such as NCSA Mosaic and Netscape, may be used to query this file. A user can display a codon usage table by clicking an anchor for selecting species or searching with species' name (Fig. 1 ).


Figure 1 . Snapshot of the CUTG home page.

DISTRIBUTION AND ACCESS

Complete form of the database is available by anonymous ftp from DDBJ:

ftp://ftp.nig.ac.jp/pub/db/codon/GB90.

The file README contains the latest information on the database in plain text format.

The frequencies of codon use in 461 organisms for which >20 genes are available can be accessed on the following WWW server:

http://tisun4a.lab.nig.ac.jp/codon/CUTG.html.

Comments on the database can be sent to cutg{at}lab.nig.ac.jp by e-mail.

ACKNOWLEDGEMENTS

The authors wish to thank Dr Y. Ugawa at the DNA Information and Stock Center, National Institute of Agrobiological Resources, for help in constructing the database. This work was supported by a grant-in-aid for creative basic research (Human Genome Project) and for scientific research on a priority area (Genome Informatics) and by a grant-in-aid of scientific research from the Ministry of Education, Science and Culture of Japan.

REFERENCES

1 Maruyama, T., Gojobori, T., Aota, S. and Ikemura, T. (1986) Nucleic Acids Res., 14, r151-r197.

2 Aota, S. Gojobori, T., Ishibashi, F., Maruyama, T. and Ikemura, T. (1988) Nucleic Acids Res., 16, r315-r402.

3 Wada, K., Aota, S., Tsuchiya, R., Ishibashi, F., Gojobori. and Ikemura, T. (1990) Nucleic Acids Res., 18, 2367-2441. MEDLINE Abstract

4 Wada, K., Wada, Y., Doi, H., Ishibashi, F., Gojobori, T. and Ikemura, T. (1991) Nucleic Acids Res., 19, 1991-1986.

5 Wada, K., Wada, Y., Ishibashi, F., Gojobori, T. and Ikemura, T. (1992) Nucleic Acids Res., 20, 2111-2118. MEDLINE Abstract

6 Ikemura, T. (1985) Mol. Biol. Evol., 2, 13-24. MEDLINE Abstract

7 Aota, S. and Ikemura, T. (1986) Nucleic Acids Res., 14, 6345-6355. MEDLINE Abstract

8 Aota, S. and Ikemura, T. (1986) Nucleic Acids Res., 14, 8702.


Return

* To whom correspondence should be addressed
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Biol. Chem.Home page
P. Hariharan, W. Liang, S.-H. Chou, and D.-H. Chin
A New Model for Ligand Release: ROLE OF SIDE CHAIN IN GATING THE ENEDIYNE ANTIBIOTIC
J. Biol. Chem., June 9, 2006; 281(23): 16025 - 16033.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
J. Elf, D. Nilsson, T. Tenson, and M. Ehrenberg
Selective Charging of tRNA Isoacceptors Explains Patterns of Codon Usage
Science, June 13, 2003; 300(5626): 1718 - 1722.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
J. Zhou, W. J. Liu, S. W. Peng, X. Y. Sun, and I. Frazer
Papillomavirus Capsid Protein Expression Level Depends on the Match between Codon Usage and tRNA Availability
J. Virol., June 1, 1999; 73(6): 4972 - 4982.
[Abstract] [Full Text]


Home page
Proc. Natl. Acad. Sci. USAHome page
P. Tranque, M. C.-Y. Hu, G. M. Edelman, and V. P. Mauro
rRNA complementarity within mRNAs: A possible basis for mRNA-ribosome interactions and translational control
PNAS, October 13, 1998; 95(21): 12238 - 12243.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
D. Parsonage, J. Luba, T. C. Mallett, and A. Claiborne
The Soluble alpha -Glycerophosphate Oxidase from Enterococcus casseliflavus. SEQUENCE HOMOLOGY WITH THE MEMBRANE-ASSOCIATED DEHYDROGENASE AND KINETIC ANALYSIS OF THE RECOMBINANT ENZYME
J. Biol. Chem., September 11, 1998; 273(37): 23812 - 23822.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. Platzer, G. Rotman, D. Bauer, T. Uziel, K. Savitsky, A. Bar-Shira, S. Gilad, Y. Shiloh, and A. Rosenthal
Ataxia-Telangiectasia Locus: Sequence Analysis of 184 kb of Human Genomic DNA Containing the Entire ATM Gene
Genome Res., June 1, 1997; 7(6): 592 - 605.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. F. Linton, M. Raabe, V. Pierotti, and S. G. Young
Reading-frame Restoration by Transcriptional Slippage at Long Stretches of Adenine Residues in Mammalian Cells
J. Biol. Chem., May 30, 1997; 272(22): 14127 - 14132.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
V. Charrier, E. Buckley, D. Parsonage, A. Galinier, E. Darbon, M. Jaquinod, E. Forest, J. Deutscher, and A. Claiborne
Cloning and Sequencing of two Enterococcal glpK Genes and Regulation of the Encoded Glycerol Kinases by Phosphoenolpyruvate-dependent, Phosphotransferase System-catalyzed Phosphorylation of a Single Histidyl Residue
J. Biol. Chem., May 30, 1997; 272(22): 14166 - 14174.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (79K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (45)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Nakamura, Y
Right arrow Articles by Ikemura, T
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Nakamura, Y
Right arrow Articles by Ikemura, T
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?