Nucleic Acids Research Advance Access originally published online on October 5, 2006
Nucleic Acids Research 2006 34(19):5623-5630; doi:10.1093/nar/gkl723
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Nucleic Acids Research, 2006, Vol. 34, No. 19 5623-5630
© 2006 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Computational Biology |
MetaGene: prokaryotic gene finding from environmental genome shotgun sequences
Department of Computational Biology, Graduate School of Frontier Sciences, University of Tokyo Kashiwa, Chiba 277-8562, Japan
*To whom correspondence should be addressed. Tel: +81 4 7136 3973; Fax: +81 4 7136 4100; Email: hide{at}cb.k.u-tokyo.ac.jp
Received March 18, 2006. Revised September 1, 2006. Accepted September 19, 2006.
Exhaustive gene identification is a fundamental goal in all metagenomics projects. However, most metagenomic sequences are unassembled anonymous fragments, and conventional gene-finding methods cannot be applied. We have developed a prokaryotic gene-finding program, MetaGene, which utilizes di-codon frequencies estimated by the GC content of a given sequence with other various measures. MetaGene can predict a whole range of prokaryotic genes based on the anonymous genomic sequences of a few hundred bases, with a sensitivity of 95% and a specificity of 90% for artificial shotgun sequences (700 bp fragments from 12 species). MetaGene has two sets of codon frequency interpolations, one for bacteria and one for archaea, and automatically selects the proper set for a given sequence using the domain classification method we propose. The domain classification works properly, correctly assigning domain information to more than 90% of the artificial shotgun sequences. Applied to the Sargasso Sea dataset, MetaGene predicted almost all of the annotated genes and a notable number of novel genes. MetaGene can be applied to wide variety of metagenomic projects and expands the utility of metagenomics.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
V. K. Sharma, N. Kumar, T. Prakash, and T. D. Taylor MetaBioME: a database to explore commercially useful enzymes in metagenomic datasets Nucleic Acids Res., November 11, 2009; (2009) gkp1001v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kosakovsky Pond, S. Wadhawan, F. Chiaromonte, G. Ananda, W.-Y. Chung, J. Taylor, A. Nekrutenko, and The Galaxy Team Windshield splatter analysis with the Galaxy metagenomic pipeline Genome Res., November 1, 2009; 19(11): 2144 - 2153. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. M. Lauro, D. McDougald, T. Thomas, T. J. Williams, S. Egan, S. Rice, M. Z. DeMaere, L. Ting, H. Ertan, J. Johnson, et al. From the Cover: Feature Article: The genomic basis of trophic strategy in marine bacteria PNAS, September 15, 2009; 106(37): 15527 - 15533. [Abstract] [Full Text] [PDF] |
||||
![]() |
G.-Q. Hu, J.-T. Guo, Y.-C. Liu, and H. Zhu MetaTISA: Metagenomic Translation Initiation Site Annotator for improving gene start prediction Bioinformatics, July 15, 2009; 25(14): 1843 - 1845. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. J. Hoff, T. Lingner, P. Meinicke, and M. Tech Orphelia: predicting genes in metagenomic sequencing reads Nucleic Acids Res., July 1, 2009; 37(suppl_2): W101 - W105. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Hattori and T. D. Taylor The Human Intestinal Microbiome: A New Frontier of Human Biology DNA Res, February 1, 2009; 16(1): 1 - 12. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Kunin, A. Copeland, A. Lapidus, K. Mavromatis, and P. Hugenholtz A Bioinformatician's Guide to Metagenomics Microbiol. Mol. Biol. Rev., December 1, 2008; 72(4): 557 - 578. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Noguchi, T. Taniguchi, and T. Itoh MetaGeneAnnotator: Detecting Species-Specific Patterns of Ribosomal Binding Site for Precise Gene Prediction in Anonymous Prokaryotic and Phage Genomes DNA Res, December 1, 2008; 15(6): 387 - 396. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Kurokawa, T. Itoh, T. Kuwahara, K. Oshima, H. Toh, A. Toyoda, H. Takami, H. Morita, V. K. Sharma, T. P. Srivastava, et al. Comparative Metagenomics Revealed Commonly Enriched Gene Sets in Human Gut Microbiomes DNA Res, October 16, 2007; (2007) dsm018v2. [Abstract] [Full Text] [PDF] |
||||





