Skip Navigation


Nucleic Acids Research Advance Access originally published online on November 28, 2008
Nucleic Acids Research 2009 37(1):289-297; doi:10.1093/nar/gkn916
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (533K) Freely available
Right arrow Screen PDF (194K) Freely available
Right arrow Supplementary Data
Right arrowOA All Versions of this Article:
37/1/289    most recent
gkn916v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Parra, G.
Right arrow Articles by Korf, I.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Parra, G.
Right arrow Articles by Korf, I.
Related Collections
Right arrow Computational methods
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2009, Vol. 37, No. 1 289-297
© 2008 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.


Genomics

Assessing the gene space in draft genomes

Genis Parra1, Keith Bradnam1, Zemin Ning2, Thomas Keane2 and Ian Korf1,*

1UC Davis Genome Center, University of California Davis, Davis, CA, USA and 2The Wellcome Trust Sanger Institute, Genome Campus, Hinxton, CB10 1SA, UK

*To whom correspondence should be addressed. Tel: +1 530 754 4989; Email: ifkorf{at}ucdavis.edu

Received July 17, 2008. Revised October 28, 2008. Accepted October 30, 2008.

Genome sequencing projects have been initiated for a wide range of eukaryotes. A few projects have reached completion, but most exist as draft assemblies. As one of the main reasons to sequence a genome is to obtain its catalog of genes, an important question is how complete or completable the catalog is in unfinished genomes. To answer this question, we have identified a set of core eukaryotic genes (CEGs), that are extremely highly conserved and which we believe are present in low copy numbers in higher eukaryotes. From an analysis of a phylogenetically diverse set of eukaryotic genome assemblies, we found that the proportion of CEGs mapped in draft genomes provides a useful metric for describing the gene space, and complements the commonly used N50 length and x-fold coverage values.


The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.