Skip Navigation


Nucleic Acids Research Advance Access originally published online on November 7, 2006
Nucleic Acids Research 2007 35(Database issue):D332-D338; doi:10.1093/nar/gkl828
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (147K) Freely available
Right arrow Screen PDF (150K) Freely available
Right arrowOA All Versions of this Article:
35/suppl_1/D332    most recent
gkl828v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Gregory, T. R.
Right arrow Articles by Bennett, M. D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gregory, T. R.
Right arrow Articles by Bennett, M. D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2007, Vol. 35, Database issue D332-D338
© 2006 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.


Articles

Eukaryotic genome size databases

T. Ryan Gregory*, James A. Nicol1, Heidi Tamm2, Bellis Kullman2, Kaur Kullman3, Ilia J. Leitch4, Brian G. Murray5, Donald F. Kapraun6, Johann Greilhuber7 and Michael D. Bennett4

Department of Integrative Biology, University of Guelph Guelph, Ontario, N1G 2W1, Canada 1 Glossopteris Web Design and Development, Sydney Australia 2 Institute of Agricultural and Environmental Sciences Estonian University of Life Sciences, 181 Riia Street, 51014 Tartu, Estonia 3 Trump Trading Ltd, Tallinn Estonia 4 Jodrell Laboratory, Royal Botanic Gardens Kew, Richmond, Surrey TW9 3AB, UK 5 School of Biological Sciences, University of Auckland Private Bag 92019, Auckland, New Zealand 6 Department of Biological Sciences, University of North Carolina-Wilmington 601 South College Road, Wilmington, NC 28403-3915, USA 7 Institute of Botany and Botanical Garden of the University of Vienna, Rennweg 14 A 1030 Vienna, Austria

*To whom correspondence should be addressed. Tel: +1 519 824 4120, ext. 58053; Fax: +1 519 767 1656; Email: rgregory{at}uoguelph.ca

Received August 14, 2006. Accepted October 4, 2006.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 PLANT DNA C-VALUES DATABASE
 USAGE OF THE DATABASE
 ANIMAL GENOME SIZE DATABASE
 FUNGAL GENOME SIZE DATABASE
 FUTURE PROSPECTS
 REFERENCES
 
Three independent databases of eukaryotic genome size information have been launched or re-released in updated form since 2005: the Plant DNA C-values Database (www.kew.org/genomesize/homepage.html), the Animal Genome Size Database (www.genomesize.com) and the Fungal Genome Size Database (www.zbi.ee/fungal-genomesize/). In total, these databases provide freely accessible genome size data for >10 000 species of eukaryotes assembled from more than 50 years' worth of literature. Such data are of significant importance to the genomics and broader scientific community as fundamental features of genome structure, for genomics-based comparative biodiversity studies, and as direct estimators of the cost of complete sequencing programs.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 PLANT DNA C-VALUES DATABASE
 USAGE OF THE DATABASE
 ANIMAL GENOME SIZE DATABASE
 FUNGAL GENOME SIZE DATABASE
 FUTURE PROSPECTS
 REFERENCES
 
Eukaryotic genome size data are becoming increasingly important both as the basis for comparative research into genome evolution and as direct estimators of the cost and difficulty of genome sequencing programs for an expanding sphere of non-model organisms (13). Nuclear DNA content data for >10 000 species of plants, animals and fungi are made freely available through three independent databases of eukaryotic genome size that have been either launched or re-released since 2005: the Plant DNA C-values Database (http://www.kew.org/genomesize/homepage.html) (4), the Animal Genome Size Database (www.genomesize.com) (5) and the Fungal Genome Size Database (www.zbi.ee/fungal-genomesize) (6).

Genome sizes are typically given as gametic nuclear DNA contents (‘C-values’) either in units of mass (picograms, where 1 pg = 10–12 g) or in number of base pairs (in eukaryotes, most often in megabases, where 1 Mb = 106 bases). These are directly interconvertible as 1 pg = 978 Mb (or 1 Mb = 1.022 x 10–3 pg) (7). The majority of modern genome size estimates are based on either Feulgen densitometry (more recently using computerized image analysis) or flow cytometry, although DNA reassociation kinetics, bulk fluorometry, static fluorometry, electrophoretic methods, quantitative real-time PCR and complete genome sequencing have also been used. Data from all such measurements are compiled into the databases along with updated taxonomy, analytical details and other relevant information (e.g. chromosome number) where available.

The first genome size estimates were conducted in the late 1940s, and the earliest attempt at a comprehensive list was provided ~25 years later by Sparrow et al. (8). M.D. Bennett and colleagues carried on this important effort by publishing a series of lists for botanical genome size data beginning in 1976. Unfortunately, zoological and mycological counterparts were not forthcoming for another 30 years, aside from a few taxon-specific compilations based on a small number of sources [e.g. (9)] or online lists of limited scope [e.g. Database of Genome Sizes (http://www.cbs.dtu.dk/databases/DOGS/) and DBA Mammalian Genome Size Database (http://www.unipv.it/webbio/dbagsdb.htm)]. The databases described below, therefore, provide the first truly comprehensive catalogues of eukaryotic genome size data and represent a much-needed resource for members of the genomics community.


    PLANT DNA C-VALUES DATABASE
 TOP
 ABSTRACT
 INTRODUCTION
 PLANT DNA C-VALUES DATABASE
 USAGE OF THE DATABASE
 ANIMAL GENOME SIZE DATABASE
 FUNGAL GENOME SIZE DATABASE
 FUTURE PROSPECTS
 REFERENCES
 
Early development
By 1995, four major lists of angiosperm genome sizes had been published, which together contained data for 2802 species (1013). Although the lists were well used (collectively, they have been cited >1500 times as on August 2006), it became increasingly cumbersome to determine whether a particular species was listed. It was therefore decided to pool the values into a single database and release them on the internet. The resulting Angiosperm DNA C-values Database was compiled by M.D. Bennett and I.J. Leitch and was coded by the Information Services Department at the Royal Botanic Gardens, Kew; it went live in April 1997. Between 1997 and 2001, two updates of the Angiosperm DNA C-values Database were released and the Pteridophyte DNA C-values Database was added.

Based on the evident utility and high usage of these two databases, efforts were initiated to construct counterparts for other plant taxa as data became available. Ultimately, this led to the assembly of the overarching Plant DNA C-values Database, which was made available through Java-based queries of a SyBase database and was released in September 2001. Initially, it contained C-values for 3864 species from the four land plant groups [angiosperms, gymnosperms, pteridophytes (comprising monilophytes and lycophytes) and bryophytes], with available genome size estimates for three algal groups (Rhodophyta, Chlorophyta and Phaeophyta) added to Release 3.0 in December 2004.

Coverage and features of the Plant DNA C-values Database, Release 4.0
Release 4.0 of the Plant DNA C-values Database, launched in October 2005, contains genome sizes for 5150 species including 4427 angiosperms, 207 gymnosperms, 87 pteridophytes, 176 bryophytes, and 253 algae compiled from over 550 publications or personal communications (4). Tables 1 and 2 provide a breakdown of absolute and relative coverage of the major plant groups. Table 1 also gives the minimum, maximum and mean C-values for each of the major groups and shows that genome sizes in plants range from 0.01 pg in some unicellular algae (e.g. Cyanidium caldarium) to 127.4 pg in the tetraploid angiosperm Fritillaria assyriaca. Most of these data have been acquired through either Feulgen densitometry of stained root tip squashes (63%) or flow cytometry of freshly chopped leaf material (31%) using a range of plant calibration standards. These data are accessible through a variety of search options that allow users to either analyze C-value data across different groups of plants (by clicking on the Plant DNA C-values Database icon), or by searching within taxonomically specific subsections of the database (by clicking on the appropriate plant group icon).


View this table:
[in this window]
[in a new window]

 
Table 1 Minimum, maximum and mean 1C DNA amounts for each of the major plant groups in the Plant DNA C-values Database (Release 4.0, October 2005), together with the current level of species representation of C-value data

 


View this table:
[in this window]
[in a new window]

 
Table 2 Representation of gymnosperms and angiosperms at different taxonomic levels in the Plant DNA C-values Database (Release 4.0, October 2005)

 
The database contains information where available for the following fields:
  1. Plant group (e.g. angiosperm, gymnosperm and pteridophyte).
  2. Family.
  3. Genus.
  4. Species.
  5. Taxonomic authority.
  6. Genome size, for which users have the choice of outputting data in pg or Mb and choosing among 1C, 2C and 4C DNA content. (In plants, the measurement of genome size by Feulgen microdensitometry involves determining the amount of staining in mitotic or meiotic dividing cells, typically prepared from actively dividing root tips. The most suitable stage for measurement was considered to be prophase, as chromatin in metaphase or telophase is too condensed. As prophase cells contain a fully replicated genome with a 4C DNA amount it is these values that were reported in publications. Today, 2C values are usually given.)
  1. Ploidy level.
  2. Chromosome number.
  3. Method used to estimate the genome size.
  4. Information on taxonomic vouchers that may exist for the species analyzed.
  5. The full bibliographic reference from which the original data were taken.

At the end of each output, the number of records returned and summary statistics (minimum, maximum, mean and standard deviations) of these records are given.

Additional search options further enhance the flexibility of the database:

  1. All versus prime estimates. Where multiple genome size estimates exist for a given species, users have the choice of outputting all estimates or only the ‘prime’ estimate. The availability of additional, non-prime estimates for a species provides the user with an indication of the range of values that have been reported. In some cases the differences point to genuine intraspecific variation (e.g. Zea mays) but in others they highlight discrepancies attributable to either taxonomic or methodological errors in genome size estimation (14,15). Recent reviews covering potential problems in genome size estimation include those by Greilhuber (15) for Feulgen densitometry and Dolezel and Bartos (16) for flow cytometry.
  2. Wild card searches. An asterisk (*) can be used to indicate wild cards in searches that include only partial names.
  3. From/to searches. To restrict searches based on numeric data (i.e. chromosome number, ploidy level, DNA amount), users can set criteria in the ‘from’ and ‘to’ boxes of the query page. As examples, user may use this feature:
    • To limit the results of a query to taxa with diploid chromosome numbers between 18 and 36 (inclusive) by entering 18 in the ‘from’ box and 36 in the ‘to’ box for chromosome numbers.
    • To limit the search to taxa with only 18 chromosomes, by placing this number in both the ‘from’ and ‘to’ boxes.
    • To select all records having a diploid number of 18 or greater, by entering 18 in the ‘from’ box and leaving the ‘to’ box empty.

  4. Sorting results. The results of searches are automatically sorted by increasing 1C DNA amounts in picograms. To sort the results by family, genus, species, taxonomic authority, chromosome number or ploidy level, users can select their appropriate choice from the drop-down box under the option ‘Sort by’ at the bottom of the Query form.

In addition to searching the entire database in this way, users can choose to search subsections of the database by selecting the specific plant group of interest (i.e. angiosperms, gymnosperms, pteridophytes, bryophytes or algae) from the homepage. In doing so, the user is provided with additional options for querying and/or outputting that are of unique relevance to each taxon:

  1. Angiosperms:
    • Angiosperm group (i.e. monocots, eudicots or basal angiosperms).
    • Life cycle type (i.e. annual, biennial and perennial).
    • Family. In particular, users have the choice of displaying either the family name given in the original source of the genome size data or the assigned family following the Angiosperm Phylogeny Group (APG) circumscription (17).

  2. Gymnosperms:
    • Gymnosperm group [i.e. Cycadales, Ginkgoales, Gnetales, Pinaceae or Coniferales II (all conifer families excluding Pinaceae)].
    • Sperm flagella number (i.e. multiflagellate or none).

  3. Pteridophytes:
    • Pteridophyte group.
    • Spore type (i.e. homosporous or heterosporous).
    • Sporangium type (i.e. eusporangiate or leptosporangiate).
    • Sperm flagella number (i.e. biflagellate or multiflagellate).

  4. Bryophytes:
    • Bryophyte group (i.e. hornwort, liverwort or moss).

  5. Algae:
    • Algal group (i.e. Chlorophyta, Phaeophyta or Rhodophyta).

Users are required to provide an email address to query the database, which aids in the tracking of usage and in the protection of intellectual property, but otherwise there are no restrictions whatsoever on access.

Besides genome size data, the database includes a summary of the development and release history of the database, instructions on how to search the database, author contact information, links to other databases containing genome size data, and the meeting reports from the international Plant Genome Size meetings, two of which have been held to date (in 1997 and 2003) at the Royal Botanic Gardens, Kew.


    USAGE OF THE DATABASE
 TOP
 ABSTRACT
 INTRODUCTION
 PLANT DNA C-VALUES DATABASE
 USAGE OF THE DATABASE
 ANIMAL GENOME SIZE DATABASE
 FUNGAL GENOME SIZE DATABASE
 FUTURE PROSPECTS
 REFERENCES
 
The Plant DNA C-values Database has been widely used, with >110 000 hits from over 55 countries since its (re-)launch in 2001. On average, the database receives 2000–3000 hits per month with a mean of >60 queries per day, with each query downloading on average 110 genome size estimates. As on August 2006, the database has been cited in ~130 publications since its initial launch as the Angiosperm DNA C-values Database in 1997.


    ANIMAL GENOME SIZE DATABASE
 TOP
 ABSTRACT
 INTRODUCTION
 PLANT DNA C-VALUES DATABASE
 USAGE OF THE DATABASE
 ANIMAL GENOME SIZE DATABASE
 FUNGAL GENOME SIZE DATABASE
 FUTURE PROSPECTS
 REFERENCES
 
Early development
The first large-scale compilation of animal genome size data was created for an analysis of the correlation between genome size and erythrocyte size in mammals (18), which was later expanded for a similar study in birds (19). Recognizing the severe limitations on the study of animal genome size variation posed by the lack of access to such data, these unpublished datasets were expanded to include data from both vertebrates and invertebrates and were posted online as the Animal Genome Size Database on January 10, 2001. This initial release consisted only of flat text tables and included ~2900 animal species. As data continued to be added over the ensuing 5 years, the flat table format became increasingly cumbersome in terms of both updates and for the growing number of users.

Coverage and features of the Animal Genome Size Database, Release 2.0
A completely redesigned Release 2.0 of the Animal Genome Size Database was launched on December 24, 2005, meant to coincide approximately with the 5-year anniversary of the database (5). Rather than flat tables, the database has been converted to a MySQL database accessed through a user-friendly website coded in XHTML, CSS, Javascript and PHP. Its search tools also employ some AJAX (Asynchronous Javascript and XML) features, and some Flash charts are used in information display. At the time of this writing, the database contains 5677 records from 601 sources, covering 2953 species of vertebrates and 1323 invertebrates. Reported animal genome sizes range >4000-fold, from ~0.03 pg in the root-knot nematode Meloidogyne graminicola to ~133 pg in the marbled lungfish Protopterus aethiopicus. Table 3 provides more detailed breakdowns of the available data, including the ranges, means, and absolute and relative coverage of the major animal groups.


View this table:
[in this window]
[in a new window]

 
Table 3 Summary of the content of the Animal Genome Size Database as on August 2006, showing the number of records (i.e. including multiple entries for the same species), species coverage in absolute numbers and percentage of described diversity (in parentheses; note that for many invertebrate taxa only a minority of species have been described), ranges in reported genome sizes, and mean of available genome size data

 
Animal genome size data are accessed through either browse or search functions. The browse function allows users to select an entire group of animals (e.g. mammals, insects), or to select subsections of the database using progressive pull-down menus ranging in specificity from phylum to species. The advanced search feature allows a variety of queries, including genus, species or common name, as well as options to select genome sizes equal to, less than/greater than or between user-specified values. Finally, it is also possible to retrieve all records generated using a given method, standard species or cell type.

Data are returned in customizable dynamic tables, with users specifying the number of records displayed per page (100, 250, 500 or All). The default results page includes taxonomic details (Phylum/Subphylum, Class, Order, Family, Genus, Species, common name), C-value in pg, chromosome number (where available), and the method, cell type and standard species used in the analysis. The source is given as a numbered reference with a hotlink to the full citation. Two courses of action are possible from this results table: (i) the data can be downloaded and can be viewed using Excel (with the spreadsheet following the same customized format as the dynamic tables), or (ii) users can click on species names to enter individual species pages. The latter option provides a detailed record for the species of choice, including taxonomic and methodological details, the C-value estimate from the chosen record as well as links to other available records for the same species, chromosome number, the full source citation and both internal links (e.g. to call up data for all members of the genus, family, order, etc.) and external links [e.g. to NCBI, image searches and both general (e.g. the Integrated Taxonomic Information Service) and specific (e.g. FishBase, AmphibiaWeb) taxonomic databases as applicable]. There are no limitations on browsing or searching the database, but downloading data to Excel requires users to input a name and valid email address as a digital signature of a data sharing agreement. A randomized and limited-duration link to the compiled spreadsheet is then emailed to the input address as a means of protecting intellectual property without hindering access to information.

Release 2.0 of the Animal Genome Size Database also provides users with up-to-the-minute summary statistics for the entire database and each major taxonomic group and subgroup therein, number of species covered, min/max, mean ± standard error, a breakdown of methods, cell types, standards used for all records in the given group, and a brief summary of the major patterns and correlates reported to date for the taxon in question. Other features available to users include a real-time Flash-based graphical summary of the total dataset, relevant announcements and a list of the 10 most recently added records on the main page, as well as a fully searchable reference list, an FAQ, author contact information, links to related sites and a genome size discussion forum.

Usage of the database
Traffic at the Animal Genome Size Database has increased steadily since its launch in 2001, and the main page now receives 50–100 unique visitors per day. Records regarding individual queries are not kept, but a typical data download includes all data for one or more entire groups of animals (i.e. up to several hundred species for a particular vertebrate group). The database has been cited in ~90 publications since 2001.


    FUNGAL GENOME SIZE DATABASE
 TOP
 ABSTRACT
 INTRODUCTION
 PLANT DNA C-VALUES DATABASE
 USAGE OF THE DATABASE
 ANIMAL GENOME SIZE DATABASE
 FUNGAL GENOME SIZE DATABASE
 FUTURE PROSPECTS
 REFERENCES
 
Development and coverage of the Fungal Genome Size Database, Release 1.0
In a discussion of the plant and animal genome size databases penned in mid-2004, it was noted that ‘unfortunately, equivalent databases have not yet been compiled for fungi or "protists", although this would clearly be a worthy project for experts in those groups to undertake' (3). On March 20, 2005, a major portion of this gap had been filled with the launch of the Fungal Genome Size Database (6).

Numerous relative genome sizes (i.e. in arbitrary units) had been estimated in the late 1980s and early 1990s by researchers at the University of Regensburg in Germany using a classical cytophotometry technique, including 287 records for Basidiomycetes (20,21) and 743 for Ascomycetes (22). Using the same method as well as flow cytometry and image cytometry, and by employing an internal standard (Saccharomyces cerevisiae), it became possible to convert these estimates from arbitrary units into far more informative absolute genome sizes in Mb (2325). These converted data formed the basis of the Fungal Genome Size Database, which has since been expanded to include 1298 records covering 739 species and 335 genera from 40 orders (Table 4) based on the taxonomy of the Index Fungorum Partnership (www.indexfungorum.org) (26).


View this table:
[in this window]
[in a new window]

 
Table 4 Number of records in fungal genome size database

 
Data from the Fungal Genome Size Database are made available through queries (PHP, HTML) of a MySQL database. The user and administrative interfaces for the database are generated by a CMS system developed by Trump Trading Ltd (TTCMS). The data can be queried by different taxonomic levels (phylum, order, genus, species epithet, variety) as well as by ploidy level, chromosome number, chromosome size range, method of genome size estimation, standard specimens used, cell type analyzed and source reference. Responses to queries are presented as HTML tables, with detailed information about given records (e.g. herbarium index, original reference and additional remarks) provided in a separate pop-up window accessed by clicking on a given genus or species name in the main table.

Compared with plants and animals, fungi display very small genomes: ~90% of the available fungal data lie within the range of 1C = 10–60 Mb, with an average of ~37 Mb and a median of 28 Mb (Figure 1). The largest fungal genome size reported to date, that of Scutellospora castanea (Diversisporales) is a mere 795 Mb (0.81 pg) (27), whereas the smallest, 6.5 Mb (0.007 pg) in Pneumocystis carinii f. sp. muris (Pneumocystidales), is far more miniscule than even the most streamlined animal or non-algal plant genomes (www.broad.mit.edu/annotation/fungi/fgi/FGI_01_whitepaper_2002.pdf) (28).


Figure 1
View larger version (13K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1 Histogram presenting fungal genome sizes (Mb) in the database. A majority of genome size estimates cover the range from 10 to 60 Mb. The odd values are labeled with species names.

 
As with plants (and to a far lesser but not insignificant degree with animals), ploidy level variability is an important consideration in fungi. Ploidy level (x) has been estimated for 1036 (80%) of the records in the database, and varies from 1x to 50x. Diploidy (2x) is the single most commonly observed level (36% of records), although haploidy (1x) is also common; a level of 50x has been reported for only one species, Neottiella rutilans (22). Chromosome numbers have been reported for 81 of the species included in the database, ranging from n = 3 in Schizosaccharomyces pombe (Schizosaccharomycetales) (29) to n = 20 in Ustilago hordei (Ustilaginales) and Batrachochytrium dendrobatidis (Chytridiales) (28,30).

In both plants and animals, the majority of variation among estimates for individual species is attributed to experimental error (3,14,15). In fungi, however, it remains unclear to what extent apparent intraspecific variation is non-artifactual as data regarding heteroploidy in this group remain controversial (20,31,32). There is evidence that interspecific hybrids may occur in most fungal phyla, with both sexual and asexual origins evident among the growing list of apparent fungal hybrids (33). Hybrids may be diploid or maintain the dikaryotic state, they may undergo karyogamy and normal meiosis to reconstitute the euploid state, or they may undergo abnormal meiosis to yield a heteroploid hybrid. During vegetative growth, chromosomes and chromosome segments can be lost at random, which would generate legitimate variation in estimated genome sizes.

Electrophoretic karyotyping has shown that variation in chromosome number and size is a rule rather than an exception for many, mostly asexual, species (32). This method indicated that genome size in Pleurotus ostreatus (Agaricales) ranges from 20.8 to 35.1 Mb (0.021–0.036 pg, a relative difference of >60%) and chromosome number ranges from 6 to 11 (34,35). Using flow cytometry, genome size in the same species appears to range from 18.5 to 28.7 Mb (0.019–0.021 pg, a 55% difference) (B. Kullman, unpublished data), whereas microfluorometric measurements resulted in a reported range of 24.0–27.53 Mb (0.025–0.028 pg, a 15% difference) (21). It bears noting, however, that even small absolute differences among estimates that might be considered within the margin of measurement error in plants or animals (e.g. 0.01 pg) translate into substantial relative differences in species with such tiny genomes.

Usage of the database
At this early stage, the database receives ~10–20 unique hits per day, and at the time of this writing has been visited by >9000 visitors from around the world.


    FUTURE PROSPECTS
 TOP
 ABSTRACT
 INTRODUCTION
 PLANT DNA C-VALUES DATABASE
 USAGE OF THE DATABASE
 ANIMAL GENOME SIZE DATABASE
 FUNGAL GENOME SIZE DATABASE
 FUTURE PROSPECTS
 REFERENCES
 
Taken together, the three eukaryotic genome size databases represent some of the broadest genetic datasets available, covering >10 000 species. In relative terms, however, this comprises a very small minority of eukaryotic diversity. It is therefore a primary objective of modern genome size research to greatly increase the coverage of taxa in all three kingdoms. Perhaps the least well studied of all, however, are the members of the extremely diverse (and paraphyletic) assemblage commonly known as ‘protists’. The construction of a database of genome sizes for this group, and subsequent efforts to fill the gaps therein, represents an equivalently high priority. Overall, the release of these databases has proved to be a boon for the advancement of knowledge about eukaryotic genome structure and evolution, and has made it possible for the first time to identify the key areas still in need of intensive study.


    ACKNOWLEDGEMENTS
 
The authors wish to thank their many colleagues and collaborators for assistance with various aspects of the construction and maintenance of the genome size databases. Work on the Animal Genome Size Database has been supported by the Natural Sciences and Engineering Research Council of Canada in the form of several scholarships, fellowships and grants to T.R.G. Research leading to the development of the Fungal Genome Size Database was supported by Estonian Science Foundation grant number 4989 to B.K. The Open Access publication charges for this article were waived by Oxford University Press.

Conflict of interest statement. None declared.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 PLANT DNA C-VALUES DATABASE
 USAGE OF THE DATABASE
 ANIMAL GENOME SIZE DATABASE
 FUNGAL GENOME SIZE DATABASE
 FUTURE PROSPECTS
 REFERENCES
 

  1. Bennett, M.D. and Leitch, I.J. (2005) Genome size evolution in plants In Gregory, T.R. (Ed.). The Evolution of the Genome, San Diego, CA Elsevier pp. 89–162 .

  2. Gregory, T.R. (2005) Synergy between sequence and size in large-scale genomics Nature Rev. Genet, . 6, 699–708[Medline] .

  3. Gregory, T.R. (2005) Genome size evolution in animals In Gregory, T.R. (Ed.). The Evolution of the Genome, San Diego, CA Elsevier pp. 3–87 .

  4. Bennett, M.D. and Leitch, I.J. (2005) Plant DNA C-values Database .

  5. Gregory, T.R. (2005) Animal Genome Size Database .

  6. Kullman, B., Tamm, H., Kullman, K. (2005) Fungal Genome Size Database .

  7. Dolezel, J., Bartos, J., Voglmayr, H., Greilhuber, J. (2003) Nuclear DNA content and genome size of trout and human Cytometry, 51A, 127–128 .

  8. Sparrow, A.H., Price, H.J., Underbink, A.G. (1972) A survey of DNA content per cell and per chromosome of prokaryotic and eukaryotic organisms: some evolutionary considerations In Smith, H.H. (Ed.). Evolution of Genetic Systems, New York Gordon and Breach pp. 451–494 .

  9. Tiersch, T.R. and Wachtel, S.S. (1991) On the evolution of genome size of birds J. Hered, . 82, 363–368[Abstract/Free Full Text] .

  10. Bennett, M.D. and Smith, J.B. (1976) Nuclear DNA amounts in angiosperms Philos. Trans. R. Soc. Lond. Ser. B, 274, 227–274[Web of Science][Medline] .

  11. Bennett, M.D., Smith, J.B., Heslop-Harrison, J.S. (1982) Nuclear DNA amounts in angiosperms Proc. R. Soc. Lond. B, 216, 179–199[Abstract/Free Full Text] .

  12. Bennett, M.D. and Smith, J.B. (1991) Nuclear DNA amounts in angiosperms Philos. Trans. R. Soc. Lond. Ser. B, 334, 309–345[CrossRef] .

  13. Bennett, M.D. and Leitch, I.J. (1995) Nuclear DNA amounts in angiosperms Ann. Bot, . 76, 113–176[Abstract/Free Full Text] .

  14. Greilhuber, J. (1998) Intraspecific variation in genome size: a critical reassessment Ann. Bot, . 82, Suppl. A, 27–35[Abstract/Free Full Text] .

  15. Greilhuber, J. (2005) Intraspecific variation in genome size in angiosperms—identifying its existence Ann. Bot, . 95, 91–98[Abstract/Free Full Text] .

  16. Dolezel, J. and Bartos, J. (2005) Plant DNA flow cytometry and estimation of nuclear genome size Ann. Bot, . 95, 99–110[Abstract/Free Full Text] .

  17. Angiosperm Phylogeny Group II. (2003) An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants Bot. J. Linnean Soc, . 141, 399–436[CrossRef] .

  18. Gregory, T.R. (2000) Nucleotypic effects without nuclei: genome size and erythrocyte size in mammals Genome, 43, 895–901[Medline] .

  19. Gregory, T.R. (2002) A bird's-eye view of the C-value enigma: genome size, cell size, and metabolic rate in the class Aves Evolution, 56, 121–130[Web of Science][Medline] .

  20. Bresinsky, A., Wittmann-Meixner, B., Weber, E., Fischer, M. (1987) Karyologische Untersuchungen an Pilzen mittels Fluoreszenzmikroskopie Z. Mykol, . 53, 303–318 .

  21. Wittmann-Meixner, B. (1989) Polyploidie bei Pilzen Biblioth. Mycol, . 131, 1–163 .

  22. Weber, E. (1992) Untersuchungen zu Fortpflanzung und Ploidie verschiedener Ascomyceten Biblioth. Mycol, . 140, 1–186 .

  23. Kullman, B. (2000) Application of flow cytometry for measurement of nuclear DNA content in fungi Folia Cryptog. Estonica, 36, 31–46 .

  24. Kullman, B. (2002) Nuclear DNA content, life cycle and ploidy in two Neottiella species (Pezizales, Ascomycetes) Persoonia, 18, 103–115 .

  25. Kullman, B. and Teterin, W. (2006) Estimation of fungal genome size: comparison of image cytometry and photometric cytometry Folia Cryptog. Estonica, 42, 43–56 .

  26. Index Fungorum Partnership. (2004) Index Fungorum. Custodians CABI Bioscience, CBS and Landcare Research .

  27. Hijri, M. and Sanders, J.R. (2005) Low gene copy number shows that arbuscular mycorrhizal fungi inherit genetically different nuclei Nature, 433, 160–163[CrossRef][Medline] .

  28. Birren, B., Fink, G., Lander, E. (2002) Fungal Genome Initiative White Paper .

  29. Wood, V., Gwilliam, R., Rajandream, M.A. (2002) The genome sequence of Schizosaccharomyces pombe Nature, 415, 871–880[CrossRef][Medline] .

  30. McCluskey, K. and Mills, D. (1990) Identification and characterization of chromosome length polymorphisms among strains representing fourteen races of Ustilago hordei Mol. Plant- Micr. Interact, . 3, 336–373 .

  31. Tolmsoff, W.J. (1983) Heteroploidy as a mechanism of variability among fungi Annu. Rev. Phytopathol, . 21, 317–340[CrossRef][Web of Science] .

  32. Beadle, J., Wright, M., McNeely, L., Bennett, J.W. (2003) Electrophoretic karyotype analysis in fungi Adv. Appl. Microbiol, . 53, 243–270[Web of Science][Medline] .

  33. Schardl, C.L. and Craven, K.D. (2003) Interspecific hybridization in plant-associated fungi and oomycetes: a review Mol. Ecol, . 12, 2861–2873[CrossRef][Medline] .

  34. Sagawa, I. and Nagata, Y. (1992) Analysis of chromosomal DNA of mushrooms in genus Pleurotus by pulsed field gel electrophoresis J. Gen. Appl. Microbiol, . 38, 47–52 .

  35. Ramírez, L., Larraya, L.M., Pisabarro, A.G. (2000) Molecular tools for breeding basidiomycetes Int. Microbiol, . 3, 147–152[Medline] .

  36. Kapraun, D.F. (2005) Nuclear DNA content estimates in multicellular eukaryotic green, red and brown algae: phylogenetic considerations Ann. Bot, . 95, 7–44[Abstract/Free Full Text] .

  37. Qiu, Y.L. and Palmer, J.D. (1999) Phylogeny of early land plants: insights from genes and genomes Trends. Plant Sci, . 4, 26–30[CrossRef][Web of Science][Medline] .

  38. Murray, B.G., Leitch, I.J., Bennett, M.D. (2001) Gymnosperm DNA C-values Database .

  39. Greilhuber, J., Borsch, T., Müller, K., Worberg, A., Porembski, S., Barthlott, W. (2006) Smallest angiosperm genomes found in Lentibulariaceae with chromosomes of bacterial size Plant Biol, . in press .


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Gen Biol EvolHome page
A. T. Beckenbach and J. B. Joy
Evolution of the Mitochondrial Genomes of Gall Midges (Diptera: Cecidomyiidae): Rearrangement and Severe Truncation of tRNA Genes
Gen Biol Evol, August 21, 2009; 2009(0): 278 - 287.
[Abstract] [Full Text] [PDF]


Home page
J. Gen. Virol.Home page
T. M. Work, J. Dagenais, G. H. Balazs, J. Schumacher, T. D. Lewis, J.-A. C. Leong, R. N. Casey, and J. W. Casey
In vitro biology of fibropapilloma-associated turtle herpesvirus and host cells in Hawaiian green turtles (Chelonia mydas)
J. Gen. Virol., August 1, 2009; 90(8): 1943 - 1950.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
F. Zhao, J. Qi, and S. C. Schuster
Tracking the past: Interspersed repeats in an extinct Afrotherian mammal, Mammuthus primigenius
Genome Res., August 1, 2009; 19(8): 1384 - 1392.
[Abstract] [Full Text] [PDF]


Home page
Biol LettHome page
J. D.L. Smith and T. R. Gregory
The genome sizes of megabats (Chiroptera: Pteropodidae) are remarkably constrained
Biol Lett, June 23, 2009; 5(3): 347 - 351.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
R. W. Shultz, T.-J. Lee, G. C. Allen, W. F. Thompson, and L. Hanley-Bowdoin
Dynamic Localization of the DNA Replication Proteins MCM5 and MCM7 in Plants
Plant Physiology, June 1, 2009; 150(2): 658 - 669.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. Parra, K. Bradnam, Z. Ning, T. Keane, and I. Korf
Assessing the gene space in draft genomes
Nucleic Acids Res., January 1, 2009; 37(1): 289 - 297.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. J. Grzymski, A. E. Murray, B. J. Campbell, M. Kaplarevic, G. R. Gao, C. Lee, R. Daniel, A. Ghadiri, R. A. Feldman, and S. C. Cary
Metagenome analysis of an extreme microbial symbiosis reveals eurythermal adaptation and metabolic flexibility
PNAS, November 11, 2008; 105(45): 17516 - 17521.
[Abstract] [Full Text] [PDF]


Home page
ANN BOT (LOND)Home page
P. Smarda, P. Bures, L. Horova, and O. Rotreklova
Intrapopulation Genome Size Dynamics in Festuca pallens
Ann. Bot., October 1, 2008; 102(4): 599 - 607.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
B. S. Gaut and J. Ross-Ibarra
Selection on Major Components of Angiosperm Genomes
Science, April 25, 2008; 320(5875): 484 - 486.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
E. Gladyshev and M. Meselson
Extreme resistance of bdelloid rotifers to ionizing radiation
PNAS, April 1, 2008; 105(13): 5139 - 5144.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. J. Buza, F. M. McCarthy, N. Wang, S. M. Bridges, and S. C. Burgess
Gene Ontology annotation quality analysis in model eukaryotes
Nucleic Acids Res., February 2, 2008; 36(2): e12 - e12.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. Xie, A. Mosig, X. Qi, Y. Li, P. F. Stadler, and J. J.-L. Chen
Structure and Function of the Smallest Vertebrate Telomerase RNA from Teleost Fish
J. Biol. Chem., January 25, 2008; 283(4): 2049 - 2059.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
Y. Nakatani, H. Takeda, Y. Kohara, and S. Morishita
Reconstruction of the vertebrate ancestral genome reveals dynamic genome reorganization in early vertebrates
Genome Res., September 1, 2007; 17(9): 1254 - 1265.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
C. M. Miller-Butterworth, W. J. Murphy, S. J. O'Brien, D. S. Jacobs, M. S. Springer, and E. C. Teeling
A Family Matter: Conclusive Resolution of the Taxonomic Position of the Long-Fingered Bats, Miniopterus
Mol. Biol. Evol., July 1, 2007; 24(7): 1553 - 1561.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (147K) Freely available
Right arrow Screen PDF (150K) Freely available
Right arrowOA All Versions of this Article:
35/suppl_1/D332    most recent
gkl828v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Gregory, T. R.
Right arrow Articles by Bennett, M. D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gregory, T. R.
Right arrow Articles by Bennett, M. D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?