Skip Navigation

Nucleic Acids Research 2006 34(Database Issue):D452-D455; doi:10.1093/nar/gkj041
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (2433K) Freely available
Right arrow Screen PDF (467K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Byrne, K. P.
Right arrow Articles by Wolfe, K. H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Byrne, K. P.
Right arrow Articles by Wolfe, K. H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2006, Vol. 34, Database issue D452-D455
© The Author 2006. Published by Oxford University Press. All rights reserved
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions{at}oxfordjournals.org


Article

Visualizing syntenic relationships among the hemiascomycetes with the Yeast Gene Order Browser

Kevin P. Byrne* and Kenneth H. Wolfe

Department of Genetics, Smurfit Institute, University of Dublin, Trinity College Dublin 2, Ireland

*To whom correspondence should be addressed. Tel: +353 1 608 1288; Fax: +353 1 679 8558; Email: kevin.byrne{at}tcd.ie

Received August 10, 2005. Revised October 3, 2005. Accepted October 3, 2005.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 VISUAL USER INTERFACE
 GENOME BROWSING
 BIOINFORMATICS UTILITIES
 DATA RETRIEVAL
 CONCLUSION
 REFERENCES
 
The Yeast Gene Order Browser (YGOB) is an online tool designed to facilitate the comparative genomic visualization and appraisal of synteny within and between the genomes of seven hemiascomycete yeast species. Three of these genomes are polyploid, and hence contain intra-genomic syntenic regions, the correct assembly of which is a particular success of YGOB. Designed to accurately assemble, display and score gene order relationships, YGOB is both an interactive tool for browsing genomic data, and a software engine now being used for evolutionary analyses on a whole-genome scale. Underlying the online interface is the YGOB database, which consists of homology assignments across the species, extensively curated based on sequence similarity and novelly, an appraisal of genomic context (synteny) in multiple genomes. Currently the YGOB database incorporates genome data from Saccharomyces cerevisiae, Candida glabrata, Saccharomyces castellii, Ashbya gossypii, Kluyveromyces lactis, Kluyveromyces waltii and Saccharomyces kluyveri, but the system is scaleable to accommodate additional genomes. This paper discusses the usage and utility of version 1.0 of YGOB, which is publicly available at http://wolfe.gen.tcd.ie/ygob.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 VISUAL USER INTERFACE
 GENOME BROWSING
 BIOINFORMATICS UTILITIES
 DATA RETRIEVAL
 CONCLUSION
 REFERENCES
 
The Yeast Gene Order Browser (YGOB) has been developed to take advantage of the potential the hemiascomycete yeasts offer for studying many aspects of genome evolution, including in particular the whole-genome duplication (WGD) event we proposed in 1997 (1) as having occurred in an ancestor of Saccharomyces cerevisiae, and which was conclusively confirmed in 2004 (24).

YGOB version 1.0 features seven genomes: three from lineages that separated after the WGD [‘post-WGD’ species: S.cerevisiae (5), Saccharomyces castellii (6) and Candida glabrata (4)], and four from outgroups [‘pre-WGD’ species: Ashbya gossypii (3), Kluyveromyces lactis (4), Kluyveromyces waltii (2) and Saccharomyces kluyveri (6)]. The phylogenetic relationship among these species and the position of the WGD event are shown in Figure 1A. YGOB consists of a curated database of homology assignments across these genomes, an intuitive and interactive online interface and software that accurately assesses and displays gene order (synteny).



View larger version (46K):
[in this window]
[in a new window]
 
Figure 1 (A) Approximate phylogenetic relationship of the yeasts included in YGOB, with the WGD event marked as a closed circle. The tree is based on Kurtzman et al. (10). (B) YGOB screenshot focused on the A.gossypii gene AGL112C with a window size of 6. Tracks are labeled at right. At the bottom of the interface is the control console, which allows users to select the window size and the gene to focus on. Each box in the display represents a gene, and each color, a chromosome. The ‘b’ buttons open a window with BLASTP results against YGOB's database, ‘S’ buttons click through to protein sequences, ‘T’ buttons draw approximate phylogenetic trees and ‘+’ buttons output YGOB data in a tabulated text format.

 
YGOB is publicly available online (wolfe.gen.tcd.ie/ygob) and is being used by our laboratory for a number of research projects. We believe it will be of similar utility to yeast researchers in general, and of interest to evolutionary biologists as a pioneering comparative genomics tool.

The algorithms involved in accurately assigning synteny and aligning chromosomal fragments, information on the curation of the genomic data and research carried out using YGOB are presented in detail elsewhere (7).


    VISUAL USER INTERFACE
 TOP
 ABSTRACT
 INTRODUCTION
 VISUAL USER INTERFACE
 GENOME BROWSING
 BIOINFORMATICS UTILITIES
 DATA RETRIEVAL
 CONCLUSION
 REFERENCES
 
Figure 1B shows a YGOB screenshot with a window size of 6. The tracks are labeled at right and show from top to bottom: the A tracks of the three post-WGD species, the four single-track pre-WGD species and the B tracks of the post-WGD species. Genes are represented by boxes stating the gene name, chromosome or scaffold name, and species name (except for S.cerevisiae, where systematic and genetic names are both shown instead). Each genome has a color palette that is used to distinguish genes from different chromosomes or contigs; for example, the three differently colored S.kluyveri contigs in Figure 1B. Genes without synteny are colored gray, for example the S.kluyveri gene 950.1 in the column marked ‘b’ in Figure 1B. The in-focus gene, which was used to compose the entire display, is highlighted by an orange border.

Connectors join nearby genes: solid connectors join adjacent genes, two small bars connect genes <5 genes apart, and one small bar connects genes <20 genes apart. These connectors are usually colored black, but are highlighted in orange if they denote an inversion. When there is an intervening space between genes, for example in the post-WGD tracks in Figure 1B, then the connectors are extended in gray over that space. The end of a chromosome or contig is denoted by a brace (e.g. S.kluyveri contigs in Figure 1B). The arrows under a gene box denote its relative transcriptional orientation. Whether a rightwards arrow corresponds to a Watson or a Crick strand gene is arbitrary and can be changed by using the ‘twist’ option, which reverses the left–right sequence of the columns. For this reason, no orientation arrow is shown below genes without synteny (i.e. those colored gray).

The online interface is a Perl (www.perl.org) application with the visual output created using the GD package (www.boutell.com/gd). Some advanced aspects of the control console (discussed below), but none of the features in the main visual interface, require users to have JavaScript enabled in their web browser. The YGOB source code is available on request (kevin.byrne{at}tcd.ie).


    GENOME BROWSING
 TOP
 ABSTRACT
 INTRODUCTION
 VISUAL USER INTERFACE
 GENOME BROWSING
 BIOINFORMATICS UTILITIES
 DATA RETRIEVAL
 CONCLUSION
 REFERENCES
 
The user can click on any other gene in the display (Figure 1B) to refocus the browser on it. This allows users to walk along chromosomes by repeatedly refocusing on genes at the edge of the display; these effectively serve as ‘step right’ and ‘step left’ buttons. Browsing a genomic region when focused on a pre-WGD genome often gives the best view of genes' syntenic context. This is because of the way the pre-WGD genomes act as a ‘scaffold’ for the sister genomic regions from post-WGD genomes (2,3,7). YGOB's display of homology information in the syntenic context of many genomes makes features such as annotation problems, fast evolving loci, species-specific genes, gene clusters, chromosomal rearrangements, ohnologs (paralogs arising from a WGD, e.g. column ‘a’ in Figure 1B), or cases of differential gene loss (e.g. column ‘c’ in Figure 1B) immediately apparent. A number of these features, in particular the patterns of gene loss, have been examined by us elsewhere (7).

The control console, which is at the bottom of YGOB's web interface (Figure 1B), allows a user to select several parameters: which genomes to display; the window size (between 4 and 50 columns can be shown on each side of the in-focus gene's column); and which gene to focus the display on. A new in-focus gene can be chosen by typing its name into the text box, as an alternative to clicking on one of the genes currently on screen. Gene naming conventions are explained in the online help, and synonyms are supported. The control console includes buttons that allow a user to invert the whole display output (‘twist’) or to swap the A and B tracks for the post-WGD species (‘flip’).


    BIOINFORMATICS UTILITIES
 TOP
 ABSTRACT
 INTRODUCTION
 VISUAL USER INTERFACE
 GENOME BROWSING
 BIOINFORMATICS UTILITIES
 DATA RETRIEVAL
 CONCLUSION
 REFERENCES
 
Every homology pillar and gene box in the YGOB visual interface (Figure 1B) features buttons offering ready access to standard bioinformatics tools (Figure 2). The ‘S’ (Sequence) button below a column clicks through to its protein sequences (Figure 2A). S.cerevisiae gene boxes have an ‘i’ (Information) button that clicks through to the Saccharomyces Genome Database (SGD) description (8) of the protein and its Gene Ontology terms (Figure 2B). Each gene box has a ‘b’ (BLAST) button, which opens a window with BLASTP results for that gene's protein against all the proteins in YGOB's database. In the pop-up window of BLASTP results (Figure 2C), gene names are highlighted in red (seen as dark gray highlighting in Figure 2C) if they are in the same column as the query gene; blue if they appear on screen but not in its column, and orange if they are a tandem copy of the query gene. Normally colored white, the ‘b’ button is orange when the gene has a tandem repeat on screen nearby. Finally, where there are ≥3 genes in a column, the ‘T’ (Tree) button draws an approximate phylogenetic tree (Figure 2D), generated on the fly using a T-Coffee alignment (9) and Phylip's Neighbor-Joining program, rooted with pre-WGD species if possible. Rolling the computer mouse over a gene's box will also display brief information for that gene, for example from SGD title lines in the case of S.cerevisiae, and GenBank annotation tags for other genomes.



View larger version (56K):
[in this window]
[in a new window]
 
Figure 2 YGOB bioinformatics utilities screenshots for the homology pillar containing the S.cerevisiae gene REF2, and labeled ‘c’ in Figure 1B. Clockwise from top left: (A) protein sequences for the pillar; (B) SGD and Gene Ontology information for REF2; (C) BLASTP results for Ref2 against the YGOB database; and (D) approximate phylogenetic tree for genes in the pillar.

 

    DATA RETRIEVAL
 TOP
 ABSTRACT
 INTRODUCTION
 VISUAL USER INTERFACE
 GENOME BROWSING
 BIOINFORMATICS UTILITIES
 DATA RETRIEVAL
 CONCLUSION
 REFERENCES
 
Every visualized YGOB page (Figure 1B) has a ‘+’ button in its bottom left-hand corner. Clicking the button displays the same YGOB data in a tabulated text format, showing each track as a column and each pillar as a row. This allows a user to easily retrieve the raw syntenic context information around a gene or region of interest. The YGOB website also hosts interactive lists of loci classed by syntenic locus type, allowing researchers to quickly acquire lists of genes/loci from particular classes or alternatively to browse through these lists online (e.g. a list of all loci that are single copy in S.cerevisiae but retained in duplicate in C.glabrata and S.castellii can be retrieved). In addition to this the website features lists of genes from locus classes further subdivided by Gene Ontology (www.geneontology.org) terms whereas interactive diagrams show the physical distribution of all locus classes over the three pre-WGD species.


    CONCLUSION
 TOP
 ABSTRACT
 INTRODUCTION
 VISUAL USER INTERFACE
 GENOME BROWSING
 BIOINFORMATICS UTILITIES
 DATA RETRIEVAL
 CONCLUSION
 REFERENCES
 
The YGOB database consists of a set of extensively curated homology assignments across the seven yeast genomes currently included. The use of syntenic information in assigning homology means these pillars are virtually free from the demonstrated limitations of BLAST. The YGOB online interface (wolfe.gen.tcd.ie/ygob) provides browsing access to this database via an interface that is intuitive to use, with ready access to various bioinformatics utilities. YGOB has been primarily designed to facilitate the use of gene order in the study of polyploid genomes and post-polyploid evolution, but by showing genes and homology in an accurate syntenic context, YGOB makes the characteristics of genomic regions or loci of interest immediately apparent. The success of this contextual syntenic approach in polyploid yeasts encourages us to extend it to other groups of species for which there is sufficient genomic data.


    ACKNOWLEDGEMENTS
 
We thank D. R. Scannell, J. L. Gordon and S. Wong for assistance with the development of YGOB. This study was supported by Science Foundation Ireland. Funding to pay the Open Access publication charges for this article was provided by Science Foundation Ireland.

Conflict of interest statement. None declared.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 VISUAL USER INTERFACE
 GENOME BROWSING
 BIOINFORMATICS UTILITIES
 DATA RETRIEVAL
 CONCLUSION
 REFERENCES
 

  1. Wolfe, K.H. and Shields, D.C. (1997) Molecular evidence for an ancient duplication of the entire yeast genome Nature, 387, 708–713[CrossRef][Medline] .

  2. Kellis, M., Birren, B.W., Lander, E.S. (2004) Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae Nature, 428, 617–624[CrossRef][Medline] .

  3. Dietrich, F.S., Voegeli, S., Brachat, S., Lerch, A., Gates, K., Steiner, S., Mohr, C., Pohlmann, R., Luedi, P., Choi, S., et al. (2004) The Ashbya gossypii genome as a tool for mapping the ancient Saccharomyces cerevisiae genome Science, 304, 304–307[Abstract/Free Full Text] .

  4. Dujon, B., Sherman, D., Fischer, G., Durrens, P., Casaregola, S., Lafontaine, I., De Montigny, J., Marck, C., Neuveglise, C., Talla, E., et al. (2004) Genome evolution in yeasts Nature, 430, 35–44[CrossRef][Medline] .

  5. Goffeau, A., Barrell, B.G., Bussey, H., Davis, R.W., Dujon, B., Feldmann, H., Galibert, F., Hoheisel, J.D., Jacq, C., Johnston, M., et al. (1996) Life with 6000 genes Science, 274, 546 563–547[Abstract/Free Full Text] .

  6. Cliften, P., Sudarsanam, P., Desikan, A., Fulton, L., Fulton, B., Majors, J., Waterston, R., Cohen, B.A., Johnston, M. (2003) Finding functional features in Saccharomyces genomes by phylogenetic footprinting Science, 301, 71–76[Abstract/Free Full Text] .

  7. Byrne, K.P. and Wolfe, K.H. (2005) The Yeast Gene Order Browser: combining curated homology and syntenic context reveals gene fate in polyploid species Genome Res, . 15, 1456–1461[Abstract/Free Full Text] .

  8. Christie, K.R., Weng, S., Balakrishnan, R., Costanzo, M.C., Dolinski, K., Dwight, S.S., Engel, S.R., Feierbach, B., Fisk, D.G., Hirschman, J.E., et al. (2004) Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms Nucleic Acids Res, . 32, D311–D314[Abstract/Free Full Text] .

  9. Notredame, C., Higgins, D.G., Heringa, J. (2000) T-Coffee: a novel method for fast and accurate multiple sequence alignment J. Mol. Biol, . 302, 205–217[CrossRef][Web of Science][Medline] .

  10. Kurtzman, C.P. and Robnett, C.J. (2003) Phylogenetic relationships among yeasts of the ‘Saccharomyces complex’ determined from multigene sequence analyses FEMS Yeast Res, . 3, 417–432[CrossRef][Web of Science][Medline] .


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
MicrobiologyHome page
Z. Luo and H. J. J. van Vuuren
Functional analyses of PAU genes in Saccharomyces cerevisiae
Microbiology, December 1, 2009; 155(12): 4036 - 4049.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
N. Conde e Silva, I. R. Goncalves, M. Lemaire, E. Lesuisse, J. M. Camadro, and P. L. Blaiseau
KlAft, the Kluyveromyces lactis Ortholog of Aft1 and Aft2, Mediates Activation of Iron-Responsive Transcription Through the PuCACCC Aft-Type Sequence
Genetics, September 1, 2009; 183(1): 93 - 106.
[Abstract] [Full Text] [PDF]


Home page
Mol. Biol. CellHome page
Y. Nakajima, R. G. Tyers, C. C.L. Wong, J. R. Yates III, D. G. Drubin, and G. Barnes
Nbl1p: A Borealin/Dasra/CSC-1-like Protein Essential for Aurora/Ipl1 Complex Function and Integrity in Saccharomyces cerevisiae
Mol. Biol. Cell, March 15, 2009; 20(6): 1772 - 1784.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. J. A. van Hoek and P. Hogeweg
The Role of Mutational Dynamics in Genome Shrinkage
Mol. Biol. Evol., November 1, 2007; 24(11): 2485 - 2494.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. J. Penkett, J. A. Morris, V. Wood, and J. Bahler
YOGY: a web-based, integrated database to retrieve protein orthologs and associated Gene Ontology terms.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W330 - W334.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Y. Galperin
The Molecular Biology Database Collection: 2006 update.
Nucleic Acids Res., January 1, 2006; 34(Database issue): D3 - D5.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (2433K) Freely available
Right arrow Screen PDF (467K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Byrne, K. P.
Right arrow Articles by Wolfe, K. H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Byrne, K. P.
Right arrow Articles by Wolfe, K. H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?