Nucleic Acids Research, 2005, Vol. 33, Database issue D358-D363
© 2005, the authors
Nucleic Acids Research, Vol. 33, Database issue © Oxford University Press 2005; all rights reserved
The Candida Genome Database (CGD), a community resource for Candida albicans gene and protein information
Department of Genetics, Stanford University School of Medicine, CCSR 2255, 269 Campus Drive, Stanford, CA 94305-5120, USA
* To whom correspondence should be addressed. Tel: +1 650 736 0075; Fax: +1 650 724 3701; Email: arnaudm{at}genome.stanford.edu
Received August 13, 2004; Revised August 30, 2004; Accepted September 8, 2004
| ABSTRACT |
|---|
|
|
|---|
The Candida Genome Database (CGD) is a new database that contains genomic information about the opportunistic fungal pathogen Candida albicans. CGD is a public resource for the research community that is interested in the molecular biology of this fungus. CGD curators are in the process of combing the scientific literature to collect all C.albicans gene names and aliases; to assign gene ontology terms that describe the molecular function, biological process, and subcellular localization of each gene product; to annotate mutant phenotypes; and to summarize the function and biological context of each gene product in free-text description lines. CGD also provides community resources, including a reservation system for gene names and a colleague registry through which Candida researchers can share contact information and research interests. CGD is publicly funded (by NIH grant R01 DE15873-01 from the NIDCR) and is freely available at http://www.candidagenome.org/.
| INTRODUCTION |
|---|
|
|
|---|
Candida albicans is the best studied of the human fungal pathogens, and it serves as a model organism for the study of other pathogenic fungi. In recent years, the frequency of fungal infections has steadily grown and although these infections are generally less frequent than bacterial infections, at least two aspects make them increasingly important. First, opportunistic infections in immunocompromised patients represent an increasingly common cause of mortality and morbidity (1,2). Second, many of the currently used antifungal compounds (3,4) are often of limited use because of their toxicity and side effects (5). Furthermore, within the last decade there has been an emergence of anti-fungal drug resistance, which was a rarity in the past (610). By serving as a resource for scientists who study fungal biology and pathogenesis, the Candida Genome Database (CGD) aims to facilitate progress toward more complete understanding of and effective treatment for fungal diseases.
Before CGD was created, three web sites contained information about the C.albicans genome sequence and about C.albicans gene products. The Stanford Genome Technology Center undertook the process of sequencing and the difficult challenge of assembling the sequence of this diploid organism (11), and their web site provides options for searching and downloading the genome sequence. CandidaDB, at the Pasteur Institute, was the first freely available C.albicans database; it contains sequence-based annotation for assemblies 6 and 19 of the genome sequence (http://genolist.pasteur.fr/CandidaDB/). The third resource was developed by the Candida Annotation Working Group, colleagues who came together on a volunteer basis, to analyze the C.albicans sequence produced by the Stanford Genome Technology Center. The results of the Annotation Working Group's efforts include a high quality set of gene annotations and gene ontology (GO) terms assigned by sequence-based prediction. The Annotation Working Group's annotation and sequence analysis tools are accessible on a web site hosted at the Biotechnology Research Institute of the National Research Council in Canada (http://candida.bri.nrc.ca/candida/index.cfm).
The Candida research community expressed a need for a database with additional features: comprehensive literature curation, to complement the high quality sequence-based annotation already available; a more extensive set of sequence retrieval and analysis tools, similar to those provided at the Saccharomyces Genome Database (SGD) (12); and centralized community information, such as a colleague directory and a gene name registry. CGD was proposed to meet these needs. CGD is based on the framework of SGD, using the same software, user interfaces, and underlying schema. The format and tools will therefore be familiar to CGD users who are already users of SGD. CGD started with the Candida Annotation Working Group's informative data set, and the CGD curators are now adding published material from the literature.
| LOCUS PAGE |
|---|
|
|
|---|
Similar to SGD, CGD contains gene information organized around locus pages. An example locus page is shown in Figure 1.
|
The locus page displays basic information about a gene and its product. The gene name is displayed prominently at the top of the page along with all aliases, including names assigned during sequencing and sequence assembly. Also found near the top of the page is the description, which is a concise statement of the most important information known about the gene and the gene product, especially its function, biological context, and physical characteristics. Each gene product is assigned GO terms (13) that describe its molecular functions, its location within the cell, and the biological processes in which it participates. The GO annotation section of each locus page contains a link to the GO annotation page, which shows all GO terms along with the references that were used to make each assignment and the type of evidence that supports it. An example GO annotation page is shown in Figure 2. Each GO term name, both on the locus page and on the GO annotation page, links to a graphical view that allows users to see parent and child relationships for each term, to navigate within the ontologies, and to view summary information about all of the Candida genes assigned to any given GO term.
|
Initially, CGD GO curation has focused on one or a few references that describe each gene product. With time, CGD will collect GO terms comprehensively, such that the database will list all of the papers that support assignment of each term, rather than listing only a more limited set of representative papers. The rationale for assigning GO terms from each paper is that the number of independent pieces of evidence for assignment to a particular GO term can be a measure of confidence in that assignment.
The locus page also contains a mutant phenotype section. This section lists the type of mutation (e.g. homozygous null, heterozygous null, or overexpression) and any corresponding phenotype. At this time, phenotypes are collected from the literature as free-text descriptions. Each phenotype that is displayed on the locus page is hyperlinked to a list of all C.albicans genes that share that mutant phenotype. The locus page also presents a link to a page that lists the references in which specific phenotypes are described. This page also contains phenotype details, including additional information about the specific conditions under which some phenotypes have been observed.
| LITERATURE INTERFACE |
|---|
|
|
|---|
CGD contains a wealth of information about the C.albicans scientific literature. This information is available in several formats within the literature guide. The literature guide, which is accessed from the menu on the right-hand side of each locus page, provides a list of papers that characterize a particular gene. These lists were generated by using an automated search of the PubMed database at NCBI, and have been manually screened to eliminate spurious references. Next to each reference there is a list of all the genes described in the paper, and each gene name is hyperlinked to its corresponding locus page. As each reference is curated, curators note whether the paper pertains to any of a set of 45 literature topics. These topics are based on the set that is used by SGD, but have been expanded to include additional topics of special interest to the Candida research community. The topics include filamentous growth, phenotypic switching, adherence and biofilms, as well as more generalized topics such as function/process, protein physical properties, proteinprotein interactions, proteinnucleic acid interactions, post-translational modifications, transcriptional regulation and translational regulation. The complete set of CGD literature guide topics is listed in Table 1. Within the literature guide interface, the reference list may be sorted according to the topic or by curation status (curated or not yet curated). Alternately, users may choose to focus on individual papers. The curated paper view displays the reference information and the abstract, along with a summary of literature guide topics that are assigned to every gene characterized in the paper.
|
| COMMUNITY RESOURCES |
|---|
|
|
|---|
CGD seeks to facilitate an interaction among the members of the C.albicans research community. Thus, CGD has implemented a colleague registry, by which researchers may share contact information and find others who share research interests or who are experts in a particular topic.
CGD also serves as the keeper of gene name reservations prior to publication. The community conferred this privilege upon CGD at the ASM meeting on Candida and Candidiasis in March 2004. Having a reservation system for gene names benefits the entire community because it helps to reduce conflicts in gene names and prevents the introduction of confusing synonyms into the literature. CGD does not itself assign gene names, but rather collects and maintains a list of current reservations and attempts to mediate resolution of any disputes that may arise. CGD follows gene name guidelines that are based on those used by the Saccharomyces cerevisiae research community. Detailed information about choosing and reserving a gene name is found on the CGD web site under nomenclature guide.
CGD also hosts a web page with Candida community news and a list of meetings, courses, and related web sites of interest.
| CURRENT PROGRESS AND FUTURE DIRECTIONS |
|---|
|
|
|---|
The CGD project began in April 2004, and is progressing rapidly. However, there is still much to be done, and there are plans to add additional information and new features.
CGD literature curation is now in progress. As of August 2004, CGD contained more than 900 gene product descriptions, and
1500 mutant phenotype descriptions and 1500 GO term assignments. The initial release of CGD contained locus pages for genes that have been characterized in the literature. The C.albicans genome contains
6400 homologous pairs of genes (11); the majority of these genes have not yet been characterized. CGD will contain the entire SC5314 gene complement, with locus pages and sequences for all the genes that were identified in the genome-sequencing project, although not all of this information was included in CGD upon the initial database release. CGD will contain the reference sequence of the strain SC5314 (11). The C.albicans genomic sequence data is scheduled to be added to CGD in the autumn of 2004. Once this information has been incorporated, CGD will provide access to sequence analysis and visualization tools that are similar to those available at SGD, including tools for viewing multiple versions of sequences that have been updated since the original sequence was published. Each locus page currently provides a hyperlink to the C.albicans BLAST tool at the Biotechnology Research Institute of the National Research Council in Canada.
In addition, CGD will also provide links between CGD and SGD locus pages, which will provide instant access to information about the S.cerevisiae orthologs of C.albicans proteins. All CGD data will available for free download at an ftp site that will be linked from our home page.
The current curation efforts are focused on the body of scientific literature that deals with specific C.albicans genes by name. However, an additional set of literature exists that concerns more generalized C.albicans biology, e.g. drug sensitivity studies or morphological descriptions that do not examine the role of any specific gene product. CGD plans to include these papers in the database and to make literature guide topic assignments. The current set of literature topics may need to be expanded to capture information from this set of papers more effectively. The CGD group seeks input from the research community as to what types of information would be most useful for CGD to collect from such papers.
Within the next year, CGD plans to begin curation of metabolic pathway information. CGD will use the Pathway Tools software (14) to make pathway predictions, and will supplement and validate these predictions by curating pathway information from the published literature.
| SUMMARY AND AVAILABILITY |
|---|
|
|
|---|
In summary, the Candida Genomic Database is a resource modeled after the Saccharomyces Genome Database. The CGD contains information about C.albicans genes and gene products. CGD is freely available on the web at www.candidagenome.org. CGD also facilitates community interaction by providing a colleague registry and a gene name registry. CGD is being actively developed, and the CGD project staff would like to solicit advice from Candida researchers about ways in which CGD may best serve the C.albicans research community. Users are encouraged to contact CGD at candida-curator{at}genome.stanford.edu with comments or suggestions.
| ACKNOWLEDGEMENTS |
|---|
CGD thanks the Candida Annotation Working Group, especially Andre Nantel, for their generosity in sharing their entire data set and their enthusiasm for letting CGD house this resource; Judith Berman, Burk Braun, Neil Gow, Pete Magee and Aaron Mitchell for their advice and support; and the Saccharomyces Genome Database group for their resources and invaluable assistance. CGD is supported by grant R01 DE15873-01 from the National Institute of Dental and Craniofacial Research at the US National Institutes of Health.
| Notes |
|---|
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use permissions, please contact journals.permissions{at}oupjournals.org.
| REFERENCES |
|---|
|
|
|---|
- Fisher-Hoch,S.P. and Hutwagner,L. ( (1995) ) Opportunistic candidiasis: an epidemic of the 1980s. Clin. Infect. Dis., , 21, , 897904.[Web of Science][Medline] .
- Groll,A.H., De Lucca,A.J. and Walsh,T.J. ( (1998) ) Emerging targets for the development of novel antifungal therapeutics. Trends Microbiol., , 6, , 117124.[CrossRef][Web of Science][Medline] .
- Vanden Bossche,H. ( (1995) ) In Lyr,H. (ed.), Modern Selective Fungicides: Properties, Applications, Mechanism of Actions. Gustav Fisher Verlag, Jena, pp. 431484. .
- Bennet,J.E. ( (1996) ) In Taylor,P. (ed.), Goodman and Gilman's The Pharmacological Basis of Therapeutics. Pergamon, Elmsford, NY, pp. 11651181. .
- Georgopapadakou,N.H. and Walsh,T.J. ( (1996) ) Antifungal agents: chemotherapeutic targets and immunologic strategies. Antimicrob. Agents Chemother., , 40, , 279291.[Web of Science][Medline] .
- Smith,D., Boag,F., Midgley,J. and Gazzard,B. ( (1991) ) Fluconazole resistant candida in AIDS. J. Infect., , 23, , 345346.[CrossRef][Web of Science][Medline] .
- Siegman-Igra,Y. and Rabaw,M.Y. ( (1992) ) Failure of fluconazole in systemic candidiasis. Eur. J. Clin. Microbiol. Infect. Dis., , 11, , 201202.[CrossRef][Web of Science][Medline] .
- Johnson,E.M., Warnock,D.W., Luker,J., Porter,S.R. and Scully,C. ( (1995) ) Emergence of azole drug resistance in Candida species from HIV-infected patients receiving prolonged fluconazole therapy for oral candidosis. J. Antimicrob. Chemother., , 35, , 103114.
[Abstract/Free Full Text] . - Denning,D.W. ( (1995) ) Can we prevent azole resistance in fungi? Lancet, , 346, , 454455.[CrossRef][Web of Science][Medline] .
- Boschman,C.R., Bodnar,U.R., Tornatore,M.A., Obias,A.A., Noskin,G.A., Englund,K., Postelnick,M.A., Suriano,T. and Peterson,L.R. ( (1998) ) Thirteen-year evolution of azole resistance in yeast isolates and prevalence of resistant strains carried by cancer patients at a large medical center. Antimicrob. Agents Chemother., , 42, , 734738.
[Abstract/Free Full Text] . - Jones,T., Federspiel,N.A., Chibana,H., Dungan,J., Kalman,S., Magee,B.B., Newport,G., Thorstenson,Y.R., Agabian,N., Magee,P.T. et al. ( (2004) ) The diploid genome sequence of Candida albicans. Proc. Natl Acad. Sci. USA, , 101, , 73297334.
[Abstract/Free Full Text] . - Christie,K.R., Weng,S., Balakrishnan,R., Costanzo,M.C., Dolinski,K., Dwight,S.S., Engel,S.R., Feierbach,B., Fisk,D.G., Hirschman,J.E. et al. ( (2004) ) Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms. Nucleic Acids Res., , 32, (Database issue), D311D314.
[Abstract/Free Full Text] . - Ashburner,M., Ball,C.A., Blake,J.A., Botstein,D., Butler,H., Cherry,J.M., Davis,A.P., Dolinski,K., Dwight,S.S., Eppig,J.T. et al. ( (2000) ) Gene ontology: tool for the unification of biology. The Gene Ontology consortium. Nature Genet., , 25, , 2529.[CrossRef][Web of Science][Medline] .
- Karp,P.D., Paley,S. and Romero,P. ( (2002) ) The Pathway Tools software. Bioinformatics, , 18, (Suppl. 1), S225232.[Abstract]
.
This article has been cited by other articles:
![]() |
G. C. Conant and P. F. Stadler Solvent Exposure Imparts Similar Selective Pressures across a Range of Yeast Proteins Mol. Biol. Evol., May 1, 2009; 26(5): 1155 - 1161. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. M. Mora-Montes, O. Bader, E. Lopez-Romero, S. Zinker, P. Ponce-Noyola, B. Hube, N. A. R. Gow, and A. Flores-Carreon Kex2 protease converts the endoplasmic reticulum {alpha}1,2-mannosidase of Candida albicans into a soluble cytosolic form Microbiology, December 1, 2008; 154(12): 3782 - 3794. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. L. Chaffin Candida albicans Cell Wall Proteins Microbiol. Mol. Biol. Rev., September 1, 2008; 72(3): 495 - 544. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Goto, D. Carter, and S. G. Reed Immunological Dominance of Trypanosoma cruzi Tandem Repeat Proteins Infect. Immun., September 1, 2008; 76(9): 3967 - 3974. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. K. Sherwood and R. J. Bennett Microtubule Motor Protein Kar3 Is Required for Normal Mitotic Division and Morphogenesis in Candida albicans Eukaryot. Cell, September 1, 2008; 7(9): 1460 - 1474. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Romero-Zaliz, C. del Val, J. P. Cobb, and I. Zwir Onto-CC: a web server for identifying Gene Ontology conceptual clusters Nucleic Acids Res., July 1, 2008; 36(suppl_2): W352 - W357. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. W. Kebaara, M. L. Langford, D. H. M. L. P. Navarathna, R. Dumitru, K. W. Nickerson, and A. L. Atkin Candida albicans Tup1 Is Involved in Farnesol-Mediated Inhibition of Filamentous-Growth Induction Eukaryot. Cell, June 1, 2008; 7(6): 980 - 987. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. S. Noffz, V. Liedschulte, K. Lengeler, and J. F. Ernst Functional Mapping of the Candida albicans Efg1 Regulator Eukaryot. Cell, May 1, 2008; 7(5): 881 - 893. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Banerjee, D. S. Thompson, A. Lazzell, P. L. Carlisle, C. Pierce, C. Monteagudo, J. L. Lopez-Ribot, and D. Kadosh UME6, a Novel Filament-specific Regulator of Candida albicans Hyphal Extension and Virulence Mol. Biol. Cell, April 1, 2008; 19(4): 1354 - 1365. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. G. Oliver, P. M. Silver, C. Marie, S. J. Hoot, S. E. Leyde, and T. C. White Tetracycline alters drug susceptibility in Candida albicans and other pathogenic fungi Microbiology, March 1, 2008; 154(3): 960 - 970. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Rossignol, P. Lechat, C. Cuomo, Q. Zeng, I. Moszer, and C. d'Enfert CandidaDB: a multi-genome database for Candida species and related Saccharomycotina Nucleic Acids Res., January 11, 2008; 36(suppl_1): D557 - D561. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Banerjee, G. Lelandais, S. Shukla, G. Mukhopadhyay, C. Jacq, F. Devaux, and R. Prasad Responses of Pathogenic and Nonpathogenic Yeast Species to Steroids Reveal the Functioning and Evolution of Multidrug Resistance Transcriptional Networks Eukaryot. Cell, January 1, 2008; 7(1): 68 - 77. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Wolyniak and P. Sundstrom Role of Actin Cytoskeletal Dynamics in Activation of the Cyclic AMP Pathway and HWP1 Gene Expression in Candida albicans Eukaryot. Cell, October 1, 2007; 6(10): 1824 - 1840. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Jezewski, M. von der Heide, S. Poltermann, A. Hartl, W. Kunkel, P. F. Zipfel, and R. Eck Role of the Vps34p-interacting protein Ade5,7p in hyphal growth and virulence of Candida albicans Microbiology, July 1, 2007; 153(7): 2351 - 2362. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. E. McBride, C. Zurita-Lopez, A. Regis, E. Blum, A. Conboy, S. Elf, and S. Clarke Protein Arginine Methylation in Candida albicans: Role in Nuclear Transport Eukaryot. Cell, July 1, 2007; 6(7): 1119 - 1129. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Fernandez-Arenas, V. Cabezon, C. Bermejo, J. Arroyo, C. Nombela, R. Diez-Orejas, and C. Gil Integrated Proteomics and Genomics Strategies Bring New Insight into Candida albicans Response upon Macrophage Interaction Mol. Cell. Proteomics, March 1, 2007; 6(3): 460 - 478. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. B. Arnaud, M. C. Costanzo, M. S. Skrzypek, P. Shah, G. Binkley, C. Lane, S. R. Miyasato, and G. Sherlock Sequence resources at the Candida Genome Database Nucleic Acids Res., January 12, 2007; 35(suppl_1): D452 - D456. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. D. Khot, P. A. Suci, R. L. Miller, R. D. Nelson, and B. J. Tyler A Small Subpopulation of Blastospores in Candida albicans Biofilms Exhibit Resistance to Amphotericin B Associated with Differential Regulation of Ergosterol and {beta}-1,6-Glucan Pathway Genes Antimicrob. Agents Chemother., November 1, 2006; 50(11): 3708 - 3716. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Brown, J. A. Sexton, and M. Johnston A Glucose Sensor in Candida albicans. Eukaryot. Cell, October 1, 2006; 5(10): 1726 - 1737. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. B. Knight and A. Dancis Reduction of 2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-2H-tetrazolium-5-carboxanilide inner salt (XTT) is dependent on CaFRE10 ferric reductase for Candida albicans grown in unbuffered media. Microbiology, August 1, 2006; 152(Pt 8): 2301 - 2308. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Uppuluri, B. Sarmah, and W. L. Chaffin Candida albicans SNO1 and SNZ1 expressed in stationary-phase planktonic yeast cells and base of biofilm Microbiology, July 1, 2006; 152(7): 2031 - 2038. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Gebhart, A. K. Bahrami, and A. Sil Identification of a Copper-Inducible Promoter for Use in Ectopic Expression in the Fungal Pathogen Histoplasma capsulatum Eukaryot. Cell, June 1, 2006; 5(6): 935 - 944. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Niimi, K. Maki, F. Ikeda, A. R. Holmes, E. Lamping, M. Niimi, B. C. Monk, and R. D. Cannon Overexpression of Candida albicans CDR1, CDR2, or MDR1 Does Not Produce Significant Changes in Echinocandin Susceptibility. Antimicrob. Agents Chemother., April 1, 2006; 50(4): 1148 - 1155. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Kadosh and A. D. Johnson Induction of the Candida albicans Filamentous Growth Program by Relief of Transcriptional Repression: A Genome-wide Analysis Mol. Biol. Cell, June 1, 2005; 16(6): 2903 - 2912. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||










