Nucleic Acids Research, 2004, Vol. 32, Database issue D277-D280
© 2004 Oxford University Press
The KEGG resource for deciphering the genome
Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 611-0011, Japan
*To whom correspondence should be addressed. Tel: +81 774 38 3270; Fax: +81 774 38 3269; Email: kanehisa{at}kuicr.kyoto-u.ac.jp
Received September 15, 2003; Revised and Accepted September 25, 2003
| ABSTRACT |
|---|
|
|
|---|
A grand challenge in the post-genomic era is a complete computer representation of the cell and the organism, which will enable computational prediction of higher-level complexity of cellular processes and organism behavior from genomic information. Toward this end we have been developing a knowledge-based approach for network prediction, which is to predict, given a complete set of genes in the genome, the protein interaction networks that are responsible for various cellular processes. KEGG at http://www.genome.ad.jp/kegg/ is the reference knowledge base that integrates current knowledge on molecular interaction networks such as pathways and complexes (PATHWAY database), information about genes and proteins generated by genome projects (GENES/SSDB/KO databases) and information about biochemical compounds and reactions (COMPOUND/GLYCAN/REACTION databases). These three types of database actually represent three graph objects, called the protein network, the gene universe and the chemical universe. New efforts are being made to abstract knowledge, both computationally and manually, about ortholog clusters in the KO (KEGG Orthology) database, and to collect and analyze carbohydrate structures in the GLYCAN database.
| INTRODUCTION |
|---|
|
|
|---|
KEGG (Kyoto Encyclopedia of Genes and Genomes) is a bioinformatics resource for understanding higher-order functional meanings and utilities of the cell or the organism from its genome information. It is an integrated resource consisting of three types of database for genomic, chemical and network information, and associated software, which are all developed by the Kanehisa Laboratory (now part of the Bioinformatics Center) in the Institute for Chemical Research, Kyoto University. While KEGG has cross-references to numerous outside databases, it is intended to be a self-sufficient system for linking genomes to life at the cellular level, containing a complete set of building blocks (genes and molecules) and wiring diagrams (interaction networks) for cellular functions. Eventually, this self-sufficient system will become a computer representation of the cell and the organism, and perhaps the biosphere as well, which will enable in silico analysis of biological systems (1). Even at the current primitive stage, KEGG is widely used for analysis of various types of molecular biological data in order to obtain clues to higher-order functions.
During the past 2 years we have worked to make the KEGG resource (2,3) more accessible to automated analysis. For example, the XML representation of KEGG pathway diagrams is useful for automatic drawing of potential networks identified by two-hybrid experiments as an extension to known networks. Access to KEGG can now be made through the KEGG API (SOAP interface to KEGG), which means that the user can write a program to analyze microarray gene expression data or to annotate a newly sequenced genome by automating KEGG queries. In addition, we have released two new database components: KO for ortholog grouping and hierarchical classification of genes and GLYCAN for carbohydrate structures. Here we describe the current status and future plans of the KEGG resource.
| THE KEGG DATABASES |
|---|
|
|
|---|
Graph representation
To understand the overall architecture, it is useful to know that KEGG consists of three graph objects for representation and manipulation of genomic, chemical and network data. Mathematically, a graph is a set of nodes (building blocks) and edges (interactions or relations). As shown in Table 1 the three graph objects are called the gene universe (GENES, SSDB and KO databases), the chemical universe (COMPOUND, GLYCAN and REACTION databases), and the protein network (PATHWAY database). The gene universe is a conceptual graph object representing ortholog/paralog relations, operon information and other relationships between genes in all the completely sequenced genomes. The chemical universe is another conceptual graph object representing chemical reactions and structural/functional relations among metabolites and other biochemical compounds. In contrast, the protein network is based on biological phenomena, representing known molecular interaction networks in various cellular processes (2).
|
Network hierarchy
Another important aspect in the overall architecture of KEGG is network hierarchy. The protein network, which is the most unique data object in KEGG, is stored as a collection of pathway maps in the PATHWAY database, representing wiring diagrams of proteins and other gene products responsible for various cellular functions. Reflecting the map resolution and functional modules at different levels, these pathway maps are hierarchically classified. There are five categories in the top level (metabolism, genetic information processing, environmental information processing, cellular processes and human diseases) and 24 subcategories in the second level. The third level in the hierarchy corresponds to individual pathway maps. When the protein network is linked to the gene universe, the fourth level corresponds to KO (KEGG Orthology) entries. Thus the hierarchy of gene functions in KEGG is based on the hierarchy of the protein network as shown in Table 2.
|
| GENE UNIVERSE |
|---|
|
|
|---|
GENES database
The information about individual genes is stored in the GENES database. As of September 12, 2003 the GENES database contains 572 881 genes in 155 organisms. The GENES entries are generated semi-automatically by selecting and combining various sources including authors submissions to GenBank (ftp://ftp.ncbi.nih.gov/genbank/genomes/), the NCBI RefSeq database (ftp://ftp.ncbi.nih.gov/genomes/), the EMBL database (ftp://ftp.ebi.ac.uk/pub/databases/embl/ genomes/) and publicly available organism-specific databases. They are then subjected to internal re-annotation, in which we just assign K numbers for the KO grouping of genes without updating the description of the genes. Our KO assignment appears in the ORTHOLOG line of the GENES entry.
SSDB database
SSDB was originally a sequence similarity database containing precomputed similarity scores by the SSEARCH program with additional information about best hits and best-best hits in pairwise genome comparisons (2). We have recently implemented an automatic procedure, based on a graph analytical method, to computationally generate ortholog clusters (OCs) and paralog clusters (PCs) from the huge SSDB graph, currently containing 200 million edges. The resulting ortholog clusters can be examined by clicking on the ORTHOLOG link in the GENES entry.
KO database
When the KEGG project was initiated in 1995, the integration of genomic information and network information was achieved via the EC numbers. The EC numbers were common identifiers for matching genes in the genome and gene products (enzymes) in the metabolic pathway. Then, a new scheme using ortholog identifiers was introduced (3) to extend the matching procedure to regulatory pathways and to overcome various problems inherent in the enzyme nomenclature. KO is a further extension of this scheme based on computational analysis, as well as manual curation, of SSDB ortholog clusters in order to classify all gene functions and explore unknown pathways. Each KO entry is identified by the K number (accession number) with the previous ortholog identifier as an alternative name. In future releases of KEGG, the KO hierarchy (Table 2) will become more complete, and the current ortholog group tables will be part of the KO database.
| CHEMICAL UNIVERSE |
|---|
|
|
|---|
COMPOUND database
The COMPOUND database contains chemical structures of most known metabolic compounds and some pharmaceutical and environmental compounds. All chemical structures are manually entered, computationally verified and continuously updated. Currently the database contains 10 739 entries, each of which is identified by the C number (accession number). A new feature implemented in the COMPOUND database is the KCF (KEGG Chemical Function) representation of chemical structures shown in Figure 1 and the resulting graph- based chemical structure comparison method (4). The COMPOUND/REACTION databases are being moved from the ISIS system (5) to the in-house-developed relational database system, in order to integrate with the GLYCAN database. Once this is done the substructure search against the database will be performed more rigorously by our graph comparison algorithm rather than the bit string comparison in the ISIS system.
|
GLYCAN database
GLYCAN is a new addition to the KEGG suite of databases. We have initiated efforts to collect carbohydrate structures because of the lack of a publicly available database after the termination of the CarbBank project (6). The pathway diagrams for metabolism of complex carbohydrates and metabolism of complex lipids are now linked to individual entries of carbohydrate structures in the GLYCAN database. The reactions catalyzed by glycosyltransferases and other sugar-related enzymes are represented in the REACTION database in a simpler form of carbohydrate structures (Fig. 1) rather than the all-atom representation. Each GLYCAN entry is identified by the G number (accession number) and the current total is 10 445 entries, among which only a few hundred were manually entered and linked to KEGG pathways. The rest represents unique structures derived from CarbBank. The GLYCAN database is maintained in a relational database with the structure drawing tool in Java. A database search is also made available based on newly developed algorithms for tree structure comparisons.
REACTION database
The REACTION database contains reaction formulas for enzymic reactions, currently totaling 5799 entries. Each entry is identified by the R number (accession number) representing a unique reaction corresponding to sets of reactants and products represented by the C number in the COMPOUND database or the G number in the GLYCAN database. This should be compared with the EC number, which may correspond to multiple reaction formulas. The EC number hierarchy is supposed to represent aspects of enzymatic reactions, but in reality it often contains aspects of enzyme molecules. Within the KEGG resource, these two aspects of EC numbers are clearly distinguished: R numbers for reactions and K numbers for molecules. We are working to develop a new hierarchy, tentatively called RC (Reaction Classification), for understanding chemistry of enzymic reactions.
ENZYME database
The ENZYME database contains enzyme nomenclature with numerous links to KEGG databases. It is generated semi-automatically from the enzyme nomenclature website (http://www.chem.qmul.ac.uk/iubmb/enzyme/). The role of this database within KEGG has diminished, but the EC number is still the simplest way to link to KEGG from outside resources.
| PROTEIN NETWORK |
|---|
|
|
|---|
PATHWAY database
The protein network in KEGG is an abstract network of gene products, representing not only the pathway or the complex resulting from direct proteinprotein interactions, but also the metabolic network viewed as a network of enzymes, and the gene regulatory network viewed as a network of transcription factors and target products (2,3). The PATHWAY database is a collection of manually drawn diagrams called the KEGG reference pathway diagrams (maps), each corresponding to a known network of functional significance. The PATHWAY database also contains organism-specific pathways, which are automatically generated by superimposing (coloring) genes in given organisms. The database currently contains 13 457 entries including 235 reference pathway diagrams.
In the past, the pathway diagrams were available only in GIF (or PNG) image files. Although the coordinates of nodes (boxes) could be obtained from the HTML file, it was not possible to reconstruct the pathway because the edge information was not readily available. We have released the KEGG Markup Language (KGML) as a specification of graph objects in KEGG. All metabolic pathways and some regulatory pathways are now available in KGML, enabling computational reconstruction and manipulation of KEGG pathways.
| ACCESS METHODS |
|---|
|
|
|---|
The primary mode of access to KEGG is through the GenomeNet website at http://www.genome.ad.jp/kegg/. Different components of the KEGG resource can most conveniently be accessed from the KEGG table of contents page at http://www.genome.ad.jp/kegg/kegg2.html. The four databases for the chemical universe, COMPOUND, GLYCAN, REACTION and ENZYME are collectively called the LIGAND database with a separate home page at http://www.genome.ad.jp/ligand/. Table 3 summarizes these and other useful URLs including additional databases not covered in the present article.
|
For computerized access to KEGG, the SOAP server is open to academic users at http://www.genome.ad.jp/kegg/soap/. All the KEGG databases, except SSDB, are also available to academic users by anonymous FTP at http://www.genome. ad.jp/anonftp/.
| ACKNOWLEDGEMENTS |
|---|
The computational resource was provided by the Bioinformatics Center, Institute for Chemical Research, Kyoto University. This work was supported by grants from the Ministry of Education, Culture, Sports, Science and Technology of Japan, the Japan Society for the Promotion of Science and the Japan Science and Technology Agency.
| REFERENCES |
|---|
|
|
|---|
- Kanehisa,M. and Bork,P. (2003) Bioinformatics in the post-sequence era. Nature Genet., 33, 305310.
- Kanehisa,M., Goto,S., Kawashima,S. and Nakaya,A. (2002) The KEGG databases at GenomeNet. Nucleic Acids Res., 30, 4246.
[Abstract/Free Full Text] - Kanehisa,M. and Goto,S. (2000) KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res., 28, 2730.
[Abstract/Free Full Text] - Hattori,M., Okuno,Y., Goto,S. and Kanehisa,M. (2003) Development of a chemical structure comparison method for integrated analysis of chemical and genomic information in the metabolic pathways. J. Am. Chem. Soc., 125, 1185311865.[CrossRef][Web of Science][Medline]
- Goto,S., Okuno,Y., Hattori,M., Nishioka,T. and Kanehisa,M. (2001) LIGAND: database of chemical compounds and reactions in biological pathways. Nucleic Acids Res., 30, 402404.
- Doubet,S., Bock,K., Smith,D., Darvill,A. and Albersheim,P. (1989) The complex carbohydrate structure database. Trends Biochem. Sci., 14, 475477.[CrossRef][Web of Science][Medline]
This article has been cited by other articles:
![]() |
Y. Wang, E. Bolton, S. Dracheva, K. Karapetyan, B. A. Shoemaker, T. O. Suzek, J. Wang, J. Xiao, J. Zhang, and S. H. Bryant An overview of the PubChem BioAssay resource Nucleic Acids Res., November 19, 2009; (2009) gkp965v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Senf and X.-w. Chen Identification of genes involved in the same pathways using a Hidden Markov Model-based approach Bioinformatics, November 15, 2009; 25(22): 2945 - 2954. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. S. Dehal, M. P. Joachimiak, M. N. Price, J. T. Bates, J. K. Baumohl, D. Chivian, G. D. Friedland, K. H. Huang, K. Keller, P. S. Novichkov, et al. MicrobesOnline: an integrated portal for comparative and functional genomics Nucleic Acids Res., November 11, 2009; (2009) gkp919v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Turnbaugh, V. K. Ridaura, J. J. Faith, F. E. Rey, R. Knight, and J. I. Gordon The Effect of Diet on the Human Gut Microbiome: A Metagenomic Analysis in Humanized Gnotobiotic Mice Science Translational Medicine, November 11, 2009; 1(6): 6ra14 - 6ra14. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Hu, Y. Wang, L. Ding, P. Lu, S. Atkinson, and S. Chen Positive regulation of flhDC expression by OmpR in Yersinia pseudotuberculosis Microbiology, November 1, 2009; 155(11): 3622 - 3631. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Hernandez, O. Valdes-Lopez, M. Ramirez, N. Goffard, G. Weiller, R. Aparicio-Fabre, S. I. Fuentes, A. Erban, J. Kopka, M. K. Udvardi, et al. Global Changes in the Transcript and Metabolic Profiles during Symbiotic Nitrogen Fixation in Phosphorus-Stressed Common Bean Plants Plant Physiology, November 1, 2009; 151(3): 1221 - 1238. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Beloqui, M.-E. Guazzaroni, F. Pazos, J. M. Vieites, M. Godoy, O. V. Golyshina, T. N. Chernikova, A. Waliczek, R. Silva-Rocha, Y. Al-ramahi, et al. Reactome Array: Forging a Link Between Metabolome and Genome Science, October 9, 2009; 326(5950): 252 - 257. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. R.M. Tillier and R. L. Charlebois The human protein coevolution network Genome Res., October 1, 2009; 19(10): 1861 - 1871. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. S. Najafabadi, H. Goodarzi, and R. Salavati Universal function-specificity of codon usage Nucleic Acids Res., September 22, 2009; (2009) gkp792v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Song, S. J. Lim, and J. C. Tong Recent advances in computer-aided drug design Brief Bioinform, September 1, 2009; 10(5): 579 - 591. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Benita, H. Kikuchi, A. D. Smith, M. Q. Zhang, D. C. Chung, and R. J. Xavier An integrative genomics approach identifies Hypoxia Inducible Factor-1 (HIF-1)-target genes that form the core response to hypoxia Nucleic Acids Res., August 1, 2009; 37(14): 4587 - 4602. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Skolnick and M. Brylinski FINDSITE: a combined evolution/structure-based approach to protein function prediction Brief Bioinform, July 1, 2009; 10(4): 378 - 391. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Nam, M. Li, K. Choi, C. Balch, S. Kim, and K. P. Nephew MicroRNA and mRNA integrated analysis (MMIA): a web tool for examining biological functions of microRNA expression Nucleic Acids Res., July 1, 2009; 37(suppl_2): W356 - W362. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Medina, D. Montaner, N. Bonifaci, M. A. Pujana, J. Carbonell, J. Tarraga, F. Al-Shahrour, and J. Dopazo Gene set-based analysis of polymorphisms: finding pathways or biological processes associated to traits in genome-wide association studies Nucleic Acids Res., July 1, 2009; 37(suppl_2): W340 - W344. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Robinzon, A. Dafa-Berger, M. D. Dyer, B. Paeper, S. C. Proll, T. H. Teal, S. Rom, D. Fishman, B. Rager-Zisman, and M. G. Katze Impaired Cholesterol Biosynthesis in a Neuronal Cell Line Persistently Infected with Measles Virus J. Virol., June 1, 2009; 83(11): 5495 - 5504. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. E. Lewis, B.-K. Cho, E. M. Knight, and B. O. Palsson Gene Expression Profiling and the Use of Genome-Scale In Silico Models of Escherichia coli for Analysis: Providing Context for Content J. Bacteriol., June 1, 2009; 191(11): 3437 - 3444. [Full Text] [PDF] |
||||
![]() |
Z. Tu, C. Argmann, K. K. Wong, L. J. Mitnaul, S. Edwards, I. C. Sach, J. Zhu, and E. E. Schadt Integrating siRNA and protein-protein interaction data to identify an expanded insulin signaling network Genome Res., June 1, 2009; 19(6): 1057 - 1067. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Calicchio, T. Collins, and H. P. Kozakewich Identification of Signaling Systems in Proliferating and Involuting Phase Infantile Hemangiomas by Genome-Wide Transcriptional Profiling Am. J. Pathol., May 1, 2009; 174(5): 1638 - 1649. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. C. Daniel, L. Marchionni, J. S. Hierman, J. T. Rhodes, W. L. Devereux, C. M. Rudin, R. Yung, G. Parmigiani, M. Dorsch, C. D. Peacock, et al. A Primary Xenograft Model of Small-Cell Lung Cancer Reveals Irreversible Changes in Gene Expression Imposed by Culture In vitro Cancer Res., April 15, 2009; 69(8): 3364 - 3373. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Pratilas, B. S. Taylor, Q. Ye, A. Viale, C. Sander, D. B. Solit, and N. Rosen V600EBRAF is associated with disabled feedback inhibition of RAF-MEK signaling and elevated transcriptional output of the pathway PNAS, March 17, 2009; 106(11): 4519 - 4524. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Wu and M Watson CORNA: testing gene lists for regulation by microRNAs Bioinformatics, March 15, 2009; 25(6): 832 - 833. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Pan, C. Kumar, S. Bohl, U. Klingmueller, and M. Mann Comparative Proteomic Phenotyping of Cell Lines and Primary Cells to Assess Preservation of Cell Type-specific Functions Mol. Cell. Proteomics, March 1, 2009; 8(3): 443 - 450. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. C. Johnson, S. Corthals, C. Ramos, A. Hoering, K. Cocks, N. J. Dickens, J. Haessler, H. Goldschmidt, J. A. Child, S. E. Bell, et al. Genetic associations with thalidomide mediated venous thrombotic events in myeloma identified using targeted genotyping Blood, December 15, 2008; 112(13): 4924 - 4934. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Dinh, G. M. Scholz, and J. A. Hamilton Regulation of WAVE1 expression in macrophages at multiple levels J. Leukoc. Biol., December 1, 2008; 84(6): 1483 - 1491. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Jacob and J.-P. Vert Protein-ligand interaction prediction: an improved chemogenomics approach Bioinformatics, October 1, 2008; 24(19): 2149 - 2156. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. T. Chung, J. S. Yoo, H. B. Oh, Y. S. Lee, S. H. Cha, S. J. Kim, and C. K. Yoo Complete Genome Sequence of Neisseria gonorrhoeae NCCP11945 J. Bacteriol., September 1, 2008; 190(17): 6035 - 6036. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Schmitz-Esser, I. Haferkamp, S. Knab, T. Penz, M. Ast, C. Kohl, M. Wagner, and M. Horn Lawsonia intracellularis Contains a Gene Encoding a Functional Rickettsia-Like ATP/ADP Translocase for Host Exploitation J. Bacteriol., September 1, 2008; 190(17): 5746 - 5752. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. F. Saccone, N. L. Saccone, G. E. Swan, P. A. F. Madden, A. M. Goate, J. P. Rice, and L. J. Bierut Systematic biological prioritization after a genome-wide association study: an application to nicotine dependence Bioinformatics, August 15, 2008; 24(16): 1805 - 1811. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-H. Chen, C.-K. Liu, S.-C. Chang, Y.-J. Lin, M.-F. Tsai, Y.-T. Chen, and A. Yao GenoWatch: a disease gene mining browser for association study Nucleic Acids Res., July 1, 2008; 36(suppl_2): W336 - W340. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Al-Shahrour, J. Carbonell, P. Minguez, S. Goetz, A. Conesa, J. Tarraga, I. Medina, E. Alloza, D. Montaner, and J. Dopazo Babelomics: advanced functional profiling of transcriptomics, proteomics and genomics experiments Nucleic Acids Res., July 1, 2008; 36(suppl_2): W341 - W346. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Takarada, M. Sekine, H. Kosugi, Y. Matsuo, T. Fujisawa, S. Omata, E. Kishi, A. Shimizu, N. Tsukatani, S. Tanikawa, et al. Complete Genome Sequence of the Soil Actinomycete Kocuria rhizophila J. Bacteriol., June 15, 2008; 190(12): 4139 - 4146. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. X. Cordero, B. Snel, and P. Hogeweg Coevolution of gene families in prokaryotes Genome Res., March 1, 2008; 18(3): 462 - 468. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. R Kensche, V. van Noort, B. E Dutilh, and M. A Huynen Practical and theoretical advances in predicting the function of a protein by its phylogenetic distribution J R Soc Interface, February 6, 2008; 5(19): 151 - 170. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Juan, F. Pazos, and A. Valencia High-confidence prediction of global interactomes based on genome-wide coevolutionary networks PNAS, January 22, 2008; 105(3): 934 - 939. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. M. Markowitz, N. N. Ivanova, E. Szeto, K. Palaniappan, K. Chu, D. Dalevi, I-M. A. Chen, Y. Grechkin, I. Dubchak, I. Anderson, et al. IMG/M: a data management and analysis system for metagenomes Nucleic Acids Res., January 11, 2008; 36(suppl_1): D534 - D538. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. M. Markowitz, E. Szeto, K. Palaniappan, Y. Grechkin, K. Chu, I-M. A. Chen, I. Dubchak, I. Anderson, A. Lykidis, K. Mavromatis, et al. The integrated microbial genomes (IMG) system in 2007: data content and analysis tool extensions Nucleic Acids Res., January 11, 2008; 36(suppl_1): D528 - D533. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Nam, B. Kim, S. Shin, and S. Lee miRGator: an integrated system for functional annotation of microRNAs Nucleic Acids Res., January 11, 2008; 36(suppl_1): D159 - D164. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Conte, S. Gaillard, N. Lanau, M. Rouard, and C. Perin GreenPhylDB: a database for plant comparative genomics Nucleic Acids Res., January 11, 2008; 36(suppl_1): D991 - D998. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Okuno, A. Tamon, H. Yabuuchi, S. Niijima, Y. Minowa, K. Tonomura, R. Kunimoto, and C. Feng GLIDA: GPCR ligand database for chemical genomics drug discovery database and tools update Nucleic Acids Res., January 11, 2008; 36(suppl_1): D907 - D912. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Donitz, B. Goemann, M. Lize, H. Michael, N. Sasse, E. Wingender, and A. P. Potapov EndoNet: an information resource about regulatory networks of cell-to-cell communication Nucleic Acids Res., January 11, 2008; 36(suppl_1): D689 - D694. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Shin, A. M. Lisewski, and O. Lichtarge Graph sharpening plus graph integration: a synergy that improves protein functional classification Bioinformatics, December 1, 2007; 23(23): 3217 - 3224. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. J. Bootsma, M. Egmont-Petersen, and P. W. M. Hermans Analysis of the In Vitro Transcriptional Response of Human Pharyngeal Epithelial Cells to Adherent Streptococcus pneumoniae: Evidence for a Distinct Response to Encapsulated Strains Infect. Immun., November 1, 2007; 75(11): 5489 - 5499. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Cakmak and G. Ozsoyoglu Mining biological networks for unknown pathways Bioinformatics, October 15, 2007; 23(20): 2775 - 2783. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. D. Harrington, A. H. Singh, T. Doerks, I. Letunic, C. von Mering, L. J. Jensen, J. Raes, and P. Bork Quantitative assessment of protein function prediction from metagenomics shotgun sequences PNAS, August 28, 2007; 104(35): 13913 - 13918. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Al-Shahrour, P. Minguez, J. Tarraga, I. Medina, E. Alloza, D. Montaner, and J. Dopazo FatiGO +: a functional profiling tool for genomic data. Integration of functional annotation, regulatory motifs and interaction data with microarray experiments Nucleic Acids Res., July 13, 2007; 35(suppl_2): W91 - W96. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Bleakley, G. Biau, and J.-P. Vert Supervised reconstruction of biological networks with local models Bioinformatics, July 1, 2007; 23(13): i57 - i65. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Udwary, L. Zeigler, R. N. Asolkar, V. Singan, A. Lapidus, W. Fenical, P. R. Jensen, and B. S. Moore Genome sequencing reveals complex secondary metabolome in the marine actinomycete Salinispora tropica PNAS, June 19, 2007; 104(25): 10376 - 10381. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Hernandez, M. Ramirez, O. Valdes-Lopez, M. Tesfaye, M. A. Graham, T. Czechowski, A. Schlereth, M. Wandrey, A. Erban, F. Cheung, et al. Phosphorus Stress in Common Bean: Root Transcript and Metabolic Responses Plant Physiology, June 1, 2007; 144(2): 752 - 767. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Masoudi-Nejad, S. Goto, R. Jauregui, M. Ito, S. Kawashima, Y. Moriya, T. R. Endo, and M. Kanehisa EGENES: Transcriptome-Based Plant Database of Genes with Metabolic Pathway Information and Expressed Sequence Tag Indices in KEGG Plant Physiology, June 1, 2007; 144(2): 857 - 866. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. E. van Noorden, T. Kerim, N. Goffard, R. Wiblin, F. I. Pellerone, B. G. Rolfe, and U. Mathesius Overlap of Proteome Changes in Medicago truncatula in Response to Auxin and Sinorhizobium meliloti Plant Physiology, June 1, 2007; 144(2): 1115 - 1131. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Caetano-Anolles, H. S. Kim, and J. E. Mittenthal The origin of modern metabolic networks inferred from phylogenomic analysis of protein architecture PNAS, May 29, 2007; 104(22): 9358 - 9363. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Sulakhe, M. D'Souza, M. Syed, A. Rodriguez, Y. Zhang, E. M. Glass, M. F. Romine, and N. Maltsev GNARE--a grid-based server for the analysis of user submitted genomes Nucleic Acids Res., May 25, 2007; (2007) gkm366v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Yamanishi, F. Bach, and J.-P. Vert Glycan classification with tree kernels Bioinformatics, May 15, 2007; 23(10): 1211 - 1216. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Johnson, O. Fletcher, C. Palles, M. Rudd, E. Webb, G. Sellick, I. dos Santos Silva, V. McCormack, L. Gibson, A. Fraser, et al. Counting potentially functional variants in BRCA1, BRCA2 and ATM predicts breast cancer susceptibility Hum. Mol. Genet., May 1, 2007; 16(9): 1051 - 1057. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Palenik, J. Grimwood, A. Aerts, P. Rouze, A. Salamov, N. Putnam, C. Dupont, R. Jorgensen, E. Derelle, S. Rombauts, et al. The tiny eukaryote Ostreococcus provides genomic insights into the paradox of plankton speciation PNAS, May 1, 2007; 104(18): 7705 - 7710. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. J. Gaulton, K. L. Mohlke, and T. J. Vision A computational system to select candidate genes for complex human traits Bioinformatics, May 1, 2007; 23(9): 1132 - 1140. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Frickey and G. Weiller Analyzing microarray data using CLANS Bioinformatics, May 1, 2007; 23(9): 1170 - 1171. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Ganti, R. C. Hunt, S. K. Parapuram, and D. M. Hunt Vitreous Modulation of Gene Expression in Low-Passage Human Retinal Pigment Epithelial Cells Invest. Ophthalmol. Vis. Sci., April 1, 2007; 48(4): 1853 - 1863. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Feng, W. Wang, J. Cheng, Y. Ren, G. Zhao, C. Gao, Y. Tang, X. Liu, W. Han, X. Peng, et al. Genome and proteome of long-chain alkane degrading Geobacillus thermodenitrificans NG80-2 isolated from a deep-subsurface oil reservoir PNAS, March 27, 2007; 104(13): 5602 - 5607. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Pierrou, P. Broberg, R. A. O'Donnell, K. Pawlowski, R. Virtala, E. Lindqvist, A. Richter, S. J. Wilson, G. Angco, S. Moller, et al. Expression of Genes Involved in Oxidative Stress Responses in Airway Epithelial Cells of Smokers with Chronic Obstructive Pulmonary Disease Am. J. Respir. Crit. Care Med., March 15, 2007; 175(6): 577 - 586. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Nijland, N. E. Schlabritz-Loutsevitch, G. B. Hubbard, P. W. Nathanielsz, and L. A. Cox Non-human primate fetal kidney transcriptome analysis indicates mammalian target of rapamycin (mTOR) is a central nutrient-responsive pathway J. Physiol., March 15, 2007; 579(3): 643 - 656. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. S. Wishart, D. Tzur, C. Knox, R. Eisner, A. C. Guo, N. Young, D. Cheng, K. Jewell, D. Arndt, S. Sawhney, et al. HMDB: the Human Metabolome Database Nucleic Acids Res., January 12, 2007; 35(suppl_1): D521 - D526. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. L. Holliday, D. E. Almonacid, G. J. Bartlett, N. M. O'Boyle, J. W. Torrance, P. Murray-Rust, J. B. O. Mitchell, and J. M. Thornton MACiE (Mechanism, Annotation and Classification in Enzymes): novel tools for searching catalytic mechanisms Nucleic Acids Res., January 12, 2007; 35(suppl_1): D515 - D520. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D'Souza, E. M. Glass, M. H. Syed, Y. Zhang, A. Rodriguez, N. Maltsev, and M. Y. Galperin Sentra: a database of signal transduction proteins for comparative genome analysis Nucleic Acids Res., January 12, 2007; 35(suppl_1): D271 - D273. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. K. McNeil, C. Reich, R. K. Aziz, D. Bartels, M. Cohoon, T. Disz, R. A. Edwards, S. Gerdes, K. Hwang, M. Kubal, et al. The National Microbial Pathogen Database Resource (NMPDR): a genomics platform based on subsystem annotation Nucleic Acids Res., January 12, 2007; 35(suppl_1): D347 - D353. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ng, B. Bursteinas, Q. Gao, E. Mollison, and M. Zvelebil Resources for integrative systems biology: from data through databases to networks and dynamic system models Brief Bioinform, December 1, 2006; 7(4): 318 - 330. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Suen, J. S. Jakobsen, B. S. Goldman, M. Singer, A. G. Garza, and R. D. Welch Bacterial Postgenomics: the Promise and Peril of Systems Biology{triangledown} J. Bacteriol., December 1, 2006; 188(23): 7999 - 8004. [Full Text] [PDF] |
||||
![]() |
N. Goffard and G. Weiller Extending MapMan: application to legume genome arrays Bioinformatics, December 1, 2006; 22(23): 2958 - 2959. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. George, J. Y. Liu, L. L. Feng, R. J. Bryson-Richardson, D. Fatkin, and M. A. Wouters Analysis of protein sequence and interaction data for candidate disease gene prediction Nucleic Acids Res., November 14, 2006; 34(19): e130 - e130. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Reed, T. R. Patel, K. H. Chen, A. R. Joyce, M. K. Applebee, C. D. Herring, O. T. Bui, E. M. Knight, S. S. Fong, and B. O. Palsson Systems approach to refining genome annotation PNAS, November 14, 2006; 103(46): 17480 - 17484. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. L. Webb, M. F. Rudd, G. S. Sellick, R. El Galta, L. Bethke, W. Wood, O. Fletcher, S. Penegar, L. Withey, M. Qureshi, et al. Search for low penetrance alleles for colorectal cancer through a scan of 1467 non-synonymous SNPs in 2575 cases and 2707 controls with validation by kin-cohort analysis of 14 704 first-degree relatives Hum. Mol. Genet., November 1, 2006; 15(21): 3263 - 3271. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Janga, W. F. Lamboy, A. M. Huerta, and G. Moreno-Hagelsieb The distinctive signatures of promoter regions and operon junctions across prokaryotes Nucleic Acids Res., September 1, 2006; 34(14): 3980 - 3987. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Pang, A. Lin, M. Holford, B. E. Enerson, B. Lu, M. P. Lawton, E. Floyd, and H. Zhao Pathway analysis using random forests classification and regression Bioinformatics, August 15, 2006; 22(16): 2028 - 2036. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Shioda, J. Chesnes, K. R. Coser, L. Zou, J. Hur, K. L. Dean, C. Sonnenschein, A. M. Soto, and K. J. Isselbacher Importance of dosage standardization for interpreting transcriptomal signature profiles: Evidence from studies of xenoestrogens PNAS, August 8, 2006; 103(32): 12033 - 12038. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Green and P. D. Karp The outcomes of pathway database computations depend on pathway ontology Nucleic Acids Res., August 7, 2006; 34(13): 3687 - 3697. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Paley and P. D. Karp The Pathway Tools cellular overview diagram and Omics Viewer Nucleic Acids Res., August 7, 2006; 34(13): 3771 - 3778. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Bryson, V. Loux, R. Bossy, P. Nicolas, S. Chaillou, M. van de Guchte, S. Penaud, E. Maguin, M. Hoebeke, P. Bessieres, et al. AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system Nucleic Acids Res., July 19, 2006; 34(12): 3533 - 3545. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Rahman and D. Schomburg Observing local and global properties of metabolic pathways: 'load points' and 'choke points' in the metabolic networks Bioinformatics, July 15, 2006; 22(14): 1767 - 1774. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Liu, S. Sivaganesan, K. Y. Yeung, J. Guo, R. E. Bumgarner, and M. Medvedovic Context-specific infinite mixtures for clustering gene expression profiles across diverse microarray dataset Bioinformatics, July 15, 2006; 22(14): 1737 - 1744. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. F. Rudd, G. S. Sellick, E. L. Webb, D. Catovsky, and R. S. Houlston Variants in the ATM-BRCA2-CHEK2 axis predispose to chronic lymphocytic leukemia Blood, July 15, 2006; 108(2): 638 - 644. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Lozada-Chavez, S. C. Janga, and J. Collado-Vides Bacterial regulatory networks are extremely flexible in evolution Nucleic Acids Res., July 13, 2006; 34(12): 3434 - 3445. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






















