Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (56K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Kanehisa, M.
Right arrow Articles by Hattori, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kanehisa, M.
Right arrow Articles by Hattori, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2004, Vol. 32, Database issue D277-D280
© 2004 Oxford University Press

The KEGG resource for deciphering the genome

Minoru Kanehisa*, Susumu Goto, Shuichi Kawashima, Yasushi Okuno and Masahiro Hattori

Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 611-0011, Japan

*To whom correspondence should be addressed. Tel: +81 774 38 3270; Fax: +81 774 38 3269; Email: kanehisa{at}kuicr.kyoto-u.ac.jp

Received September 15, 2003; Revised and Accepted September 25, 2003


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 THE KEGG DATABASES
 GENE UNIVERSE
 CHEMICAL UNIVERSE
 PROTEIN NETWORK
 ACCESS METHODS
 REFERENCES
 
A grand challenge in the post-genomic era is a complete computer representation of the cell and the organism, which will enable computational prediction of higher-level complexity of cellular processes and organism behavior from genomic information. Toward this end we have been developing a knowledge-based approach for network prediction, which is to predict, given a complete set of genes in the genome, the protein interaction networks that are responsible for various cellular processes. KEGG at http://www.genome.ad.jp/kegg/ is the reference knowledge base that integrates current knowledge on molecular interaction networks such as pathways and complexes (PATHWAY database), information about genes and proteins generated by genome projects (GENES/SSDB/KO databases) and information about biochemical compounds and reactions (COMPOUND/GLYCAN/REACTION databases). These three types of database actually represent three graph objects, called the protein network, the gene universe and the chemical universe. New efforts are being made to abstract knowledge, both computationally and manually, about ortholog clusters in the KO (KEGG Orthology) database, and to collect and analyze carbohydrate structures in the GLYCAN database.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 THE KEGG DATABASES
 GENE UNIVERSE
 CHEMICAL UNIVERSE
 PROTEIN NETWORK
 ACCESS METHODS
 REFERENCES
 
KEGG (Kyoto Encyclopedia of Genes and Genomes) is a bioinformatics resource for understanding higher-order functional meanings and utilities of the cell or the organism from its genome information. It is an integrated resource consisting of three types of database for genomic, chemical and network information, and associated software, which are all developed by the Kanehisa Laboratory (now part of the Bioinformatics Center) in the Institute for Chemical Research, Kyoto University. While KEGG has cross-references to numerous outside databases, it is intended to be a self-sufficient system for linking genomes to life at the cellular level, containing a complete set of building blocks (genes and molecules) and wiring diagrams (interaction networks) for cellular functions. Eventually, this self-sufficient system will become a computer representation of the cell and the organism, and perhaps the biosphere as well, which will enable in silico analysis of biological systems (1). Even at the current primitive stage, KEGG is widely used for analysis of various types of molecular biological data in order to obtain clues to higher-order functions.

During the past 2 years we have worked to make the KEGG resource (2,3) more accessible to automated analysis. For example, the XML representation of KEGG pathway diagrams is useful for automatic drawing of potential networks identified by two-hybrid experiments as an extension to known networks. Access to KEGG can now be made through the KEGG API (SOAP interface to KEGG), which means that the user can write a program to analyze microarray gene expression data or to annotate a newly sequenced genome by automating KEGG queries. In addition, we have released two new database components: KO for ortholog grouping and hierarchical classification of genes and GLYCAN for carbohydrate structures. Here we describe the current status and future plans of the KEGG resource.


    THE KEGG DATABASES
 TOP
 ABSTRACT
 INTRODUCTION
 THE KEGG DATABASES
 GENE UNIVERSE
 CHEMICAL UNIVERSE
 PROTEIN NETWORK
 ACCESS METHODS
 REFERENCES
 
Graph representation
To understand the overall architecture, it is useful to know that KEGG consists of three graph objects for representation and manipulation of genomic, chemical and network data. Mathematically, a graph is a set of nodes (building blocks) and edges (interactions or relations). As shown in Table 1 the three graph objects are called the gene universe (GENES, SSDB and KO databases), the chemical universe (COMPOUND, GLYCAN and REACTION databases), and the protein network (PATHWAY database). The gene universe is a conceptual graph object representing ortholog/paralog relations, operon information and other relationships between genes in all the completely sequenced genomes. The chemical universe is another conceptual graph object representing chemical reactions and structural/functional relations among metabolites and other biochemical compounds. In contrast, the protein network is based on biological phenomena, representing known molecular interaction networks in various cellular processes (2).


View this table:
[in this window]
[in a new window]
 
Table 1. The three graph objects in KEGG
 
Network hierarchy
Another important aspect in the overall architecture of KEGG is network hierarchy. The protein network, which is the most unique data object in KEGG, is stored as a collection of pathway maps in the PATHWAY database, representing wiring diagrams of proteins and other gene products responsible for various cellular functions. Reflecting the map resolution and functional modules at different levels, these pathway maps are hierarchically classified. There are five categories in the top level (metabolism, genetic information processing, environmental information processing, cellular processes and human diseases) and 24 subcategories in the second level. The third level in the hierarchy corresponds to individual pathway maps. When the protein network is linked to the gene universe, the fourth level corresponds to KO (KEGG Orthology) entries. Thus the hierarchy of gene functions in KEGG is based on the hierarchy of the protein network as shown in Table 2.


View this table:
[in this window]
[in a new window]
 
Table 2. The hierarchy of KEGG orthology (KO)
 

    GENE UNIVERSE
 TOP
 ABSTRACT
 INTRODUCTION
 THE KEGG DATABASES
 GENE UNIVERSE
 CHEMICAL UNIVERSE
 PROTEIN NETWORK
 ACCESS METHODS
 REFERENCES
 
GENES database
The information about individual genes is stored in the GENES database. As of September 12, 2003 the GENES database contains 572 881 genes in 155 organisms. The GENES entries are generated semi-automatically by selecting and combining various sources including authors’ submissions to GenBank (ftp://ftp.ncbi.nih.gov/genbank/genomes/), the NCBI RefSeq database (ftp://ftp.ncbi.nih.gov/genomes/), the EMBL database (ftp://ftp.ebi.ac.uk/pub/databases/embl/ genomes/) and publicly available organism-specific databases. They are then subjected to internal re-annotation, in which we just assign K numbers for the KO grouping of genes without updating the description of the genes. Our KO assignment appears in the ORTHOLOG line of the GENES entry.

SSDB database
SSDB was originally a sequence similarity database containing precomputed similarity scores by the SSEARCH program with additional information about best hits and best-best hits in pairwise genome comparisons (2). We have recently implemented an automatic procedure, based on a graph analytical method, to computationally generate ortholog clusters (OCs) and paralog clusters (PCs) from the huge SSDB graph, currently containing 200 million edges. The resulting ortholog clusters can be examined by clicking on the ORTHOLOG link in the GENES entry.

KO database
When the KEGG project was initiated in 1995, the integration of genomic information and network information was achieved via the EC numbers. The EC numbers were common identifiers for matching genes in the genome and gene products (enzymes) in the metabolic pathway. Then, a new scheme using ortholog identifiers was introduced (3) to extend the matching procedure to regulatory pathways and to overcome various problems inherent in the enzyme nomenclature. KO is a further extension of this scheme based on computational analysis, as well as manual curation, of SSDB ortholog clusters in order to classify all gene functions and explore unknown pathways. Each KO entry is identified by the K number (accession number) with the previous ortholog identifier as an alternative name. In future releases of KEGG, the KO hierarchy (Table 2) will become more complete, and the current ortholog group tables will be part of the KO database.


    CHEMICAL UNIVERSE
 TOP
 ABSTRACT
 INTRODUCTION
 THE KEGG DATABASES
 GENE UNIVERSE
 CHEMICAL UNIVERSE
 PROTEIN NETWORK
 ACCESS METHODS
 REFERENCES
 
COMPOUND database
The COMPOUND database contains chemical structures of most known metabolic compounds and some pharmaceutical and environmental compounds. All chemical structures are manually entered, computationally verified and continuously updated. Currently the database contains 10 739 entries, each of which is identified by the C number (accession number). A new feature implemented in the COMPOUND database is the KCF (KEGG Chemical Function) representation of chemical structures shown in Figure 1 and the resulting graph- based chemical structure comparison method (4). The COMPOUND/REACTION databases are being moved from the ISIS system (5) to the in-house-developed relational database system, in order to integrate with the GLYCAN database. Once this is done the substructure search against the database will be performed more rigorously by our graph comparison algorithm rather than the bit string comparison in the ISIS system.



View larger version (17K):
[in this window]
[in a new window]
 
Figure 1. Chemical compound structures in the COMPOUND database and carbohydrate structures in the GLYCAN database are graph objects, where the nodes are either atoms or monosaccharides and the edges are covalent bonds. For the purpose of chemical structure comparison, the chemical structure is converted to the KCF (KEGG Chemical Function) representation where the same atoms are distinguished by their environments.

 
GLYCAN database
GLYCAN is a new addition to the KEGG suite of databases. We have initiated efforts to collect carbohydrate structures because of the lack of a publicly available database after the termination of the CarbBank project (6). The pathway diagrams for metabolism of complex carbohydrates and metabolism of complex lipids are now linked to individual entries of carbohydrate structures in the GLYCAN database. The reactions catalyzed by glycosyltransferases and other sugar-related enzymes are represented in the REACTION database in a simpler form of carbohydrate structures (Fig. 1) rather than the all-atom representation. Each GLYCAN entry is identified by the G number (accession number) and the current total is 10 445 entries, among which only a few hundred were manually entered and linked to KEGG pathways. The rest represents unique structures derived from CarbBank. The GLYCAN database is maintained in a relational database with the structure drawing tool in Java. A database search is also made available based on newly developed algorithms for tree structure comparisons.

REACTION database
The REACTION database contains reaction formulas for enzymic reactions, currently totaling 5799 entries. Each entry is identified by the R number (accession number) representing a unique reaction corresponding to sets of reactants and products represented by the C number in the COMPOUND database or the G number in the GLYCAN database. This should be compared with the EC number, which may correspond to multiple reaction formulas. The EC number hierarchy is supposed to represent aspects of enzymatic reactions, but in reality it often contains aspects of enzyme molecules. Within the KEGG resource, these two aspects of EC numbers are clearly distinguished: R numbers for reactions and K numbers for molecules. We are working to develop a new hierarchy, tentatively called RC (Reaction Classification), for understanding chemistry of enzymic reactions.

ENZYME database
The ENZYME database contains enzyme nomenclature with numerous links to KEGG databases. It is generated semi-automatically from the enzyme nomenclature website (http://www.chem.qmul.ac.uk/iubmb/enzyme/). The role of this database within KEGG has diminished, but the EC number is still the simplest way to link to KEGG from outside resources.


    PROTEIN NETWORK
 TOP
 ABSTRACT
 INTRODUCTION
 THE KEGG DATABASES
 GENE UNIVERSE
 CHEMICAL UNIVERSE
 PROTEIN NETWORK
 ACCESS METHODS
 REFERENCES
 
PATHWAY database
The protein network in KEGG is an abstract network of gene products, representing not only the pathway or the complex resulting from direct protein–protein interactions, but also the metabolic network viewed as a network of enzymes, and the gene regulatory network viewed as a network of transcription factors and target products (2,3). The PATHWAY database is a collection of manually drawn diagrams called the KEGG reference pathway diagrams (maps), each corresponding to a known network of functional significance. The PATHWAY database also contains organism-specific pathways, which are automatically generated by superimposing (coloring) genes in given organisms. The database currently contains 13 457 entries including 235 reference pathway diagrams.

In the past, the pathway diagrams were available only in GIF (or PNG) image files. Although the coordinates of nodes (boxes) could be obtained from the HTML file, it was not possible to reconstruct the pathway because the edge information was not readily available. We have released the KEGG Markup Language (KGML) as a specification of graph objects in KEGG. All metabolic pathways and some regulatory pathways are now available in KGML, enabling computational reconstruction and manipulation of KEGG pathways.


    ACCESS METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 THE KEGG DATABASES
 GENE UNIVERSE
 CHEMICAL UNIVERSE
 PROTEIN NETWORK
 ACCESS METHODS
 REFERENCES
 
The primary mode of access to KEGG is through the GenomeNet website at http://www.genome.ad.jp/kegg/. Different components of the KEGG resource can most conveniently be accessed from the KEGG table of contents page at http://www.genome.ad.jp/kegg/kegg2.html. The four databases for the chemical universe, COMPOUND, GLYCAN, REACTION and ENZYME are collectively called the LIGAND database with a separate home page at http://www.genome.ad.jp/ligand/. Table 3 summarizes these and other useful URLs including additional databases not covered in the present article.


View this table:
[in this window]
[in a new window]
 
Table 3. URLs for the KEGG resource
 
For computerized access to KEGG, the SOAP server is open to academic users at http://www.genome.ad.jp/kegg/soap/. All the KEGG databases, except SSDB, are also available to academic users by anonymous FTP at http://www.genome. ad.jp/anonftp/.


    ACKNOWLEDGEMENTS
 
The computational resource was provided by the Bioinformatics Center, Institute for Chemical Research, Kyoto University. This work was supported by grants from the Ministry of Education, Culture, Sports, Science and Technology of Japan, the Japan Society for the Promotion of Science and the Japan Science and Technology Agency.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 THE KEGG DATABASES
 GENE UNIVERSE
 CHEMICAL UNIVERSE
 PROTEIN NETWORK
 ACCESS METHODS
 REFERENCES
 

  1. Kanehisa,M. and Bork,P. (2003) Bioinformatics in the post-sequence era. Nature Genet., 33, 305–310.

  2. Kanehisa,M., Goto,S., Kawashima,S. and Nakaya,A. (2002) The KEGG databases at GenomeNet. Nucleic Acids Res., 30, 42–46.[Abstract/Free Full Text]

  3. Kanehisa,M. and Goto,S. (2000) KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res., 28, 27–30.[Abstract/Free Full Text]

  4. Hattori,M., Okuno,Y., Goto,S. and Kanehisa,M. (2003) Development of a chemical structure comparison method for integrated analysis of chemical and genomic information in the metabolic pathways. J. Am. Chem. Soc., 125, 11853–11865.[CrossRef][Web of Science][Medline]

  5. Goto,S., Okuno,Y., Hattori,M., Nishioka,T. and Kanehisa,M. (2001) LIGAND: database of chemical compounds and reactions in biological pathways. Nucleic Acids Res., 30, 402–404.

  6. Doubet,S., Bock,K., Smith,D., Darvill,A. and Albersheim,P. (1989) The complex carbohydrate structure database. Trends Biochem. Sci., 14, 475–477.[CrossRef][Web of Science][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
Y. Wang, E. Bolton, S. Dracheva, K. Karapetyan, B. A. Shoemaker, T. O. Suzek, J. Wang, J. Xiao, J. Zhang, and S. H. Bryant
An overview of the PubChem BioAssay resource
Nucleic Acids Res., November 19, 2009; (2009) gkp965v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Senf and X.-w. Chen
Identification of genes involved in the same pathways using a Hidden Markov Model-based approach
Bioinformatics, November 15, 2009; 25(22): 2945 - 2954.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. S. Dehal, M. P. Joachimiak, M. N. Price, J. T. Bates, J. K. Baumohl, D. Chivian, G. D. Friedland, K. H. Huang, K. Keller, P. S. Novichkov, et al.
MicrobesOnline: an integrated portal for comparative and functional genomics
Nucleic Acids Res., November 11, 2009; (2009) gkp919v1.
[Abstract] [Full Text] [PDF]


Home page
Sci Transl MedHome page
P. J. Turnbaugh, V. K. Ridaura, J. J. Faith, F. E. Rey, R. Knight, and J. I. Gordon
The Effect of Diet on the Human Gut Microbiome: A Metagenomic Analysis in Humanized Gnotobiotic Mice
Science Translational Medicine, November 11, 2009; 1(6): 6ra14 - 6ra14.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
Y. Hu, Y. Wang, L. Ding, P. Lu, S. Atkinson, and S. Chen
Positive regulation of flhDC expression by OmpR in Yersinia pseudotuberculosis
Microbiology, November 1, 2009; 155(11): 3622 - 3631.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
G. Hernandez, O. Valdes-Lopez, M. Ramirez, N. Goffard, G. Weiller, R. Aparicio-Fabre, S. I. Fuentes, A. Erban, J. Kopka, M. K. Udvardi, et al.
Global Changes in the Transcript and Metabolic Profiles during Symbiotic Nitrogen Fixation in Phosphorus-Stressed Common Bean Plants
Plant Physiology, November 1, 2009; 151(3): 1221 - 1238.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
A. Beloqui, M.-E. Guazzaroni, F. Pazos, J. M. Vieites, M. Godoy, O. V. Golyshina, T. N. Chernikova, A. Waliczek, R. Silva-Rocha, Y. Al-ramahi, et al.
Reactome Array: Forging a Link Between Metabolome and Genome
Science, October 9, 2009; 326(5950): 252 - 257.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
E. R.M. Tillier and R. L. Charlebois
The human protein coevolution network
Genome Res., October 1, 2009; 19(10): 1861 - 1871.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. S. Najafabadi, H. Goodarzi, and R. Salavati
Universal function-specificity of codon usage
Nucleic Acids Res., September 22, 2009; (2009) gkp792v1.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
C. M. Song, S. J. Lim, and J. C. Tong
Recent advances in computer-aided drug design
Brief Bioinform, September 1, 2009; 10(5): 579 - 591.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y. Benita, H. Kikuchi, A. D. Smith, M. Q. Zhang, D. C. Chung, and R. J. Xavier
An integrative genomics approach identifies Hypoxia Inducible Factor-1 (HIF-1)-target genes that form the core response to hypoxia
Nucleic Acids Res., August 1, 2009; 37(14): 4587 - 4602.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
J. Skolnick and M. Brylinski
FINDSITE: a combined evolution/structure-based approach to protein function prediction
Brief Bioinform, July 1, 2009; 10(4): 378 - 391.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Nam, M. Li, K. Choi, C. Balch, S. Kim, and K. P. Nephew
MicroRNA and mRNA integrated analysis (MMIA): a web tool for examining biological functions of microRNA expression
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W356 - W362.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
I. Medina, D. Montaner, N. Bonifaci, M. A. Pujana, J. Carbonell, J. Tarraga, F. Al-Shahrour, and J. Dopazo
Gene set-based analysis of polymorphisms: finding pathways or biological processes associated to traits in genome-wide association studies
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W340 - W344.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
S. Robinzon, A. Dafa-Berger, M. D. Dyer, B. Paeper, S. C. Proll, T. H. Teal, S. Rom, D. Fishman, B. Rager-Zisman, and M. G. Katze
Impaired Cholesterol Biosynthesis in a Neuronal Cell Line Persistently Infected with Measles Virus
J. Virol., June 1, 2009; 83(11): 5495 - 5504.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
N. E. Lewis, B.-K. Cho, E. M. Knight, and B. O. Palsson
Gene Expression Profiling and the Use of Genome-Scale In Silico Models of Escherichia coli for Analysis: Providing Context for Content
J. Bacteriol., June 1, 2009; 191(11): 3437 - 3444.
[Full Text] [PDF]


Home page
Genome ResHome page
Z. Tu, C. Argmann, K. K. Wong, L. J. Mitnaul, S. Edwards, I. C. Sach, J. Zhu, and E. E. Schadt
Integrating siRNA and protein-protein interaction data to identify an expanded insulin signaling network
Genome Res., June 1, 2009; 19(6): 1057 - 1067.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Pathol.Home page
M. L. Calicchio, T. Collins, and H. P. Kozakewich
Identification of Signaling Systems in Proliferating and Involuting Phase Infantile Hemangiomas by Genome-Wide Transcriptional Profiling
Am. J. Pathol., May 1, 2009; 174(5): 1638 - 1649.
[Abstract] [Full Text] [PDF]


Home page
Cancer Res.Home page
V. C. Daniel, L. Marchionni, J. S. Hierman, J. T. Rhodes, W. L. Devereux, C. M. Rudin, R. Yung, G. Parmigiani, M. Dorsch, C. D. Peacock, et al.
A Primary Xenograft Model of Small-Cell Lung Cancer Reveals Irreversible Changes in Gene Expression Imposed by Culture In vitro
Cancer Res., April 15, 2009; 69(8): 3364 - 3373.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
C. A. Pratilas, B. S. Taylor, Q. Ye, A. Viale, C. Sander, D. B. Solit, and N. Rosen
V600EBRAF is associated with disabled feedback inhibition of RAF-MEK signaling and elevated transcriptional output of the pathway
PNAS, March 17, 2009; 106(11): 4519 - 4524.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Wu and M Watson
CORNA: testing gene lists for regulation by microRNAs
Bioinformatics, March 15, 2009; 25(6): 832 - 833.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
C. Pan, C. Kumar, S. Bohl, U. Klingmueller, and M. Mann
Comparative Proteomic Phenotyping of Cell Lines and Primary Cells to Assess Preservation of Cell Type-specific Functions
Mol. Cell. Proteomics, March 1, 2009; 8(3): 443 - 450.
[Abstract] [Full Text] [PDF]


Home page
BloodHome page
D. C. Johnson, S. Corthals, C. Ramos, A. Hoering, K. Cocks, N. J. Dickens, J. Haessler, H. Goldschmidt, J. A. Child, S. E. Bell, et al.
Genetic associations with thalidomide mediated venous thrombotic events in myeloma identified using targeted genotyping
Blood, December 15, 2008; 112(13): 4924 - 4934.
[Abstract] [Full Text] [PDF]


Home page
J. Leukoc. Biol.Home page
H. Dinh, G. M. Scholz, and J. A. Hamilton
Regulation of WAVE1 expression in macrophages at multiple levels
J. Leukoc. Biol., December 1, 2008; 84(6): 1483 - 1491.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
L. Jacob and J.-P. Vert
Protein-ligand interaction prediction: an improved chemogenomics approach
Bioinformatics, October 1, 2008; 24(19): 2149 - 2156.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
G. T. Chung, J. S. Yoo, H. B. Oh, Y. S. Lee, S. H. Cha, S. J. Kim, and C. K. Yoo
Complete Genome Sequence of Neisseria gonorrhoeae NCCP11945
J. Bacteriol., September 1, 2008; 190(17): 6035 - 6036.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
S. Schmitz-Esser, I. Haferkamp, S. Knab, T. Penz, M. Ast, C. Kohl, M. Wagner, and M. Horn
Lawsonia intracellularis Contains a Gene Encoding a Functional Rickettsia-Like ATP/ADP Translocase for Host Exploitation
J. Bacteriol., September 1, 2008; 190(17): 5746 - 5752.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. F. Saccone, N. L. Saccone, G. E. Swan, P. A. F. Madden, A. M. Goate, J. P. Rice, and L. J. Bierut
Systematic biological prioritization after a genome-wide association study: an application to nicotine dependence
Bioinformatics, August 15, 2008; 24(16): 1805 - 1811.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y.-H. Chen, C.-K. Liu, S.-C. Chang, Y.-J. Lin, M.-F. Tsai, Y.-T. Chen, and A. Yao
GenoWatch: a disease gene mining browser for association study
Nucleic Acids Res., July 1, 2008; 36(suppl_2): W336 - W340.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
F. Al-Shahrour, J. Carbonell, P. Minguez, S. Goetz, A. Conesa, J. Tarraga, I. Medina, E. Alloza, D. Montaner, and J. Dopazo
Babelomics: advanced functional profiling of transcriptomics, proteomics and genomics experiments
Nucleic Acids Res., July 1, 2008; 36(suppl_2): W341 - W346.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
H. Takarada, M. Sekine, H. Kosugi, Y. Matsuo, T. Fujisawa, S. Omata, E. Kishi, A. Shimizu, N. Tsukatani, S. Tanikawa, et al.
Complete Genome Sequence of the Soil Actinomycete Kocuria rhizophila
J. Bacteriol., June 15, 2008; 190(12): 4139 - 4146.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
O. X. Cordero, B. Snel, and P. Hogeweg
Coevolution of gene families in prokaryotes
Genome Res., March 1, 2008; 18(3): 462 - 468.
[Abstract] [Full Text] [PDF]


Home page
J R Soc InterfaceHome page
P. R Kensche, V. van Noort, B. E Dutilh, and M. A Huynen
Practical and theoretical advances in predicting the function of a protein by its phylogenetic distribution
J R Soc Interface, February 6, 2008; 5(19): 151 - 170.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
D. Juan, F. Pazos, and A. Valencia
High-confidence prediction of global interactomes based on genome-wide coevolutionary networks
PNAS, January 22, 2008; 105(3): 934 - 939.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. M. Markowitz, N. N. Ivanova, E. Szeto, K. Palaniappan, K. Chu, D. Dalevi, I-M. A. Chen, Y. Grechkin, I. Dubchak, I. Anderson, et al.
IMG/M: a data management and analysis system for metagenomes
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D534 - D538.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. M. Markowitz, E. Szeto, K. Palaniappan, Y. Grechkin, K. Chu, I-M. A. Chen, I. Dubchak, I. Anderson, A. Lykidis, K. Mavromatis, et al.
The integrated microbial genomes (IMG) system in 2007: data content and analysis tool extensions
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D528 - D533.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Nam, B. Kim, S. Shin, and S. Lee
miRGator: an integrated system for functional annotation of microRNAs
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D159 - D164.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. G. Conte, S. Gaillard, N. Lanau, M. Rouard, and C. Perin
GreenPhylDB: a database for plant comparative genomics
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D991 - D998.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y. Okuno, A. Tamon, H. Yabuuchi, S. Niijima, Y. Minowa, K. Tonomura, R. Kunimoto, and C. Feng
GLIDA: GPCR ligand database for chemical genomics drug discovery database and tools update
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D907 - D912.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Donitz, B. Goemann, M. Lize, H. Michael, N. Sasse, E. Wingender, and A. P. Potapov
EndoNet: an information resource about regulatory networks of cell-to-cell communication
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D689 - D694.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Shin, A. M. Lisewski, and O. Lichtarge
Graph sharpening plus graph integration: a synergy that improves protein functional classification
Bioinformatics, December 1, 2007; 23(23): 3217 - 3224.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
H. J. Bootsma, M. Egmont-Petersen, and P. W. M. Hermans
Analysis of the In Vitro Transcriptional Response of Human Pharyngeal Epithelial Cells to Adherent Streptococcus pneumoniae: Evidence for a Distinct Response to Encapsulated Strains
Infect. Immun., November 1, 2007; 75(11): 5489 - 5499.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Cakmak and G. Ozsoyoglu
Mining biological networks for unknown pathways
Bioinformatics, October 15, 2007; 23(20): 2775 - 2783.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
E. D. Harrington, A. H. Singh, T. Doerks, I. Letunic, C. von Mering, L. J. Jensen, J. Raes, and P. Bork
Quantitative assessment of protein function prediction from metagenomics shotgun sequences
PNAS, August 28, 2007; 104(35): 13913 - 13918.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
F. Al-Shahrour, P. Minguez, J. Tarraga, I. Medina, E. Alloza, D. Montaner, and J. Dopazo
FatiGO +: a functional profiling tool for genomic data. Integration of functional annotation, regulatory motifs and interaction data with microarray experiments
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W91 - W96.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Bleakley, G. Biau, and J.-P. Vert
Supervised reconstruction of biological networks with local models
Bioinformatics, July 1, 2007; 23(13): i57 - i65.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
D. W. Udwary, L. Zeigler, R. N. Asolkar, V. Singan, A. Lapidus, W. Fenical, P. R. Jensen, and B. S. Moore
Genome sequencing reveals complex secondary metabolome in the marine actinomycete Salinispora tropica
PNAS, June 19, 2007; 104(25): 10376 - 10381.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
G. Hernandez, M. Ramirez, O. Valdes-Lopez, M. Tesfaye, M. A. Graham, T. Czechowski, A. Schlereth, M. Wandrey, A. Erban, F. Cheung, et al.
Phosphorus Stress in Common Bean: Root Transcript and Metabolic Responses
Plant Physiology, June 1, 2007; 144(2): 752 - 767.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
A. Masoudi-Nejad, S. Goto, R. Jauregui, M. Ito, S. Kawashima, Y. Moriya, T. R. Endo, and M. Kanehisa
EGENES: Transcriptome-Based Plant Database of Genes with Metabolic Pathway Information and Expressed Sequence Tag Indices in KEGG
Plant Physiology, June 1, 2007; 144(2): 857 - 866.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
G. E. van Noorden, T. Kerim, N. Goffard, R. Wiblin, F. I. Pellerone, B. G. Rolfe, and U. Mathesius
Overlap of Proteome Changes in Medicago truncatula in Response to Auxin and Sinorhizobium meliloti
Plant Physiology, June 1, 2007; 144(2): 1115 - 1131.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
G. Caetano-Anolles, H. S. Kim, and J. E. Mittenthal
The origin of modern metabolic networks inferred from phylogenomic analysis of protein architecture
PNAS, May 29, 2007; 104(22): 9358 - 9363.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Sulakhe, M. D'Souza, M. Syed, A. Rodriguez, Y. Zhang, E. M. Glass, M. F. Romine, and N. Maltsev
GNARE--a grid-based server for the analysis of user submitted genomes
Nucleic Acids Res., May 25, 2007; (2007) gkm366v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Yamanishi, F. Bach, and J.-P. Vert
Glycan classification with tree kernels
Bioinformatics, May 15, 2007; 23(10): 1211 - 1216.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
N. Johnson, O. Fletcher, C. Palles, M. Rudd, E. Webb, G. Sellick, I. dos Santos Silva, V. McCormack, L. Gibson, A. Fraser, et al.
Counting potentially functional variants in BRCA1, BRCA2 and ATM predicts breast cancer susceptibility
Hum. Mol. Genet., May 1, 2007; 16(9): 1051 - 1057.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
B. Palenik, J. Grimwood, A. Aerts, P. Rouze, A. Salamov, N. Putnam, C. Dupont, R. Jorgensen, E. Derelle, S. Rombauts, et al.
The tiny eukaryote Ostreococcus provides genomic insights into the paradox of plankton speciation
PNAS, May 1, 2007; 104(18): 7705 - 7710.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. J. Gaulton, K. L. Mohlke, and T. J. Vision
A computational system to select candidate genes for complex human traits
Bioinformatics, May 1, 2007; 23(9): 1132 - 1140.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
T. Frickey and G. Weiller
Analyzing microarray data using CLANS
Bioinformatics, May 1, 2007; 23(9): 1170 - 1171.
[Abstract] [Full Text] [PDF]


Home page
IOVSHome page
R. Ganti, R. C. Hunt, S. K. Parapuram, and D. M. Hunt
Vitreous Modulation of Gene Expression in Low-Passage Human Retinal Pigment Epithelial Cells
Invest. Ophthalmol. Vis. Sci., April 1, 2007; 48(4): 1853 - 1863.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
L. Feng, W. Wang, J. Cheng, Y. Ren, G. Zhao, C. Gao, Y. Tang, X. Liu, W. Han, X. Peng, et al.
Genome and proteome of long-chain alkane degrading Geobacillus thermodenitrificans NG80-2 isolated from a deep-subsurface oil reservoir
PNAS, March 27, 2007; 104(13): 5602 - 5607.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Respir. Crit. Care Med.Home page
S. Pierrou, P. Broberg, R. A. O'Donnell, K. Pawlowski, R. Virtala, E. Lindqvist, A. Richter, S. J. Wilson, G. Angco, S. Moller, et al.
Expression of Genes Involved in Oxidative Stress Responses in Airway Epithelial Cells of Smokers with Chronic Obstructive Pulmonary Disease
Am. J. Respir. Crit. Care Med., March 15, 2007; 175(6): 577 - 586.
[Abstract] [Full Text] [PDF]


Home page
J. Physiol.Home page
M. J. Nijland, N. E. Schlabritz-Loutsevitch, G. B. Hubbard, P. W. Nathanielsz, and L. A. Cox
Non-human primate fetal kidney transcriptome analysis indicates mammalian target of rapamycin (mTOR) is a central nutrient-responsive pathway
J. Physiol., March 15, 2007; 579(3): 643 - 656.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. S. Wishart, D. Tzur, C. Knox, R. Eisner, A. C. Guo, N. Young, D. Cheng, K. Jewell, D. Arndt, S. Sawhney, et al.
HMDB: the Human Metabolome Database
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D521 - D526.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. L. Holliday, D. E. Almonacid, G. J. Bartlett, N. M. O'Boyle, J. W. Torrance, P. Murray-Rust, J. B. O. Mitchell, and J. M. Thornton
MACiE (Mechanism, Annotation and Classification in Enzymes): novel tools for searching catalytic mechanisms
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D515 - D520.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. D'Souza, E. M. Glass, M. H. Syed, Y. Zhang, A. Rodriguez, N. Maltsev, and M. Y. Galperin
Sentra: a database of signal transduction proteins for comparative genome analysis
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D271 - D273.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. K. McNeil, C. Reich, R. K. Aziz, D. Bartels, M. Cohoon, T. Disz, R. A. Edwards, S. Gerdes, K. Hwang, M. Kubal, et al.
The National Microbial Pathogen Database Resource (NMPDR): a genomics platform based on subsystem annotation
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D347 - D353.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
A. Ng, B. Bursteinas, Q. Gao, E. Mollison, and M. Zvelebil
Resources for integrative systems biology: from data through databases to networks and dynamic system models
Brief Bioinform, December 1, 2006; 7(4): 318 - 330.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
G. Suen, J. S. Jakobsen, B. S. Goldman, M. Singer, A. G. Garza, and R. D. Welch
Bacterial Postgenomics: the Promise and Peril of Systems Biology{triangledown}
J. Bacteriol., December 1, 2006; 188(23): 7999 - 8004.
[Full Text] [PDF]


Home page
BioinformaticsHome page
N. Goffard and G. Weiller
Extending MapMan: application to legume genome arrays
Bioinformatics, December 1, 2006; 22(23): 2958 - 2959.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. A. George, J. Y. Liu, L. L. Feng, R. J. Bryson-Richardson, D. Fatkin, and M. A. Wouters
Analysis of protein sequence and interaction data for candidate disease gene prediction
Nucleic Acids Res., November 14, 2006; 34(19): e130 - e130.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. L. Reed, T. R. Patel, K. H. Chen, A. R. Joyce, M. K. Applebee, C. D. Herring, O. T. Bui, E. M. Knight, S. S. Fong, and B. O. Palsson
Systems approach to refining genome annotation
PNAS, November 14, 2006; 103(46): 17480 - 17484.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
E. L. Webb, M. F. Rudd, G. S. Sellick, R. El Galta, L. Bethke, W. Wood, O. Fletcher, S. Penegar, L. Withey, M. Qureshi, et al.
Search for low penetrance alleles for colorectal cancer through a scan of 1467 non-synonymous SNPs in 2575 cases and 2707 controls with validation by kin-cohort analysis of 14 704 first-degree relatives
Hum. Mol. Genet., November 1, 2006; 15(21): 3263 - 3271.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. C. Janga, W. F. Lamboy, A. M. Huerta, and G. Moreno-Hagelsieb
The distinctive signatures of promoter regions and operon junctions across prokaryotes
Nucleic Acids Res., September 1, 2006; 34(14): 3980 - 3987.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Pang, A. Lin, M. Holford, B. E. Enerson, B. Lu, M. P. Lawton, E. Floyd, and H. Zhao
Pathway analysis using random forests classification and regression
Bioinformatics, August 15, 2006; 22(16): 2028 - 2036.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
T. Shioda, J. Chesnes, K. R. Coser, L. Zou, J. Hur, K. L. Dean, C. Sonnenschein, A. M. Soto, and K. J. Isselbacher
Importance of dosage standardization for interpreting transcriptomal signature profiles: Evidence from studies of xenoestrogens
PNAS, August 8, 2006; 103(32): 12033 - 12038.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. L. Green and P. D. Karp
The outcomes of pathway database computations depend on pathway ontology
Nucleic Acids Res., August 7, 2006; 34(13): 3687 - 3697.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. M. Paley and P. D. Karp
The Pathway Tools cellular overview diagram and Omics Viewer
Nucleic Acids Res., August 7, 2006; 34(13): 3771 - 3778.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Bryson, V. Loux, R. Bossy, P. Nicolas, S. Chaillou, M. van de Guchte, S. Penaud, E. Maguin, M. Hoebeke, P. Bessieres, et al.
AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system
Nucleic Acids Res., July 19, 2006; 34(12): 3533 - 3545.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. A. Rahman and D. Schomburg
Observing local and global properties of metabolic pathways: 'load points' and 'choke points' in the metabolic networks
Bioinformatics, July 15, 2006; 22(14): 1767 - 1774.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Liu, S. Sivaganesan, K. Y. Yeung, J. Guo, R. E. Bumgarner, and M. Medvedovic
Context-specific infinite mixtures for clustering gene expression profiles across diverse microarray dataset
Bioinformatics, July 15, 2006; 22(14): 1737 - 1744.
[Abstract] [Full Text] [PDF]


Home page
BloodHome page
M. F. Rudd, G. S. Sellick, E. L. Webb, D. Catovsky, and R. S. Houlston
Variants in the ATM-BRCA2-CHEK2 axis predispose to chronic lymphocytic leukemia
Blood, July 15, 2006; 108(2): 638 - 644.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
I. Lozada-Chavez, S. C. Janga, and J. Collado-Vides
Bacterial regulatory networks are extremely flexible in evolution
Nucleic Acids Res., July 13, 2006; 34(12): 3434 - 3445.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (56K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Kanehisa, M.
Right arrow Articles by Hattori, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kanehisa, M.
Right arrow Articles by Hattori, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?