Nucleic Acids Research, 2002, Vol. 30, No. 1 402-404
© 2002 Oxford University Press
LIGAND: database of chemical compounds and reactions in biological pathways
Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 611-0011, Japan and 1Graduate School of Agricultural Sciences, Kyoto University, Sakyo-ku, Kyoto 606-8502, Japan
Received September 19, 2001; Accepted September 26, 2001.
| ABSTRACT |
|---|
|
|
|---|
LIGAND is a composite database comprising three sections: COMPOUND for the information about metabolites and other chemical compounds, REACTION for the collection of substrateproduct relations representing metabolic and other reactions, and ENZYME for the information about enzyme molecules. The current release (as of September 7, 2001) includes 7298 compounds, 5166 reactions and 3829 enzymes. In addition to the keyword search provided by the DBGET/LinkDB system, a substructure search to the COMPOUND and REACTION sections is now available through the World Wide Web (http://www.genome.ad.jp/ligand/). LIGAND may be also downloaded by anonymous FTP (ftp://ftp.genome.ad.jp/pub/kegg/ligand/).
| INTRODUCTION |
|---|
|
|
|---|
The completion of the human genome sequence and those of many other organisms, including several dozens of bacteria, accelerated post-genome projects aimed at elucidating the blueprint of life from a scientific point of view. They are also aimed at discovering new drugs and other useful materials, and at deriving biodegradation pathways of xenobiotic chemicals such as pollutants and toxins from medical, industrial and environmental viewpoints. All of them require chemical information, which is not stored in the genome, in addition to information about genes and proteins, which is derived from the genome, and chembioinformatics has been considered as one of the important research fields in the post-genome era. The LIGAND database (1) has been organized to fill in the gap between genomic information and chemical information, and applied to actual reconstruction of metabolic pathways in the completely sequenced organisms in the Kyoto Encyclopedia of Genes and Genomes (KEGG) (2,3).
The LIGAND database is a composite database comprising three sections: COMPOUND, for the information about metabolites and other chemical compounds; REACTION, for the collection of substrateproduct relationships representing metabolic and other reactions; and ENZYME, for the information about enzyme molecules. We report here the current status of the LIGAND database, where efforts are being made to add more data in the COMPOUND and REACTION sections, and the new features of the two sections including the substructure search facility.
| CURRENT STATUS OF LIGAND |
|---|
|
|
|---|
The COMPOUND and ENZYME sections are constructed as flat-file databases and the data format of each section is similar to those of GenBank (4) flat files: a fixed number of columns is assigned to specify each field of entry (1). COMPOUND and REACTION sections are now organized and maintained as ISIS format (see below).
The COMPOUND section contains a collection of chemical compounds that are found in the KEGG/PATHWAY database and in the ENZYME section, as well as other compounds found in literature. The REACTION section is a collection of chemical reactions, mostly enzymatic reactions, represented as conversions of chemical structures. The ENZYME section is based on the enzyme nomenclature of the International Union of Biochemistry and Molecular Biology (IUBMB) (5), which is also available from the World Wide Web (http://www.chem.qmw.ac.uk/iubmb/enzyme/). We have added several links to other databases such as OMIM (6) for human genetic diseases, PROSITE (7) for amino acid sequence motifs, PDB (8) for protein structures, in addition to PATHWAY and GENES databases in KEGG.
The number of entries in the current release is summarized in Table 1.
|
| NEW FEATURES OF COMPOUND AND REACTION |
|---|
|
|
|---|
New types of compounds: drugs and xenobiotic chemicals
The COMPOUND section was originally created by extracting chemical compounds from the metabolic pathways of the KEGG/PATHWAY database, as well as the ENZYME section of LIGAND. The current version of COMPOUND includes xenobiotic chemicals such as environmental pollutants and toxins, because KEGG has an agreement with UM-BBD (9) to include biodegradation pathways of xenobiotic chemicals in KEGG/PATHWAY. Efforts have also been made to add more drug-related chemicals, and their ratio is increasing. They will be used, for example, as the starting compounds to search possible degradation pathways, which will connect to the existing pathways presented in the KEGG/PATHWAY database.
Compounds and reactions in the ISIS database
The COMPOUND and REACTION sections are now managed as MDB and RXN formats, respectively, which are database formats for handling chemical structures by the ISIS/HOST database (Fig. 1, upper part). ISIS automatically computes molecular formulae and weights once the curator of the COMPOUND database inputs a chemical structure. Curators of REACTION update the information about the substrates and products by the compound IDs, not by their structures, because we developed a program which automatically imports the compound structures of each reaction from the COMPOUND section.
|
Substructure searches using ISIS database and Chemscape
Because COMPOUND and REACTION are stored in the ISIS/HOST database, they can be accessed through the Chemscape server. This enables users to search by compound structures, in addition to the keyword search originally provided by the DBGET/LinkDB system (10,11). Although the Chime plug-in and ISIS/Draw are required for the structure search, they are freely available from the MDL web site (http://www.mdli.com/) for academic users. The relationship between the ISIS version and the DBGET version of LIGAND is summarized in Figure 1.
| AVAILABILITY |
|---|
|
|
|---|
The LIGAND database is accessible through the World Wide Web at http://www.genome.ad.jp/ligand/. The user can then invoke the DBGET/LinkDB system to retrieve COMPOUND and ENZYME, the ISIS/Chemscape-based system to retrieve COMPOUND and REACTION by substructure or chemical formula.
The LIGAND database can be downloaded via anonymous FTP at ftp://ftp.genome.ad.jp/pub/kegg/ligand/. This directory contains all sections, COMPOUND, ENZYME and REACTION, including GIF image files and MDL-MOL files for compound structures. The same data set is mirrored at the NCBI repository ftp://ncbi.nlm.nih.gov/repository/LIGAND/.
The basic concept of the LIGAND database has been published elsewhere (1). The present article reflects the most up-to-date version of the database and should be cited accordingly.
| ACKNOWLEDGEMENTS |
|---|
We thank Nobue Takeuchi, Tomoko Komeno, Rumiko Yamamoto and Yuriko Matsuura for inputting the compound and reaction data. We also thank Koichiro Tonomura for developing the interface of the ISIS system for searching and updating COMPOUND and REACTION. The computational resource was provided by the Bioinformatics Center, Institute for Chemical Research, Kyoto University. This work was supported by the grants from the Ministry of Education, Culture, Sports, Science and Technology of Japan, Japan Society for the Promotion of Science, and Japan Science and Technology Corporation.
| FOOTNOTES |
|---|
* To whom correspondence should be addressed. Tel: +81 774 38 3271; Fax: +81 774 38 3269; Email: goto{at}kuicr.kyoto-u.ac.jp
| REFERENCES |
|---|
|
|
|---|
-
1 Goto,S., Nishioka,T. and Kanehisa,M. (1998) LIGAND: chemical database for enzyme reactions. Bioinformatics, 14, 591599.
2 Kanehisa,M., Goto,S., Kawashima,S. and Nakaya,A. (2002) The KEGG databases at GenomeNet. Nucleic Acids Res., 30, 4246.
3 Kanehisa,M. (1997) A database for post-genome analysis. Trends Genet., 13, 375376.[Web of Science][Medline]
4 Benson,D.A., Karsch-Mizrachi,I., Lipman,D.J., Ostell,J., Rapp,B.A. and Wheeler,D.L. (2002) GenBank. Nucleic Acids Res., 30, 1720.
5 International Union of Biochemistry and Molecular Biology (1992) Enzyme Nomenclature: Recommendations (1992) of the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology. Academic Press, New York.
6 Pearson,P., Francomano,C., Foster,P., Bocchini,C., Li,P. and McKusick,V. (1994) The status of online Mendelian inheritance in man (OMIM) medio 1994. Nucleic Acids Res., 22, 34703473. Updated article in this issue: Nucleic Acids Res. (2002), 30, 5255.
7 Falquet,L., Pagni,M., Bucher,P., Hulo,N., Sigrist,C.J.A., Hofmann,K. and Bairoch,A. (2002) The PROSITE database, its status in 2002. Nucleic Acids Res., 30, 235238.
8 Westbrook,J., Feng,Z., Jain,S., Bhat,T.N., Thanki,N., Ravichandran,V., Gilliland,G.L., Bluhm,W., Weissig,H., Greer,D.S., Bowne,P.E., Berman,H.M. (2002) The Protein Data Bank: unifying the archive. Nucleic Acids Res., 30, 245248.
9 Ellis,L.B.M., Hershberger,C.D., Bryan,E.M. and Wackett,L.P. (2001) The University of Minnesota Biocatalysis/Biodegradation Database: emphasizing enzymes. Nucleic Acids Res., 29, 340343.
10 Fujibuchi,W., Goto,S., Migimatsu,H., Uchiyama,I., Ogiwara,A., Akiyama,Y. and Kanehisa,M. (1998) DBGET/LinkDB: an integrated database retrieval system. Pac. Symp. Biocomput., 683694.
11 Kanehisa,M. (1997) Linking databases and organisms: GenomeNet resources in Japan. Trends Biochem. Sci., 22, 442444.[Web of Science][Medline]
This article has been cited by other articles:
![]() |
K. M. Hettne, R. H. Stierum, M. J. Schuemie, P. J. M. Hendriksen, B. J. A. Schijvenaars, E. M. v. Mulligen, J. Kleinjans, and J. A. Kors A dictionary to identify small molecules and drugs in free text Bioinformatics, November 15, 2009; 25(22): 2983 - 2991. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Song, S. J. Lim, and J. C. Tong Recent advances in computer-aided drug design Brief Bioinform, September 1, 2009; 10(5): 579 - 591. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Goffard, T. Frickey, and G. Weiller PathExpress update: the enzyme neighbourhood method of associating gene-expression data with metabolic pathways Nucleic Acids Res., July 1, 2009; 37(suppl_2): W335 - W339. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Saito, M. Robert, H. Kochi, G. Matsuo, Y. Kakazu, T. Soga, and M. Tomita Metabolite Profiling Reveals YihU as a Novel Hydroxybutyrate Dehydrogenase for Alternative Succinic Semialdehyde Metabolism in Escherichia coli J. Biol. Chem., June 12, 2009; 284(24): 16442 - 16451. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Nelson, J. P. Otis, S. L. Martin, and H. V. Carey Analysis of the hibernation cycle using LC-MS-based metabolomics in ground squirrel liver Physiol Genomics, March 3, 2009; 37(1): 43 - 51. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Kumar, B. C. Han, Z. Shi, J. Jia, Y. P. Wang, Y. T. Zhang, L. Liang, Q. F. Liu, Z. L. Ji, and Y. Z. Chen Update of KDBI: Kinetic Data of Bio-molecular Interaction database Nucleic Acids Res., January 1, 2009; 37(suppl_1): D636 - D641. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. G. Toyota, C. L. Berthold, A. Gruez, S. Jonsson, Y. Lindqvist, C. Cambillau, and N. G. J. Richards Differential Substrate Specificity and Kinetic Behavior of Escherichia coli YfdW and Oxalobacter formigenes Formyl Coenzyme A Transferase J. Bacteriol., April 1, 2008; 190(7): 2556 - 2564. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Cornell, I. Alam, D. M. Soanes, H. M. Wong, C. Hedeler, N. W. Paton, M. Rattray, S. J. Hubbard, N. J. Talbot, and S. G. Oliver Comparative genome analysis across a kingdom of eukaryotic organisms: Specialization and diversification in the Fungi Genome Res., December 1, 2007; 17(12): 1809 - 1822. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Mu, R. F. Williams, C. J. Unkefer, P. J. Unkefer, J. R. Faeder, and W. S. Hlavacek Carbon-fate maps for metabolic reactions Bioinformatics, December 1, 2007; 23(23): 3193 - 3199. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Goffard and G. Weiller PathExpress: a web-based tool to identify relevant pathways in gene expression data Nucleic Acids Res., July 13, 2007; 35(suppl_2): W176 - W181. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Masoudi-Nejad, S. Goto, R. Jauregui, M. Ito, S. Kawashima, Y. Moriya, T. R. Endo, and M. Kanehisa EGENES: Transcriptome-Based Plant Database of Genes with Metabolic Pathway Information and Expressed Sequence Tag Indices in KEGG Plant Physiology, June 1, 2007; 144(2): 857 - 866. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Mu, P. J. Unkefer, C. J. Unkefer, and W. S. Hlavacek Prediction of oxidoreductase-catalyzed reactions based on atomic properties of metabolites Bioinformatics, December 15, 2006; 22(24): 3082 - 3088. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Rahman and D. Schomburg Observing local and global properties of metabolic pathways: 'load points' and 'choke points' in the metabolic networks Bioinformatics, July 15, 2006; 22(14): 1767 - 1774. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Soga, R. Baran, M. Suematsu, Y. Ueno, S. Ikeda, T. Sakurakawa, Y. Kakazu, T. Ishikawa, M. Robert, T. Nishioka, et al. Differential Metabolomics Reveals Ophthalmic Acid as an Oxidative Stress Biomarker Indicating Hepatic Glutathione Consumption J. Biol. Chem., June 16, 2006; 281(24): 16768 - 16776. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Mahadevan, D. R. Bond, J. E. Butler, A. Esteve-Nunez, M. V. Coppi, B. O. Palsson, C. H. Schilling, and D. R. Lovley Characterization of Metabolism in the Fe(III)-Reducing Organism Geobacter sulfurreducens by Constraint-Based Modeling Appl. Envir. Microbiol., February 1, 2006; 72(2): 1558 - 1568. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Okuno, J. Yang, K. Taneishi, H. Yabuuchi, and G. Tsujimoto GLIDA: GPCR-ligand database for chemical genomic drug discovery Nucleic Acids Res., January 1, 2006; 34(suppl_1): D673 - D677. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Tokimatsu, N. Sakurai, H. Suzuki, H. Ohta, K. Nishitani, T. Koyama, T. Umezawa, N. Misawa, K. Saito, and D. Shibata KaPPA-View. A Web-Based Analysis Tool for Integration of Transcript and Metabolite Data on Plant Metabolic Pathway Maps Plant Physiology, July 1, 2005; 138(3): 1289 - 1300. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. O. Jonsdottir, F. S. Jorgensen, and S. Brunak Prediction methods and databases within chemoinformatics: emphasis on drugs and drug candidates Bioinformatics, May 15, 2005; 21(10): 2145 - 2160. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Ogata and J.-M. Claverie Metagrowth: a new resource for the building of metabolic hypotheses in microbiology Nucleic Acids Res., January 1, 2005; 33(suppl_1): D321 - D324. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. P. Wackett, A. G. Dodge, and L. B. M. Ellis Microbial Genomics and the Periodic Table Appl. Envir. Microbiol., February 1, 2004; 70(2): 647 - 655. [Full Text] [PDF] |
||||
![]() |
T. Noguchi and Y. Akiyama PDB-REPRDB: a database of representative protein chains from the Protein Data Bank (PDB) in 2003 Nucleic Acids Res., January 1, 2003; 31(1): 492 - 493. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kanehisa, S. Goto, S. Kawashima, and A. Nakaya The KEGG databases at GenomeNet Nucleic Acids Res., January 1, 2002; 30(1): 42 - 46. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||









