Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (152K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Fleischmann, A.
Right arrow Articles by Apweiler, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Fleischmann, A.
Right arrow Articles by Apweiler, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2004, Vol. 32, Database issue D434-D437
© 2004 Oxford University Press

IntEnz, the integrated relational enzyme database

Astrid Fleischmann*, Michael Darsow, Kirill Degtyarenko, Wolfgang Fleischmann, Sinéad Boyce1, Kristian B. Axelsen2, Amos Bairoch2, Dietmar Schomburg3, Keith F. Tipton1 and Rolf Apweiler

EMBL Outstation—European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, 1 The University of Dublin, Trinity College, Dublin 2, Ireland, 2 Swiss Institute of Bioinformatics (SIB), Geneva, Switzerland and 3 Cologne University, Cologne, Germany

*To whom correspondence should be addressed. Tel: +44 1223 494444; Fax: +44 1223 494468; Email: Astrid.Fleischmann{at}ebi.ac.uk

Received August 15, 2003; Revised and Accepted October 20, 2003


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 CLASSIFICATION OF ENZYMES
 DATA IN IntEnz
 DATABASE DEVELOPMENT
 PUBLIC AND PRODUCTION DATABASE
 SEARCH TOOLS
 FUTURE DEVELOPMENT
 REFERENCES
 
IntEnz is the name for the Integrated relational Enzyme database and is the official version of the Enzyme Nomenclature. The Enzyme Nomenclature comprises recommendations of the Nomenclature Committee of the International Union of Bio chemistry and Molecular Biology (NC-IUBMB) on the nomenclature and classification of enzyme-catalysed reactions. IntEnz is supported by NC-IUBMB and contains enzyme data curated and approved by this committee. The database IntEnz is available at http://www.ebi.ac.uk/intenz.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 CLASSIFICATION OF ENZYMES
 DATA IN IntEnz
 DATABASE DEVELOPMENT
 PUBLIC AND PRODUCTION DATABASE
 SEARCH TOOLS
 FUTURE DEVELOPMENT
 REFERENCES
 
The explosion of structural data that has resulted from gene-sequencing and proteomic studies has emphasized the need to relate structures to functions. It is important that these structural data are integrated with the functional data on catalytic proteins that are available from a number of sources, such as the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB) Enzyme List (1), ENZYME (2) and BRENDA (3) databases. The Enzyme List represents a well established and officially recognized functional classification system.

The Integrated relational Enzyme database IntEnz provides a complete, freely available database storing the most up-to-date version of the Enzyme List, which contains enzyme data curated and approved by the NC-IUBMB.

In addition tools have been developed to maintain the database and allow propagation of new and updated biochemical terminology, which has been integrated into the database. These tools are connected to the ChEBI database, which provides a definitive dictionary of compounds, to improve the quality of the IntEnz vocabulary. ChEBI stands for dictionary of Chemical Compounds of Biological Interest. The ChEBI database is also hosted at the European Bioinformatics Institute (EBI), but is still under development.

Enzyme data will be represented in a number of different ways, one of which is the official NC-IUBMB view, which has been developed and can be accessed at http://www.ebi.ac.uk/intenz.


    CLASSIFICATION OF ENZYMES
 TOP
 ABSTRACT
 INTRODUCTION
 CLASSIFICATION OF ENZYMES
 DATA IN IntEnz
 DATABASE DEVELOPMENT
 PUBLIC AND PRODUCTION DATABASE
 SEARCH TOOLS
 FUTURE DEVELOPMENT
 REFERENCES
 
The NC-IUBMB classification and nomenclature of enzymes, commonly known as the Enzyme List, differs from most biochemical classification systems in that it is based on function, i.e. the reaction catalysed, rather than structure. The basis of the system is the assignment of a specific numerical identifier, the EC number, which identifies the enzyme in terms of the reaction catalysed. The first digit represents the type of reaction catalysed. At present six such classes of reaction are recognized, as summarized in Table 1. The second digit of the EC number refers to the subclass, which generally contains information about the type of compound or group involved. The third digit, the sub-subclass, further specifies the nature of the reaction and the fourth digit is a serial number that is used to identify the individual enzyme within a sub-subclass. The classification of enzymes is described in detail elsewhere (4,5).


View this table:
[in this window]
[in a new window]
 
Table 1. The six reaction classes used in the enzyme list
 
Although several distinct proteins may catalyse the same reaction, they would all be ascribed the same EC number, since the Enzyme List is a system based upon the reaction catalysed. Information about structural diversity is obtained by linkage of enzymes in the List to related entries in other databases. The information for each enzyme within the Enzyme List is contained in seven distinct fields, as summarized in Table 2. There is also a glossary of compound names, which may be used to search the Enzyme List. The Enzyme List is maintained by Trinity College Dublin on behalf of the NC-IUBMB. Suggestions for new entries or for modifications to existing entries can be found on the IntEnz homepage.


View this table:
[in this window]
[in a new window]
 
Table 2. Fields used in the IUBMB Enzyme Lista
 

    DATA IN IntEnz
 TOP
 ABSTRACT
 INTRODUCTION
 CLASSIFICATION OF ENZYMES
 DATA IN IntEnz
 DATABASE DEVELOPMENT
 PUBLIC AND PRODUCTION DATABASE
 SEARCH TOOLS
 FUTURE DEVELOPMENT
 REFERENCES
 
IntEnz is supported by the NC-IUBMB and contains enzyme data that is curated and approved by the Nomenclature Committee. The relational database (IntEnz) implemented and supported by the EBI is the master copy of the Enzyme Nomenclature data. The goal is to create a single relational database containing enzyme data from three different sources.

(i) Trinity College Dublin (TCD) maintains on behalf of the NC-IUBMB the Enzyme List and is involved in many other aspects of biochemical nomenclature.

(ii) The Swiss Institute of Bioinformatics (SIB) produces an Enzyme Nomenclature database (ENZYME).

(iii) The University of Cologne produces an enzyme function database (BRENDA), which contains a large amount of information on substrates, products and inhibitors.

The EBI has already implemented a relational database version storing the most up-to-date version of the Enzyme List, which contains enzyme data curated and approved by NC-IUBMB. Figure 1 shows a screenshot of the classification of EC 1.1.1.51 [EC] .



View larger version (37K):
[in this window]
[in a new window]
 
Figure 1. Screenshot of the Enzyme List entry for EC 1.1.1.51 [EC] .

 

    DATABASE DEVELOPMENT
 TOP
 ABSTRACT
 INTRODUCTION
 CLASSIFICATION OF ENZYMES
 DATA IN IntEnz
 DATABASE DEVELOPMENT
 PUBLIC AND PRODUCTION DATABASE
 SEARCH TOOLS
 FUTURE DEVELOPMENT
 REFERENCES
 
The database is implemented using a relational database management system (ORACLE) and consists of development, production and public versions of the database.

Web servers build a bridge between the database server machines and the users on the internet. They accept user requests for data, interrogate the appropriate database, summarize the result in a form fit for human consumption and deliver it to the user’s internet browser.

The web application consists of a back- and a front-end. The back-end is responsible for database-related functionality (retrieving and updating) and processing of data. It is connected with the production and public database to allow access either for adding/modifying or for retrieving data. Representing data and providing an interface for user interaction are the fundamental features of the front-end. The interface is compatible with all common browsers (e.g. Microsoft Internet Explorer, Netscape Mozilla, OmniWeb, Safari). The front-end is implemented using HTML and JavaServerPages. The functionality of the back-end is mainly achieved using Java Servlets.


    PUBLIC AND PRODUCTION DATABASE
 TOP
 ABSTRACT
 INTRODUCTION
 CLASSIFICATION OF ENZYMES
 DATA IN IntEnz
 DATABASE DEVELOPMENT
 PUBLIC AND PRODUCTION DATABASE
 SEARCH TOOLS
 FUTURE DEVELOPMENT
 REFERENCES
 
The users of the web application are categorized into two groups: the general user and the IntEnz curator. The general user has access to the public database to retrieve information of interest, but will not be able to modify or add data. The IntEnz curator is able to access the production database to perform changes to the data.

The reasons for the separation of production data and publicly accessible data are as follows.

(i) Security: only IntEnz curators access the data on the production database. The public access the public web server and the public database, which hold a copy of the production data.

(ii) Fail over: in emergencies and during scheduled maintenance sessions, one set of machines can serve both the production and public versions of the database, albeit at a reduced performance rate.

(iii) Technical: there are different usage patterns for production and public machines. Usage of the production machines can be split into interactive use during European office hours and bulk updates during the night, while the public machines have lower, but more constant access rates to serve the world wide web.

(iv) Scientific: new and updated data on the production machines needs to be kept separate until it is peer reviewed by the NC-IUBMB who are in editorial control of the data. Only then can it be copied to the public database and public web servers. This process is crucial to maintaining the consistency of the database through editorial control by the NC-IUBMB.

The building of such a robust database and tools environment automates database updating as much as possible. It also provides the general user and the IntEnz curator with stable search and curation tools.


    SEARCH TOOLS
 TOP
 ABSTRACT
 INTRODUCTION
 CLASSIFICATION OF ENZYMES
 DATA IN IntEnz
 DATABASE DEVELOPMENT
 PUBLIC AND PRODUCTION DATABASE
 SEARCH TOOLS
 FUTURE DEVELOPMENT
 REFERENCES
 
The database is provided with an advanced search tool that enables users to exploit the data contained in the database. The search tool permits the user to:

(i) limit searches to defined fields;

(ii) combine search terms or search results with Boolean operators;

(iii) use word expansion operators, e.g. the wildcard operator or the fuzzy operator.


    FUTURE DEVELOPMENT
 TOP
 ABSTRACT
 INTRODUCTION
 CLASSIFICATION OF ENZYMES
 DATA IN IntEnz
 DATABASE DEVELOPMENT
 PUBLIC AND PRODUCTION DATABASE
 SEARCH TOOLS
 FUTURE DEVELOPMENT
 REFERENCES
 
IntEnz will be further populated by corresponding data from the ENZYME database of the SIB and from the BRENDA database of the University of Cologne. The discrepancies between the equivalent data fields from these data sets are to be resolved as the next step towards a controlled vocabulary for biochemical terminology. These common data comprise the EC number, common name of an enzyme, systematic name of an enzyme, other name(s) of an enzyme, reaction catalysed and comments.

The representation of enzyme data will be done in a number of different ways. We already have a NC-IUBMB authorized view of the data. After resolving the discrepancies between the equivalent data fields we will also be able to retrieve a flat-file output of IntEnz, which will replace the ENZYME.


    ACKNOWLEDGEMENTS
 
We thank Nicola Mulder, Maria Krestyaninova, Sandra Orchard and Bob Vaughan who helped us with the annotation of entries. We are also grateful for the support from David Binns, Christian Gran, Henning Hermjakob, Alexander Kanapin, Rodrigo Lopez and Muruli Rao. The IntEnz project is supported by the BioBabel grant (no. QLRT-2000-00981) of the European Commission.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 CLASSIFICATION OF ENZYMES
 DATA IN IntEnz
 DATABASE DEVELOPMENT
 PUBLIC AND PRODUCTION DATABASE
 SEARCH TOOLS
 FUTURE DEVELOPMENT
 REFERENCES
 

  1. IUBMB (1992) Enzyme Nomenclature: Recommendations (1992) of the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology. Academic Press, San Diego, CA.

  2. Bairoch,A. (2000) The ENZYME database in 2000. Nucleic Acids Res., 28, 304–305.[Abstract/Free Full Text]

  3. Schomburg,I., Chang,A. and Schomburg,D. (2002) BRENDA, enzyme data and metabolic information. Nucleic Acids Res., 30, 47–49.[Abstract/Free Full Text]

  4. Boyce,S. and Tipton,K.F. (2000) History of the enzyme nomenclature system. Bioinformatics, 16, 34–40.[Abstract/Free Full Text]

  5. Tipton,K.F. and Boyce,S. (2000) Enzyme classification and nomenclature. In Nature Encyclopedia of Life Sciences. Nature Publishing Group, London. http://www.els.net/ [doi:10.1038/npg.els.0000710]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
B. Aranda, P. Achuthan, Y. Alam-Faruque, I. Armean, A. Bridge, C. Derow, M. Feuermann, A. T. Ghanbarian, S. Kerrien, J. Khadake, et al.
The IntAct molecular interaction database in 2010
Nucleic Acids Res., October 22, 2009; (2009) gkp878v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Gao, L. B. M. Ellis, and L. P. Wackett
The University of Minnesota Biocatalysis/Biodegradation Database: improving public access
Nucleic Acids Res., September 18, 2009; (2009) gkp771v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
V. M. Markowitz, K. Mavromatis, N. N. Ivanova, I-M. A. Chen, K. Chu, and N. C. Kyrpides
IMG ER: a system for microbial genome annotation expert review and curation
Bioinformatics, September 1, 2009; 25(17): 2271 - 2278.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Koua, L. Cerutti, L. Falquet, C. J. A. Sigrist, G. Theiler, N. Hulo, and C. Dunand
PeroxiBase: a database with new tools for peroxidase family classification
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D261 - D266.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
S. B. Quintaje and S. Orchard
The Annotation of Both Human and Mouse Kinomes in UniProtKB/Swiss-Prot: One Small Step in Manual Annotation, One Giant Leap for Full Comprehension of Genomes
Mol. Cell. Proteomics, August 1, 2008; 7(8): 1409 - 1419.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Patient, D. Wieser, M. Kleen, E. Kretschmann, M. Jesus Martin, and R. Apweiler
UniProtJAPI: a remote API for accessing UniProt data
Bioinformatics, May 15, 2008; 24(10): 1321 - 1322.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Degtyarenko, P. de Matos, M. Ennis, J. Hastings, M. Zbinden, A. McNaught, R. Alcantara, M. Darsow, M. Guedj, and M. Ashburner
ChEBI: a database and ontology for chemical entities of biological interest
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D344 - D350.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Y. Niv, D. R. Ripoll, J. A. Vila, A. Liwo, E. S. Vanamee, A. K. Aggarwal, H. Weinstein, and H. A. Scheraga
Topology of Type II REases revisited; structural classes and the common conserved core
Nucleic Acids Res., April 1, 2007; 35(7): 2227 - 2237.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. L. Holliday, D. E. Almonacid, G. J. Bartlett, N. M. O'Boyle, J. W. Torrance, P. Murray-Rust, J. B. O. Mitchell, and J. M. Thornton
MACiE (Mechanism, Annotation and Classification in Enzymes): novel tools for searching catalytic mechanisms
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D515 - D520.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. Kanz, P. Aldebert, N. Althorpe, W. Baker, A. Baldwin, K. Bates, P. Browne, A. van den Broek, M. Castro, G. Cochrane, et al.
The EMBL Nucleotide Sequence Database
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D29 - D33.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. Brooksbank, G. Cameron, and J. Thornton
The European Bioinformatics Institute's data resources: towards systems biology
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D46 - D53.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Velankar, P. McNeil, V. Mittard-Runte, A. Suarez, D. Barrell, R. Apweiler, and K. Henrick
E-MSD: an integrated data resource for bioinformatics
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D262 - D265.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (152K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Fleischmann, A.
Right arrow Articles by Apweiler, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Fleischmann, A.
Right arrow Articles by Apweiler, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?