Nucleic Acids Research, Vol 27, Issue 1 275-279, Copyright © 1999 by Oxford University Press
CA Orengo, FM Pearl, JE Bray, AE Todd, AC Martin, L Lo Conte and JM Thornton
We report the latest release (version 1.4) of the CATH protein domains
database (http://www.biochem.ucl.ac.uk/bsm/cath). This is a hierarchical
classification of 13 359 protein domain structures into evolutionary
families and structural groupings. We currently identify 827 homologous
families in which the proteins have both structual similarity and sequence
and/or functional similarity. These can be further clustered into 593 fold
groups and 32 distinct architectures. Using our structural classification
and associated data on protein functions, stored in the database (EC
identifiers, SWISS-PROT keywords and information from the Enzyme database
and literature) we have been able to analyse the correlation between the 3D
structure and function. More than 96% of folds in the PDB are associated
with a single homologous family. However, within the superfolds, three or
more different functions are observed. Considering enzyme functions, more
than 95% of clearly homologous families exhibit either single or closely
related functions, as demonstrated by the EC identifiers of their
relatives. Our analysis supports the view that determining structures, for
example as part of a 'structural genomics' initiative, will make a major
contribution to interpreting genome data.
ARTICLES
The CATH Database provides insights into protein structure/function relationships
Department of Biochemistry and Molecular Biology, Darwin Building, Univeristy College London, Gower Street, London WC1E 6BT, UK. orengo@biochem.ucl.ac.uk
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
G. Malgieri, L. Russo, S. Esposito, I. Baglivo, L. Zaccaro, E. M. Pedone, B. Di Blasio, C. Isernia, P. V. Pedone, and R. Fattorusso The prokaryotic Cys2His2 zinc-finger adopts a novel fold as revealed by the NMR structure of Agrobacterium tumefaciens Ros DNA-binding domain PNAS, October 30, 2007; 104(44): 17341 - 17346. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Yu, R. Jansen, G. Stolovitzky, and M. Gerstein Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications Bioinformatics, August 15, 2007; 23(16): 2163 - 2173. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Portugaly, N. Linial, and M. Linial EVEREST: a collection of evolutionary conserved protein domains Nucleic Acids Res., January 12, 2007; 35(suppl_1): D241 - D246. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Cuff, R. W. Janes, and A. C.R. Martin Analysing the ability to retain sidechain hydrogen-bonds in mutant proteins Bioinformatics, June 15, 2006; 22(12): 1464 - 1470. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Wang and R. Samudrala FSSA: a novel method for identifying functional signatures from structural alignments Bioinformatics, July 1, 2005; 21(13): 2969 - 2977. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Thompson, V. Prigent, and O. Poch LEON: multiple aLignment Evaluation Of Neighbours Nucleic Acids Res., February 24, 2004; 32(4): 1298 - 1307. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Kopp and T. Schwede The SWISS-MODEL Repository of annotated three-dimensional protein structure homology models Nucleic Acids Res., January 1, 2004; 32(90001): D230 - 234. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. T. DOW and S. A. DAVIES Integrative Physiology and Functional Genomics of Epithelial Function in a Genetic Model Organism Physiol Rev, July 1, 2003; 83(3): 687 - 729. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Hegyi and M. Gerstein Annotation Transfer for Genomics: Measuring Functional Divergence in Multi-Domain Proteins Genome Res., October 1, 2001; 11(10): 1632 - 1640. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. M. Luscombe, R. A. Laskowski, and J. M. Thornton Amino acid-base interactions: a three-dimensional analysis of protein-DNA interactions at an atomic level Nucleic Acids Res., July 1, 2001; 29(13): 2860 - 2874. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. T. Silverstein, E. Shoop, J. E. Johnson, A. Kilian, J. L. Freeman, T. M. Kunau, I. A. Awad, M. Mayer, and E. F. Retzel The MetaFam Server: a comprehensive protein family resource Nucleic Acids Res., January 1, 2001; 29(1): 49 - 51. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Dietmann, J. Park, C. Notredame, A. Heger, M. Lappe, and L. Holm A fully automatic evolutionary classification of protein folds: Dali Domain Dictionary version 3 Nucleic Acids Res., January 1, 2001; 29(1): 55 - 57. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. N. Shindyalov and P. E. Bourne A database and tools for 3-D protein structure comparison and alignment using the Combinatorial Extension (CE) algorithm Nucleic Acids Res., January 1, 2001; 29(1): 228 - 229. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. W. Li, B. V. B. Reddy, I. N. Shindyalov, and P. E. Bourne CKAAPs DB: a conserved key amino acid positions database Nucleic Acids Res., January 1, 2001; 29(1): 329 - 331. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. D. Pollock, J. A. Eisen, N. A. Doggett, and M. P. Cummings A Case for Evolutionary Genomics and the Comprehensive Examination of Sequence Biodiversity Mol. Biol. Evol., December 1, 2000; 17(12): 1776 - 1788. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Balasubramanian, T. Schneider, M. Gerstein, and L. Regan Proteomics of Mycoplasma genitalium: identification and characterization of unannotated and atypical proteins in a small model genome Nucleic Acids Res., August 15, 2000; 28(16): 3075 - 3082. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Caputo, G. Manco, L. Mandrich, and J. Guardiola A Novel Aspartyl Proteinase from Apocrine Epithelia and Breast Tumors J. Biol. Chem., March 10, 2000; 275(11): 7935 - 7941. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.E. Bray, A.E. Todd, F.M.G. Pearl, J.M. Thornton, and C.A. Orengo The CATH Dictionary of Homologous Superfamilies (DHS): a consensus approach for identifying distant structural homologues Protein Eng. Des. Sel., March 1, 2000; 13(3): 153 - 165. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Reichert, A. Jabs, P. Slickers, and J. Suhnel The IMB Jena Image Library of Biological Macromolecules Nucleic Acids Res., January 1, 2000; 28(1): 246 - 249. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Sanchez, U. Pieper, N. Mirkovi, P. I. W. de Bakker, E. Wittenstein, and A. ali MODBASE, a database of annotated comparative protein structure models Nucleic Acids Res., January 1, 2000; 28(1): 250 - 253. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. E. Brenner, P. Koehl, and M. Levitt The ASTRAL compendium for protein structure and sequence analysis Nucleic Acids Res., January 1, 2000; 28(1): 254 - 256. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Huang, C. Xiao, and C. H. Wu ProClass protein family database Nucleic Acids Res., January 1, 2000; 28(1): 273 - 276. [Abstract] [Full Text] [PDF] |
||||







