Nucleic Acids Research, 2000, Vol. 28, No. 1 257-259
© 2000 Oxford University Press
SCOP: a Structural Classification of Proteins database
MRC Laboratory of Molecular Biology and 1Centre for Protein Engineering, Hills Road, Cambridge CB2 2QH, UK, 2Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK and 3Department of Structural Biology, Stanford University, Stanford, CA 94305-5400, USA
Received October 12, 1999; Accepted October 13, 1999.
| ABSTRACT |
|---|
|
|
|---|
The Structural Classification of Proteins (SCOP) database provides a detailed and comprehensive description of the relationships of known protein structures. The classification is on hierarchical levels: the first two levels, family and superfamily, describe near and distant evolutionary relationships; the third, fold, describes geometrical relationships. The distinction between evolutionary relationships and those that arise from the physics and chemistry of proteins is a feature that is unique to this database so far. The sequences of proteins in SCOP provide the basis of the ASTRAL sequence libraries that can be used as a source of data to calibrate sequence search algorithms and for the generation of statistics on, or selections of, protein structures. Links can be made from SCOP to PDB-ISL: a library containing sequences homologous to proteins of known structure. Sequences of proteins of unknown structure can be matched to distantly related proteins of known structure by using pairwise sequence comparison methods to find homologues in PDB-ISL. The database and its associated files are freely accessible from a number of WWW sites mirrored from URL http://scop.mrc-lmb. cam.ac.uk/scop/
| INTRODUCTION |
|---|
|
|
|---|
At present (October, 1999) the Brookhaven Protein Databank (PDB) (1) contains nearly 10 000 protein entries and the number is increasing by ~200 a month. These proteins have structural similarities with other proteins and, in many cases, share a common evolutionary origin. To facilitate access to this information, we constructed the Structural Classification of Proteins (SCOP) database (2). It includes not only the proteins in the current version of the PDB, but many proteins for which there are published descriptions but whose co-ordinates are not yet available.
The classification of proteins in SCOP has been constructed by visual inspection and comparison of structures (3). Given the current limitations of purely automatic procedures, we believe this approach produces the most accurate and useful results. The unit of classification is usually the protein domain. Small proteins, and most of those of medium size, have a single domain and are, therefore, treated as a whole. The domains in large proteins are usually classified individually.
| CLASSIFICATION |
|---|
|
|
|---|
The classification of the proteins in SCOP is on hierarchical levels as follows:
Family. Proteins are clustered together into families on the basis of one of two criteria that imply their having a common evolutionary origin: first, all proteins that have residue identities of 30% and greater; second, proteins with lower sequence identities but whose functions and structures are very similar; for example, globins with sequence identities of 15%.
Superfamily. Families whose proteins have low sequence identities but whose structures and, in many cases, functional features suggest that a common evolutionary origin is probable, are placed together in superfamilies; for example, the variable and constant domains of immunoglobulins.
Common fold. Superfamilies and families are defined as having a common fold if their proteins have the same major secondary structures in the same arrangement and with the same topological connections. The structural similarities of proteins in the same fold category probably arise from the physics and chemistry of proteins favouring certain packing arrangements and chain topologies.
Class. The different folds have been grouped into classes. Most of the folds are assigned to one of the five structural classes:
1. all-
, those whose structure is essentially formed by
-helices;
2. all-ß, those whose structure is essentially formed by ß-sheets;
3.
/ß, those with
-helices and ß-strands;
4.
+ß, those in which
-helices and ß-strands are largely segregated;
5. multi-domain, those with domains of different fold and for which no homologues are known at present.
Other classes have been assigned for peptides, small proteins, theoretical models, nucleic acids and carbohydrates.
There are now a number of other databases which classify protein structures, such as CATH (4), FSSP (5), Entrez (6) and DDBASE (7), however, the distinction between evolutionary relationships and those that arise from the physics and chemistry of proteins is a feature that is so far unique to SCOP. Because functional similarity is implied by an evolutionary relationship but not necessarily by a physical relationship, we believe that this classification level is of considerable value, for example as a way of reliably linking very distant sequence families.
| ORGANISATION AND FACILITIES OF SCOP |
|---|
|
|
|---|
The SCOP database is available as a set of tightly coupled hypertext pages on the WWW via the URL: http://scop.mrc-lmb. cam.ac.uk/scop/ The interface to SCOP has been designed to facilitate both detailed searching of particular families and browsing of the whole database. To this end, there are a variety of different techniques for navigation:
Browsing through the SCOP hierarchy. SCOP is organised as a tree structure. Entering at the top of the hierarchy the user can navigate through the levels of Class, Fold, Superfamily, Family and Species to the leaves of the tree which are structural domains of individual PDB entries. An alternative hierarchy of Folds, Superfamilies and Families by the date of solution of the first representative structure is also provided.
From an amino acid sequence. The Sequence similarity search facility allows any sequence of interest to be searched against databases of protein sequences classified in SCOP using the algorithms BLAST (8), FASTA or SSEARCH (9). SCOP can then be entered from the list of PDB chains found to be similar and the similarity can be displayed visually.
From a keyword. The keyword search facility returns a list of SCOP pages containing the word entered or combinations of words separated by a series of boolean operators.
From a PDB identifier. The PDB entry viewer links PDB entries to various graphical views, external databases and SCOP itself.
By history. Pages are provided that order folds, superfamilies and families by date of entry into PDB or publication. This is both for interest and to make it easier to keep up to date with the appearance of new folds or significant new members of existing folds.
In addition to the information on structural and evolutionary relationships contained within SCOP, each entry (for which co-ordinates are available) has links to images of the structure, interactive molecular viewers, the atomic co-ordinates, data on functional conformational changes, sequence data and homologues and MEDLINE abstracts.
To facilitate rapid and effective access to SCOP, a number of mirrors have been established, a full current list of which can be found via the above URL. The facilities provided by the various sites are always the same, so you will lose nothing by accessing your nearest mirror. The implementation does differ: for example, currently, sequence similarity searching is always carried out at the main, scop.mrc-lmb.cam.ac.uk site, however this is transparent to the user who will always be returned a search results page marked up with links to pages on the mirror that they started from.
| OTHER USES OF SCOP |
|---|
|
|
|---|
Non-redundant sequence databases and the evaluation of sequence alignment methods
The clustering of sequences of protein chains of known structures at different levels of sequence similarity gives a series of non-redundant sequence databases known as PDB40, PDB90, PDB95 etc. (the number refers to maximum percentage sequence identity of any pair of sequences in the sequence databases) and these are available from SCOP. The current versions are produced by the ASTRAL procedure (10).
These databases contain large sets of sequence whose evolutionary relationships are known unambiguously and are, therefore, suitable test data in the calibration of sequence searching algorithms. They form the basis of a calibration of the pairwise sequence methods (11) and of methods that use multiple sequences (12). The particular databases used for these studies are available via the SCOP URL.
Assignment of protein structures to sequences using the intermediate sequence library PDB-ISL
Two homologous sequences, which have diverged beyond the point where their homology can be recognised by a simple direct comparison, can be related through one or more other sequences that are suitably intermediate between the two. A library containing potential intermediate sequences for proteins of known structure (PDB-ISL) has been constructed (13) and can be accessed directly or through SCOP. The sequences in the library were collected from a large sequence database using the sequences of the domains of proteins of known structure as the query sequences and the program PSI-BLAST (14). Sequences of proteins of unknown structure can be matched to distantly related proteins of known structure by using pairwise sequence comparison methods to find homologues in PDB-ISL. For a given error rate the number of correct matches found is the same as that found using PSI-BLAST and a large sequence database. The advantage of this library is that, because it uses pairwise sequence comparison methods such as FASTA or BLAST, it can be searched easily and, in most cases, much more quickly (13).
| ACKNOWLEDGEMENT |
|---|
AGM is grateful to the MRC for financial support.
| FOOTNOTES |
|---|
* To whom correspondence should be addressed. Tel: +44 1223 402010; Fax: +44 1223 213556; Email: loredana@mrc-lmb.cam.ac.uk
| REFERENCES |
|---|
|
|
|---|
-
1 Abola,E., Bernstein,F.C., Bryant,S.H., Koetzle,T.F. and Weng,J. (1987) In Allen,F.H., Bergerhoff,G. and Sievers,R. (eds), Crystallographic DatabasesInformation Content, Software Systems, Scientific Applications. Data Commission of the International Union of Crystallography, Bonn/Cambridge/Chester, pp. 107132.
2 Murzin,A., Brenner,S.E., Hubbard,T.J.P. and Chothia,C. (1995) J. Mol. Biol., 247, 536540.[ISI][Medline]
3 Brenner,S.E., Chothia,C., Hubbard,T.J.P. and Murzin,A. (1995) Methods Enzymol., 266, 635653.
4 Orengo,C.A., Michie,A.D., Jones,S., Jones,D.T., Swindells,M.B. and Thornton,J.M. (1997) Structure, 5, 10931108.[Medline]
5 Holm,L. and Sander,C. (1994) Nucleic Acids Res., 22, 36003609.
6 Hogue,C., Ohkawa,H. and Bryant,S.H. (1996) Trends Biochem. Sci., 21, 226229.[ISI][Medline]
7 Sowdhamini,R., Rufino,S.D. and Blundell,T.L. (1996) Folding Des., 1, 209220.[ISI][Medline]
8 Altschul,S.F., Gish,W., Miller,W., Myers,E.W. and Lipman,D.J. (1990) J. Mol. Biol., 215, 403410. [ISI][Medline]
9 Pearson,W.R. (1996) Methods Enzymol., 266, 227258.[ISI][Medline]
10 Brenner,S.E., Koehl,P. and Levitt,M. (2000) Nucleic Acids Res., 28, 254256 (this issue).
11 Brenner,S.E., Chothia,C. and Hubbard,T.J.P. (1998) Proc. Natl Acad. Sci. USA, 95, 60736078.
12 Park,J., Karplus,K., Barrett,C., Hughey,R., Haussler,D., Hubbard,T. and Chothia,C. (1998) J. Mol. Biol., 284, 12011210.[ISI][Medline]
13 Teichmann,S.A., Chothia,C., Church,G.M. and Park,J. (2000) Bioinformatics, in press.
14 Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J.H., Zhang,Z., Miller,W. and Lipman,D.J. (1997) Nucleic Acids Res., 25, 33893402.
This article has been cited by other articles:
![]() |
S. Ravaud, G. Stjepanovic, K. Wild, and I. Sinning The Crystal Structure of the Periplasmic Domain of the Escherichia coli Membrane Protein Insertase YidC Contains a Substrate Binding Cleft J. Biol. Chem., April 4, 2008; 283(14): 9350 - 9358. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Konagurthu, P. J. Stuckey, and A. M. Lesk Structural search and retrieval using a tableau representation of protein folding patterns Bioinformatics, March 1, 2008; 24(5): 645 - 651. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Yang, S. Wallin, and E. I. Shakhnovich Universality and diversity of folding mechanics for three-helix bundle proteins PNAS, January 22, 2008; 105(3): 895 - 900. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. B. Roland and E. I. Shakhnovich Divergent Evolution of a Structural Proteome: Phenomenological Models Biophys. J., February 1, 2007; 92(3): 701 - 716. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. R. Chandra, N. Kumar, J. Jeyakani, D. D. Singh, S. B. Gowda, and M. N. Prathima Lectindb: a plant lectin database Glycobiology, October 1, 2006; 16(10): 938 - 946. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Tyagi, P. Sharma, C. S. Swamy, F. Cadet, N. Srinivasan, A. G. de Brevern, and B. Offmann Protein Block Expert (PBE): a web-based protein structure analysis server using a structural alphabet. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W119 - W123. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. A. Fodor and R. W. Aldrich Statistical Limits to the Identification of Ion Channel Domains by Sequence Similarity J. Gen. Physiol., May 30, 2006; 127(6): 755 - 766. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kouranov, L. Xie, J. de la Cruz, L. Chen, J. Westbrook, P. E. Bourne, and H. M. Berman The RCSB PDB information portal for structural genomics Nucleic Acids Res., January 1, 2006; 34(suppl_1): D302 - D305. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. H. Barros, A. Johnson, P. Gin, B. N. Marbois, C. F. Clarke, and A. Tzagoloff The Saccharomyces cerevisiae COQ10 Gene Encodes a START Domain Protein Required for Function of Coenzyme Q in Respiration J. Biol. Chem., December 30, 2005; 280(52): 42627 - 42635. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-H. RENALIER, N. JOSEPH, C. GASPIN, P. THEBAULT, and A. MOUGIN The Cm56 tRNA modification in archaea is catalyzed either by a specific 2'-O-methylase, or a C/D sRNP RNA, July 1, 2005; 11(7): 1051 - 1063. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Sillitoe, M. Dibley, J. Bray, S. Addou, and C. Orengo Assessing strategies for improved superfamily recognition Protein Sci., July 1, 2005; 14(7): 1800 - 1810. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Kifer, O. Sasson, and M. Linial Predicting fold novelty based on ProtoNet hierarchical classification Bioinformatics, April 1, 2005; 21(7): 1020 - 1027. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Blades, J. C. Ison, R. Ranasinghe, and J. B.C. Findlay Automatic generation and evaluation of sparse protein signatures for families of protein structural domains Protein Sci., January 1, 2005; 14(1): 13 - 23. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Casbon and M. A. S. Saqi S4: structure-based sequence alignments of SCOP superfamilies Nucleic Acids Res., January 1, 2005; 33(suppl_1): D219 - D222. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Mukai, S. Kawai, S. Mori, B. Mikami, and K. Murata Crystal Structure of Bacterial Inorganic Polyphosphate/ATP-glucomannokinase: INSIGHTS INTO KINASE EVOLUTION J. Biol. Chem., November 26, 2004; 279(48): 50591 - 50600. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Abdel-Sater, M. El Bakkoury, A. Urrestarazu, S. Vissers, and B. Andre Amino Acid Signaling in Yeast: Casein Kinase I and the Ssy5 Endoprotease Are Key Determinants of Endoproteolytic Activation of the Membrane-Bound Stp1 Transcription Factor Mol. Cell. Biol., November 15, 2004; 24(22): 9771 - 9785. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. V. Lunin, C. Munger, J. Wagner, Z. Ye, M. Cygler, and M. Sacher The Structure of the MAPK Scaffold, MP1, Bound to Its Partner, p14: A COMPLEX WITH A CRITICAL ROLE IN ENDOSOMAL MAP KINASE SIGNALING J. Biol. Chem., May 28, 2004; 279(22): 23422 - 23430. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Herve du Penhoat, H. S. Atreya, Y. Shen, G. Liu, T. B. Acton, R. Xiao, Z. Li, D. Murray, G. T. Montelione, and T. Szyperski The NMR solution structure of the 30S ribosomal protein S27e encoded in gene RS27_ARCFU of Archaeoglobus fulgidis reveals a novel protein fold Protein Sci., May 1, 2004; 13(5): 1407 - 1416. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. F. Wright, J. Christodoulou, C. M. Dobson, and J. Clarke The importance of loop length in the folding of an immunoglobulin domain Protein Eng. Des. Sel., May 1, 2004; 17(5): 443 - 453. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. John and A. Sali Detection of homologous proteins by an intermediate sequence search Protein Sci., January 1, 2004; 13(1): 54 - 62. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Arnesano, L. Banci, M. Benvenuti, I. Bertini, V. Calderone, S. Mangani, and M. S. Viezzoli The Evolutionarily Conserved Trimeric Structure of CutA1 Proteins Suggests a Role in Signal Transduction J. Biol. Chem., November 14, 2003; 278(46): 45999 - 46006. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Sivaraman, Y. Li, J. Banks, D. E. Cane, A. Matte, and M. Cygler Crystal Structure of Escherichia coli PdxA, an Enzyme Involved in the Pyridoxal Phosphate Biosynthesis Pathway J. Biol. Chem., October 31, 2003; 278(44): 43682 - 43690. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Gherardi, M. E. Youles, R. N. Miguel, T. L. Blundell, L. Iamele, J. Gough, A. Bandyopadhyay, G. Hartmann, and P. J. G. Butler Functional map and domain structure of MET, the product of the c-met protooncogene and receptor for hepatocyte growth factor/scatter factor PNAS, October 14, 2003; 100(21): 12039 - 12044. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. H. Dyer, S. Lovell, J. B. Thoden, H. M. Holden, I. Rayment, and Q. Lan The Structural Determination of an Insect Sterol Carrier Protein-2 with a Ligand-bound C16 Fatty Acid at 1.35-A Resolution J. Biol. Chem., October 3, 2003; 278(40): 39085 - 39091. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Matsuda, T. Nishioka, K. Kinoshita, T. Kawabata, and N. Go Finding evolutionary relations beyond superfamilies: Fold-based superfamilies Protein Sci., October 1, 2003; 12(10): 2239 - 2251. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. H. Dyer, L. M. Rubio, J. B. Thoden, H. M. Holden, P. W. Ludden, and I. Rayment The Three-dimensional Structure of the Core Domain of Naf Y from Azotobacter vinelandii determined at 1.8-A Resolution J. Biol. Chem., August 22, 2003; 278(34): 32150 - 32156. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. T. Bankier, H. F. Spriggs, B. Fartmann, B. A. Konfortov, M. Madera, C. Vogel, S. A. Teichmann, A. Ivens, and P. H. Dear Integrated Mapping, Chromosomal Sequencing and Sequence Analysis of Cryptosporidium parvum Genome Res., August 1, 2003; 13(8): 1787 - 1799. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Ginalski and L. Rychlewski Detection of reliable and unexpected protein fold predictions using 3D-Jury Nucleic Acids Res., July 1, 2003; 31(13): 3291 - 3292. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Ginalski, J. Pas, L. S. Wyrwicz, M. v. Grotthuss, J. M. Bujnicki, and L. Rychlewski ORFeus: detection of distant homology using sequence profiles and predicted secondary structure Nucleic Acids Res., July 1, 2003; 31(13): 3804 - 3807. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Pineda-Lucena, J. C.C. Liao, J. R. Cort, A. Yee, M. A. Kennedy, Aled. M. Edwards, and C. H. Arrowsmith A novel member of the split {beta}{alpha}{beta} fold: Solution structure of the hypothetical protein YML108W from Saccharomyces cerevisiae Protein Sci., May 1, 2003; 12(5): 1136 - 1140. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. S. Krishna, I. Majumdar, and N. V. Grishin Structural classification of zinc fingers: SURVEY AND SUMMARY Nucleic Acids Res., January 15, 2003; 31(2): 532 - 550. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. V. Smith, C.-c. Huang, A. Miczak, D. G. Russell, J. C. Sacchettini, and K. Honer zu Bentrup Biochemical and Structural Studies of Malate Synthase from Mycobacterium tuberculosis J. Biol. Chem., January 10, 2003; 278(3): 1735 - 1743. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. P. Chen, A. Kernytsky, and B. Rost Transmembrane helix predictions revisited Protein Sci., December 1, 2002; 11(12): 2774 - 2791. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Muller, R. M. MacCallum, and M. J.E. Sternberg Structural Characterization of the Human Proteome Genome Res., November 1, 2002; 12(11): 1625 - 1641. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. R. Ortiz, C. E.M. Strauss, and O. Olmea MAMMOTH (Matching molecular models obtained from theory): An automated method for model comparison Protein Sci., November 1, 2002; 11(11): 2606 - 2621. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Dean, C. Neuhauser, E. Grenier, and G. B. Golding The Pattern of Amino Acid Replacements in {alpha}/{beta}-Barrels Mol. Biol. Evol., November 1, 2002; 19(11): 1846 - 1864. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Baker, I. I. Serysheva, S. Sencer, Y. Wu, S. J. Ludtke, W. Jiang, S. L. Hamilton, and W. Chiu The skeletal muscle Ca2+ release channel has an oxidoreductase-like domain PNAS, September 17, 2002; 99(19): 12155 - 12160. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Maxwell, W. H. Welch, F. M. Horodyski, K. M. Schegg, and D. A. Schooley Juvenile Hormone Diol Kinase. II. SEQUENCING, CLONING, AND MOLECULAR MODELING OF JUVENILE HORMONE-SELECTIVE DIOL KINASE FROM MANDUCA SEXTA J. Biol. Chem., June 7, 2002; 277(24): 21882 - 21890. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Jardine, J. Gough, C. Chothia, and S. A. Teichmann Comparison of the Small Molecule Metabolic Enzymes of Escherichia coli and Saccharomyces cerevisiae Genome Res., June 1, 2002; 12(6): 916 - 929. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Benner, M. D. Caraco, J. M. Thomson, and E. A. Gaucher Planetary Biology--Paleontological, Geological, and Molecular Histories of Life Science, May 3, 2002; 296(5569): 864 - 868. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Kozma, E. Brown, E. M. Ellis, and A. J. Lapthorn The Crystal Structure of Rat Liver AKR7A1. A DIMERIC MEMBER OF THE ALDO-KETO REDUCTASE SUPERFAMILY J. Biol. Chem., May 3, 2002; 277(18): 16285 - 16293. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Aloy and R. B. Russell Interrogating protein interaction networks through structural biology PNAS, April 30, 2002; 99(9): 5896 - 5901. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Anantharaman, E. V. Koonin, and L. Aravind Comparative genomics and evolution of proteins involved in RNA metabolism Nucleic Acids Res., April 1, 2002; 30(7): 1427 - 1464. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Enright, S. Van Dongen, and C. A. Ouzounis An efficient algorithm for large-scale detection of protein families Nucleic Acids Res., April 1, 2002; 30(7): 1575 - 1584. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W.A. Buchan, A. J. Shepherd, D. Lee, F. M.G. Pearl, S. C.G. Rison, J. M. Thornton, and C. A. Orengo Gene3D: Structural Assignment for Whole Genes and Genomes Using the CATH Domain Structure Database Genome Res., March 1, 2002; 12(3): 503 - 514. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Frishman Knowledge-based selection of targets for structural genomics Protein Eng. Des. Sel., March 1, 2002; 15(3): 169 - 183. [Abstract] |













