Nucleic Acids Research, 2004, Vol. 32, Database issue D226-D229
© 2004 Oxford University Press
SCOP database in 2004: refinements integrate structure and sequence family data
MRC Centre for Protein Engineering and 3 MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK, 1 Department of Plant and Microbial Biology, 461A Koshland Hall 3102, University of California, Berkeley, CA 94720-3102, USA and 2 Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
*To whom correspondence should be addressed. Tel: +44 1223 402132; Fax: +44 1223 402140; Email: agm{at}mrc-lmb.cam.ac.uk
Received September 17, 2003; Accepted September 18, 2003
| ABSTRACT |
|---|
|
|
|---|
The Structural Classification of Proteins (SCOP) database is a comprehensive ordering of all proteins of known structure, according to their evolutionary and structural relationships. Protein domains in SCOP are hierarchically classified into families, superfamilies, folds and classes. The continual accumulation of sequence and structural data allows more rigorous analysis and provides important information for understanding the protein world and its evolutionary repertoire. SCOP participates in a project that aims to rationalize and integrate the data on proteins held in several sequence and structure databases. As part of this project, starting with release 1.63, we have initiated a refinement of the SCOP classification, which introduces a number of changes mostly at the levels below superfamily. The pending SCOP reclassification will be carried out gradually through a number of future releases. In addition to the expanded set of static links to external resources, available at the level of domain entries, we have started modernization of the interface capabilities of SCOP allowing more dynamic links with other databases. SCOP can be accessed at http://scop.mrc-lmb.cam.ac.uk/scop.
| BACKGROUND |
|---|
|
|
|---|
The SCOP (Structural Classification of Proteins) database is developed as an evolutionary classification, in which the main focus is to place the proteins in a coherent evolutionary framework, based on their conserved structural features. The database aims to provide a comprehensive and detailed description of the relationships between all proteins whose 3D structures have been determined. A fundamental unit of classification in the SCOP database is the protein domain. A domain is defined as an evolutionary unit observed in nature either in isolation or in more than one context in multidomain proteins. The protein domains are classified hierarchically into families, superfamilies, folds and classes, whose meaning has been discussed before (1,2).
An advantage of the SCOP database is that it embeds a theory of protein evolution as defined by human experts rather than by empirical rules implemented in a variety of bioinformatics algorithms and tools. Computational support in SCOP is used to extend the human ability to analyse and interpret the data and to make the invaluable knowledge of protein evolutionary repertoire broadly available to scientific researchers.
The first official SCOP release 9 years ago comprised 3179 protein domains grouped into 498 families, 366 superfamilies and 279 folds (1). The seven main classes in the latest release (1.65) contain 40 452 domains organized into 2327 families, 1294 superfamilies and 800 folds. These domains correspond to 20 619 entries in the Protein Data Bank (PDB) (3,4) and one literature reference to a structure with unpublished coordinates. Statistics of the current and previous releases, summaries and full histories of changes and other information are available from the SCOP website (http://scop.mrc-lmb. cam.ac.uk/scop/) together with parsable files encoding all SCOP data (5). The sequences and structures of SCOP domains are available from the ASTRAL compendium (6), and hidden Markov models of SCOP domains are available from the SUPERFAMILY database (7).
Here we present further improvements and new features implemented in SCOP since the previous update (5). Starting with release 1.63, large parts of the SCOP classification are being reorganized to facilitate the integration of structural classification with the contemporary sequence and functional classification schemes. On the top levels of the SCOP hierarchy these changes will affect only a small number of entries (
20 folds and superfamilies in SCOP have been reclassified so far). The more substantial but not so apparent rearrangements are being carried out at the lower levels and are aimed at the refinement of relationships amongst proteins and protein families. Major changes introduced in SCOP 1.63 and 1.65 are described in more detail below.
| RECLASSIFICATION |
|---|
|
|
|---|
The dynamic nature of SCOP is one of its main features and needs to be taken into account in applications that use the SCOP database. The continual accumulation of sequence and structural data nowadays allows more rigorous analysis and provides important information for understanding the protein world and its evolutionary repertoire. If there is new evidence about protein relationships, then this may result in a redefinition of domain boundaries and/or rearrangements of nodes in the SCOP hierarchy. A typical example is when a part of a large novel protein first classified as a single multidomain entry is subsequently observed as a stand-alone protein or in a combination of different domain types and therefore it is reclassified as a separate domain. Frequently two separately classified proteins are shown to be related through an intermediate, the structure of which has been determined more recently. The appearance of such proteins in the structural databases can help to identify more distant relationships between protein domains and thus can lead to a rearrangement that unifies distinct protein superfamilies.
Another factor influencing reclassification is integration with other databases. A project has started during the past year that aims to rationalize and integrate the SCOP information with the data about protein families housed by prominent sequence and structural databases, including InterPro (8), Pfam (9), CATH (10) and MSD (11). A milestone in this ambitious goal is the provision of stricter and more precise definitions behind the different classification schemes used in these different databases. In response to these requirements, starting with release 1.63, we have initiated a refinement of the SCOP classification that introduces a number of changes mostly at the levels below superfamily.
Membrane all-
proteins
One of the major rearrangements in the SCOP 1.63 release was a revision of the so-called Membrane all-alpha fold. Created in SCOP when there were a handful of known membrane protein structures, this fold listed protein domains classified solely on the basis of their secondary structural content without explicit consideration of their fold topologies. Prompted by the rapid progress of membrane protein crystallography, a comprehensive analysis of these domains has been undertaken and the membrane all-
proteins have been reclassified from scratch into 24 new or already existing folds in the SCOP database. Currently these protein folds are encompassed under more precise fold definitions based on the number of helices that span the membrane. New structural and probable evolutionary relationships have been discovered during the reclassification. The discovery of a new haem-binding fold is arguably the most interesting. This protein fold comprises four transmembrane helices arranged in an up-and-down bundle with the haem groups bound in between the helices. The haem-binding four-helical fold is observed in the structures of the cytochrome b subunit of the bovine cytochrome bc1 complex (1be3
[PDB]
:C) (12), the
subunit of Escherichia coli formate dehydrogenase N (1kqf
[PDB]
:C) (13) and in the transmembrane subunits of fumarate reductase respiratory complex (1qla
[PDB]
:C) (14). Three of the four haem ligands are conserved between the cytochrome bc1 complex and formate dehydrogenase N subunits and occupy structurally equivalent sites with the haem-binding modes of both proteins being very similar. These features considered in conjunction with good overall structural similarity of the four-helical domains could be interpreted as evidence for their common evolutionary origin. Currently these protein domains constitute a superfamily of transmembrane di-haem cytochromes.
Viral capsid and coat proteins
The former SCOP classification of viral capsid and coat proteins was based on the assumption that viruses co-evolved with their hosts. The protein domains of this fold were classified into a number of families according to the infected host. However, the increasing amount of available data on virus structures and genome sequences has caused a reassessment of the old classification concept. Mammalian picornaviruses (positive-stranded ssRNA viruses) for instance are morphologically and genetically very similar to small so-called Cricket paralysis-like viruses and to a number of plant viruses (15,16). Their coat proteins form similar heterooligomeric assemblies and display several conserved characteristic features in their folds. These similarities between mammalian and insect viruses extend to the post-processing of structural polyproteins. In SCOP release 1.65 these protein domains are grouped together and classified as belonging to the superfamily of positive-stranded ssRNA viruses. The reclassification of the viral capsid and coat protein fold results in four new superfamilies and 11 new families. The new classification explicitly follows the naming convention and virus taxonomy established by the International Committee on Taxonomy of Viruses (ICTV) (17). In addition to the internal reorganization, this protein fold was been merged with the former nucleoplasmin/PNGase F-like fold.
Antibody domains
Antibodies and their fragments are the largest group of homologous proteins of known structure. In SCOP, there are more than 2000 antibody domains organized previously in 228 separate species of variable domain combinations and 185 species of constant domain combinations. In SCOP release 1.65 all variable and constant domains have been reclassified according to their chain and source organism. The constant domains have been additionally sorted by their chain order. Our main goal was to provide a more comprehensive and systematic characterization of the structural repertoire of variable domainsa task that is not easy, having in mind the number of engineered antibody structures deposited in the PDB. In our analysis we excluded the 51 hybrid and artificial variable domains from the domain set and classified them as a separate engineered species. In order to identify different groups among antibody variable domains we performed a two-phase sequence clustering. First the sequences corresponding to the germline segments were clustered using a threshold of 85% identity for the inclusion of a protein sequence to the cluster set. Then the segments were sorted according to the size of the CDR1 and CDR2 regions. We anticipate that the resulting clusters might correspond to the putative germline families in the species genomes.
E-set domains
The E-set domains are presumed to be early domains of the immunoglobulin-like fold and may be the evolutionary link between the immunoglobulin and fibronectin type III domain superfamilies. In release 1.63 the former E-set domains family was taken out of the immunoglobulin superfamily in SCOP and transformed into a superfamily. The constituent domains were reorganized into 15 new families. The C-terminal domain of mollusc haemocyanin sharing only a partial structural similarity with the immunoglobulin-like domain of arthropod haemocyanin was reclassified into a new fold.
Protein kinases
In SCOP release 1.65 the related catalytic domains of Tyr and Thr/Ser kinases were merged into a single family of protein kinases. In fact their close relationship was confirmed by the structure determination of type I TGF-ß receptor R4, a Thr/Ser protein kinase that is more similar in sequence to Tyr kinases than to the other Thr/Ser kinases (18). Even though the catalytic domains of protein kinases are very similar, there are certain motifs that can be identified in their sequences and used to characterize the functional properties of each distinct kinase. We used these specific features to assign all protein kinase domains of known structure to the major groups, defined by the substrate specificity and/or mode of regulation, and then by functional subfamilies (19). For each protein kinase, SCOP now provides a detailed description in the annotation field. This field is searchable and allows users to extract a protein set of particular interest.
Non-coordinate entries
Early SCOP releases provided classification of dozens of protein structures published in the literature but not available at the time from PDB. Classified as literature references, these structures were sole representatives of their protein families at the time. After adoption of the policy of linking the publication of structure with the obligatory release of coordinates by most of the scientific journals, classification of new non-coordinate entries in SCOP was discontinued, and their number gradually decreased. Twenty-seven of the 28 remaining proteins in SCOP 1.63 were found by a recent inspection to have closely related representative structures in PDB and were made obsolete. In the latest release, there is just one literature reference (20) representing a unique superfamily.
| TECHNICAL DEVELOPMENTS |
|---|
|
|
|---|
As part of the database integration project we have started to modernize the interface capabilities of SCOP and to link the databases dynamically. An initial step suggested by MSD was to implement an on-demand server of SCOP domain definitions. This is intended to avoid synchronization problems arising from the different release schedules of the various databases. It is based on Simple Object Access Protocol (SOAP) technology and is currently used by the Pfam team to display comparisons of domains in the CATH, Pfam and SCOP databases. Further developments are expected and will be made available to other interested parties.
| CONCLUDING REMARKS |
|---|
|
|
|---|
From the beginning, the main focus of SCOP was on the probable evolutionary relationships between proteins that were undetectable by sequence comparison methods. The hierarchically organized structural data played a major part in the development of contemporary sequence-based methods with improved sensitivity. These methods allowed clustering of the multitude of known and hypothetical proteins in the sequence databases in a relatively small number of protein sequence families. The availability of complete genome sequences allowed the exploration of evolutionary and structural repertoires of different organisms and the refinement of their phylogeny. A large fraction of the protein families of unknown structure can be assigned with confidence into the existing SCOP superfamilies. The continual accumulation of structure, sequence and genome data will allow SCOP and related databases to play an increasingly effective role in the integration of these data.
| ACKNOWLEDGEMENTS |
|---|
We acknowledge Dr Loredana Lo Contes contribution to maintenance and development of the SCOP database. This work was supported by the MRC strategic grant G0100305.
| REFERENCES |
|---|
|
|
|---|
- Murzin,A., Brenner,S.E., Hubbard,T.J.P. and Chothia,C. (1995) SCOP: a Structural Classification of Proteins database for the investigation of sequences and structures. J. Mol. Biol., 247, 536540.[CrossRef][Web of Science][Medline]
- Brenner,S.E., Chothia,C., Hubbard,T.J.P. and Murzin,A. (1996) Understanding protein structure: using SCOP for fold interpretation. Methods Enzymol., 266, 635643.[CrossRef][Web of Science][Medline]
- Bernstein,F.C., Koetzle,T.F., Williams,G.J.B., Meyer,E.F., Brice,M.D., Rodgers,J.R., Kennard,O., Shimanouchi,T. and Tasumi,M. (1977) The Protein Data Bank: a computer-based archival file for macromolecular structures. J. Mol. Biol., 112, 535542.[Web of Science][Medline]
- Westbrook,J., Feng,Z., Jain,S., Bhat,T.N., Thanki,N., Ravichandran,V., Gilliland,G.L., Bluhm,W., Weissig,H., Greer,D.S. et al. (2002) The Protein Data Bank: unifying the archive. Nucleic Acids Res., 30, 245248.
[Abstract/Free Full Text] - Lo Conte,L., Brenner,S.E., Hubbard,T.J.P., Chothia,C. and Murzin,A.G. (2002) SCOP database in 2002: refinements accommodate structural genomics. Nucleic Acids Res., 30, 264267.
[Abstract/Free Full Text] - Chandonia,J.-M., Hon,G., Walker,N.S., Lo Conte,L., Koehl,P., Levitt,M. and Brenner,S.E. (2004) The ASTRAL compendium in 2004. Nucleic Acids Res., 32, D189D192.
[Abstract/Free Full Text] - Gough,J., Karplus,K., Hughey,R. and Chothia,C. (2001) Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. J. Mol. Biol., 313, 903919.[CrossRef][Web of Science][Medline]
- Mulder,N.J., Apweiler,R., Attwood,T.K., Bairoch,A., Barrell,D., Bateman,A., Binns,D., Biswas,M., Bradley,P., Bork,P. et al. (2003) The InterPro Database, 2003 brings increased coverage and new features. Nucleic Acids Res., 31, 315318.
[Abstract/Free Full Text] - Bateman,A., Coin,L., Durbin,R., Finn,R.D., Hollich,V., Griffith-Jones,S., Khanna,A., Marshall,M., Moxon,S., Sonnhammer,E.L.L. et al. (2004) The Pfam protein families database. Nucleic Acids Res., 32, D138D141.
[Abstract/Free Full Text] - Orengo,C.A., Michie,A.D., Jones,S., Jones,D.T., Swindells,M.B. and Thornton,J.M. (1997) CATH: A hierarchic classification of protein domain structures. Structure, 5, 10931108.[Medline]
- Golovin,A., Oldfield,T.J., Tate,J.G., Velankar,S., Barton,G.J., Boutselakis,H., Dimitropoulos,D., Fillon,J., Hussain,A., Ionides,J.M.C. et al. (2004) E-MSD: an integrated resourse for bioinformatics. Nucleic Acids Res., 32, D211D214.
[Abstract/Free Full Text] - Iwata,S., Lee,J.W., Okada,K., Lee,J.K., Iwata,M., Rasmussen,B., Link,T.A., Ramaswamy,S. and Jap,B.K. (1998) Complete structure of the 11-subunit bovine mitochondrial cytochrome bc1 complex. Science, 281, 6471.
[Abstract/Free Full Text] - Jormakka,M., Tornroth,S., Byrne,B. and Iwata,S. (2002) Molecular basis of proton motive force generation: structure of formate dehydrogenase-N. Science, 295, 18631868.
[Abstract/Free Full Text] - Lancaster,C.R., Kroger,A., Auer,M. and Michel,H. (1999) Structure of fumarate reductase from Wolinella succinogenes at 2.2 Å resolution. Nature, 402, 377385.[CrossRef][Medline]
- Liljas,L., Tate,J., Lin,T., Christian,P. and Johnson,J.E. (2002) Evolutionary and taxonomic implications of conserved structural motifs between picornaviruses and insect picorna-like viruses. Arch. Virol., 147, 5984.[CrossRef][Web of Science][Medline]
- Chandrasekar,V. and Johnson,J.E. (1998) The structure of tobacco ringspot virus: a link in the evolution of icosahedral capsids in the picornavirus superfamily. Structure, 6, 157171.[Medline]
- van Regenmortel,M.H.V., Fauquet,C.M., Bishop,D.H.L., Carstens,E.B., Estes,M.K., Lemon,S.M., Maniloff,J., Mayo,M.A., McGeoch,D.J., Pringle,C.R. et al. (Eds) (2000). Virus Taxonomy: the Seventh Report of the International Committee on Taxonomy of Viruses. Academic Press, San Diego, CA.
- Huse,M., Chen,Y.G., Massague,J. and Kuriyan,J. (1999) Crystal structure of the cytoplasmic domain of the type I TGF ß receptor in complex with FKBP12. Cell, 96, 425436.[CrossRef][Web of Science][Medline]
- Hanks,S.K. (2003) Genomic analysis of the eukaryotic protein kinase superfamily: a perspective. Genome Biol., 4, 111.[CrossRef][Medline]
- Hoess,A., Watson,S., Siber,G.R. and Liddington,R. (1993) Crystal structure of an endotoxin-neutralizing protein from the horseshoe crab, Limulus anti-LPS factor, at 1.5 Å resolution. EMBO J., 12, 33513356.[Web of Science][Medline]
This article has been cited by other articles:
![]() |
C.-C. Chen, C.-Y. Lin, Y.-S. Lo, and J.-M. Yang PPISearch: a web server for searching homologous protein-protein interactions across multiple species Nucleic Acids Res., July 1, 2009; 37(suppl_2): W369 - W375. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Margraf, G. Schenk, and A. E. Torda The SALAMI protein structure search server Nucleic Acids Res., July 1, 2009; 37(suppl_2): W480 - W484. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Bjorkholm, P. Daniluk, A. Kryshtafovych, K. Fidelis, R. Andersson, and T. R. Hvidsten Using multi-data hidden Markov models trained on local neighborhoods of protein structure to predict residue-residue contacts Bioinformatics, May 15, 2009; 25(10): 1264 - 1270. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-i. Cho, D. Kim, and D. Lee A feature-based approach to modeling protein-protein interaction hot spots Nucleic Acids Res., May 1, 2009; 37(8): 2672 - 2687. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. A Reeves, D. Talavera, and J. M Thornton Genome and proteome annotation: organization, interpretation and integration J R Soc Interface, February 6, 2009; 6(31): 129 - 147. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Baumbach, A. Tauch, and S. Rahmann Towards the integrated analysis, visualization and reconstruction of microbial gene regulatory networks Brief Bioinform, January 1, 2009; 10(1): 75 - 83. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Bauer, S. Gunther, D. Jansen, C. Heeger, P. F. Thaben, and R. Preissner SuperSite: dictionary of metabolite and drug binding sites in proteins Nucleic Acids Res., January 1, 2009; 37(suppl_1): D195 - D200. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Sirocco and S. C. E. Tosatto TESE: generating specific protein structure test set ensembles Bioinformatics, November 15, 2008; 24(22): 2632 - 2633. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. C. Lee, M. L. Reniere, E. P. Skaar, and M. E. P. Murphy Ruffling of Metalloporphyrins Bound to IsdG and IsdI, Two Heme-degrading Enzymes in Staphylococcus aureus J. Biol. Chem., November 7, 2008; 283(45): 30957 - 30963. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Araiso, S. Palioura, R. Ishitani, R. L. Sherrer, P. O'Donoghue, J. Yuan, H. Oshikane, N. Domae, J. DeFranco, D. Soll, et al. Structural insights into RNA-dependent eukaryal and archaeal selenocysteine formation Nucleic Acids Res., March 27, 2008; 36(4): 1187 - 1199. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Shu, T. Zhou, and S. Hovmoller Prediction of zinc-binding sites in proteins from sequence Bioinformatics, March 15, 2008; 24(6): 775 - 782. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Tinoco, C. W. Peterson, B. Lucchese, R. P. Doyle, and A. M. Valentine On the evolutionary significance and metal-binding characteristics of a monolobal transferrin from Ciona intestinalis PNAS, March 4, 2008; 105(9): 3268 - 3273. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. R Kensche, V. van Noort, B. E Dutilh, and M. A Huynen Practical and theoretical advances in predicting the function of a protein by its phylogenetic distribution J R Soc Interface, February 6, 2008; 5(19): 151 - 170. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Birzele, G. Csaba, and R. Zimmer Alternative splicing and protein structure evolution Nucleic Acids Res., February 2, 2008; 36(2): 550 - 558. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Adams, Y. Luo, B. Hove-Jensen, S.-M. He, L. M. van Staalduinen, D. L. Zechel, and Z. Jia Crystal Structure of PhnH: an Essential Component of Carbon-Phosphorus Lyase in Escherichia coli J. Bacteriol., February 1, 2008; 190(3): 1072 - 1083. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Cheng, B.-H. Kim, and N. V. Grishin MALISAM: a database of structurally analogous motifs in proteins Nucleic Acids Res., January 11, 2008; 36(suppl_1): D211 - D217. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Birzele, J. E. Gewehr, and R. Zimmer AutoPSI: a database for automatic structural classification of protein sequences and structures Nucleic Acids Res., January 11, 2008; 36(suppl_1): D398 - D401. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Heger, E. Korpelainen, T. Hupponen, K. Mattila, V. Ollikainen, and L. Holm PairsDB atlas of protein sequence space Nucleic Acids Res., January 11, 2008; 36(suppl_1): D276 - D280. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Andreeva, D. Howorth, J.-M. Chandonia, S. E. Brenner, T. J. P. Hubbard, C. Chothia, and A. G. Murzin Data growth and its impact on the SCOP database: new developments Nucleic Acids Res., January 11, 2008; 36(suppl_1): D419 - D425. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Finn, J. Tate, J. Mistry, P. C. Coggill, S. J. Sammut, H.-R. Hotz, G. Ceric, K. Forslund, S. R. Eddy, E. L. L. Sonnhammer, et al. The Pfam protein families database Nucleic Acids Res., January 11, 2008; 36(suppl_1): D281 - D288. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Birzele, R. Kuffner, F. Meier, F. Oefinger, C. Potthast, and R. Zimmer ProSAS: a database for analyzing alternative splicing in the context of protein structures Nucleic Acids Res., January 1, 2008; 36(suppl_1): D63 - D68. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. D. Rawlings, F. R. Morton, C. Y. Kok, J. Kong, and A. J. Barrett MEROPS: the peptidase database Nucleic Acids Res., January 1, 2008; 36(suppl_1): D320 - D325. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Hollich and E. L.L. Sonnhammer PfamAlyzer: domain-centric homology search Bioinformatics, December 15, 2007; 23(24): 3382 - 3383. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. R. Tabita, T. E. Hanson, H. Li, S. Satagopan, J. Singh, and S. Chan Function, Structure, and Evolution of the RubisCO-Like Proteins and Their RubisCO Homologs Microbiol. Mol. Biol. Rev., December 1, 2007; 71(4): 576 - 599. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. S. Domingues, J. Rahnenfuhrer, and T. Lengauer Conformational analysis of alternative protein structures Bioinformatics, December 1, 2007; 23(23): 3131 - 3138. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Wang, L. S. Yafremava, D. Caetano-Anolles, J. E. Mittenthal, and G. Caetano-Anolles Reductive evolution of architectural repertoires in proteomes and the birth of the tripartite world Genome Res., November 1, 2007; 17(11): 1572 - 1585. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Chen and L. Kurgan PFRES: protein fold classification by using evolutionary information and predicted secondary structure Bioinformatics, November 1, 2007; 23(21): 2843 - 2850. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Huhne, F.-T. Koch, and J. Suhnel A comparative view at comprehensive information resources on three-dimensional structures of biological macro-molecules Brief Funct Genomic Proteomic, October 23, 2007; (2007) elm020v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Heger, S. Mallick, C. Wilton, and L. Holm The global trace graph, a novel paradigm for searching protein sequence databases Bioinformatics, September 15, 2007; 23(18): 2361 - 2367. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Fukuda, S. Kawai, and K. Murata NADP(H) Phosphatase Activities of Archaeal Inositol Monophosphatase and Eubacterial 3'-Phosphoadenosine 5'-Phosphate Phosphatase Appl. Envir. Microbiol., September 1, 2007; 73(17): 5447 - 5452. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Guharoy and P. Chakrabarti Secondary structure based analysis and classification of biological interfaces: identification of binding motifs in protein protein interactions Bioinformatics, August 1, 2007; 23(15): 1909 - 1918. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Marti-Renom, U. Pieper, M. S. Madhusudhan, A. Rossi, N. Eswar, F. P. Davis, F. Al-Shahrour, J. Dopazo, and A. Sali DBAli tools: mining the protein structure space Nucleic Acids Res., July 13, 2007; 35(suppl_2): W393 - W397. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-H. Tung and J.-M. Yang fastSCOP: a fast web server for recognizing protein structural domains and SCOP superfamilies Nucleic Acids Res., July 13, 2007; 35(suppl_2): W438 - W443. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. I. Sadreyev, M. Tang, B.-H. Kim, and N. V. Grishin COMPASS server for remote homology inference Nucleic Acids Res., July 13, 2007; 35(suppl_2): W653 - W658. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-C. Chen, Y.-S. Lo, W.-C. Hsu, and J.-M. Yang 3D-partner: a web server to infer interacting partners and binding models Nucleic Acids Res., July 13, 2007; 35(suppl_2): W561 - W567. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Ferre, Y. Ponty, W. A. Lorenz, and P. Clote DIAL: a web server for the pairwise alignment of two RNA three-dimensional structures using nucleotide, dihedral angle and base-pairing similarities Nucleic Acids Res., July 13, 2007; 35(suppl_2): W659 - W668. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Lerman and B. E. Shakhnovich Defining functional distance using manifold embeddings of gene ontology annotations PNAS, July 3, 2007; 104(27): 11334 - 11339. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Lopez-Vinas, A. Bentebibel, C. Gurunathan, M. Morillas, D. de Arriaga, D. Serra, G. Asins, F. G. Hegardt, and P. Gomez-Puertas Definition by Functional and Structural Analysis of Two Malonyl-CoA Sites in Carnitine Palmitoyltransferase 1A J. Biol. Chem., June 22, 2007; 282(25): 18212 - 18224. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Shi, Y. Zhong, I. Majumdar, S. Sri Krishna, and N. V. Grishin Searching for three-dimensional secondary structural patterns in proteins with ProSMoS Bioinformatics, June 1, 2007; 23(11): 1331 - 1338. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Iwaki, T. Muraki, S. Ishihara, Y. Hasegawa, K. N. Rankin, T. Sulea, J. Boyd, and P. C. K. Lau Characterization of a Pseudomonad 2-Nitrobenzoate Nitroreductase and Its Catabolic Pathway-Associated 2-Hydroxylaminobenzoate Mutase and a Chemoreceptor Involved in 2-Nitrobenzoate Chemotaxis J. Bacteriol., May 1, 2007; 189(9): 3502 - 3514. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Y. Niv, D. R. Ripoll, J. A. Vila, A. Liwo, E. S. Vanamee, A. K. Aggarwal, H. Weinstein, and H. A. Scheraga Topology of Type II REases revisited; structural classes and the common conserved core Nucleic Acids Res., April 1, 2007; 35(7): 2227 - 2237. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yu, P.-A. Genest, B. ter Riet, K. Sweeney, C. DiPaolo, R. Kieft, E. Christodoulou, A. Perrakis, J. M. Simmons, R. P. Hausinger, et al. The protein that binds to DNA base J in trypanosomatids has features of a thymidine hydroxylase Nucleic Acids Res., April 1, 2007; 35(7): 2107 - 2115. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Dalhus, I. H. Helle, P. H. Backe, I. Alseth, T. Rognes, M. Bjoras, and J. K. Laerdahl Structural insight into repair of alkylated DNA by a new superfamily of DNA glycosylases comprising HEAT-like repeats Nucleic Acids Res., April 1, 2007; 35(7): 2451 - 2459. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Bateman and R. D. Finn SCOOP: a simple method for identification of novel protein superfamily relationships Bioinformatics, April 1, 2007; 23(7): 809 - 814. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Fariselli, I. Rossi, E. Capriotti, and R. Casadio The WWWH of remote homolog detection: The state of the art Brief Bioinform, March 1, 2007; 8(2): 78 - 87. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Cockell, B. Oliva, and R. M. Jackson Structure-based evaluation of in silico predictions of protein protein interactions using Comparative Docking Bioinformatics, March 1, 2007; 23(5): 573 - 581. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Levitt Growth of novel protein structural data PNAS, February 27, 2007; 104(9): 3183 - 3188. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Suhrer, M. Wiederstein, and M. J. Sippl QSCOP--SCOP quantified by structural relationships Bioinformatics, February 15, 2007; 23(4): 513 - 514. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Rueda, C. Ferrer-Costa, T. Meyer, A. Perez, J. Camps, A. Hospital, J. L. Gelpi, and M. Orozco A consensus view of protein dynamics PNAS, January 16, 2007; 104(3): 796 - 801. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. R. Jefferson, T. P. Walsh, T. J. Roberts, and G. J. Barton SNAPPI-DB: a database and API of Structures, iNterfaces and Alignments for Protein-Protein Interactions Nucleic Acids Res., January 12, 2007; 35(suppl_1): D580 - D589. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Kundrotas and E. Alexov PROTCOM: searchable database of protein complexes enhanced with domain-domain structures Nucleic Acids Res., January 12, 2007; 35(suppl_1): D575 - D579. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Wilson, M. Madera, C. Vogel, C. Chothia, and J. Gough The SUPERFAMILY database in 2007: families and functions Nucleic Acids Res., January 12, 2007; 35(suppl_1): D308 - D313. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Sonego, M. Pacurar, S. Dhir, A. Kertesz-Farkas, A. Kocsor, Z. Gaspari, J. A.M. Leunissen, and S. Pongor A Protein Classification Benchmark collection for machine learning Nucleic Acids Res., January 12, 2007; 35(suppl_1): D232 - D236. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. F. Fulton, M. A. Bate, N. G. Faux, K. Mahmood, C. Betts, and A. M. Buckle Protein Folding Database (PFD 2.0): an online environment for the International Foldeomics Consortium Nucleic Acids Res., January 12, 2007; 35(suppl_1): D304 - D307. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hollunder, M. Friedel, A. Beyer, C. T. Workman, and T. Wilhelm DASS: efficient discovery and p-value calculation of substructures in unordered data Bioinformatics, January 1, 2007; 23(1): 77 - 83. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Holmes and G. Jogl Crystal Structure of Inositol Phosphate Multikinase 2 and Implications for Substrate Specificity J. Biol. Chem., December 8, 2006; 281(49): 38109 - 38116. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Xu, A. Canutescu, Z. Obradovic, and R. L. Dunbrack Jr ProtBuD: a database of biological unit structures of protein families and superfamilies Bioinformatics, December 1, 2006; 22(23): 2876 - 2882. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Kajan, A. Kertesz-Farkas, D. Franklin, N. Ivanova, A. Kocsor, and S. Pongor Application of a simple likelihood ratio approximant to protein sequence classification Bioinformatics, December 1, 2006; 22(23): 2865 - 2869. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Mackenzie, L. Pedersen, S. Arent, and A. Henriksen Controlling Electron Transfer in Acyl-CoA Oxidases and Dehydrogenases: A STRUCTURAL VIEW J. Biol. Chem., October 13, 2006; 281(41): 31012 - 31020. [Abstract] [Full Text] [PDF] |
||||
![]() |
I.-G. Choi and S.-H. Kim Evolution of protein structural classes and protein sequence families PNAS, September 19, 2006; 103(38): 14056 - 14061. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Espadaler, E. Querol, F. X. Aviles, and B. Oliva Identification of function-associated loop motifs and application to protein function prediction Bioinformatics, September 15, 2006; 22(18): 2237 - 2243. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Flannick, A. Novak, B. S. Srinivasan, H. H. McAdams, and S. Batzoglou Graemlin: General and robust alignment of multiple large interaction networks Genome Res., September 1, 2006; 16(9): 1169 - 1181. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Lin, L. Zhu, and D.-Y. Zhang An initial strategy for comparing proteins at the domain architecture level Bioinformatics, September 1, 2006; 22(17): 2081 - 2086. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Hasegawa, S. Fukuda, K. Shimokawa, S. Kondo, N. Maeda, and Y. Hayashizaki A RecA-mediated exon profiling method Nucleic Acids Res., August 8, 2006; 34(13): e97 - e97. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-M. Yang and C.-H. Tung Protein structure database search and evolutionary classification Nucleic Acids Res., August 2, 2006; 34(13): 3646 - 3659. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Yang, Y. Eyobo, L. A. Brand, D. Martynowski, D. Tomchick, E. Strauss, and H. Zhang Crystal Structure of a Type III Pantothenate Kinase: Insight into the Mechanism of an Essential Coenzyme A Biosynthetic Enzyme Universally Distributed in Bacteria. J. Bacteriol., August 1, 2006; 188(15): 5532 - 5540. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Bryson, V. Loux, R. Bossy, P. Nicolas, S. Chaillou, M. van de Guchte, S. Penaud, E. Maguin, M. Hoebeke, P. Bessieres, et al. AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system Nucleic Acids Res., July 19, 2006; 34(12): 3533 - 3545. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-B. Shen and K.-C. Chou Ensemble classifier for protein fold pattern recognition Bioinformatics, July 15, 2006; 22(14): 1717 - 1722. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Soding, M. Remmert, and A. Biegert HHrep: de novo protein repeat detection and the origin of TIM barrels. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W137 - W142. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Pasek, J.-L. Risler, and P. Brezellec Gene fusion/fission is a major contributor to evolution of multi-domain bacterial proteins Bioinformatics, June 15, 2006; 22(12): 1418 - 1423. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||










