Nucleic Acids Research, 2004, Vol. 32, Database issue D235-D239
© 2004 Oxford University Press
The SUPERFAMILY database in 2004: additions and improvements
MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK, 1 Department of Structural Biology, School of Medicine, Stanford University, Stanford, CA 94305-5125, USA and 2 RIKEN Genomic Sciences Centre, W121 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
*To whom correspondence should be addressed. Tel: +44 1223 402479; Fax: +44 1223 213556; Email: mm238{at}mrc-lmb.cam.ac.uk
The SUPERFAMILY database provides structural assignments to protein sequences and a framework for analysis of the results. At the core of the database is a library of profile Hidden Markov Models that represent all proteins of known structure. The library is based on the SCOP classification of proteins: each model corresponds to a SCOP domain and aims to represent an entire superfamily. We have applied the library to predicted proteins from all completely sequenced genomes (currently 154), the Swiss-Prot and TrEMBL databases and other sequence collections. Close to 60% of all proteins have at least one match, and one half of all residues are covered by assignments. All models and full results are available for download and online browsing at http://supfam.org. Users can study the distribution of their superfamily of interest across all completely sequenced genomes, investigate with which other superfamilies it combines and retrieve proteins in which it occurs. Alternatively, concentrating on a particular genome as a whole, it is possible first, to find out its superfamily composition, and secondly, to compare it with that of other genomes to detect superfamilies that are over- or under-represented. In addition, the webserver provides the following standard services: sequence search; keyword search for genomes, superfamilies and sequence identifiers; and multiple alignment of genomic, PDB and custom sequences.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
X. Zhai, M. L. Malakhova, H. M. Pike, L. M. Benson, H. R. Bergen III, I. P. Sugar, L. Malinina, D. J. Patel, and R. E. Brown Glycolipid Acquisition by Human Glycolipid Transfer Protein Dramatically Alters Intrinsic Tryptophan Fluorescence: INSIGHTS INTO GLYCOLIPID BINDING AFFINITY J. Biol. Chem., May 15, 2009; 284(20): 13620 - 13628. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Niwa, B.-W. Ying, K. Saito, W. Jin, S. Takada, T. Ueda, and H. Taguchi Bimodal protein solubility distribution revealed by an aggregation analysis of the entire ensemble of Escherichia coli proteins PNAS, March 17, 2009; 106(11): 4201 - 4206. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Ruiz, Y. Coute, B. Sanchez, C. G. de los Reyes-Gavilan, J.-C. Sanchez, and A. Margolles The cell-envelope proteome of Bifidobacterium longum in an in vitro bile environment Microbiology, March 1, 2009; 155(3): 957 - 967. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Iqbal, A. A. Freitas, C. G. Johnson, and M. Vergassola Message-passing algorithms for the prediction of protein domain interactions from protein-protein interaction data Bioinformatics, September 15, 2008; 24(18): 2064 - 2070. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Brinkrolf, S. Ploger, S. Solle, I. Brune, S. S. Nentwich, A. T. Huser, J. Kalinowski, A. Puhler, and A. Tauch The LacI/GalR family transcriptional regulator UriR negatively controls uridine utilization of Corynebacterium glutamicum by binding to catabolite-responsive element (cre)-like sequences Microbiology, April 1, 2008; 154(4): 1068 - 1081. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Paffett-Lugassy, N. Hsia, P. G. Fraenkel, B. Paw, I. Leshinsky, B. Barut, N. Bahary, J. Caro, R. Handin, and L. I. Zon Functional conservation of erythropoietin signaling in zebrafish Blood, October 1, 2007; 110(7): 2718 - 2726. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Kong, Y. Zhang, Z.-Q. Ye, X.-Q. Liu, S.-Q. Zhao, L. Wei, and G. Gao CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine Nucleic Acids Res., July 13, 2007; 35(suppl_2): W345 - W349. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Smits, J. A. M. Smeitink, L. P. van den Heuvel, M. A. Huynen, and T. J. G. Ettema Reconstructing the evolution of the mitochondrial ribosomal proteome Nucleic Acids Res., July 9, 2007; 35(14): 4686 - 4703. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Saveanu, J.-C. Rousselle, P. Lenormand, A. Namane, A. Jacquier, and M. Fromont-Racine The p21-Activated Protein Kinase Inhibitor Skb15 and Its Budding Yeast Homologue Are 60S Ribosome Assembly Factors Mol. Cell. Biol., April 15, 2007; 27(8): 2897 - 2909. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Brune, N. Jochmann, K. Brinkrolf, A. T. Huser, R. Gerstmeir, B. J. Eikmanns, J. Kalinowski, A. Puhler, and A. Tauch The IclR-Type Transcriptional Repressor LtbR Regulates the Expression of Leucine and Tryptophan Biosynthesis Genes in the Amino Acid Producer Corynebacterium glutamicum J. Bacteriol., April 1, 2007; 189(7): 2720 - 2733. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Sun and J. Buhler Designing patterns for profile HMM search Bioinformatics, January 15, 2007; 23(2): e36 - e43. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Wilson, M. Madera, C. Vogel, C. Chothia, and J. Gough The SUPERFAMILY database in 2007: families and functions Nucleic Acids Res., January 12, 2007; 35(suppl_1): D308 - D313. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hollunder, M. Friedel, A. Beyer, C. T. Workman, and T. Wilhelm DASS: efficient discovery and p-value calculation of substructures in unordered data Bioinformatics, January 1, 2007; 23(1): 77 - 83. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Muszynski, T. Dam, B. Li, D. M. Shirbroun, Z. Hou, E. Bruggemann, R. Archibald, E. V. Ananiev, and O. N. Danilevskaya delayed flowering1 Encodes a Basic Leucine Zipper Protein That Mediates Floral Inductive Signals at the Shoot Apex in Maize Plant Physiology, December 1, 2006; 142(4): 1523 - 1536. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Lin, L. Zhu, and D.-Y. Zhang An initial strategy for comparing proteins at the domain architecture level Bioinformatics, September 1, 2006; 22(17): 2081 - 2086. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Nolting and S. Poggeler A MADS Box Protein Interacts with a Mating-Type Protein and Is Required for Fruiting Body Development in the Homothallic Ascomycete Sordaria macrospora. Eukaryot. Cell, July 1, 2006; 5(7): 1043 - 1056. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Tangrot, L. Wang, B. Kagstrom, and U. H. Sauer FISH--family identification of sequence homologues using structure anchored hidden Markov models. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W10 - W14. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. A. Fodor and R. W. Aldrich Statistical Limits to the Identification of Ion Channel Domains by Sequence Similarity J. Gen. Physiol., May 30, 2006; 127(6): 755 - 766. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. A. Reeves, J. M. Thornton, and the BioSapiens Network of Excellence Integrating biological data through the genome Hum. Mol. Genet., April 15, 2006; 15(suppl_1): R81 - R87. [Abstract] [Full Text] [PDF] |
||||
![]() |
G.-S. Han, W.-I Wu, and G. M. Carman The Saccharomyces cerevisiae Lipin Homolog Is a Mg2+-dependent Phosphatidate Phosphatase Enzyme J. Biol. Chem., April 7, 2006; 281(14): 9210 - 9218. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Babu, M. L. Priya, A. T. Selvan, M. Madera, J. Gough, L. Aravind, and K. Sankaran A Database of Bacterial Lipoproteins (DOLOP) with Functional Assignments to Predicted Lipoproteins. J. Bacteriol., April 1, 2006; 188(8): 2761 - 2773. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L Marsden, J. A.G Ranea, A. Sillero, O. Redfern, C. Yeats, M. Maibaum, D. Lee, S. Addou, G. A Reeves, T. J Dallman, et al. Exploiting protein structure data to explore the evolution of protein function and biological complexity Phil Trans R Soc B, March 29, 2006; 361(1467): 425 - 440. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Fleming, L. A Kelley, S. A Islam, R. M MacCallum, A. Muller, F. Pazos, and M. J.E Sternberg The proteome: structure, function and evolution Phil Trans R Soc B, March 29, 2006; 361(1467): 441 - 451. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Marsden, D. Lee, M. Maibaum, C. Yeats, and C. A. Orengo Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space Nucleic Acids Res., February 15, 2006; 34(3): 1066 - 1080. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Riley, T. Abe, M. B. Arnaud, M. K.B. Berlyn, F. R. Blattner, R. R. Chaudhuri, J. D. Glasner, T. Horiuchi, I. M. Keseler, T. Kosuge, et al. Escherichia coli K-12: a cooperatively developed annotation snapshot--2005 Nucleic Acids Res., January 5, 2006; 34(1): 1 - 9. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Kummerfeld and S. A. Teichmann DBD: a transcription factor prediction database Nucleic Acids Res., January 1, 2006; 34(suppl_1): D74 - D81. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Finn, J. Mistry, B. Schuster-Bockler, S. Griffiths-Jones, V. Hollich, T. Lassmann, S. Moxon, M. Marshall, A. Khanna, R. Durbin, et al. Pfam: clans, web tools and services Nucleic Acids Res., January 1, 2006; 34(suppl_1): D247 - D251. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Yeats, M. Maibaum, R. Marsden, M. Dibley, D. Lee, S. Addou, and C. A. Orengo Gene3D: modelling protein structure, function and evolution Nucleic Acids Res., January 1, 2006; 34(suppl_1): D281 - D284. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kouranov, L. Xie, J. de la Cruz, L. Chen, J. Westbrook, P. E. Bourne, and H. M. Berman The RCSB PDB information portal for structural genomics Nucleic Acids Res., January 1, 2006; 34(suppl_1): D302 - D305. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Hinrichs, D. Karolchik, R. Baertsch, G. P. Barber, G. Bejerano, H. Clawson, M. Diekhans, T. S. Furey, R. A. Harte, F. Hsu, et al. The UCSC Genome Browser Database: update 2006 Nucleic Acids Res., January 1, 2006; 34(suppl_1): D590 - D598. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Wilson, S. D. Forney, A. M. Ricci, E. G. Allen, K. L. Hefferon, and L. K. Miller Expression and Mutational Analysis of Autographa californica Nucleopolyhedrovirus HCF-1: Functional Requirements for Cysteine Residues J. Virol., November 15, 2005; 79(22): 13900 - 13914. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Davison and N. D. Stow New Genes from Old: Redeployment of dUTPase by Herpesviruses J. Virol., October 15, 2005; 79(20): 12880 - 12892. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Laskowski, J. D. Watson, and J. M. Thornton ProFunc: a server for predicting protein function from 3D structure Nucleic Acids Res., July 1, 2005; 33(suppl_2): W89 - W93. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. S. Moon, J. Bhak, K. H. Lee, and D. Lee Architecture of basic building blocks in protein and domain structural interaction networks Bioinformatics, April 15, 2005; 21(8): 1479 - 1486. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. Pereira-Leal and S. A. Teichmann Novel specificities emerge by stepwise duplication of functional modules Genome Res., April 1, 2005; 15(4): 552 - 559. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Tan, L. A. McCue, and G. D. Stormo Making connections between novel transcription factors and their DNA motifs Genome Res., February 1, 2005; 15(2): 312 - 320. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Pugalenthi, A. Bhaduri, and R. Sowdhamini GenDiS: Genomic Distribution of protein structural domain Superfamilies Nucleic Acids Res., January 1, 2005; 33(suppl_1): D252 - D255. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Janga and G. Moreno-Hagelsieb Conservation of adjacency as evidence of paralogous operons Nucleic Acids Res., October 11, 2004; 32(18): 5392 - 5397. [Abstract] [Full Text] [PDF] |
||||














