Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (191K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Bateman, A.
Right arrow Articles by Eddy, S. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bateman, A.
Right arrow Articles by Eddy, S. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2004, Vol. 32, Database issue D138-D141
© 2004 Oxford University Press

The Pfam protein families database

Alex Bateman*, Lachlan Coin, Richard Durbin, Robert D. Finn, Volker Hollich1, Sam Griffiths-Jones, Ajay Khanna2, Mhairi Marshall, Simon Moxon, Erik L. L. Sonnhammer1, David J. Studholme, Corin Yeats and Sean R. Eddy2

Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, 1 Center for Genomics and Bioinformatics, Karolinska Institutet, S-171 77 Stockholm, Sweden and 2 Howard Hughes Medical Institute and Department of Genetics, Washington University School of Medicine, St Louis, MO 63110, USA

*To whom correspondence should be addressed. Tel: +44 1223 494950; Fax: +44 1223 494919; Email: agb{at}sanger.ac.uk

Received September 17, 2003; Accepted October 20, 2003


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 GROWTH OF Pfam
 IMPROVED MODELLING OF DOMAINS...
 IMPROVED FAMILY MEMBERSHIP
 IMPROVED STRUCTURE IMAGES
 AVAILABILITY
 REFERENCES
 
Pfam is a large collection of protein families and domains. Over the past 2 years the number of families in Pfam has doubled and now stands at 6190 (version 10.0). Methodology improvements for searching the Pfam collection locally as well as via the web are described. Other recent innovations include modelling of discontinuous domains allowing Pfam domain definitions to be closer to those found in structure databases. Pfam is available on the web in the UK (http://www.sanger.ac.uk/Software/Pfam/), the USA (http://pfam.wustl.edu/), France (http://pfam.jouy.inra.fr/) and Sweden (http://Pfam.cgb.ki.se/).


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 GROWTH OF Pfam
 IMPROVED MODELLING OF DOMAINS...
 IMPROVED FAMILY MEMBERSHIP
 IMPROVED STRUCTURE IMAGES
 AVAILABILITY
 REFERENCES
 
Pfam is a comprehensive collection of protein domains and families, with a range of well-established uses including genome annotation. Each family in Pfam is represented by two multiple sequence alignments and two profile-Hidden Markov Models (profile-HMMs). The functionality, use and philosophy of Pfam have been discussed in previous publications (1,2) and will not be discussed at length here. In the following sections we describe the most important improvements that have been made to the database in the past 2 years.


    GROWTH OF Pfam
 TOP
 ABSTRACT
 INTRODUCTION
 GROWTH OF Pfam
 IMPROVED MODELLING OF DOMAINS...
 IMPROVED FAMILY MEMBERSHIP
 IMPROVED STRUCTURE IMAGES
 AVAILABILITY
 REFERENCES
 
As of release 10.0, Pfam contains 6190 Pfam families. Pfam families match 75% of protein sequences in Swiss-Prot and TrEMBL (3) (and 53% of all residues). This compares with 3071 families and 69% coverage at release 6.6, 2 years ago (2). For those protein sequences that do not belong to any Pfam family, we derive automatically generated Pfam-B families. The Pfam-B families are derived from ProDom (4), a comprehensive set of protein domain families automatically generated from the Swiss-Prot and TrEMBL sequence databases. Many multi-domain protein sequences contain (non-overlapping) matches to both Pfam and Pfam-B families. The combination of Pfam and Pfam-B covers 82% of protein sequences in Swiss-Prot and TrEMBL. Every Pfam release is now built on the latest versions of Swiss-Prot and TrEMBL minimizing problems with out-of-date sequence entries.

Pfam has two large series of functionally uncharacterized families, known as Domains of Unknown Function (DUFs) and Uncharacterized Protein Families (UPFs). DUFs are families that have been created by Pfam whereas UPFs are those created by Swiss-Prot and added to Pfam. Tracking the number of DUFs and UPFs gives us some idea of how many families in Pfam are uncharacterized and how this number has changed over time. As of Pfam 6.6 there were 272 DUF and UPF families out of a total of 3071. Pfam release 10.0 contains 1004 DUF and UPF families out of 6190. Eighty-nine of the original 272 have been annotated. Of these, 20 were merged with other families and 69 were annotated with a function. Hence, on average, around 37 new domains of unknown function are added to Pfam every month and six are annotated with a function. The proportion of DUF and UPF families in the Pfam database has increased from 9% to 16%. However, the number of DUF and UPF matches to Swiss-Prot compared with the number of hits from annotated families has increased only marginally over this period. This reflects the increasing tendency for completely undescribed families to be small and specific to a few genomes.

To cope with the increased computational burden that doubling the number of families and therefore profile-HMMs creates we have two innovations to aid users. First, HMMER, the freely available profile-HMM software, used to construct and search Pfam, has been upgraded to a version 2.3 lineage (the current release is 2.3.1, see http://hmmer.wustl.edu). The principal difference between HMMER 2.3 and previous versions is a 2- to 3-fold speed-up on most platforms because of performance optimizations, and ~8-fold on Mac OS/X thanks to code contributed by Erik Lindahl at Stanford University. These performance improvements accelerate Pfam searches, and help keep pace with the growing size of the database. Secondly, users can now carry out batch searches of up to 1000 sequences at a time on the UK web server, with results being returned by email. This service means that users with moderate requirements do not need to install a local copy of Pfam and HMMER.


    IMPROVED MODELLING OF DOMAINS IN Pfam
 TOP
 ABSTRACT
 INTRODUCTION
 GROWTH OF Pfam
 IMPROVED MODELLING OF DOMAINS...
 IMPROVED FAMILY MEMBERSHIP
 IMPROVED STRUCTURE IMAGES
 AVAILABILITY
 REFERENCES
 
Pfam aims to be a database of accurate protein domain definitions. In the past 2 years we have split many existing families into structural domains. Collaborations with the structural protein domain databases SCOP (5) and CATH (6) have enabled the development of a domain comparison tool to aid this process (see Fig. 1). This tool allows the relationship of the structural domain architecture defined by CATH and SCOP to be compared with each other and Pfam. Such comparisons help ensure consistency of domain definitions in the three independent databases and facilitate their linking at a common level. The domain comparison tool uses web services to retrieve the domain boundaries from CATH and SCOP on user request. These web services are maintained by each database ensuring up to date data and minimizing discrepancies between database versions. Web users can view structures marked up according to domain boundaries with Rasmol or RasWin (7).



View larger version (82K):
[in this window]
[in a new window]
 
Figure 1. Comparison of Pfam, SCOP and CATH domain definitions for S-adenosylmethionine synthetase. The definitions of SCOP and Pfam are very similar, but rather different from the stricter structural definition of CATH. The definitions can be compared at the level of structure for each database. The two Rasmol windows on the right show the Pfam definition above the CATH definition.

 
An area of significant difference in domain definitions between Pfam and the structural databases is due to discontinuous domains defined in SCOP and CATH. A discontinuous domain is one where the linear sequence of the domain is interrupted by another inserted domain. For example, the IMPDH domain (Pfam accession PF00478) is found as a continuous domain in the GuaB protein, and with a pair of inserted CBS domains (PF00571) in inosine monophosphate dehydrogenases. Currently, there are 29 examples of discontinuous domains in Pfam. Modelling of discontinuous domains is achieved by forcing the profile-HMM to allow the inserted domain as a long insertion. We do this by using the –hand option in the HMMER software along with a # = GC RF line. For clarity the sequence of the inserted domain is also masked with X characters and the presence of a nested domain is indicated in the flat files by an NE tag. This improved modelling has allowed more accurate description of discontinuous domains as we see them in protein structures, and leads to increased search sensitivity.


    IMPROVED FAMILY MEMBERSHIP
 TOP
 ABSTRACT
 INTRODUCTION
 GROWTH OF Pfam
 IMPROVED MODELLING OF DOMAINS...
 IMPROVED FAMILY MEMBERSHIP
 IMPROVED STRUCTURE IMAGES
 AVAILABILITY
 REFERENCES
 
To provide users with a more unified view of protein domains, we have implemented two web-based innovations. First, the SMART database of protein domains (8) and Pfam each contain many entries that are not available in the other, and in other cases the family memberships differ markedly. The two databases exchange lists of matches and present these matches on the Pfam and SMART websites.

Secondly, we have applied language modelling techniques from speech recognition to identify weak domain hits (9). When the HMMER score is below the curated threshold, there is often sufficient contextual information contained in the other domain hits to the protein to increase this score above the threshold. Language modelling is applied as a post-processing step to the set of all HMMER identified matches with an E-value of <1000. A dynamic programming algorithm is used to find the highest scoring domain architecture for a protein, taking into account both HMMER and context scores. We find 32 587 additional domain occurrences in this way, accounting for an additional 0.5% residue coverage in Pfam release 10.0.


    IMPROVED STRUCTURE IMAGES
 TOP
 ABSTRACT
 INTRODUCTION
 GROWTH OF Pfam
 IMPROVED MODELLING OF DOMAINS...
 IMPROVED FAMILY MEMBERSHIP
 IMPROVED STRUCTURE IMAGES
 AVAILABILITY
 REFERENCES
 
Despite the large increase in the number of Pfam entries, just over one-third of entries contain at least one protein of known 3D structure. Previously, Pfam used structure images kindly provided by the PDBsum database (10). To make the images more informative with respect to Pfam we now colour the structures by Pfam domain. This domain mark-up of structures was greatly aided by the mapping of PDB sequences to Swiss-Prot sequences provided by the EBI Macromolecular Structure Database (E-MSD) (11). The static images are generated using Molscript (12) and rendered using Raster3D (13). Each image is accompanied by a brief description of the structure, followed by the domain mark-up key, which contains links to the family pages for all the domains in the structure (Fig. 2).



View larger version (118K):
[in this window]
[in a new window]
 
Figure 2. Structure image coloured by Pfam domain, including a coloured domain key below. This structure shows the complex of P27, cyclin and CDK2. Clicking on the image links to the PDBsum resource (10).

 

    AVAILABILITY
 TOP
 ABSTRACT
 INTRODUCTION
 GROWTH OF Pfam
 IMPROVED MODELLING OF DOMAINS...
 IMPROVED FAMILY MEMBERSHIP
 IMPROVED STRUCTURE IMAGES
 AVAILABILITY
 REFERENCES
 
The Pfam database is freely available on the web in the UK (http://www.sanger.ac.uk/Software/Pfam/), the USA (http://pfam.wustl.edu/), France (http://pfam.jouy.inra.fr/) and Sweden (http://Pfam.cgb.ki.se/). All data are available for download in flat file form from the FTP sites linked from each Pfam website, and also as a set of MySQL relational database files.


    ACKNOWLEDGEMENTS
 
We would like to thank William Mifsud, Nicola Kerrison, David Waterfield and Ben Vella Briffa for adding many of the new families to Pfam. We are grateful to Kevin Howe for useful discussions and advice. We would also like to thank Timo Lassmann and Markus Wistrand for help maintaining the Sweden Pfam website and Lorenzo Cerutti for maintaining the French Pfam website. This work was funded by the The Wellcome Trust and and an MRC (UK) E-science grant.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 GROWTH OF Pfam
 IMPROVED MODELLING OF DOMAINS...
 IMPROVED FAMILY MEMBERSHIP
 IMPROVED STRUCTURE IMAGES
 AVAILABILITY
 REFERENCES
 

  1. Bateman,A., Birney,E., Durbin,R., Eddy,S.R., Howe,K.L. and Sonnhammer,E.L.L. (2000) The Pfam protein families database. Nucleic Acids Res., 28, 263–266.[Abstract/Free Full Text]

  2. Bateman,A., Birney,E., Cerruti,L., Durbin,R., Etwiller,L., Eddy,S.R., Griffiths-Jones,S., Howe,K.L., Marshall,M. and Sonnhammer,E.L.L. (2002) The Pfam protein families database. Nucleic Acids Res., 30, 276–280.[Abstract/Free Full Text]

  3. Boeckmann,B., Bairoch,A., Apweiler,R., Blatter,M.C., Estreicher,A., Gasteiger,E., Martin,M.J., Michoud,K., O’Donovan,C., Phan,I. et al. (2003) The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res., 31, 365–370.[Abstract/Free Full Text]

  4. Corpet,F., Servant,F., Gouzy,J. and Kahn,D. (2000) ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisons. Nucleic Acids Res., 28, 267–269.[Abstract/Free Full Text]

  5. Lo Conte,L., Brenner,S.E., Hubbard,T.J., Chothia,C. and Murzin,A.G. (2002) SCOP database in 2002: refinements accommodate structural genomics. Nucleic Acids Res., 30, 264–267.[Abstract/Free Full Text]

  6. Pearl,F.M., Bennett,C.F., Bray,J.E., Harrison,A.P., Martin,N., Shepherd,A., Sillitoe,I., Thornton,J. and Orengo,C.A. (2003) The CATH database: an extended protein family resource for structural and functional genomics. Nucleic Acids Res., 31, 452–455.[Abstract/Free Full Text]

  7. Sayle,R. and Milner-White,E. (1995) RASMOL: biomolecular graphics for all. Trends Biochem. Sci., 20, 374–374.[CrossRef][Web of Science][Medline]

  8. Letunic,I., Goodstadt,L., Dickens,N.J., Doerks,T., Schultz,J., Mott,R., Ciccarelli,F., Copley,R.R., Ponting,C.P. and Bork,P. (2002) Recent improvements to the SMART domain-based sequence annotation resource. Nucleic Acids Res., 30, 242–244.[Abstract/Free Full Text]

  9. Coin,L., Bateman,A. and Durbin,R. (2003) Enhanced protein domain discovery by using language modeling techniques from speech recognition. Proc. Natl Acad. Sci. USA, 100, 4516–4520.[Abstract/Free Full Text]

  10. Laskowski,R.A. (2001) PDBsum: summaries and analyses of PDB structures. Nucleic Acids Res., 29, 221–222.[Abstract/Free Full Text]

  11. Boutselakis,H., Dimitropoulos,D., Fillon,J., Golovin,A., Henrick,K., Hussain,A., Ionides,J., John,M., Keller,P.A., Krissinel,E. et al. (2003) E-MSD: the European Bioinformatics Institute Macromolecular Structure Database. Nucleic Acids Res., 31, 458–462.[Abstract/Free Full Text]

  12. Kraulis,P. (1991) MOLSCRIPT: A program to produce both detailed and schematic plots of protein structures. J. Appl. Crystallogr., 24, 946–950.[CrossRef]

  13. Bacon,D. and Anderson,W. (1988) A fast algorithm for rendering space-filling molecule pictures. J. Mol. Graph., 6, 219–220.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Protein Eng Des SelHome page
E. A. Champion, L. Kundrat, L. Regan, and S. J. Baserga
A structural model for the HAT domain of Utp6 incorporating bioinformatics and genetics
Protein Eng. Des. Sel., July 1, 2009; 22(7): 431 - 439.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
P. H. Degnan, Y. Yu, N. Sisneros, R. A. Wing, and N. A. Moran
Hamiltonella defensa, genome evolution of protective bacterial endosymbiont from pathogenic ancestors
PNAS, June 2, 2009; 106(22): 9063 - 9068.
[Abstract] [Full Text] [PDF]


Home page
IOVSHome page
M. Moosajee, M. Tulloch, R. A. Baron, C. Y. Gregory-Evans, J. B. Pereira-Leal, and M. C. Seabra
Single choroideremia Gene in Nonmammalian Vertebrates Explains Early Embryonic Lethality of the Zebrafish Model of Choroideremia
Invest. Ophthalmol. Vis. Sci., June 1, 2009; 50(6): 3009 - 3016.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
P. J. A. Cock, T. Antao, J. T. Chang, B. A. Chapman, C. J. Cox, A. Dalke, I. Friedberg, T. Hamelryck, F. Kauff, B. Wilczynski, et al.
Biopython: freely available Python tools for computational molecular biology and bioinformatics
Bioinformatics, June 1, 2009; 25(11): 1422 - 1423.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
W. Zhang, A. Lorence, H. A. Gruszewski, B. I. Chevone, and C. L. Nessler
AMR1, an Arabidopsis Gene That Coordinately and Negatively Regulates the Mannose/L-Galactose Ascorbic Acid Biosynthetic Pathway
Plant Physiology, June 1, 2009; 150(2): 942 - 950.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
Y. Kawabe, T. Morio, J. L. James, A. R. Prescott, Y. Tanaka, and P. Schaap
Activated cAMP receptors switch encystation into sporulation
PNAS, April 28, 2009; 106(17): 7089 - 7094.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
T. M. Alleyne, L. Pena-Castillo, G. Badis, S. Talukder, M. F. Berger, A. R. Gehrke, A. A. Philippakis, M. L. Bulyk, Q. D. Morris, and T. R. Hughes
Predicting the binding preference of transcription factors to individual DNA k-mers
Bioinformatics, April 15, 2009; 25(8): 1012 - 1018.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
X. Hou, K. Xie, J. Yao, Z. Qi, and L. Xiong
A homolog of human ski-interacting protein in rice positively regulates cell viability and stress tolerance
PNAS, April 14, 2009; 106(15): 6410 - 6415.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
L. Klasson, J. Westberg, P. Sapountzis, K. Naslund, Y. Lutnaes, A. C. Darby, Z. Veneti, L. Chen, H. R. Braig, R. Garrett, et al.
The mosaic genome structure of the Wolbachia wRi strain infecting Drosophila simulans
PNAS, April 7, 2009; 106(14): 5725 - 5730.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. D. Hooper, I. J. Anderson, A. Pati, D. Dalevi, K. Mavromatis, and N. C. Kyrpides
Integration of phenotypic metadata and protein similarity in Archaea using a spectral bipartitioning approach
Nucleic Acids Res., April 1, 2009; 37(7): 2096 - 2104.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
L. Ruiz, Y. Coute, B. Sanchez, C. G. de los Reyes-Gavilan, J.-C. Sanchez, and A. Margolles
The cell-envelope proteome of Bifidobacterium longum in an in vitro bile environment
Microbiology, March 1, 2009; 155(3): 957 - 967.
[Abstract] [Full Text] [PDF]


Home page
ANN BOT (LOND)Home page
S. Schuette, A. J. Wood, M. Geisler, J. Geisler-Lee, R. Ligrone, and K. S. Renzaglia
Novel localization of callose in the spores of Physcomitrella patens and phylogenomics of the callose synthase gene family
Ann. Bot., March 1, 2009; 103(5): 749 - 756.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
D. Chandran, Y. C. Tai, G. Hather, J. Dewdney, C. Denoux, D. G. Burgess, F. M. Ausubel, T. P. Speed, and M. C. Wildermuth
Temporal Global Expression Data Reveal Known and Novel Salicylate-Impacted Processes and Regulators Mediating Powdery Mildew Growth and Reproduction on Arabidopsis
Plant Physiology, March 1, 2009; 149(3): 1435 - 1451.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. Sorokin, K. Severinov, and M. S. Gelfand
Systematic prediction of control proteins and their DNA binding sites
Nucleic Acids Res., February 1, 2009; 37(2): 441 - 451.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Huang and J. S. Bader
Precision and recall estimates for two-hybrid screens
Bioinformatics, February 1, 2009; 25(3): 372 - 378.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
C. Brochier-Armanet, E. Talla, and S. Gribaldo
The Multiple Evolutionary Histories of Dioxygen Reductases: Implications for the Origin and Evolution of Aerobic Respiration
Mol. Biol. Evol., February 1, 2009; 26(2): 285 - 297.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
K. Kashiyama, T. Seki, H. Numata, and S. G. Goto
Molecular Characterization of Visual Pigments in Branchiopoda and the Evolution of Opsins in Arthropoda
Mol. Biol. Evol., February 1, 2009; 26(2): 299 - 311.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. H. Hur, K. Van Doninck, M. L. Mandigo, and M. Meselson
Degenerate Tetraploidy Was Established Before Bdelloid Rotifer Families Diverged
Mol. Biol. Evol., February 1, 2009; 26(2): 375 - 383.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
J. D. King, E. Vinogradov, A. Preston, J. Li, and D. J. Maskell
Post-assembly Modification of Bordetella bronchiseptica O Polysaccharide by a Novel Periplasmic Enzyme Encoded by wbmE
J. Biol. Chem., January 16, 2009; 284(3): 1474 - 1483.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
S. Pina, A. Olvera, A. Barcelo, and A. Bensaid
Trimeric Autotransporters of Haemophilus parasuis: Generation of an Extensive Passenger Domain Repertoire Specific for Pathogenic Strains
J. Bacteriol., January 15, 2009; 191(2): 576 - 587.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Martin, S. Abubucker, T. Wylie, Y. Yin, Z. Wang, and M. Mitreva
Nematode.net update 2008: improvements enabling more efficient data mining and comparative nematode genomics
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D571 - D578.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y. Ahmad, F.-M. Boisvert, P. Gregor, A. Cobley, and A. I. Lamond
NOPdb: Nucleolar Proteome Database--2008 update
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D181 - D184.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. L. Mallam, E. R. Morris, and S. E. Jackson
Exploring knotting mechanisms in protein folding
PNAS, December 2, 2008; 105(48): 18740 - 18745.
[Abstract] [Full Text] [PDF]


Home page
Eukaryot CellHome page
T. M. Mittelmeier, P. Berthold, A. Danon, M. R. Lamb, A. Levitan, M. E. Rice, and C. L. Dieckmann
C2 Domain Protein MIN1 Promotes Eyespot Organization in Chlamydomonas reinhardtii
Eukaryot. Cell, December 1, 2008; 7(12): 2100 - 2112.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
P. H. Degnan and N. A. Moran
Diverse Phage-Encoded Toxins in a Protective Insect Endosymbiont
Appl. Envir. Microbiol., November 1, 2008; 74(21): 6782 - 6791.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
S. A. Mikolajczak, H. Silva-Rivera, X. Peng, A. S. Tarun, N. Camargo, V. Jacobs-Lorena, T. M. Daly, L. W. Bergman, P. de la Vega, J. Williams, et al.
Distinct Malaria Parasite Sporozoites Reveal Transcriptional Changes That Cause Differential Tissue Infection Competence in the Mosquito Vector and Mammalian Host
Mol. Cell. Biol., October 15, 2008; 28(20): 6196 - 6207.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Bromberg, G. Yachdav, and B. Rost
SNAP predicts effect of mutations on protein function
Bioinformatics, October 15, 2008; 24(20): 2397 - 2398.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
Y. J. Huang, D. Hang, L. J. Lu, L. Tong, M. B. Gerstein, and G. T. Montelione
Targeting the Human Cancer Pathway Protein Interaction Network by Structural Genomics
Mol. Cell. Proteomics, October 1, 2008; 7(10): 2048 - 2060.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
F. Morcos, C. Lamanna, M. Sikora, and J. Izaguirre
Cytoprophet: a Cytoscape plug-in for protein and domain interaction networks inference
Bioinformatics, October 1, 2008; 24(19): 2265 - 2266.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
A. V. Ritchie, S. van Es, C. Fouquet, and P. Schaap
From Drought Sensing to Developmental Control: Evolution of Cyclic AMP Signaling in Social Amoebas
Mol. Biol. Evol., October 1, 2008; 25(10): 2109 - 2118.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
E. Akiva, Z. Itzhaki, and H. Margalit
Built-in loops allow versatility in domain-domain interactions: Lessons from self-interacting domains
PNAS, September 9, 2008; 105(36): 13292 - 13297.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
M. Letek, A. A. Ocampo-Sosa, M. Sanders, U. Fogarty, T. Buckley, D. P. Leadon, P. Gonzalez, M. Scortti, W. G. Meijer, J. Parkhill, et al.
Evolution of the Rhodococcus equi vap Pathogenicity Island Seen through Comparison of Host-Associated vapA and vapB Virulence Plasmids
J. Bacteriol., September 1, 2008; 190(17): 5797 - 5805.
[Abstract] [Full Text] [PDF]


Home page
Mol PlantHome page
L.-R. Zeng, C. H. Park, R.C. Venu, J. Gough, and G.-L. Wang
Classification, Expression Pattern, and E3 Ligase Activity Assay of Rice U-Box-Containing Proteins
Mol Plant, September 1, 2008; 1(5): 800 - 815.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Greiner, X. Wang, R. G. Herrmann, U. Rauwolf, K. Mayer, G. Haberer, and J. Meurer
The Complete Nucleotide Sequences of the 5 Genetically Distinct Plastid Genomes of Oenothera, Subsection Oenothera: II. A Microevolutionary View Using Bioinformatics and Formal Genetic Data
Mol. Biol. Evol., September 1, 2008; 25(9): 2019 - 2030.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Yang, M. Gao, C. Xu, J. Gao, S. Deshpande, S. Lin, B. A. Roe, and H. Zhu
Alfalfa benefits from Medicago truncatula: The RCT1 gene from M. truncatula confers broad-spectrum resistance to anthracnose in alfalfa
PNAS, August 26, 2008; 105(34): 12164 - 12169.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. Dalevi, N. N. Ivanova, K. Mavromatis, S. D. Hooper, E. Szeto, P. Hugenholtz, N. C. Kyrpides, and V. M. Markowitz
Annotation of metagenome short reads using proxygenes
Bioinformatics, August 15, 2008; 24(16): i7 - i13.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Pandey, M. Koyuturk, S. Subramaniam, and A. Grama
Functional coherence in domain interaction networks
Bioinformatics, August 15, 2008; 24(16): i28 - i34.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
M. Liu, A. Nauta, C. Francke, and R. J. Siezen
Comparative Genomics of Enzymes in Flavor-Forming Pathways from Amino Acids in Lactic Acid Bacteria
Appl. Envir. Microbiol., August 1, 2008; 74(15): 4590 - 4600.
[Full Text] [PDF]


Home page
GlycobiologyHome page
N. Deshpande, M. R Wilkins, N. Packer, and H. Nevalainen
Protein glycosylation pathways in filamentous fungi
Glycobiology, August 1, 2008; 18(8): 626 - 637.
[Abstract] [Full Text] [PDF]


Home page
Eukaryot CellHome page
C. Schilde, K. Lutter, R. Kissmehl, and H. Plattner
Molecular Identification of a SNAP-25-Like SNARE Protein in Paramecium
Eukaryot. Cell, August 1, 2008; 7(8): 1387 - 1402.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
R. Mor-Cohen, N. Rosenberg, M. Landau, J. Lahav, and U. Seligsohn
Specific Cysteines in {beta}3 Are Involved in Disulfide Bond Exchange-dependent and -independent Activation of {alpha}IIb{beta}3
J. Biol. Chem., July 11, 2008; 283(28): 19235 - 19244.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Wong and M. A. Ragan
MACHOS: Markov clusters of homologous subsequences
Bioinformatics, July 1, 2008; 24(13): i77 - i85.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Pei, M. Tang, and N. V. Grishin
PROMALS3D web server for accurate multiple protein sequence and structure alignments
Nucleic Acids Res., July 1, 2008; 36(suppl_2): W30 - W34.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. von Grotthuss, D. Plewczynski, G. Vriend, and L. Rychlewski
3D-Fun: predicting enzyme function from structure
Nucleic Acids Res., July 1, 2008; 36(suppl_2): W303 - W307.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
M. A. Anwar, S. Kralj, M. J. E. C. van der Maarel, and L. Dijkhuizen
The Probiotic Lactobacillus johnsonii NCC 533 Produces High-Molecular-Mass Inulin from Sucrose by Using an Inulosucrase Enzyme
Appl. Envir. Microbiol., June 1, 2008; 74(11): 3426 - 3433.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
M. J. Sanderson, D. Boss, D. Chen, K. A. Cranston, and A. Wehe
The PhyLoTA Browser: Processing GenBank for Molecular Phylogenetics Research
Syst Biol, June 1, 2008; 57(3): 335 - 346.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Zebisch and N. Strater
Structural insight into signal conversion and inactivation by NTPDase2 in purinergic signaling
PNAS, May 13, 2008; 105(19): 6882 - 6887.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. D. Schrag, S. Jiralerspong, M. Banville, M. L. Jaramillo, and M. D. O'Connor-McCourt
The crystal structure and dimerization interface of GADD45{gamma}
PNAS, May 6, 2008; 105(18): 6566 - 6571.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
M. Balado, C. R. Osorio, and M. L. Lemos
Biosynthetic and regulatory elements involved in the production of the siderophore vanchrobactin in Vibrio anguillarum
Microbiology, May 1, 2008; 154(5): 1400 - 1413.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
D. F. Aubert, R. S. Flannagan, and M. A. Valvano
A Novel Sensor Kinase-Response Regulator Hybrid Controls Biofilm Formation and Type VI Secretion System Activity in Burkholderia cenocepacia
Infect. Immun., May 1, 2008; 76(5): 1979 - 1991.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
M. Pilhofer, K. Rappl, C. Eckl, A. P. Bauer, W. Ludwig, K.-H. Schleifer, and G. Petroni
Characterization and Evolution of Cell Division and Cell Wall Synthesis Genes in the Bacterial Phyla Verrucomicrobia, Lentisphaerae, Chlamydiae, and Planctomycetes and Phylogenetic Comparison with rRNA Genes
J. Bacteriol., May 1, 2008; 190(9): 3192 - 3202.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Lloret-Llinares, C. Carre, A. Vaquero, N. de Olano, and F. Azorin
Characterization of Drosophila melanogaster JmjC+N histone demethylases
Nucleic Acids Res., May 1, 2008; 36(9): 2852 - 2863.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
S. L. Zimmer, Z. Fei, and D. B. Stern
Genome-Based Analysis of Chlamydomonas reinhardtii Exoribonucleases and Poly(A) Polymerases Predicts Unexpected Organellar and Exosomal Features
Genetics, May 1, 2008; 179(1): 125 - 136.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Xu, J. Wu, J. Xiao, Y. Tan, Q. Bao, F. Zhao, and X. Li
PlasmoGF: an integrated system for comparative genomics and phylogenetic analysis of Plasmodium gene families
Bioinformatics, May 1, 2008; 24(9): 1217 - 1220.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
K. Horan, C. Jang, J. Bailey-Serres, R. Mittler, C. Shelton, J. F. Harper, J.-K. Zhu, J. C. Cushman, M. Gollery, and T. Girke
Annotating Genes of Known and Unknown Function by Large-Scale Coexpression Analysis
Plant Physiology, May 1, 2008; 147(1): 41 - 57.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
L. Neumann, F. Spinozzi, R. Sinibaldi, F. Rustichelli, M. Potter, and A. Steinbuchel
Binding of the Major Phasin, PhaP1, from Ralstonia eutropha H16 to Poly(3-Hydroxybutyrate) Granules
J. Bacteriol., April 15, 2008; 190(8): 2911 - 2919.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Ye, G. Vriend, and A. P. IJzerman
Tracing evolutionary pressure
Bioinformatics, April 1, 2008; 24(7): 908 - 915.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
E. Magnani and S. Hake
KNOX Lost the OX: The Arabidopsis KNATM Gene Defines a Novel Class of KNOX Transcriptional Regulators Missing the Homeodomain
PLANT CELL, April 1, 2008; 20(4): 875 - 887.
[Abstract] [Full Text] [PDF]


Home page
Biol. Reprod.Home page
E. De La Chesnaye, B. Kerr, A. Paredes, H. Merchant-Larios, J. P. Mendez, and S. R. Ojeda
Fbxw15/Fbxo12J Is an F-Box Protein-Encoding Gene Selectively Expressed in Oocytes of the Mouse Ovary
Biol Reprod, April 1, 2008; 78(4): 714 - 725.
[Abstract] [Full Text] [PDF]


Home page
Brief Funct Genomic ProteomicHome page
C. Ansong, S. O. Purvine, J. N. Adkins, M. S. Lipton, and R. D. Smith
Proteogenomics: needs and roles to be filled by proteomics in genome annotation
Brief Funct Genomic Proteomic, March 10, 2008; (2008) eln010v1.
[Abstract] [Full Text] [PDF]


Home page
J Antimicrob ChemotherHome page
S. K. P. Lau, G. K. M. Wong, M. W. S. Li, P. C. Y. Woo, and K.-y. Yuen
Distribution and molecular characterization of tetracycline resistance in Laribacter hongkongensis
J. Antimicrob. Chemother., March 1, 2008; 61(3): 488 - 497.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
M. Abedin and N. King
The Premetazoan Ancestry of Cadherins
Science, February 15, 2008; 319(5865): 946 - 948.
[Abstract] [Full Text] [PDF]


Home page
J. Cell Sci.Home page
L. Bengtsson and H. Otto
LUMA interacts with emerin and influences its distribution at the inner nuclear membrane
J. Cell Sci., February 15, 2008; 121(4): 536 - 548.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
V. Kunin, S. He, F. Warnecke, S. B. Peterson, H. Garcia Martin, M. Haynes, N. Ivanova, L. L. Blackall, M. Breitbart, F. Rohwer, et al.
A bacterial metapopulation adapts locally to phage predation despite global dispersal
Genome Res., February 1, 2008; 18(2): 293 - 297.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
D. Kerk, G. Templeton, and G. B.G. Moorhead
Evolutionary Radiation Pattern of Novel Protein Phosphatases Revealed by Analysis of Protein Data from the Completely Sequenced Genomes of Humans, Green Algae, and Higher Plants
Plant Physiology, February 1, 2008; 146(2): 351 - 367.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
R. J. Siezen, M. J. C. Starrenburg, J. Boekhorst, B. Renckens, D. Molenaar, and J. E. T. van Hylckama Vlieg
Genome-Scale Genotype-Phenotype Matching of Two Lactococcus lactis Isolates from Plants Identifies Mechanisms of Adaptation to the Plant Niche
Appl. Envir. Microbiol., January 15, 2008; 74(2): 424 - 436.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Cheng, B.-H. Kim, and N. V. Grishin
MALISAM: a database of structurally analogous motifs in proteins
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D211 - D217.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. M.C. Robb, E. Ross, and A. S. Alvarado
SmedGD: the Schmidtea mediterranea genome database
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D599 - D606.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. M. Markowitz, N. N. Ivanova, E. Szeto, K. Palaniappan, K. Chu, D. Dalevi, I-M. A. Chen, Y. Grechkin, I. Dubchak, I. Anderson, et al.
IMG/M: a data management and analysis system for metagenomes
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D534 - D538.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. M. Markowitz, E. Szeto, K. Palaniappan, Y. Grechkin, K. Chu, I-M. A. Chen, I. Dubchak, I. Anderson, A. Lykidis, K. Mavromatis, et al.
The integrated microbial genomes (IMG) system in 2007: data content and analysis tool extensions
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D528 - D533.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Rossignol, P. Lechat, C. Cuomo, Q. Zeng, I. Moszer, and C. d'Enfert
CandidaDB: a multi-genome database for Candida species and related Saccharomycotina
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D557 - D561.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
M. Brilli, R. Fani, and P. Lio
Current trends in the bioinformatic sequence analysis of metabolic pathways in prokaryotes
Brief Bioinform, January 1, 2008; 9(1): 34 - 45.
[Abstract] [Full Text] [PDF]


Home page
J. Lipid Res.Home page
K. Hashimoto, A. C. Yoshizawa, S. Okuda, K. Kuma, S. Goto, and M. Kanehisa
The repertoire of desaturases and elongases reveals fatty acid variations in 56 eukaryotic genomes
J. Lipid Res., January 1, 2008; 49(1): 183 - 191.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
J. J. Cragnolini and J. A. L. de Castro
Identification of Endogenously Presented Peptides from Chlamydia trachomatis with High Homology to Human Proteins and to a Natural Self-ligand of HLA-B27
Mol. Cell. Proteomics, January 1, 2008; 7(1): 170 - 180.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
B. L. Cantarel, I. Korf, S. M.C. Robb, G. Parra, E. Ross, B. Moore, C. Holt, A. Sanchez Alvarado, and M. Yandell
MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes
Genome Res., January 1, 2008; 18(1): 188 - 196.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
N. Molina and E. van Nimwegen
Universal patterns of purifying selection at noncoding positions in bacteria
Genome Res., January 1, 2008; 18(1): 148 - 160.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. Landau, K. Herz, E. Padan, and N. Ben-Tal
Model Structure of the Na+/H+ Exchanger 1 (NHE1): FUNCTIONAL AND CLINICAL IMPLICATIONS
J. Biol. Chem., December 28, 2007; 282(52): 37854 - 37863.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. N. Chua, W.-K. Sung, and L. Wong
An efficient strategy for extensive integration of diverse biological data for protein function prediction
Bioinformatics, December 15, 2007; 23(24): 3364 - 3373.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. P. McCutcheon and N. A. Moran
Parallel genomic evolution and metabolic interdependence in an ancient symbiosis
PNAS, December 4, 2007; 104(49): 19392 - 19397.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
T. H. Hazen, D. Wu, J. A. Eisen, and P. A. Sobecky
Sequence Characterization and Comparative Analysis of Three Plasmids Isolated from Environmental Vibrio spp.
Appl. Envir. Microbiol., December 1, 2007; 73(23): 7703 - 7710.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
N. Bhardwaj, R. V. Stahelin, G. Zhao, W. Cho, and H. Lu
MeTaDoR: a comprehensive resource for membrane targeting domains and their host proteins
Bioinformatics, November 15, 2007; 23(22): 3110 - 3112.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
X. Liu and H. A. Karimi
High-throughput modeling and analysis of protein structural dynamics
Brief Bioinform, November 1, 2007; 8(6): 432 - 445.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
A. Mavroidi, D. M. Aanensen, D. Godoy, I. C. Skovsted, M. S. Kaltoft, P. R. Reeves, S. D. Bentley, and B. G. Spratt
Genetic Relatedness of the Streptococcus pneumoniae Capsular Biosynthetic Loci
J. Bacteriol., November 1, 2007; 189(21): 7841 - 7855.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
J. K. Hane, R. G.T. Lowe, P. S. Solomon, K.-C. Tan, C. L. Schoch, J. W. Spatafora, P. W. Crous, C. Kodira, B. W. Birren, J. E. Galagan, et al.
Dothideomycete Plant Interactions Illuminated by Genome Sequencing and EST Analysis of the Wheat Pathogen Stagonospora nodorum
PLANT CELL, November 1, 2007; 19(11): 3347 - 3368.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
C. A. Lee and A. D. Grossman
Identification of the Origin of Transfer (oriT) and DNA Relaxase Required for Conjugation of the Integrative and Conjugative Element ICEBs1 of Bacillus subtilis
J. Bacteriol., October 15, 2007; 189(20): 7254 - 7261.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (191K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Bateman, A.
Right arrow Articles by Eddy, S. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bateman, A.
Right arrow Articles by Eddy, S. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?