Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (156K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (280)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Sonnhammer, E. L.
Right arrow Articles by Durbin, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sonnhammer, E. L.
Right arrow Articles by Durbin, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research Pages 320-322  


Pfam: multiple sequence alignments and HMM-profiles of protein domains
Introduction
Description OF THE DATA
   Pfam-B
  Sequence database coverage
Searching Pfam
World Wide Web Servers, FTP Access And Format
Acknowledgements
References


Pfam: multiple sequence alignments and HMM-profiles of protein domains

Pfam: multiple sequence alignments and HMM-profiles of protein domains

Erik L. L. Sonnhammer*, Sean R. Eddy1, Ewan Birney2, Alex Bateman2, Richard Durbin2

Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, Building 38A, Room 8N805, National Institutes of Health, Bethesda, MD 20894, USA, 1Department of Genetics, Washington University School of Medicine, St Louis, MO 63110, USA and 2Sanger Centre, Hinxton Hall, Cambridge CB10 1SA, UK

Received September 8, 1997; Revised and Accepted October 8, 1997

ABSTRACT

Pfam contains multiple alignments and hidden Markov model based profiles (HMM-profiles) of complete protein domains. The definition of domain boundaries, family members and alignment is done semi-automatically based on expert knowledge, sequence similarity, other protein family databases and the ability of HMM-profiles to correctly identify and align the members. Release 2.0 of Pfam contains 527 manually verified families which are available for browsing and on-line searching via the World Wide Web in the UK at http://www.sanger.ac.uk/Pfam/ and in the US at http://genome.wustl.edu/Pfam/ Pfam 2.0 matches one or more domains in 50% of Swissprot-34 sequences, and 25% of a large sample of predicted proteins from the Caenorhabditis elegans genome.

INTRODUCTION

A relatively small number of structural and functional domains are used in a large number of different proteins. Particularly for protein analysis and annotation in large-scale sequencing projects, there is a growing need for easily interpretable and sensitive detection of common protein domains. A protein containing one or more common domains can produce a morass of hundreds or thousands of BLAST hits when searching single sequence databases (e.g. GenBank, Swissprot, PIR). Although searches can be augmented by tools that condense and summarise results (1), satisfactory annotation of such proteins often becomes a time-consuming and error-prone process. Instead, a search of an organised database of protein domain families can produce more concise results which simplify annotation, domain parsing and functional prediction for a query sequence (2-5). Protein family databases are typically based on multiple sequence alignments of known family members. Conserved features can be recognised in the alignment and given higher weight in searches, which for distant similarities can often render the comparison more sensitive than pairwise alignment approaches.

We present here Pfam (6) release 2.0. Pfam was developed in order to use HMM-profile analysis to complement BLAST analysis in the Caenorhabditis elegans genome project. The main distinction between Pfam and most other protein family databases is that for all of Pfam, both the family definition and the search method span entire domains, including not only conserved motifs but also less-conserved regions, insertions and deletions. HMM-profile methods allow variable conservation and insertions/deletions to be dealt with in a fairly robust way (7,8). Modelling of complete domains should facilitate more biologically meaningful sequence annotation, and, in some cases, more sensitive detection.

DESCRIPTION OF THE DATA

For each protein domain family in Pfam, there are three important files. The seed alignment is a manually verified multiple alignment of a representative set of sequences (Fig. 1). An HMM-profile is built from the seed alignment for database searching and alignment purposes. A full alignment is generated automatically from the seed HMM-profile by searching Swissprot for all detectable members and aligning them to the HMM-profile. The distinction between seed and full alignments facilitates updating the database; the seed alignments are stable resources, whereas full alignments and HMM-profiles can be generated automatically for any new Swissprot (or other sequence database) release.


Figure 1 Example of a typical Pfam entry, the SH2 family. Shown is the flat file record including a reduced version of the seed alignment.

Each family has a name, a permanent accession number and a record of the methods used to identify the family members and create the alignments. There is also either a brief description of the usual function and structure of the domain, or (more often) links to other on-line documentation resources such as Prosite and Prints.

Both the seed and the full alignments are subjected to a small array of `quality control' procedures, to verify that the alignments are sensible, that the HMM-detected sequences in the full alignment include all presumed members of the family in Swissprot and no other sequences, and that the family does not overlap with other Pfam families. The process of generating the Pfam family is iterated, if necessary, until all quality requirements are met.

Most Pfam families are based on, and cross-referenced to, corresponding Prosite or Prints entries. In many cases, however, the definition of which sequences belong to a family differs between the databases. This is a pragmatic consequence of the different search methods used. Prosite and Prints detection relies primarily on short conserved patterns corresponding to superfamily motifs. A Prosite pattern or Prints fingerprint may recognise a highly conserved motif shared amongst an otherwise highly diverged superfamily that Pfam splits into several families; conversely, Pfam may recognise a superfamily that Prosite and Prints classify into several distinct families with distinct motif signatures. For some protein domain families, there may be no motif sufficiently conserved to make a discriminative pattern or fingerprint. (Prosite is increasingly incorporating profiles for these families; these Prosite profiles are very similar to Pfam models.) Only the largest (>15 members) Prosite families were systematically used to construct Pfam entries. For smaller families, constructing an HMM-profile is of less value since the sensitivity is unlikely to improve relative to single-sequence searching, and because a small sample is often non-representative. Of the 71 Pfam families with no corresponding Prosite or Prints entry, 55 were `discovered' as large clusters in Pfam-B (see below). 24 Pfam families contain links to other World Wide Web (WWW) protein family documentation resources, some of which were gleaned from the ProWeb server (9).

Pfam 2.0 contains 527 families, comprising 39 113 sequence segments and 6.8 million residues in the full alignments. All sequences were taken from Swissprot 34 (10). The alignments are on average 275 residues wide, including gaps. There are on average ~75 members per family in full alignments, and ~22 in seed alignments.

Pfam-B

For comprehensiveness, all Swissprot sequences not in Pfam are clustered automatically by the program Domainer (2), which also constructs multiple alignments automatically and is the basis for the ProDom protein family database. The quality of these alignments tends to be low, but domain-based automated clustering is a convenient method of identifying large obvious families that need to be targeted for Pfam model construction. Although we do not stably maintain, annotate or produce HMM-profiles of these clusters, we make them available as Pfam-B. Pfam-B 2.0 contains 13 289 clusters, 62 611 subsequences, and 8.2 million residues. On average, alignments are 146 residues wide (including gaps) and contain five members.


Figure 2 Pfam 2.0 contains domains from nearly half of all Swissprot 34 proteins. The automatic clusters in Pfam-B 2.0 contain domains from 33% of the Swissprot proteins that do not contain Pfam domains. When counting residue-by-residue, roughly a third of Swissprot is covered by Pfam and Pfam-B each. Pfam-B does not include proteins known to be fragments or segments shorter than 30 residues; the figures for unique sequences are therefore overestimated.

Sequence database coverage

As shown in Figure 2, 48% of the sequences and 32% of the residues in Swissprot 34 are included in annotated Pfam alignments. If unannotated Pfam-B clusters are also taken into account, 81% of sequences and 71% of residues in Swissprot 34 are included in Pfam. In searches of a large and presumably unbiased set of predicted protein sequences from the C.elegans genome, 25% of sequences and 13% of residues show significant hits to Pfam HMM-profiles. The numbers are slightly lower for prokaryotic genomes.


Figure 3 Tabular output (a) and schematic output (b) from a Pfam search with the C.elegans protein E01H11.1 as query. Both pictures were taken from the Washington University WWW server.

SEARCHING Pfam

The US and UK Pfam WWW servers provide users the ability to search query protein sequences against one, all, or a few Pfam HMMs. Results are returned in tabular format, and both GIF- and Java-based graphical representations are available optionally. An example of the results from such a search is shown in Figure 3. Here, the C.elegans Kin-11 gene product (E01H11.1) is shown to possess a duplicated phorbol esters/diacylglycerol binding domain (DAG/PE-bind), a C2 domain, a protein kinase catalytic domain (pkinase) and a duplicated domain frequently associated C-terminally to protein kinase domains (pkinase_C).

Users can also use Pfam HMM-profiles to search protein sequences locally using the freely available HMMER software package at http://genome.wustl.edu/eddy/hmmer.html#hmmer For comparing genomic and EST data to Pfam HMM-profiles, the programs GeneWise and ESTWise (11) are available at http://www.sanger.ac.uk/Software/Wise2/

WORLD WIDE WEB SERVERS, FTP ACCESS AND FORMAT

The Pfam home pages are http://www.sanger.ac.uk/Pfam/ at the Sanger Centre in the UK and http://genome.wustl.edu/Pfam/ at Washington University in the USA. The two servers are separately maintained and differ slightly in their services and capabilities, but are based on the same underlying Pfam database. Both servers support HMM searching, browsing of the family alignments and documentation and lookup of the domain organisation of proteins in Swissprot.

The entire database, including accessory data files such as Pfam schematics for Swissprot proteins, is also available as flat file format ASCII files by anonymous FTP at ftp.sanger.ac.uk and genome.wustl.edu in /pub/databases/Pfam/

The format of the Pfam alignment flat files is based on the EMBL/Swissprot two-character field labels. The following Pfam-specific labels are used: AL, alignment method of seed members; AM, alignment method of full alignment; AU, author responsible for the alignments; GA, gathering method/search program and cutoffs used to build full alignment; SE, source suggesting the seed members belong to the same family; SQ, number of sequences (and last line before the alignment starts). The alignment is in a simple format (Fig. 1) which consists of one line per subsequence containing the Swissprot sequence ID, start and end of the segment, and the aligned subsequence itself (no length limit). In the Pfam flat file, the corresponding Swissprot accession number is added to the right of each alignment line. Users of the Pfam database or WWW servers should cite this article as the appropriate reference.

ACKNOWLEDGEMENTS

We thank Robert Finn for preparing most of the new families for Pfam 2.0, and Jose Aguilar for writing and maintaining the Washington University Pfam server. Pfam development in SRE's group is supported by grant R01-HG01363 from the NIH National Human Genome Research Institute. Pfam development at the Sanger Centre is supported by the Wellcome Trust.

REFERENCES

1. Sonnhammer,E.L.L. and Durbin,R. (1994) Comput. Appl. Biosci., 10, 301-307. MEDLINE Abstract

2. Sonnhammer,E.L.L. and Kahn,D. (1994) Protein Sci., 3, 482-492. MEDLINE Abstract

3. Attwood,T.K., Beck,M.E., Bleasby,A.J., Degtyarenko,K., Michie,A.D. and Parry-Smith,D.J. (1997) Nucleic Acids Res., 25, 212-217 [see also this issue (1998) Nucleic Acids Res. 26, 304-308].

4. Bairoch,A., Bucher,P. and Hofmann,K. (1997) Nucleic Acids Res., 25, 217-221. MEDLINE Abstract

5. Henikoff,J.G., Pietrokovski,S. and Henikoff,S. (1997) Nucleic Acids Res., 25, 222-226 [see also this issue (1998) Nucleic Acids Res. 26, 309-312].

6. Sonnhammer,E.L.L., Eddy,S.R. and Durbin,R. (1997) Proteins, 28, 405-420.

7. Krogh,A., Brown,M., Mian,I.S., Sjoelander,K. and Haussler,D. (1994) J. Mol. Biol., 235, 1501-1531. MEDLINE Abstract

8. Eddy,S.R. (1996) Curr. Opin. Struct. Biol., 6, 361-365. MEDLINE Abstract

9. Henikoff,S., Endow,S.A. and Greene,E.A. (1996) Trends Biochem. Sci., 21, 444-445. MEDLINE Abstract

10. Bairoch,A. and Apweiler,R. (1997) Nucleic Acids Res., 25, 31-36 [see also this issue (1998) Nucleic Acids Res. 26, 38-42].

11. Birney,E. and Durbin,R. (1997) In ISMB-97; Proceedings Fifth International Conference on Intelligent Systems for Molecular Biology. AAAI Press, Menlo Park, pp. 56-64.


*To whom correspondence should be addressed. Tel: +1 301 435 5930; Fax: +1 301 480 9241; Email: sonnhammer@ncbi.nlm.nih.gov


This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals Comments and feedback: www-admin{at}oup.co.uk
Last modification: 17 Dec 1997
Copyright© Oxford University Press, 1998.

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
Y. Y. Tseng, Z. J. Chen, and W.-H. Li
fPOP: footprinting functional pockets of proteins by comparative spatial patterns
Nucleic Acids Res., October 30, 2009; (2009) gkp900v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Forslund and E. L. Sonnhammer
Benchmarking homology detection procedures with low complexity filters
Bioinformatics, October 1, 2009; 25(19): 2500 - 2505.
[Abstract] [Full Text] [PDF]


Home page
J. Neurosci.Home page
H. Fu, J. Cai, H. Clevers, E. Fast, S. Gray, R. Greenberg, M. K. Jain, Q. Ma, M. Qiu, D. H. Rowitch, et al.
A Genome-Wide Screen for Spatially Restricted Expression Patterns Identifies Transcription Factors That Regulate Glial Development
J. Neurosci., September 9, 2009; 29(36): 11399 - 11408.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
J. A. DeGrasse, K. N. DuBois, D. Devos, T. N. Siegel, A. Sali, M. C. Field, M. P. Rout, and B. T. Chait
Evidence for a Shared Nuclear Pore Complex Architecture That Is Conserved from the Last Common Eukaryotic Ancestor
Mol. Cell. Proteomics, September 1, 2009; 8(9): 2119 - 2130.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Levitt
Nature of the protein universe
PNAS, July 7, 2009; 106(27): 11079 - 11084.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
S. Das, T. Kanamoto, X. Ge, P. Xu, T. Unoki, C. L. Munro, and T. Kitten
Contribution of Lipoproteins and Lipoprotein Processing to Endocarditis Virulence in Streptococcus sanguinis
J. Bacteriol., July 1, 2009; 191(13): 4166 - 4179.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
A. M. Moses and R. Durbin
Inferring Selection on Amino Acid Preference in Protein Domains
Mol. Biol. Evol., March 1, 2009; 26(3): 527 - 536.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. M. Berman, J. D. Westbrook, M. J. Gabanyi, W. Tao, R. Shah, A. Kouranov, T. Schwede, K. Arnold, F. Kiefer, L. Bordoli, et al.
The protein structure initiative structural genomics knowledgebase
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D365 - D368.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
X. Zou, Y. Zhu, E. L. Pohlmann, J. Li, Y. Zhang, and G. P. Roberts
Identification and functional characterization of NifA variants that are independent of GlnB activation in the photosynthetic bacterium Rhodospirillum rubrum
Microbiology, September 1, 2008; 154(9): 2689 - 2699.
[Abstract] [Full Text] [PDF]


Home page
Mol PlantHome page
L.-R. Zeng, C. H. Park, R.C. Venu, J. Gough, and G.-L. Wang
Classification, Expression Pattern, and E3 Ligase Activity Assay of Rice U-Box-Containing Proteins
Mol Plant, September 1, 2008; 1(5): 800 - 815.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Forslund and E. L. L. Sonnhammer
Predicting protein function from domain content
Bioinformatics, August 1, 2008; 24(15): 1681 - 1687.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
I. M. Overton, C. A. J. van Niekerk, L. G. Carter, A. Dawson, D. M. A. Martin, S. Cameron, S. A. McMahon, M. F. White, W. N. Hunter, J. H. Naismith, et al.
TarO: a target optimisation system for structural biology
Nucleic Acids Res., July 1, 2008; 36(suppl_2): W190 - W196.
[Abstract] [Full Text] [PDF]


Home page
J HeredHome page
C.-M. Valcu, C. Lalanne, G. Muller-Starck, C. Plomion, and K. Schlink
Protein Polymorphism between 2 Picea abies Populations Revealed by 2-Dimensional Gel Electrophoresis and Tandem Mass Spectrometry
J. Hered., March 15, 2008; (2008) esn007v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Biegert and J. Soding
De novo identification of highly diverged protein repeats by probabilistic consistency
Bioinformatics, March 15, 2008; 24(6): 807 - 814.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
T. Kosaka, S. Kato, T. Shimoyama, S. Ishii, T. Abe, and K. Watanabe
The genome of Pelotomaculum thermopropionicum reveals niche-associated evolution in anaerobic microbiota
Genome Res., March 1, 2008; 18(3): 442 - 448.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. E. Primo, S. Klinke, M. P. Sica, F. A. Goldbaum, J. Jakoncic, E. Poskus, and M. R. Ermacora
Structure of the Mature Ectodomain of the Human Receptor-type Protein-tyrosine Phosphatase IA-2
J. Biol. Chem., February 22, 2008; 283(8): 4674 - 4681.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
B. Saunders, S. Lyon, M. Day, B. Riley, E. Chenette, and S. Subramaniam
The Molecule Pages database
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D700 - D706.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
I. Hughes, M. Saito, P. H. Schlesinger, and D. M. Ornitz
Otopetrin 1 activation by purinergic nucleotides regulates intracellular calcium
PNAS, July 17, 2007; 104(29): 12023 - 12028.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
N.-H. Cho, H.-R. Kim, J.-H. Lee, S.-Y. Kim, J. Kim, S. Cha, S.-Y. Kim, A. C. Darby, H.-H. Fuxelius, J. Yin, et al.
The Orientia tsutsugamushi genome reveals massive proliferation of conjugative type IV secretion system and host cell interaction genes
PNAS, May 8, 2007; 104(19): 7981 - 7986.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
N.-H. Hsiao, J. Soding, D. Linke, C. Lange, C. Hertweck, W. Wohlleben, and E. Takano
ScbA from Streptomyces coelicolor A3(2) has homology to fatty acid synthases and is able to synthesize {gamma}-butyrolactones
Microbiology, May 1, 2007; 153(5): 1394 - 1404.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
T. M. Casey, J. L. Meade, and E. W. Hewitt
Organelle Proteomics: Identification of the Exocytic Machinery Associated with the Natural Killer Cell Secretory Lysosome
Mol. Cell. Proteomics, May 1, 2007; 6(5): 767 - 780.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. M. Phuong, C. B. Do, R. C. Edgar, and S. Batzoglou
Multiple alignment of protein sequences with repeats and rearrangements
Nucleic Acids Res., November 6, 2006; 34(20): 5932 - 5942.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
P. Xu, Y. Zhang, L. Kang, M. J. Roossinck, and K. S. Mysore
Computational Estimation and Experimental Verification of Off-Target Silencing during Posttranscriptional Gene Silencing in Plants
Plant Physiology, October 1, 2006; 142(2): 429 - 440.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
G. Delhon, E. R. Tulman, C. L. Afonso, Z. Lu, J. J. Becnel, B. A. Moser, G. F. Kutish, and D. L. Rock
Genome of invertebrate iridescent virus type 3 (mosquito iridescent virus).
J. Virol., September 1, 2006; 80(17): 8439 - 8449.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Soding, M. Remmert, and A. Biegert
HHrep: de novo protein repeat detection and the origin of TIM barrels.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W137 - W142.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Bhadra, S. Sandhya, K. R. Abhinandan, S. Chakrabarti, R. Sowdhamini, and N. Srinivasan
Cascade PSI-BLAST web server: a remote homology search tool for relating protein domains.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W143 - W146.
[Abstract] [Full Text] [PDF]


Home page
Microbiol. Mol. Biol. Rev.Home page
M.-J. Han and S. Y. Lee
The Escherichia coli Proteome: Past, Present, and Future Prospects
Microbiol. Mol. Biol. Rev., June 1, 2006; 70(2): 362 - 439.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
C. L. Afonso, E. R. Tulman, G. Delhon, Z. Lu, G. J. Viljoen, D. B. Wallace, G. F. Kutish, and D. L. Rock
Genome of crocodilepox virus.
J. Virol., May 1, 2006; 80(10): 4978 - 4991.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
N. M. Wills, B. Moore, A. Hammer, R. F. Gesteland, and J. F. Atkins
A Functional -1 Ribosomal Frameshift Signal in the Human Paraneoplastic Ma3 Gene
J. Biol. Chem., March 17, 2006; 281(11): 7082 - 7088.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
F. Zhao, X. Zhang, C. Liang, J. Wu, Q. Bao, and S. Qin
Genome-wide analysis of restriction-modification system in unicellular and filamentous cyanobacteria
Physiol Genomics, February 23, 2006; 24(3): 181 - 190.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. H. Ardell and S. G. E. Andersson
TFAM detects co-evolution of tRNA identity rules with lateral transfer of histidyl-tRNA synthetase
Nucleic Acids Res., February 9, 2006; 34(3): 893 - 904.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
R. H. Y. Jiang, B. M. Tyler, S. C. Whisson, A. R. Hardham, and F. Govers
Ancient Origin of Elicitin Gene Clusters in Phytophthora Genomes
Mol. Biol. Evol., February 1, 2006; 23(2): 338 - 351.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. D. Finn, J. Mistry, B. Schuster-Bockler, S. Griffiths-Jones, V. Hollich, T. Lassmann, S. Moxon, M. Marshall, A. Khanna, R. Durbin, et al.
Pfam: clans, web tools and services
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D247 - D251.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. Perry, N. Kleckner, and G. V. Borner
Bioinformatic analyses implicate the collaborating meiotic crossover/chiasma proteins Zip2, Zip3, and Spo22/Zip4 in ubiquitin labeling
PNAS, December 6, 2005; 102(49): 17594 - 17599.
[Abstract] [Full Text] [PDF]


Home page
J. Lipid Res.Home page
A. C. Lake, Y. Sun, J.-L. Li, J. E. Kim, J. W. Johnson, D. Li, T. Revett, H. H. Shih, W. Liu, J. E. Paulsen, et al.
Expression, regulation, and triglyceride hydrolase activity of Adiponutrin family members
J. Lipid Res., November 1, 2005; 46(11): 2477 - 2487.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
S.-H. Shiu, M.-C. Shih, and W.-H. Li
Transcription Factor Families Have Much Higher Expansion Rates in Plants than in Animals
Plant Physiology, September 1, 2005; 139(1): 18 - 26.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Soding, A. Biegert, and A. N. Lupas
The HHpred interactive server for protein homology detection and structure prediction
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W244 - W248.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
F. D. Ivey and C. S. Hoffman
Direct activation of fission yeast adenylate cyclase by the Gpa2 G{alpha} of the glucose signaling pathway
PNAS, April 26, 2005; 102(17): 6108 - 6113.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. Hou, S.-R. Jun, C. Zhang, and S.-H. Kim
From The Cover: Global mapping of the protein structure space and application in structure-based inference of protein function
PNAS, March 8, 2005; 102(10): 3651 - 3656.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
B. E. Shakhnovich, E. Deeds, C. Delisi, and E. Shakhnovich
Protein structure and evolutionary history determine sequence space topology
Genome Res., March 1, 2005; 15(3): 385 - 392.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. M. Zdobnov, Món. Campillos, E. D. Harrington, D. Torrents, and P. Bork
Protein coding potential of retroviruses and other transposable elements in vertebrate genomes
Nucleic Acids Res., February 16, 2005; 33(3): 946 - 954.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
C. B. Do, M. S.P. Mahabhashyam, M. Brudno, and S. Batzoglou
ProbCons: Probabilistic consistency-based multiple sequence alignment
Genome Res., February 1, 2005; 15(2): 330 - 340.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
T. D. Sharkey, S. Yeh, A. E. Wiberley, T. G. Falbel, D. Gong, and D. E. Fernandez
Evolution of the Isoprene Biosynthetic Pathway in Kudzu
Plant Physiology, February 1, 2005; 137(2): 700 - 712.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
R. Schulein, P. Guye, T. A. Rhomberg, M. C. Schmid, G. Schroder, A. C. Vergunst, I. Carena, and C. Dehio
A bipartite signal mediates the transfer of type IV secretion substrates of Bartonella henselae into human cells
PNAS, January 18, 2005; 102(3): 856 - 861.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Liu, H. Walch, S. Wu, and A. Grigoriev
Significant expansion of exon-bordering protein domains during animal proteome evolution
Nucleic Acids Res., January 7, 2005; 33(1): 95 - 105.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
N. Nagata, R. Tanaka, S. Satoh, and A. Tanaka
Identification of a Vinyl Reductase Gene for Chlorophyll Synthesis in Arabidopsis thaliana and Implications for the Evolution of Prochlorococcus Species
PLANT CELL, January 1, 2005; 17(1): 233 - 240.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
O. Rowland, A. A. Ludwig, C. J. Merrick, F. Baillieul, F. E. Tracy, W. E. Durrant, L. Fritz-Laylin, V. Nekrasov, K. Sjolander, H. Yoshioka, et al.
Functional Analysis of Avr9/Cf-9 Rapidly Elicited Genes Identifies a Protein Kinase, ACIK1, That Is Essential for Full Cf-9-Dependent Disease Resistance in Tomato
PLANT CELL, January 1, 2005; 17(1): 295 - 310.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
Z. Xie, Z.-L. Zhang, X. Zou, J. Huang, P. Ruas, D. Thompson, and Q. J. Shen
Annotations and Functional Analyses of the Rice WRKY Gene Superfamily Reveal Positive and Negative Regulators of Abscisic Acid Signaling in Aleurone Cells
Plant Physiology, January 1, 2005; 137(1): 176 - 189.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
P. A. Gray, H. Fu, P. Luo, Q. Zhao, J. Yu, A. Ferrari, T. Tenzen, D.-i. Yuk, E. F. Tsung, Z. Cai, et al.
Mouse Brain Organization Revealed Through Direct Genome-Scale TF Expression Analysis
Science, December 24, 2004; 306(5705): 2255 - 2257.
[Abstract] [Full Text] [PDF]


Home page
Endocr Relat CancerHome page
P. Carter, L. Smith, and M. Ryan
Identification and validation of cell surface antigens for antibody targeting in oncology
Endocr. Relat. Cancer, December 1, 2004; 11(4): 659 - 687.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
J. Boekhorst, R. J. Siezen, M.-C. Zwahlen, D. Vilanova, R. D. Pridmore, A. Mercenier, M. Kleerebezem, W. M. de Vos, H. Brussow, and F. Desiere
The complete genomes of Lactobacillus plantarum and Lactobacillus johnsonii reveal extensive differences in chromosome organization and gene content
Microbiology, November 1, 2004; 150(11): 3601 - 3611.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
S.-H. Shiu, W. M. Karlowski, R. Pan, Y.-H. Tzeng, K. F. X. Mayer, and W.-H. Li
Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice
PLANT CELL, May 1, 2004; 16(5): 1220 - 1234.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S.-H. Shiu and W.-H. Li
Origins, Lineage-Specific Expansions, and Multiple Losses of Tyrosine Kinases in Eukaryotes
Mol. Biol. Evol., May 1, 2004; 21(5): 828 - 840.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
R. D. Pridmore, B. Berger, F. Desiere, D. Vilanova, C. Barretto, A.-C. Pittet, M.-C. Zwahlen, M. Rouvet, E. Altermann, R. Barrangou, et al.
The genome sequence of the probiotic intestinal bacterium Lactobacillus johnsonii NCC 533
PNAS, February 24, 2004; 101(8): 2512 - 2517.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Respir. Cell Mol. Bio.Home page
Y. Chen, Y. H. Zhao, T. B. Kalaslavadi, E. Hamati, K. Nehrke, A. D. Le, D. K. Ann, and R. Wu
Genome-Wide Search and Identification of a Novel Gel-Forming Mucin MUC19/Muc19 in Glandular Tissues
Am. J. Respir. Cell Mol. Biol., February 1, 2004; 30(2): 155 - 165.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
S. Y. McLoughlin, C. Jackson, J.-W. Liu, and D. L. Ollis
Growth of Escherichia coli Coexpressing Phosphotriesterase and Glycerophosphodiester Phosphodiesterase, Using Paraoxon as the Sole Phosphorus Source
Appl. Envir. Microbiol., January 1, 2004; 70(1): 404 - 412.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
Y. Mudgil, S.-H. Shiu, S. L. Stone, J. N. Salt, and D. R. Goring
A Large Complement of the Predicted Arabidopsis ARM Repeat Proteins Are Members of the U-Box E3 Ubiquitin Ligase Family
Plant Physiology, January 1, 2004; 134(1): 59 - 66.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
G. Delhon, M. P. Moraes, Z. Lu, C. L. Afonso, E. F. Flores, R. Weiblen, G. F. Kutish, and D. L. Rock
Genome of Bovine Herpesvirus 5
J. Virol., October 1, 2003; 77(19): 10339 - 10347.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
H. F. Clark, A. L. Gurney, E. Abaya, K. Baker, D. Baldwin, J. Brush, J. Chen, B. Chow, C. Chui, C. Crowley, et al.
The Secreted Protein Discovery Initiative (SPDI), a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins: A Bioinformatics Assessment
Genome Res., October 1, 2003; 13(10): 2265 - 2270.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
N. Ariel, A. Zvi, K. S. Makarova, T. Chitlaru, E. Elhanany, B. Velan, S. Cohen, A. M. Friedlander, and A. Shafferman
Genome-Based Bioinformatic Selection of Chromosomal Bacillus anthracis Putative Vaccine Candidates Coupled with Proteomic Identification of Surface-Associated Antigens
Infect. Immun., August 1, 2003; 71(8): 4563 - 4579.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
Y. A. Goo, E. C. Yi, N. S. Baliga, W. A. Tao, M. Pan, R. Aebersold, D. R. Goodlett, L. Hood, and W. V. Ng
Proteomic Analysis of an Extreme Halophilic Archaeon, Halobacterium sp. NRC-1
Mol. Cell. Proteomics, August 1, 2003; 2(8): 506 - 524.
[Abstract] [Full Text] [PDF]


Home page
Mol. Biol. CellHome page
T. Sandmann, J. M. Herrmann, J. Dengjel, H. Schwarz, and A. Spang
Suppression of Coatomer Mutants by a New Protein Family with COPI and COPII Binding Motifs in Saccharomyces cerevisiae
Mol. Biol. Cell, August 1, 2003; 14(8): 3097 - 3113.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
J. R. Wortman, B. J. Haas, L. I. Hannick, R. K. Smith Jr., R. Maiti, C. M. Ronning, A. P. Chan, C. Yu, M. Ayele, C. A. Whitelaw, et al.
Annotation of the Arabidopsis Genome
Plant Physiology, June 1, 2003; 132(2): 461 - 468.
[Full Text] [PDF]


Home page
Plant Physiol.Home page
S.-H. Shiu and A. B. Bleecker
Expansion of the Receptor-Like Kinase/Pelle Gene Family and Receptor-Like Proteins in Arabidopsis
Plant Physiology, June 1, 2003; 132(2): 530 - 543.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
K. Meierhoff, S. Felder, T. Nakamura, N. Bechtold, and G. Schuster
HCF152, an Arabidopsis RNA Binding Pentatricopeptide Repeat Protein Involved in the Processing of Chloroplast psbB-psbT-psbH-petB-petD RNAs
PLANT CELL, June 1, 2003; 15(6): 1480 - 1495.
[Abstract] [Full Text]


Home page
JCBHome page
L. H. Engelholm, K. List, S. Netzel-Arnett, E. Cukierman, D. J. Mitola, H. Aaronson, L. Kjoller, J. K. Larsen, K. M. Yamada, D. K. Strickland, et al.
uPARAP/Endo180 is essential for cellular uptake of collagen and promotes fibroblast collagen adhesion
J. Cell Biol., March 31, 2003; 160(7): 1009 - 1015.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. S. Gowri, S. B. Pandit, P. S. Karthik, N. Srinivasan, and S. Balaji
Integration of related sequences with protein three-dimensional structural families in an updated version of PALI database
Nucleic Acids Res., January 1, 2003; 31(1): 486 - 488.
[Abstract] [Full Text] [PDF]


Home page
Eukaryot CellHome page
D. R. Williams and J. R. McIntosh
mcl1+, the Schizosaccharomyces pombe Homologue of CTF4, Is Important for Chromosome Replication, Cohesion, and Segregation
Eukaryot. Cell, October 1, 2002; 1(5): 758 - 773.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. Mondragon-Palomino, B. C. Meyers, R. W. Michelmore, and B. S. Gaut
Patterns of Positive Selection in the Complete NBS-LRR Gene Family of Arabidopsis thaliana
Genome Res., September 1, 2002; 12(9): 1305 - 1315.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
M. Gronborg, T. Z. Kristiansen, A. Stensballe, J. S. Andersen, O. Ohara, M. Mann, O. N. Jensen, and A. Pandey
A Mass Spectrometry-based Proteomic Approach for Identification of Serine/Threonine-phosphorylated Proteins by Enrichment with Phospho-specific Antibodies: Identification of a Novel Protein, Frigg, as a Protein Kinase A Substrate
Mol. Cell. Proteomics, July 1, 2002; 1(7): 517 - 527.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
P. Bradley, P. S. Kim, and B. Berger
From the Cover: TRILOGY: Discovery of sequence-structure patterns across diverse proteins
PNAS, June 25, 2002; 99(13): 8500 - 8505.
[Abstract] [Full Text] [PDF]


Home page
Eukaryot CellHome page
L. Sperling, P. Dessen, M. Zagulski, R. E. Pearlman, A. Migdalski, R. Gromadka, M. Froissard, A.-M. Keller, and J. Cohen
Random Sequencing of Paramecium Somatic DNA
Eukaryot. Cell, June 1, 2002; 1(3): 341 - 352.
[Abstract] [Full Text] [PDF]


Home page
Sci SignalHome page
S.-H. Shiu and A. B. Bleecker
Plant Receptor-Like Kinase Gene Family: Diversity, Function, and Signaling
Sci. Signal., December 18, 2001; 2001(113): re22 - re22.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
J. Gao, L. Estrada, S. Cho, R. E. Ellis, and J. L. Gorski
The Caenorhabditis elegans homolog of FGD1, the human Cdc42 GEF gene responsible for faciogenital dysplasia, is critical for excretory cell morphogenesis
Hum. Mol. Genet., December 1, 2001; 10(26): 3049 - 3062.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
C. L. Afonso, E. R. Tulman, Z. Lu, C. A. Balinsky, B. A. Moser, J. J. Becnel, D. L. Rock, and G. F. Kutish
Genome Sequence of a Baculovirus Pathogenic for Culex nigripalpus
J. Virol., November 15, 2001; 75(22): 11157 - 11165.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
P. Vaccaro, B. Brannetti, L. Montecchi-Palazzi, S. Philipp, M. H. Citterich, G. Cesareni, and L. Dente
Distinct Binding Specificity of the Multiple PDZ Domains of INADL, a Human Protein with Homology to INAD from Drosophila melanogaster
J. Biol. Chem., November 2, 2001; 276(45): 42122 - 42130.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
Y. Pouliot, J. Gao, Q. J. Su, G. G. Liu, and X. B. Ling
DIAN: A Novel Algorithm for Genome Ontological Classification
Genome Res., October 1, 2001; 11(10): 1766 - 1779.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
P. J. Fernandes, J. A. C. Powell, and J. A. C. Archer
Construction of Rhodococcus random mutagenesis libraries using Tn5 transposition complexes
Microbiology, September 1, 2001; 147(9): 2529 - 2536.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S.-H. Shiu and A. B. Bleecker
Receptor-like kinases from Arabidopsis form a monophyletic gene family related to animal receptor kinases
PNAS, August 23, 2001; (2001) 181141598.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
J. Murvai, K. Vlahovicek, C. Szepesvari, and S. Pongor
Prediction of Protein Functional Domains from Sequences Using Artificial Neural Networks
Genome Res., August 1, 2001; 11(8): 1410 - 1417.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
I. Chen and E. C. Gotschlich
ComE, a Competence Protein from Neisseria gonorrhoeae with DNA-Binding Activity
J. Bacteriol., May 15, 2001; 183(10): 3160 - 3168.
[Abstract] [Full Text]


Home page
Nucleic Acids ResHome page
I. Chambaud, R. Heilig, S. Ferris, V. Barbe, D. Samson, F. Galisson, I. Moszer, K. Dybvig, H. Wroblewski, A. Viari, et al.
The complete genome sequence of the murine respiratory pathogen Mycoplasma pulmonis
Nucleic Acids Res., May 15, 2001; 29(10): 2145 - 2153.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Yang, I. T. Phan, S. Fitz-Gibbon, M. K. K. Shivji, R. D. Wood, W. M. Clendenin, E. C. Hyman, and J. H. Miller
A thermostable endonuclease III homolog from the archaeon Pyrobaculum aerophilum
Nucleic Acids Res., February 1, 2001; 29(3): 604 - 613.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
C. L. Afonso, E. R. Tulman, Z. Lu, L. Zsak, D. L. Rock, and G. F. Kutish
The Genome of Turkey Herpesvirus
J. Virol., January 15, 2001; 75(2): 971 - 978.
[Abstract] [Full Text]


Home page
Infect. Immun.Home page
R. J. M. Bongaerts, H.-P. Heinz, U. Hadding, and G. Zysk
Antigenicity, Expression, and Molecular Characterization of Surface-Located Pullulanase of Streptococcus pneumoniae
Infect. Immun., December 1, 2000; 68(12): 7141 - 7143.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
C. G. Friedrich, A. Quentmeier, F. Bardischewsky, D. Rother, R. Kraft, S. Kostka, and H. Prinz
Novel Genes Coding for Lithotrophic Sulfur Oxidation of Paracoccus pantotrophus GB17
J. Bacteriol., September 1, 2000; 182(17): 4677 - 4687.
[Abstract] [Full Text]


Home page
Mol. Cell. Biol.Home page
A. M. DeSandro, U. M. Nagarajan, and J. M. Boss
Associations and Interactions between Bare Lymphocyte Syndrome Factors
Mol. Cell. Biol., September 1, 2000; 20(17): 6587 - 6599.
[Abstract] [Full Text]


Home page
Genome ResHome page
M. G. Reese, G. Hartzell, N. L. Harris, U. Ohler, J. F. Abril, and S. E. Lewis
Genome Annotation Assessment in Drosophila melanogaster
Genome Res., April 1, 2000; 10(4): 483 - 501.
[Abstract] [Full Text]


Home page
J. Biol. Chem.Home page
H. Yang, M. M. Slupska, Y.-F. Wei, J. H. Tai, W. M. Luther, Y.-R. Xia, D. M. Shih, J.-H. Chiang, C. Baikalov, S. Fitz-Gibbon, et al.
Cloning and Characterization of a New Member of the Nudix Hydrolases from Human and Mouse
J. Biol. Chem., March 17, 2000; 275(12): 8844 - 8853.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
G. Subramanian, E. V. Koonin, and L. Aravind
Comparative Genome Analysis of the Pathogenic Spirochetes Borrelia burgdorferi and Treponema pallidum
Infect. Immun., March 1, 2000; 68(3): 1633 - 1648.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
H. Hutter, B. E. Vogel, J. D. Plenefisch, C. R. Norris, R. B. Proenca, J. Spieth, C. Guo, S. Mastwal, X. Zhu, I. S. A. J. Scheel, et al.
Conservation and Novelty in the Evolution of Cell Adhesion and Extracellular Matrix Genes
Science, February 11, 2000; 287(5455): 989 - 994.
[Abstract] [Full Text]


Home page
Protein Eng Des SelHome page
A.C. Camproux, P. Tuffery, J.P. Chevrolat, J.F. Boisvieux, and S. Hazout
Hidden Markov model approach for identifying the modular framework of the protein backbone
Protein Eng. Des. Sel., December 1, 1999; 12(12): 1063 - 1073.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
B. D. Lindenbach and C. M. Rice
Genetic Interaction of Flavivirus Nonstructural Proteins NS1 and NS4A as a Determinant of Replicase Function
J. Virol., June 1, 1999; 73(6): 4611 - 4621.
[Abstract] [Full Text]


Home page
GeneticsHome page
B. M. Cali, S. L. Kuchma, J. Latham, and P. Anderson
smg-7 Is Required for mRNA Surveillance in Caenorhabditis elegans
Genetics, February 1, 1999; 151(2): 605 - 616.
[Abstract] [Full Text]


Home page
ScienceHome page
The C. elegans Sequencing Consortium
Genome Sequence of the Nematode C. elegans: A Platform for Investigating Biology
Science, December 11, 1998; 282(5396): 2012 - 2018.
[Abstract] [Full Text]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (156K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (280)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Sonnhammer, E. L.
Right arrow Articles by Durbin, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sonnhammer, E. L.
Right arrow Articles by Durbin, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?