| Nucleic Acids Research | Pages |
Pfam: multiple sequence alignments and HMM-profiles of protein domains
Introduction
Description OF THE DATA
Pfam-B
Sequence database coverage
Searching Pfam
World Wide Web Servers, FTP Access And Format
Acknowledgements
References
Pfam: multiple sequence alignments and HMM-profiles of protein domains
ABSTRACT
INTRODUCTION
A relatively small number of structural and functional domains are used in a large number of different proteins. Particularly for protein analysis and annotation in large-scale sequencing projects, there is a growing need for easily interpretable and sensitive detection of common protein domains. A protein containing one or more common domains can produce a morass of hundreds or thousands of BLAST hits when searching single sequence databases (e.g. GenBank, Swissprot, PIR). Although searches can be augmented by tools that condense and summarise results (1), satisfactory annotation of such proteins often becomes a time-consuming and error-prone process. Instead, a search of an organised database of protein domain families can produce more concise results which simplify annotation, domain parsing and functional prediction for a query sequence (2-5). Protein family databases are typically based on multiple sequence alignments of known family members. Conserved features can be recognised in the alignment and given higher weight in searches, which for distant similarities can often render the comparison more sensitive than pairwise alignment approaches.
We present here Pfam (6) release 2.0. Pfam was developed in order to use HMM-profile analysis to complement BLAST analysis in the Caenorhabditis elegans genome project. The main distinction between Pfam and most other protein family databases is that for all of Pfam, both the family definition and the search method span entire domains, including not only conserved motifs but also less-conserved regions, insertions and deletions. HMM-profile methods allow variable conservation and insertions/deletions to be dealt with in a fairly robust way (7,8). Modelling of complete domains should facilitate more biologically meaningful sequence annotation, and, in some cases, more sensitive detection.
DESCRIPTION OF THE DATA
For each protein domain family in Pfam, there are three important files. The seed alignment is a manually verified multiple alignment of a representative set of sequences (Fig. 1). An HMM-profile is built from the seed alignment for database searching and alignment purposes. A full alignment is generated automatically from the seed HMM-profile by searching Swissprot for all detectable members and aligning them to the HMM-profile. The distinction between seed and full alignments facilitates updating the database; the seed alignments are stable resources, whereas full alignments and HMM-profiles can be generated automatically for any new Swissprot (or other sequence database) release.
Figure
Each family has a name, a permanent accession number and a record of the methods used to identify the family members and create the alignments. There is also either a brief description of the usual function and structure of the domain, or (more often) links to other on-line documentation resources such as Prosite and Prints. Both the seed and the full alignments are subjected to a small array of `quality control' procedures, to verify that the alignments are sensible, that the HMM-detected sequences in the full alignment include all presumed members of the family in Swissprot and no other sequences, and that the family does not overlap with other Pfam families. The process of generating the Pfam family is iterated, if necessary, until all quality requirements are met. Most Pfam families are based on, and cross-referenced to, corresponding Prosite or Prints entries. In many cases, however, the definition of which sequences belong to a family differs between the databases. This is a pragmatic consequence of the different search methods used. Prosite and Prints detection relies primarily on short conserved patterns corresponding to superfamily motifs. A Prosite pattern or Prints fingerprint may recognise a highly conserved motif shared amongst an otherwise highly diverged superfamily that Pfam splits into several families; conversely, Pfam may recognise a superfamily that Prosite and Prints classify into several distinct families with distinct motif signatures. For some protein domain families, there may be no motif sufficiently conserved to make a discriminative pattern or fingerprint. (Prosite is increasingly incorporating profiles for these families; these Prosite profiles are very similar to Pfam models.) Only the largest (>15 members) Prosite families were systematically used to construct Pfam entries. For smaller families, constructing an HMM-profile is of less value since the sensitivity is unlikely to improve relative to single-sequence searching, and because a small sample is often non-representative. Of the 71 Pfam families with no corresponding Prosite or Prints entry, 55 were `discovered' as large clusters in Pfam-B (see below). 24 Pfam families contain links to other World Wide Web (WWW) protein family documentation resources, some of which were gleaned from the ProWeb server (9). Pfam 2.0 contains 527 families, comprising 39 113 sequence segments and 6.8 million residues in the full alignments. All sequences were taken from Swissprot 34 (10). The alignments are on average 275 residues wide, including gaps. There are on average ~75 members per family in full alignments, and ~22 in seed alignments. For comprehensiveness, all Swissprot sequences not in Pfam are clustered automatically by the program Domainer (2), which also constructs multiple alignments automatically and is the basis for the ProDom protein family database. The quality of these alignments tends to be low, but domain-based automated clustering is a convenient method of identifying large obvious families that need to be targeted for Pfam model construction. Although we do not stably maintain, annotate or produce HMM-profiles of these clusters, we make them available as Pfam-B. Pfam-B 2.0 contains 13 289 clusters, 62 611 subsequences, and 8.2 million residues. On average, alignments are 146 residues wide (including gaps) and contain five members.
Figure
As shown in Figure 2, 48% of the sequences and 32% of the residues in Swissprot 34 are included in annotated Pfam alignments. If unannotated Pfam-B clusters are also taken into account, 81% of sequences and 71% of residues in Swissprot 34 are included in Pfam. In searches of a large and presumably unbiased set of predicted protein sequences from the C.elegans genome, 25% of sequences and 13% of residues show significant hits to Pfam HMM-profiles. The numbers are slightly lower for prokaryotic genomes.
Figure

Pfam-B

Sequence database coverage

SEARCHING Pfam
The US and UK Pfam WWW servers provide users the ability to search query protein sequences against one, all, or a few Pfam HMMs. Results are returned in tabular format, and both GIF- and Java-based graphical representations are available optionally. An example of the results from such a search is shown in Figure 3. Here, the C.elegans Kin-11 gene product (E01H11.1) is shown to possess a duplicated phorbol esters/diacylglycerol binding domain (DAG/PE-bind), a C2 domain, a protein kinase catalytic domain (pkinase) and a duplicated domain frequently associated C-terminally to protein kinase domains (pkinase_C).
Users can also use Pfam HMM-profiles to search protein sequences locally using the freely available HMMER software package at http://genome.wustl.edu/eddy/hmmer.html#hmmer For comparing genomic and EST data to Pfam HMM-profiles, the programs GeneWise and ESTWise (11) are available at http://www.sanger.ac.uk/Software/Wise2/
WORLD WIDE WEB SERVERS, FTP ACCESS AND FORMAT
The Pfam home pages are http://www.sanger.ac.uk/Pfam/ at the Sanger Centre in the UK and http://genome.wustl.edu/Pfam/ at Washington University in the USA. The two servers are separately maintained and differ slightly in their services and capabilities, but are based on the same underlying Pfam database. Both servers support HMM searching, browsing of the family alignments and documentation and lookup of the domain organisation of proteins in Swissprot.
The entire database, including accessory data files such as Pfam schematics for Swissprot proteins, is also available as flat file format ASCII files by anonymous FTP at ftp.sanger.ac.uk and genome.wustl.edu in /pub/databases/Pfam/
The format of the Pfam alignment flat files is based on the EMBL/Swissprot two-character field labels. The following Pfam-specific labels are used: AL, alignment method of seed members; AM, alignment method of full alignment; AU, author responsible for the alignments; GA, gathering method/search program and cutoffs used to build full alignment; SE, source suggesting the seed members belong to the same family; SQ, number of sequences (and last line before the alignment starts). The alignment is in a simple format (Fig. 1) which consists of one line per subsequence containing the Swissprot sequence ID, start and end of the segment, and the aligned subsequence itself (no length limit). In the Pfam flat file, the corresponding Swissprot accession number is added to the right of each alignment line. Users of the Pfam database or WWW servers should cite this article as the appropriate reference.
ACKNOWLEDGEMENTS
We thank Robert Finn for preparing most of the new families for Pfam 2.0, and Jose Aguilar for writing and maintaining the Washington University Pfam server. Pfam development in SRE's group is supported by grant R01-HG01363 from the NIH National Human Genome Research Institute. Pfam development at the Sanger Centre is supported by the Wellcome Trust.
REFERENCES
This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals Comments and feedback: www-admin{at}oup.co.uk
Last modification: 17 Dec 1997
Copyright© Oxford University Press, 1998.
This article has been cited by other articles:
![]() |
Y. Y. Tseng, Z. J. Chen, and W.-H. Li fPOP: footprinting functional pockets of proteins by comparative spatial patterns Nucleic Acids Res., October 30, 2009; (2009) gkp900v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Forslund and E. L. Sonnhammer Benchmarking homology detection procedures with low complexity filters Bioinformatics, October 1, 2009; 25(19): 2500 - 2505. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Fu, J. Cai, H. Clevers, E. Fast, S. Gray, R. Greenberg, M. K. Jain, Q. Ma, M. Qiu, D. H. Rowitch, et al. A Genome-Wide Screen for Spatially Restricted Expression Patterns Identifies Transcription Factors That Regulate Glial Development J. Neurosci., September 9, 2009; 29(36): 11399 - 11408. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. DeGrasse, K. N. DuBois, D. Devos, T. N. Siegel, A. Sali, M. C. Field, M. P. Rout, and B. T. Chait Evidence for a Shared Nuclear Pore Complex Architecture That Is Conserved from the Last Common Eukaryotic Ancestor Mol. Cell. Proteomics, September 1, 2009; 8(9): 2119 - 2130. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Levitt Nature of the protein universe PNAS, July 7, 2009; 106(27): 11079 - 11084. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Das, T. Kanamoto, X. Ge, P. Xu, T. Unoki, C. L. Munro, and T. Kitten Contribution of Lipoproteins and Lipoprotein Processing to Endocarditis Virulence in Streptococcus sanguinis J. Bacteriol., July 1, 2009; 191(13): 4166 - 4179. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Moses and R. Durbin Inferring Selection on Amino Acid Preference in Protein Domains Mol. Biol. Evol., March 1, 2009; 26(3): 527 - 536. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. M. Berman, J. D. Westbrook, M. J. Gabanyi, W. Tao, R. Shah, A. Kouranov, T. Schwede, K. Arnold, F. Kiefer, L. Bordoli, et al. The protein structure initiative structural genomics knowledgebase Nucleic Acids Res., January 1, 2009; 37(suppl_1): D365 - D368. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Zou, Y. Zhu, E. L. Pohlmann, J. Li, Y. Zhang, and G. P. Roberts Identification and functional characterization of NifA variants that are independent of GlnB activation in the photosynthetic bacterium Rhodospirillum rubrum Microbiology, September 1, 2008; 154(9): 2689 - 2699. [Abstract] [Full Text] [PDF] |
||||
![]() |
L.-R. Zeng, C. H. Park, R.C. Venu, J. Gough, and G.-L. Wang Classification, Expression Pattern, and E3 Ligase Activity Assay of Rice U-Box-Containing Proteins Mol Plant, September 1, 2008; 1(5): 800 - 815. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Forslund and E. L. L. Sonnhammer Predicting protein function from domain content Bioinformatics, August 1, 2008; 24(15): 1681 - 1687. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Overton, C. A. J. van Niekerk, L. G. Carter, A. Dawson, D. M. A. Martin, S. Cameron, S. A. McMahon, M. F. White, W. N. Hunter, J. H. Naismith, et al. TarO: a target optimisation system for structural biology Nucleic Acids Res., July 1, 2008; 36(suppl_2): W190 - W196. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-M. Valcu, C. Lalanne, G. Muller-Starck, C. Plomion, and K. Schlink Protein Polymorphism between 2 Picea abies Populations Revealed by 2-Dimensional Gel Electrophoresis and Tandem Mass Spectrometry J. Hered., March 15, 2008; (2008) esn007v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Biegert and J. Soding De novo identification of highly diverged protein repeats by probabilistic consistency Bioinformatics, March 15, 2008; 24(6): 807 - 814. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Kosaka, S. Kato, T. Shimoyama, S. Ishii, T. Abe, and K. Watanabe The genome of Pelotomaculum thermopropionicum reveals niche-associated evolution in anaerobic microbiota Genome Res., March 1, 2008; 18(3): 442 - 448. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. E. Primo, S. Klinke, M. P. Sica, F. A. Goldbaum, J. Jakoncic, E. Poskus, and M. R. Ermacora Structure of the Mature Ectodomain of the Human Receptor-type Protein-tyrosine Phosphatase IA-2 J. Biol. Chem., February 22, 2008; 283(8): 4674 - 4681. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Saunders, S. Lyon, M. Day, B. Riley, E. Chenette, and S. Subramaniam The Molecule Pages database Nucleic Acids Res., January 11, 2008; 36(suppl_1): D700 - D706. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Hughes, M. Saito, P. H. Schlesinger, and D. M. Ornitz Otopetrin 1 activation by purinergic nucleotides regulates intracellular calcium PNAS, July 17, 2007; 104(29): 12023 - 12028. [Abstract] [Full Text] [PDF] |
||||
![]() |
N.-H. Cho, H.-R. Kim, J.-H. Lee, S.-Y. Kim, J. Kim, S. Cha, S.-Y. Kim, A. C. Darby, H.-H. Fuxelius, J. Yin, et al. The Orientia tsutsugamushi genome reveals massive proliferation of conjugative type IV secretion system and host cell interaction genes PNAS, May 8, 2007; 104(19): 7981 - 7986. [Abstract] [Full Text] [PDF] |
||||
![]() |
N.-H. Hsiao, J. Soding, D. Linke, C. Lange, C. Hertweck, W. Wohlleben, and E. Takano ScbA from Streptomyces coelicolor A3(2) has homology to fatty acid synthases and is able to synthesize {gamma}-butyrolactones Microbiology, May 1, 2007; 153(5): 1394 - 1404. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. M. Casey, J. L. Meade, and E. W. Hewitt Organelle Proteomics: Identification of the Exocytic Machinery Associated with the Natural Killer Cell Secretory Lysosome Mol. Cell. Proteomics, May 1, 2007; 6(5): 767 - 780. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. M. Phuong, C. B. Do, R. C. Edgar, and S. Batzoglou Multiple alignment of protein sequences with repeats and rearrangements Nucleic Acids Res., November 6, 2006; 34(20): 5932 - 5942. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Xu, Y. Zhang, L. Kang, M. J. Roossinck, and K. S. Mysore Computational Estimation and Experimental Verification of Off-Target Silencing during Posttranscriptional Gene Silencing in Plants Plant Physiology, October 1, 2006; 142(2): 429 - 440. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Delhon, E. R. Tulman, C. L. Afonso, Z. Lu, J. J. Becnel, B. A. Moser, G. F. Kutish, and D. L. Rock Genome of invertebrate iridescent virus type 3 (mosquito iridescent virus). J. Virol., September 1, 2006; 80(17): 8439 - 8449. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Soding, M. Remmert, and A. Biegert HHrep: de novo protein repeat detection and the origin of TIM barrels. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W137 - W142. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Bhadra, S. Sandhya, K. R. Abhinandan, S. Chakrabarti, R. Sowdhamini, and N. Srinivasan Cascade PSI-BLAST web server: a remote homology search tool for relating protein domains. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W143 - W146. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-J. Han and S. Y. Lee The Escherichia coli Proteome: Past, Present, and Future Prospects Microbiol. Mol. Biol. Rev., June 1, 2006; 70(2): 362 - 439. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. L. Afonso, E. R. Tulman, G. Delhon, Z. Lu, G. J. Viljoen, D. B. Wallace, G. F. Kutish, and D. L. Rock Genome of crocodilepox virus. J. Virol., May 1, 2006; 80(10): 4978 - 4991. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. M. Wills, B. Moore, A. Hammer, R. F. Gesteland, and J. F. Atkins A Functional -1 Ribosomal Frameshift Signal in the Human Paraneoplastic Ma3 Gene J. Biol. Chem., March 17, 2006; 281(11): 7082 - 7088. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Zhao, X. Zhang, C. Liang, J. Wu, Q. Bao, and S. Qin Genome-wide analysis of restriction-modification system in unicellular and filamentous cyanobacteria Physiol Genomics, February 23, 2006; 24(3): 181 - 190. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. H. Ardell and S. G. E. Andersson TFAM detects co-evolution of tRNA identity rules with lateral transfer of histidyl-tRNA synthetase Nucleic Acids Res., February 9, 2006; 34(3): 893 - 904. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. H. Y. Jiang, B. M. Tyler, S. C. Whisson, A. R. Hardham, and F. Govers Ancient Origin of Elicitin Gene Clusters in Phytophthora Genomes Mol. Biol. Evol., February 1, 2006; 23(2): 338 - 351. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Finn, J. Mistry, B. Schuster-Bockler, S. Griffiths-Jones, V. Hollich, T. Lassmann, S. Moxon, M. Marshall, A. Khanna, R. Durbin, et al. Pfam: clans, web tools and services Nucleic Acids Res., January 1, 2006; 34(suppl_1): D247 - D251. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Perry, N. Kleckner, and G. V. Borner Bioinformatic analyses implicate the collaborating meiotic crossover/chiasma proteins Zip2, Zip3, and Spo22/Zip4 in ubiquitin labeling PNAS, December 6, 2005; 102(49): 17594 - 17599. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. C. Lake, Y. Sun, J.-L. Li, J. E. Kim, J. W. Johnson, D. Li, T. Revett, H. H. Shih, W. Liu, J. E. Paulsen, et al. Expression, regulation, and triglyceride hydrolase activity of Adiponutrin family members J. Lipid Res., November 1, 2005; 46(11): 2477 - 2487. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-H. Shiu, M.-C. Shih, and W.-H. Li Transcription Factor Families Have Much Higher Expansion Rates in Plants than in Animals Plant Physiology, September 1, 2005; 139(1): 18 - 26. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Soding, A. Biegert, and A. N. Lupas The HHpred interactive server for protein homology detection and structure prediction Nucleic Acids Res., July 1, 2005; 33(suppl_2): W244 - W248. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. D. Ivey and C. S. Hoffman Direct activation of fission yeast adenylate cyclase by the Gpa2 G{alpha} of the glucose signaling pathway PNAS, April 26, 2005; 102(17): 6108 - 6113. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hou, S.-R. Jun, C. Zhang, and S.-H. Kim From The Cover: Global mapping of the protein structure space and application in structure-based inference of protein function PNAS, March 8, 2005; 102(10): 3651 - 3656. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. E. Shakhnovich, E. Deeds, C. Delisi, and E. Shakhnovich Protein structure and evolutionary history determine sequence space topology Genome Res., March 1, 2005; 15(3): 385 - 392. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. M. Zdobnov, Món. Campillos, E. D. Harrington, D. Torrents, and P. Bork Protein coding potential of retroviruses and other transposable elements in vertebrate genomes Nucleic Acids Res., February 16, 2005; 33(3): 946 - 954. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. B. Do, M. S.P. Mahabhashyam, M. Brudno, and S. Batzoglou ProbCons: Probabilistic consistency-based multiple sequence alignment Genome Res., February 1, 2005; 15(2): 330 - 340. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. D. Sharkey, S. Yeh, A. E. Wiberley, T. G. Falbel, D. Gong, and D. E. Fernandez Evolution of the Isoprene Biosynthetic Pathway in Kudzu Plant Physiology, February 1, 2005; 137(2): 700 - 712. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Schulein, P. Guye, T. A. Rhomberg, M. C. Schmid, G. Schroder, A. C. Vergunst, I. Carena, and C. Dehio A bipartite signal mediates the transfer of type IV secretion substrates of Bartonella henselae into human cells PNAS, January 18, 2005; 102(3): 856 - 861. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Liu, H. Walch, S. Wu, and A. Grigoriev Significant expansion of exon-bordering protein domains during animal proteome evolution Nucleic Acids Res., January 7, 2005; 33(1): 95 - 105. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Nagata, R. Tanaka, S. Satoh, and A. Tanaka Identification of a Vinyl Reductase Gene for Chlorophyll Synthesis in Arabidopsis thaliana and Implications for the Evolution of Prochlorococcus Species PLANT CELL, January 1, 2005; 17(1): 233 - 240. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Rowland, A. A. Ludwig, C. J. Merrick, F. Baillieul, F. E. Tracy, W. E. Durrant, L. Fritz-Laylin, V. Nekrasov, K. Sjolander, H. Yoshioka, et al. Functional Analysis of Avr9/Cf-9 Rapidly Elicited Genes Identifies a Protein Kinase, ACIK1, That Is Essential for Full Cf-9-Dependent Disease Resistance in Tomato PLANT CELL, January 1, 2005; 17(1): 295 - 310. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Xie, Z.-L. Zhang, X. Zou, J. Huang, P. Ruas, D. Thompson, and Q. J. Shen Annotations and Functional Analyses of the Rice WRKY Gene Superfamily Reveal Positive and Negative Regulators of Abscisic Acid Signaling in Aleurone Cells Plant Physiology, January 1, 2005; 137(1): 176 - 189. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. A. Gray, H. Fu, P. Luo, Q. Zhao, J. Yu, A. Ferrari, T. Tenzen, D.-i. Yuk, E. F. Tsung, Z. Cai, et al. Mouse Brain Organization Revealed Through Direct Genome-Scale TF Expression Analysis Science, December 24, 2004; 306(5705): 2255 - 2257. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Carter, L. Smith, and M. Ryan Identification and validation of cell surface antigens for antibody targeting in oncology Endocr. Relat. Cancer, December 1, 2004; 11(4): 659 - 687. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Boekhorst, R. J. Siezen, M.-C. Zwahlen, D. Vilanova, R. D. Pridmore, A. Mercenier, M. Kleerebezem, W. M. de Vos, H. Brussow, and F. Desiere The complete genomes of Lactobacillus plantarum and Lactobacillus johnsonii reveal extensive differences in chromosome organization and gene content Microbiology, November 1, 2004; 150(11): 3601 - 3611. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-H. Shiu, W. M. Karlowski, R. Pan, Y.-H. Tzeng, K. F. X. Mayer, and W.-H. Li Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice PLANT CELL, May 1, 2004; 16(5): 1220 - 1234. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-H. Shiu and W.-H. Li Origins, Lineage-Specific Expansions, and Multiple Losses of Tyrosine Kinases in Eukaryotes Mol. Biol. Evol., May 1, 2004; 21(5): 828 - 840. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Pridmore, B. Berger, F. Desiere, D. Vilanova, C. Barretto, A.-C. Pittet, M.-C. Zwahlen, M. Rouvet, E. Altermann, R. Barrangou, et al. The genome sequence of the probiotic intestinal bacterium Lactobacillus johnsonii NCC 533 PNAS, February 24, 2004; 101(8): 2512 - 2517. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Chen, Y. H. Zhao, T. B. Kalaslavadi, E. Hamati, K. Nehrke, A. D. Le, D. K. Ann, and R. Wu Genome-Wide Search and Identification of a Novel Gel-Forming Mucin MUC19/Muc19 in Glandular Tissues Am. J. Respir. Cell Mol. Biol., February 1, 2004; 30(2): 155 - 165. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Y. McLoughlin, C. Jackson, J.-W. Liu, and D. L. Ollis Growth of Escherichia coli Coexpressing Phosphotriesterase and Glycerophosphodiester Phosphodiesterase, Using Paraoxon as the Sole Phosphorus Source Appl. Envir. Microbiol., January 1, 2004; 70(1): 404 - 412. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Mudgil, S.-H. Shiu, S. L. Stone, J. N. Salt, and D. R. Goring A Large Complement of the Predicted Arabidopsis ARM Repeat Proteins Are Members of the U-Box E3 Ubiquitin Ligase Family Plant Physiology, January 1, 2004; 134(1): 59 - 66. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Delhon, M. P. Moraes, Z. Lu, C. L. Afonso, E. F. Flores, R. Weiblen, G. F. Kutish, and D. L. Rock Genome of Bovine Herpesvirus 5 J. Virol., October 1, 2003; 77(19): 10339 - 10347. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. F. Clark, A. L. Gurney, E. Abaya, K. Baker, D. Baldwin, J. Brush, J. Chen, B. Chow, C. Chui, C. Crowley, et al. The Secreted Protein Discovery Initiative (SPDI), a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins: A Bioinformatics Assessment Genome Res., October 1, 2003; 13(10): 2265 - 2270. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Ariel, A. Zvi, K. S. Makarova, T. Chitlaru, E. Elhanany, B. Velan, S. Cohen, A. M. Friedlander, and A. Shafferman Genome-Based Bioinformatic Selection of Chromosomal Bacillus anthracis Putative Vaccine Candidates Coupled with Proteomic Identification of Surface-Associated Antigens Infect. Immun., August 1, 2003; 71(8): 4563 - 4579. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. A. Goo, E. C. Yi, N. S. Baliga, W. A. Tao, M. Pan, R. Aebersold, D. R. Goodlett, L. Hood, and W. V. Ng Proteomic Analysis of an Extreme Halophilic Archaeon, Halobacterium sp. NRC-1 Mol. Cell. Proteomics, August 1, 2003; 2(8): 506 - 524. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Sandmann, J. M. Herrmann, J. Dengjel, H. Schwarz, and A. Spang Suppression of Coatomer Mutants by a New Protein Family with COPI and COPII Binding Motifs in Saccharomyces cerevisiae Mol. Biol. Cell, August 1, 2003; 14(8): 3097 - 3113. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. Wortman, B. J. Haas, L. I. Hannick, R. K. Smith Jr., R. Maiti, C. M. Ronning, A. P. Chan, C. Yu, M. Ayele, C. A. Whitelaw, et al. Annotation of the Arabidopsis Genome Plant Physiology, June 1, 2003; 132(2): 461 - 468. [Full Text] [PDF] |
||||
![]() |
S.-H. Shiu and A. B. Bleecker Expansion of the Receptor-Like Kinase/Pelle Gene Family and Receptor-Like Proteins in Arabidopsis Plant Physiology, June 1, 2003; 132(2): 530 - 543. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Meierhoff, S. Felder, T. Nakamura, N. Bechtold, and G. Schuster HCF152, an Arabidopsis RNA Binding Pentatricopeptide Repeat Protein Involved in the Processing of Chloroplast psbB-psbT-psbH-petB-petD RNAs PLANT CELL, June 1, 2003; 15(6): 1480 - 1495. [Abstract] [Full Text] |
||||
![]() |
L. H. Engelholm, K. List, S. Netzel-Arnett, E. Cukierman, D. J. Mitola, H. Aaronson, L. Kjoller, J. K. Larsen, K. M. Yamada, D. K. Strickland, et al. uPARAP/Endo180 is essential for cellular uptake of collagen and promotes fibroblast collagen adhesion J. Cell Biol., March 31, 2003; 160(7): 1009 - 1015. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. S. Gowri, S. B. Pandit, P. S. Karthik, N. Srinivasan, and S. Balaji Integration of related sequences with protein three-dimensional structural families in an updated version of PALI database Nucleic Acids Res., January 1, 2003; 31(1): 486 - 488. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. R. Williams and J. R. McIntosh mcl1+, the Schizosaccharomyces pombe Homologue of CTF4, Is Important for Chromosome Replication, Cohesion, and Segregation Eukaryot. Cell, October 1, 2002; 1(5): 758 - 773. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Mondragon-Palomino, B. C. Meyers, R. W. Michelmore, and B. S. Gaut Patterns of Positive Selection in the Complete NBS-LRR Gene Family of Arabidopsis thaliana Genome Res., September 1, 2002; 12(9): 1305 - 1315. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Gronborg, T. Z. Kristiansen, A. Stensballe, J. S. Andersen, O. Ohara, M. Mann, O. N. Jensen, and A. Pandey A Mass Spectrometry-based Proteomic Approach for Identification of Serine/Threonine-phosphorylated Proteins by Enrichment with Phospho-specific Antibodies: Identification of a Novel Protein, Frigg, as a Protein Kinase A Substrate Mol. Cell. Proteomics, July 1, 2002; 1(7): 517 - 527. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Bradley, P. S. Kim, and B. Berger From the Cover: TRILOGY: Discovery of sequence-structure patterns across diverse proteins PNAS, June 25, 2002; 99(13): 8500 - 8505. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Sperling, P. Dessen, M. Zagulski, R. E. Pearlman, A. Migdalski, R. Gromadka, M. Froissard, A.-M. Keller, and J. Cohen Random Sequencing of Paramecium Somatic DNA Eukaryot. Cell, June 1, 2002; 1(3): 341 - 352. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-H. Shiu and A. B. Bleecker Plant Receptor-Like Kinase Gene Family: Diversity, Function, and Signaling Sci. Signal., December 18, 2001; 2001(113): re22 - re22. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Gao, L. Estrada, S. Cho, R. E. Ellis, and J. L. Gorski The Caenorhabditis elegans homolog of FGD1, the human Cdc42 GEF gene responsible for faciogenital dysplasia, is critical for excretory cell morphogenesis Hum. Mol. Genet., December 1, 2001; 10(26): 3049 - 3062. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. L. Afonso, E. R. Tulman, Z. Lu, C. A. Balinsky, B. A. Moser, J. J. Becnel, D. L. Rock, and G. F. Kutish Genome Sequence of a Baculovirus Pathogenic for Culex nigripalpus J. Virol., November 15, 2001; 75(22): 11157 - 11165. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Vaccaro, B. Brannetti, L. Montecchi-Palazzi, S. Philipp, M. H. Citterich, G. Cesareni, and L. Dente Distinct Binding Specificity of the Multiple PDZ Domains of INADL, a Human Protein with Homology to INAD from Drosophila melanogaster J. Biol. Chem., November 2, 2001; 276(45): 42122 - 42130. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Pouliot, J. Gao, Q. J. Su, G. G. Liu, and X. B. Ling DIAN: A Novel Algorithm for Genome Ontological Classification Genome Res., October 1, 2001; 11(10): 1766 - 1779. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Fernandes, J. A. C. Powell, and J. A. C. Archer Construction of Rhodococcus random mutagenesis libraries using Tn5 transposition complexes Microbiology, September 1, 2001; 147(9): 2529 - 2536. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-H. Shiu and A. B. Bleecker Receptor-like kinases from Arabidopsis form a monophyletic gene family related to animal receptor kinases PNAS, August 23, 2001; (2001) 181141598. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Murvai, K. Vlahovicek, C. Szepesvari, and S. Pongor Prediction of Protein Functional Domains from Sequences Using Artificial Neural Networks Genome Res., August 1, 2001; 11(8): 1410 - 1417. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Chen and E. C. Gotschlich ComE, a Competence Protein from Neisseria gonorrhoeae with DNA-Binding Activity J. Bacteriol., May 15, 2001; 183(10): 3160 - 3168. [Abstract] [Full Text] |
||||
![]() |
I. Chambaud, R. Heilig, S. Ferris, V. Barbe, D. Samson, F. Galisson, I. Moszer, K. Dybvig, H. Wroblewski, A. Viari, et al. The complete genome sequence of the murine respiratory pathogen Mycoplasma pulmonis Nucleic Acids Res., May 15, 2001; 29(10): 2145 - 2153. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Yang, I. T. Phan, S. Fitz-Gibbon, M. K. K. Shivji, R. D. Wood, W. M. Clendenin, E. C. Hyman, and J. H. Miller A thermostable endonuclease III homolog from the archaeon Pyrobaculum aerophilum Nucleic Acids Res., February 1, 2001; 29(3): 604 - 613. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. L. Afonso, E. R. Tulman, Z. Lu, L. Zsak, D. L. Rock, and G. F. Kutish The Genome of Turkey Herpesvirus J. Virol., January 15, 2001; 75(2): 971 - 978. [Abstract] [Full Text] |
||||
![]() |
R. J. M. Bongaerts, H.-P. Heinz, U. Hadding, and G. Zysk Antigenicity, Expression, and Molecular Characterization of Surface-Located Pullulanase of Streptococcus pneumoniae Infect. Immun., December 1, 2000; 68(12): 7141 - 7143. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. G. Friedrich, A. Quentmeier, F. Bardischewsky, D. Rother, R. Kraft, S. Kostka, and H. Prinz Novel Genes Coding for Lithotrophic Sulfur Oxidation of Paracoccus pantotrophus GB17 J. Bacteriol., September 1, 2000; 182(17): 4677 - 4687. [Abstract] [Full Text] |
||||
![]() |
A. M. DeSandro, U. M. Nagarajan, and J. M. Boss Associations and Interactions between Bare Lymphocyte Syndrome Factors Mol. Cell. Biol., September 1, 2000; 20(17): 6587 - 6599. [Abstract] [Full Text] |
||||
![]() |
M. G. Reese, G. Hartzell, N. L. Harris, U. Ohler, J. F. Abril, and S. E. Lewis Genome Annotation Assessment in Drosophila melanogaster Genome Res., April 1, 2000; 10(4): 483 - 501. [Abstract] [Full Text] |
||||
![]() |
H. Yang, M. M. Slupska, Y.-F. Wei, J. H. Tai, W. M. Luther, Y.-R. Xia, D. M. Shih, J.-H. Chiang, C. Baikalov, S. Fitz-Gibbon, et al. Cloning and Characterization of a New Member of the Nudix Hydrolases from Human and Mouse J. Biol. Chem., March 17, 2000; 275(12): 8844 - 8853. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Subramanian, E. V. Koonin, and L. Aravind Comparative Genome Analysis of the Pathogenic Spirochetes Borrelia burgdorferi and Treponema pallidum Infect. Immun., March 1, 2000; 68(3): 1633 - 1648. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Hutter, B. E. Vogel, J. D. Plenefisch, C. R. Norris, R. B. Proenca, J. Spieth, C. Guo, S. Mastwal, X. Zhu, I. S. A. J. Scheel, et al. Conservation and Novelty in the Evolution of Cell Adhesion and Extracellular Matrix Genes Science, February 11, 2000; 287(5455): 989 - 994. [Abstract] [Full Text] |
||||
![]() |
A.C. Camproux, P. Tuffery, J.P. Chevrolat, J.F. Boisvieux, and S. Hazout Hidden Markov model approach for identifying the modular framework of the protein backbone Protein Eng. Des. Sel., December 1, 1999; 12(12): 1063 - 1073. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. D. Lindenbach and C. M. Rice Genetic Interaction of Flavivirus Nonstructural Proteins NS1 and NS4A as a Determinant of Replicase Function J. Virol., June 1, 1999; 73(6): 4611 - 4621. [Abstract] [Full Text] |
||||
![]() |
B. M. Cali, S. L. Kuchma, J. Latham, and P. Anderson smg-7 Is Required for mRNA Surveillance in Caenorhabditis elegans Genetics, February 1, 1999; 151(2): 605 - 616. [Abstract] [Full Text] |
||||
![]() |
The C. elegans Sequencing Consortium Genome Sequence of the Nematode C. elegans: A Platform for Investigating Biology Science, December 11, 1998; 282(5396): 2012 - 2018. [Abstract] [Full Text] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






























